I have been given some wonderfully complicated and not at all well formatted Excel spreadsheets at work. These spreadsheets are kept by the nurses I work with and have important patient info in. I'm trying to consolidate the data and then clean it in a hope to make everyone's lives easier.
So Excel #1 looks like this and has info on every patient in a study. But it is missing lots of data.
[TABLE="width: 500"]
<tbody>[TR]
[TD]ID[/TD]
[TD]DOB[/TD]
[TD]Consent[/TD]
[TD]Surgery Type[/TD]
[TD]Disease Stage[/TD]
[/TR]
[TR]
[TD]EX-0001[/TD]
[TD]01/01/1970[/TD]
[TD]Y[/TD]
[TD]Laser Lap[/TD]
[TD]1[/TD]
[/TR]
[TR]
[TD]EX-0002[/TD]
[TD]02/02/1971[/TD]
[TD]Y[/TD]
[TD]Diag Lap[/TD]
[TD]2[/TD]
[/TR]
[TR]
[TD]...[/TD]
[TD]...[/TD]
[TD]...[/TD]
[TD]...[/TD]
[TD]...[/TD]
[/TR]
[TR]
[TD]EX-0997[/TD]
[TD]31/12/1999[/TD]
[TD]N[/TD]
[TD]Rx Endo[/TD]
[TD]1[/TD]
[/TR]
</tbody>[/TABLE]
Excel #2 looks like this and has a subset of patients but has a lot more data (eg red column).
[TABLE="width: 500"]
<tbody>[TR]
[TD]ID[/TD]
[TD]DOB[/TD]
[TD]Consent[/TD]
[TD]Disease Tissue[/TD]
[TD]Surgery Type[/TD]
[/TR]
[TR]
[TD]EX-0001[/TD]
[TD]01/01/1970[/TD]
[TD]Y[/TD]
[TD]1[/TD]
[TD]Laser Lap[/TD]
[/TR]
[TR]
[TD]EX-0004[/TD]
[TD]04/04/1974[/TD]
[TD]N[/TD]
[TD]1,2[/TD]
[TD]Rx Endo[/TD]
[/TR]
[TR]
[TD]...[/TD]
[TD]...[/TD]
[TD]...[/TD]
[TD]...[/TD]
[TD]...[/TD]
[/TR]
[TR]
[TD]EX-0996[/TD]
[TD]30/12/1999[/TD]
[TD]N[/TD]
[TD]2,4[/TD]
[TD]Laser Lap[/TD]
[/TR]
</tbody>[/TABLE]
I am trying to find all the IDs that are in #1 but NOT in #2 . So in the example above, EX-0002 and EX-0997 would be highlighted. Ideally the rows for the missing IDs would be left and everything else removed.
The columns are ordered differently in both spreadsheets and shared info is formatted differently. #2 has more up to date info and more data in general and I am hoping to add the data from #1 to #2 and then populate the missing info.
I've noticed that these kinds of tasks are regularly done by the nurses and take up so much of their time. They're often given a list of IDs in an email and asked for a subset of columns for those IDs and the nurses usually do it by hand and have to take info from one spreadsheet and info from another and then type it all up again.
Any help would be much appreciated!
So Excel #1 looks like this and has info on every patient in a study. But it is missing lots of data.
[TABLE="width: 500"]
<tbody>[TR]
[TD]ID[/TD]
[TD]DOB[/TD]
[TD]Consent[/TD]
[TD]Surgery Type[/TD]
[TD]Disease Stage[/TD]
[/TR]
[TR]
[TD]EX-0001[/TD]
[TD]01/01/1970[/TD]
[TD]Y[/TD]
[TD]Laser Lap[/TD]
[TD]1[/TD]
[/TR]
[TR]
[TD]EX-0002[/TD]
[TD]02/02/1971[/TD]
[TD]Y[/TD]
[TD]Diag Lap[/TD]
[TD]2[/TD]
[/TR]
[TR]
[TD]...[/TD]
[TD]...[/TD]
[TD]...[/TD]
[TD]...[/TD]
[TD]...[/TD]
[/TR]
[TR]
[TD]EX-0997[/TD]
[TD]31/12/1999[/TD]
[TD]N[/TD]
[TD]Rx Endo[/TD]
[TD]1[/TD]
[/TR]
</tbody>[/TABLE]
Excel #2 looks like this and has a subset of patients but has a lot more data (eg red column).
[TABLE="width: 500"]
<tbody>[TR]
[TD]ID[/TD]
[TD]DOB[/TD]
[TD]Consent[/TD]
[TD]Disease Tissue[/TD]
[TD]Surgery Type[/TD]
[/TR]
[TR]
[TD]EX-0001[/TD]
[TD]01/01/1970[/TD]
[TD]Y[/TD]
[TD]1[/TD]
[TD]Laser Lap[/TD]
[/TR]
[TR]
[TD]EX-0004[/TD]
[TD]04/04/1974[/TD]
[TD]N[/TD]
[TD]1,2[/TD]
[TD]Rx Endo[/TD]
[/TR]
[TR]
[TD]...[/TD]
[TD]...[/TD]
[TD]...[/TD]
[TD]...[/TD]
[TD]...[/TD]
[/TR]
[TR]
[TD]EX-0996[/TD]
[TD]30/12/1999[/TD]
[TD]N[/TD]
[TD]2,4[/TD]
[TD]Laser Lap[/TD]
[/TR]
</tbody>[/TABLE]
I am trying to find all the IDs that are in #1 but NOT in #2 . So in the example above, EX-0002 and EX-0997 would be highlighted. Ideally the rows for the missing IDs would be left and everything else removed.
The columns are ordered differently in both spreadsheets and shared info is formatted differently. #2 has more up to date info and more data in general and I am hoping to add the data from #1 to #2 and then populate the missing info.
I've noticed that these kinds of tasks are regularly done by the nurses and take up so much of their time. They're often given a list of IDs in an email and asked for a subset of columns for those IDs and the nurses usually do it by hand and have to take info from one spreadsheet and info from another and then type it all up again.
Any help would be much appreciated!