Hi,
I have a spreadsheet with 130000 lines, and I would really appreciate some help in merging some of the data. I have searched the forum for help, and even though I found some related post, it did not solve my problem. I have lost of rows with duplicates of an unique ID (column A), and each ID may be associated with different values (Column B). I need to merge all the duplicate ID's into one row, and then pick the most frequent value in Column B as the new value for that row. Column A contain numbers, and Column B contain text strings. If there are eg 2 values in Column B that occur equally often, I dont care which one is chosen.
I have this:
A B
[TABLE="width: 110"]
<colgroup><col span="2"></colgroup><tbody>[TR]
[TD]1[/TD]
[TD]a[/TD]
[/TR]
[TR]
[TD]1[/TD]
[TD]a[/TD]
[/TR]
[TR]
[TD]1[/TD]
[TD]a[/TD]
[/TR]
[TR]
[TD]1[/TD]
[TD]b[/TD]
[/TR]
[TR]
[TD]1[/TD]
[TD]c[/TD]
[/TR]
[TR]
[TD]1[/TD]
[TD]d[/TD]
[/TR]
[TR]
[TD]2[/TD]
[TD]a[/TD]
[/TR]
[TR]
[TD]2[/TD]
[TD]b[/TD]
[/TR]
[TR]
[TD]2[/TD]
[TD]b[/TD]
[/TR]
[TR]
[TD]2[/TD]
[TD]b[/TD]
[/TR]
[TR]
[TD]2[/TD]
[TD]c[/TD]
[/TR]
[TR]
[TD]3[/TD]
[TD]a[/TD]
[/TR]
[TR]
[TD]3[/TD]
[TD]a[/TD]
[/TR]
[TR]
[TD]3[/TD]
[TD]a[/TD]
[/TR]
[TR]
[TD]3[/TD]
[TD]a[/TD]
[/TR]
[TR]
[TD]3[/TD]
[TD]g[/TD]
[/TR]
[TR]
[TD]4[/TD]
[TD]c[/TD]
[/TR]
[TR]
[TD]4[/TD]
[TD]c[/TD]
[/TR]
[TR]
[TD]4[/TD]
[TD]d[/TD]
[/TR]
[TR]
[TD]4[/TD]
[TD]e[/TD]
[/TR]
[TR]
[TD]4[/TD]
[TD]f[/TD]
[/TR]
</tbody>[/TABLE]
I want this:
[TABLE="width: 110"]
<colgroup><col span="2"></colgroup><tbody>[TR]
[TD]1[/TD]
[TD]a[/TD]
[/TR]
[TR]
[TD]2[/TD]
[TD]b[/TD]
[/TR]
[TR]
[TD]3[/TD]
[TD]a[/TD]
[/TR]
[TR]
[TD]4[/TD]
[TD]c[/TD]
[/TR]
</tbody>[/TABLE]
Help on this is much appreciated, because manually doing this is not going to happen
Thanks
Cato
I have a spreadsheet with 130000 lines, and I would really appreciate some help in merging some of the data. I have searched the forum for help, and even though I found some related post, it did not solve my problem. I have lost of rows with duplicates of an unique ID (column A), and each ID may be associated with different values (Column B). I need to merge all the duplicate ID's into one row, and then pick the most frequent value in Column B as the new value for that row. Column A contain numbers, and Column B contain text strings. If there are eg 2 values in Column B that occur equally often, I dont care which one is chosen.
I have this:
A B
[TABLE="width: 110"]
<colgroup><col span="2"></colgroup><tbody>[TR]
[TD]1[/TD]
[TD]a[/TD]
[/TR]
[TR]
[TD]1[/TD]
[TD]a[/TD]
[/TR]
[TR]
[TD]1[/TD]
[TD]a[/TD]
[/TR]
[TR]
[TD]1[/TD]
[TD]b[/TD]
[/TR]
[TR]
[TD]1[/TD]
[TD]c[/TD]
[/TR]
[TR]
[TD]1[/TD]
[TD]d[/TD]
[/TR]
[TR]
[TD]2[/TD]
[TD]a[/TD]
[/TR]
[TR]
[TD]2[/TD]
[TD]b[/TD]
[/TR]
[TR]
[TD]2[/TD]
[TD]b[/TD]
[/TR]
[TR]
[TD]2[/TD]
[TD]b[/TD]
[/TR]
[TR]
[TD]2[/TD]
[TD]c[/TD]
[/TR]
[TR]
[TD]3[/TD]
[TD]a[/TD]
[/TR]
[TR]
[TD]3[/TD]
[TD]a[/TD]
[/TR]
[TR]
[TD]3[/TD]
[TD]a[/TD]
[/TR]
[TR]
[TD]3[/TD]
[TD]a[/TD]
[/TR]
[TR]
[TD]3[/TD]
[TD]g[/TD]
[/TR]
[TR]
[TD]4[/TD]
[TD]c[/TD]
[/TR]
[TR]
[TD]4[/TD]
[TD]c[/TD]
[/TR]
[TR]
[TD]4[/TD]
[TD]d[/TD]
[/TR]
[TR]
[TD]4[/TD]
[TD]e[/TD]
[/TR]
[TR]
[TD]4[/TD]
[TD]f[/TD]
[/TR]
</tbody>[/TABLE]
I want this:
[TABLE="width: 110"]
<colgroup><col span="2"></colgroup><tbody>[TR]
[TD]1[/TD]
[TD]a[/TD]
[/TR]
[TR]
[TD]2[/TD]
[TD]b[/TD]
[/TR]
[TR]
[TD]3[/TD]
[TD]a[/TD]
[/TR]
[TR]
[TD]4[/TD]
[TD]c[/TD]
[/TR]
</tbody>[/TABLE]
Help on this is much appreciated, because manually doing this is not going to happen
Thanks
Cato