I have got a very large dataset on which I need to perform some data analysis. There are roughly 100 variables but there are multiple iterations so in the end it is closer to 15,000 variables. I need a way to select just the variables with the same header in a new worksheet as that will make it much easier as it is currently unordered.
As an example:
[TABLE="width: 500"]
<tbody>[TR]
[TD]V1[/TD]
[TD]V2[/TD]
[TD]V3[/TD]
[TD]V2[/TD]
[TD]V3[/TD]
[TD]V1[/TD]
[TD]V2[/TD]
[TD]V3[/TD]
[TD]..[/TD]
[/TR]
[TR]
[TD]4[/TD]
[TD]6[/TD]
[TD]8[/TD]
[TD]9[/TD]
[TD]7[/TD]
[TD]7[/TD]
[TD]8[/TD]
[TD]48[/TD]
[TD]..[/TD]
[/TR]
[TR]
[TD]5[/TD]
[TD]4[/TD]
[TD]6[/TD]
[TD]5[/TD]
[TD]6[/TD]
[TD]5[/TD]
[TD]1[/TD]
[TD]6[/TD]
[TD]..[/TD]
[/TR]
[TR]
[TD]6[/TD]
[TD]8[/TD]
[TD]4[/TD]
[TD]12[/TD]
[TD]3[/TD]
[TD]6[/TD]
[TD]5[/TD]
[TD]12[/TD]
[TD]..[/TD]
[/TR]
[TR]
[TD]..[/TD]
[TD]..[/TD]
[TD]..[/TD]
[TD]..[/TD]
[TD]..[/TD]
[TD]..[/TD]
[TD]..[/TD]
[TD]..[/TD]
[TD]..[/TD]
[/TR]
</tbody>[/TABLE]
And the output I'm looking for in a new worksheet:
If I want to see every V3 Variable..
[TABLE="width: 500"]
<tbody>[TR]
[TD]V3[/TD]
[TD]V3[/TD]
[TD]V3[/TD]
[/TR]
[TR]
[TD]8[/TD]
[TD]7[/TD]
[TD]48[/TD]
[/TR]
[TR]
[TD]6[/TD]
[TD]6[/TD]
[TD]6[/TD]
[/TR]
[TR]
[TD]4[/TD]
[TD]3[/TD]
[TD]12[/TD]
[/TR]
[TR]
[TD]..[/TD]
[TD]..[/TD]
[TD]..[/TD]
[/TR]
</tbody>[/TABLE]
I have written out an Excel Formula that sort of does this, but my computer does not have the memory to execute (The dataset has 7,500,00 cells worth of data) so maybe a macro or a more optimised formula would work.
Just thinking off the top of my head, if there is a way to just split it up once so every variable is automatically split and outputted like this in one go rather than redoing it over and over again that would be brilliant
Cheers
As an example:
[TABLE="width: 500"]
<tbody>[TR]
[TD]V1[/TD]
[TD]V2[/TD]
[TD]V3[/TD]
[TD]V2[/TD]
[TD]V3[/TD]
[TD]V1[/TD]
[TD]V2[/TD]
[TD]V3[/TD]
[TD]..[/TD]
[/TR]
[TR]
[TD]4[/TD]
[TD]6[/TD]
[TD]8[/TD]
[TD]9[/TD]
[TD]7[/TD]
[TD]7[/TD]
[TD]8[/TD]
[TD]48[/TD]
[TD]..[/TD]
[/TR]
[TR]
[TD]5[/TD]
[TD]4[/TD]
[TD]6[/TD]
[TD]5[/TD]
[TD]6[/TD]
[TD]5[/TD]
[TD]1[/TD]
[TD]6[/TD]
[TD]..[/TD]
[/TR]
[TR]
[TD]6[/TD]
[TD]8[/TD]
[TD]4[/TD]
[TD]12[/TD]
[TD]3[/TD]
[TD]6[/TD]
[TD]5[/TD]
[TD]12[/TD]
[TD]..[/TD]
[/TR]
[TR]
[TD]..[/TD]
[TD]..[/TD]
[TD]..[/TD]
[TD]..[/TD]
[TD]..[/TD]
[TD]..[/TD]
[TD]..[/TD]
[TD]..[/TD]
[TD]..[/TD]
[/TR]
</tbody>[/TABLE]
And the output I'm looking for in a new worksheet:
If I want to see every V3 Variable..
[TABLE="width: 500"]
<tbody>[TR]
[TD]V3[/TD]
[TD]V3[/TD]
[TD]V3[/TD]
[/TR]
[TR]
[TD]8[/TD]
[TD]7[/TD]
[TD]48[/TD]
[/TR]
[TR]
[TD]6[/TD]
[TD]6[/TD]
[TD]6[/TD]
[/TR]
[TR]
[TD]4[/TD]
[TD]3[/TD]
[TD]12[/TD]
[/TR]
[TR]
[TD]..[/TD]
[TD]..[/TD]
[TD]..[/TD]
[/TR]
</tbody>[/TABLE]
I have written out an Excel Formula that sort of does this, but my computer does not have the memory to execute (The dataset has 7,500,00 cells worth of data) so maybe a macro or a more optimised formula would work.
Just thinking off the top of my head, if there is a way to just split it up once so every variable is automatically split and outputted like this in one go rather than redoing it over and over again that would be brilliant
Cheers