TabulaRasa
New Member
- Joined
- Jul 14, 2015
- Messages
- 10
Hello Everyone! I'm having a bit of trouble with formatting a fairly large data set. Currently, there are ~10,000 rows of data. Each row has the following information format in it:
[TABLE="class: grid, width: 1000"]
<tbody>[TR]
[TD]A[/TD]
[TD]B[/TD]
[TD]C[/TD]
[TD]D[/TD]
[TD]E[/TD]
[TD]F[/TD]
[TD]G[/TD]
[TD]H[/TD]
[TD]I[/TD]
[TD]J[/TD]
[TD]K[/TD]
[/TR]
[TR]
[TD]TEXT[/TD]
[TD]DATE[/TD]
[TD]DATE[/TD]
[TD]TEXT[/TD]
[TD]GENERAL (INCLUDES NUMBERS)[/TD]
[TD]TEXT[/TD]
[TD]CURRENCY[/TD]
[TD]TEXT (INCLUDES NUMBERS)[/TD]
[TD]No [2], Yes [3], etc.[/TD]
[TD]Yes [3], No [2], etc.[/TD]
[TD]TEXT [123.1], TEXT [123.5], etc.[/TD]
[/TR]
</tbody>[/TABLE]
The goal is to take columns I-K and make them multiple rows, with the data staying relative between the columns (for example, the above would be the following:[TABLE="class: grid, width: 1000"]
<tbody>[TR]
[TD]A[/TD]
[TD]B[/TD]
[TD]C[/TD]
[TD]D[/TD]
[TD]E[/TD]
[TD]F[/TD]
[TD]G[/TD]
[TD]H[/TD]
[TD]I[/TD]
[TD]J[/TD]
[TD]K[/TD]
[/TR]
[TR]
[TD]TEXT[/TD]
[TD]DATE[/TD]
[TD]DATE[/TD]
[TD]TEXT[/TD]
[TD]GENERAL (INCLUDES NUMBERS)[/TD]
[TD]TEXT[/TD]
[TD]CURRENCY[/TD]
[TD]TEXT (INCLUDES NUMBERS)[/TD]
[TD]No [2][/TD]
[TD]Yes [3][/TD]
[TD]TEXT [123.1][/TD]
[/TR]
[TR]
[TD]TEXT[/TD]
[TD]DATE[/TD]
[TD]DATE[/TD]
[TD]TEXT[/TD]
[TD]GENERAL (INCLUDES NUMBERS)[/TD]
[TD]TEXT[/TD]
[TD]CURRENCY[/TD]
[TD]TEXT (INCLUDES NUMBERS)[/TD]
[TD]Yes [3][/TD]
[TD]No [2][/TD]
[TD]TEXT [123.5][/TD]
[/TR]
</tbody>[/TABLE]
There are two additional complications. The first is that columns A-H need to be duplicated along with the now separated information from columns I-K (as it is identifying information). The second complication is that the TEXT information in column K contains commas, (for example: obesity, adult) so any comma delimited approach that I could think of will not work.
Hopefully this explains the issue and the goal well enough! I really appreciate everyone's assistance.
[TABLE="class: grid, width: 1000"]
<tbody>[TR]
[TD]A[/TD]
[TD]B[/TD]
[TD]C[/TD]
[TD]D[/TD]
[TD]E[/TD]
[TD]F[/TD]
[TD]G[/TD]
[TD]H[/TD]
[TD]I[/TD]
[TD]J[/TD]
[TD]K[/TD]
[/TR]
[TR]
[TD]TEXT[/TD]
[TD]DATE[/TD]
[TD]DATE[/TD]
[TD]TEXT[/TD]
[TD]GENERAL (INCLUDES NUMBERS)[/TD]
[TD]TEXT[/TD]
[TD]CURRENCY[/TD]
[TD]TEXT (INCLUDES NUMBERS)[/TD]
[TD]No [2], Yes [3], etc.[/TD]
[TD]Yes [3], No [2], etc.[/TD]
[TD]TEXT [123.1], TEXT [123.5], etc.[/TD]
[/TR]
</tbody>[/TABLE]
The goal is to take columns I-K and make them multiple rows, with the data staying relative between the columns (for example, the above would be the following:[TABLE="class: grid, width: 1000"]
<tbody>[TR]
[TD]A[/TD]
[TD]B[/TD]
[TD]C[/TD]
[TD]D[/TD]
[TD]E[/TD]
[TD]F[/TD]
[TD]G[/TD]
[TD]H[/TD]
[TD]I[/TD]
[TD]J[/TD]
[TD]K[/TD]
[/TR]
[TR]
[TD]TEXT[/TD]
[TD]DATE[/TD]
[TD]DATE[/TD]
[TD]TEXT[/TD]
[TD]GENERAL (INCLUDES NUMBERS)[/TD]
[TD]TEXT[/TD]
[TD]CURRENCY[/TD]
[TD]TEXT (INCLUDES NUMBERS)[/TD]
[TD]No [2][/TD]
[TD]Yes [3][/TD]
[TD]TEXT [123.1][/TD]
[/TR]
[TR]
[TD]TEXT[/TD]
[TD]DATE[/TD]
[TD]DATE[/TD]
[TD]TEXT[/TD]
[TD]GENERAL (INCLUDES NUMBERS)[/TD]
[TD]TEXT[/TD]
[TD]CURRENCY[/TD]
[TD]TEXT (INCLUDES NUMBERS)[/TD]
[TD]Yes [3][/TD]
[TD]No [2][/TD]
[TD]TEXT [123.5][/TD]
[/TR]
</tbody>[/TABLE]
There are two additional complications. The first is that columns A-H need to be duplicated along with the now separated information from columns I-K (as it is identifying information). The second complication is that the TEXT information in column K contains commas, (for example: obesity, adult) so any comma delimited approach that I could think of will not work.
Hopefully this explains the issue and the goal well enough! I really appreciate everyone's assistance.