Flattening a hierarchy from 5 columns into a unique one

tomoonot · May 22, 2023

I'm trying to flatten a hierarchy of 5 columns into one singular column with only unique values from an original file that is copypasted into the InputTable The output column is not too important, so I am trying to create this hierarchy by ommitting it.

It is kinda working, but it's not optimal, since I cannot seem to be able to figure out, how to lookup a value and retrieve its header.
So I have to circle back a using a hardcoded sequence of 5 repeating values and then applying unique to those and then filtering them.
Also not sure if it's possible to have a table of dynamic size, since I am copy pasting into it, I would like to have it adjust automatically how many rows are not empty under the header, but no idea how to do so.

And since there are 4 Columns of Rows, in the 2nd table, where I try to flatten it, I had to pre-allocate 4-5x rows for the output of the flattenning.
In case where I had a 1000 rows, I had to pre-allocate 5000, which seemed to be quite slow.
I'm pretty sure it could be done more elegantly, dynamically and in a way that is not so resource intensive,

I'm imagining that if I could just get the header row from a formula without using the hard coded workaround and 2 helper tables, it should be possible to do the whole flattenning, removing duplicates and filtering out the output in one go, but I am struggling to find out how.

Also watched some vdeos on dynamic arrays using NamedRange# with the has/pound sign in the end, but haven't really gotten to trying it out as I am mostly getting by by using index and 1 result at a time by ROW()-1, but perhaps arrays could be better? Tried something similar a few times before but inside a named table they'd spill, not sure if I had used the correct notation at the time though.

Input table: Source data

Dymmy_Sheet - Copy.xlsx

A

B

C

D

E

F

1

Stage

Level1

Level2

Level3

Level4

Output

2

Stage 1

A100: Level1 A

B200: Level2 AB

C300: Level3 ABC

XYZ10: Name 1

Output 1

3

Stage 1

A100: Level1 A

B200: Level2 AB

C300: Level3 ABC

XYZ10: Name 1

Output 2

4

Stage 1

A100: Level1 A

B200: Level2 AB

C300: Level3 ABC

XYZ15: Name 2

Output 1

5

Stage 1

A100: Level1 A

B200: Level2 AB

C300: Level3 ABC

XYZ15: Name 2

Output 2

6

Stage 1

A100: Level1 A

B200: Level2 AB

C300: Level3 ABC

XYZ15: Name 2

Output 3

7

Stage 1

A100: Level1 A

B200: Level2 AB

C350: Level3 ABC

XYZ20: Name 1

Output 4

8

Stage 1

A100: Level1 A

B200: Level2 AB

C350: Level3 ABC

XYZ20: Name 1

Output 5

9

Stage 1

A100: Level1 A

B200: Level2 AB

C350: Level3 ABC

XYZ25: Name 2

Output 6

10

Stage 1

A100: Level1 A

B250: Level2 AB

C400: Level3 ABC

XYZ30: Name 3

Output 7

11

Stage 2

A150: Level1 A

B300: Level2 AB

C450: Level4 ABC

SDT20: Name 1

Output 11

12

Stage 2

A150: Level1 A

B300: Level2 AB

C450: Level4 ABC

SDT20: Name 1

Output 12

13

Stage 2

A150: Level1 A

B300: Level2 AB

C500: Level4 ABC

SDT25: Name 1

Output 20

14

Stage 3

A200: Level1 A

B350: Level2 AB

C550: Level4 ABC

SDT30: Name 1

Output 25

Transform

Cells with Conditional Formatting
Cell	Condition		Cell Format	Stop If True
A2:F14	Cell Value	duplicates	text	NO
E:E	Cell Value	duplicates	text	NO
F:F	Cell Value	duplicates	text	NO

AllHierarchyTable: Flattened table

Dymmy_Sheet - Copy.xlsx

H

I

J

K

1

allHierarchy

Type

Test1

Test2

2

Cell Formulas
Range	Formula
H2:H51	H2	=IFERROR(INDEX(InputTable[[Level1]:[Output]], INT((ROW()-2)/COLUMNS(InputTable[[Level1]:[Output]]))+1, MOD(ROW()-2, COLUMNS(InputTable[[Level1]:[Output]]))+1), "")
I2:I51	I2	=IF(allHierarchy<>"", CHOOSE(MOD(ROW()-2,5)+1, "Level1", "Level2", "Level3", "Level4", "Output"), "")
J2:J51	J2	=INDEX(InputTable[#Headers], MATCH([@allHierarchy], INDEX(InputTable, 0, MATCH([@allHierarchy],InputTable, 0)), 0))
K2:K51	K2	=IF([@allHierarchy]<>"",XLOOKUP([@allHierarchy],InputTable,InputTable[#Headers],,0,1),"")

Named Ranges
Name	Refers To	Cells
allHierarchy	=FlattenTable[allHierarchy]	J2:K2, I2:I51
Level1	=InputTable[Level1]	H2:H51, J2:K51
Level2	=InputTable[Level2]	H2:H51, J2:K51
Level3	=InputTable[Level3]	H2:H51, J2:K51
Level4	=InputTable[Level4]	H2:H51, J2:K51
Output	=InputTable[Output]	H2:H51, J2:K51
Stage	=InputTable[Stage]	J2:K51

Cell Formulas
Range	Formula
M2:M24	M2	=IFERROR(IF(allHierarchy<>"",CHOOSE(MATCH([@Type2], {"Level1","Level2","Level3","Level4","Output"}, 0), "Alpha", "Beta", "Gamma", "Delta", "Epsilon"), ""),"")
N2:N24	N2	=IFERROR(INDEX(UNIQUE(FILTER(FlattenTable[allHierarchy], (FlattenTable[allHierarchy]<>"")*(FlattenTable[allHierarchy]<>0))), ROW()-1), "")
O2:O24	O2	=IF([@Uniques]<>"",XLOOKUP([@Uniques],FlattenTable[allHierarchy],FlattenTable[Type],,0), "")

Cell Formulas
Range	Formula
Q2:Q16	Q2	=IF([@[Name/ID]]<>"", INDEX(UniqueTable[Kind], MATCH([@[Name/ID]], UniqueTable[Uniques], 0)), "")
R2:R16	R2	=IFERROR( INDEX(FILTER(UniqueTable[[Kind]:[Uniques]], ( UniqueTable[Kind]="Alpha")+( UniqueTable[Kind]="Beta")+( UniqueTable[Kind]="Gamma")+( UniqueTable[Kind]="Delta") ), ROW()-1,2), "")
S2:S16	S2	=IF([@Kind]="Alpha",XLOOKUP([@[Name/ID]],Level1,Stage,,),"")

Flattening a hierarchy from 5 columns into a unique one

tomoonot

New Member

Kerryx

Well-known Member

Similar threads

Forum statistics

Share this page

We've detected that you are using an adblocker.

Which adblocker are you using?

Disable AdBlock

Disable AdBlock Plus

Disable uBlock Origin

Disable uBlock