Power Query: Extracting Left 2 Characters From a Column
February 25, 2020 - by Bill Jelen
Note
This is one of a series of articles detailing solutions sent in for the Podcast 2316 challenge.
In my original video about reshaping data, I arrived at a point where I needed to get the first 2 characters from a column. My method involved Split Column to generate the first 2 characters and everything else. I then deleted everything else.
Mike Girvin from the ExcelisFun channel, as well as Brandon Brimberry, Wyn Hopkins from Access Analytic, Geert Demulle noted that a faster way would be to use the Transform, Extract, First Characters.
data:image/s3,"s3://crabby-images/6d32d/6d32db06807a0f2889500ec4c529d395290183d1" alt="Transform, Extract, First Characters"
This keeps just the quarter information with no need to delete the rest of the column.
data:image/s3,"s3://crabby-images/1d728/1d728ec5c6a417708df5a832402e3bbc933c8d27" alt="Result"
Bohdan Duda used Extracted Text Before Delimiter to get everything before the _ character.
data:image/s3,"s3://crabby-images/62be1/62be17c541099c699d73a8266ebdce9be5f9c29c" alt="Text Before Delimiter"
Mike Girvin noted that his solution was “on the fly, quick before I am out the door.” Here is his code:
let
Source = Excel.CurrentWorkbook(){[Name="BadStartData"]}[Content],
#"Unpivoted Other Columns" = Table.UnpivotOtherColumns(Source, {"Category Description"}, "Quarter", "Amount"),
#"Added Custom" = Table.AddColumn(#"Unpivoted Other Columns", "Employee", each if not Text.StartsWith([Quarter],"Q") then [Quarter] else null),
#"Filled Down" = Table.FillDown(#"Added Custom",{"Employee"}),
#"Extracted First Characters" = Table.TransformColumns(#"Filled Down", {{"Quarter", each Text.Start(_, 2), type text}}),
#"Filtered Rows" = Table.SelectRows(#"Extracted First Characters", each Text.StartsWith([Quarter], "Q")),
#"Pivoted Column" = Table.Pivot(#"Filtered Rows", List.Distinct(#"Filtered Rows"[Quarter]), "Quarter", "Amount", List.Sum),
#"Filtered Rows1" = Table.SelectRows(#"Pivoted Column", each ([Employee] <> "Dept. Total")),
#"Inserted Sum" = Table.AddColumn(#"Filtered Rows1", "Total", each List.Sum({[Q1], [Q2], [Q3], [Q4]}), type number),
#"Sorted Rows" = Table.Sort(#"Inserted Sum",{{"Employee", Order.Ascending},{"Category Description", Order.Ascending}}),
#"Changed Type" = Table.TransformColumnTypes(#"Sorted Rows",{{"Q1", type number}, {"Q2", type number}, {"Q3", type number}, {"Q4", type number}, {"Total", type number}, {"Employee", type text}, {"Category Description", type text}})
in
#"Changed Type"
Here is Geert DeMulle's solution:
let
Source = Excel.CurrentWorkbook(){[Name="tblUglyData"]}[Content],
#"Changed Type" = Table.TransformColumnTypes(Source,{{"Category Description", type text}, {"Dept. Total", type number}, {"Q1", type number}, {"Q2", type number}, {"Q3", type number}, {"Q4", Int64.Type}, {"Employee 1", type number}, {"Q12", type number}, {"Q23", type number}, {"Q34", Int64.Type}, {"Q45", Int64.Type}, {"Employee 2", Int64.Type}, {"Q16", Int64.Type}, {"Q27", Int64.Type}, {"Q38", Int64.Type}, {"Q49", Int64.Type}, {"Employee 3", Int64.Type}, {"Q110", Int64.Type}, {"Q211", Int64.Type}, {"Q312", Int64.Type}, {"Q413", Int64.Type}, {"Employee 4", type number}, {"Q114", type number}, {"Q215", type number}, {"Q316", type number}, {"Q417", Int64.Type}}),
#"Removed Columns" = Table.RemoveColumns(#"Changed Type",{"Dept. Total", "Q1", "Q2", "Q3", "Q4"}),
#"Unpivoted Other Columns" = Table.UnpivotOtherColumns(#"Removed Columns", {"Category Description"}, "Attribute", "Value"),
#"Added Conditional Column" = Table.AddColumn(#"Unpivoted Other Columns", "Employee", each if Text.StartsWith([Attribute], "Employee") then [Attribute] else null),
#"Filled Down" = Table.FillDown(#"Added Conditional Column",{"Employee"}),
#"Filtered Rows" = Table.SelectRows(#"Filled Down", each Text.StartsWith([Attribute], "Q")),
#"Extracted First Characters" = Table.TransformColumns(#"Filtered Rows", {{"Attribute", each Text.Start(_, 2), type text}}),
#"Renamed Columns" = Table.RenameColumns(#"Extracted First Characters",{{"Attribute", "Quarter"}}),
#"Pivoted Column" = Table.Pivot(#"Renamed Columns", List.Distinct(#"Renamed Columns"[Quarter]), "Quarter", "Value"),
#"Changed Type1" = Table.TransformColumnTypes(#"Pivoted Column",{{"Employee", type text}}),
#"Inserted Sum" = Table.AddColumn(#"Changed Type1", "Total", each List.Sum({[Q1], [Q2], [Q3], [Q4]}), type number),
#"Sorted Rows" = Table.Sort(#"Inserted Sum",{{"Employee", Order.Ascending}, {"Category Description", Order.Ascending}})
in
#"Sorted Rows"
Return to the main page for the Podcast 2316 challenge.
Read the next article in this series: Power Query: Adding a Total Column.