JackDanIce
Well-known Member
- Joined
- Feb 3, 2010
- Messages
- 9,922
- Office Version
- 365
- Platform
- Windows
Hi,
Couldn't decide if this room or the General Excel and other programmes room was where to post; apologies if in wrong place.
Looking for general comments/experiences for following:
I have a table of data (~10k rows) which needs to be grouped by client name. Data entry has been entered by multiple persons so sheet has client names like:
ABC Limited
ABC Ltd
ABC Emea Ltd
Which needs to be aggregated into a single row.
I'm trying to determine a formula or set of steps to group names as similar as possible and then unfortunately it will be a manual sense check to find matches.
Initial thoughts
- #1 (helper columns)
Create table of words that can be replaced with nullstring (e.g. Limited, Ltd, Inc etc)
Remove spaces
List data alphabetically
A formula that calculates "how much" of the words match e.g. ABCEmea vs ABC (something like LEN and SUBSTITUTE or something)
Problem with this is ABC Ltd may be an entirely separate client to ABC Limited or ABC itself.
- #2
Use something like Alteryx, Vero Analytics or Tableau which supposedly finds links between data sets but unable to determine how suitable their services might be for this task.
Does anyone have experience of matching text in Excel or the above software providers to suggest best approaches for this?
Thank you in advance!
Jack
Couldn't decide if this room or the General Excel and other programmes room was where to post; apologies if in wrong place.
Looking for general comments/experiences for following:
I have a table of data (~10k rows) which needs to be grouped by client name. Data entry has been entered by multiple persons so sheet has client names like:
ABC Limited
ABC Ltd
ABC Emea Ltd
Which needs to be aggregated into a single row.
I'm trying to determine a formula or set of steps to group names as similar as possible and then unfortunately it will be a manual sense check to find matches.
Initial thoughts
- #1 (helper columns)
Create table of words that can be replaced with nullstring (e.g. Limited, Ltd, Inc etc)
Remove spaces
List data alphabetically
A formula that calculates "how much" of the words match e.g. ABCEmea vs ABC (something like LEN and SUBSTITUTE or something)
Problem with this is ABC Ltd may be an entirely separate client to ABC Limited or ABC itself.
- #2
Use something like Alteryx, Vero Analytics or Tableau which supposedly finds links between data sets but unable to determine how suitable their services might be for this task.
Does anyone have experience of matching text in Excel or the above software providers to suggest best approaches for this?
Thank you in advance!
Jack