Hi Everybody,
I am supposed to do client analysis, howevere, I have a big problem with data quality starting with client's names
I am supposed to match our key accounts for the largest companies in our region. The official list contains full official names of the companies, e.g. John Smith Ltd.
What we have in our system is completely different. Examples:
John Smith
J. Smith
Smith
Any ideas how to do data cleansing for this?
I am supposed to do client analysis, howevere, I have a big problem with data quality starting with client's names
I am supposed to match our key accounts for the largest companies in our region. The official list contains full official names of the companies, e.g. John Smith Ltd.
What we have in our system is completely different. Examples:
John Smith
J. Smith
Smith
Any ideas how to do data cleansing for this?