Closest Duplicate remove

ClementA

New Member
Joined
Dec 8, 2018
Messages
2
Hi,

I have a column with huge list of company names. I have removed the exact duplicate companies presented in the column.

However, there are plenty of repeated companies name in the column with partial and real spelling of the company name.

Eg:
ABC inc
ABC
XYZ Data Inc
XYZ Inc
XYZ
Northeast corp.
North corp.
Northeast

(hope you understood the duplicate types in the column)

Only first a few letters of company name is similar between the cells randomly.

I am finding very difficult to delete the duplicate row manually by sorting option. I humbly request you to help me identifying the cells contains of first "any range" of same letters presented.
OR help me highlight the partially matched cell in the entire column.

Please help me with the solutions.
Awaiting for help!

Thank you so much.
 
Last edited by a moderator:

Excel Facts

Workdays for a market open Mon, Wed, Friday?
Yes! Use "0101011" for the weekend argument in NETWORKDAYS.INTL or WORKDAY.INTL. The 7 digits start on Monday. 1 means it is a weekend.
IF YOU HAVE Northeast corp AND Northwest corp where these are 2 different companies no partial match system will work. I once worked with a spreadsheet that had King's Cross, Kings Cross, K Cross, KX, Kings X and many more - I sorted alphabetically and and replaced all of the above variants with Kings Cross as each row was details of an accident at that location..............
 
Upvote 0
Yes, I am also doing the same way, I'm sorting the list alphabetically and deleting the duplicate row. I got nearly 15000 names to be checked. If I have any option like highlighting or true/false if first four letters in the cells are similar. Then I will do filter and will be able to remove them easily.
 
Upvote 0
You could do something like


Excel 2013/2016
AB
2YO95 1BTFALSE
3YO91 1XYTRUE
4YO91 1RTFALSE
5YO90 1YDFALSE
6YO8 9ZZTRUE
7YO6 9ZTFALSE
8YO8 9ZSTRUE
9YO8 9XATRUE
10YO8 9TYTRUE
11YO8 8YYTRUE
12YO8 7ZZTRUE
13YO8 7ZYTRUE
14YO10 6XPFALSE
15YO8 7PYTRUE
16YO8 7ENTRUE
Sheet1
Cell Formulas
RangeFormula
B2=COUNTIF(A2:A30,LEFT(A2,4)&"*")>1


and then filter on TRUE
 
Upvote 0

Forum statistics

Threads
1,223,893
Messages
6,175,240
Members
452,621
Latest member
Laura_PinksBTHFT

We've detected that you are using an adblocker.

We have a great community of people providing Excel help here, but the hosting costs are enormous. You can help keep this site running by allowing ads on MrExcel.com.
Allow Ads at MrExcel

Which adblocker are you using?

Disable AdBlock

Follow these easy steps to disable AdBlock

1)Click on the icon in the browser’s toolbar.
2)Click on the icon in the browser’s toolbar.
2)Click on the "Pause on this site" option.
Go back

Disable AdBlock Plus

Follow these easy steps to disable AdBlock Plus

1)Click on the icon in the browser’s toolbar.
2)Click on the toggle to disable it for "mrexcel.com".
Go back

Disable uBlock Origin

Follow these easy steps to disable uBlock Origin

1)Click on the icon in the browser’s toolbar.
2)Click on the "Power" button.
3)Click on the "Refresh" button.
Go back

Disable uBlock

Follow these easy steps to disable uBlock

1)Click on the icon in the browser’s toolbar.
2)Click on the "Power" button.
3)Click on the "Refresh" button.
Go back
Back
Top