Hello,
I am trying to sanitize a spreadsheet of data and get it to be more uniform based on Brand (column A), and PolishName (column B). Using AbleBits Fuzzy Duplicate finder ( https://www.ablebits.com/excel-find-similar/find-fuzzy-excel-duplicates.php ), I was able to find most of the differences in the Brand column and then make them all uniform. Now I am trying to fuzzy find the duplicates in the PolishName column, but it has to be based on the PolishName that is associated with the same brand. Once the PolishName that is correct is determined, it should update the entries and delete the duplicates. My example sheet is here:
http://www.nailmob.com/TEST-Sheet-V3-unify.xlsx
Issues i've ran into:
Some Brands have PolishNames that are similar to other brand's polishnames. I need to make sure the duplicates are unique to the brand. I could run the fuzzy duplicate find on the range of Polish names for each brand if I separated them into one sheet per brand, but that would be literally thousands of sheets. My main data set has over 120,000 rows and 5000+ different brands. I need to automate this somehow, as i'm not really familiar with writing macros/vba. Looking for some ideas from the experts.
Thanks!
I am trying to sanitize a spreadsheet of data and get it to be more uniform based on Brand (column A), and PolishName (column B). Using AbleBits Fuzzy Duplicate finder ( https://www.ablebits.com/excel-find-similar/find-fuzzy-excel-duplicates.php ), I was able to find most of the differences in the Brand column and then make them all uniform. Now I am trying to fuzzy find the duplicates in the PolishName column, but it has to be based on the PolishName that is associated with the same brand. Once the PolishName that is correct is determined, it should update the entries and delete the duplicates. My example sheet is here:
http://www.nailmob.com/TEST-Sheet-V3-unify.xlsx
data:image/s3,"s3://crabby-images/0a304/0a3044d984befecad164a42ca5546ae733148624" alt="data-and-results.jpg"
Issues i've ran into:
Some Brands have PolishNames that are similar to other brand's polishnames. I need to make sure the duplicates are unique to the brand. I could run the fuzzy duplicate find on the range of Polish names for each brand if I separated them into one sheet per brand, but that would be literally thousands of sheets. My main data set has over 120,000 rows and 5000+ different brands. I need to automate this somehow, as i'm not really familiar with writing macros/vba. Looking for some ideas from the experts.
Thanks!