I believe I am using Fuzzy Lookup correctly, but it is not behaving as expected.
I have a list of over 20,000 names of businesses. A large percentage of the records are duplicates that either have misspellings or extra words. I want to use the Fuzzy Lookup add-in to find the duplicates. However, I don't want Fuzzy Lookup to search all 20,000+ businesses in trying to find matches for any given business. That's because each business is also assigned an index number that represents a geographic area, and there aren't more than around 40 businesses that share any geographic area index number. So I want to limit my Fuzzy Lookup query to just the records that share the index number of any given record. The way I have gone about doing this is by creating two match criteria. One uses the default matching criterion on the "name" field. The other uses the "exact match" criterion on the index number field. However, the output list is still returning matches between records that do not share the same index number.
Here is a screenshot, showing the problem. Columns W ("column 2") and Z ("column 9") make up the Left Table, and columns AB ("column 2") and AC ("column 9") make up the right table. Note how I set an exact match on the two column 9s and a default match on the two column 2s. I expected to only get matches between records sharing the same index number in the column 9s, but as you can see, it has returned many records that do not share the same number (the number "1") found in column 9 of the Left Table. It is showing non-zero values for these records despite my "exact match" criterion.
I have tried searching google about this problem and have come across a couple other posts that are very similar, but there has never been a solution posted.
Some details: Windows 10 desktop, Microsoft Office 365.
I have a list of over 20,000 names of businesses. A large percentage of the records are duplicates that either have misspellings or extra words. I want to use the Fuzzy Lookup add-in to find the duplicates. However, I don't want Fuzzy Lookup to search all 20,000+ businesses in trying to find matches for any given business. That's because each business is also assigned an index number that represents a geographic area, and there aren't more than around 40 businesses that share any geographic area index number. So I want to limit my Fuzzy Lookup query to just the records that share the index number of any given record. The way I have gone about doing this is by creating two match criteria. One uses the default matching criterion on the "name" field. The other uses the "exact match" criterion on the index number field. However, the output list is still returning matches between records that do not share the same index number.
Here is a screenshot, showing the problem. Columns W ("column 2") and Z ("column 9") make up the Left Table, and columns AB ("column 2") and AC ("column 9") make up the right table. Note how I set an exact match on the two column 9s and a default match on the two column 2s. I expected to only get matches between records sharing the same index number in the column 9s, but as you can see, it has returned many records that do not share the same number (the number "1") found in column 9 of the Left Table. It is showing non-zero values for these records despite my "exact match" criterion.
I have tried searching google about this problem and have come across a couple other posts that are very similar, but there has never been a solution posted.
Some details: Windows 10 desktop, Microsoft Office 365.