PLease help me out with this problem! It seems simple yet I can't do anything about it.
I have 3 lists of company names that I have acquired from three different sources. The companies mentioned are the same in each of the lists (the lists may have few additional names as well), but since I used 3 different sources for my data, they are worded slighty differently. For example, 'Aberdeen UK smaller Cos' might appear in another list as 'Aberdeen UK Smllr Companies', etc. Now I could manually match all companies across the 3 lists, but the lists are sufficiently long and I do not have enough time. Is it possible to do some sort of 'near matching' function in excel (preferably), or some other software?
I can provide a sample excel sheet with the 3 lists if needed. The reason why I need to match these lists is because I want to mearge the different data (from each of the sources) for each company in Stata for running regressions.
Any kind of help will be highly appreciated!
I have 3 lists of company names that I have acquired from three different sources. The companies mentioned are the same in each of the lists (the lists may have few additional names as well), but since I used 3 different sources for my data, they are worded slighty differently. For example, 'Aberdeen UK smaller Cos' might appear in another list as 'Aberdeen UK Smllr Companies', etc. Now I could manually match all companies across the 3 lists, but the lists are sufficiently long and I do not have enough time. Is it possible to do some sort of 'near matching' function in excel (preferably), or some other software?
I can provide a sample excel sheet with the 3 lists if needed. The reason why I need to match these lists is because I want to mearge the different data (from each of the sources) for each company in Stata for running regressions.
Any kind of help will be highly appreciated!