mzza
Board Regular
- Joined
- Nov 8, 2006
- Messages
- 55
Hi,
I'm not convinced there's a way of doing this without using VBA which I would really like to avoid, but here goes...
I have been tasked with matching 2 spreadsheets from entirely different sources. Each is a list of distinct vehicles with a unique identifier (which is proprietary to each source, so the IDs are different although the vehicles are the same).
To try and match vehicles my first line of thought was to concatenate data to create a string of (for example) [Manufacturer]+[Model]+[Year]+[Engine CC]+etc, then do a Vlookup based on this concatenated string to find the matching vehicle in the other spreadsheet (and thereby build a list of 'Unique ID A' matched to 'Unique ID B', the ultimate aim of the exercise).
The problem is, owing to slight variances in how such things are measured, the (e.g.) Engine CC may be slightly different in each spreadsheet for the exact same vehicle. So the "vlookup" (or equivalent jury-rigged operation) needs to allow for a tolerance of +/-5 on the Engine CC to still count as a potential "match". As an example, I would need the (Ford+Mustang+1994+1799CC) to match to the (Ford+Mustang+1994+1803CC); however, that's just accounting for 4 variables, one of which has a tolerance of +/-5; there are many more variables I need to match on, and 7 of those variables have tolerances rather than exact values.
I've been wracking my brains but cannot fathom an efficient/elegant way of doing this... or even an inelegant one that at least does the job!
As I say, I would very much prefer to avoid coding if possible, but beggars can't be choosers to if that's the only realistic solution I'm open to all suggestions.
TIA for any help anyone can provide.
I'm not convinced there's a way of doing this without using VBA which I would really like to avoid, but here goes...
I have been tasked with matching 2 spreadsheets from entirely different sources. Each is a list of distinct vehicles with a unique identifier (which is proprietary to each source, so the IDs are different although the vehicles are the same).
To try and match vehicles my first line of thought was to concatenate data to create a string of (for example) [Manufacturer]+[Model]+[Year]+[Engine CC]+etc, then do a Vlookup based on this concatenated string to find the matching vehicle in the other spreadsheet (and thereby build a list of 'Unique ID A' matched to 'Unique ID B', the ultimate aim of the exercise).
The problem is, owing to slight variances in how such things are measured, the (e.g.) Engine CC may be slightly different in each spreadsheet for the exact same vehicle. So the "vlookup" (or equivalent jury-rigged operation) needs to allow for a tolerance of +/-5 on the Engine CC to still count as a potential "match". As an example, I would need the (Ford+Mustang+1994+1799CC) to match to the (Ford+Mustang+1994+1803CC); however, that's just accounting for 4 variables, one of which has a tolerance of +/-5; there are many more variables I need to match on, and 7 of those variables have tolerances rather than exact values.
I've been wracking my brains but cannot fathom an efficient/elegant way of doing this... or even an inelegant one that at least does the job!
As I say, I would very much prefer to avoid coding if possible, but beggars can't be choosers to if that's the only realistic solution I'm open to all suggestions.
TIA for any help anyone can provide.
Last edited: