Hi, hoping someone can help with a string testing problem.
I want to test for similarity. I have two columns of data, actually filenames. In the second column, somewhere there is an *almost* identical filename to each of the filenames in the first column. The function needs to lookup the second column, and return the *most similar* filename.
I figure the answer will lie in comparing the characters? If it is too computationally intensive to do this over each 30-odd character filename, then perhaps rather a function that just tests for a match of at least 5 of the first 8 characters from the left or something like this? I can't figure out how to structure this "at least x of y characters" structure - if anyone can help I'd sure appreciate it !
thanks in advance,
patsnip
I want to test for similarity. I have two columns of data, actually filenames. In the second column, somewhere there is an *almost* identical filename to each of the filenames in the first column. The function needs to lookup the second column, and return the *most similar* filename.
I figure the answer will lie in comparing the characters? If it is too computationally intensive to do this over each 30-odd character filename, then perhaps rather a function that just tests for a match of at least 5 of the first 8 characters from the left or something like this? I can't figure out how to structure this "at least x of y characters" structure - if anyone can help I'd sure appreciate it !
thanks in advance,
patsnip