Hi guys, I am trying to create a way to check for fraud in our system. We have sales reps that go around different parts of the country and they turn in info about their sales. The problem is some reps have found a way to submit multiple entries that count as sales, when in actuality, these people do not exist. For instance in the customer name field, the reps can put in a "-" or a "Jr." "Sr." or a II/III and the system will accept it. Im looking for a way to check just that one column to see if names are similar or close enough that they look like fraud. I can group the UserIDs to get only the customer names they did, but can not figure out a way to check them. I have tried the Levenshtein Distance only to realize that i must plug in a string text which would make this utterly time consuming. We have almost 15,000 new entries daily. I am still fairly new to Excel and am unsure on how to go about this. Then I tried my hand at a fuzzy lookup and that also was not what i needed.
The second method to detect fraud is the time stamp for each agent. Once again i can group the agents just to see their Create Date Time. I need a way to find if the difference between two of the times is less than 4 minutes. This would also be a alert to something fishy occurring. Does Excel 2013 not have Datediff? I also tried doing =TEXT(E3-E2),"mm") but this returned number values that were under .0008^n. or along those lines of small numbers that couldn't mean minutes. The TransactionID are unique numbers.
This is a sample of how the info looks:Just imagine another 14993 lines of information. If the question is why do we not use some other method of storing and examining the information, the development team is working on something but we need something right now.
[TABLE="width: 774"]
<tbody>[TR]
[TD]User ID[/TD]
[TD]Transaction ID[/TD]
[TD]Application ID[/TD]
[TD]Status ID[/TD]
[TD]Create Date Time[/TD]
[TD]Customer Name[/TD]
[/TR]
[TR]
[TD]PCRE5181[/TD]
[TD]104620811[/TD]
[TD]198042515[/TD]
[TD]Completed[/TD]
[TD]2015/05/22 23:53:29[/TD]
[TD]jessica adams[/TD]
[/TR]
[TR]
[TD]PCRE5181[/TD]
[TD]104620808[/TD]
[TD]198042516[/TD]
[TD]Completed[/TD]
[TD]2015/05/22 23:51:33[/TD]
[TD]walter white[/TD]
[/TR]
[TR]
[TD]PCRE70D3[/TD]
[TD]104620798[/TD]
[TD]198042500[/TD]
[TD]Completed[/TD]
[TD]2015/05/22 23:45:04[/TD]
[TD]robert jordan[/TD]
[/TR]
[TR]
[TD]PCRE5619[/TD]
[TD]104620795[/TD]
[TD]198042505[/TD]
[TD]Completed[/TD]
[TD]2015/05/22 23:43:29[/TD]
[TD]guangming liu[/TD]
[/TR]
[TR]
[TD]PCRE5181[/TD]
[TD]104620792[/TD]
[TD]198042497[/TD]
[TD]Completed[/TD]
[TD]2015/05/22 23:41:32[/TD]
[TD]richard nixon[/TD]
[/TR]
[TR]
[TD]PCRE70D3[/TD]
[TD]104620789[/TD]
[TD]198042492[/TD]
[TD]Completed[/TD]
[TD]2015/05/22 23:40:21[/TD]
[TD]tommy pickles[/TD]
[/TR]
[TR]
[TD]PCRE70C4[/TD]
[TD]104620785[/TD]
[TD]198042489[/TD]
[TD]Completed[/TD]
[TD]2015/05/22 23:38:32[/TD]
[TD]angelo mayo[/TD]
[/TR]
</tbody>[/TABLE]
[TABLE="width: 774"]
<tbody>[TR]
[TD="class: xl65, width: 84"]PCRE5181[/TD]
[TD="class: xl65, width: 131"]104620776[/TD]
[TD="class: xl65, width: 129"]198042472[/TD]
[TD="class: xl66, width: 95"]Completed[/TD]
[TD="class: xl67, width: 151"]2015/05/22 23:33:32[/TD]
[TD="class: xl65, width: 184"]jessica-adams II[/TD]
[/TR]
</tbody>[/TABLE]
Thank for any and all help, and sort of advice/direction is greatly appreciated and I will love you forever!!!
The second method to detect fraud is the time stamp for each agent. Once again i can group the agents just to see their Create Date Time. I need a way to find if the difference between two of the times is less than 4 minutes. This would also be a alert to something fishy occurring. Does Excel 2013 not have Datediff? I also tried doing =TEXT(E3-E2),"mm") but this returned number values that were under .0008^n. or along those lines of small numbers that couldn't mean minutes. The TransactionID are unique numbers.
This is a sample of how the info looks:Just imagine another 14993 lines of information. If the question is why do we not use some other method of storing and examining the information, the development team is working on something but we need something right now.
[TABLE="width: 774"]
<tbody>[TR]
[TD]User ID[/TD]
[TD]Transaction ID[/TD]
[TD]Application ID[/TD]
[TD]Status ID[/TD]
[TD]Create Date Time[/TD]
[TD]Customer Name[/TD]
[/TR]
[TR]
[TD]PCRE5181[/TD]
[TD]104620811[/TD]
[TD]198042515[/TD]
[TD]Completed[/TD]
[TD]2015/05/22 23:53:29[/TD]
[TD]jessica adams[/TD]
[/TR]
[TR]
[TD]PCRE5181[/TD]
[TD]104620808[/TD]
[TD]198042516[/TD]
[TD]Completed[/TD]
[TD]2015/05/22 23:51:33[/TD]
[TD]walter white[/TD]
[/TR]
[TR]
[TD]PCRE70D3[/TD]
[TD]104620798[/TD]
[TD]198042500[/TD]
[TD]Completed[/TD]
[TD]2015/05/22 23:45:04[/TD]
[TD]robert jordan[/TD]
[/TR]
[TR]
[TD]PCRE5619[/TD]
[TD]104620795[/TD]
[TD]198042505[/TD]
[TD]Completed[/TD]
[TD]2015/05/22 23:43:29[/TD]
[TD]guangming liu[/TD]
[/TR]
[TR]
[TD]PCRE5181[/TD]
[TD]104620792[/TD]
[TD]198042497[/TD]
[TD]Completed[/TD]
[TD]2015/05/22 23:41:32[/TD]
[TD]richard nixon[/TD]
[/TR]
[TR]
[TD]PCRE70D3[/TD]
[TD]104620789[/TD]
[TD]198042492[/TD]
[TD]Completed[/TD]
[TD]2015/05/22 23:40:21[/TD]
[TD]tommy pickles[/TD]
[/TR]
[TR]
[TD]PCRE70C4[/TD]
[TD]104620785[/TD]
[TD]198042489[/TD]
[TD]Completed[/TD]
[TD]2015/05/22 23:38:32[/TD]
[TD]angelo mayo[/TD]
[/TR]
</tbody>[/TABLE]
[TABLE="width: 774"]
<tbody>[TR]
[TD="class: xl65, width: 84"]PCRE5181[/TD]
[TD="class: xl65, width: 131"]104620776[/TD]
[TD="class: xl65, width: 129"]198042472[/TD]
[TD="class: xl66, width: 95"]Completed[/TD]
[TD="class: xl67, width: 151"]2015/05/22 23:33:32[/TD]
[TD="class: xl65, width: 184"]jessica-adams II[/TD]
[/TR]
</tbody>[/TABLE]
Thank for any and all help, and sort of advice/direction is greatly appreciated and I will love you forever!!!