I have a data base that has over 200k rows.
They are individuals that have been termed from the company.
The issue is that from multiple reports I merged together, I have the same Employee showing up multiple times based on the fact the reports that the data came from shows him up more than one.
So, I am trying to remove the duplicate Employee and only keeping the most recent term date for the employee. So in the example below I want to keep only the second from last which is file 9 which is the most recent file. I have employees that show up only twice and some showing up 6. It varies based on the number of reports I merged into the file. Is there a formula to look at the name then id which file for me to keep or delete. Since the DB is so huge I can't think of an easier method because filters don't work and slow my computer to a crawl. I appreciate any advice. Thanks
EE_NAME EMPLOYEE_NUMBER ACTUAL_TERMINATION_DATE FILE EE COUNT FILE REF
Abbott, Costello 123456 15-Jan-16 TERMS_2015 9 1
Abbott, Costello 123456 15-JAN-16 TERMS_YTD_01APR2016 9 3
Abbott, Costello 123456 15-JAN-16 TERMS_YTD_06JUN2016 9 7
Abbott, Costello 123456 15-JAN-16 TERMS_YTD_12MAY2016 9 6
Abbott, Costello 123456 15-JAN-16 TERMS_YTD_14APR2016 9 4
Abbott, Costello 123456 15-JAN-16 TERMS_YTD_14JUN2016 9 8
Abbott, Costello 123456 15-JAN-16 TERMS_YTD_19APR2016 9 5
Abbott, Costello 123456 15-JAN-16 TERMS_YTD_22JUN2016 9 9
Abbott, Costello 123456 15-JAN-16 TERMS_YTD_22MAR2016 9 2
They are individuals that have been termed from the company.
The issue is that from multiple reports I merged together, I have the same Employee showing up multiple times based on the fact the reports that the data came from shows him up more than one.
So, I am trying to remove the duplicate Employee and only keeping the most recent term date for the employee. So in the example below I want to keep only the second from last which is file 9 which is the most recent file. I have employees that show up only twice and some showing up 6. It varies based on the number of reports I merged into the file. Is there a formula to look at the name then id which file for me to keep or delete. Since the DB is so huge I can't think of an easier method because filters don't work and slow my computer to a crawl. I appreciate any advice. Thanks
EE_NAME EMPLOYEE_NUMBER ACTUAL_TERMINATION_DATE FILE EE COUNT FILE REF
Abbott, Costello 123456 15-Jan-16 TERMS_2015 9 1
Abbott, Costello 123456 15-JAN-16 TERMS_YTD_01APR2016 9 3
Abbott, Costello 123456 15-JAN-16 TERMS_YTD_06JUN2016 9 7
Abbott, Costello 123456 15-JAN-16 TERMS_YTD_12MAY2016 9 6
Abbott, Costello 123456 15-JAN-16 TERMS_YTD_14APR2016 9 4
Abbott, Costello 123456 15-JAN-16 TERMS_YTD_14JUN2016 9 8
Abbott, Costello 123456 15-JAN-16 TERMS_YTD_19APR2016 9 5
Abbott, Costello 123456 15-JAN-16 TERMS_YTD_22JUN2016 9 9
Abbott, Costello 123456 15-JAN-16 TERMS_YTD_22MAR2016 9 2