I have a csv with n columns in it and it has say 500k rows. Now kind help with a way in excel for possible ways to pick n+1 rows such that intersection between any two rows is exactly one record in the entire row and all elements in n+1 rows together have each record in the row exactly only twice. Also problem to open such huge csv file. Can I remove duplicate rows of type if row has a b c d e respective 5 column and another b c a e d say a permuted of the first all fields strings how to identify delete such duplicate rows keepin only one.