Help to find the right duplicates?

frank265

New Member
Joined
Feb 5, 2013
Messages
33
Office Version
  1. 365
Platform
  1. MacOS
Hi, I was hoping someone smarter than me might be able to help me with this excel conundrum...

I have some software which can look for duplicate images on a hard drive and then export the results into a csv file. As you will see from the spreadsheet below it pairs the duplicates into what it calls groups as shown in column A. So A1 and A2 are a close match in duplication and they are both found in the folder called NAS FILES on drive E. The results I am interested in is where there is a reference to a folder called Book Images. In that case I want to take the corresponding image in the matching group. As an example in row 30 and row 31, there is a match in Group 15 and the images are found in the folders Book Images and Web recipe images low res. I would like rows 30 and 31 (and all other matches like that) to be marked in the spreadsheet somehow so that they could be filtered and all other groups that do not contain the Book Images folder to be ignored. I hope this makes sense...

Recipe Dupes.xlsx
ABC
1Group NumPicture FilenameSimilarity
21E:\NAS FILES\Jessica- Jul2017\0611_ljusare.jpg72
31E:\NAS FILES\Jessica- Jul2017\0611.jpg72
42E:\NAS FILES\Jessica- Jul2017\0633_ljusare.jpg74
52E:\NAS FILES\Jessica- Jul2017\0633.jpg74
63E:\NAS FILES\Jessica- Jul2017\0664_snabbfri-ko.jpg83
73E:\NAS FILES\Jessica- Jul2017\0664.jpg83
84E:\NAS FILES\Jessica- Jul2017\0674-ljusare.jpg69
94E:\NAS FILES\Jessica- Jul2017\0674.jpg69
105E:\NAS FILES\Jessica- Jul2017\0681_ljusare.jpg85
115E:\NAS FILES\Jessica- Jul2017\0681.jpg85
126E:\NAS FILES\Jessica- Jul2017\0701_ljjusare.jpg69
136E:\NAS FILES\Jessica- Jul2017\0701.jpg69
147E:\NAS FILES\Jessica- Jul2017\2540.jpg99
157E:\NAS FILES\Jessica- Jul2017\Glögg drink recept.jpg99
168E:\NAS FILES\Jessica- Jul2017\Glögg drink recept.jpg83
178E:\NAS FILES\Jessica- Jul2017\Glögg drink recept web .jpg83
189E:\NAS FILES\Jessica- Jul2017\jessica mörkgrön tröja utan fönster_ret.jpg86
199E:\NAS FILES\Jessica- Jul2017\jessica mörkgrön tröja utan fönster.jpg86
2010E:\NAS FILES\Jessica- Jul2017\jessica mörkgrön tröja utan fönster.jpg86
2110E:\NAS FILES\Jessica- Jul2017\jessica mörkgrön tröja.jpg86
2211E:\NAS FILES\Jessica- Jul2017\jessica mörkgrön tröja.jpg76
2311E:\NAS FILES\Jessica- Jul2017\jessica.jpg76
2412E:\NAS FILES\Jessica Frej - receptbilder sommar för Web\0107.jpg100
2512E:\NAS FILES\Jessica Frej - receptbilder sommar för Web\minipizzor med tryffelfeta.jpg100
2613E:\Book Images\2018-02-09_2386.jpg85
2713E:\Book Images\klassiska röror.jpg85
2814E:\NAS FILES\Jessica- Jul2017\0611.jpg62
2914E:\Web Recipe Images Low Res\Agghalvor-med-oliver.jpg62
3015E:\Book Images\psari me fasolia 1 1.jpg82
3115E:\Web Recipe Images Low Res\Black-eye-bonor-med-seabream.jpg82
3216E:\Book Images\kalamarakia.jpg79
3316E:\Web Recipe Images Low Res\Blackfisk-med-gronsaker-och-grilloumi.jpg79
3417E:\Book Images\pizza xoriatiki.jpg80
3517E:\Web Recipe Images Low Res\Bondpizza.jpg80
3618E:\Book Images\psomia 1 1.jpg74
3718E:\Web Recipe Images Low Res\Brod-fran-Xiliato.jpg74
3819E:\NAS FILES\Jessica Frej - receptbilder sommar för Web\0092.jpg82
3919E:\Web Recipe Images Low Res\Burgare-med-halloumi.jpg82
4020E:\NAS FILES\Jessica- Jul2017\0701.jpg66
4120E:\Web Recipe Images Low Res\Dadelbollar.jpg66
Sheet 1 - Recipe Dupes
 

Excel Facts

Excel Joke
Why can't spreadsheets drive cars? They crash too often!
will this work for you

=COUNTIFS(A:A,A2,B:B,"*"&"book images"&"*")

it counts/flags the groups that also have "book images" in the picture file name

however Group 13 has 2 rows - both have book images in the file

if this is the results - then you can filter on >0

Book1
ABCDE
1Group NumPicture FilenameSimilarityCount of group & Book Images
21E:\NAS FILES\Jessica- Jul2017\0611_ljusare.jpg720
31E:\NAS FILES\Jessica- Jul2017\0611.jpg720
42E:\NAS FILES\Jessica- Jul2017\0633_ljusare.jpg740
52E:\NAS FILES\Jessica- Jul2017\0633.jpg740
63E:\NAS FILES\Jessica- Jul2017\0664_snabbfri-ko.jpg830
73E:\NAS FILES\Jessica- Jul2017\0664.jpg830
84E:\NAS FILES\Jessica- Jul2017\0674-ljusare.jpg690
94E:\NAS FILES\Jessica- Jul2017\0674.jpg690
105E:\NAS FILES\Jessica- Jul2017\0681_ljusare.jpg850
115E:\NAS FILES\Jessica- Jul2017\0681.jpg850
126E:\NAS FILES\Jessica- Jul2017\0701_ljjusare.jpg690
136E:\NAS FILES\Jessica- Jul2017\0701.jpg690
147E:\NAS FILES\Jessica- Jul2017\2540.jpg990
157E:\NAS FILES\Jessica- Jul2017\Glögg drink recept.jpg990
168E:\NAS FILES\Jessica- Jul2017\Glögg drink recept.jpg830
178E:\NAS FILES\Jessica- Jul2017\Glögg drink recept web .jpg830
189E:\NAS FILES\Jessica- Jul2017\jessica mörkgrön tröja utan fönster_ret.jpg860
199E:\NAS FILES\Jessica- Jul2017\jessica mörkgrön tröja utan fönster.jpg860
2010E:\NAS FILES\Jessica- Jul2017\jessica mörkgrön tröja utan fönster.jpg860
2110E:\NAS FILES\Jessica- Jul2017\jessica mörkgrön tröja.jpg860
2211E:\NAS FILES\Jessica- Jul2017\jessica mörkgrön tröja.jpg760
2311E:\NAS FILES\Jessica- Jul2017\jessica.jpg760
2412E:\NAS FILES\Jessica Frej - receptbilder sommar för Web\0107.jpg1000
2512E:\NAS FILES\Jessica Frej - receptbilder sommar för Web\minipizzor med tryffelfeta.jpg1000
2613E:\Book Images\2018-02-09_2386.jpg852
2713E:\Book Images\klassiska röror.jpg852
2814E:\NAS FILES\Jessica- Jul2017\0611.jpg620
2914E:\Web Recipe Images Low Res\Agghalvor-med-oliver.jpg620
3015E:\Book Images\psari me fasolia 1 1.jpg821
3115E:\Web Recipe Images Low Res\Black-eye-bonor-med-seabream.jpg821
3216E:\Book Images\kalamarakia.jpg791
3316E:\Web Recipe Images Low Res\Blackfisk-med-gronsaker-och-grilloumi.jpg791
3417E:\Book Images\pizza xoriatiki.jpg801
3517E:\Web Recipe Images Low Res\Bondpizza.jpg801
3618E:\Book Images\psomia 1 1.jpg741
3718E:\Web Recipe Images Low Res\Brod-fran-Xiliato.jpg741
3819E:\NAS FILES\Jessica Frej - receptbilder sommar för Web\0092.jpg820
3919E:\Web Recipe Images Low Res\Burgare-med-halloumi.jpg820
4020E:\NAS FILES\Jessica- Jul2017\0701.jpg660
4120E:\Web Recipe Images Low Res\Dadelbollar.jpg660
Sheet1
Cell Formulas
RangeFormula
E2:E41E2=COUNTIFS(A:A,A2,B:B,"*"&"book images"&"*")
 
Upvote 0
Solution
will this work for you

=COUNTIFS(A:A,A2,B:B,"*"&"book images"&"*")

it counts/flags the groups that also have "book images" in the picture file name

however Group 13 has 2 rows - both have book images in the file

if this is the results - then you can filter on >0

Book1
ABCDE
1Group NumPicture FilenameSimilarityCount of group & Book Images
21E:\NAS FILES\Jessica- Jul2017\0611_ljusare.jpg720
31E:\NAS FILES\Jessica- Jul2017\0611.jpg720
42E:\NAS FILES\Jessica- Jul2017\0633_ljusare.jpg740
52E:\NAS FILES\Jessica- Jul2017\0633.jpg740
63E:\NAS FILES\Jessica- Jul2017\0664_snabbfri-ko.jpg830
73E:\NAS FILES\Jessica- Jul2017\0664.jpg830
84E:\NAS FILES\Jessica- Jul2017\0674-ljusare.jpg690
94E:\NAS FILES\Jessica- Jul2017\0674.jpg690
105E:\NAS FILES\Jessica- Jul2017\0681_ljusare.jpg850
115E:\NAS FILES\Jessica- Jul2017\0681.jpg850
126E:\NAS FILES\Jessica- Jul2017\0701_ljjusare.jpg690
136E:\NAS FILES\Jessica- Jul2017\0701.jpg690
147E:\NAS FILES\Jessica- Jul2017\2540.jpg990
157E:\NAS FILES\Jessica- Jul2017\Glögg drink recept.jpg990
168E:\NAS FILES\Jessica- Jul2017\Glögg drink recept.jpg830
178E:\NAS FILES\Jessica- Jul2017\Glögg drink recept web .jpg830
189E:\NAS FILES\Jessica- Jul2017\jessica mörkgrön tröja utan fönster_ret.jpg860
199E:\NAS FILES\Jessica- Jul2017\jessica mörkgrön tröja utan fönster.jpg860
2010E:\NAS FILES\Jessica- Jul2017\jessica mörkgrön tröja utan fönster.jpg860
2110E:\NAS FILES\Jessica- Jul2017\jessica mörkgrön tröja.jpg860
2211E:\NAS FILES\Jessica- Jul2017\jessica mörkgrön tröja.jpg760
2311E:\NAS FILES\Jessica- Jul2017\jessica.jpg760
2412E:\NAS FILES\Jessica Frej - receptbilder sommar för Web\0107.jpg1000
2512E:\NAS FILES\Jessica Frej - receptbilder sommar för Web\minipizzor med tryffelfeta.jpg1000
2613E:\Book Images\2018-02-09_2386.jpg852
2713E:\Book Images\klassiska röror.jpg852
2814E:\NAS FILES\Jessica- Jul2017\0611.jpg620
2914E:\Web Recipe Images Low Res\Agghalvor-med-oliver.jpg620
3015E:\Book Images\psari me fasolia 1 1.jpg821
3115E:\Web Recipe Images Low Res\Black-eye-bonor-med-seabream.jpg821
3216E:\Book Images\kalamarakia.jpg791
3316E:\Web Recipe Images Low Res\Blackfisk-med-gronsaker-och-grilloumi.jpg791
3417E:\Book Images\pizza xoriatiki.jpg801
3517E:\Web Recipe Images Low Res\Bondpizza.jpg801
3618E:\Book Images\psomia 1 1.jpg741
3718E:\Web Recipe Images Low Res\Brod-fran-Xiliato.jpg741
3819E:\NAS FILES\Jessica Frej - receptbilder sommar för Web\0092.jpg820
3919E:\Web Recipe Images Low Res\Burgare-med-halloumi.jpg820
4020E:\NAS FILES\Jessica- Jul2017\0701.jpg660
4120E:\Web Recipe Images Low Res\Dadelbollar.jpg660
Sheet1
Cell Formulas
RangeFormula
E2:E41E2=COUNTIFS(A:A,A2,B:B,"*"&"book images"&"*")

Hi etaf, I can't say thank you enough for this. It's super helpful and worked absolutely brilliantly. It's exactly what I needed. Thank you for taking the time out to help with this.
Wish you a great day ahead too.

F
 
Upvote 0

Forum statistics

Threads
1,223,903
Messages
6,175,284
Members
452,630
Latest member
OdubiYouth

We've detected that you are using an adblocker.

We have a great community of people providing Excel help here, but the hosting costs are enormous. You can help keep this site running by allowing ads on MrExcel.com.
Allow Ads at MrExcel

Which adblocker are you using?

Disable AdBlock

Follow these easy steps to disable AdBlock

1)Click on the icon in the browser’s toolbar.
2)Click on the icon in the browser’s toolbar.
2)Click on the "Pause on this site" option.
Go back

Disable AdBlock Plus

Follow these easy steps to disable AdBlock Plus

1)Click on the icon in the browser’s toolbar.
2)Click on the toggle to disable it for "mrexcel.com".
Go back

Disable uBlock Origin

Follow these easy steps to disable uBlock Origin

1)Click on the icon in the browser’s toolbar.
2)Click on the "Power" button.
3)Click on the "Refresh" button.
Go back

Disable uBlock

Follow these easy steps to disable uBlock

1)Click on the icon in the browser’s toolbar.
2)Click on the "Power" button.
3)Click on the "Refresh" button.
Go back
Back
Top