Hi ludo2016, welcome to the boards.Hello
I have a dataset with about 200 different post codes spread across 6,000 company names.
I want to find out how many companies are in each post code.
how should i do it?
hopefully, my end product is a bell curve distribution chart..
I don't have a column with a unique post code per row. I have a column with about 200 different codes across 6,000 rows.
In order to go thru with your method, first of all, how would I extrapolate the different codes and distribute them across 200 rows?
(Btw the codes are all 5 numerical digits)
1> copy the column and paste in a different worksheet
2>select the new column and press ' remove duplicates' in the DATA section
3> use the countif formula.
Not sure if people follow..Is there a way to attach a spreadsheet to illustrate? Would that help you help me?
HERE is a simplified example of my suggestion. It is simplified by:ok thanks fishboy and bhos. its still not getting me a proper result.
removing duplicates worked but the countif suggestion is not working
as per fishboy's idea here is the formula I'm using:
=COUNTIF($A$5:$A$11477,"*"&Sheet1!B2&"*")
As I don't know what the asterix is for, I left it blank. The formula resulted in a blank which is not right but even if it did work, fishboy is suggesting dragging it down across the 200 or so rows of unique post codes... but don't I need to be dragging it down across the 600 or so rows of non-unique post codes?
(there are about 600 company names that I'm looking at having filtered the ones I want within A2:A11477).
And as per bhos instruction Sheet1!B2 is the first row of the unique post code column.
Not sure if people follow..Is there a way to attach a spreadsheet to illustrate? Would that help you help me?
ok thanks fishboy and bhos. its still not getting me a proper result.
removing duplicates worked but the countif suggestion is not working
as per fishboy's idea here is the formula I'm using:
=COUNTIF($A$5:$A$11477,"*"&Sheet1!B2&"*")
As I don't know what the asterix is for, I left it blank. The formula resulted in a blank which is not right but even if it did work, fishboy is suggesting dragging it down across the 200 or so rows of unique post codes... but don't I need to be dragging it down across the 600 or so rows of non-unique post codes?
(there are about 600 company names that I'm looking at having filtered the ones I want within A2:A11477).
And as per bhos instruction Sheet1!B2 is the first row of the unique post code column.
Not sure if people follow..Is there a way to attach a spreadsheet to illustrate? Would that help you help me?
Ah! Indeed, if there are no addresses just postcode then the whole wildcard bit is most definitely redundant and unnecessary.Thanks Fishboy. Problem solved!
btw, I'm not using addresses, only postcodes, so don't think there was an absolute need to cater for wildcards? but anyhow.. it seems to have worked.
now I'm learning about bell curves from https://www.youtube.com/watch?v=_PqnDYMO3lw. Thanks once again Fishboy et al, and thanks Mr Excel!