I am trying to analyze our company's sales data to better understand which items frequently sell together so that we can better organize our inventory and cut down on time to assemble orders.
The data is organized by order number and product number.
My initial thought was to take the top 1000 or so items and set them up along the x and y axes. I would run some kind of count formula that would identify the distinct number of order numbers that included the items from both the x axis and y axis. Does anyone know how I could set up that count formula?
[TABLE="class: grid, width: 200"]
<tbody>[TR]
[TD]Order Number[/TD]
[TD]Product Number[/TD]
[/TR]
[TR]
[TD]111[/TD]
[TD]123[/TD]
[/TR]
[TR]
[TD]111[/TD]
[TD]456[/TD]
[/TR]
[TR]
[TD]111[/TD]
[TD]789[/TD]
[/TR]
[TR]
[TD]222[/TD]
[TD]456[/TD]
[/TR]
[TR]
[TD]222[/TD]
[TD]789[/TD]
[/TR]
[TR]
[TD]333[/TD]
[TD]123[/TD]
[/TR]
[TR]
[TD]333[/TD]
[TD]789[/TD]
[/TR]
</tbody>[/TABLE]
[TABLE="class: grid, width: 200"]
<tbody>[TR]
[TD][/TD]
[TD]123[/TD]
[TD]456[/TD]
[TD]789[/TD]
[/TR]
[TR]
[TD]123[/TD]
[TD]x[/TD]
[TD]1[/TD]
[TD]2[/TD]
[/TR]
[TR]
[TD]456[/TD]
[TD]1[/TD]
[TD]x[/TD]
[TD]2[/TD]
[/TR]
[TR]
[TD]789[/TD]
[TD]2[/TD]
[TD]2[/TD]
[TD]x[/TD]
[/TR]
</tbody>[/TABLE]
Any thoughts on how to accomplish this kind of analysis?
Thanks!
The data is organized by order number and product number.
My initial thought was to take the top 1000 or so items and set them up along the x and y axes. I would run some kind of count formula that would identify the distinct number of order numbers that included the items from both the x axis and y axis. Does anyone know how I could set up that count formula?
[TABLE="class: grid, width: 200"]
<tbody>[TR]
[TD]Order Number[/TD]
[TD]Product Number[/TD]
[/TR]
[TR]
[TD]111[/TD]
[TD]123[/TD]
[/TR]
[TR]
[TD]111[/TD]
[TD]456[/TD]
[/TR]
[TR]
[TD]111[/TD]
[TD]789[/TD]
[/TR]
[TR]
[TD]222[/TD]
[TD]456[/TD]
[/TR]
[TR]
[TD]222[/TD]
[TD]789[/TD]
[/TR]
[TR]
[TD]333[/TD]
[TD]123[/TD]
[/TR]
[TR]
[TD]333[/TD]
[TD]789[/TD]
[/TR]
</tbody>[/TABLE]
[TABLE="class: grid, width: 200"]
<tbody>[TR]
[TD][/TD]
[TD]123[/TD]
[TD]456[/TD]
[TD]789[/TD]
[/TR]
[TR]
[TD]123[/TD]
[TD]x[/TD]
[TD]1[/TD]
[TD]2[/TD]
[/TR]
[TR]
[TD]456[/TD]
[TD]1[/TD]
[TD]x[/TD]
[TD]2[/TD]
[/TR]
[TR]
[TD]789[/TD]
[TD]2[/TD]
[TD]2[/TD]
[TD]x[/TD]
[/TR]
</tbody>[/TABLE]
Any thoughts on how to accomplish this kind of analysis?
Thanks!