Hey guys,
I need help with a data mining assignment. I have attached dummy data that will help you understand how my data is structured.
What I need to achieve:
Being able to filter the data by multiple queries (people who have used the app "MO" + they are from "USA")
Being able to filter the data by interests as well (people who have an interest in "Movies" & they are from "GB")
After the filtering is done, I need to know the numbers of records that meet my criteria as well as their interests in the case of 1. So I want to know the top 5 interests of the users who used the app "MO" & that are from "USA".
After that, creating charts to show these numbers would be of added value.
My objective from this assignment is to answer the question HOW MANY & not WHO. So if I query "MO" users + from "GB" then I want to know how many people that fit these criteria exist as well as what are their top 5 interests.
Which direction should I take to achieve this objective? Should I use pivot tables? (I'm having problems with the interests data being divided in 3) Should I not use excel at all for this?
Awaiting your answers.
Cheers.
[TABLE="width: 773"]
<tbody>[TR]
[TD][/TD]
[/TR]
</tbody>[/TABLE]
[TABLE="width: 773"]
<colgroup><col><col span="2"><col><col><col><col></colgroup><tbody>[TR]
[TD]App Name[/TD]
[TD]Name[/TD]
[TD]Age[/TD]
[TD]Country[/TD]
[TD]Interest 1[/TD]
[TD]Interest 2[/TD]
[TD]Interest 3[/TD]
[/TR]
[TR]
[TD]MO[/TD]
[TD]Jean[/TD]
[TD]20[/TD]
[TD]USA[/TD]
[TD]Movies[/TD]
[TD]Music[/TD]
[TD]Food[/TD]
[/TR]
[TR]
[TD]MO[/TD]
[TD]Val[/TD]
[TD]20[/TD]
[TD]GB[/TD]
[TD]Movie[/TD]
[TD]Food[/TD]
[TD]Tech[/TD]
[/TR]
[TR]
[TD]MO[/TD]
[TD]Maurice[/TD]
[TD]25[/TD]
[TD]USA[/TD]
[TD]Music[/TD]
[TD]Movies[/TD]
[TD]Books[/TD]
[/TR]
[TR]
[TD]WAK[/TD]
[TD]Marc[/TD]
[TD]25[/TD]
[TD]GB[/TD]
[TD]Music[/TD]
[TD]Tech[/TD]
[TD]Books[/TD]
[/TR]
[TR]
[TD]WAK[/TD]
[TD]Paul[/TD]
[TD]30[/TD]
[TD]GB[/TD]
[TD]Food[/TD]
[TD]Movies[/TD]
[TD]Books[/TD]
[/TR]
[TR]
[TD]WAK[/TD]
[TD]Pule[/TD]
[TD]18[/TD]
[TD]USA[/TD]
[TD]Food[/TD]
[TD]Movies[/TD]
[TD]Music[/TD]
[/TR]
</tbody>[/TABLE]
[TABLE="width: 500"]
<tbody></tbody>[/TABLE]
I need help with a data mining assignment. I have attached dummy data that will help you understand how my data is structured.
What I need to achieve:
Being able to filter the data by multiple queries (people who have used the app "MO" + they are from "USA")
Being able to filter the data by interests as well (people who have an interest in "Movies" & they are from "GB")
After the filtering is done, I need to know the numbers of records that meet my criteria as well as their interests in the case of 1. So I want to know the top 5 interests of the users who used the app "MO" & that are from "USA".
After that, creating charts to show these numbers would be of added value.
My objective from this assignment is to answer the question HOW MANY & not WHO. So if I query "MO" users + from "GB" then I want to know how many people that fit these criteria exist as well as what are their top 5 interests.
Which direction should I take to achieve this objective? Should I use pivot tables? (I'm having problems with the interests data being divided in 3) Should I not use excel at all for this?
Awaiting your answers.
Cheers.
[TABLE="width: 773"]
<tbody>[TR]
[TD][/TD]
[/TR]
</tbody>[/TABLE]
[TABLE="width: 773"]
<colgroup><col><col span="2"><col><col><col><col></colgroup><tbody>[TR]
[TD]App Name[/TD]
[TD]Name[/TD]
[TD]Age[/TD]
[TD]Country[/TD]
[TD]Interest 1[/TD]
[TD]Interest 2[/TD]
[TD]Interest 3[/TD]
[/TR]
[TR]
[TD]MO[/TD]
[TD]Jean[/TD]
[TD]20[/TD]
[TD]USA[/TD]
[TD]Movies[/TD]
[TD]Music[/TD]
[TD]Food[/TD]
[/TR]
[TR]
[TD]MO[/TD]
[TD]Val[/TD]
[TD]20[/TD]
[TD]GB[/TD]
[TD]Movie[/TD]
[TD]Food[/TD]
[TD]Tech[/TD]
[/TR]
[TR]
[TD]MO[/TD]
[TD]Maurice[/TD]
[TD]25[/TD]
[TD]USA[/TD]
[TD]Music[/TD]
[TD]Movies[/TD]
[TD]Books[/TD]
[/TR]
[TR]
[TD]WAK[/TD]
[TD]Marc[/TD]
[TD]25[/TD]
[TD]GB[/TD]
[TD]Music[/TD]
[TD]Tech[/TD]
[TD]Books[/TD]
[/TR]
[TR]
[TD]WAK[/TD]
[TD]Paul[/TD]
[TD]30[/TD]
[TD]GB[/TD]
[TD]Food[/TD]
[TD]Movies[/TD]
[TD]Books[/TD]
[/TR]
[TR]
[TD]WAK[/TD]
[TD]Pule[/TD]
[TD]18[/TD]
[TD]USA[/TD]
[TD]Food[/TD]
[TD]Movies[/TD]
[TD]Music[/TD]
[/TR]
</tbody>[/TABLE]
[TABLE="width: 500"]
<tbody></tbody>[/TABLE]