WaqasTariq
Board Regular
- Joined
- Jun 26, 2012
- Messages
- 58
- Office Version
- 365
I am trying to find a way to create optimal bin size for the data (random size/rows - based on user input) provided by user to create a histogram. The data has random times within a 24 hour time period along with varying quantities.
I did find two methods online, but they had issues;
How can I use Kernel Density Estimation (KDE) to find optimal bin size for my data? The data will be of varying amount (rows) each time.
Here is how the data looks like, example excel file attached as well.
Time Quantity
0:30 1
1:15 2
1:30 4
1:45 5
2:05 6
2:25 7
2:45 6
3:15 4
0:30 1
1:15 2
1:30 4
1:45 5
2:05 6
2:25 7
2:45 6
3:15 4
I did find two methods online, but they had issues;
- The first method was to take a count of the data points, then take a square root (rounded) and then divide the specification tolerance by number of bins using the table article had.
- The second used Kernel Density Estimation (KDE), which is what I want to use, but wanted me to buy their plugin.
How can I use Kernel Density Estimation (KDE) to find optimal bin size for my data? The data will be of varying amount (rows) each time.
Here is how the data looks like, example excel file attached as well.
Time Quantity
0:30 1
1:15 2
1:30 4
1:45 5
2:05 6
2:25 7
2:45 6
3:15 4
0:30 1
1:15 2
1:30 4
1:45 5
2:05 6
2:25 7
2:45 6
3:15 4