Showing posts with label Data Mining. Show all posts
Showing posts with label Data Mining. Show all posts

Thursday, May 18, 2023

Binning (of a column or 1D Numerical Data)

Equal-frequency partitioning

Suppose a group of 12 sales price records has been sorted as follows: 5, 10, 11, 13, 15, 35, 50, 55, 72, 92, 204, 215 Partition them into three bins by each of the following methods. equal-frequency partitioning bin 1: 5, 10, 11, 13 bin 2: 15, 35, 50, 55 bin 3: 72, 92, 204, 215

Equal-width partitioning

Suppose a group of 12 sales price records has been sorted as follows: 5, 10, 11, 13, 15, 35, 50, 55, 72, 92, 204, 215 Partition them into three bins by each of the following methods. The width of each interval is (215 – 5)/3 = 70. bin 1: 5, 10, 11, 13, 15, 35, 50, 55, 72 bin 2: 92 bin 3: 204, 215

For ex: if we had 3 bins instead of 4. The bins would look like this: