I am analyzing a set of data that does not follow a normal distribution and wondering how I can standardize for 'sample size' of each bin.
For example, I have a data set of songs I have streamed, including the 'hour of the day streamed' and the 'BPM' of each song. I would like to chart the BPM vs the hour of day streamed.
I am wondering what would be the best way to approach analyzing this, and how I would need to standardize for the disproportionate number of songs streamed per hour.
Thanks!