When plotting quite dissimilar values, it is common to use a logarithmic scale for the Y axis. Let's say, I have the following lists (a and b) of values.

a = [6.43x10^6, 9.42x10^6, 12.7x10^6, 12.9x10^6, 18x10^6, 16.3x10^6]

b= [5.19x10^6, 5.64x10^6, 7.06x10^6, 6.73x10^6]

If I want to compare 'a' and 'b', let's say by using the mean and the standard deviation I have at least two options:

1. I compute the average and the std.dev of each list, but the standard deviation of 'a' will be huge and the 'a' and 'b' means won't be different.

2. I could transform each value in 'a' and 'b' by using Log10 and then calculate the mean and the standard deviation of such logarithms.

1 and 2 produce substantially different results. Intuitively it looks to me that 1 is correct and 2 is not.

What's your opinion?

More Juan Jovel's questions See All
Similar questions and discussions