08 February 2022 2 920 Report

We work on comparing the effects of different treatments (e.g. temperature increase, ocean acidification, eutrophication) on different physiological parameters of a coral species (e.g. its respiration). The goal would basically be a classical ANOVA to compare the means of different groups. As the data comes from many different experiments, using the absolute measured values is not an option, as the baselines are different between the experiments. Therefore, we thought of comparing the percentage difference (last day of the treatment compared to baseline) in the response variables, to have a more general look at the overall trends.

Attached some dummy data to better explain our point.

The dummy data is extreme, but depending on the parameters we look at, we end up having many negative percentage values, as respiration decreased, and also changes above 100%, when respiration triples as a response to a certain stressor. Separating between the negative and positive percentage changes is important, as one factor might increase respiration by 10% (+10) while another one decreases it by 10% (-10), and it would be interesting to see if there is a significant difference between these two treatments. Just converting it all into positive percentage change values would hide these opposite effects of treatments.

Researching online and looking at papers working with percentage data, we thought to convert the percentages into proportions and then conduct an arcsin squareroot transformation. The problem is, that for this transformation the data cannot be negative, and we also run into problems with values bigger than 1. So, we thought of normalizing the data to have them in a range of 0 to 1, meaning that a value of 0.5 equals a percentage change of zero and then conduct an arcsin squareroot transformation on this normalized data.

The question we are asking ourselves: How do we have to deal with the percentage data before being allowed to use it in a statistical test like an ANOVA? Is a transformation really necessary, and if so, which one (that can be used having negative percentages and percentages higher than 100)?

More Selma Mezger's questions See All
Similar questions and discussions