Hi everyone, we need some help with a request from a reviewer.
Here's the issue:
In our study we investigated the impairment in the ability to recognize facial emotions when a facemask is present (standard mask, transparent mask, and no mask). One reviewer argued that we employed a too small number of trials (40 items: 10 faces * 4 facial expressions). Moreover, due to the non-normality of data, we ran non-parametric tests (Kruskal-Wallis; Mann-Whitney; Friedman test; Wilcoxon signed-rank test). Does it make sense to compute the observed power of the analysis in response to the reviewer's concern? And in particular, how can we calculate the observed power of a Kruskal-Wallis test (SPSS does not have the point and click option for observed power as for the non-parametric analysis)?
Furthermore, would it be the same to compute the observed power (post hoc) of the relative parametric tests (i.e., compute the observed power of the ANOVA relative to the Kruskal Wallis test)?
Lastly, in a bunch of forum many staticians clearly state that any post hoc computation of observed power is completely useless (not to say "nonsense"). But, aside from blog posts, is there some relevant paper we could quote to support this claim? Also, if that's the case, how can we justify our number of trials without using observed power?
P.S.: In our case we always observed significant results with very low p values (always .90 observed power.
How can we sort this out?
thank you guys in advance,