Good afternoon stats experts,

I know this may be excessive, but I am currently working on a project (N = 61) investigating the effects of white matter metrics on memory development in children. I have run 4 sets of 9 beta regressions (using betareg in R) whose formulae look like this: [Memory Measure] ~ [IQ] + [Control Tract] + [Tract of Interest]. The only thing that varies within the sets is the memory measure (there are 9), and the only thing that varies between the sets is the type of white matter index we are looking at (i.e. FA, MD, streamlines, etc.). IQ and the control tract index are our control measures. The only variable we are truly interested in in any of these models is our tract of interest. With the way I have things set up currently, I have run a whopping 36 beta regressions. Note that all of these analyses are theory- and hypothesis-driven.

It is also important to note that I get a chi-square and p-value in each of these models in order to assess whether or not the inclusion of our variable of interest has given way to a significant change in the log-likelihood function yielded by each model (using lrtest() in R).

When I brought up this issue with my PI, she recommended that I control for multiple comparisons using an FDR correction due to the shear number of models I have run - not due to the number of predictors in each model. However, I an unsure (1) if this is necessary, and (2) how to approach this technically speaking. I am trying to use the p.adjust() function in R, which requires you to input a vector of p-values for which you are trying to correct. If this is the correct approach to be taking, I need to know the following:

  • When listing the vector of p-values in the p.adjust() function, should I be doing this for just the three p-values yielded for each predictor? That is, should I do the adjustment separately fore each model in each set (i.e. 36 times)? Should I included the p-value of the log-likelihood function as well?
  • Would it be correct instead for me to list all the p-values of all the models in each set in order to perform a correction?
  • Would it be correct for me to list all of the p-values across sets in order to perform the correction?
  • Is any correction needed at all?
  • Thank you in advanced for your expertise! Any advise would be sincerely appreciated.

    Kind regards,

    Linda

    More Linda Hoffman's questions See All
    Similar questions and discussions