I have data on numbers of taxa found in water samples from different locations (lakes) as the dependent variable, and many potential predictor variables (e.g. concentrations of various chemicals). I am mainly interested in the effects of one of these predictors, but this predictor is strongly correlated with several other predictors.
Is it reasonable to do the following:
1. Keep the main predictor of interest and remove all others with which it is strongly correlated.
2. Introduce a categorical variable "location" as an additional predictor.
3. Perform random effects modeling (e.g. Poisson regression) where the intercepts and/or slopes for the predictor of interest vary by this categorical variable "location".
In other words, the effects of removed variables which were correlated with the predictor of interest would be included in the random effects of location. Alternatively, I am also thinking about using gradient boosting (e.g. GBM package in r) on these data.
I would be very grateful for advice from anyone interested!