Our research campaing involved  sampling of 4 rivers (at three different altitude stations each river). At every station we collected three different samples in a longitudinal 100 m-transect of the river taking special care to sample the full heterogeneity of substrata, and analyzed for benthic macroinvertebrates.

At the same time we evaluated numerous catchment variables in order to test the relevance of the land use and catchment properties on the macroinvertebrate community.

 Therefore, we ended up with 4 rivers * 3 stations * 3 samples per station = 36 samples.

 My question is  Whether all samples could be wisely included as  cases in a Random Forest Model (n=36)?…, or should I instead average macroinvertebrate samples per station to avoid pseudoreplication (n=12)?

I would greatly appreciate any help and advice on this issue.

Salud, y gracias

Manuel

Similar questions and discussions