Hello,

I am trying to decide on the best approach for statistical analyses of my data. I have never done this before and was wanting a second opinion.

I am studying the impacts of nitrogen pollution on the near-shore nekton communities. There are three different estuaries each divided in to three stations: an upper (impacted region), middle (less impacted region), and lower (less impacted region). The upper impacted regions tend to be dominated by macroalgae are are subjected to frequent anoxic events. The two lower sites generally are dominated by seagrass.

The the estuaries' station are going to be sampled twice: in 2020 (July and August, already completed) and again in 2021 (June, July, August) to account for temporal variation. Five seine net samples will be done at each station. Environmental data will be collected as well: temperature, salinity, dissolved oxygen, vegetation cover.

My main question is if the upper station's community differs from the lower two stations' communities. I am also interested in determining if the communities between estuaries differ, but this is not the main question. The response variable is the proportional abundances of species in the estuary. The ultimate goal would be to identify what species appear to drive this change.

If I look at each estuary individually I will have two factors: Station (Levels: 3 (upper, middle, lower); Factor effect: I think is fixed) and Month (Levels: 2 in 2020 (July, August), 3 in 2021 (June, July, August); Factor effect: fixed). I am wondering If I would consider the stations fixed, as I revisited the same spots? I think Month would be considered ordered, as July comes before august. Would I consider stations to be ordered? as they are in an order from upper to lower.

I am using PRIMER_e as the stats program. I looked over the data I have so far and I decided to standardize the data first so it proportional abundance per seine, then it seems that either 4th root or log transformation shows rare taxa the same so I went with Log transformed. I want rare taxa to be represented. I think a Bray -Curtis similarity matrix will suit my purposes, but several of the species are schooling fish, so we caught thousands in on seine but nothing in the next. I was wondering how best to deal with that?

I thought a two-factor crossed ANOSIM would be the best way of analyzing each estuary. But as I understand it ANOSIM can only tell you if a difference exists, nothing more, and PERMANOVA can tell you the magnitude of difference. I was wondering what anyone thinks which would be the better option for answer my question, or will it not matter that much?

I would really appreciate any feedback, I am just learning about multivariate statistics and I have so much to learn.

More Mark Saunders's questions See All
Similar questions and discussions