How to build a correct GAM model with repeated measurement and replicates?

01 January 1970 3 7K Report

I made an experiment with repeated measurement depending on time scale and I was advised to use GAM model to describe my data.

I made an experiment on 12 mesocosms (big tanks) for 6 weeks to test the crossed effect of two treatment A (TA) and B (TB) on the variation of chlorophyll. Each treatment were composed of two modalities : A1 or A2, B1 or B2. Each modalities were crossed and replicates three times, so at the end a worked on 4 combinaisons of treatments:

3 tanks (replicates) submitted to A1+ B1
3 tanks (replicates) submitted to A1+ B2
3 tanks (replicates) submitted to A2+ B1
3 tanks (replicates) submitted to A2+ B2

Each two or three days, we measured the concentration of chlorophyll on each tank. So I need to include in my model:

That there was a repeated measurement on a same tank

And in addition there were replicates in each treatment.

My variables are:

date : num (I wrote them in julian date format, for example: 2 sept= 245th day of the year)
tank: factor with 12 levels (different tanks)
TA = factor with two modalities (A1 et A2)
TB = factor with two modalities (B1 et B2)
treat = crossed treatments A1_B1, A1_B2, A2_B1, A2_B2
chloro = response variable

(see pictures).

I’m new with GAM models and that’s why i ask for help to better understand the model i wrote and to better construct them. I first wanted to write simples models and then more complex and to visualize them of a graph (see figure, Be careful, models are not displayed in the right order but in the following order: mod 1, mod 10, mod 11, mod 2, mod 3, mod 4, mod 5, mod 6 mod 7, mod 8, mod 9 )

(see pictures)

I would like to know :

3. How to write correctly the « repeated measurement » i made taking into account replicates ? In linear regression, i write it as random effect « (1|variable) » , however here, i realized that there is a lot of possibility to write the model. I didn’t understand the difference between these functions : s(tank, bs=« re ») and s(date, tank, bs=c(« fs »))

And between : s(date, tank, bs=c(« fs »)) and s(date, tank, bs=c(« fs », « re »)) and ?

Especially as models seems identical (mod 1 vs mod 3 in the graph) How to know which correct writing to choose

4. When i remove the tank effect of the model (mod 7) and test only the effect of treatment, i observed that all my predictions were very bad and do not fit as expected my data. I’ve the impression that my variable « tank » was able to explain by itself all my data in previous models… Does this mod 7 suggest that my treatment variable has no effect on my response variable?

5. When i wrote categorial predictors as in linear regression (mod 6, 8, 9 & 10) outside of a smooth function, i didn’t see the difference with models were these predictors were not present (as in mod 3 or 4).

6.How to choose between all these models, the best one ? Do i have to compare models with AIC comparison or is there other method to choose ?

Thank you for your help,

David Eugene Booth

https://1lib.us/book/3553646/73cc0c

Links to Simon Woods, Generalized additive Models 2nd ed. Probably the best book around for what you want to do. Check it out. Best wishes, David Booth

Abolfazl Ghoodjani

I think your study is easier than what you wrote.

Date, treat, replicates are not known as Variables and are redundant in the model.

Your study will be Three-way RM which you can do simply by using SPSS.

This study includes 288 models.

12*2*2*6 = 288.

Holger Steinmetz

Hello Emilie,

you have a GAMM not a GAM, as your time series are nested in tanks.

Here is a nice tutorial that fits IMHO your situation.

Sóskuthy, M. (2017). Generalised additive mixed models for dynamic analysis in linguistics: a practical introduction. arXiv preprint arXiv:1703.05339.

Article Generalised additive mixed models for dynamic analysis in li...

HTH

--Holger

How can, granulometry or grain size data, be used in enrichment factor calculations?

Lipid composition of the aortic membrane cells (smooth muscles)?

How should fully standardized expected parameter change (StdYX E.P.C. in Mplus CFA output) be interpreted?

How I can differentiate between semi metals and metals?

Hi, does anyone have this book?

VMD NAMD Plot?

Colony PCR post Gibson Assembly, help?

Are there any "linear" lagrangian systems of interest for which an analytic solution is not obvious?

Can someone help me with the confirmation of the diagnosis for two species of the Proalides genus (Monogononta, Epiphanidae)?

Pricing as attribute in conjoint absolutely needed?

How can I apply boundary conditions in an orthotropic steel deck numerical model using ABAQUS software?

Unusual intensity drop in some sections of chromatograms in DDA?

How to normalize and take the significance of the MTT OD values with 3 replicates for the same cell-line?

Is there an alternative to a multinomial regression which allows the DV to be non mutually exclusive?

In order to run Multinomial Logistic Regression, is it required that the data be in the long format?

How to report results of Generalised Linear Mixed Models in a journal article?

Repeated measures ANOVA, ANCOVA or Regression?

Request a single Lecture notes for math as detailed as this that I can find in one place?

What is meant by baseline of FTIR data?

Normality assumption for linear regression is The assumption of normality is whether for residual errors or predictor variavble?