GAM model is quite suitable for my research question, but I'm not sure whether my samples size is sufficient. What should I pay attention to? Thanks a lot.
Hi, Tz-Hsuan Tseng, In the following paper, it is mentioned that the sample size of GAM needs to be greater than 40 after dividing by the included covariates, which I hope will help you.
Karatekin, T., Sancak, S., Celik, G., Topcuoglu, S., Karatekin, G., Kirci, P., & Okatan, A. (2019, August). Interpretable machine learning in healthcare through generalized additive model with pairwise interactions (GA2M): Predicting severe retinopathy of prematurity. In 2019 International Conference on Deep Learning and Machine Learning in Emerging Applications (Deep-ML) (pp. 61-66). IEEE.
The GAM framework is built on a simple and appealing mental model: relationships between individual predictors and the dependent variable follow smooth patterns that might be linear or nonlinear. We may estimate these smooth connections concurrently and then forecast g(E(Y))) by adding them all up.
Look here as well: https://stats.stackexchange.com/questions/494308/is-there-a-formal-way-to-determine-the-minimum-sample-size-required-to-build-a-g