tl;dr: Why data linearization is applied in the identification o differentially expressed genes/proteins?
I am new to data analysis of big data like proteomics. As far as I know, a simple t-test is not enough as there is a high chance of false positives. I've been reading and it seems that Limma is a good package with better statistics to be applied in the identification of differentially expressed proteins (and genes). Most of the papers apply linearization in the process of identifying the genes but I would like to understand why this step is necessary.
An example of a R code that I generally see people using is this one:
design