First, you have to divide the FPKM of the second value (of the second group) on the FPKM of the first value to get the Fold Change (FC). then, put the equation in Excel =Log(FC, 2) to get the log2 fold change value from FPKM value.
I send you an approach coming from bioinformatique links sources:
Suppose 2 gene expression values A,B (treatment):
A=10
B=15
Foldchange is B/A => FC=1.5 or greater is Up regulated , and if the values were B=10,A=15 we'll have FC=0.66 it means all values less than 0.66 will be down regulated.
So to calculate log2-foldchange, its formula is
log2FC=Log2(B)-Log2(A)
which then all values greater than 0.5849 were be up regulated and all values less than -0.5849 (or FC =0.666) were be down regulated genes, protein or etc.
For calculating Fold change from log2 just do , Power(2, log2_Value)
, Power(2, 0.5849)=1.5
You can also read Microarray data normalization and transformation. Nature Genetics 32, 496 - 501 (2002) .
What about the scenario where fpkm value of a is 0 and b is 10, If we divide 10/0 we get infinity. Or consider a case where a is 10 and b is 0, in that case b/a will be zero and the log value would be inifinity. Is that the correct way of representing fold change
Hello i am new to RNA-seq data analysis and i am working with RNA-seq data analysis of Tumor and Normal samples. I have some queries regarding the FC calculation and interpretation
Question 1:
I am unable to understand the above post shown by Marius K. Somda . It says, all value greater than the calculated FC FC=0.66 is up regulated and similarly, value less than FC=0.66 is down regulated.
My question is what is "all value" in the data set, as FC=0.66 is the actual fold change, we calculated.
Lets consider B is expression value of Gene-1 in Tumor sample1 and A is expression value of Gene-1 in Normal sample1
As per above formula
FC=B(Tumor)/A(Normal)=10/15= 0.666667.
So, how do we interpret this FC value? can we say there is a down regulation of Gene-1 in the Tumor sample as compared to Normal sample?
Question 2:
I have FPKM values of multiple genes and i want to compare FPKM values of each gene in two groups i.e. Tumor vs Normal.
How do we calculate FC or Log2 value of gene expression in group os samples, i.e. FPKM value of gene-1 in 50 Tumor sample and 30 Normal sample? Should we take average of FPKM values from one group (50 Tumor) and similarly from group 2 (30 Normal), or any other way?
Do we require to do statistical analysis such as Poisson test and Fisher's exact test, as i am having gene expression values from multiple samples?
Kutti Biotech the FPKMs of a gene represents the abundance of that gene normalised by gene length and sequence depth and it doesn`t affect the stats. so you calculate the p.value as you would normally do with any other experiments
Everyone without computational experience to perform gene-count level statistical analysis in R or python, please do yourself a favor and use Biojupies. https://maayanlab.cloud/biojupies/ developed by the Maayan Lab.