24 September 2020 3 1K Report

I am currently using TCGA data available in cbioportal to perform analysis of different p53 targets. There's mRNA level data of all patients and all genes and about half of the patients have protein levels information for just some proteins. Both mRNA levels and protein levels are given in Z-scores. Since I was just getting started with the analysis and still becoming familiarized with everything, I decided to try and plot the p53 protein z-score vs the p21 mRNA z-score to see what it looked like, since p21 is a well-known p53 target. But to my surprise, the scatter plot looked quite random and uncorrelated (I attached below the image of the plot I did in R). I performed a correlation test between the p53 protein z-score and p21 mRNA z-score using Spearman's coefficient (since the distributions aren't normal) and got a very low number, around -0.002. I wanted to know if there's an error in the statistical methodology I'm applying and if not, what could be the explanation to this, since p53 has been widely proven to activate p21 so you would expect at least *some* degree of correlation.

Thanks in advance and sorry if it's a dumb question, I'm new to the field of databases and biostatisical analysis.

More Emilia Baffo's questions See All
Similar questions and discussions