Difference between power and myAUC in Seurat R package for analyzing Single Cell RNA-seq data?

17 May 2023 1 3K Report

I am attempting to use the Seurat FindAllMarkers function to validate markers for rice taken from the plantsSCRNA-db. I want to use the ROC test in order to get a good idea of how effective any of the markers are. While doing a bit of research, different stats forums say: "If we must label certain scores as good or bad, we can reference the following rule of thumb from Hosmer and Lemeshow in Applied Logistic Regression (p. 177):

0.5 = No discrimination 0.5-0.7 = Poor discrimination 0.7-0.8 = Acceptable discrimination 0.8-0.9= Excellent discrimination0.9 = Outstanding discrimination "

https://www.statology.org/what-is-a-good-auc-score/#:~:text=0.5%2D0.7%20%3D%20Poor%20discrimination,%3E0.9%20%3D%20Outstanding%20discrimination

For more background, the output of the function returns a dataframe with a row for each gene, showing myAUC: area under the Receiver Operating Characteristic, and Power: the absolute value of myAUC - 0.5 multiplied by 2. Some other statistics are included as well such as average log2FC and the percent of cells expressing the gene in one cluster vs all other clusters.

With this being said, I would assume a myAUC score of 0.7 or above would imply the marker is effective. However given the formula used to calculate power, a myAUC score of 0.7 would correlate to a power of 0.4. So with this being said, would it be fair to assume that myAUC should be ignored for the purposes of validating markers? Or should both values be taken into account somehow?

Ma'Mon Abu Hammad

In the Seurat R package for analyzing single-cell RNA-seq data, "power" and "myAUC" are both functions used for selecting the most informative features or genes in the dataset. However, they employ different approaches and criteria to achieve this.

Power: The "power" function in Seurat is used for identifying highly variable genes (HVGs) based on their expression dispersion relative to their mean expression level. This approach aims to capture genes that display biological variability across cells and are likely to be driving the observed heterogeneity in the dataset. By default, the "power" function calculates the power of a statistical test to detect differences in expression between two groups of cells, such as treatment vs. control or different cell types. It estimates the relationship between the mean expression and variance of each gene using a trend line and defines highly variable genes as those with expression levels deviating significantly from the trend line. The function outputs a list of highly variable genes ranked by their deviation.

myAUC: The "myAUC" function in Seurat stands for "Area Under the Curve" and is used to rank genes based on their differential expression between two predefined groups or conditions. It employs the area under the receiver operating characteristic (ROC) curve as a measure of differential expression, where the ROC curve represents the true positive rate against the false positive rate at various gene expression thresholds. The myAUC algorithm evaluates the discriminatory power of each gene in distinguishing between the two groups and ranks them accordingly. Genes with higher AUC values have greater discriminatory power and are considered more differentially expressed between the groups of interest.

In summary, the "power" function identifies highly variable genes based on their expression dispersion relative to mean expression, while the "myAUC" function ranks genes based on their ability to discriminate between two predefined groups or conditions using the area under the ROC curve. Both functions aim to identify genes that are potentially important for distinguishing between different cell types, states, or experimental conditions, but they use different statistical and computational approaches to achieve this goal.

Badges
Science method

Similar topics
Biological Science
Ecology

More Tim Johnson's questions See All

Seeking Advice on Viability and Execution of Undergraduate Thesis Topic?

Hello everyone, I am currently developing a thesis proposal and would appreciate your input on its viability and how to effectively carry it out. My proposed topic is: "Does the perceived threat...

10 August 2024 8,992 0 View

PBMC infection with virus?

In my study, I intend to infect PBMCs with SARS-CoV-2. After that, I will analyze NK cells by flow cytometry to see if their phenotype changes or if they show degranulation. After the infection, I...

01 August 2024 4,403 4 View

Culture plates for PBMC (non-adherent). For infection?

Hello, good morning. I would like to know which plate I can use to infect fresh PBMC with virus and culture for 24 hours. I have tried using 12-well and 24-well plates, but many cells adhere and...

24 July 2024 2,158 2 View

How can we, as educators, Dismantle Deficit Ideology in Educator Professional Development?

Deficit ideology, the harmful belief that certain individuals or groups are inherently lacking in skills, knowledge, or ability, is a pervasive issue in educational settings, particularly in the...

22 July 2024 7,804 2 View

How to removal of dead cells and debris from organoid cultures?

I'm currently working with human pancreatic organoids and I am wondering to remove the dead cells and debris from health organoids, any suggestions would be appreciated!

17 July 2024 8,330 1 View

Cp of Turbine is ever increasing with Tip Speed Ratio. What Could be the reason??

I am a student of Mechanical Engineering. I am doing CFD opf Archimedes Winf Turbine. I am trying to validate the result of Paper. But the problem is that tha value of Power Coefficient is ever...

05 July 2024 6,839 2 View

Which statistical test would be the best?

For my practical project at Uni, I am researching blue-green algae in a freshwater lake. I have three locations around the lake and am testing the water for phosphates, pH, and temperature. I am...

30 June 2024 4,264 3 View

Why our eyes can concisely definite left and right, high and low? But cannot near and far?

While analyzing the data of point cloud, i found some strange phenomenon about the precise. But I expect there is no best plan between LiDAR and binocular vision. However, why our eyes more...

21 June 2024 5,269 3 View

Can anyone send me the Weblink to Download ECRTool?

ECRTool is a Matlab based software tool for plotting Electrical Conductivity Relaxation graphs.

04 June 2024 8,133 1 View

How can artificial intelligence enhance personalized learning experiences in education, and what are the potential challenges?

Artificial intelligence (AI) has the potential to revolutionize education by providing personalized learning experiences tailored to individual students' needs. AI-driven systems can analyze...

21 May 2024 2,058 3 View

How to learn more about SPSS and its Application?

I would like to learn more about SPSS and Its application especially in regards to data analysis. Please suggest me how I can learn more about it. Thank you so much.

11 August 2024 9,101 4 View

Can I base on reverse DNA sequences to perform alignment, convert to amino acids and GenBank submission?

I have reverse sequences (AB1 format), can I base on reverse DNA sequences to perform nucleotide alignment, convert nucleotides to amino acids and deposit the sequence in GenBank database?

11 August 2024 5,138 1 View

Baseline drift in HPLC? What causes this?

Hello, Why do i see this baseline drift when i compare my blank (black) to the sample (blue)? Any suggestions as to why this happened? Thank you!

11 August 2024 3,770 4 View

Text-Communication from the M1 Hand Area using BCI—and then there is Elon Musk?

Willett, Shenoy et al. (2021) have developed a brain computer interface (BCI) that used neural signal collected from the hand area of the motor cortex (area M1) of a paralyzed patient. The...

10 August 2024 7,180 0 View

Has anyone applied Python in the field of textile engineering for data analysis, automation, or smart textiles?

I'm currently exploring the application of Python in textile engineering, specifically in areas like data analysis, process automation, and the development of smart textiles. I'm interested in...

10 August 2024 7,429 2 View

Which Scopus Journal provides the most affordable fees?

"PUBLISHING IN A SCOPUS JOURNAL" Researchers are now at a cross road. The critical need to publish in a Scopus or ISI, etc journal is ever vital. Journal Publication fees must be submitted....

10 August 2024 8,621 1 View

Seeking Advice on Viability and Execution of Undergraduate Thesis Topic?

Hello everyone, I am currently developing a thesis proposal and would appreciate your input on its viability and how to effectively carry it out. My proposed topic is: "Does the perceived threat...

10 August 2024 8,992 0 View

Why Do TDS and EC Increase with Larger Wastewater Volumes, While BOD and COD Decrease?

I have carried out MFC experiments on three different volumes, 50, 500 and 1000 mL of wastewater. Results after MFC treatment shows that TDS and EC are more in larger volumes of water i.e. TDS and...

09 August 2024 9,621 0 View

How can I use the cif data obtained from rietveld refinement extracted via gsas2, for microstructural analysis using ETEX software?

09 August 2024 7,718 0 View

Could it be a cell culture contamination?

Hello everyone! I observed in my culture (htert-RPE1 cells) an orange- red particle at the bottom of the dish. It is visible to the naked eye as a very very small red dot. Could it be a...

09 August 2024 2,824 3 View