How to check if a tissue sample is mixed with blood based on transcriptome data?

10 June 2019 2 4K Report

I want to check if decidua tissue samples in dataset (from GEO database) GSE60438 are mixed with blood. I merged dataset GSE60438 with dataset GSE73685 consisting of multiple tissues including decidua and maternal and umbilical cord blood. I selected only healthy samples with decidua and blood from both datasets, merged their expression matrices by rownames (matrices were log transformed, quantile normalized and rownames were mapped do Entrez identifiers before that) and removed batch effect using R ComBat function with model matrix based on tissue and with datasets' accession number as batch

# secondaryaccession is a column with samples' dataset accession number: GSE60438 or GSE73685

batch = as.factor(pdata$secondaryaccession)

# Biological.Specimen is a column with tissue types in phenodata dataframe pdata

mod = model.matrix(~as.factor(Biological.Specimen), data=pdata)

# mrgd is a dataframe - expression matrices of two datasets merged by rowname

exprs = ComBat(dat=as.matrix(mrgd), batch=batch, mod=mod, par.prior=TRUE, prior.plots=FALSE)

Principal component analysis showed some of the sample leaning towards blood samples. How to check more rigorously whether those samples are really mixed with blood samples?

Chinedu A. Anene

Сашко Лихенко There are many deconvolution algorithms published in the literature, which may be useful.

Check this review for a range of available tools. The performance of each tool depends on the underlying model assumptions, source of expression data and the availability of expression levels in known cell populations.

Article An assessment of computational methods for estimating purity...

I have used ABSOLUTE, ESTIMATE and MuTect, with each generating relevant results.

Rocco Piazza

I would start by testing the expression of genes whose expression is restricted to white blood cells, e.g. MPO (myeloid) or TCR (T-Lympho), Ig-chains (B-Lympho).

How to get a list of all human morphogens?

Can I count distance between my sequence and the consensus one having only sequence logo?

Why do we equate male and female arousal?

PBMC infection with virus?

CHO-K1 suspension adaptation protocol?

What is the acceptable p-value cutoff for GO enrichment analysis ?

Where to find a gene list for CRISPRa/i library screening of regulatory factors that affect pathogenic Th17 differentiation in PBMC?

Can you visualize platelets using EVOS ?

Culture plates for PBMC (non-adherent). For infection?

The best source for amplification of ADAM17 prodomain?

Detaching human-derived monocytes?

"Hello, I am trying to find public datasets containing FTIR spectra of blood samples (both healthy and disease-related)?