So I've recently been involved in an RNAseq project and my investigator is interested in comparing RNA abundances in organoids to corresponding tissue samples taken directly from the organism (a mouse). There were three experimental groups for both tissue types as well to test the effects of these treatments on both the organism and the organoid. We are primarily interested in arguing that an organoid is an appropriate model for the whole mouse. We would like to demonstrate that there are genes that exhibit similar differential abundance patterns in both the organism and the organoid for each of the three treatments.
As expected, the transcriptome of an organoid is very different from the corresponding original host tissue, so most heatmaps are going to illustrate differences between organoid and tissue rather that differences of responses to the experimental treatments. In an ideal world, samples from treatment A would look similar (on a heatmap) regardless of whether they came from a tissue sample or an organoid. Alternatively, if the samples were to be divided into two heatmaps, one for organoid samples and one for tissue samples, the pattern of color contrasts between treatments A and B would be similar on both heatmaps.
I have been playing around with different gene sets and cutoffs and I am still unsure where to go next. Are there any sorting techniques similar to unsupervised clustering that are designed to emphasize differences along one dimension and group according to similarity along the other? Perhaps there is a specific technique to select a subset of genes that will generate a heatmap in which treatment contrast is emphasized over organoid contrast. Has anyone run into similar problems? Are there any established protocols or tools that you have used to generate heatmaps according to unique constraints? Are there other bioinformaticians that have run into similar problems?
Please feel free to share your experiences, suggestions, and opinions