I want to compare baseline vs relapsed samples in an RNAseq cohort with FFPE (14) and some fresh frozen (7) samples. From clustering, it seems that FFPE and FF (fresh frozen) samples are forming two separate clusters which is a main confounding factor. Please see the attached figure. Material (FF and FFPE) is the second covariate bar in the heatmap. I think I can not perform a baseline vs relapse comparison unless I mitigate this FFPE/FF separation. I am not sure that removing the batch effect is the best answer to make it a homogeneous cohort. I am looking for some suggestions that how can i combine FFPE and FF samples before performing any downstream analysis.