I have Cuffdiff (--no-diff) output tables and I want to perform clustering :
1) better to use Count or FPKM ?
2) Are "standard" clustering algorithms appropriate for RNA-Seq data due to its discrete nature?
3) What sort of normalization (if any) should be done to RNA-Seq data before clustering?
4) Does the lower variance of long genes need to be accounted for using the output of clustering of RNA-Seq data? i.e. when doing gene set enrichment analysis of the genes in a cluster?
Thank you