I have seen many papers using gene expression data (microarrays) and the application of biclustering algorithms in order to correlate it with environmental data.
Generally, gene expression data is arranged in a data matrix, where each gene corresponds to one row and each condition to one column. Each element of the matrix represents the expression level of a gene under a certain condition. From my understanding, biclustering is the simultaneous row-column clustering.
What I can't see is how and if these algorithms can be applied to 16s amplicon sequencing data.
I have an OTU table, in which rows correspond to OTUs and columns correspond to 12 locations. The last column corresponds to the taxonomy of each OTU. In addition, I have the corresponding environmental matrix.
Does anyone share some thoughts on this?
Thanks!