14 September 2020 1 2K Report

I have a dataset with relative abundances of Genomes against their COG-ids. It is a large data frame with 4214 rows (COG-ids) and 1313 columns (Genomes).

https://drive.google.com/file/d/1byXVMMWYcCgVwqRRLTZV6UJf7ejCBucE/view?usp=sharing

I would like to make a Bray-Curtis distance-based PCoA plot with my data which will show clusters of similar COG-ids based on their relative abundances from the data frame.

do a distance-based redundancy analysis (db-RDA) to find out which COG-ids are driving the main clusters to set apart from each other.

As there are 4213 COG-ids, I want to use some kind of correlation cut-off in my analysis, so that I get top 20% COG-ids responsible for driving the functions rather than all 4213 in the RDA plot, which will make it look very clumsy.

So far, I have tried to use "capscale" function in Vegan using distance bray to do this analysis but have been getting the error "Error in cbind(x$CCA$v, x$CA$v) : number of rows of matrices must match (see arg 2)" when I am trying to plot the RDA.

Thanks in advance.

More Tahsin Khan's questions See All
Similar questions and discussions