Hi

I need to combine two 454 datasets but one was obtained with FLX (400 nt) and the other with FLX+ (700 nt), both starting with the same forward primer. I know I have to rarefy to the lowest number of sequences to calculate indexes and compare between samples. But as far as I know, in Qiime for example, you can run a pipeline without screening all sequences to the same length (and not even aligning the sequences to a reference alignment before, as it is done in Mothur).

Is this then possible to compare diversity indexes and inspect beta diversity as well without having all sequences spanning the same length? meaning more variable regions for some sequences (some samples) and less for others?

Or as well as the sampling effort (number of sequences per sample), trimming to have the same sequence length and positions (same region in all sequences using E. coli gene as reference) is a "must do"?

Thanks for your feedback!

More Susana Claudia Vazquez's questions See All
Similar questions and discussions