Hi, I have this simple but wired question that how many of you ever compared the results of the same data set using both reference-based and de novo approach, and get more loci/SNPs using the latter one.
In the paper J. Paris, J. Stevens, & J. Catchen. Lost in parameter space: a road map for Stacks. Methods in Ecology and Evolution, 8(10):1360-1373, 2017 They observed more loci/SNPs acquired from de novo mapping, while I had the opposite results for my Arctic charr data set using the Canadian Arctic charr genome.
I have found similar reports (more loci from reference-mapped) from phylogenetic studies (e.g. Tripp, Erin A., et al. "RAD seq dataset with 90% missing data fully resolves recent radiation of Petalidium (Acanthaceae) in the ultra‐arid deserts of Namibia." Ecology and Evolution 7.19 (2017): 7920-7936. ) and for their studies, the two approaches also provide different resolutions.
It is so obvious that people prefer to use reference-based mapping as it provides more detailed info for downstream analysis such as functional and ontology. However, I got clear pictures of the population structure using de novo but not in reference mapping. Therefore I would love to know your suggestions. Thank you very much!
Best regards,
Han Xiao