Hi all,
I've always wanted to try harvesting and analysing data from NCBI, but I've just never really known the best way forward. I know that I can easily download a whole bunch of data from NCBI by just searching for a group on the website, Geneious, or any other number of programs/scripts. But, I end up with a big mess of data that seems difficult to sort through and actually use.
So, for those who routinely download NCBI sequences — particularly using R — can you point me in the right direction of packages or tutorials to make it easier to:
Perhaps there is already a document or paper that really summarises these issues and the best-practice nicely, but my search queries haven't found it yet.
Many thanks in advance!
James