I have downloaded complete protein sequences of some bacterial genomes in March 2014 from following NCBI FTP site,
ftp://ftp.ncbi.nlm.nih.gov/genomes/Bacteria/
I had a list of NCBI GIs which were accessible in the on-line NCBI interface (www.ncbi.nlm.nih.gov/protein) till mid of 2014, but now when I search them on NCBI, an error appears stating "Record removed" with no reasons. The page provides me a link to obsolete version, but not to updated version which I actually needed. Few of those GIs are;
378699640
537435588
550903479
NCBI Help Center provided me the reason that NCBI has updated its FTP site in August 2014. They have also modified FASTA format with NCBI-GI removed from the header (see following NCBI News)
http://www.ncbi.nlm.nih.gov/news/08-26-2014-new-genomes-FTP-live/
http://www.ncbi.nlm.nih.gov/news/09-17-2014-simple-FASTA-headers-genomes-FTP/
In this scenario, I needed to repeat my analysis with new genomes. So that other people can access my published results.
My question to NCBI users is that how they tackle such situation ? Has anyone else encountered same issue as mine ?