I am looking for a vcf file of dbSNP release 131. Can anyone provide me with a link holding the data? Or is it possible to convert dbSNP flat files to vcf format by using any tool?
I dont have the dbSNP file with me to check, but if my memory is good I think you will have a tag mentioned in the VCF file which says from which version the SNP was added. Like for each SNP you will have dbsnp129, dbsnp132, etc... You can just pick whichever is less than or equal to dbsnp131. Correct me if am wrong.
I also thought about this. But when I greped with the tag, the count came too small. Just for an idea I am giving a approx count. According to dbSNP131 there are almost 30 million variants. Where in dbSNP137 release here are about 52 million variants in which 2 million variants containing dbSNP131 tag. Hope you will get the idea. Thanks a lot.
Sorry for being late to respond. I have tried that also. But in vain. Obviously the number of variants increased almost 10 times, but still there are ~10 millions missing counts. That is also quite obvious, because there will be so many dbSNP131 entries which must have been got updated due to incorporation of new segments into human genome. So I guess it is not enough to serve the purpose. Correct me if I am wrong.