Dear colleagues, I have amplified and sequenced (Sanger sequencing) three candidate genes (730-1600 bp) from 150 teak (Tectona grandis) genotypes. After alignment and variant detection using NovoSNP, I obtained a list of SNPs and indels. One of the genotypes was used as the reference sequence during the analysis in NovoSNP. All the corresponding gene sequences have been submitted to NCBI.

Now, I need to submit these variants to the European Variation Archive (EVA), which requires a VCF file including chromosomal positions for each variant. However, the whole genome assembly of teak is not yet available in NCBI, and only short reads (SRA entries ~101 bp) are accessible.

My questions are:

  • How can I determine or approximate chromosomal positions of my SNPs in the absence of a complete genome assembly/complete Ref sequences of teak
  • Is EVA submission still possible without exact chromosomal coordinates, especially if the study is gene-targeted and not genome-wide?
  • Has anyone faced a similar issue while submitting variants from non-model species? Any suggestions or alternative approaches would be very helpful.
  • Thank you in advance for any guidance.

    Best regards, Nuzhat Bano

    More Nuzhat Bano's questions See All
    Similar questions and discussions