I would like to obtain the sl-leader sequence from various species ( such asTrypansoma, c.elegans, Euglena etc). Could you please let me know, if there any database/publications available for the SL-leader sequences.
Anuraj Nayarisseri Anuraj, To get consensus fastq and fasta, I have used following command. samtools mpileup -uf genome.fasta rna.bam | bcftools call -c | vcfutils.pl vcf2fq > cns.fastq seqtk seq -aQ64 -q20 -n N cns.fastq > cns.fasta
However, Could you please correct me if I am wrong, The consensus fasta sequence contains consensus region between the genome and RNA-seq. And those regions of genome are expressed. Thanks you