How can I filter out reference sequences that get multiple hits in BWA MEM ?

More Chrystelle Delord's questions See All

Are there R packages able to read genetic datasets in .arp format (Arlequin)?

Hi all! Hope you had a great summer! I was wondering if anyone knew an R package able to load genetic data from .arp files (that is, the Arlequin input file format,...

04 September 2020 6,403 2 View

[Detecting "source" populations] Should I trust these BayesAss results?

Dear all, I would like to use the software BayesAss (Wilson and Rannala, 2003) to estimate contemporary gene flow between several populations sampled from a river network (fish species). I am...

06 May 2018 3,734 0 View

Sample size correction for P (polymorphism rate) in population genetics?

Hi everyone, I have a population genetics dataset for several sampling location, with sample size N varying sometimes quite much (sometimes by a factor 2 between two different locations). I would...

12 October 2017 2,950 4 View

[Opinion] What are the best SNP-based measures of genetic diversity ?

Hi everyone, I would be interested in your opinion about most widely used metrics to assess genetic diversity, particularly when considering markers specific features. Expected heterozygosity,...

31 May 2017 2,578 5 View

Max number of libraries on a HiSeq sequencing lane ?

Hi everyone ! I would be interesting in knowing, based on your own experience with Illumina HiSeq sequecing, (1) what was your typical number of libraries to be sequenced simultaneously on a...

06 March 2017 4,399 4 View

Pooled RAD data and SNP calling with Snape (on non-model species) ?

Hi everyone, I'm working on RAD-seq pooled data with several species, but only one pool/species ! (RAD data are just meant to be used for SNP calling as another genotyping procedure will be...

17 July 2016 2,029 2 View

How to confirm the site-directed mutagenesis result without performing NGS?

I'm cloning a fragment of 3200 nts into plasmid. The cloning was successful, however, 02 amino acids were mutated. Now I want to fix these 02 aa by site-directed mutagenesis technique using...

08 August 2024 4,645 2 View

I can't see the ssDNA band after performing asymmetric PCR. Is there any way to do this?

After performing symmetric PCR, PCR purification was performed. Afterwards, asymmetric PCR was performed using the PCR purification product as a template, but no ssDNA band was confirmed in the...

08 August 2024 1,668 3 View

E.coli contamination in human RNA seq data ?

Recently, we observed that 99% of the sequences in our RNA-seq data corresponded to the E. coli genome. Despite multiple DNAse treatments after RNA extraction and ribosomal depletion, we were...

06 August 2024 807 3 View

How much total RNA concentration to be extracted from sorted plasma cells from bone marrow of C57BL/6 mice for RT-PCR ?

i have sorted anti-NP specific plasma cells from bone marrow of C57BL/6 mice at certain times after immunization with variable counts and isolated total RNA using TRIZOL method for RT-PCR using...

05 August 2024 8,835 1 View

Does anyone have issues using Prepman Ultra reagent for MicroSeq ID bacterial, fungal and yeast sample preparation?

I have been attempting to extract DNA from Bacterial, Fungal and Yeast banked samples (>1e7 cells) using Prepman Ultra reagent and I seem to be struggling to obtain a sequence. Although the...

01 August 2024 2,079 0 View

What is the acceptable p-value cutoff for GO enrichment analysis ?

I have an RNA-seq data that I have analysed using Limma-voom and have extracted the gene IDs, log2FC and the p-values. At p value < 0.05, I have over 10,000 DEGs, however, when I run the GO...

31 July 2024 225 2 View

PCR showing no bands - Master mix and primers didn't mix?

Hello everyone, I performed a PCR yesterday, and the results showed no bands on the gel. Of course, I probably missed some crucial steps, like adding my samples to the PCR strips themselves, for...

31 July 2024 2,406 6 View

Dimensions of an MJ Research 96-well alpha module?

Can anyone with an MJ research / BioRad PCR machine from ~2010 or earlier tell me the external measurements (LxWxH) of the removable standard PCR alpha modules that can be removed from a PTC200 or...

30 July 2024 2,867 0 View

How to retain amplicon yield after Ampure bead cleanup, following a 2-stage PCR protocol?

Hello all, I have been trying to follow a 2-stage PCR protocol used to amplify barcodes of a large yeast library, as per Nyugen et al. (2022) -...

30 July 2024 841 2 View

Someone have the key or the installer of the program Mx pro 3000 (PCR real time)?

I need to install this program

25 July 2024 4,756 0 View

Fabiano Sillo

Hi Chrystelle,

As far as I know, read which maps to multiple places in the reference is defined as non-unique read and it has a mapping quality of 0 (see BWA-MEM). By using a threshold of 30 (samtools view -q 30 etc.) you should filter out reads with multiple alignments. If you use "-q 0" instead, you will get reads with poor quality and/or more than one hit on reference. Once you have this set of non-unique reads you can get their positions on reference by using awk or bedtools (-bamtobed command). With the positions (contig ID, start, end) you can filter out your "500 sequences" reference. I hope this helps.

Chrystelle Delord

Hi Fabio,

Thank you very much for your answer ! For my specific issue I finally chose to develop a python script that works directly on the SAM file (so I can run multiple filtering test simultaneously), but I did not know about BEDtools, I'll sure have a look at it :)

Have a very nice day !