RNAseq variant calling: why STAR align promotes mismatched overhanging mappings onto introns instead of calling splicing junctions?

Francesco Antonio Tucci @Francesco-Antonio-Tucci-2

07 October 2020 0 4K Report

Hi, I'm doing variant calling from my RNAseq data on a cell line (Paired End 2X100b, Illumina)

I'm following GATK workflows and everything is working almost fine since I can find all the expected mutations already characterized in my cell line. The problem is that I see some additional mutations that are clearly the result of an alignment error (e.g. NM_001364837:exon8:c.727+1G>A, rs781065280)

I'm using the latest release of STAR aligner 2.7.6a to perform the alignment with the following options

STAR --runThreadN 8 --genomeDir $GENOME_DIR \

--readFilesIn ... --readFilesCommand gunzip -c \

--outSAMattrRGline ... \

--outSAMtype BAM SortedByCoordinate \

--twopassMode Basic \

--outSAMmapqUnique 60 \

--outFilterType BySJout --alignIntronMin 20 \

--alignIntronMax 1000000 --alignMatesGapMax 1000000 \

--alignSJoverhangMin 8 --alignSJDBoverhangMin 3 --scoreGenomicLengthLog2scale 0 \

--outFileNamePrefix $OUTPUT_DIR/...

The problem resides in STAR mapping some reads with a mismatching overhang of 4 bp at beginning of the intron instead of picking a perfect match at the beginning of the next exon (check out the screenshot at: https://i.ibb.co/LRDYNt3/mismatch.png)

As you can see, `--alignSJDBoverhangMin` is set to 3 and the length of the overhang is 4bp.

Also the splice junction should be canonical so a spliced mapping should carry no penalty.

Am I doing something wrong?

Badges
Science topic

More Francesco Antonio Tucci's questions See All

Do you think can be any Uranium bearing rocks in Eastern part of Iran and western part of Afghanistan?

I want to know more about Uranium ore deposits in world.

11 August 2024 6,720 0 View

Do you think can be any diamond bearing rocks in Eastern part of Iran and western part of Afghanistan?

I want to know more about diamond ore deposits in world.

11 August 2024 2,167 1 View

What is the difference between mathematical R^4 space and physical 4D unit space?

We assume that the difference is huge and that it is not possible to compare the two spaces. The R^4 mathematical space considers time as an external controller and the space itself is immobile in...

10 August 2024 6,678 14 View

If Banks do not provide credit facility, what are the options available for FPOs and impact on producer’s income?

10 August 2024 8,198 5 View

Controlling for pupil light reflex when analyzing pupil size time course?

I used eye tracking to examine how participants from two different populations (A and B) react to an image. Participants in population A exhibit larger pupil sizes over time, but they also have...

10 August 2024 3,229 0 View

What are a “Farmers Producer Organization” (FPO) and its essential features?

10 August 2024 477 5 View

Strugglling with m6A dot blot any suugesstion ?

I have been doing the m6A dot blot for a while with no improvement, I am extracting the RNA, and I can see the dots although the three biological replicas give a different reading on the memberan...

10 August 2024 8,539 5 View

Do interactions between biosphere, carbon cycle, & water cycle impact global warming & interaction between atmosphere & hydrosphere?

How do interactions between the biosphere, the carbon cycle, and the water cycle impact global warming and interaction between the atmosphere and the hydrosphere?

09 August 2024 3,291 2 View

How to get moment output in Abaqus Standart?

I have input a moment load in module load Abaqus, i put my moment load on the node surface (using reference point). I have define moment in history output and make a set for moment too. But the...

08 August 2024 4,831 4 View

How is energy cycled through the Earth's climate system and how do matter cycle and energy flow through the rock cycle?

08 August 2024 8,162 0 View

Can I base on reverse DNA sequences to perform alignment, convert to amino acids and GenBank submission?

I have reverse sequences (AB1 format), can I base on reverse DNA sequences to perform nucleotide alignment, convert nucleotides to amino acids and deposit the sequence in GenBank database?

11 August 2024 5,138 1 View

Mass spectra averaging algorithm?

I am now developing a python module for ms2 database searching, would like to realize a function that similar to what Xcalibur did, choose multiple mass spectra and get an averaged spectra. But...

22 July 2024 3,975 1 View

Every star like the sun is at a stationary position. Am I right?

The earth has a orbit, but not for the sun. Similarly, all planets have orbits, but not for stars like the sun.

18 July 2024 1,768 4 View

Promotor observation in region annotation of RNA RIP-seq?

Hello all, I extracted RNA from my samples and performed RIP-seq. After annotating the genomic regions using R, I obtained promoters, exons, introns, and UTRs. Given that my samples consist of...

18 July 2024 1,579 2 View

What is wrong about this model of Universe?

A universe model compatible with VSLT The research of Halton Arp and Eric Lerner supports a stationary universe, and the intrinsic nature of redshift, i.e. not linked to its presumed expansion...

14 July 2024 4,092 5 View

How to solve Bubble Error in fiber splicing?

I'm having a bubbling error while splicing 100/350 um optical fiber (core/cladding) on the Fujikura FSM100P+. I have tried some ways such as changing Prefuse power and Prefuse time but to no...

03 July 2024 7,463 6 View

If Purospher STAR RP-18 LiChroCART (HPLC column) is suitable for amino acids?

Hi Dear Researchers, Could anyone please provide me with some information about the HPLC column, specifically the “Purospher STAR RP-18 LiChroCART Cartridge”? I would like to know if it is...

25 June 2024 9,523 1 View

Can someone explain the intuition behind, Blue stars are small, but wavelength is big. Red stars are Big but their wavelength is small?

can someone use quantum mechanics to explain the intuition behind this phenomenon?

07 June 2024 9,584 2 View

Inconsistente results with cell line LX-2, any advice?

Hi there! I'm working on my master's thesis and for my project I'm testing a new curcumin derivative in liver fibrosis. I'm working with the cell line LX-2 and have been having problems since the...

21 May 2024 2,171 0 View

Why do all the stars appear to move around one central star and why is Polaris different from other stars in the sky?

05 May 2024 3,418 6 View