I am presently analysing 300 bp paired-end Illumina reads, and after some generous trimming (with Trimmomatic), the remaining 200 bp have excellent base sequence quality all the way (FastQC). However, when I merge the resulting P1 and P2 outfiles (forward and reverse paired reads) in PANDAseq, a large section in the middle gets horrible base qualities, with very wide confidence intervals and subsequent programs throwing out most of the reads because they "contain at least one poor base". I would have thought that the central portion of the reads should be best because it gets information from both P1 and P2. Does anyone have an explanation/solution for this?
Thanks, Peter