How to generate all possible exonic BSJ events within transcripts taking 20 bp from each side of all exons and pasting them together in reverse order?
I would like to construct a library of Back splicing junctions (BSJ) from all the possible exons (reference file) from every human transcript for circRNA detection. For that, I need to extract 20-30 bp from each side and paste them together side by side in reverse order.
There are several reports which have done this.
I am attaching a paper that has used a similar approach.
Article Transcriptome-wide profiles of circular RNA and RNA-binding ...
I will be very grateful if someone can suggest ways to address this question.