How can I get redundant FASTA reads from a file containing collapsed reads?

More Giovanni Bubici's questions See All

Reliable transfection method for Jurkat cells?

Hi to everyone! Does any of you have experience in transfecting siRNA in Jurkat cells and can provide a reliable method? I have both Neon transfection system and RNAiMAX Lipo, but to date none of...

30 June 2024 6,901 4 View

How do I find shoreline's evolution throughout the years map using Landsat 8?

I'm doing my first research and I can't find a way to see the shoreline evolution.

09 May 2024 1,817 4 View

Adsorption of carboxylic acid on TiO2 (anatase) Nps surface, physisorption or chemisorption ?

I run an experiment to test the dispersing property of oxalic acid on TiO2 (anatase) colloids, only by mixting the carboxylic aqueous solution with the titania colloid. My results showed that a...

08 May 2024 4,667 0 View

Can anyone help me with the preparation of a solution of Methyl Jasmonate?

Hi, I would like to set up an experiment with some plants sprayed with an aqueous solution of 200 μM Methyl Jasmonate with 0.025% Zipper surfactant. From the Sigma/Merch website I see I can buy...

05 February 2024 8,534 1 View

Where and how can I get the EEG data (V-ERP or auditory-ERP) for Alzheimer's disease?

For my research, I need visual and/or auditory event related potential, EEG data from normal patients and patients with Alzheimer's disease. Can anyone suggest where I can find them?

29 September 2023 3,831 3 View

Which are the effects of carbon on the liquid phase sintering of W-Fe-Ni?

In the liquid phase sintering of Tungsten-Iron-Nickel powder compacts, could the presence of carbon (between 0.6% and 1.5%) significantly influence the final microstructure by altering the...

30 July 2023 4,368 1 View

What is the best way to transport biogas from anaerobic digestion plants to a centralised upgrading plant?

Good morning, everyone! We are doing a spatial analysis to identify the best sites to install centralised upgrading plants. These plants should collect the biogas produced in several anaerobic...

28 June 2023 1,811 2 View

What should i do with count microbiome data?

Hello everybody, I'm a master degree student. I'm working with 16S data on some environmental samples. After all the cleaning, denoising ecc... now I have an object that stores my sequences,...

25 May 2023 6,380 2 View

Dynamic light scattering: Volume plot shows one peak, while intensity plot shows two peaks. How can I quantify the two different size distributions?

I measured by DLS the size of a TiO2 nanoparticles suspension sample to compare the results against a TEM measurement (5-6nm). As I expected I got different results from the two experiments due to...

24 May 2023 7,592 8 View

Can I use IEEE 14 bus system as energy management systems for microgrid?

I want to use IEEE 14 bus system to integrate two microgrids in the generator buses. My research would be based on Model predictive control to schedule charge and discharge on on BESS of the...

26 March 2023 1,574 3 View

Can I base on reverse DNA sequences to perform alignment, convert to amino acids and GenBank submission?

I have reverse sequences (AB1 format), can I base on reverse DNA sequences to perform nucleotide alignment, convert nucleotides to amino acids and deposit the sequence in GenBank database?

11 August 2024 5,138 1 View

How to confirm the site-directed mutagenesis result without performing NGS?

I'm cloning a fragment of 3200 nts into plasmid. The cloning was successful, however, 02 amino acids were mutated. Now I want to fix these 02 aa by site-directed mutagenesis technique using...

08 August 2024 4,645 2 View

Does anyone have issues using Prepman Ultra reagent for MicroSeq ID bacterial, fungal and yeast sample preparation?

I have been attempting to extract DNA from Bacterial, Fungal and Yeast banked samples (>1e7 cells) using Prepman Ultra reagent and I seem to be struggling to obtain a sequence. Although the...

01 August 2024 2,079 0 View

Should the amount of DNA input used for ChIP-seq library preparation be matched between the control and experimental groups?

Hi all. As a beginner in ChIP-seq experiments, I hope you understand that the following questions might be somewhat basic. I am planning to perform ChIP-seq or MeDIP-seq analysis to investigate...

28 July 2024 6,938 1 View

Can we convert a thousand of FASTA sequence in numeric form in .csv format? If yes kindly send me the script for the same?

I have a .text file for various FASTA sequence , and i want to convert these sequences into a numeric file which will be in .csv format. OR I want to extract physiochemical properties(features)...

25 July 2024 3,650 2 View

If my gene of interest has high GC content can it be problematic in sequencing? What kind of error is expected with GC rich gene sequences??

Gene sequencing related trouble shooting

25 July 2024 4,149 2 View

Does post-translational protein modification cause devisions on observed pI verses calculated pI?

In running two-dimensional gel electrophoresis on bacterial protein, some spots that appear to match a protein sequence have a significantly more acidic isoelectric point than the calculated pI....

24 July 2024 8,076 3 View

Are there always been barcodes, apapters and primer sequences in the FASTQ files of NGS?

Hello researchers, Sorry for my stupid question. I am learning the QIIME2 workflow for analyzing some 16s amplicon NGS fastq data. I found a very nice paper with data and code public available...

20 July 2024 5,405 2 View

Promotor observation in region annotation of RNA RIP-seq?

Hello all, I extracted RNA from my samples and performed RIP-seq. After annotating the genomic regions using R, I obtained promoters, exons, introns, and UTRs. Given that my samples consist of...

18 July 2024 1,579 2 View

Full Enquiry On Biomass Energy In engineering Technology?

How is biomass energy essential in the power generation in a country.

08 July 2024 9,477 2 View

Achraf El Allali

A quick script would do it. are you looking for a ready tool?

Giovanni Bubici

Yes, I found some scripts, but a user-friendly tool would be better. FASTX-Toolkit should do it, but this function is not available in Galaxy, and a command line should be used, I think.

Any other suggestion?

Tyler Chafin

Here is a simple perl script to do it.

You can specify several options:

-i: Input file (fasta)
-o output file name
-m: Maximum depth to print [Default: not set], for example to truncate any stacks with depth above 200, and only print 200 copies
-n: Skip sequences with less than "n" reads [default not set], for example to skip any singleton sequences (only had one read)
-x: Skip sequences with depth greater than "x" [default= not set], for example to skip any sequences with depth greater than 100,

Call program like:

./splitStackedFasta.pl -i [and any additional options]

Example:

./splitStackedFasta.pl -i test.fasta -m 2 -n 2

On your example would only output:

>2-1

ATAT

>2-2

Whereas ./splitStackedFasta.pl -i test.fasta

would output:

>1-1

TGCG

>2-3

>2-4

>3-1

TGGC

>4-1

TGAG

>5-1

TTCA

Also, I had to put it in a .zip archive to upload to RG. So you need to unzip first. Let me know if you have any issues. I only wrote it in ~10 minutes so it isn't tested!

Fantastic! Thank you so much!

Dear Tyler,

do you think that a similar script could be written for SAM/BAM/BED mapping file? It could be very useful as one can map unique reads and go back to the redundant reads in orther to visualize to mapped reads with their coverage depth.

Thanks,

Giovanni

Yes it could be scripted

your script works fine! Thank you so much!

Best regards,