Assemble NGS data based on a single input sequence to build from?

14 February 2025 1 3K Report

Hey RG community,

I'm wondering if anyone knows of a good genome assembler software (and the code needed) to assemble NGS data (Illumina/ON/PacBio) based on a single sequence, building from this sequence, but ignoring any other possible contigs, to make the assembly fast and targeted?

Essentially, I want to give the assembler a single sequence (a gene perhaps) and ask the software to simply build out from what I provide to give me the longest possible contig., while ignoring any reads that don't associate.

The reason behind this, is that we have some huge datasets that would take weeks to assemble in their entirety. If I can get an assembler to focus on just one sequence input and build from that, and ignore the rest of the data, it might give me a faster way to pull out specific contigs of interest without completing a total assembly.

FYI: I do not have the full sequence I am looking for, only a short read that I want to build out from to create a contig. So mapping isn't quite the way to get the job done, unless I wanted to very slowly build on the edges. I'd like a quick way, if possible.

Thanks for your help,

Jamie

David Lázaro-Gimeno

Hello Jamie,

Assemblers works in two ways, de-novo or mapping. You do not want to use the mapping approach. Depending on your fragment size, which extension do you want to cover?

I have mapped in some specific cases my reference sequence in both strands as input to extend my sequence query.

I used MIRA at that specific project. Depending on the purpose, it is an option.

Badges
Science topic

Similar topics
Eutheria
Rodentia
Muridae
Murinae
Mice

More Jamie Bojko's questions See All

Epitope/Peptide mapping of mAbs?

I recently developed a few mAbs that are able to neutralize SARS2. By ELISA and WB, I found that they all bind to SARS2 RBD. I wanted to map their RBD epitopes so I ordered pre-coated plates with...

02 June 2024 2,877 2 View

What is the implications of the findings for construction management practices including land use planning and infrastructure management?

Analysing the fiscal implications of peripheral residential development including case studies

10 March 2024 1,754 0 View

Any UK leads on plastic pitfall trap suppliers?

Hi all, I'm starting an arachnological research project with Edge Hill University and have encountered a supply problem with pitfall traps of the required dimensions. The previous supplier has...

08 February 2023 4,685 0 View

Clarification on domain decomposition comparison in GROMACS and LAMMPS?

I was wondering if someone could clear up the difference between domain decomposition in GROMACS and LAMMPS. As I understand it LAMMPS creates ghost atom copies of atoms in surrounding domains and...

07 February 2023 9,041 0 View

Thromboelastography (TEG) or Calibrated Automated Thrombinography?

When assessing the contribution platelet concentrates make to coagulation, the above tests are commonly used within the platelet concentrate research field. However, are both nescisary for a...

21 November 2022 4,737 2 View

Most useful/appropriate GPS apps for research?

We're currently planning a pilot study to explore movement behaviour within urban and sub-urban greenspace, as well as the contribution of greenspace-based physical activity to total weekly...

23 November 2021 5,756 11 View

Desperate for SPSS/statistical help! I'm really starting to doubt that I chose the right statistical analysis. Can someone verify for me?

I feel like maybe this question is so easy that it's hard and I keep doubting every choice I make, so I thought I would just finally ask online. I have one dependent variable with three levels:...

03 July 2021 230 5 View

Does anyone know if the Quantstudio 3 qPCR machine requires a passive reference when Using SYBR? I ran a qPCR and got the plot in the attached file.?

I know that my target is in my sample as I extracted it from cell culture myself. These plots show some amplification. The first one seems O.K for the first 36 cycles but then collapses at the...

14 February 2021 3,229 2 View

EDTA storage time?

I need to do some PBMC/WBC FACS analysis. How long can my whole blood be stored in EDTA tubes at 4oC so that the lymphocytes are still good to be stained with a florescently-labeled mAb?

11 February 2021 4,766 3 View

Ferret PBMC Fc block?

Hi all, I'm going to run Flow Cytometery on ferret PBMCs but can't find an Fc block for ferret PBMCs. Does that even exist? Do I need ferret-specific Fc blocking antibodies or can I use:...

18 January 2021 4,757 5 View

Can I base on reverse DNA sequences to perform alignment, convert to amino acids and GenBank submission?

I have reverse sequences (AB1 format), can I base on reverse DNA sequences to perform nucleotide alignment, convert nucleotides to amino acids and deposit the sequence in GenBank database?

11 August 2024 5,138 1 View

How to confirm the site-directed mutagenesis result without performing NGS?

I'm cloning a fragment of 3200 nts into plasmid. The cloning was successful, however, 02 amino acids were mutated. Now I want to fix these 02 aa by site-directed mutagenesis technique using...

08 August 2024 4,645 2 View

Does anyone have issues using Prepman Ultra reagent for MicroSeq ID bacterial, fungal and yeast sample preparation?

I have been attempting to extract DNA from Bacterial, Fungal and Yeast banked samples (>1e7 cells) using Prepman Ultra reagent and I seem to be struggling to obtain a sequence. Although the...

01 August 2024 2,079 0 View

How is the bacterial genome's high protein count verified as genuine despite 800+ contigs and good metrics (98.55%completeness, 0.68% contamination)?

Given that the bacterial genome has over 800 contigs, but its quality metrics are good, with a completeness of 98.55% and a contamination of 0.68% as assessed by CheckM, what specific validation...

01 August 2024 1,514 1 View

What should a Mechanical Engineering PhD scholar focus on during their PhD to enhance their chances of securing a postdoctoral position?

29 July 2024 7,714 4 View

In terms of chaos, what is the necessary and sufficient condition for authoritarianism, permanent or temporary, to come to exist and persist?

Since 2016 Brexit, the world needed to change the thinking behind traditional democracy as the democratic landscape changed, yet traditional democratic thinkers and actors have been acting as if...

28 July 2024 6,515 1 View

Should the amount of DNA input used for ChIP-seq library preparation be matched between the control and experimental groups?

Hi all. As a beginner in ChIP-seq experiments, I hope you understand that the following questions might be somewhat basic. I am planning to perform ChIP-seq or MeDIP-seq analysis to investigate...

28 July 2024 6,938 1 View

If my gene of interest has high GC content can it be problematic in sequencing? What kind of error is expected with GC rich gene sequences??

Gene sequencing related trouble shooting

25 July 2024 4,149 2 View

Has the Affordable Connectivity Program Helped You/Your Community? And Would Bulk Tablets on a Similar Program Help Your Business?

Do You Believe VP Harris Would Show Empathy to Tech Companies in Her Administration as POTUS? I ask in light of the fact that, the ACP was not re-funded this round. So many and businesses need...

24 July 2024 4,732 3 View

Does post-translational protein modification cause devisions on observed pI verses calculated pI?

In running two-dimensional gel electrophoresis on bacterial protein, some spots that appear to match a protein sequence have a significantly more acidic isoelectric point than the calculated pI....

24 July 2024 8,076 3 View