- Can anyone tell me how to fill gaps in a draft genome assembly?

More Zara Rafaque's questions See All

I am looking for someone who has done computational research on NON SMALL CELL LUNG CANCER & used MOE docking software?

I have selected 100 proteins from the literature on nonsmall cell lung cancer but I am having trouble in finding exact PDB IDs of it. what is the best way to get authentic PDB IDs? and secondly...

27 January 2024 3,402 2 View

I need the dataset to test my simulation. Can someone help me with this?

I am doing my graduate research on Through-the-wall radar imaging (TWRI) to detect and image human and animal (cat) targets. I need the dataset to test my simulation. Can someone help me with this?

07 August 2023 8,921 4 View

Does anyone have any references or studies that note levels of participation or adherence to workplace health and wellbeing programmes please?

My team and I are developing health and wellbeing programmes and are looking for current published evidence relating to the levels of participation or adherence to workplace health and wellbeing...

14 March 2023 4,495 2 View

Insert molecule command suggestion?

Hi everyone. I am trying to add three different types of solvent molecules in protein file by repeating following command three time. I am not sure how to center my protein via insert-molecules...

01 April 2022 8,095 0 View

Micromolar IC50 but no binding observed in MST?

Hi I'm investigating a small molecule inhibitor against a kinase enzyme and it has an IC50 of about 20-30µM observed in the kinaseglo assay. However when I tried to test the binding using MST...

01 November 2021 8,668 1 View

What is the impact of carers health literacy on supported living individuals?

I am looking to find out if support workers health literacy has a positive or negative effect on their charges when they are being supported at home, what the carers think their level of health...

07 January 2020 3,123 1 View

Where can I find lower body zones of comfort for a sitting posture?

Hi all, I am trying to populate a CATIA manikin with preferred angles and zones of comfort in order to run an ergonomics analysis. However, I am struggling to decipher zones of comfort for...

24 February 2019 8,003 3 View

Anyone having experience of working with ATTC 5736 and T24 bladder cell lines?

I am having problem obtaining the desired confluency. They are not growing out well even after 24-72 h incubation. Confluency remains less than 60% Note: - Media is RMPI 1640 (As recommended by...

15 January 2018 1,088 4 View

Is it important to dissolve sodium palmitate in fatty-acid free BSA for cell culture media? Can I use regular BSA?

I'm trying to dissolve sodium palmitate for cell culture media. I used regular BSA previously and dissolved the sodium palm with ethanol but that method didn't work; the sodium palmitate chunked...

27 December 2017 6,965 2 View

How can I immobilise collagen scaffolds for AFM?

Hi all, I'm produced collagen scaffolds and am planning to do some AFM (in liquid) to understand their mechanical properties. However I am struggling to immobilise them on a surface. I have tried...

02 May 2017 9,638 6 View

Can I base on reverse DNA sequences to perform alignment, convert to amino acids and GenBank submission?

I have reverse sequences (AB1 format), can I base on reverse DNA sequences to perform nucleotide alignment, convert nucleotides to amino acids and deposit the sequence in GenBank database?

11 August 2024 5,138 1 View

Which Scopus Journal provides the most affordable fees?

"PUBLISHING IN A SCOPUS JOURNAL" Researchers are now at a cross road. The critical need to publish in a Scopus or ISI, etc journal is ever vital. Journal Publication fees must be submitted....

10 August 2024 8,621 1 View

Seeking Advice on Viability and Execution of Undergraduate Thesis Topic?

Hello everyone, I am currently developing a thesis proposal and would appreciate your input on its viability and how to effectively carry it out. My proposed topic is: "Does the perceived threat...

10 August 2024 8,992 0 View

Who will be moral responsible for the death of thousands of people in the event of an earthquake?

Who will bear moral responsibility for the deaths of thousands of people in the event of an earthquake? Weeks and months remain before the onset of strong earthquakes that bring death to...

08 August 2024 6,134 12 View

How to confirm the site-directed mutagenesis result without performing NGS?

I'm cloning a fragment of 3200 nts into plasmid. The cloning was successful, however, 02 amino acids were mutated. Now I want to fix these 02 aa by site-directed mutagenesis technique using...

08 August 2024 4,645 2 View

Is there any way to quantify bacterial and fungal cells in their mixed culture?

I am working in fungal fermentation of soybean meal and there is bacterial growth in them at times. I am trying to quantify fungal cell counts and bacterial cells; but I haven't been able to do at...

07 August 2024 7,535 4 View

Are there any instruments for studying time similar to the way it is in space?

There are a huge number of methods for studying objects in space, according to the senses (and not only). Mechanical, thermal, optical, acoustic, electrical, magnetic, based on particle beams,...

06 August 2024 7,102 0 View

Are there instances where molecules with larger molecular weights exhibit greater mobility than those with smaller molecular weights?

Hi, I know that low molecular weight (MW) molecules generally tend to have higher mobility, while high molecular weight molecules tend to have lower mobility. However, in my experimental...

06 August 2024 1,495 2 View

Weak DAPI staining after immunohistochemistry - how to improve?

After immunohistochemistry of previously fixed in PFA and EtOH and then frozen 20 μm sections of zebrafish brain, DAPI staining is very weak (right) compared to the same sections stained without...

05 August 2024 9,637 2 View

Why did the authors extrapolate a phenotype that they experimentally proved in one bacterial strain across the whole genus of the organism?

I aim to be as skeptical as possible regarding whether a pair of orthologous genes results in the same phenotype in their different but related bacterial organisms under similar environmental...

05 August 2024 6,787 4 View

Seth Munholland Popular answer

Depending on how you got your base reads sequenced (I.E. Illumina HiSeq, PacBio nanopore, etc), the size and complexity of the target genome, the depth of coverage, and the amount of resources you spend on it, it may be able to remove gaps.

You should be able to submit it without filling everything. Pretty much every genome I've seen has Ns in it, but I've only dealt with plant genomes so I may be wrong there.

As I understand it, with bacterial genomes the poly-N areas depict 1 of 2 things; either an actual gap in the sequencing or a separate piece of DNA (IE plasmids). I don't know how the assembler would be able to differentiate between the two, so adding a large segment of Ns denotes the uncertainty of what's being assembled. I'm not familiar with contiguator, so I can't comment on the specifics of that software though.

Abhijeet Singh

Your genome will be incomplete with those gaps. Do design primers to fill the gaps with flanking region, sequence the product and fill the gaps before submission.

Zara Rafaque

Hi Abhjeet!

Thanks for your kind reply. The sequencing company reported no gaps in draft genome. The gaps appear only after I align draft genome with reference strain. Is it making any sense?

I am sorry am just very new in this field.

Dongliang Yu

100 bp gaps between all the contigs ? Are these gaps poly-N in the reference genome? that may be the artificial conjunction when making the scaffold.

Dear Dongliang!

Thanks for your reply.

Yes these are poly-N,and the software (CONTIGUator) has a an option " Do not use N to separate the contigs" when I check this option I see no "Ns or gaps".

Shairul Izan

I think those 100bp N are used to separate contigs.. so that‘s mean you can see the start of your contigs and the end..plus you can check whether the read coverage is high or low.. if you see some reads overlapping with the 100bp N but with low coverage i dont think you should worried about them.

Dear Shairul Izan!

thanks for your reply. I am very naive in bioinformatics. I could not find any gaps in my contigs other than those 100bp gaps between all contigs? Is it possible to not have any gaps in your WGS at all? Can I submit my whole genome sequence with the above gaps? Or do I need to fill them?

Seth Munholland

Angelo Joshua Victoria

It is normal for genomes to be published with gaps. Gaps of 100 Ns are short anyway and won't affect much of the downstream analyses you may want to do as long as you have sufficiently-large contigs.

The way to close these gaps is to do Sanger sequencing by desigining primers to produce sequences that, ideally, will span the gaps. Another way is to use long reads (e.g PacBio), usually complemented by short reads (e.g. Illumina).

Ralf Koebnik

Hello,

my impression is that these gaps of 100 bp (100 "N") were introduced artificially and do not say anything about the actual size of the gap. Some people insert 50 "N", other 100 "N". If it is always the same number it comes for sure from the assembly. The next consequence would be that the sequence before these "N" and the one after it correspond to individual contigs and do not necessarily be next to each other in the genome. Saying this, PCR won't help, except if you want the go for all combinations. Moreover, since you do not know the size and complexity of the gaps, you may even not be able to PCR amplify the gap (e.g. if the gap is 10000 bp in reality).

Well, if I would be you, I would submit the genome sequence as it is but I would first remove all these stretches of "N" and generate separate contigs.

BTW, could you please tell us how many of the N100 pieces do you have?

Best wishes,

Ralf

Hi Ralf!

Thanks for your kind reply.

The ordered contigs are 50 in number and the N sequences are roughly in the same number. Can you tell me can we use unmapped contigs for filling these gaps and How?

Thanking in anticipation.

I do not know what you mean with untapped contigs? I guess you work with Escherichia coli? You could check for the most closely related strain that has been completely sequenced, and then map all your contigs against this reference genome sequence.

Sorry it was unmapped.

Raymond Kiu

I assume if you want to fill up gaps between contigs in the draft genome, in bioinformatics you can 'scaffold' the contigs using tool like SSPACE, or Gapfiller to fill up gaps, i.e. to get a better assembly of your draft genome. Not sure if this is helpful.

Manmohan Pandey

Well during sequence submission at NCBI, one has to introduce 100Ns artificially if the distance between 2 contigs is not know but they are supposed to be part of the same scaffold. I think you should align the contigs on to the strain before and after the gap is introduced.

Soma Marla

Merging available draft genomes abd re assemble them. Secondly, removal of repeat elements significantly helps in gap removal and to coverage improvement.We found assembly tools such as SPADES are realy useful in reassembly after merging the draft genomes.