Should regulatory sequences be included in intergenic sequences?

Ravi Kanth Reddy Sathi @Ravi_Kanth_Reddy_Sathi

08 August 2013 1 2K Report

I am working on finding the functional signatures/remains of regulatory regions, RNAs and proteins in the intergenic sequences (IGs) of E. coli K12 MG1655. I have a few doubts regarding the demarcation of IGs and inclusion of regulatory regions (esp, promoters) in the IGs. Could you please help me to clarify these two aspects:

1. For my study, I have extracted IGs from the genome using the annotation provided by the NCBI as well as EcoCyc. I have extracted a total of 3718 sequences as IGs. The extraction was done by removing the gene regions from the genome, and what was left with were called IGs. Is this the right approach? If not, what else should I do to get the IGs?

2. My aim is to identify if any sequence or structural similarities are shown by the IGs or their translates with various regulatory RNAs or peptides.

My question is, should I remove the known regulatory regions in E. coli (available in RegulonDB) from the sequences which I extracted as IGs, before going ahead with search for functional signatures/remains of intergenic regions?

My doubt regarding retaining/removing known regulatory regions from IGs is due the following observation:

Regulatory regions have been detected often in regions annotated as a gene in E. coli genome i.e promoters/terminators within a gene. Therefore presence of regulatory regions within a sequence does not eliminate the possibility of finding a functional region in another reading frame, as is observed in the gene regions.

Shall I include the regulatory regions and continue my analysis of finding functional signatures/remains in IGs? Or remove them? If I have to remove them, what explanation shall I give to justify the removal?

Siva Kumar

You can identify the regulatory sequences already deposited in the database RegulonDB in E.Coli. Because, they were identified already. After removal of these known region, you will get the sequence whose role is not already annotated. Then you can start annotate these regions using bioinformatic tools liks BLAST, FASTA etc.,

Badges
Science topic

More Ravi Kanth Reddy Sathi's questions See All

What is the formula to calculate the critical value of correlation?

I am calculating the correlation values between two data sets of size 257. I want to know what is the critical value of correlation for a sample size of 257. I tried searching on the web, but...

11 December 2013 9,253 18 View

What is the minimum coverage and how to identity percentage for protein domains?

I am trying to find out existence of protein domains in a set of sequences. I am using BLASTX for the task. I have made a BLASTX of my sequences with the ProDom sequences. I used an e-value cutoff...

10 November 2013 4,708 3 View

Ambiguity with bacterial ITS regions?

ITS regions are used for identifications of bacterial species. But while observing bacterial genomes it is seen that tRNA sequences are present in the ITS regions present between 16S rRNA and 23S...

09 October 2013 4,130 5 View

How to get consensus at ambiguous sites?

I have a set of aligned sequences in fasta format. I want to get consensus out of the alignment. In case of most of the sites one of the base is showing maximum occurrence. In case of sites where...

09 October 2013 3,887 2 View

How to screen genomes for compositional studies?

I am working with around 2600+ genomes and wish to study the genome, gene and intergenic features among various groups. In case of taxonomical groups which have very few representatives, there is...

08 September 2013 9,603 6 View

Isn't a prokaryotic gene continuous?

I work on E. coli genomes and while going through the various genes present, I have seen (link) that in the coordinates area of the description it is suggested to join different regions of genome....

07 August 2013 9,166 7 View

Is there any free online access to a good system for researchers?

I am working at genomes level and I have a shortage of computational resources to do the tasks I want to. So could you please suggest where I can get free online access to a good system for my...

07 August 2013 9,290 5 View

How do I Rectify NCBI C++ Exception? ncbi::CMemoryFileSegment::CMemoryFileSegment()

I have tried running a command line blast. The query file is a multi fasta file containing 2600 sequences. It was made a BLASTX against a proteins sequences (ProDom) of size 2 GB (prodom.phr :...

07 August 2013 444 0 View

How efficient are PERL and PHP in designing bioinformatics tools? What are the pros and cons of each and which is most commonly used and why?

Out of my experience I think the basic difference among both of them lies in speed and usage. PHP can be used for creating online tools. PERL can also be used, but PHP is more easy to handle to...

06 July 2013 513 4 View

Are there any cloud computation facilities for Bioinformatics work?

Can some one please throw some light on cloud facilities available to carry on Bioinformatics work. Also is there a possibility of using such services for free or for a nominal fee, for academic...

06 July 2012 5,093 19 View

How to learn more about SPSS and its Application?

I would like to learn more about SPSS and Its application especially in regards to data analysis. Please suggest me how I can learn more about it. Thank you so much.

11 August 2024 9,101 4 View

Can I base on reverse DNA sequences to perform alignment, convert to amino acids and GenBank submission?

I have reverse sequences (AB1 format), can I base on reverse DNA sequences to perform nucleotide alignment, convert nucleotides to amino acids and deposit the sequence in GenBank database?

11 August 2024 5,138 1 View

Baseline drift in HPLC? What causes this?

Hello, Why do i see this baseline drift when i compare my blank (black) to the sample (blue)? Any suggestions as to why this happened? Thank you!

11 August 2024 3,770 4 View

Text-Communication from the M1 Hand Area using BCI—and then there is Elon Musk?

Willett, Shenoy et al. (2021) have developed a brain computer interface (BCI) that used neural signal collected from the hand area of the motor cortex (area M1) of a paralyzed patient. The...

10 August 2024 7,180 0 View

Has anyone applied Python in the field of textile engineering for data analysis, automation, or smart textiles?

I'm currently exploring the application of Python in textile engineering, specifically in areas like data analysis, process automation, and the development of smart textiles. I'm interested in...

10 August 2024 7,429 2 View

How can I use the cif data obtained from rietveld refinement extracted via gsas2, for microstructural analysis using ETEX software?

09 August 2024 7,718 0 View

I can't see the ssDNA band after performing asymmetric PCR. Is there any way to do this?

After performing symmetric PCR, PCR purification was performed. Afterwards, asymmetric PCR was performed using the PCR purification product as a template, but no ssDNA band was confirmed in the...

08 August 2024 1,668 3 View

Does crude extraction using NaOH and Tris work well with Fungi?

I'm trying to find a DNA extraction method for fungi that does not require equipment and heating. Is there anyone who can suggest an alternative option? Thank you

08 August 2024 4,733 2 View

How to increase simulation box size?

We intend to study the interaction between peptides and polymer (like PP, PE and PS) through MD simulations using Martini force fields ( Martini 2 for PP and Martini 3 for PE, PS). We have...

08 August 2024 4,842 0 View

How are iso-frequency contours plotted?

Let's say we have a standard, regular hexagonal honeycomb with a 3-arm primitive unit cell (something like the figure attached; the figure is only representative and not drawn to scale). The...

07 August 2024 1,937 1 View