How to determine 'Cluster of Orthologous Groups' for our proteins?

More Muhammad Sufian's questions See All

Can we mark 'EFL Learners shifting from general digital to AI technologies' as technological transition?

After COVID-19 it has seen that EFL learners technological affiliation has raised. In addition, in the post-COVID period learners started to engage AI technologies like ChatGPT while learning...

08 August 2024 8,964 4 View

How to generate a citation of my paper from ResearchGate?

How we can cite the papers from ResearchGate. I am trying to create citations for this article, Quantum Machine Learning Algorithms for Optimization Problems: Theory, Implementation, and...

08 August 2024 6,690 3 View

Does Anyone have expertise in in vitro transcription and RNA pull down assay?

I am currently working on LncRNA; to know the lncRNA-protein interactions I want to do RNA pull down assay, so I need to design primers with T7 promoter. I need assistance in this regard.

07 August 2024 6,622 1 View

How to fix background error in rietveld refinement of one XRD peak using GSAS-II?

I want to refine one XRD peak of my in-situ xrd but the background is never working good which ultimately fails the refinement. How to refine and adjust the background using GSAS-II

05 August 2024 5,291 2 View

How can I add own Henry coefficients in Aspen Plus?

Hi, i would like to simulate an absorption process in Aspen Plus. I want to use the NRTL model und would like to add some individual Henry coefficients. Is that possible and how?

05 August 2024 2,333 2 View

Why might the impedance values for DI water and 0.1X PBS buffer solution exhibit a decreasing and increasing trend, respectively over time (HP 4194A)?

Hello everyone, I'm encountering an issue with my electrochemical impedance spectroscopy (EIS) measurements and would appreciate some insights. Experimental Setup: Electrodes: Gold interdigitated...

05 August 2024 3,783 2 View

Can usage of AI tools like chat GPT in research work is recommendable ?

AI tools like ChatGPT can enhance research work significantly when used responsibly and in conjunction with thorough human oversight.

05 August 2024 1,842 3 View

Usage of internal standards in LC-MS/MS analysis?

Have you ever seen a LC-MS/MS method uses both internal standards and external standards (in matrix matching purpose) but the concentrations of internal standards are outside the calibration curve...

05 August 2024 3,084 6 View

ANY free software for reconstructing neurons in the microscopic image?

Hi everyone, I am working on brain slices for visualizing a protein in the soma and dendrites, using a fluorescence tag. However, I need a tool (not paid) for reconstruction of the whole neuron,...

04 August 2024 4,725 2 View

How effective is the Citi Bloc standard basket in enhancing the accuracy and comparability of international construction cost assessments?

Citi BLOC Standard Basket Definitions: A standardized unit representing a fixed basket of construction materials, labor, and equipment costs priced in various cities. Purpose: To create a common...

04 August 2024 8,997 1 View

Which Scopus Journal provides the most affordable fees?

"PUBLISHING IN A SCOPUS JOURNAL" Researchers are now at a cross road. The critical need to publish in a Scopus or ISI, etc journal is ever vital. Journal Publication fees must be submitted....

10 August 2024 8,621 1 View

Seeking Advice on Viability and Execution of Undergraduate Thesis Topic?

Hello everyone, I am currently developing a thesis proposal and would appreciate your input on its viability and how to effectively carry it out. My proposed topic is: "Does the perceived threat...

10 August 2024 8,992 0 View

Who will be moral responsible for the death of thousands of people in the event of an earthquake?

Who will bear moral responsibility for the deaths of thousands of people in the event of an earthquake? Weeks and months remain before the onset of strong earthquakes that bring death to...

08 August 2024 6,134 12 View

Are there any instruments for studying time similar to the way it is in space?

There are a huge number of methods for studying objects in space, according to the senses (and not only). Mechanical, thermal, optical, acoustic, electrical, magnetic, based on particle beams,...

06 August 2024 7,102 0 View

Weak DAPI staining after immunohistochemistry - how to improve?

After immunohistochemistry of previously fixed in PFA and EtOH and then frozen 20 μm sections of zebrafish brain, DAPI staining is very weak (right) compared to the same sections stained without...

05 August 2024 9,637 2 View

Why did the authors extrapolate a phenotype that they experimentally proved in one bacterial strain across the whole genus of the organism?

I aim to be as skeptical as possible regarding whether a pair of orthologous genes results in the same phenotype in their different but related bacterial organisms under similar environmental...

05 August 2024 6,787 4 View

The Curse of Evolution and Complexity?

Brain and body mass together are positively correlated with lifespan (Hofman 1993). The duration of neural development is one of the best predictors of brain size, and conception is the best...

05 August 2024 6,247 3 View

In the case of a wound l recurrence after radical breast cancer and sentinel lymph node biopsy. Are the sentinel lymph node procedure recommended?

In the case of a wound l recurrence after radical breast cancer and sentinel lymph node biopsy. Are the sentinel lymph node procedure recommended? If no axillary lymph node dissection was not...

05 August 2024 8,056 1 View

Regarding a model for simulating battery charge and discharge, what do you consider to be high fidelity?

Regarding a model for simulating battery charge and discharge, what do you consider to be high fidelity? What is the acceptable percentage of error (regardless of the metric)? Could you suggest...

03 August 2024 5,358 0 View

Interested in a SCOPUS collaboration?

Hi RG family. My team and I are working on some SCOPUS publications and we need co-authors who are willing and capable of undertaking both qualitative and quantitative-based studies. The scope...

02 August 2024 7,843 0 View

Dave Lee Popular answer

Hey Muhammad,

Instead of running everything yourself. An alternative could be to instead use resources whereby COGs, NOGs (and other millions of other OGs there are) have already been done for you and available for download.

Examples include eggNOG from embl which uses a blast-approach for assigning OGs -> This is the second approach you suggested -> I would not recommend running it yourself since it would take AGES.

http://eggnog.embl.de/version_4.0.beta/

Here, you could assign your ~6000 proteins (uniprot accessions?) to the OG's that they have already done.

If you don't like the blast-type OGs, you could find a resource where the OGs have been determined via trees instead -> I have less experience but there are bound to be some.

Dave

Dave Lee

Muhammad Sufian

Thank you Dave. My proteins have NCBI GI accessions. Does any of the OGs can accept my 6000 NCBI GIs ? eggNOG can take maximum upto 30 records only.

You'll need to do a couple of things:

1) One to convert the GI accessions to uniprot IDs

Try using the uniprot mapper (I've never used the website version as I always write my own but this looks fine).

http://www.uniprot.org/help/mapping

2) Map the uniprot IDs to OGs

You should probably stick with the version 3 release of eggNOG for this part 4 doesn't seem to have the actual 'UniprotAC2eggNOG' file :

http://eggnog.embl.de/version_3.0/downloads.html

The first part doesn't require any coding but the second one does. Should be quite straight forward though.

For the 2), some pseudo-code would be to:

1. Read in the uniprot2OG file

2. Create a matrix with 2 columns and n rows where n = unique(c(uniprotOGs, yourUniprot)). These are also the rownames.

3. Then in: TotalMatrix[uniprotOGs, FirstColumn]

Snehal Karpe

Hi Muhammad,

If no other database gives you such information about already annotated COGs, you can try tools like Proteinortho (https://www.bioinf.uni-leipzig.de/Software/proteinortho/) , OrthoMCL (http://orthomcl.org/orthomcl/) , etc (I am sure there are many more!). OrthoMCL has been used for many eukaryotic genomes, proteinortho for few. Proteinortho can be run by one step command after installation (in which all-against-all BLAST is done at the back-end) whereas OrthoMCL is very complex. You can give them a try.

Hope this helps.

Snehal

Stefano Levi Mortera

Hi Muhammad

I know your topic is a little out-of-date but I got in touch with it today, and I have quite the same problem to solve. I'm not a bioinformatician so I'm not so friendly with such issues, anyway I must face it somehow. I'd like to know how did you achieved your COG analysis and, if you are currently involved in such problems, how do you manage it now. I know MEGAN5 can be a tool for this, and I used it once with BLASTp outputs, but I'm looking for less cpu-consuming paths.

Thanks in advance for your help

Regards

Stefano

Dear Stefano,

Actually I did not included this experiment in my previous study as it was taking much time, eventually delaying my publication. Hopefully, if I came across any way, I will definitely share in this thread.

Thanks.

Daniel Kurth

Using the NCBI Batch Web CD-Search Tool (http://www.ncbi.nlm.nih.gov/Structure/bwrpsb/bwrpsb.cgi) you can search up to 40,000 sequences at once against several databases, including COG database. It takes roughly one day though, but it's done on their servers, and you might send several batches. However, I think it's the old COG database before the recent update (http://nar.oxfordjournals.org/content/43/D1/D261.abstract)

Robert Rentzsch

Also scanning yourself is now much simpler with the eggNOG (currently) 4.5 HMM library. Even simpler: use the new eggNOG mapper at http://beta-eggnogdb.embl.de/#/app/emapper