How to write short name for downloaded protein sequences quickly to save time?

More Sandeep Kumar's questions See All

What is the meaning of zero or undetectable gene expression ct value of RT-PCR?

I'm getting zero or no ct value in my gene expression analysis during RT-PCR analysis. I'm not understanding the meaning of zero ct value, how I should interpret the data now. Should I consider...

02 March 2020 3,209 3 View

Is it ok to use transparent 96 wells plate for h2dcfda assay or I should use black plate with transparent base?

I'm using transparent polystyrene 96 well plate for H2dcfda assay, is it ok or I should use black plate with transparent base.

01 February 2020 5,393 3 View

Which statistical test to perform my MTT dataperformed for different time intervals with different concentrations of a bacterial toxin?

I have treated THP1 and AGS cells for 12, 24 and 48 hours with a bacterial toxin concentrations 0, 5, 10, 20, 40, and 80 ug/ml. Now I want to prove my results with statistical methods but I'm...

10 November 2019 6,595 6 View

Problem during pET15b and pET22b restriction digestion with XhoI and NdeI?

Hello everyone, I'm digesting 2 ug of midiprep isolated pET15b and 22b with BamHI and XhoI at 37 oC for overnight. There is smearing in gel and size of the cut vectors also not appearing actual,...

05 June 2019 1,701 4 View

What would be the conductivity of Tris buffer having molar concentrations 1.5 and 0.5 M with respect to pH 8.5 and 6.5 respectively?

I'm preparing tris buffers of molarity 1.5 and 0.5 M with respect to pH 8.5 and 6.5 respectively. The conductivity of molar concentration 1.5 is 26.0 ms/cm and for 0.5 M is 33.6 ms/cm. Is it a...

03 April 2019 6,404 9 View

Is it good to strip gaps in a protein MSA file for phylogenetic analysis?

I'm analyzing 445 protein sequences of a single protein for phylogenetic analysis. Multiple sequence alignment of these sequences is showing gaps of different sizes. Is it ok to keep these gaps...

03 April 2019 8,617 3 View

Reason for long streaky colonies after transformation?

Hello every one, I'm getting long streaky colonies after transforming pET15b in DH5alpha. Kindly suggest the reason.

01 February 2019 6,732 7 View

How many colonies we should expect after transformation of ligation mixture during a cloning experiment?

I'm getting more than 200 colonies after transforming 2 ul of my ligation mixture from 1;3 ratio. I used 100 ng of vector and now it is going hectic to screen a positive clone because the colies I...

01 February 2019 8,886 4 View

What would be best competent cell to maintain and expression of a secreted Gram negative bacterial toxin gene?

I'm cloning a secreted Gram negative bacterial toxin and not finding any colonies with DH5alpha and pET15b. I'm just wondering what could be the best competent cell for keeping and expression of...

31 December 2018 9,075 8 View

Techniques to confirm bacterial outer membrane vesicles?

Are there techniques other than SEM to confirm bacterial outer membrane vesicles. Actually, our SEM is out of order for some time, so, I want to confirm them by some other techniques.

11 December 2018 9,608 4 View

How to learn more about SPSS and its Application?

I would like to learn more about SPSS and Its application especially in regards to data analysis. Please suggest me how I can learn more about it. Thank you so much.

11 August 2024 9,101 4 View

Can I base on reverse DNA sequences to perform alignment, convert to amino acids and GenBank submission?

I have reverse sequences (AB1 format), can I base on reverse DNA sequences to perform nucleotide alignment, convert nucleotides to amino acids and deposit the sequence in GenBank database?

11 August 2024 5,138 1 View

Baseline drift in HPLC? What causes this?

Hello, Why do i see this baseline drift when i compare my blank (black) to the sample (blue)? Any suggestions as to why this happened? Thank you!

11 August 2024 3,770 4 View

Text-Communication from the M1 Hand Area using BCI—and then there is Elon Musk?

Willett, Shenoy et al. (2021) have developed a brain computer interface (BCI) that used neural signal collected from the hand area of the motor cortex (area M1) of a paralyzed patient. The...

10 August 2024 7,180 0 View

Has anyone applied Python in the field of textile engineering for data analysis, automation, or smart textiles?

I'm currently exploring the application of Python in textile engineering, specifically in areas like data analysis, process automation, and the development of smart textiles. I'm interested in...

10 August 2024 7,429 2 View

How can I use the cif data obtained from rietveld refinement extracted via gsas2, for microstructural analysis using ETEX software?

09 August 2024 7,718 0 View

How to confirm the site-directed mutagenesis result without performing NGS?

I'm cloning a fragment of 3200 nts into plasmid. The cloning was successful, however, 02 amino acids were mutated. Now I want to fix these 02 aa by site-directed mutagenesis technique using...

08 August 2024 4,645 2 View

How are iso-frequency contours plotted?

Let's say we have a standard, regular hexagonal honeycomb with a 3-arm primitive unit cell (something like the figure attached; the figure is only representative and not drawn to scale). The...

07 August 2024 1,937 1 View

How to prepare the nanoparticle treated fungal sample for Environmental SEM analysis?

A fungal strain was treated with nanoparticles. We want to do an environmental SEM analysis. So could anyone share your views on preparing the sample? Thank you.

07 August 2024 5,307 1 View

How to normalize and take the significance of the MTT OD values with 3 replicates for the same cell-line?

Hi, I have a question about normalizing the MTT OD values for doing the statistical analysis. So, if we have 3 different plates and we call them 3 different replicates, so, first we would...

07 August 2024 8,106 4 View

Brian Thomas Foley Popular answer

The Sequence Name Annotation-based Designer (SNAD) at:

http://veb.lumc.nl/SNAD/index.cgi

is very good for this. It can take an alignment, a tree (Newick format for example), or a list of names, and convert the names based on GenBank or UniProt database annotation.

I am attaching an output for your data, where I asked for genus and species information for each sequence, but given that they are all from H pylori, this was not a good choice.

The LANL HIV Databases has a tool for taking an alignment plus a spreadsheet of data to rename the sequences in the alignment:

https://www.hiv.lanl.gov/content/sequence/ALIGN_MULTITOOL/align_mt.html

use the RENAME tab at the top of this tool.

Abhijeet Singh

cat VacAFASTA1000.fasta | cut -d " " -f1 > shortname_VacAFASTA1000.fasta

This will trim the fasta header after the first space in fasta header

>tr|A0A1Y3E563|A0A1Y3E563_HELPX Toxin OS=Helicobacter pylori SS1 OX=102617 GN=X568_02390 PE=4 SV=1

>tr|A0A1Y3E563|A0A1Y3E563_HELPX

Or use delimiter you want instead of space

Brian Thomas Foley

Katharina Hoff

Both previous answers are helpful. If you in the end don’t care much about the sequence names anymore, you can also simply replace them by short string and a number or similar (https://github.com/Gaius-Augustus/Augustus/blob/master/scripts/simplifyFastaHeaders.pl). I do this if some software may be unhappy with the | character that will still be there using the first answer, or if I need even shorter headers.