If I have a .gff file or a .faa file, how can I use this information to understand what the average length of all the sequences are.
Or is there a program to visualize the distribution of sequence lengths?
Many NGS quality analysis tools will do this, the PRINSEQ web service is probably the easiest way using a .faa file.
https://edwards.sdsu.edu/cgi-bin/prinseq/prinseq.cgi
Upload your .faa file and run just basic statistics and you'll get some numeric stats and graphical distributions.
Agree with Joseph
in linux you can simply do:
awk '{/>/&&++a||b+=length()}END{print b/a}' file.fa
Cheers
I see many different studies using RAG1 or 2 knock out mice. I know that this means that lymphocytes will not be fully mature since. T cells will not have a TCR and b cells will not make...
10 November 2019 7,594 3 View
I am wondering if there is a way to get a nucleotide file of just the conserved regions of the 16S ribosomal RNA gene in bacteria to use for finding and obtaining 16S gene sequences from a...
08 September 2018 8,479 3 View
I have a list of taxa names at the family and genus level and their relative abundances. Is it possible to make a phylogenetic tree out with just the names and also add in a marker to demonstrate...
08 September 2018 2,104 3 View
Hiiiii everyone! I have an enquiry on statistical analysis. I was looking for many forum and it's still cannot solve my problem. I want to compare means of two groups of data but only with two...
03 March 2021 8,796 3 View
I am on the lookout for the Enhanced Yellow Fluorescent Protein (Aequorea victoria) DNA sequence. Does anyone know where I can find it? Thank you in advance
03 March 2021 3,568 1 View
Hi, I want to start testing pitfall trap to obtain ants samples, but I need to conduct molecular analysis on those insects. So, what kind of fluid can I use? Ethanol expires too early and I need...
03 March 2021 5,978 5 View
Results of single-case research designs (i.e., n-of-1 trials) are often evaluated by visually inspecting the time-series graph and computing quantitative indices. A question our research team is...
03 March 2021 687 1 View
What's the best way to measure growth rates in House sparrow chicks from day 2 to day 10? Since, the growth curve from day 2 to 10 won't be like the "Logistic curve" it might not follow logistic...
03 March 2021 1,401 3 View
I have conducted and published a systematic review and meta-analysis research with the topic related to public health and health pomotion (protocol was registed in PROSPERO). Now we want to...
03 March 2021 8,920 3 View
dear community, my model is based feature extraction from non stationary signals using discrete Wavelet Transform and then using statistical features then machine learning classifiers in order to...
03 March 2021 6,994 5 View
I am try to make the Paal Knorr reaction between 2.5 hexnedione and 6-amino hexanoic acid, my problem is type of solvent to use, because 6-hexaminohexanoic acid is soluble in water but I am not...
02 March 2021 4,443 2 View
I just wanted to check if I need to run a linear regression separately if I am using PROCESS MACRO to run mediation analysis. Thank you.
02 March 2021 4,359 3 View
This is a Buck (step-down) Converter and I am new to this program. I need to program with simplified C block and I actually don't know what to do. Can anyone show me how to do this? Or at least...
02 March 2021 8,233 1 View