Can anyone recommend a method to detect words in an audio file?

More Prakash Gavhane's questions See All

How the slurry is preparing for cathodes in batteries?

13 June 2024 4,314 3 View

Geology Doctorate. Available for Post-doc and guest faculty. Please give suggestions. Anywhere in world ?

Geology Doctorate. Available for Post-doc and guest faculty. Please give suggestions. Anywhere in world.

09 June 2024 1,805 4 View

How can analyses a split plot design by SPSS?

02 June 2024 5,946 2 View

What do you mean by meadow plant?

01 June 2024 7,248 2 View

What is the actual difference between Competency-Based Education (CBE) and Outcome-Based Education (OBE)?

In any professional education students' Competency is Evaluated and used as the measurement of Outcome of the teaching-learning process. Then how these two are differentiated in curriculum...

22 May 2024 8,488 25 View

Can I purify mononucleosome from T-cell using a kit followed by SEC superdex 200 10/300 column chromatography for CryoEM ?

I am planning to purify the mononucleosome for cryoEM study. I have seen, people assemble the nucleosome from recombinantly expressed histones and then providing DNA sequence to it. Isn't this...

19 May 2024 7,170 1 View

How can I prepare a loading dye that also act as a stopping buffer for an enzymatically degraded DNA, for analysis on a sequencing gel?

I have a recipe of loading dye (10 mM EDTA, 1X TBE, pinch of Xylene cynol and Orange G dye which is made upto 10 ml with 100% Formamide) but somehow the bands of the nucleotide product...

17 April 2024 8,025 6 View

How perform subpixel level measurement of deformation using image processing?

Hi, I am working on subpixel level measurement of deformation using FPGA with the help of image processing technique. Can anybody suggest me the steps to be followed? As per citation pixel level...

14 March 2024 4,078 0 View

What is the performance parameter for imblance dataset ?

I am curr research at phising detection Using URL . Using logistic Regression model . I have data set 1:10 ratio 20k legitimate and 2 k phishing .

10 March 2024 9,531 2 View

How much amount of zeolite 13x is required to mix with how much amount of water vapour to attain a temperature of 110 degree centigrade ?

I am trying to work with zeolite material instead of electrical heating element for drying, so how much amount of zeolite 13x is required to mix with how much amount of water vapour to attain a...

17 February 2024 3,177 0 View

How to learn more about SPSS and its Application?

I would like to learn more about SPSS and Its application especially in regards to data analysis. Please suggest me how I can learn more about it. Thank you so much.

11 August 2024 9,101 4 View

Can I base on reverse DNA sequences to perform alignment, convert to amino acids and GenBank submission?

I have reverse sequences (AB1 format), can I base on reverse DNA sequences to perform nucleotide alignment, convert nucleotides to amino acids and deposit the sequence in GenBank database?

11 August 2024 5,138 1 View

Baseline drift in HPLC? What causes this?

Hello, Why do i see this baseline drift when i compare my blank (black) to the sample (blue)? Any suggestions as to why this happened? Thank you!

11 August 2024 3,770 4 View

Text-Communication from the M1 Hand Area using BCI—and then there is Elon Musk?

Willett, Shenoy et al. (2021) have developed a brain computer interface (BCI) that used neural signal collected from the hand area of the motor cortex (area M1) of a paralyzed patient. The...

10 August 2024 7,180 0 View

Has anyone applied Python in the field of textile engineering for data analysis, automation, or smart textiles?

I'm currently exploring the application of Python in textile engineering, specifically in areas like data analysis, process automation, and the development of smart textiles. I'm interested in...

10 August 2024 7,429 2 View

Geotechnical Engineering (Proceedings of the ICE) time review?

Hello everyone, I recently submitted an article to Geotechnical Engineering (Proceedings of the ICE), and the current status has been listed as "EiC Pre-assessment: Ready" for the past 20 days. I...

10 August 2024 6,493 1 View

How can I use the cif data obtained from rietveld refinement extracted via gsas2, for microstructural analysis using ETEX software?

09 August 2024 7,718 0 View

How are iso-frequency contours plotted?

Let's say we have a standard, regular hexagonal honeycomb with a 3-arm primitive unit cell (something like the figure attached; the figure is only representative and not drawn to scale). The...

07 August 2024 1,937 1 View

How to prepare the nanoparticle treated fungal sample for Environmental SEM analysis?

A fungal strain was treated with nanoparticles. We want to do an environmental SEM analysis. So could anyone share your views on preparing the sample? Thank you.

07 August 2024 5,307 1 View

How to normalize and take the significance of the MTT OD values with 3 replicates for the same cell-line?

Hi, I have a question about normalizing the MTT OD values for doing the statistical analysis. So, if we have 3 different plates and we call them 3 different replicates, so, first we would...

07 August 2024 8,106 4 View

Sterling Bond

TTS is your best bet. Run a search for "dictation software" - there are a ton of options out there and some are more efficient than others / some are more affordable than others.

Prakash Gavhane

As I have mentioned earlier. I don't want to use TTS. as I have audio files with me.

Yoichi Ando

I hope that you will try to identify of speech in music background.

Autocorrelation-Based Features for Speech Representation.ACTA ACUSTICA UNITED WITH ACUSTICA Vol. 101 (2015) 1 – 1 to be published.

With best wishes,

No. not yet tried. Let me check.

but as I have mention there will be music with words in the audio file so I am not sure will it work with my scenario or not but for sure let me try :)

Thank you.

According to my volume, Auditory and Visual Sensations, Springer, NY, 2009,

I would like to suggest you that the minimum effective duration of ACF of speech is about 2 ms and 20 ms of music. To analyze ACF signal duration 2T is selected for speech about 40 -60 ms, but for music 2T should be 0.4 - 0.6 s.

Thus, you can set 2T ~ 40 ms for extracting ACF factors of speech signals.

And then you can try to reproduce speech signals by use of the ACF factors.

I was referring to "TTS" as a blanket term for all text-to-speech software. Unfortunately, I'm not particularly familiar with all of the options out there, but there are several very advanced pieces of software out there that pick up much more detail than the dictation programs included with mobile devices. Thus, I suggest you search this product market more extensively - I feel like it's the right track for you, if what you need exists. Another option would be to hire a talented software developer ;)