What is the best method to split a large piece of audio into several short utterances?

More Selma Kali Ali's questions See All

Geotechnical Engineering (Proceedings of the ICE) time review?

Hello everyone, I recently submitted an article to Geotechnical Engineering (Proceedings of the ICE), and the current status has been listed as "EiC Pre-assessment: Ready" for the past 20 days. I...

10 August 2024 6,493 1 View

How can we differentiate between calcite, dolomite, siderite, magnesite and ankerite minerals in carbonatite rocks in thin section under op microscop?

How can we differentiate between calcite, dolomite, siderite, magnesite and ankerite minerals in carbonatite rocks in thin section under optical microscope?

07 August 2024 2,132 3 View

Unusual intensity drop in some sections of chromatograms in DDA?

Hi, we have measured tryptic peptides using both DDA and DIA method on QExactive. In DDA replicates i saw unusual intensity drops occurring at the same sections of chromatograms in DDA replicates...

07 August 2024 3,218 4 View

Can you suggest reliable sources defining "3D mesh" and "3D city models"?

Dear fellow researchers, I am currently working on a paper where I need to provide a reliable reference that defines and distinguishes between 3D mesh models and 3D city models. Although I am...

06 August 2024 9,986 2 View

Absorption coefficient of methane?

Hello, Can anyone provide me with the absorption coefficient of methane gas at 7.7 um? Any reference?

06 August 2024 980 5 View

What is the best sampling strategy?

I am conducting a qualitative study that uses interviews to investigate the perceptions of teachers about a particular leadership practice and I am focusing on 3 schools which have a total number...

01 August 2024 8,457 10 View

Looking for help on sem image analysis?

Hello I am conducting a microstructural analysis of a soil treated with lime. The following sem images are of the untreated s1 and treated soil s3. The untreated soil contains quartz calcite...

01 August 2024 572 0 View

What is Random Audit?

HI there, I've came across several articles discuss about random audit an Non random to tax evasion or compliance. Most of the articles is relating about effect of audit (random or non random)...

31 July 2024 5,309 7 View

Can we patent a process flow diagram developed using a process simulator but no actual cases is carried out?

Can we patent a process flow diagram developed using a process simulator but no actual cases is carried out? For example consider a process for certain product manufacture where a new process flow...

31 July 2024 781 1 View

How can we calculate the percentage of configuration interaction (CI) in the UV output data of the Gaussian program?

How can we calculate the percentage of configuration interaction (CI) in the UV output data of the Gaussian program? for example: Excited State 17: Singlet-A 5.1359 eV 241.41 nm...

28 July 2024 9,165 2 View

Broca’s area must be intact for the learning of new movement sequences?

When the eyes of a person are damaged this causes complete blindness. Likewise, when Wernicke’s and Broca’s areas of neocortex are damaged this causes complete aphasia, losing the ability to...

01 August 2024 6,744 2 View

What is the best topics for a research topic for B.A. (Music) ?

specially when we talk about music related to our behavior..

29 July 2024 3,672 0 View

When you express a protein, why do we express not only the domain we want, but also the protein around it?

I want to express STK4, and I've searched the paper for reference. When I check the protein kinase domain sequence for that kinase on Uniprot, it's 30-281, but the paper expresses the protein...

20 July 2024 4,951 1 View

What is your preference regarding Artificial Intelligence apps/methods/platforms for image analysis?

Please, let me know the apps, platforms, or methodologies based on AI to analyze images, such as radiographic or histology images. Tell me your experience of using AI in assessing patterns,...

30 June 2024 4,430 4 View

What is the effectiveness of AI-powered language learning tools in improving language acquisition skills in children with speech and language delays?

The impact of AI-powered language learning tools in enhancing language acquisition skills of children with speech and language delays.

28 June 2024 3,105 2 View

Impact of nuclear-mitochondrial DNA segments (NUMTs) in phylogeny construction?

"Nuclear-mitochondrial DNA segments (NUMTs) are mitochondrial DNA (mtDNA) fragments that have been inserted into the nuclear genome." (Xue et al., 2023) I would like to know under this...

20 June 2024 603 1 View

Music Therapy used in several colleges.Which one that are great?

I'd trying the several countries with several religions on young and older. what are which books so far. Sound Medicine, Music Medicine on many religions working now.

18 June 2024 1,907 1 View

Is the pure phonemic content related to emotional valence?

Dear colleagues, After statistical processing of a large corpus of English utterances assessed for emotional valence, it turned out that the phonemic content of speech is tied to emotional...

17 June 2024 5,459 0 View

What are the challenges of developing technology for real-time speech translation?

What are the significant technical obstacles in the development of instantaneous speech translation tools? I would appreciate your insights on this question. Could you please share your thoughts?

13 June 2024 1,042 3 View

What makes 'Eka Ibid Isong' significant in Ibibio indigenous music?

Ibibio Traditional music compositions

10 June 2024 320 0 View

Amin Honarmandi Shandiz

Dear Selma Kali Ali

You can Simply search for Audio Segmentation which there are plenty of resources available.

Yaakov J Stein

If I understand correctly you want to extract segments with speech and disregard areas without speech - silence, noise, music, etc.

You are right that one way to go is to properties (i.e., features) that identify speech, but energy is not certainly not such a property (unless you are looking for energy contours).

You could work with any of the well-known speech-specfic feature sets (e.g., LPC cepstrum), but it will require a lot of tweaking (how do you differentiate between speaking and singing?)

Another possibility is to forget about features and work directly on samples. You really don't need a prepared dataset - just take a few recordings of speech and a few of music and mix segments together. You know where the speech is, so you can train. Interesting how much data you will need to make that work...

Karolina Klimiuk

Dear Yaakov,

maybe You could try Audacity od Praat - these are programs You can easily download on your computer.

Hope it was helpful.