What unsupervised powerful method is used for the phonetic segmentation of an unlabeled corpus to better detect the boundaries in the time domain?

More Fréjus A. A. Laleye's questions See All

Comment ajouter une recherche ?

En suivant toutes les étapes mentionnées sur (ajouter une recherche) il me signale erreur, ressayer, j'ai essayé plusieurs fois, sans succès, toujours il me signal erreur, ressayer

02 August 2024 9,719 1 View

Why practical assessments of dislocation density by XRD ignores the texture which apparently affects the result?

As one can read in numerous papers on this matter, the dislocation density is derived from the dependence of peak broadening on the diffraction angle, that is, on considered reflections. Thus, to...

28 July 2024 6,496 5 View

Is there any other efficient technique, as an alternative to multiple linear regression ?

with secondary data collected from central bank to know the impact of digitalization in banks profitability, how can its impact be more precisely measured through econometric tools.

21 July 2024 7,295 10 View

Hellow every one i have a question, please help me. How ESP car working?

I try to understand the ESP mechanism working. What kind of sensors it has, Is it necessary to modify or boost its mechanical ?

15 July 2024 4,772 1 View

Are diatom inclusions found in shellfish shells?

Mangroves were reported to eat into limestone atolls like so much cake - but could it have been diatom partnerships at mangrove roots that ate into the rock? If gypsum formation is a clue, how...

12 July 2024 159 2 View

How can I verify if an academic journal is a legitimate publication or a cloned version?

This question seeks to address the growing concern of cloned academic journals, which are fraudulent duplicates of legitimate publications. It aims to guide researchers, scholars, and academics on...

10 July 2024 5,966 2 View

Is there any guidelines about zooming out a chromatogram?

Hi, Just curious to know that in chromatography, a chromatogram should be zoomed to what scale? 10 times of output , 50X or ..? Is there any regulation or general chapter which describes this?...

03 July 2024 3,119 6 View

How to see whether my protein is aggregated using western blotting?

I am working on one protein which gets aggregated (similar to alpha-synuclein and tau) in certain neurodegenerative diseases. I am trying to confirm the aggregation using Western blotting....

30 June 2024 5,060 3 View

Do body proportions of sharks change after drying?

Dear colleagues, I would be grateful to anyone who can answer my two questions: 1. Do the proportions of the shark's body change after drying? In other words, is it possible to discriminate two...

30 June 2024 6,101 3 View

Graphene and graphene oxide as cancer factor?

Please, can any one inform us about risks of graphene and graphene oxide as a factor of cancer growth?

28 June 2024 6,815 4 View

Broca’s area must be intact for the learning of new movement sequences?

When the eyes of a person are damaged this causes complete blindness. Likewise, when Wernicke’s and Broca’s areas of neocortex are damaged this causes complete aphasia, losing the ability to...

01 August 2024 6,744 2 View

When you express a protein, why do we express not only the domain we want, but also the protein around it?

I want to express STK4, and I've searched the paper for reference. When I check the protein kinase domain sequence for that kinase on Uniprot, it's 30-281, but the paper expresses the protein...

20 July 2024 4,951 1 View

What are the current challenges and future prospects of integrating artificial intelligence into recognition systems for autonomous vehicles?

This question aims to explore the intersection of artificial intelligence and autonomous vehicle technology. It seeks to identify the key challenges faced in implementing AI for recognition...

20 July 2024 3,469 2 View

Help me download paper?

I have 2 papers below, but I can't access this, you can help me? Shuai Zhang, Xiaodi Li, Xingyu Zhou, Yuning Wang, Yue Hu, Cloud removal using SAR and optical images via attention mechanism-based...

18 July 2024 9,635 0 View

What is your preference regarding Artificial Intelligence apps/methods/platforms for image analysis?

Please, let me know the apps, platforms, or methodologies based on AI to analyze images, such as radiographic or histology images. Tell me your experience of using AI in assessing patterns,...

30 June 2024 4,430 4 View

What is the difference between opportunity recognition in entrepreneurship literature and sensing in dynamic capabilities theory?

While I do have some opinions on how to address this question based on my reading as student, I would like to know the opinions of more accomplished scholars.

28 June 2024 802 3 View

What is the effectiveness of AI-powered language learning tools in improving language acquisition skills in children with speech and language delays?

The impact of AI-powered language learning tools in enhancing language acquisition skills of children with speech and language delays.

28 June 2024 3,105 2 View

I am working on a network for facial expretion recognition and I have problem with the loss function can anyone help?

I am using dice loss and wing loss for loss function and my network outputs are heatmaps and landmarks and I am trying to train on both of them at a same time do you guys know how to solve this...

22 June 2024 10,013 2 View

Impact of nuclear-mitochondrial DNA segments (NUMTs) in phylogeny construction?

"Nuclear-mitochondrial DNA segments (NUMTs) are mitochondrial DNA (mtDNA) fragments that have been inserted into the nuclear genome." (Xue et al., 2023) I would like to know under this...

20 June 2024 603 1 View

Is the pure phonemic content related to emotional valence?

Dear colleagues, After statistical processing of a large corpus of English utterances assessed for emotional valence, it turned out that the phonemic content of speech is tied to emotional...

17 June 2024 5,459 0 View

Stevan Ostrogonac

To my knowledge, there are no unsupervised phonetic segmentation methods based only on energy. Energy can be used to help distinguish between voiced and unvoiced phonemes, but for this problem it is necessary to apply correlation calculation because voicing is manifested as periodicity in the speech waveform. For phonetic segmentation, a labeled corpus is needed for training and, besides energy, MFCCs are usually used, along with their derivatives (first and second).

Sri Harsha Dumpala

Though finding the boundaries is difficult, you can use the concept of landmarks in speech. Landmark refers to the process of representing each segment with a single point. Of course, even landmarks can also be used to get the phone or segment boundaries. Refer to the work of Carol Espy Wilson and Sharlene A Liu (1996 Jasa paper)

Fabian Santiago

The tool "Prosogram" could perform an automatic segmentation into local peaks in the intensity of the band-pass filtered speech signal. The tool does not need a labeled corpus. Yet, even if I have used several times, I do not know its accuracy when making an automatic segmentation without labels. You can try.

http://bach.arts.kuleuven.be/pmertens/prosogram/

Thomas Schatz

While it is notoriously hard to find phoneme boundaries without supervision, the location in time of syllable onsets is easier to obtain. I put some references for this below in case it is useful for your application:

Mermelstein, P. (1975). Automatic segmentation of speech into syllabic units. The Journal of the Acoustical Society of America, 58(4), 880-883.
Nagarajan, T., Murthy, H. A., & Hegde, R. M. (2003). Segmentation of speech into syllable-like units. Energy, 1(1.5), 2.
Hyafil, A., & Cernak, M. (2015). Neuromorphic Based Oscillatory Device for Incremental Syllable Boundary Detection. In Proceedings of Interspeech.
Räsänen, O., Doyle, G., & Frank, M. C. (2015). Unsupervised word discovery from speech using automatic segmentation into syllable-like units. In Proceedings of Interspeech.

Amir Harati

There are much better ways to do this: https://www.researchgate.net/publication/271849557_Speech_Acoustic_Unit_Segmentation_Using_Hierarchical_Dirichlet_Processes

Conference Paper Speech Acoustic Unit Segmentation Using Hierarchical Dirichl...