What is the most efficient method for detecting voiced/Unvoiced/silence regions in speech data based on signal processing techniques for clean speech?

More Mohammed Nasar Ibnu Ibrahim's questions See All

Is skin yellowness an numerical or ordinal variable?

I have a response variable called skin yellowness, which I will measure via a scored color chart, whereby 1 is pale yellow and 15 is orange. I'm not sure if this counts as an ordinal variable,...

11 August 2024 4,793 1 View

• What the possible Persistent Organic Pollutants and Heavy metals present in fluorspar, sediments, and water bodies around its mining area?

Approximate concentrations are require in compared with the WHO permissible limts

11 August 2024 2,723 1 View

How can I get old refernce in geomorphological mapping?

how can I read Waters, R. S. 1958. Morphological mapping. Geography 43 :10-17 from internet? note: not in google or resaearch gate

26 July 2024 7,813 3 View

How can the rubber fibres produced by the electrospinning device be removed from the fibre collect(aluminium foil) ?

How can the rubber fibres produced by the electrospinning device be removed from the fibre collector (aluminium foil) without affecting the orientation of the rubber fibres for use in reinforcing...

26 July 2024 8,281 0 View

My question concerns MTT cell viability assay?

Despite not having cells in the media, I am getting purple color. I have tried many troubleshooting methods, varying media types, and even different MTTs from different companies to figure out the...

21 July 2024 9,914 1 View

Guidance needed for preparing the hydrogel samples for the XRD instrument?

I'm working with a hydrogel sample and I'd like to perform XRD analysis. Can anyone offer guidance on preparing the hydrogel for the XRD instrument? Specifically, I'm unsure about the best method...

20 July 2024 3,611 4 View

What are the modern topics to work under bacteriocin and fermentation as PhD student?

As PhD student specialized as food and industrial microbiology need to further research on bacteriocin and fermented food.

16 July 2024 7,394 1 View

How to get a research question in educational administration and management in MPhil?

What are some of the areas to consider in undertaken research in educational administration in MPhil.

14 July 2024 8,394 2 View

How can I extract the mathematical equation from existing Neural Network Model?

There exists a neural network model designed to predict a specific output, detailed in a published article. The model comprises 14 inputs, each normalized with minimum and maximum parameters...

14 July 2024 2,714 3 View

How can i elucidate a compound using xrd?

XRD analysis

08 July 2024 977 4 View

Broca’s area must be intact for the learning of new movement sequences?

When the eyes of a person are damaged this causes complete blindness. Likewise, when Wernicke’s and Broca’s areas of neocortex are damaged this causes complete aphasia, losing the ability to...

01 August 2024 6,744 2 View

Simulation of metal drawing by Abaqus with UMAT?

Hello, colleagues. Recently, I have been working on a metal processing simulation with my UMAT in Abaqus. I have outlined the corresponding simulation, but I keep encountering issues that cause...

30 July 2024 7,062 1 View

Does post-translational protein modification cause devisions on observed pI verses calculated pI?

In running two-dimensional gel electrophoresis on bacterial protein, some spots that appear to match a protein sequence have a significantly more acidic isoelectric point than the calculated pI....

24 July 2024 8,076 3 View

Can a shoot-through event of a tri-state digital buffer cause momentary Hi-Z state?

// interested in the difference between floating events and short circuits.

22 July 2024 6,565 0 View

What are the current challenges and future prospects of integrating artificial intelligence into recognition systems for autonomous vehicles?

This question aims to explore the intersection of artificial intelligence and autonomous vehicle technology. It seeks to identify the key challenges faced in implementing AI for recognition...

20 July 2024 3,469 2 View

Trial exclusion in eye-tracking data?

Is it reasonable to exclude all trials with a blink or saccade in the 150 ms before stimulus onset? As an alternative, would it be better to exclude blinks (after extending them by about 100 ms...

19 July 2024 3,838 0 View

Help me download paper?

I have 2 papers below, but I can't access this, you can help me? Shuai Zhang, Xiaodi Li, Xingyu Zhou, Yuning Wang, Yue Hu, Cloud removal using SAR and optical images via attention mechanism-based...

18 July 2024 9,635 0 View

What are the future implications of quantum computing on image processing algorithms?

Image Processing Algorithms, Quantum Computing.

17 July 2024 7,958 2 View

Given the current advances in Super Computation and Quantum Computing, what are the missing link between the Applied AI and Ultra Smart Cyberspace?

In recent years, quantum computing has emerged as a groundbreaking technology with the potential to revolutionize various fields, including artificial intelligence (AI). AI has already made...

17 July 2024 1,398 3 View

What are the radiometric correction procedures that are applied in Agisoft Metashape?

Dear RG-Community, I have been using Agisoft Metashape for UAV imagery processing for quite a while now. Also a while ago, I stumbled upon the Micasense GitHub repository and saw that individual...

17 July 2024 8,580 2 View

Hossein Soleimani

If you only want to detect silence regions form others(voiced or unvoiced), it is better to use Voice Activity Detection(VAD) methods which has been investigated already.

If signal is not corrupted by noise, energy of each speech frame can be used to separate silence regions.

here, I attached a energy based VAD (Matlab code) which works in low SNR.

To separate the Voiced from Unvoiced, the frequency domain techniques are more effective.

Good Luck

Saeid Safavi

The question is not complete, as the answer to this question strongly depends on the application. For example for speaker recognition energy based activity detector works better than a pitch based, while it is exactly opposite for accent recognition.

Please read the following article for more details:

1.Contrasting the Effects of Different Frequency Bands on Speaker and Accent Identification,

Saeid Safavi, Abualsoud Hanani, Martin Russell, Peter Jancovic, M Carey, IEEE Signal processing letter.

Robert Wielgat

Dear Mohammed,

I do not know if you still need the answer for you question, if so you can try the method of landmarks. Pleas read the article from the attachement. You should use landmarks 'g' in order to detect voiced regions of speech. I obtained detection rate above 90% in my preliminary experiments.

Best regards