Why is ASS performance better for an automatically segmented dataset than a manually segmented dataset?

More Mohammed Nasar Ibnu Ibrahim's questions See All

Is skin yellowness an numerical or ordinal variable?

I have a response variable called skin yellowness, which I will measure via a scored color chart, whereby 1 is pale yellow and 15 is orange. I'm not sure if this counts as an ordinal variable,...

11 August 2024 4,793 1 View

• What the possible Persistent Organic Pollutants and Heavy metals present in fluorspar, sediments, and water bodies around its mining area?

Approximate concentrations are require in compared with the WHO permissible limts

11 August 2024 2,723 1 View

How can I get old refernce in geomorphological mapping?

how can I read Waters, R. S. 1958. Morphological mapping. Geography 43 :10-17 from internet? note: not in google or resaearch gate

26 July 2024 7,813 3 View

How can the rubber fibres produced by the electrospinning device be removed from the fibre collect(aluminium foil) ?

How can the rubber fibres produced by the electrospinning device be removed from the fibre collector (aluminium foil) without affecting the orientation of the rubber fibres for use in reinforcing...

26 July 2024 8,281 0 View

My question concerns MTT cell viability assay?

Despite not having cells in the media, I am getting purple color. I have tried many troubleshooting methods, varying media types, and even different MTTs from different companies to figure out the...

21 July 2024 9,914 1 View

Guidance needed for preparing the hydrogel samples for the XRD instrument?

I'm working with a hydrogel sample and I'd like to perform XRD analysis. Can anyone offer guidance on preparing the hydrogel for the XRD instrument? Specifically, I'm unsure about the best method...

20 July 2024 3,611 4 View

What are the modern topics to work under bacteriocin and fermentation as PhD student?

As PhD student specialized as food and industrial microbiology need to further research on bacteriocin and fermented food.

16 July 2024 7,394 1 View

How to get a research question in educational administration and management in MPhil?

What are some of the areas to consider in undertaken research in educational administration in MPhil.

14 July 2024 8,394 2 View

How can I extract the mathematical equation from existing Neural Network Model?

There exists a neural network model designed to predict a specific output, detailed in a published article. The model comprises 14 inputs, each normalized with minimum and maximum parameters...

14 July 2024 2,714 3 View

How can i elucidate a compound using xrd?

XRD analysis

08 July 2024 977 4 View

Broca’s area must be intact for the learning of new movement sequences?

When the eyes of a person are damaged this causes complete blindness. Likewise, when Wernicke’s and Broca’s areas of neocortex are damaged this causes complete aphasia, losing the ability to...

01 August 2024 6,744 2 View

Simulation of metal drawing by Abaqus with UMAT?

Hello, colleagues. Recently, I have been working on a metal processing simulation with my UMAT in Abaqus. I have outlined the corresponding simulation, but I keep encountering issues that cause...

30 July 2024 7,062 1 View

Does post-translational protein modification cause devisions on observed pI verses calculated pI?

In running two-dimensional gel electrophoresis on bacterial protein, some spots that appear to match a protein sequence have a significantly more acidic isoelectric point than the calculated pI....

24 July 2024 8,076 3 View

Can a shoot-through event of a tri-state digital buffer cause momentary Hi-Z state?

// interested in the difference between floating events and short circuits.

22 July 2024 6,565 0 View

What are the current challenges and future prospects of integrating artificial intelligence into recognition systems for autonomous vehicles?

This question aims to explore the intersection of artificial intelligence and autonomous vehicle technology. It seeks to identify the key challenges faced in implementing AI for recognition...

20 July 2024 3,469 2 View

Trial exclusion in eye-tracking data?

Is it reasonable to exclude all trials with a blink or saccade in the 150 ms before stimulus onset? As an alternative, would it be better to exclude blinks (after extending them by about 100 ms...

19 July 2024 3,838 0 View

Help me download paper?

I have 2 papers below, but I can't access this, you can help me? Shuai Zhang, Xiaodi Li, Xingyu Zhou, Yuning Wang, Yue Hu, Cloud removal using SAR and optical images via attention mechanism-based...

18 July 2024 9,635 0 View

What are the future implications of quantum computing on image processing algorithms?

Image Processing Algorithms, Quantum Computing.

17 July 2024 7,958 2 View

Given the current advances in Super Computation and Quantum Computing, what are the missing link between the Applied AI and Ultra Smart Cyberspace?

In recent years, quantum computing has emerged as a groundbreaking technology with the potential to revolutionize various fields, including artificial intelligence (AI). AI has already made...

17 July 2024 1,398 3 View

What are the radiometric correction procedures that are applied in Agisoft Metashape?

Dear RG-Community, I have been using Agisoft Metashape for UAV imagery processing for quite a while now. Also a while ago, I stumbled upon the Micasense GitHub repository and saw that individual...

17 July 2024 8,580 2 View

Jan Romportl

Maybe the manually segmented corpus isn't actually internally coherent - the human annotator performed differently in various parts of the database, hence the bias. ANNs then obviously cannot capture this incoherence. On the other hand, the force alignment is always coherent - maybe not correct, but at least the errors are systematic and perhaps predictable (unlike in the case of the manual annotator).

Mohammed Nasar Ibnu Ibrahim

Thanks a lot for ur answer...

So, automatically segmented corpus will always perform better than manually segmented corpus for ASR application using ANN ? Because my assumption was ANN performance may go up for atleast 3-4% by using more accurate manually segmented TIMIT corpus than Forced Aligned corpus.

I wouldn't put it exactly like this... It's quite a complex issue, and I would myself also expect the performance to go up if I use more precise manual segmentation. But here it seems to me that the manual corpus comprises more information than the forced aligned data set, i.e. the manual segmentation has higher entropy, is more irregular - because it comprises all the nuances added by the human when carefully listening, observing the waveforms and making well-informed and cognitively highly complex judgements. So in my opinion this can have various consequences, such as: 1) your ANN is performing worse in generalizing the more irregular manual data set; 2) your feature vectors (MFCC?) aren't discriminative enough for the variability in the manual dataset, and the ANN gets "confused" by being trained with more cases of very similar input vectors with very different outputs (and this is a combination of the problems with the feature vectors not being able to properly describe the actual data, and the ANN itself not being able to cope with such a non-linearity). And since the automatically segmented data set has probably significantly lower entropy, it accidentally (for your case) happens that the ANN performs better here.