How to formalise training and testing dataset for audio classification?

More Devan Govender's questions See All

Can we mark 'EFL Learners shifting from general digital to AI technologies' as technological transition?

After COVID-19 it has seen that EFL learners technological affiliation has raised. In addition, in the post-COVID period learners started to engage AI technologies like ChatGPT while learning...

08 August 2024 8,964 4 View

How to generate a citation of my paper from ResearchGate?

How we can cite the papers from ResearchGate. I am trying to create citations for this article, Quantum Machine Learning Algorithms for Optimization Problems: Theory, Implementation, and...

08 August 2024 6,690 3 View

Does Anyone have expertise in in vitro transcription and RNA pull down assay?

I am currently working on LncRNA; to know the lncRNA-protein interactions I want to do RNA pull down assay, so I need to design primers with T7 promoter. I need assistance in this regard.

07 August 2024 6,622 1 View

How to fix background error in rietveld refinement of one XRD peak using GSAS-II?

I want to refine one XRD peak of my in-situ xrd but the background is never working good which ultimately fails the refinement. How to refine and adjust the background using GSAS-II

05 August 2024 5,291 2 View

How can I add own Henry coefficients in Aspen Plus?

Hi, i would like to simulate an absorption process in Aspen Plus. I want to use the NRTL model und would like to add some individual Henry coefficients. Is that possible and how?

05 August 2024 2,333 2 View

Why might the impedance values for DI water and 0.1X PBS buffer solution exhibit a decreasing and increasing trend, respectively over time (HP 4194A)?

Hello everyone, I'm encountering an issue with my electrochemical impedance spectroscopy (EIS) measurements and would appreciate some insights. Experimental Setup: Electrodes: Gold interdigitated...

05 August 2024 3,783 2 View

Can usage of AI tools like chat GPT in research work is recommendable ?

AI tools like ChatGPT can enhance research work significantly when used responsibly and in conjunction with thorough human oversight.

05 August 2024 1,842 3 View

Usage of internal standards in LC-MS/MS analysis?

Have you ever seen a LC-MS/MS method uses both internal standards and external standards (in matrix matching purpose) but the concentrations of internal standards are outside the calibration curve...

05 August 2024 3,084 6 View

ANY free software for reconstructing neurons in the microscopic image?

Hi everyone, I am working on brain slices for visualizing a protein in the soma and dendrites, using a fluorescence tag. However, I need a tool (not paid) for reconstruction of the whole neuron,...

04 August 2024 4,725 2 View

How effective is the Citi Bloc standard basket in enhancing the accuracy and comparability of international construction cost assessments?

Citi BLOC Standard Basket Definitions: A standardized unit representing a fixed basket of construction materials, labor, and equipment costs priced in various cities. Purpose: To create a common...

04 August 2024 8,997 1 View

How do we pick data for determination of Validation Acceptance Criteria?

Hello, colleagues! There is commenting open for new upcoming edition of USP 1033. Validation target acceptance criteria is now different from what it used to be and it doesn't include Cpm....

23 July 2024 7,292 3 View

Which research tool for expert validation for our study?

I am a 3rd year Computer Science student currently writing our Bachelor's thesis about finding diverse k-shortest paths in pedestrian networks. We have chosen 3 local areas as our proposed...

15 July 2024 4,289 0 View

List of journals impact factors?

Dear colleagues, Is it possible to send me the list of journals impact factor for the year 2024 (classification is for the year 2023)? excel format if it is possible. Thank you in...

29 June 2024 2,102 3 View

After training an XGBoost model using K-fold cross-validation, Can I use SHAP to interpret this dataset?

In other words, I did not use the trained XGBoost model to make predictions on the test set and then use SHAP for interpretation. The reasons are as follows: Even with the best and most...

26 June 2024 7,332 1 View

What is the proper quality assessment tool of a comparative descriptive survey studies? Could I use the STROBE Checklist of cross-sectional studies?

Tile of study is "Comparing Health Information-Seeking Patterns in Exceptional versus Normal Conditions" Objective of the study is "to examine the influence of contextual differences and...

21 June 2024 7,618 5 View

Use of agency vs. google-translation for translating non-english qualitative data?

Hi, As part of a miltilevel study examining the impact of steroid toxicity in patients with different rheumatic diseases (see here: https://vasup.ndorms.ox.ac.uk/) we collected data from the UK...

17 June 2024 4,016 4 View

How does the application of (GANs) for data augmentation impact the robustness and accuracy of image classification models?

How does the application of generative adversarial networks (GANs) for data augmentation impact the robustness and accuracy of image classification models?

09 June 2024 2,923 2 View

How can attention mechanisms be integrated with convolutional neural networks to enhance performance in image classification tasks?

09 June 2024 2,432 3 View

Do you know some research papers about the relationship between the priming effect and illustration description behavior?

I am planning to research the relationship between the priming effects and human drawing behavior in the field of cognitive psychology. I want to know about those field research links or something...

08 June 2024 6,372 3 View

How spectral bands and indices like (NDVI, NDBI) together used as input before supervised classification? In ArcGIS pro or any other software?

How Satellite Bands (Landsat/Sentinal) and indices (NDVI/NDBI) were composite together (Layer stacked) (In a single layer) before performing supervised classification (MLC/SVM/RF etc)? How it...

06 June 2024 2,207 1 View

Theodoros Giannakopoulos

Yes the procedure you describe is correct: 10 folds, use each fold for testing and the rest of the data for training. Repeat for each fold and compute the average performance measures (f1, accuracy etc).

However, for the particular case of audio classification, fold definition should not be done using a random permutations. Instead one should define each fold in a way to include samples from the same recording or with similar characteristics in the same fold. In your case, for example, samples from the same speaker should be in the same fold, otherwise, you could end up in bias, since samples from the same speaker would be both in the training and testing datasets at the same time (which is kind of "cheating"...).

Amani Munis Mahmoud

Theodoros Giannakopoulos i just saw your comment and it actually discusses the exact problem facing now with my Baby Cry dataset, i split my 5 sec audios into 2 sec segments so my training set has segments from the same speaker and i am getting high accuracy from the bias happening , so how can i keep samples from the same speaker be in the same fold, would you please explain ?