TensorFlow to compare two audio files?

More Leopold Hamminger's questions See All

Which abcam myelin basic protein antibody for mouse penile dorsal nerve tissue?

Dear all, does somebody know the exact abcam antibody the scientists used in this study in Fig. 2? Implications for Differentiation of Endogenous Stem Cells: T... The stained myelin and...

26 November 2019 7,703 0 View

What are your experiences with Second Life to support eLearning?

A recent question on what online educational tools people use generated a lot of answers, among them a suggestion to use Second Life. I looked into it a few years ago and it appeared to me that...

22 September 2016 6,579 26 View

Will a standard plasmid mini prep contain enough bacterial 16s rDNA to amplify in PCR with general 16s primers? If so, could this be avoided?

I am preparing to use some plasmid DNA to spike into soil samples at a known concentration as an internal standard to account for variable DNA extraction efficiency. I will also be looking at...

08 January 2015 5,408 3 View

How to reduce resolution of MicroCT Data? Is such a function included in Mimics?

I would like to process MicroCT data with a file size of 16GB, but Mimics supports file sizes not bigger than 4 GB. Is there a possibility to reduce the resolution of DICOM images?

29 January 2014 603 3 View

Is a binary classifier based on Gaussian models resistant to the problem of training set imbalance?

A binary classifier based on multivariate Gaussian models, which estimates the mean vector and the variance-covariance matrix during the training phase and returns the class with the highest...

23 June 2024 10,114 1 View

Which standards, frameworks or best practices for AI governance do you know?

Frameworks for AI governance The use of artificial intelligence in the company requires its control and monitoring by the company's governing bodies due to legal and regulatory requirements....

17 June 2024 8,956 5 View

How can I calssify samples in multiclass EEG Motor Imagery dataset in MATLAB?

1) In EEG Motor Imagery multiclass dataset, e.g., BCI Competition IV dataset IIa, which is a 4-class dataset (Right Hand, Left Hand, Tongue and both Feet). How can I classify the samples with...

14 June 2024 3,570 1 View

In SPSS, how to generate P-value of corresponding odds-ratio with 95% CI using complex sample logistic regression option?

I plan to apply multinomial logistic regression using the complex sample option of SPSS. The dependent variables have 04 categories (low, moderate, high, and very high), and 05 independent...

02 June 2024 6,967 3 View

S there any classified research on the glass brush bush?

I need this research in PDF format

02 June 2024 661 0 View

How to do precise nutrient management with plant tissue and soil analysis data?

Currently, most of the fertilizer recommendations are based on crop requirements and the soil analysis value will be classified as low, medium, and high. if it is the medium recommendation and...

09 May 2024 1,179 2 View

What should I do if my research is stolen and republished? Note that it was previously published?

Please ... I need help from someone who has knowledge or an idea. I am a researcher from Syria in the field of remote sensing and photogrammetry. I have an international research published since...

09 May 2024 1,373 4 View

How to convert .pcap files to .csv files?

I am trying to apply a machine-learning classifier to a dataset. But the dataset is in the .pcap file extension. How can I apply classifiers to this dataset? Is there any process to convert the...

06 May 2024 8,197 4 View

Is the AAM cement can be classified as Geopolymer cement if we can tackle the efflorescence?

Unlike geopolymer cement, the AAM cement produced with sodium silicate/hydroxide will leach out the alkali Na+. Based on Davidovits' explanation, geopolymer cement will not leach out the sodium...

01 May 2024 9,110 3 View

What statistical tests should I use to analyze why a machine learning classifier outperforms other classifiers in IDS?

I am developing a machine-learning model for a Network Intrusion Detection System (IDS) and have experimented with several ensemble classifiers including Random Forest, Bagging, Stacking, and...

25 April 2024 5,123 4 View

Shafagat Mahmudova

Dear Leopold Hamminger,

Look the link, maybe helpful.

https://www.tensorflow.org/tutorials/sequences/audio_recognition

https://medium.com/iotforall/sound-classification-with-tensorflow-8209bdb03dfb

Regards, Shafagat

Leopold Hamminger

Thank you very much, @Shafagat Mahmudova, highly appreciated

Vinh T. Nguyen

Based on my understanding, when you're thinking about ML, you taught the machine what has happened in the past (or historical data), and then when new data comes it automatically classifies your new data into a given category.

Since your problem is only to compare between to any audio files and to detect the difference. I dont think ML would be the first choice (unless you have existing data that clearly classify and indicate the difference).

I think one easier approach is to read the two audio files as a matrix of numbers and you compare the similarity between these matrices in terms of pitch, speed..

One of my lab's mate is currently working on detecting the similarity between the cover song and its original one. The method I think would be the same.

Vinh T. Nguyen you are absolutely right, converting the audio files into matrices would be another way to go about it, and I thought about this first. However, I thought it would be a daunting task to isolate a particular phoneme.

It doesn't mean that an ML solution would be any easier. Again you are right in pointing out that existing data is needed, and I thought of collecting some from different native speakers. My hope is that existing data could be used to "machine learn" the phoneme j in Jack, for example, likewise the phonemes that follow. If I then present a learner's 'Jack' it would either recognize it as such (if pronounced like the native speakers did), or not.

Which may mean that a learner's 'Jack', that sounds like 'Chack' could possibly also be machine-learned... This needs some more thinking and any additional ideas are very much welcome.

Meanwhile again thanks a lot, Shafagat Mahmudova and Vinh T. Nguyen

Nhat Le

I think the approach mentioned in the paper maybe a right solution for your problem. https://arxiv.org/ftp/arxiv/papers/1601/1601.01577.pdf?fbclid=IwAR017Cf6SqvramF1BO0ZjLxXnBmTDqnivkFKbZw-2XEGZDhE5r2qF7HiC7s

They conducted the experiments to identify gender from audio files using a set of features known as MFCC (Mel-frequency cepstrum coeffcients). I think the issue is that you need to gather a proper amount of training data to perform the classification.

Nhat Le excellent literature link, it appears that Support Vector Machines are superior as classifiers. Very useful also the reference to Vapnik's Statistical Learning. Thank you!