If I start reading scientific papers, which are the must read papers on "speech to text" conversion?

More Shuvanon Razik's questions See All

What is the difference between "OFF-LINE HANDWRITTEN DATABASE" and "ONLINE HANDWRITTEN DATABASE"?

I am studying about handwritten database creation but can not understand the difference between online and offline database.

08 September 2016 1,533 1 View

How to create MNIST type database from images?

I am trying to create a database for Bangla digits like MNIST so that anyone can use this database with code they use for MNIST. But I don't understand the MNIST format or how to create this type...

04 May 2016 1,675 8 View

Is there any python library or API like deeplearning4j?

I need to do some deep learning work in python, mainly image processing based work. Do Python have any standard library or API which is works works like deeplearning4j?

03 April 2016 5,670 4 View

How to compare two photo in Matlab / Python ?

I am trying to compare two photo, which process is suitable for this ? I use sobel edge detection method for preprocessing.

04 May 2015 5,961 5 View

How vast computer vision research area is?

I need to know computer vision research area for my undergraduate research topic search. I don't understand which specific topic I should choose for next one and half year.

01 February 2015 5,051 3 View

What is the Biometrics research scope for computer science students?

I have no idea about that.

01 February 2015 6,763 6 View

Image processing with machine learning?

Looking for undergraduate research idea/topic combining image processing and machine learning. Help please

31 December 2014 6,878 2 View

How to learn more about SPSS and its Application?

I would like to learn more about SPSS and Its application especially in regards to data analysis. Please suggest me how I can learn more about it. Thank you so much.

11 August 2024 9,101 4 View

Can I base on reverse DNA sequences to perform alignment, convert to amino acids and GenBank submission?

I have reverse sequences (AB1 format), can I base on reverse DNA sequences to perform nucleotide alignment, convert nucleotides to amino acids and deposit the sequence in GenBank database?

11 August 2024 5,138 1 View

Baseline drift in HPLC? What causes this?

Hello, Why do i see this baseline drift when i compare my blank (black) to the sample (blue)? Any suggestions as to why this happened? Thank you!

11 August 2024 3,770 4 View

Text-Communication from the M1 Hand Area using BCI—and then there is Elon Musk?

Willett, Shenoy et al. (2021) have developed a brain computer interface (BCI) that used neural signal collected from the hand area of the motor cortex (area M1) of a paralyzed patient. The...

10 August 2024 7,180 0 View

Has anyone applied Python in the field of textile engineering for data analysis, automation, or smart textiles?

I'm currently exploring the application of Python in textile engineering, specifically in areas like data analysis, process automation, and the development of smart textiles. I'm interested in...

10 August 2024 7,429 2 View

Which Scopus Journal provides the most affordable fees?

"PUBLISHING IN A SCOPUS JOURNAL" Researchers are now at a cross road. The critical need to publish in a Scopus or ISI, etc journal is ever vital. Journal Publication fees must be submitted....

10 August 2024 8,621 1 View

Geotechnical Engineering (Proceedings of the ICE) time review?

Hello everyone, I recently submitted an article to Geotechnical Engineering (Proceedings of the ICE), and the current status has been listed as "EiC Pre-assessment: Ready" for the past 20 days. I...

10 August 2024 6,493 1 View

Seeking Advice on Viability and Execution of Undergraduate Thesis Topic?

Hello everyone, I am currently developing a thesis proposal and would appreciate your input on its viability and how to effectively carry it out. My proposed topic is: "Does the perceived threat...

10 August 2024 8,992 0 View

How can I use the cif data obtained from rietveld refinement extracted via gsas2, for microstructural analysis using ETEX software?

09 August 2024 7,718 0 View

Who will be moral responsible for the death of thousands of people in the event of an earthquake?

Who will bear moral responsibility for the deaths of thousands of people in the event of an earthquake? Weeks and months remain before the onset of strong earthquakes that bring death to...

08 August 2024 6,134 12 View

John Salatas Popular answer

This is a rather broad field. It would be helpfull if you could be more specific.

In any case I guess it is worth looking at the Publications of the CMU SCS Speech Group

http://www.speech.cs.cmu.edu/paper_index/

John Salatas

Prakasam Periasamy

Find the attached paper for your information.

Lyes Demri

An approach that I've found out to work quite well is to set a number of papers to read. For example, google "speech to text" and read 10 papers. By the tenth paper, you'll probably have figured out who are the most prolific authors on the subject, then read their papers.

A.G. Ramakrishnan

Start with L.R.Rabiner, A tutorial on hidden Markov models and selected applications in speech recognition , Proc.IEEE 77(2):257-286.February 1989.

To understand this paper, you may have to read other papers or background material. However, this is the basis of all current, commercial speech recognition systems. The newer approach is using deep neural networks and the latest paper in this is:

Deep Speech: Scaling up end-to-end speech recognition

Awni Hannun, Carl Case, Jared Casper, Bryan Catanzaro, Greg Diamos, Erich Elsen, Ryan Prenger, Sanjeev Satheesh, Shubho Sengupta, Adam Coates, Andrew Y. Ng.

Shuvanon Razik

Thank You, Everyone!

Leonard Goeirmanto

these references are not really specific but hopefully can help you: