I am working on statistical parametric speech synthesis. I extracted the fundamental frequency and MFCC from speech waveforms. The next task is to invert MFCC back to speech waveforms. For this, I have read about sinusoidal wave generation methods which need amplitude, phase and frequency values to be determined from extracted speech parameters. How can we determine amplitude and phase information from the MFCC sequence and fundamental frequency?

I have referred to the following research paper. Can anyone please tell how phase synthesis and amplitude generation is done in this paper?

Article Speech Reconstruction From Mel Frequency Cepstral Coefficien...

More Navdeep Kaur's questions See All
Similar questions and discussions