17 September 2012 13 3K Report

I have come to understand that both FFT (not specific to voice only) and LPC are meant to break down voice signals into its wave components. The FFT algorithm (representing DFT) deconvolutes perceived signals into a series of waveforms, fundamental and harmonics. Each of these peaks (representing a waveform of a particular frequency) has an amplitude representing energy, which when recombined will reproduce the initial signal. LPC also produces a fundamental and a series of formant frequencies with the same concept, that each of the formants represents a component waveform with significant energy. The fundamental and formant series can be convoluted back producing the perceived signals. Maybe I am wrong somewhere?

I have analyzed a piece of vowel signal, 320 ms (non-independent /a/ ) with FFT which produce the following output F0 134.3 Hz, H1 266.2 Hz, H2 398.8 Hz and H3 529.3 Hz. On the other hand when I analyzed this same piece of signals with PRAAT for pitch and formants, I got the following result; F0 132.77 Hz, F1 452.3 Hz, F2 1391.2 Hz and F3 2390 Hz.

I now have two sets of component waveforms of very different frequency values, derived through two different mathematical algorithms, representing the same thing, component waveform with significant energies. This can’t be right! Any help will be much appreciated, thanks.

More Abu Dzarr's questions See All
Similar questions and discussions