Speech Recognition — Feature Extraction MFCC & PLP

Requirement

Source
  • Remove vocal fold excitation (F0) — the pitch information.
  • Make the extracted features independent.
  • Adjust to how humans perceive loudness and frequency of sound.
  • Capture the dynamics of phones (the context).

Mel-frequency cepstral coefficients (MFCC)

Source
Jurafsky & Martin, fig. 9.9
Source Top right: a signal that composed of two frequency
Source
Source
Source
Source
Paul Taylor (2008)
Source
Source
Source

Perceptual Linear Prediction (PLP)

Source

Thoughts

Next

Credit and reference

--

--

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store