Specifics of Hidden Markov Model Modifications for Large Vocabulary Continuous Speech Recognition
Volume 15, Issue 1 (2004), pp. 93–110
Pub. online: 1 January 2004
Type: Research Article
Received
1 July 2003
1 July 2003
Published
1 January 2004
1 January 2004
Abstract
Specifics of hidden Markov model‐based speech recognition are investigated. Influence of modeling simple and context‐dependent phones, using simple Gaussian, two and three‐component Gaussian mixture probability density functions for modeling feature distribution, and incorporating language model are discussed. Word recognition rates and model complexity criteria are used for evaluating suitability of these modifications for practical applications. Development of large vocabulary continuous speech recognition system using HTK toolkit and WSJCAM0 English speech corpus is described. Results of experimental investigations are presented.