Context-Dependent Phonetic Hidden Markov Models for Speaker-Independent Continuous Speech Recognition

Lee, Kay-Fu

doi:10.1007/978-3-642-76626-8_15

Kay-Fu Lee³

Part of the book series: NATO ASI Series ((NATO ASI F,volume 75))

282 Accesses
14 Citations

Abstract

The effectiveness of context-dependent phone modeling for speaker-dependent continuous speech recognition has recently been demonstrated. In this study, we apply context-dependent phone models to speaker-independent continuous speech recognition, and show that they are equally effective in this domain. In addition to evaluating several previously proposed context-dependent models, we also introduce two new context-dependent phonetic units: 1) function-word-dependent phone models, which focus on the most difficult subvocabulary, and 2) generalized triphones, which combine similar triphones together based on an information-theoretic measure. The subword clustering procedure used for generalized triphones can find the optimal number of models given a fixed amount of training data. We demonstrate that context-dependent modeling reduces the error rate by as much as 60%.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Author information

Authors and Affiliations

School of Computer Science, Carnegie-Mellon University, Pittsburgh, PA, 15213, UK
Kay-Fu Lee

Authors

Kay-Fu Lee
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Dipartimento di Automatica e Informatica, Politecnico di Torino, Corso Duca degli Abruzzi 24, 10129, Torino, Italy
Pietro Laface
School of Computer Science, 3480 University St., Montreal, Quebec, H3A 2A7, Canada
Renato De Mori

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Lee, KF. (1992). Context-Dependent Phonetic Hidden Markov Models for Speaker-Independent Continuous Speech Recognition. In: Laface, P., De Mori, R. (eds) Speech Recognition and Understanding. NATO ASI Series, vol 75. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-76626-8_15

Download citation

DOI: https://doi.org/10.1007/978-3-642-76626-8_15
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-76628-2
Online ISBN: 978-3-642-76626-8
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics