research-article

Free Access

Unsupervised learning of acoustic sub-word units

Authors:
Balakrishnan Varadarajan

Johns Hopkins University, Baltimore, MD

Johns Hopkins University, Baltimore, MD
View Profile

,
Sanjeev Khudanpur

Johns Hopkins University, Baltimore, MD

Johns Hopkins University, Baltimore, MD
View Profile

,
Emmanuel Dupoux

Laboratoire de Science Cognitive, Paris, France

Laboratoire de Science Cognitive, Paris, France
View Profile

HLT-Short '08: Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics on Human Language Technologies: Short PapersJune 2008Pages 165–168

Published:16 June 2008Publication History

HLT-Short '08: Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics on Human Language Technologies: Short Papers

Pages 165–168

ABSTRACT

Accurate unsupervised learning of phonemes of a language directly from speech is demonstrated via an algorithm for joint unsupervised learning of the topology and parameters of a hidden Markov model (HMM); states and short state-sequences through this HMM correspond to the learnt sub-word units. The algorithm, originally proposed for unsupervised learning of allophonic variations within a given phoneme set, has been adapted to learn without any knowledge of the phonemes. An evaluation methodology is also proposed, whereby the state-sequence that aligns to a test utterance is transduced in an automatic manner to a phoneme-sequence and compared to its manual transcription. Over 85% phoneme recognition accuracy is demonstrated for speaker-dependent learning from fluent, large-vocabulary speech.

References

T. Fukada, M. Bacchiani, K. K. Paliwal, and Y. Sagisaka. 1996. Speech recognition based on acoustically derived segment units. In ICSLP, pages 1077--1080.Google Scholar
K. Maekawa. 2003. Corpus of spontaneous japanese: its design and evaluation. In ISCA/IEEE Workshop on Spontaneous Speech Processing and Recognition.Google Scholar
K. K. Paliwal and A. M. Kulkarni. 1987. Segmentation and labeling using vector quantization and its application in isolated word recognition. Journal of the Acoustical Society of India, 15:102--110.Google Scholar
H. Singer and M. Ostendorf. 1996. Maximum likelihood successive state splitting. In ICASSP, pages 601--604. Google ScholarDigital Library
J. Takami and S. Sagayama. 1992. A successive state splitting algorithm for efficient allophone modeling. In ICASSP, pages 573--576.Google Scholar
J. G. Wilpon, B. H. Juang, and L. R. Rabiner. 1987. An investigation on the use of acoustic sub-word units for automatic speech recognition. In ICASSP, pages 821--824.Google Scholar

Index Terms

Unsupervised learning of acoustic sub-word units
1. Computing methodologies

Recommendations

Study of sub-word acoustical models for Kannada isolated word recognition system

The speech recognition system basically extracts the textual information present in the speech. In the present work, speaker independent isolated word recognition system for one of the south Indian language--Kannada has been developed. For European ...
Read More
Natural Sounding Sub-Word Units Concatenation in Malay Speech Synthesis
ICSAP '09: Proceedings of the 2009 International Conference on Signal Acquisition and Processing

The goal of this work was to concatenate Malay subwordswithout introducing perceptible audible discontinuities.Based on a phonemes adjacency analysis, we build a list ofnon-audible distortion sub-word unit lookup. Selecting subwordfrom this lookup will ...
Read More
Acoustic Characterization of Amharic Vowel Sound Units
SSPS '19: Proceedings of the 2019 International Symposium on Signal Processing Systems

Acoustic characterization of vowels has a significant role in the development of speech synthesis and recognition systems. Specifically, it reduces the improper vowel parameter selection for the concatenative and formant based speech synthesis systems. ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in

HLT-Short '08: Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics on Human Language Technologies: Short Papers
June 2008
307 pages
Sponsors
In-Cooperation
Publisher
Association for Computational Linguistics
United States
Publication History
- Published: 16 June 2008
Qualifiers
- research-article
Conference

Acceptance Rates
Overall Acceptance Rate240of768submissions,31%
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 6
  Total Citations
  View Citations
- 332
  Total Downloads
- Downloads (Last 12 months)18
- Downloads (Last 6 weeks)3
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Unsupervised learning of acoustic sub-word units

HLT-Short '08: Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics on Human Language Technologies: Short Papers

ABSTRACT

References

Cited By

Index Terms

Recommendations

Study of sub-word acoustical models for Kannada isolated word recognition system

Natural Sounding Sub-Word Units Concatenation in Malay Speech Synthesis

Acoustic Characterization of Amharic Vowel Sound Units

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Qualifiers

Conference

Acceptance Rates

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Unsupervised learning of acoustic sub-word units

HLT-Short '08: Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics on Human Language Technologies: Short Papers

ABSTRACT

References

Cited By

Index Terms

Recommendations

Study of sub-word acoustical models for Kannada isolated word recognition system

Natural Sounding Sub-Word Units Concatenation in Malay Speech Synthesis

Acoustic Characterization of Amharic Vowel Sound Units

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Qualifiers

Conference

Acceptance Rates

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media