Abstract
We consider the matching function in vector quantization based speaker identification system. The model of a speaker is a codebook generated from the set of feature vectors from the speakers voice sample. The matching is performed by evaluating the similarity of the unknown speaker and the models in the database. In this paper, we propose to use weighted matching method that takes into account the correlations between the known models in the database. Larger weights are assigned to vectors that have high discriminating power between the speakers and vice versa. Experiments show that the new method provides significantly higher identification accuracy and it can detect the correct speaker from shorter speech samples more reliable than the unweighted matching method.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Deller Jr. J.R., Hansen J.H.L., and Proakis J.G.: Discrete-time Processing of Speech Signals. Macmillan Publishing Company, New York, 2000.
Fränti P. and Kivijärvi J.: “Randomized local search algorithm for the clustering problem”, Pattern Analysis and Applications, 3(4): 358–369, 2000.
Furui S.: “Cepstral analysis technique for automatic speaker verification”. IEEE Transactions on Acoustics, Speech and Signal Processing, 29(2): 254–272, 1981.
He J., Liu L., and Palm G.: “A discriminative training algorithm for VQbased speaker identification”, IEEE Transactions on Speech and Audio Processing, 7(3): 353–356, 1999.
Kinnunen T., Kilpeläinen T., and Fränti P.: “Comparison of clustering algorithms in speaker identification”, Proc. IASTED Int. Conf. Signal Processing and Communications (SPC): 222–227. Marbella, Spain, 2000.
Kyung Y.J. and Lee H.S.: “Bootstrap and aggregating VQ classifier for speaker recognition”. Electronics Letters, 35(12): 973–974, 1999.
Pham T. and Wagner M., “Information based speaker identification”, Proc. Int. Conf. Pattern Recognition (ICPR), 3: 282–285, Barcelona, Spain, 2000.
Soong F.K., Rosenberg A.E., Juang B-H., and Rabiner L.R.: “A vector quantization approach to speaker recognition”, AT&T Technical Journal, 66: 14–26, 1987.
Zhen B., Wu X., Liu Z., and Chi H.: “On the use of bandpass liftering in speaker recognition”, Proc. 6th Int. Conf. of Spoken Lang. Processing (ICSLP), Beijing, China, 2000.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2001 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Kinnunen, T., Fränti, P. (2001). Speaker Discriminative Weighting Method for VQ-Based Speaker Identification. In: Bigun, J., Smeraldi, F. (eds) Audio- and Video-Based Biometric Person Authentication. AVBPA 2001. Lecture Notes in Computer Science, vol 2091. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45344-X_22
Download citation
DOI: https://doi.org/10.1007/3-540-45344-X_22
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-42216-7
Online ISBN: 978-3-540-45344-4
eBook Packages: Springer Book Archive