Perception of Synthetic Visual Speech

Cohen, Michael M.; Walker, Rachel L.; Massaro, Dominic W.

doi:10.1007/978-3-662-13015-5_11

Michael M. Cohen³,
Rachel L. Walker³ &
Dominic W. Massaro³

Part of the book series: NATO ASI Series ((NATO ASI F,volume 150))

235 Accesses
10 Citations

Abstract

We report here on an experiment comparing visual recognition of monosyllabic words produced either by our computer-animated talker or a human talker. Recognition of the synthetic talker is reasonably close to that of the human talker, but a significant distance remains to be covered and we discuss improvements to the synthetic phoneme specifications. In an additional experiment using the same paradigm, we compare perception of our animated talker with a similarly generated point-light display, finding significantly worse performance for the latter for a number of viseme classes. We conclude with some ideas for future progress and briefly describe our new animated tongue.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 299.00; Price excludes VAT (USA)

Softcover Book: USD 379.99; Price excludes VAT (USA)

Hardcover Book: USD 379.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Author information

Authors and Affiliations

University of California, Santa Cruz, USA
Michael M. Cohen, Rachel L. Walker & Dominic W. Massaro

Authors

Michael M. Cohen
View author publications
You can also search for this author in PubMed Google Scholar
Rachel L. Walker
View author publications
You can also search for this author in PubMed Google Scholar
Dominic W. Massaro
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Ricoh California Research Center, 2882 Sand Hill Road #115, 94025-7022, Menlo Park, CA, USA
David G. Stork & Marcus E. Hennecke &
Department of Electrical Engineering, Stanford University, 94305, Stanford, CA, USA
David G. Stork & Marcus E. Hennecke &

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Cohen, M.M., Walker, R.L., Massaro, D.W. (1996). Perception of Synthetic Visual Speech. In: Stork, D.G., Hennecke, M.E. (eds) Speechreading by Humans and Machines. NATO ASI Series, vol 150. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-13015-5_11

Download citation

DOI: https://doi.org/10.1007/978-3-662-13015-5_11
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-08252-8
Online ISBN: 978-3-662-13015-5
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics