Combining Global and Local Classifiers for Lipreading

Zhang, Shengping; Yao, Hongxun; Wan, Yuqi; Wang, Dan

doi:10.1007/978-3-540-74889-2_73

Shengping Zhang¹,
Hongxun Yao¹,
Yuqi Wan¹ &
…
Dan Wang¹

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 4738))

Included in the following conference series:

International Conference on Affective Computing and Intelligent Interaction

5865 Accesses

Abstract

Lipreading has become a hot research topic in recent years since the visual information extracted from the lip movement has been shown to improve the performance of automatic speech recognition (ASR) system especially under noisy environments [1]-[3], [5]. There are two important issues related to lipreading: 1) how to extract the most efficient features from lip image sequences, 2) how to build lipreading models. This paper mainly focuses on how to choose more efficient features for lipreading.

Access provided by Autonomous University of Puebla. Download to read the full chapter text

Chapter PDF

Lipreading with LipsID

Automatic lipreading based on optimized OLSDA and HMM

Article 01 March 2022

A Review on Deep Learning-Based Automatic Lipreading

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Morishima, S., Ogata, S., Murai, K., Nakamura, S.: Audio-visual speech translation with automatic lip synchronization and face tracking based on 3D head model. In: Proc. IEEE Int. Conf. Acoustics, Speech,and Signal Processing, vol. 2, pp. 2117–2120 (2002)
Google Scholar
Potamianos, G., Graf, H.P., Cosatto, E.: An image transform approach for HMM based automatic lipreading. In: Proc. Int. Conf. Image Process, Chicago, pp. 173–177 (1998)
Google Scholar
Dupont, S., Luettin, J.: Audio-visual speech modeling for continuous speech recognition. IEEE Trans. On Multimedia 2, 141–151 (2000)
Article Google Scholar
Shen, L., Bai, L.: Gabor feature based face recognition using kernel methods. AFGR, pp. 170–176 (2004)
Google Scholar
Matthews., et al.: Extraction of Visual Features for Lipreading. IEEE Trans. on Pattern Analysis and Machine Intelligence 24(2) (2002)
Google Scholar
Duchnowski, P., et al.: Toward movement-invariant automatic lip-reading and speech recognition. In: Duchnowski, P. (ed.) Proc. Int. Conf. Acoust. Speech Signal Process, Detroit, pp. 109–111 (1995)
Google Scholar
Navon, D.: Forest before the trees: the precedence of global features in visual perception. Cognitive Psychology 9, 353–383 (1977)
Article Google Scholar
Biederman, I.: On the semantics of a glance at a scene. In: Kubovy, M., Pomerantz, J. (eds.) Perceptual organization, pp. 213–253. Erlbaum (1981)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Computer Science and Engineering, Harbin Institute of Technology, Harbin, 150001, China
Shengping Zhang, Hongxun Yao, Yuqi Wan & Dan Wang

Authors

Shengping Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Hongxun Yao
View author publications
You can also search for this author in PubMed Google Scholar
Yuqi Wan
View author publications
You can also search for this author in PubMed Google Scholar
Dan Wang
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Ana C. R. Paiva Rui Prada Rosalind W. Picard

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zhang, S., Yao, H., Wan, Y., Wang, D. (2007). Combining Global and Local Classifiers for Lipreading. In: Paiva, A.C.R., Prada, R., Picard, R.W. (eds) Affective Computing and Intelligent Interaction. ACII 2007. Lecture Notes in Computer Science, vol 4738. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-74889-2_73

Download citation

DOI: https://doi.org/10.1007/978-3-540-74889-2_73
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-74888-5
Online ISBN: 978-3-540-74889-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Combining Global and Local Classifiers for Lipreading

Abstract

Chapter PDF

Similar content being viewed by others

Lipreading with LipsID

Automatic lipreading based on optimized OLSDA and HMM

A Review on Deep Learning-Based Automatic Lipreading

Keywords

References

Author information

Authors and Affiliations

Editor information

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Combining Global and Local Classifiers for Lipreading

Abstract

Chapter PDF

Similar content being viewed by others

Lipreading with LipsID

Automatic lipreading based on optimized OLSDA and HMM

A Review on Deep Learning-Based Automatic Lipreading

Keywords

References

Author information

Authors and Affiliations

Editor information

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation