Employing The Complete Face in AVSR to Recover by Ben @VideoLectures

English

Employing The Complete Face in AVSR to Recover from Facial Occlusions

Existing Audio-Visual Speech Recognition (AVSR) systems visually focus intensely on a small region of the face, centred on the immediate mouth area. This is poor design for a variety reasons in real world situations because any occlusion to this small area renders all visual advantage null and void. This is poorby design because it is well known that humans use the complete face to speechread. We demonstrate a new application of a novel visual algorithm, the Multi-Channel Gradient Model, the deploys information from the complete face to perform AVSR. Our MCGM model performs near to the performance of Discrete Cosine Transforms in the case where a small region of interest around the lips, but in the case of an occluded face we can achieve results that match nearly 70% of the performance that DCTs can achieve on the DCT best case, lips centeric approach.

Find OpenCourseWare Online Exams!

Attribution: The Open Education Consortium
http://www.ocwconsortium.org/courses/view/e1f220d1b851ac65f6172d6012c4cc13/
Course Home http://videolectures.net/wapa2011_hall_occlusions/

	5 Physiotherapy Flashcards Set 5 By Rhodes Start Flashcards
	Art History ARTH209 20th Century By Rebecca Butterfield Start Quiz
©flickr: Justin	Music Appreciation Final Practice By Madison Christian Start Exam
	41 Biology 41 Osmotic Regulation and Excretion MCQ By OpenStax Start Quiz
	Anthropology Economic System By Richley Crapo Start Assignment
	Western Political Thought MCQ By Saylor Foundation Start Quiz
	Macroeconomics By OpenStax Read Online Course
	SCJP Online Exam 310-065 By Prateek Ashtikar Start Quiz
	10 Lec:10 Sampling Confidence Intervals By Janet Forrester Start Quiz
	1 AP 01 Human Body Anatomy Physiology Essay By OpenStax Start Flashcards