Validation of an automatic lip-tracking algorithm and design of a database for audio-video speech processing

Roland GOECKE, QN Tran, J. Bruce Millar, Alexander Zelinsky, Jordi Robert-Ribes

Research output: A Conference proceeding or a Chapter in BookConference contributionpeer-review

Abstract

We have recently proposed a new algorithm for the automatic extraction of lip feature points. Based on their positions, parameters describing the shape of the mouth are derived. Since the algorithm is based on a stereo vision face tracking system, all measurements are in real-world distances. In this paper, we evaluate the accuracy of the automatic feature extraction algorithm by comparing its results with a manual feature extraction process. The results show an average error of about 1-2mm for the internal mouth width and height. In the second part of the paper, we present the design of an AV speech database for Australian English for future experiments on the correlation of audio and video speech signals.
Original languageEnglish
Title of host publicationProceedings of the 8th Australian International Conference on Speech Science and Technology SST-2000
EditorsMichael Barlow, Philip Rose
Place of PublicationCanberra
PublisherAustralian Speech Science and Technology Association (ASSTA)
Pages92-97
Number of pages6
ISBN (Print)0958857989
Publication statusPublished - 4 Dec 2000
Event8th Australian International Conference on Speech Science and Technology SST-2000 - Canberra, Australia
Duration: 4 Dec 20007 Dec 2000

Conference

Conference8th Australian International Conference on Speech Science and Technology SST-2000
Abbreviated titleSST-2000
Country/TerritoryAustralia
CityCanberra
Period4/12/007/12/00

Fingerprint

Dive into the research topics of 'Validation of an automatic lip-tracking algorithm and design of a database for audio-video speech processing'. Together they form a unique fingerprint.

Cite this