Abstract
We have recently proposed a new algorithm for the automatic extraction of lip feature points. Based on their positions, parameters describing the shape of the mouth are derived. Since the algorithm is based on a stereo vision face tracking system, all measurements are in real-world distances. In this paper, we evaluate the accuracy of the automatic feature extraction algorithm by comparing its results with a manual feature extraction process. The results show an average error of about 1-2mm for the internal mouth width and height. In the second part of the paper, we present the design of an AV speech database for Australian English for future experiments on the correlation of audio and video speech signals.
Original language | English |
---|---|
Title of host publication | Proceedings of the 8th Australian International Conference on Speech Science and Technology SST-2000 |
Editors | Michael Barlow, Philip Rose |
Place of Publication | Canberra |
Publisher | Australian Speech Science and Technology Association (ASSTA) |
Pages | 92-97 |
Number of pages | 6 |
ISBN (Print) | 0958857989 |
Publication status | Published - 4 Dec 2000 |
Event | 8th Australian International Conference on Speech Science and Technology SST-2000 - Canberra, Australia Duration: 4 Dec 2000 → 7 Dec 2000 |
Conference
Conference | 8th Australian International Conference on Speech Science and Technology SST-2000 |
---|---|
Abbreviated title | SST-2000 |
Country/Territory | Australia |
City | Canberra |
Period | 4/12/00 → 7/12/00 |