Transmission channel effects on human speaker identification in multiparty conference calls

Laura FERNANDEZ GALLARDO, Michael WAGNER, Sebastian Moeller

Research output: A Conference proceeding or a Chapter in BookConference contribution

Abstract

Telecommunication systems available today allow efficient voice transmission through channels of different audio bandwidths and terminated with different user interfaces. However, the sending and receiving user interface, the bandwidth limitation and the effects of lossy signal compression degrade the quality of the received signal and impede an unequivocal recognition of the talker. In the present work the performance of a group of human listeners identifying speakers under different conditions is assessed by transmitting a set of multi-speaker conversations through networks and user interfaces of diverse characteristics. After learning the voices of the participants of the phone calls in clean conditions, listeners identified the talkers while listening to the degraded conversations, being more accurate for particular transmission scenarios. The results show that human speaker identification can be considered as an additional criterion when judging the benefits of enhanced bandwidths
Original languageEnglish
Title of host publicationProceedings 8th International Conference on Information Technology in Asia (CITA), 2013
EditorsJane Labadin, Jacey-Lynn Minoi, Dayang NurFatimah Awang Iskandar, Azman Bujang Masli
Place of PublicationPiscataway, New Jersey, USA
PublisherIEEE
Pages38-43
Number of pages6
Volume1
ISBN (Print)9781479910922
DOIs
Publication statusPublished - 2013
Event8th International Conference on Information Technology in Asia - Smart Devices Trend: Technologising Future Lifestyle - Kuching, Kuching, Malaysia
Duration: 1 Jul 20134 Jul 2013

Conference

Conference8th International Conference on Information Technology in Asia - Smart Devices Trend: Technologising Future Lifestyle
CountryMalaysia
CityKuching
Period1/07/134/07/13

Fingerprint

Teleconferencing
User interfaces
Bandwidth
Speech transmission
Telecommunication systems
Interfaces (computer)

Cite this

FERNANDEZ GALLARDO, L., WAGNER, M., & Moeller, S. (2013). Transmission channel effects on human speaker identification in multiparty conference calls. In J. Labadin, J-L. Minoi, D. N. A. Iskandar, & A. B. Masli (Eds.), Proceedings 8th International Conference on Information Technology in Asia (CITA), 2013 (Vol. 1, pp. 38-43). Piscataway, New Jersey, USA: IEEE. https://doi.org/10.1109/CITA.2013.6637569
FERNANDEZ GALLARDO, Laura ; WAGNER, Michael ; Moeller, Sebastian. / Transmission channel effects on human speaker identification in multiparty conference calls. Proceedings 8th International Conference on Information Technology in Asia (CITA), 2013. editor / Jane Labadin ; Jacey-Lynn Minoi ; Dayang NurFatimah Awang Iskandar ; Azman Bujang Masli. Vol. 1 Piscataway, New Jersey, USA : IEEE, 2013. pp. 38-43
@inproceedings{d2727cd98e9a4390b2bbc1ffc12208ab,
title = "Transmission channel effects on human speaker identification in multiparty conference calls",
abstract = "Telecommunication systems available today allow efficient voice transmission through channels of different audio bandwidths and terminated with different user interfaces. However, the sending and receiving user interface, the bandwidth limitation and the effects of lossy signal compression degrade the quality of the received signal and impede an unequivocal recognition of the talker. In the present work the performance of a group of human listeners identifying speakers under different conditions is assessed by transmitting a set of multi-speaker conversations through networks and user interfaces of diverse characteristics. After learning the voices of the participants of the phone calls in clean conditions, listeners identified the talkers while listening to the degraded conversations, being more accurate for particular transmission scenarios. The results show that human speaker identification can be considered as an additional criterion when judging the benefits of enhanced bandwidths",
keywords = "human speaker identification, listening test, multiparty conference call",
author = "{FERNANDEZ GALLARDO}, Laura and Michael WAGNER and Sebastian Moeller",
year = "2013",
doi = "10.1109/CITA.2013.6637569",
language = "English",
isbn = "9781479910922",
volume = "1",
pages = "38--43",
editor = "Jane Labadin and Jacey-Lynn Minoi and Iskandar, {Dayang NurFatimah Awang} and Masli, {Azman Bujang}",
booktitle = "Proceedings 8th International Conference on Information Technology in Asia (CITA), 2013",
publisher = "IEEE",

}

FERNANDEZ GALLARDO, L, WAGNER, M & Moeller, S 2013, Transmission channel effects on human speaker identification in multiparty conference calls. in J Labadin, J-L Minoi, DNA Iskandar & AB Masli (eds), Proceedings 8th International Conference on Information Technology in Asia (CITA), 2013. vol. 1, IEEE, Piscataway, New Jersey, USA, pp. 38-43, 8th International Conference on Information Technology in Asia - Smart Devices Trend: Technologising Future Lifestyle, Kuching, Malaysia, 1/07/13. https://doi.org/10.1109/CITA.2013.6637569

Transmission channel effects on human speaker identification in multiparty conference calls. / FERNANDEZ GALLARDO, Laura; WAGNER, Michael; Moeller, Sebastian.

Proceedings 8th International Conference on Information Technology in Asia (CITA), 2013. ed. / Jane Labadin; Jacey-Lynn Minoi; Dayang NurFatimah Awang Iskandar; Azman Bujang Masli. Vol. 1 Piscataway, New Jersey, USA : IEEE, 2013. p. 38-43.

Research output: A Conference proceeding or a Chapter in BookConference contribution

TY - GEN

T1 - Transmission channel effects on human speaker identification in multiparty conference calls

AU - FERNANDEZ GALLARDO, Laura

AU - WAGNER, Michael

AU - Moeller, Sebastian

PY - 2013

Y1 - 2013

N2 - Telecommunication systems available today allow efficient voice transmission through channels of different audio bandwidths and terminated with different user interfaces. However, the sending and receiving user interface, the bandwidth limitation and the effects of lossy signal compression degrade the quality of the received signal and impede an unequivocal recognition of the talker. In the present work the performance of a group of human listeners identifying speakers under different conditions is assessed by transmitting a set of multi-speaker conversations through networks and user interfaces of diverse characteristics. After learning the voices of the participants of the phone calls in clean conditions, listeners identified the talkers while listening to the degraded conversations, being more accurate for particular transmission scenarios. The results show that human speaker identification can be considered as an additional criterion when judging the benefits of enhanced bandwidths

AB - Telecommunication systems available today allow efficient voice transmission through channels of different audio bandwidths and terminated with different user interfaces. However, the sending and receiving user interface, the bandwidth limitation and the effects of lossy signal compression degrade the quality of the received signal and impede an unequivocal recognition of the talker. In the present work the performance of a group of human listeners identifying speakers under different conditions is assessed by transmitting a set of multi-speaker conversations through networks and user interfaces of diverse characteristics. After learning the voices of the participants of the phone calls in clean conditions, listeners identified the talkers while listening to the degraded conversations, being more accurate for particular transmission scenarios. The results show that human speaker identification can be considered as an additional criterion when judging the benefits of enhanced bandwidths

KW - human speaker identification

KW - listening test

KW - multiparty conference call

U2 - 10.1109/CITA.2013.6637569

DO - 10.1109/CITA.2013.6637569

M3 - Conference contribution

SN - 9781479910922

VL - 1

SP - 38

EP - 43

BT - Proceedings 8th International Conference on Information Technology in Asia (CITA), 2013

A2 - Labadin, Jane

A2 - Minoi, Jacey-Lynn

A2 - Iskandar, Dayang NurFatimah Awang

A2 - Masli, Azman Bujang

PB - IEEE

CY - Piscataway, New Jersey, USA

ER -

FERNANDEZ GALLARDO L, WAGNER M, Moeller S. Transmission channel effects on human speaker identification in multiparty conference calls. In Labadin J, Minoi J-L, Iskandar DNA, Masli AB, editors, Proceedings 8th International Conference on Information Technology in Asia (CITA), 2013. Vol. 1. Piscataway, New Jersey, USA: IEEE. 2013. p. 38-43 https://doi.org/10.1109/CITA.2013.6637569