Abstract:
The Speakers in the Wild (SITW) speaker recognition database contains hand-annotated speech samples from open-source media for the purpose of benchmarking text-independent speaker recognition technology on single and multi-speaker audio acquired across unconstrained or "wild" conditions. The database consists of recordings of 299 speakers, with an average of eight different sessions per person. Unlike existing databases for speaker recognition, this data was not collected under controlled conditions and thus contains real noise, reverberation, intraspeaker variability and compression artifacts. These factors are often convolved in the real world, as the SITW data shows, and they make SITW a challenging database for single-and multispeaker recognition Copyright ©2016 ISCA.
Registro:
Documento: |
Conferencia
|
Título: | The speakers in the wild (SITW) speaker recognition database |
Autor: | McLaren, M.; Ferrer, L.; Castan, D.; Lawson, A.; Morgan N.; Georgiou P.; Morgan N.; Narayanan S.; Metze F.; Amazon Alexa; Apple; eBay; et al.; Google; Microsoft |
Filiación: | Speech Technology and Research Laboratory, SRI InternationalCA, United States Departamento de Computación, FCEN, Universidad de Buenos Aires, CONICET, Argentina
|
Palabras clave: | Database; Real-world data; Speaker recognition; Character recognition; Database systems; Speech communication; Speech processing; Compression artifacts; Controlled conditions; Open sources; Real-world; Speaker recognition; Text independents; Speech recognition |
Año: | 2016
|
Volumen: | 08-12-September-2016
|
Página de inicio: | 818
|
Página de fin: | 822
|
DOI: |
http://dx.doi.org/10.21437/Interspeech.2016-1129 |
Título revista: | 17th Annual Conference of the International Speech Communication Association, INTERSPEECH 2016
|
Título revista abreviado: | Proc. Annu. Conf. Int. Speech. Commun. Assoc., INTERSPEECH
|
ISSN: | 2308457X
|
Registro: | https://bibliotecadigital.exactas.uba.ar/collection/paper/document/paper_2308457X_v08-12-September-2016_n_p818_McLaren |
Referencias:
- Morrison, G., Zhang, C., Enzinger, E., Ochoa, F., Bleach, D., Johnson, M., Folkes, B., Chow, D., (2015) Forensic Database of Voice Recordings of 500+ Australian English Speakers, , http://databases.forensicvoice-comparison.net
- Millar, J.B., Vonwiller, J.P., Harrington, J.M., Dermody, P.J., The Australian national database of spoken language (1994) Proc IEEE ICASSP
- McCool, C., Marcel, S., MOBIO database for the ICPR 2010 face and speech competition (2009) Idiap, Tech. Rep
- Vloed Der DVan, Bouten, J., Van Leeuwen, D.A., NFI-FRITS: A forensic speaker recognition database and some first experiments (2014) Proceedings of Odyssey: The Speaker and Language Recognition Workshop, , Joensuu, Finland
- Bell, P., Gales, M., Hain, T., Kilgour, J., Lanchantin, P., Liu, X., Mc-Parland, A., Webster, M., The MGB challenge: Evaluating multi-genre broadcast media transcription (2015) Proc. IEEE ASRU
- Janin, A., Baron, D., Edwards, J., Ellis, D., Gelbart, D., Morgan, N., Peskin, B., Stolcke, A., The icsi meeting corpus (2003) Proc. IEEE ICASSP, , IEEE
- The NIST Year, (2010) Speaker Recognition Evaluation Plan, 2010, , http://www.nist.gov/itl/iad/mig/upload/NISTSRE10evalplan-r6.pdf
- Manning, C.D., Raghavan, P., Schütze, H., (2008) Introduction to Information Retrieval, 1. , Cambridge University Press Cambridge
- Brummer, N., Preez Du, J., Application independent evaluation of speaker detection (2006) Computer Speech and Language, 20 (2-3), pp. 230-275A4 - Amazon Alexa; Apple; eBay; et al.; Google; Microsoft
Citas:
---------- APA ----------
McLaren, M., Ferrer, L., Castan, D., Lawson, A., Morgan N., Georgiou P., Morgan N.,..., Amazon Alexa; Apple; eBay; et al.; Google; Microsoft
(2016)
. The speakers in the wild (SITW) speaker recognition database. 17th Annual Conference of the International Speech Communication Association, INTERSPEECH 2016, 08-12-September-2016, 818-822.
http://dx.doi.org/10.21437/Interspeech.2016-1129---------- CHICAGO ----------
McLaren, M., Ferrer, L., Castan, D., Lawson, A., Morgan N., Georgiou P., et al.
"The speakers in the wild (SITW) speaker recognition database"
. 17th Annual Conference of the International Speech Communication Association, INTERSPEECH 2016 08-12-September-2016
(2016) : 818-822.
http://dx.doi.org/10.21437/Interspeech.2016-1129---------- MLA ----------
McLaren, M., Ferrer, L., Castan, D., Lawson, A., Morgan N., Georgiou P., et al.
"The speakers in the wild (SITW) speaker recognition database"
. 17th Annual Conference of the International Speech Communication Association, INTERSPEECH 2016, vol. 08-12-September-2016, 2016, pp. 818-822.
http://dx.doi.org/10.21437/Interspeech.2016-1129---------- VANCOUVER ----------
McLaren, M., Ferrer, L., Castan, D., Lawson, A., Morgan N., Georgiou P., et al. The speakers in the wild (SITW) speaker recognition database. Proc. Annu. Conf. Int. Speech. Commun. Assoc., INTERSPEECH. 2016;08-12-September-2016:818-822.
http://dx.doi.org/10.21437/Interspeech.2016-1129