Conferencia

McLaren, M.; Ferrer, L.; Castan, D.; Lawson, A.; Morgan N.; Georgiou P.; Morgan N.; Narayanan S.; Metze F.; Amazon Alexa; Apple; eBay; et al.; Google; Microsoft "The speakers in the wild (SITW) speaker recognition database" (2016) 17th Annual Conference of the International Speech Communication Association, INTERSPEECH 2016. 08-12-September-2016:818-822
Estamos trabajando para incorporar este artículo al repositorio
Consulte el artículo en la página del editor
Consulte la política de Acceso Abierto del editor

Abstract:

The Speakers in the Wild (SITW) speaker recognition database contains hand-annotated speech samples from open-source media for the purpose of benchmarking text-independent speaker recognition technology on single and multi-speaker audio acquired across unconstrained or "wild" conditions. The database consists of recordings of 299 speakers, with an average of eight different sessions per person. Unlike existing databases for speaker recognition, this data was not collected under controlled conditions and thus contains real noise, reverberation, intraspeaker variability and compression artifacts. These factors are often convolved in the real world, as the SITW data shows, and they make SITW a challenging database for single-and multispeaker recognition Copyright ©2016 ISCA.

Registro:

Documento: Conferencia
Título:The speakers in the wild (SITW) speaker recognition database
Autor:McLaren, M.; Ferrer, L.; Castan, D.; Lawson, A.; Morgan N.; Georgiou P.; Morgan N.; Narayanan S.; Metze F.; Amazon Alexa; Apple; eBay; et al.; Google; Microsoft
Filiación:Speech Technology and Research Laboratory, SRI InternationalCA, United States
Departamento de Computación, FCEN, Universidad de Buenos Aires, CONICET, Argentina
Palabras clave:Database; Real-world data; Speaker recognition; Character recognition; Database systems; Speech communication; Speech processing; Compression artifacts; Controlled conditions; Open sources; Real-world; Speaker recognition; Text independents; Speech recognition
Año:2016
Volumen:08-12-September-2016
Página de inicio:818
Página de fin:822
DOI: http://dx.doi.org/10.21437/Interspeech.2016-1129
Título revista:17th Annual Conference of the International Speech Communication Association, INTERSPEECH 2016
Título revista abreviado:Proc. Annu. Conf. Int. Speech. Commun. Assoc., INTERSPEECH
ISSN:2308457X
Registro:https://bibliotecadigital.exactas.uba.ar/collection/paper/document/paper_2308457X_v08-12-September-2016_n_p818_McLaren

Referencias:

  • Morrison, G., Zhang, C., Enzinger, E., Ochoa, F., Bleach, D., Johnson, M., Folkes, B., Chow, D., (2015) Forensic Database of Voice Recordings of 500+ Australian English Speakers, , http://databases.forensicvoice-comparison.net
  • Millar, J.B., Vonwiller, J.P., Harrington, J.M., Dermody, P.J., The Australian national database of spoken language (1994) Proc IEEE ICASSP
  • McCool, C., Marcel, S., MOBIO database for the ICPR 2010 face and speech competition (2009) Idiap, Tech. Rep
  • Vloed Der DVan, Bouten, J., Van Leeuwen, D.A., NFI-FRITS: A forensic speaker recognition database and some first experiments (2014) Proceedings of Odyssey: The Speaker and Language Recognition Workshop, , Joensuu, Finland
  • Bell, P., Gales, M., Hain, T., Kilgour, J., Lanchantin, P., Liu, X., Mc-Parland, A., Webster, M., The MGB challenge: Evaluating multi-genre broadcast media transcription (2015) Proc. IEEE ASRU
  • Janin, A., Baron, D., Edwards, J., Ellis, D., Gelbart, D., Morgan, N., Peskin, B., Stolcke, A., The icsi meeting corpus (2003) Proc. IEEE ICASSP, , IEEE
  • The NIST Year, (2010) Speaker Recognition Evaluation Plan, 2010, , http://www.nist.gov/itl/iad/mig/upload/NISTSRE10evalplan-r6.pdf
  • Manning, C.D., Raghavan, P., Schütze, H., (2008) Introduction to Information Retrieval, 1. , Cambridge University Press Cambridge
  • Brummer, N., Preez Du, J., Application independent evaluation of speaker detection (2006) Computer Speech and Language, 20 (2-3), pp. 230-275A4 - Amazon Alexa; Apple; eBay; et al.; Google; Microsoft

Citas:

---------- APA ----------
McLaren, M., Ferrer, L., Castan, D., Lawson, A., Morgan N., Georgiou P., Morgan N.,..., Amazon Alexa; Apple; eBay; et al.; Google; Microsoft (2016) . The speakers in the wild (SITW) speaker recognition database. 17th Annual Conference of the International Speech Communication Association, INTERSPEECH 2016, 08-12-September-2016, 818-822.
http://dx.doi.org/10.21437/Interspeech.2016-1129
---------- CHICAGO ----------
McLaren, M., Ferrer, L., Castan, D., Lawson, A., Morgan N., Georgiou P., et al. "The speakers in the wild (SITW) speaker recognition database" . 17th Annual Conference of the International Speech Communication Association, INTERSPEECH 2016 08-12-September-2016 (2016) : 818-822.
http://dx.doi.org/10.21437/Interspeech.2016-1129
---------- MLA ----------
McLaren, M., Ferrer, L., Castan, D., Lawson, A., Morgan N., Georgiou P., et al. "The speakers in the wild (SITW) speaker recognition database" . 17th Annual Conference of the International Speech Communication Association, INTERSPEECH 2016, vol. 08-12-September-2016, 2016, pp. 818-822.
http://dx.doi.org/10.21437/Interspeech.2016-1129
---------- VANCOUVER ----------
McLaren, M., Ferrer, L., Castan, D., Lawson, A., Morgan N., Georgiou P., et al. The speakers in the wild (SITW) speaker recognition database. Proc. Annu. Conf. Int. Speech. Commun. Assoc., INTERSPEECH. 2016;08-12-September-2016:818-822.
http://dx.doi.org/10.21437/Interspeech.2016-1129