Integrating a State-of-the-Art ASR System into the Opencast Matterhorn Platform

Valor Miró, Juan Daniel; Pérez González de Martos, Alejandro Manuel; Civera Saiz, Jorge; Juan Císcar, Alfonso

doi:10.1007/978-3-642-35292-8_25

Identificarse

Buscar en RiuNet

Listar

Todo RiuNet
Esta colección

Mi cuenta

Acceder

Estadísticas

Ver Estadísticas de uso

Ayuda RiuNet

Admin. UPV

Compartir/Enviar a

Citas

Estadísticas

Integrating a State-of-the-Art ASR System into the Opencast Matterhorn Platform

Mostrar el registro sencillo del ítem

Ficheros en el ítem

Nombre: paper.pdf

Tamaño: 279.1Kb

Formato: PDF

Descripción: Versión del Autor.

Abrir

Nombre: Valor, J.A. - ...

Tamaño: 506.2Kb

Formato: PDF

Descripción: Versión editorial

Solicitar una copia al autor

dc.contributor.author	Valor Miró, Juan Daniel	es_ES
dc.contributor.author	Pérez González de Martos, Alejandro Manuel	es_ES
dc.contributor.author	Civera Saiz, Jorge	es_ES
dc.contributor.author	Juan Císcar, Alfonso	es_ES
dc.date.accessioned	2014-01-28T07:29:51Z
dc.date.issued	2012
dc.identifier.isbn	978-3-642-35291-1 (print)
dc.identifier.isbn	978-3-642-35292-8 (on line)
dc.identifier.issn	1865-0929
dc.identifier.uri	http://hdl.handle.net/10251/35190
dc.description.abstract	[EN] In this paper we present the integration of a state-of-the-art ASR system into the Opencast Matterhorn platform, a free, open-source platform to support the management of educational audio and video content. The ASR system was trained on a novel large speech corpus, known as poliMedia, that was manually transcribed for the European project transLectures. This novel corpus contains more than 115 hours of transcribed speech that will be available for the research community. Initial results on the poliMedia corpus are also reported to compare the performance of different ASR systems based on the linear interpolation of language models. To this purpose, the in-domain poliMedia corpus was linearly interpolated with an external large-vocabulary dataset, the well-known Google N-Gram corpus. WER figures reported denote the notable improvement over the baseline performance as a result of incorporating the vast amount of data represented by the Google N-Gram corpus.	es_ES
dc.description.sponsorship	The research leading to these results has received funding from the European Union Seventh Framework Programme (FP7/2007-2013) under grant agreement no 287755. Also supported by the Spanish Government (MIPRCV ”Consolider Ingenio 2010” and iTrans2 TIN2009-14511) and the Generalitat Valenciana (Prometeo/2009/014).
dc.language	Inglés	es_ES
dc.publisher	Springer Verlag (Germany)	es_ES
dc.relation.ispartof	Communications in Computer and Information Science	es_ES
dc.rights	Reserva de todos los derechos	es_ES
dc.subject	Speech Recognition	es_ES
dc.subject	Linear Combination	es_ES
dc.subject	Language Modeling	es_ES
dc.subject	Google N-Gram	es_ES
dc.subject	Opencast Matterhorn	es_ES
dc.subject.classification	LENGUAJES Y SISTEMAS INFORMATICOS	es_ES
dc.title	Integrating a State-of-the-Art ASR System into the Opencast Matterhorn Platform	es_ES
dc.type	Artículo	es_ES
dc.embargo.lift	10000-01-01
dc.embargo.terms	forever	es_ES
dc.identifier.doi	10.1007/978-3-642-35292-8_25
dc.relation.projectID	info:eu-repo/grantAgreement/EC/FP7/287755/EU/Transcription and Translation of Video Lectures/	es_ES
dc.relation.projectID	info:eu-repo/grantAgreement/MICINN//TIN2009-14511/ES/Traduccion De Textos Y Transcripcion De Voz Interactivas/	es_ES
dc.relation.projectID	info:eu-repo/grantAgreement/Generalitat Valenciana//PROMETEO09%2F2009%2F014/ES/Adaptive learning and multimodality in pattern recognition (Almapater)/	es_ES
dc.rights.accessRights	Abierto	es_ES
dc.contributor.affiliation	Universitat Politècnica de València. Instituto Universitario Mixto Tecnológico de Informática - Institut Universitari Mixt Tecnològic d'Informàtica	es_ES
dc.contributor.affiliation	Universitat Politècnica de València. Departamento de Sistemas Informáticos y Computación - Departament de Sistemes Informàtics i Computació	es_ES
dc.description.bibliographicCitation	Valor Miró, JD.; Pérez González De Martos, AM.; Civera Saiz, J.; Juan Císcar, A. (2012). Integrating a State-of-the-Art ASR System into the Opencast Matterhorn Platform. Communications in Computer and Information Science. 328:237-246. https://doi.org/10.1007/978-3-642-35292-8_25	es_ES
dc.description.accrualMethod	S	es_ES
dc.relation.conferencename	Spanish Speech Technology Workshop/Iberian SLTech Workshop	es_ES
dc.relation.conferencedate	NOV 21-23, 2012	es_ES
dc.relation.conferenceplace	Univ Autonoma Madrid, ATVS Biometr Res Grp, Madrid, SPAIN	es_ES
dc.relation.publisherversion	http://dx.doi.org/10.1007/978-3-642-35292-8_25	es_ES
dc.description.upvformatpinicio	237	es_ES
dc.description.upvformatpfin	246	es_ES
dc.type.version	info:eu-repo/semantics/publishedVersion	es_ES
dc.description.volume	328	es_ES
dc.relation.senia	234198
dc.contributor.funder	European Commission
dc.contributor.funder	Ministerio de Ciencia e Innovación
dc.contributor.funder	Generalitat Valenciana
dc.description.references	UPVLC, XEROX, JSI-K4A, RWTH, EML, DDS: transLectures: Transcription and Translation of Video Lectures. In: Proc. of EAMT, p. 204 (2012)	es_ES
dc.description.references	Zhan, P., Ries, K., Gavalda, M., Gates, D., Lavie, A., Waibel, A.: JANUS-II: towards spontaneous Spanish speech recognition 4, 2285–2288 (1996)	es_ES
dc.description.references	Nogueiras, A., Fonollosa, J.A.R., Bonafonte, A., Mariño, J.B.: RAMSES: El sistema de reconocimiento del habla continua y gran vocabulario desarrollado por la UPC. In: VIII Jornadas de I+D en Telecomunicaciones, pp. 399–408 (1998)	es_ES
dc.description.references	Huang, X., Alleva, F., Hon, H.W., Hwang, M.Y., Rosenfeld, R.: The SPHINX-II Speech Recognition System: An Overview. Computer, Speech and Language 7, 137–148 (1992)	es_ES
dc.description.references	Speech and Language Technology Group. Sumat: An online service for subtitling by machine translation (May 2012), http://www.sumat-project.eu	es_ES
dc.description.references	Broman, S., Kurimo, M.: Methods for combining language models in speech recognition. In: Proc. of Interspeech, pp. 1317–1320 (2005)	es_ES
dc.description.references	Liu, X., Gales, M., Hieronymous, J., Woodland, P.: Use of contexts in language model interpolation and adaptation. In: Proc. of Interspeech (2009)	es_ES
dc.description.references	Liu, X., Gales, M., Hieronymous, J., Woodland, P.: Language model combination and adaptation using weighted finite state transducers (2010)	es_ES
dc.description.references	Goodman, J.T.: Putting it all together: Language model combination. In: Proc. of ICASSP, pp. 1647–1650 (2000)	es_ES
dc.description.references	Lööf, J., Gollan, C., Hahn, S., Heigold, G., Hoffmeister, B., Plahl, C., Rybach, D., Schlüter, R., Ney, H.: The rwth 2007 tc-star evaluation system for european english and spanish. In: Proc. of Interspeech, pp. 2145–2148 (2007)	es_ES
dc.description.references	Rybach, D., Gollan, C., Heigold, G., Hoffmeister, B., Lööf, J., Schlüter, R., Ney, H.: The rwth aachen university open source speech recognition system. In: Proc. of Interspeech, pp. 2111–2114 (2009)	es_ES
dc.description.references	Stolcke, A.: SRILM - An Extensible Language Modeling Toolkit. In: Proc. of ICSLP (2002)	es_ES
dc.description.references	Michel, J.B., et al.: Quantitative analysis of culture using millions of digitized books. Science 331(6014), 176–182	es_ES
dc.description.references	Turro, C., Cañero, A., Busquets, J.: Video learning objects creation with polimedia. In: 2010 IEEE International Symposium on Multimedia (ISM), December 13-15, pp. 371–376 (2010)	es_ES
dc.description.references	Barras, C., Geoffrois, E., Wu, Z., Liberman, M.: Transcriber: development and use of a tool for assisting speech corpora production. Speech Communication Special Issue on Speech Annotation and Corpus Tools 33(1-2) (2000)	es_ES
dc.description.references	Apache. Apache felix (May 2012), http://felix.apache.org/site/index.html	es_ES
dc.description.references	Osgi alliance. osgi r4 service platform (May 2012), http://www.osgi.org/Main/HomePage	es_ES
dc.description.references	Sahidullah, M., Saha, G.: Design, analysis and experimental evaluation of block based transformation in MFCC computation for speaker recognition 54(4), 543–565 (2012)	es_ES
dc.description.references	Gascó, G., Rocha, M.-A., Sanchis-Trilles, G., Andrés-Ferrer, J., Casacuberta, F.: Does more data always yield better translations? In: Proc. of EACL, pp. 152–161 (2012)	es_ES
dc.description.references	Sánchez-Cortina, I., Serrano, N., Sanchis, A., Juan, A.: A prototype for interactive speech transcription balancing error and supervision effort. In: Proc. of IUI, pp. 325–326 (2012)	es_ES

Este ítem aparece en la(s) siguiente(s) colección(ones)

Mostrar el registro sencillo del ítem

Integrating a State-of-the-Art ASR System into the Opencast Matterhorn Platform

RiuNet: Repositorio Institucional de la Universidad Politécnica de Valencia

Buscar en RiuNet

Listar

Todo RiuNet

Esta colección

Mi cuenta

Estadísticas

Ayuda RiuNet

Admin. UPV

Compartir/Enviar a

Citas

Estadísticas

Integrating a State-of-the-Art ASR System into the Opencast Matterhorn Platform

Ficheros en el ítem

Este ítem aparece en la(s) siguiente(s) colección(ones)