- -

Integrating a State-of-the-Art ASR System into the Opencast Matterhorn Platform

RiuNet: Repositorio Institucional de la Universidad Politécnica de Valencia

Compartir/Enviar a

Citas

Estadísticas

  • Estadisticas de Uso

Integrating a State-of-the-Art ASR System into the Opencast Matterhorn Platform

Mostrar el registro sencillo del ítem

Ficheros en el ítem

dc.contributor.author Valor Miró, Juan Daniel es_ES
dc.contributor.author Pérez González de Martos, Alejandro Manuel es_ES
dc.contributor.author Civera Saiz, Jorge es_ES
dc.contributor.author Juan Císcar, Alfonso es_ES
dc.date.accessioned 2014-01-28T07:29:51Z
dc.date.issued 2012
dc.identifier.isbn 978-3-642-35291-1 (print)
dc.identifier.isbn 978-3-642-35292-8 (on line)
dc.identifier.issn 1865-0929
dc.identifier.uri http://hdl.handle.net/10251/35190
dc.description.abstract [EN] In this paper we present the integration of a state-of-the-art ASR system into the Opencast Matterhorn platform, a free, open-source platform to support the management of educational audio and video content. The ASR system was trained on a novel large speech corpus, known as poliMedia, that was manually transcribed for the European project transLectures. This novel corpus contains more than 115 hours of transcribed speech that will be available for the research community. Initial results on the poliMedia corpus are also reported to compare the performance of different ASR systems based on the linear interpolation of language models. To this purpose, the in-domain poliMedia corpus was linearly interpolated with an external large-vocabulary dataset, the well-known Google N-Gram corpus. WER figures reported denote the notable improvement over the baseline performance as a result of incorporating the vast amount of data represented by the Google N-Gram corpus. es_ES
dc.description.sponsorship The research leading to these results has received funding from the European Union Seventh Framework Programme (FP7/2007-2013) under grant agreement no 287755. Also supported by the Spanish Government (MIPRCV ”Consolider Ingenio 2010” and iTrans2 TIN2009-14511) and the Generalitat Valenciana (Prometeo/2009/014).
dc.language Inglés es_ES
dc.publisher Springer Verlag (Germany) es_ES
dc.relation.ispartof Communications in Computer and Information Science es_ES
dc.rights Reserva de todos los derechos es_ES
dc.subject Speech Recognition es_ES
dc.subject Linear Combination es_ES
dc.subject Language Modeling es_ES
dc.subject Google N-Gram es_ES
dc.subject Opencast Matterhorn es_ES
dc.subject.classification LENGUAJES Y SISTEMAS INFORMATICOS es_ES
dc.title Integrating a State-of-the-Art ASR System into the Opencast Matterhorn Platform es_ES
dc.type Artículo es_ES
dc.embargo.lift 10000-01-01
dc.embargo.terms forever es_ES
dc.identifier.doi 10.1007/978-3-642-35292-8_25
dc.relation.projectID info:eu-repo/grantAgreement/EC/FP7/287755/EU/Transcription and Translation of Video Lectures/ es_ES
dc.relation.projectID info:eu-repo/grantAgreement/MICINN//TIN2009-14511/ES/Traduccion De Textos Y Transcripcion De Voz Interactivas/ es_ES
dc.relation.projectID info:eu-repo/grantAgreement/Generalitat Valenciana//PROMETEO09%2F2009%2F014/ES/Adaptive learning and multimodality in pattern recognition (Almapater)/ es_ES
dc.rights.accessRights Abierto es_ES
dc.contributor.affiliation Universitat Politècnica de València. Instituto Universitario Mixto Tecnológico de Informática - Institut Universitari Mixt Tecnològic d'Informàtica es_ES
dc.contributor.affiliation Universitat Politècnica de València. Departamento de Sistemas Informáticos y Computación - Departament de Sistemes Informàtics i Computació es_ES
dc.description.bibliographicCitation Valor Miró, JD.; Pérez González De Martos, AM.; Civera Saiz, J.; Juan Císcar, A. (2012). Integrating a State-of-the-Art ASR System into the Opencast Matterhorn Platform. Communications in Computer and Information Science. 328:237-246. https://doi.org/10.1007/978-3-642-35292-8_25 es_ES
dc.description.accrualMethod S es_ES
dc.relation.conferencename Spanish Speech Technology Workshop/Iberian SLTech Workshop es_ES
dc.relation.conferencedate NOV 21-23, 2012 es_ES
dc.relation.conferenceplace Univ Autonoma Madrid, ATVS Biometr Res Grp, Madrid, SPAIN es_ES
dc.relation.publisherversion http://dx.doi.org/10.1007/978-3-642-35292-8_25 es_ES
dc.description.upvformatpinicio 237 es_ES
dc.description.upvformatpfin 246 es_ES
dc.type.version info:eu-repo/semantics/publishedVersion es_ES
dc.description.volume 328 es_ES
dc.relation.senia 234198
dc.contributor.funder European Commission
dc.contributor.funder Ministerio de Ciencia e Innovación
dc.contributor.funder Generalitat Valenciana
dc.description.references UPVLC, XEROX, JSI-K4A, RWTH, EML, DDS: transLectures: Transcription and Translation of Video Lectures. In: Proc. of EAMT, p. 204 (2012) es_ES
dc.description.references Zhan, P., Ries, K., Gavalda, M., Gates, D., Lavie, A., Waibel, A.: JANUS-II: towards spontaneous Spanish speech recognition 4, 2285–2288 (1996) es_ES
dc.description.references Nogueiras, A., Fonollosa, J.A.R., Bonafonte, A., Mariño, J.B.: RAMSES: El sistema de reconocimiento del habla continua y gran vocabulario desarrollado por la UPC. In: VIII Jornadas de I+D en Telecomunicaciones, pp. 399–408 (1998) es_ES
dc.description.references Huang, X., Alleva, F., Hon, H.W., Hwang, M.Y., Rosenfeld, R.: The SPHINX-II Speech Recognition System: An Overview. Computer, Speech and Language 7, 137–148 (1992) es_ES
dc.description.references Speech and Language Technology Group. Sumat: An online service for subtitling by machine translation (May 2012), http://www.sumat-project.eu es_ES
dc.description.references Broman, S., Kurimo, M.: Methods for combining language models in speech recognition. In: Proc. of Interspeech, pp. 1317–1320 (2005) es_ES
dc.description.references Liu, X., Gales, M., Hieronymous, J., Woodland, P.: Use of contexts in language model interpolation and adaptation. In: Proc. of Interspeech (2009) es_ES
dc.description.references Liu, X., Gales, M., Hieronymous, J., Woodland, P.: Language model combination and adaptation using weighted finite state transducers (2010) es_ES
dc.description.references Goodman, J.T.: Putting it all together: Language model combination. In: Proc. of ICASSP, pp. 1647–1650 (2000) es_ES
dc.description.references Lööf, J., Gollan, C., Hahn, S., Heigold, G., Hoffmeister, B., Plahl, C., Rybach, D., Schlüter, R., Ney, H.: The rwth 2007 tc-star evaluation system for european english and spanish. In: Proc. of Interspeech, pp. 2145–2148 (2007) es_ES
dc.description.references Rybach, D., Gollan, C., Heigold, G., Hoffmeister, B., Lööf, J., Schlüter, R., Ney, H.: The rwth aachen university open source speech recognition system. In: Proc. of Interspeech, pp. 2111–2114 (2009) es_ES
dc.description.references Stolcke, A.: SRILM - An Extensible Language Modeling Toolkit. In: Proc. of ICSLP (2002) es_ES
dc.description.references Michel, J.B., et al.: Quantitative analysis of culture using millions of digitized books. Science 331(6014), 176–182 es_ES
dc.description.references Turro, C., Cañero, A., Busquets, J.: Video learning objects creation with polimedia. In: 2010 IEEE International Symposium on Multimedia (ISM), December 13-15, pp. 371–376 (2010) es_ES
dc.description.references Barras, C., Geoffrois, E., Wu, Z., Liberman, M.: Transcriber: development and use of a tool for assisting speech corpora production. Speech Communication Special Issue on Speech Annotation and Corpus Tools 33(1-2) (2000) es_ES
dc.description.references Apache. Apache felix (May 2012), http://felix.apache.org/site/index.html es_ES
dc.description.references Osgi alliance. osgi r4 service platform (May 2012), http://www.osgi.org/Main/HomePage es_ES
dc.description.references Sahidullah, M., Saha, G.: Design, analysis and experimental evaluation of block based transformation in MFCC computation for speaker recognition 54(4), 543–565 (2012) es_ES
dc.description.references Gascó, G., Rocha, M.-A., Sanchis-Trilles, G., Andrés-Ferrer, J., Casacuberta, F.: Does more data always yield better translations? In: Proc. of EACL, pp. 152–161 (2012) es_ES
dc.description.references Sánchez-Cortina, I., Serrano, N., Sanchis, A., Juan, A.: A prototype for interactive speech transcription balancing error and supervision effort. In: Proc. of IUI, pp. 325–326 (2012) es_ES


Este ítem aparece en la(s) siguiente(s) colección(ones)

Mostrar el registro sencillo del ítem