The TransLectures-UPV Toolkit

Del Agua Teba, Miguel Angel; Giménez Pastor, Adrián; Serrano Martinez Santos, Nicolas; Andrés Ferrer, Jesús; Civera Saiz, Jorge; Sanchis Navarro, José Alberto; Juan Císcar, Alfonso

doi:10.1007/978-3-319-13623-3_28

Identificarse

Buscar en RiuNet

Listar

Todo RiuNet
Esta colección

Mi cuenta

Acceder

Estadísticas

Ver Estadísticas de uso

Ayuda RiuNet

Admin. UPV

Compartir/Enviar a

Citas

Estadísticas

The TransLectures-UPV Toolkit

Mostrar el registro sencillo del ítem

Ficheros en el ítem

Nombre: IberSpeech2014-TL ...

Tamaño: 144.0Kb

Formato: PDF

Descripción: Versión del Autor.

Abrir

Nombre: editor-version.pdf

Tamaño: 401.2Kb

Formato: PDF

Descripción: Versión editorial

Solicitar una copia al autor

dc.contributor.author	Del Agua Teba, Miguel Angel	es_ES
dc.contributor.author	Giménez Pastor, Adrián	es_ES
dc.contributor.author	Serrano Martinez Santos, Nicolas	es_ES
dc.contributor.author	Andrés Ferrer, Jesús	es_ES
dc.contributor.author	Civera Saiz, Jorge	es_ES
dc.contributor.author	Sanchis Navarro, José Alberto	es_ES
dc.contributor.author	Juan Císcar, Alfonso	es_ES
dc.date.accessioned	2015-05-19T10:02:58Z
dc.date.available	2015-05-19T10:02:58Z
dc.date.issued	2014
dc.identifier.isbn	978-3-319-13622-6
dc.identifier.issn	0302-9743
dc.identifier.uri	http://hdl.handle.net/10251/50452
dc.description	The final publication is available at Springer via http://dx.doi.org/10.1007/978-3-319-13623-3_28	es_ES
dc.description.abstract	Over the past few years, online multimedia educational repositories have increased in number and popularity. The main aim of the transLectures project is to develop cost-effective solutions for producing accurate transcriptions and translations for large video lecture repositories, such as VideoLectures.NET or the Universitat Politècnica de València s repository, poliMedia. In this paper, we present the transLectures-UPV toolkit (TLK), which has been specifically designed to meet the requirements of the transLectures project, but can also be used as a conventional ASR toolkit. The main features of the current release include HMM training and decoding with speaker adaptation techniques (fCMLLR). TLK has been tested on the VideoLectures.NET and poliMedia repositories, yielding very competitive results. TLK has been released under the permissive open source Apache License v2.0 and can be directly downloaded from the transLectures website.	es_ES
dc.description.sponsorship	The research leading to these results has received funding from the European Union Seventh Framework Programme (FP7/2007-2013) under grant agreement no 287755 (transLectures) and ICT Policy Support Programme (ICT PSP/2007-2013) as part of the Competitiveness and InnovationFramework Programme (CIP) under grant agreement no 621030 (EMMA), andthe Spanish MINECO Active2Trans (TIN2012-31723) research project.	es_ES
dc.language	Inglés	es_ES
dc.publisher	Springer International Publishing	es_ES
dc.relation.ispartof	Advances in Speech and Language Technologies for Iberian Languages: Second International Conference, IberSPEECH 2014, Las Palmas de Gran Canaria, Spain, November 19-21, 2014. Proceedings	es_ES
dc.relation.ispartofseries	Lecture Notes in Computer Science;8854
dc.rights	Reserva de todos los derechos	es_ES
dc.subject	TLK	es_ES
dc.subject	ASR toolkit	es_ES
dc.subject	transLectures	es_ES
dc.subject	HMM	es_ES
dc.subject.classification	ESTADISTICA E INVESTIGACION OPERATIVA	es_ES
dc.subject.classification	CIENCIAS DE LA COMPUTACION E INTELIGENCIA ARTIFICIAL	es_ES
dc.subject.classification	LENGUAJES Y SISTEMAS INFORMATICOS	es_ES
dc.title	The TransLectures-UPV Toolkit	es_ES
dc.type	Capítulo de libro	es_ES
dc.identifier.doi	10.1007/978-3-319-13623-3_28
dc.relation.projectID	info:eu-repo/grantAgreement/EC/FP7/287755/EU/Transcription and Translation of Video Lectures/	es_ES
dc.relation.projectID	info:eu-repo/grantAgreement/EC/CIP/621030/EU/European Multiple MOOC Aggregator/EMMA/	es_ES
dc.relation.projectID	info:eu-repo/grantAgreement/MINECO//TIN2012-31723/ES/INTERACCION ACTIVA PARA TRANSCRIPCION DE HABLA Y TRADUCCION/	es_ES
dc.rights.accessRights	Abierto	es_ES
dc.contributor.affiliation	Universitat Politècnica de València. Departamento de Sistemas Informáticos y Computación - Departament de Sistemes Informàtics i Computació	es_ES
dc.contributor.affiliation	Universitat Politècnica de València. Departamento de Estadística e Investigación Operativa Aplicadas y Calidad - Departament d'Estadística i Investigació Operativa Aplicades i Qualitat	es_ES
dc.description.bibliographicCitation	Del Agua Teba, MA.; Giménez Pastor, A.; Serrano Martinez Santos, N.; Andrés Ferrer, J.; Civera Saiz, J.; Sanchis Navarro, JA.; Juan Císcar, A. (2014). The TransLectures-UPV Toolkit. En Advances in Speech and Language Technologies for Iberian Languages: Second International Conference, IberSPEECH 2014, Las Palmas de Gran Canaria, Spain, November 19-21, 2014. Proceedings. Springer International Publishing. 269-278. https://doi.org/10.1007/978-3-319-13623-3_28	es_ES
dc.description.accrualMethod	S	es_ES
dc.relation.publisherversion	http://link.springer.com/chapter/10.1007/978-3-319-13623-3_28	es_ES
dc.description.upvformatpinicio	269	es_ES
dc.description.upvformatpfin	278	es_ES
dc.type.version	info:eu-repo/semantics/publishedVersion	es_ES
dc.relation.senia	276983
dc.contributor.funder	European Commission	es_ES
dc.contributor.funder	Ministerio de Economía y Competitividad	es_ES
dc.description.references	Final report on massive adaptation (M36). To be delivered on October 2014 (2014)	es_ES
dc.description.references	First report on massive adaptation (M12), https://www.translectures.eu/wp-content/uploads/2013/05/transLectures-D3.1.1-18Nov2012.pdf	es_ES
dc.description.references	Opencast Matterhorn, http://opencast.org/matterhorn/	es_ES
dc.description.references	sclite - Score speech recognition system output, http://www1.icsi.berkeley.edu/Speech/docs/sctk-1.2/sclite.htm	es_ES
dc.description.references	Second report on massive adaptation (M24), https://www.translectures.eu//wp-content/uploads/2014/01/transLectures-D3.1.2-15Nov2013.pdf	es_ES
dc.description.references	TLK: The transLectures-UPV Toolkit, https://www.translectures.eu/tlk/	es_ES
dc.description.references	Baum, L.E., Petrie, T., Soules, G., Weiss, N.: A Maximization Technique Occurring in the Statistical Analysis of Probabilistic Functions of Markov Chains. The Annals of Mathematical Statistics 41(1), 164–171 (1970)	es_ES
dc.description.references	Dahl, G.E., Yu, D., Deng, L., Acero, A.: Context-Dependent Pre-Trained Deep Neural Networks for Large-Vocabulary Speech Recognition. IEEE Transactions on Audio, Speech, and Language Processing 20(1), 30–42 (2012)	es_ES
dc.description.references	Digalakis, V., Rtischev, D., Neumeyer, L., Sa, E.: Speaker Adaptation Using Constrained Estimation of Gaussian Mixtures. IEEE Transactions on Speech and Audio Processing 3, 357–366 (1995)	es_ES
dc.description.references	Huang, J.T., Li, J., Yu, D., Deng, L., Gong, Y.: Cross-language knowledge transfer using multilingual deep neural network with shared hidden layers. In: Proc. of ICASSP (2013)	es_ES
dc.description.references	Munteanu, C., Baecker, R., Penn, G., Toms, E., James, D.: The Effect of Speech Recognition Accuracy Rates on the Usefulness and Usability of Webcast Archives. In: Proc. of CHI, pp. 493–502 (2006)	es_ES
dc.description.references	Ney, H., Ortmanns, S.: Progress in dynamic programming search for LVCSR. Proceedings of the IEEE 88(8), 1224–1240 (2000)	es_ES
dc.description.references	Ortmanns, S., Ney, H., Eiden, A.: Language-model look-ahead for large vocabulary speech recognition. In: Proc. of ICSLP, vol. 4, pp. 2095–2098 (1996)	es_ES
dc.description.references	Ortmanns, S., Ney, H., Aubert, X.: A word graph algorithm for large vocabulary continuous speech recognition. Computer Speech and Language 11(1), 43–72 (1997)	es_ES
dc.description.references	Povey, D., et al.: The Kaldi Speech Recognition Toolkit. In: Proc. of ASRU (2011)	es_ES
dc.description.references	Rumelhart, D., Hintont, G., Williams, R.: Learning representations by back-propagating errors. Nature 323(6088), 533–536 (1986)	es_ES
dc.description.references	Rybach, D., et al.: The RWTH Aachen University Open Source Speech Recognition System. In: Proc. Interspeech, pp. 2111–2114 (2009)	es_ES
dc.description.references	Seide, F., Li, G., Chen, X., Yu, D.: Feature engineering in Context-Dependent Deep Neural Networks for conversational speech transcription. In: Proc. of ASRU, pp. 24–29 (2011)	es_ES
dc.description.references	Viterbi, A.: Error bounds for convolutional codes and an asymptotically optimum decoding algorithm. IEEE Transactions on Information Theory 13(2), 260–269 (1967)	es_ES
dc.description.references	Young, S., et al.: The HTK Book. Cambridge University Engineering Department (1995)	es_ES
dc.description.references	Young, S.J., Odell, J.J., Woodland, P.C.: Tree-based state tying for high accuracy acoustic modelling. In: Proc. of HLT, pp. 307–312 (1994)	es_ES

Este ítem aparece en la(s) siguiente(s) colección(ones)

Mostrar el registro sencillo del ítem

The TransLectures-UPV Toolkit

RiuNet: Repositorio Institucional de la Universidad Politécnica de Valencia

Buscar en RiuNet

Listar

Todo RiuNet

Esta colección

Mi cuenta

Estadísticas

Ayuda RiuNet

Admin. UPV

Compartir/Enviar a

Citas

Estadísticas

The TransLectures-UPV Toolkit

Ficheros en el ítem

Este ítem aparece en la(s) siguiente(s) colección(ones)