- -

Multimodal Crowdsourcing for Transcribing Handwritten Documents

RiuNet: Repositorio Institucional de la Universidad Politécnica de Valencia

Compartir/Enviar a

Citas

Estadísticas

  • Estadisticas de Uso

Multimodal Crowdsourcing for Transcribing Handwritten Documents

Mostrar el registro sencillo del ítem

Ficheros en el ítem

dc.contributor.author Granell Romero, Emilio es_ES
dc.contributor.author Martínez Hinarejos, Carlos David es_ES
dc.date.accessioned 2017-05-30T12:40:36Z
dc.date.available 2017-05-30T12:40:36Z
dc.date.issued 2017-02
dc.identifier.issn 2329-9290
dc.identifier.uri http://hdl.handle.net/10251/82027
dc.description © 2016 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. es_ES
dc.description.abstract [EN] Transcription of handwritten documents is an important research topic for multiple applications, such as document classification or information extraction. In the case of historical documents, their transcription allows to preserve cultural heritage because of the amount of historical data contained in those documents. The transcription process can employ state-of-the-art handwritten text recognition systems in order to obtain an initial transcription. This transcription is usually not good enough for the quality standards, but that may speed up the final transcription of the expert. In this framework, the use of collaborative transcription applications (crowdsourcing) has risen in the recent years, but these platforms are mainly limited by the use of non-mobile devices. Thus, the recruiting initiatives get reduced to a smaller set of potential volunteers. In this paper, an alternative that allows the use of mobile devices is presented. The proposal consists of using speech dictation of handwritten text lines. Then, by using multimodal combination of speech and handwritten text images, a draft transcription can be obtained, presenting more quality than that obtained by only using handwritten text recognition. The speech dictation platform is implemented as a mobile device application, which allows for a wider range of population for recruiting volunteers. A real acquisition on the contents of a Spanish historical handwritten book was obtained with the platform. This data was used to perform experiments on the behaviour of the proposed framework. Some experiments were performed to study how to optimise the collaborators effort in terms of number of collaborations, including how many lines and which lines should be selected for the speech dictation. es_ES
dc.description.sponsorship This work was supported in part by projects READ-674943 (European Union's H2020), SmartWays-RTC-2014-1466-4 (MINECO), CoMUN-HaT-TIN2015-70924-C2-1-R (MINECO/FEDER), and ALMAMATER-PROMETEOII/2014/030 (Generalitat Valenciana). en_EN
dc.language Inglés es_ES
dc.publisher Institute of Electrical and Electronics Engineers (IEEE) es_ES
dc.relation.ispartof IEEE/ACM Transactions on Audio, Speech and Language Processing es_ES
dc.rights Reserva de todos los derechos es_ES
dc.subject Crowdsourcing es_ES
dc.subject Handwritten text transcription es_ES
dc.subject Multimodal combination es_ES
dc.subject Speech recognition es_ES
dc.subject.classification LENGUAJES Y SISTEMAS INFORMATICOS es_ES
dc.title Multimodal Crowdsourcing for Transcribing Handwritten Documents es_ES
dc.type Artículo es_ES
dc.identifier.doi 10.1109/TASLP.2016.2634123
dc.relation.projectID info:eu-repo/grantAgreement/EC/H2020/674943/EU/Recognition and Enrichment of Archival Documents/ es_ES
dc.relation.projectID info:eu-repo/grantAgreement/MINECO//RTC-2014-1466-4Q4618002BC.VALENCIANA/ES/SMART WAYS - DESARROLLO DE UNA PLATAFORMA TECNOLÓGICA ORIENTADA A LA EFICIENCIA DE LOS RECURSOS EN EL CAMPO DE LAS NUEVAS TECNOLOGÍAS INTERNET OF THINGS/ es_ES
dc.relation.projectID info:eu-repo/grantAgreement/MINECO//TIN2015-70924-C2-1-R/ES/CONTEXTO, MULTIMODALIDAD Y COLABORACION DEL USUARIO EN PROCESADO DE TEXTO MANUSCRITO/ es_ES
dc.relation.projectID info:eu-repo/grantAgreement/GVA//PROMETEOII%2F2014%2F030/ES/ Adaptive learning and multimodality in machine translation and text transcription/ es_ES
dc.rights.accessRights Abierto es_ES
dc.contributor.affiliation Universitat Politècnica de València. Escola Tècnica Superior d'Enginyeria Informàtica es_ES
dc.description.bibliographicCitation Granell Romero, E.; Martínez Hinarejos, CD. (2017). Multimodal Crowdsourcing for Transcribing Handwritten Documents. IEEE/ACM Transactions on Audio, Speech and Language Processing. 25(2):409-419. https://doi.org/10.1109/TASLP.2016.2634123 es_ES
dc.description.accrualMethod S es_ES
dc.relation.publisherversion http://ieeexplore.ieee.org/document/7762772/ es_ES
dc.description.upvformatpinicio 409 es_ES
dc.description.upvformatpfin 419 es_ES
dc.type.version info:eu-repo/semantics/publishedVersion es_ES
dc.description.volume 25 es_ES
dc.description.issue 2 es_ES
dc.relation.senia 324594 es_ES
dc.contributor.funder European Commission
dc.contributor.funder Ministerio de Economía y Competitividad
dc.contributor.funder Generalitat Valenciana


Este ítem aparece en la(s) siguiente(s) colección(ones)

Mostrar el registro sencillo del ítem