Mostrar el registro sencillo del ítem
dc.contributor.author | Granell, Emilio | es_ES |
dc.contributor.author | Romero, Verónica | es_ES |
dc.contributor.author | Martínez-Hinarejos, Carlos-D. | es_ES |
dc.date.accessioned | 2019-05-25T20:38:43Z | |
dc.date.available | 2019-05-25T20:38:43Z | |
dc.date.issued | 2019 | es_ES |
dc.identifier.issn | 1077-3142 | es_ES |
dc.identifier.uri | http://hdl.handle.net/10251/121074 | |
dc.description.abstract | [EN] Handwritten document transcription aims to obtain the contents of a document to provide efficient information access to, among other, digitised historical documents. The increasing number of historical documents published by libraries and archives makes this an important task. In this context, the use of image processing and understanding techniques in conjunction with assistive technologies reduces the time and human effort required for obtaining the final perfect transcription. The assistive transcription system proposes a hypothesis, usually derived from a recognition process of the handwritten text image. Then, the professional transcriber feedback can be used to obtain an improved hypothesis and speed-up the final transcription. In this framework, a speech signal corresponding to the dictation of the handwritten text can be used as an additional source of information. This multimodal approach, that combines the image of the handwritten text with the speech of the dictation of its contents, could make better the hypotheses (initial and improved) offered to the transcriber. In this paper we study the feasibility of a multimodal interactive transcription system for an assistive paradigm known as Computer Assisted Transcription of Text Images. Different techniques are tested for obtaining the multimodal combination in this framework. The use of the proposed multimodal approach reveals a significant reduction of transcription effort with some multimodal combination techniques, allowing for a faster transcription process. | es_ES |
dc.description.sponsorship | Work partially supported by projects READ-674943 (European Union's H2020), SmartWays-RTC-2014-1466-4 (MINECO, Spain), and CoMUN-HaT-TIN2015-70924-C2-1-R (MINECO/FEDER), and by Generalitat Valenciana (GVA), Spain under reference PROMETEOII/2014/030. | es_ES |
dc.language | Inglés | es_ES |
dc.publisher | Elsevier | es_ES |
dc.relation.ispartof | Computer Vision and Image Understanding | es_ES |
dc.rights | Reconocimiento - No comercial - Sin obra derivada (by-nc-nd) | es_ES |
dc.subject | Historical handwritten text transcription | es_ES |
dc.subject | Speech recognition | es_ES |
dc.subject | Multimodal combination | es_ES |
dc.subject | Interactive framework | es_ES |
dc.subject | Assistive technology | es_ES |
dc.subject.classification | ESTADISTICA E INVESTIGACION OPERATIVA | es_ES |
dc.subject.classification | LENGUAJES Y SISTEMAS INFORMATICOS | es_ES |
dc.title | Image speech combination for interactive computer assisted transcription of handwritten documents | es_ES |
dc.type | Artículo | es_ES |
dc.identifier.doi | 10.1016/j.cviu.2019.01.009 | es_ES |
dc.relation.projectID | info:eu-repo/grantAgreement/EC/H2020/674943/EU/Recognition and Enrichment of Archival Documents/ | es_ES |
dc.relation.projectID | info:eu-repo/grantAgreement/MINECO//RTC-2014-1466-4Q4618002BC.VALENCIANA/ES/SMART WAYS - DESARROLLO DE UNA PLATAFORMA TECNOLÓGICA ORIENTADA A LA EFICIENCIA DE LOS RECURSOS EN EL CAMPO DE LAS NUEVAS TECNOLOGÍAS INTERNET OF THINGS/ | es_ES |
dc.relation.projectID | info:eu-repo/grantAgreement/MINECO//TIN2015-70924-C2-1-R/ES/CONTEXTO, MULTIMODALIDAD Y COLABORACION DEL USUARIO EN PROCESADO DE TEXTO MANUSCRITO/ | es_ES |
dc.relation.projectID | info:eu-repo/grantAgreement/GVA//PROMETEOII%2F2014%2F030/ES/ Adaptive learning and multimodality in machine translation and text transcription/ | es_ES |
dc.rights.accessRights | Abierto | es_ES |
dc.contributor.affiliation | Universitat Politècnica de València. Departamento de Estadística e Investigación Operativa Aplicadas y Calidad - Departament d'Estadística i Investigació Operativa Aplicades i Qualitat | es_ES |
dc.contributor.affiliation | Universitat Politècnica de València. Departamento de Sistemas Informáticos y Computación - Departament de Sistemes Informàtics i Computació | es_ES |
dc.description.bibliographicCitation | Granell, E.; Romero, V.; Martínez-Hinarejos, C. (2019). Image speech combination for interactive computer assisted transcription of handwritten documents. Computer Vision and Image Understanding. 180:74-83. https://doi.org/10.1016/j.cviu.2019.01.009 | es_ES |
dc.description.accrualMethod | S | es_ES |
dc.relation.publisherversion | https://doi.org/10.1016/j.cviu.2019.01.009 | es_ES |
dc.description.upvformatpinicio | 74 | es_ES |
dc.description.upvformatpfin | 83 | es_ES |
dc.type.version | info:eu-repo/semantics/publishedVersion | es_ES |
dc.description.volume | 180 | es_ES |
dc.relation.pasarela | S\381015 | es_ES |
dc.contributor.funder | European Commission | es_ES |
dc.contributor.funder | Ministerio de Economía y Empresa | es_ES |
dc.contributor.funder | European Regional Development Fund | es_ES |
dc.contributor.funder | Generalitat Valenciana |