Image speech combination for interactive computer assisted transcription of handwritten documents

Granell, Emilio; Romero, Verónica; Martínez-Hinarejos, Carlos-D.

doi:10.1016/j.cviu.2019.01.009

Identificarse

Buscar en RiuNet

Listar

Todo RiuNet
Esta colección

Mi cuenta

Acceder

Estadísticas

Ver Estadísticas de uso

Ayuda RiuNet

Admin. UPV

Compartir/Enviar a

Citas

Estadísticas

Image speech combination for interactive computer assisted transcription of handwritten documents

Mostrar el registro sencillo del ítem

Ficheros en el ítem

Nombre: paper.pdf

Tamaño: 2.020Mb

Formato: PDF

Descripción: Versión del Autor.

Abrir

Nombre: CVIU - Editor ...

Tamaño: 1.743Mb

Formato: PDF

Descripción: Versión editorial

Solicitar una copia al autor

dc.contributor.author	Granell, Emilio	es_ES
dc.contributor.author	Romero, Verónica	es_ES
dc.contributor.author	Martínez-Hinarejos, Carlos-D.	es_ES
dc.date.accessioned	2019-05-25T20:38:43Z
dc.date.available	2019-05-25T20:38:43Z
dc.date.issued	2019	es_ES
dc.identifier.issn	1077-3142	es_ES
dc.identifier.uri	http://hdl.handle.net/10251/121074
dc.description.abstract	[EN] Handwritten document transcription aims to obtain the contents of a document to provide efficient information access to, among other, digitised historical documents. The increasing number of historical documents published by libraries and archives makes this an important task. In this context, the use of image processing and understanding techniques in conjunction with assistive technologies reduces the time and human effort required for obtaining the final perfect transcription. The assistive transcription system proposes a hypothesis, usually derived from a recognition process of the handwritten text image. Then, the professional transcriber feedback can be used to obtain an improved hypothesis and speed-up the final transcription. In this framework, a speech signal corresponding to the dictation of the handwritten text can be used as an additional source of information. This multimodal approach, that combines the image of the handwritten text with the speech of the dictation of its contents, could make better the hypotheses (initial and improved) offered to the transcriber. In this paper we study the feasibility of a multimodal interactive transcription system for an assistive paradigm known as Computer Assisted Transcription of Text Images. Different techniques are tested for obtaining the multimodal combination in this framework. The use of the proposed multimodal approach reveals a significant reduction of transcription effort with some multimodal combination techniques, allowing for a faster transcription process.	es_ES
dc.description.sponsorship	Work partially supported by projects READ-674943 (European Union's H2020), SmartWays-RTC-2014-1466-4 (MINECO, Spain), and CoMUN-HaT-TIN2015-70924-C2-1-R (MINECO/FEDER), and by Generalitat Valenciana (GVA), Spain under reference PROMETEOII/2014/030.	es_ES
dc.language	Inglés	es_ES
dc.publisher	Elsevier	es_ES
dc.relation.ispartof	Computer Vision and Image Understanding	es_ES
dc.rights	Reconocimiento - No comercial - Sin obra derivada (by-nc-nd)	es_ES
dc.subject	Historical handwritten text transcription	es_ES
dc.subject	Speech recognition	es_ES
dc.subject	Multimodal combination	es_ES
dc.subject	Interactive framework	es_ES
dc.subject	Assistive technology	es_ES
dc.subject.classification	ESTADISTICA E INVESTIGACION OPERATIVA	es_ES
dc.subject.classification	LENGUAJES Y SISTEMAS INFORMATICOS	es_ES
dc.title	Image speech combination for interactive computer assisted transcription of handwritten documents	es_ES
dc.type	Artículo	es_ES
dc.identifier.doi	10.1016/j.cviu.2019.01.009	es_ES
dc.relation.projectID	info:eu-repo/grantAgreement/EC/H2020/674943/EU/Recognition and Enrichment of Archival Documents/	es_ES
dc.relation.projectID	info:eu-repo/grantAgreement/MINECO//RTC-2014-1466-4Q4618002BC.VALENCIANA/ES/SMART WAYS - DESARROLLO DE UNA PLATAFORMA TECNOLÓGICA ORIENTADA A LA EFICIENCIA DE LOS RECURSOS EN EL CAMPO DE LAS NUEVAS TECNOLOGÍAS INTERNET OF THINGS/	es_ES
dc.relation.projectID	info:eu-repo/grantAgreement/MINECO//TIN2015-70924-C2-1-R/ES/CONTEXTO, MULTIMODALIDAD Y COLABORACION DEL USUARIO EN PROCESADO DE TEXTO MANUSCRITO/	es_ES
dc.relation.projectID	info:eu-repo/grantAgreement/GVA//PROMETEOII%2F2014%2F030/ES/ Adaptive learning and multimodality in machine translation and text transcription/	es_ES
dc.rights.accessRights	Abierto	es_ES
dc.contributor.affiliation	Universitat Politècnica de València. Departamento de Estadística e Investigación Operativa Aplicadas y Calidad - Departament d'Estadística i Investigació Operativa Aplicades i Qualitat	es_ES
dc.contributor.affiliation	Universitat Politècnica de València. Departamento de Sistemas Informáticos y Computación - Departament de Sistemes Informàtics i Computació	es_ES
dc.description.bibliographicCitation	Granell, E.; Romero, V.; Martínez-Hinarejos, C. (2019). Image speech combination for interactive computer assisted transcription of handwritten documents. Computer Vision and Image Understanding. 180:74-83. https://doi.org/10.1016/j.cviu.2019.01.009	es_ES
dc.description.accrualMethod	S	es_ES
dc.relation.publisherversion	https://doi.org/10.1016/j.cviu.2019.01.009	es_ES
dc.description.upvformatpinicio	74	es_ES
dc.description.upvformatpfin	83	es_ES
dc.type.version	info:eu-repo/semantics/publishedVersion	es_ES
dc.description.volume	180	es_ES
dc.relation.pasarela	S\381015	es_ES
dc.contributor.funder	European Commission	es_ES
dc.contributor.funder	Ministerio de Economía y Empresa	es_ES
dc.contributor.funder	European Regional Development Fund	es_ES
dc.contributor.funder	Generalitat Valenciana

Este ítem aparece en la(s) siguiente(s) colección(ones)

Mostrar el registro sencillo del ítem

Image speech combination for interactive computer assisted transcription of handwritten documents

RiuNet: Repositorio Institucional de la Universidad Politécnica de Valencia

Buscar en RiuNet

Listar

Todo RiuNet

Esta colección

Mi cuenta

Estadísticas

Ayuda RiuNet

Admin. UPV

Compartir/Enviar a

Citas

Estadísticas

Image speech combination for interactive computer assisted transcription of handwritten documents

Ficheros en el ítem

Este ítem aparece en la(s) siguiente(s) colección(ones)