Granell, E.; Romero, V.; Martínez-Hinarejos, C. (2019). Image speech combination for interactive computer assisted transcription of handwritten documents. Computer Vision and Image Understanding. 180:74-83. https://doi.org/10.1016/j.cviu.2019.01.009
Por favor, use este identificador para citar o enlazar este ítem: http://hdl.handle.net/10251/121074
Título:
|
Image speech combination for interactive computer assisted transcription of handwritten documents
|
Autor:
|
Granell, Emilio
Romero, Verónica
Martínez-Hinarejos, Carlos-D.
|
Entidad UPV:
|
Universitat Politècnica de València. Departamento de Estadística e Investigación Operativa Aplicadas y Calidad - Departament d'Estadística i Investigació Operativa Aplicades i Qualitat
Universitat Politècnica de València. Departamento de Sistemas Informáticos y Computación - Departament de Sistemes Informàtics i Computació
|
Fecha difusión:
|
|
Resumen:
|
[EN] Handwritten document transcription aims to obtain the contents of a document to provide efficient information access to, among other, digitised historical documents. The increasing number of historical documents ...[+]
[EN] Handwritten document transcription aims to obtain the contents of a document to provide efficient information access to, among other, digitised historical documents. The increasing number of historical documents published by libraries and archives makes this an important task. In this context, the use of image processing and understanding techniques in conjunction with assistive technologies reduces the time and human effort required for obtaining the final perfect transcription. The assistive transcription system proposes a hypothesis, usually derived from a recognition process of the handwritten text image. Then, the professional transcriber feedback can be used to obtain an improved hypothesis and speed-up the final transcription. In this framework, a speech signal corresponding to the dictation of the handwritten text can be used as an additional source of information. This multimodal approach, that combines the image of the handwritten text with the speech of the dictation of its contents, could make better the hypotheses (initial and improved) offered to the transcriber. In this paper we study the feasibility of a multimodal interactive transcription system for an assistive paradigm known as Computer Assisted Transcription of Text Images. Different techniques are tested for obtaining the multimodal combination in this framework. The use of the proposed multimodal approach reveals a significant reduction of transcription effort with some multimodal combination techniques, allowing for a faster transcription process.
[-]
|
Palabras clave:
|
Historical handwritten text transcription
,
Speech recognition
,
Multimodal combination
,
Interactive framework
,
Assistive technology
|
Derechos de uso:
|
Reconocimiento - No comercial - Sin obra derivada (by-nc-nd)
|
Fuente:
|
Computer Vision and Image Understanding. (issn:
1077-3142
)
|
DOI:
|
10.1016/j.cviu.2019.01.009
|
Editorial:
|
Elsevier
|
Versión del editor:
|
https://doi.org/10.1016/j.cviu.2019.01.009
|
Código del Proyecto:
|
info:eu-repo/grantAgreement/EC/H2020/674943/EU/Recognition and Enrichment of Archival Documents/
info:eu-repo/grantAgreement/MINECO//RTC-2014-1466-4Q4618002BC.VALENCIANA/ES/SMART WAYS - DESARROLLO DE UNA PLATAFORMA TECNOLÓGICA ORIENTADA A LA EFICIENCIA DE LOS RECURSOS EN EL CAMPO DE LAS NUEVAS TECNOLOGÍAS INTERNET OF THINGS/
info:eu-repo/grantAgreement/MINECO//TIN2015-70924-C2-1-R/ES/CONTEXTO, MULTIMODALIDAD Y COLABORACION DEL USUARIO EN PROCESADO DE TEXTO MANUSCRITO/
info:eu-repo/grantAgreement/GVA//PROMETEOII%2F2014%2F030/ES/ Adaptive learning and multimodality in machine translation and text transcription/
|
Agradecimientos:
|
Work partially supported by projects READ-674943 (European Union's H2020), SmartWays-RTC-2014-1466-4 (MINECO, Spain), and CoMUN-HaT-TIN2015-70924-C2-1-R (MINECO/FEDER), and by Generalitat Valenciana (GVA), Spain under ...[+]
Work partially supported by projects READ-674943 (European Union's H2020), SmartWays-RTC-2014-1466-4 (MINECO, Spain), and CoMUN-HaT-TIN2015-70924-C2-1-R (MINECO/FEDER), and by Generalitat Valenciana (GVA), Spain under reference PROMETEOII/2014/030.
[-]
|
Tipo:
|
Artículo
|