Alabau, V.; Martínez Hinarejos, CD.; Romero Gómez, V.; Lagarda Arroyo, AL. (2014). An iterative multimodal framework for the transcription of handwritten historical documents. Pattern Recognition Letters. 35:195-203. https://doi.org/10.1016/j.patrec.2012.11.007
Por favor, use este identificador para citar o enlazar este ítem: http://hdl.handle.net/10251/49463
Title:
|
An iterative multimodal framework for the transcription of handwritten historical documents
|
Author:
|
Alabau, Vicent
Martínez Hinarejos, Carlos David
Romero Gómez, Verónica
Lagarda Arroyo, Antonio Luís
|
UPV Unit:
|
Universitat Politècnica de València. Departamento de Sistemas Informáticos y Computación - Departament de Sistemes Informàtics i Computació
|
Issued date:
|
|
Abstract:
|
[EN] The transcription of historical documents is one of the most interesting tasks in which Handwritten Text
Recognition can be applied, due to its interest in humanities research. One alternative for transcribing ...[+]
[EN] The transcription of historical documents is one of the most interesting tasks in which Handwritten Text
Recognition can be applied, due to its interest in humanities research. One alternative for transcribing the
ancient manuscripts is the use of speech dictation by using Automatic Speech Recognition techniques. In
the two alternatives similar models (Hidden Markov Models and n-grams) and decoding processes (Viterbi
decoding) are employed, which allows a possible combination of the two modalities with little diffi-
culties. In this work, we explore the possibility of using recognition results of one modality to restrict
the decoding process of the other modality, and apply this process iteratively. Results of these multimodal
iterative alternatives are significantly better than the baseline uni-modal systems and better than
the non-iterative alternatives.
2012 Elsevier B.V. All rights reserved.
[-]
|
Subjects:
|
Ancient text transcription
,
Handwritten text recognition
,
Speech dictation
,
Multimodal systems
,
Iterative systems
,
Language modelling
|
Copyrigths:
|
Reserva de todos los derechos
|
Source:
|
Pattern Recognition Letters. (issn:
0167-8655
)
|
DOI:
|
10.1016/j.patrec.2012.11.007
|
Publisher:
|
Elsevier
|
Publisher version:
|
http://dx.doi.org/10.1016/j.patrec.2012.11.007
|
Conference name:
|
12th International Conference on Frontiers in Handwriting Recognition (ICFHR)
|
Conference place:
|
Kolkata, India
|
Conference date:
|
November 15-17, 2010
|
Project ID:
|
info:eu-repo/grantAgreement/MEC//CSD2007-00018/ES/Multimodal Intraction in Pattern Recognition and Computer Visionm/
...[+]
info:eu-repo/grantAgreement/MEC//CSD2007-00018/ES/Multimodal Intraction in Pattern Recognition and Computer Visionm/
info:eu-repo/grantAgreement/MICINN//TIN2009-14511/ES/Traduccion De Textos Y Transcripcion De Voz Interactivas/
info:eu-repo/grantAgreement/MICINN//TIN2009-14633-C03-01/ES/Multimodal Interaction For Text Transcription With Adaptive Learning/
info:eu-repo/grantAgreement/MITURCO//TSI-020110-2009-0439/ES/ERUDITO.COM/
info:eu-repo/grantAgreement/GVA//GV%2F2010%2F067/
info:eu-repo/grantAgreement/UPV//PAID-05-11-2779/
info:eu-repo/grantAgreement/UPV//UPV%2F2009%2F2851/
[-]
|
Thanks:
|
Work supported by the EC (FEDER/FSE) and the Spanish MEC/MICINN under the MIPRCV ’’Consolider Ingenio 2010’’ program (CSD2007-00018), iTrans2 (TIN2009–14511) and MITTRAL (TIN2009-14633-C03–01) projects. Also supported by ...[+]
Work supported by the EC (FEDER/FSE) and the Spanish MEC/MICINN under the MIPRCV ’’Consolider Ingenio 2010’’ program (CSD2007-00018), iTrans2 (TIN2009–14511) and MITTRAL (TIN2009-14633-C03–01) projects. Also supported by the Spanish MITyC under the erudito.com (TSI-020110-2009-439) project and by the Generalitat Valenciana under grant GV/2010/067, and by the UPV under project PAID-05-11-2779 and grant UPV/2009/2851.
[-]
|
Type:
|
Artículo
Comunicación en congreso
|