Multimodal Computer-Assisted Transcription of Text Images at Character-Level Interaction

Martín-Albo Simón, Daniel; Romero Gómez, Verónica; Toselli, Alejandro Héctor; Vidal Ruiz, Enrique

doi:10.1142/S0218001412630037

Identificarse

Buscar en RiuNet

Listar

Todo RiuNet
Esta colección

Mi cuenta

Acceder

Estadísticas

Ver Estadísticas de uso

Ayuda RiuNet

Admin. UPV

Compartir/Enviar a

Citas

Estadísticas

Multimodal Computer-Assisted Transcription of Text Images at Character-Level Interaction

Mostrar el registro sencillo del ítem

Ficheros en el ítem

Nombre: Martín-Albo;Romer ...

Tamaño: 416.1Kb

Formato: PDF

Descripción: Versión editorial

Solicitar una copia al autor

dc.contributor.author	Martín-Albo Simón, Daniel	es_ES
dc.contributor.author	Romero Gómez, Verónica	es_ES
dc.contributor.author	Toselli, Alejandro Héctor	es_ES
dc.contributor.author	Vidal Ruiz, Enrique	es_ES
dc.date.accessioned	2014-09-15T12:06:13Z
dc.date.issued	2012-10-19
dc.identifier.issn	0218-0014
dc.identifier.uri	http://hdl.handle.net/10251/39658
dc.description.abstract	Currently, automatic handwriting recognition systems are ineffectual in unconstrained handwriting documents. Therefore, to obtain perfect transcriptions, heavy human intervention is required to validate and correct the results of such systems. Given that this post-editing process is inefficient and uncomfortable, a multimodal interactive approach has been proposed in previous works, which aims at obtaining correct transcriptions with the minimum human effort. In this approach, the user interacts with the system by means of an e-pen and/or more traditional methods such as keyboard or mouse. This user's feedback allows to improve system accuracy and multimodality increases system ergonomics and user acceptability. Until now, multimodal interaction has been studied only at whole-word level. In this work, multimodal interaction at character-level is studied, that may lead to more effective interactivity, since it is faster and easier to write only one character rather than a whole word. Here we study this kind of fine-grained multimodal interaction and present developments that allow taking advantage of interaction-derived context to significantly improve feedback decoding accuracy. Empirical tests on three cursive handwritten tasks suggest that, despite losing the deterministic accuracy of traditional peripherals, this approach can save significant amounts of user effort with respect to fully manual transcription as well as to non-interactive post-editing correction.	es_ES
dc.description.sponsorship	Work supported by the Spanish Government (MICINN and "Plan E") under the MITTRAL (TIN2009-14633-C03-01) research project and under the research programme Consolider Ingenio 2010: MIPRCV (CSD2007-00018), by the Spanish MITyC under the erudito.com (TSI-020110-2009-439) project, by the FPU (AP2010-0575) grant, by the Generalitat Valenciana under grant Prometeo/2009/014 and by Universitat Politecnica de Valencia under "Programa de Apoyo a la Investigacion y Desarrollo" (PAID-05-11).	en_EN
dc.language	Inglés	es_ES
dc.publisher	World Scientific Publishing	es_ES
dc.relation.ispartof	International Journal of Pattern Recognition and Artificial Intelligence	es_ES
dc.rights	Reserva de todos los derechos	es_ES
dc.subject	Multimodal interactive pattern recognition	es_ES
dc.subject	Computer assisted transcription	es_ES
dc.subject	Handwritten text recognition	es_ES
dc.subject	Character-level interaction.	es_ES
dc.subject.classification	LENGUAJES Y SISTEMAS INFORMATICOS	es_ES
dc.title	Multimodal Computer-Assisted Transcription of Text Images at Character-Level Interaction	es_ES
dc.type	Artículo	es_ES
dc.embargo.lift	10000-01-01
dc.embargo.terms	forever	es_ES
dc.identifier.doi	10.1142/S0218001412630037
dc.relation.projectID	info:eu-repo/grantAgreement/MICINN//TIN2009-14633-C03-01/ES/Multimodal Interaction For Text Transcription With Adaptive Learning/	es_ES
dc.relation.projectID	info:eu-repo/grantAgreement/MITURCO//TSI-020110-2009-0439/ES/ERUDITO.COM/	es_ES
dc.relation.projectID	info:eu-repo/grantAgreement/ME//AP2010-0575/ES/AP2010-0575/ /	es_ES
dc.relation.projectID	info:eu-repo/grantAgreement/GVA//PROMETEO09%2F2009%2F014/ES/Adaptive learning and multimodality in pattern recognition (Almapater)/	es_ES
dc.relation.projectID	info:eu-repo/grantAgreement/UPV//PAID-05-11/	es_ES
dc.relation.projectID	info:eu-repo/grantAgreement/MEC//CSD2007-00018/ES/Multimodal Intraction in Pattern Recognition and Computer Visionm/ /	es_ES
dc.rights.accessRights	Cerrado	es_ES
dc.contributor.affiliation	Universitat Politècnica de València. Departamento de Sistemas Informáticos y Computación - Departament de Sistemes Informàtics i Computació	es_ES
dc.contributor.affiliation	Universitat Politècnica de València. Instituto Universitario Mixto Tecnológico de Informática - Institut Universitari Mixt Tecnològic d'Informàtica	es_ES
dc.description.bibliographicCitation	Martín-Albo Simón, D.; Romero Gómez, V.; Toselli, AH.; Vidal Ruiz, E. (2012). Multimodal Computer-Assisted Transcription of Text Images at Character-Level Interaction. International Journal of Pattern Recognition and Artificial Intelligence. 26(5):1263003-1-1263003-19. https://doi.org/10.1142/S0218001412630037	es_ES
dc.description.accrualMethod	S	es_ES
dc.relation.publisherversion	http://dx.doi.org/10.1142/S0218001412630037	es_ES
dc.description.upvformatpinicio	1263003-1	es_ES
dc.description.upvformatpfin	1263003-19	es_ES
dc.type.version	info:eu-repo/semantics/publishedVersion	es_ES
dc.description.volume	26	es_ES
dc.description.issue	5	es_ES
dc.relation.senia	234244
dc.contributor.funder	Ministerio de Educación	es_ES
dc.contributor.funder	Generalitat Valenciana	es_ES
dc.contributor.funder	Universitat Politècnica de València	es_ES
dc.contributor.funder	Ministerio de Industria, Turismo y Comercio	es_ES
dc.description.references	Civera, J., Vilar, J. M., Cubel, E., Lagarda, A. L., Barrachina, S., Casacuberta, F., … González, J. (2004). A Syntactic Pattern Recognition Approach to Computer Assisted Translation. Structural, Syntactic, and Statistical Pattern Recognition, 207-215. doi:10.1007/978-3-540-27868-9_21	es_ES
dc.description.references	MARTI, U.-V., & BUNKE, H. (2001). USING A STATISTICAL LANGUAGE MODEL TO IMPROVE THE PERFORMANCE OF AN HMM-BASED CURSIVE HANDWRITING RECOGNITION SYSTEM. International Journal of Pattern Recognition and Artificial Intelligence, 15(01), 65-90. doi:10.1142/s0218001401000848	es_ES
dc.description.references	Rabiner, L. R. (1989). A tutorial on hidden Markov models and selected applications in speech recognition. Proceedings of the IEEE, 77(2), 257-286. doi:10.1109/5.18626	es_ES
dc.description.references	TOSELLI, A. H., JUAN, A., GONZÁLEZ, J., SALVADOR, I., VIDAL, E., CASACUBERTA, F., … NEY, H. (2004). INTEGRATED HANDWRITING RECOGNITION AND INTERPRETATION USING FINITE-STATE MODELS. International Journal of Pattern Recognition and Artificial Intelligence, 18(04), 519-539. doi:10.1142/s0218001404003344	es_ES
dc.description.references	Toselli, A. H., Romero, V., Pastor, M., & Vidal, E. (2010). Multimodal interactive transcription of text images. Pattern Recognition, 43(5), 1814-1825. doi:10.1016/j.patcog.2009.11.019	es_ES
dc.description.references	Zimmermann, M., Chappelier, J.-C., & Bunke, H. (2006). Offline grammar-based recognition of handwritten sentences. IEEE Transactions on Pattern Analysis and Machine Intelligence, 28(5), 818-821. doi:10.1109/tpami.2006.103	es_ES

Este ítem aparece en la(s) siguiente(s) colección(ones)

Artículos, conferencias, monografías [47097]

Mostrar el registro sencillo del ítem

Multimodal Computer-Assisted Transcription of Text Images at Character-Level Interaction

RiuNet: Repositorio Institucional de la Universidad Politécnica de Valencia

Buscar en RiuNet

Listar

Todo RiuNet

Esta colección

Mi cuenta

Estadísticas

Ayuda RiuNet

Admin. UPV

Compartir/Enviar a

Citas

Estadísticas

Multimodal Computer-Assisted Transcription of Text Images at Character-Level Interaction

Ficheros en el ítem

Este ítem aparece en la(s) siguiente(s) colección(ones)