- -

Multimodal Computer-Assisted Transcription of Text Images at Character-Level Interaction

RiuNet: Repositorio Institucional de la Universidad Politécnica de Valencia

Compartir/Enviar a

Citas

Estadísticas

  • Estadisticas de Uso

Multimodal Computer-Assisted Transcription of Text Images at Character-Level Interaction

Mostrar el registro sencillo del ítem

Ficheros en el ítem

dc.contributor.author Martín-Albo Simón, Daniel es_ES
dc.contributor.author Romero Gómez, Verónica es_ES
dc.contributor.author Toselli, Alejandro Héctor es_ES
dc.contributor.author Vidal Ruiz, Enrique es_ES
dc.date.accessioned 2014-09-15T12:06:13Z
dc.date.issued 2012-10-19
dc.identifier.issn 0218-0014
dc.identifier.uri http://hdl.handle.net/10251/39658
dc.description.abstract Currently, automatic handwriting recognition systems are ineffectual in unconstrained handwriting documents. Therefore, to obtain perfect transcriptions, heavy human intervention is required to validate and correct the results of such systems. Given that this post-editing process is inefficient and uncomfortable, a multimodal interactive approach has been proposed in previous works, which aims at obtaining correct transcriptions with the minimum human effort. In this approach, the user interacts with the system by means of an e-pen and/or more traditional methods such as keyboard or mouse. This user's feedback allows to improve system accuracy and multimodality increases system ergonomics and user acceptability. Until now, multimodal interaction has been studied only at whole-word level. In this work, multimodal interaction at character-level is studied, that may lead to more effective interactivity, since it is faster and easier to write only one character rather than a whole word. Here we study this kind of fine-grained multimodal interaction and present developments that allow taking advantage of interaction-derived context to significantly improve feedback decoding accuracy. Empirical tests on three cursive handwritten tasks suggest that, despite losing the deterministic accuracy of traditional peripherals, this approach can save significant amounts of user effort with respect to fully manual transcription as well as to non-interactive post-editing correction. es_ES
dc.description.sponsorship Work supported by the Spanish Government (MICINN and "Plan E") under the MITTRAL (TIN2009-14633-C03-01) research project and under the research programme Consolider Ingenio 2010: MIPRCV (CSD2007-00018), by the Spanish MITyC under the erudito.com (TSI-020110-2009-439) project, by the FPU (AP2010-0575) grant, by the Generalitat Valenciana under grant Prometeo/2009/014 and by Universitat Politecnica de Valencia under "Programa de Apoyo a la Investigacion y Desarrollo" (PAID-05-11). en_EN
dc.language Inglés es_ES
dc.publisher World Scientific Publishing es_ES
dc.relation.ispartof International Journal of Pattern Recognition and Artificial Intelligence es_ES
dc.rights Reserva de todos los derechos es_ES
dc.subject Multimodal interactive pattern recognition es_ES
dc.subject Computer assisted transcription es_ES
dc.subject Handwritten text recognition es_ES
dc.subject Character-level interaction. es_ES
dc.subject.classification LENGUAJES Y SISTEMAS INFORMATICOS es_ES
dc.title Multimodal Computer-Assisted Transcription of Text Images at Character-Level Interaction es_ES
dc.type Artículo es_ES
dc.embargo.lift 10000-01-01
dc.embargo.terms forever es_ES
dc.identifier.doi 10.1142/S0218001412630037
dc.relation.projectID info:eu-repo/grantAgreement/MICINN//TIN2009-14633-C03-01/ES/Multimodal Interaction For Text Transcription With Adaptive Learning/ es_ES
dc.relation.projectID info:eu-repo/grantAgreement/MITURCO//TSI-020110-2009-0439/ES/ERUDITO.COM/ es_ES
dc.relation.projectID info:eu-repo/grantAgreement/ME//AP2010-0575/ES/AP2010-0575/ / es_ES
dc.relation.projectID info:eu-repo/grantAgreement/GVA//PROMETEO09%2F2009%2F014/ES/Adaptive learning and multimodality in pattern recognition (Almapater)/ es_ES
dc.relation.projectID info:eu-repo/grantAgreement/UPV//PAID-05-11/ es_ES
dc.relation.projectID info:eu-repo/grantAgreement/MEC//CSD2007-00018/ES/Multimodal Intraction in Pattern Recognition and Computer Visionm/ / es_ES
dc.rights.accessRights Cerrado es_ES
dc.contributor.affiliation Universitat Politècnica de València. Departamento de Sistemas Informáticos y Computación - Departament de Sistemes Informàtics i Computació es_ES
dc.contributor.affiliation Universitat Politècnica de València. Instituto Universitario Mixto Tecnológico de Informática - Institut Universitari Mixt Tecnològic d'Informàtica es_ES
dc.description.bibliographicCitation Martín-Albo Simón, D.; Romero Gómez, V.; Toselli, AH.; Vidal Ruiz, E. (2012). Multimodal Computer-Assisted Transcription of Text Images at Character-Level Interaction. International Journal of Pattern Recognition and Artificial Intelligence. 26(5):1263003-1-1263003-19. https://doi.org/10.1142/S0218001412630037 es_ES
dc.description.accrualMethod S es_ES
dc.relation.publisherversion http://dx.doi.org/10.1142/S0218001412630037 es_ES
dc.description.upvformatpinicio 1263003-1 es_ES
dc.description.upvformatpfin 1263003-19 es_ES
dc.type.version info:eu-repo/semantics/publishedVersion es_ES
dc.description.volume 26 es_ES
dc.description.issue 5 es_ES
dc.relation.senia 234244
dc.contributor.funder Ministerio de Educación es_ES
dc.contributor.funder Generalitat Valenciana es_ES
dc.contributor.funder Universitat Politècnica de València es_ES
dc.contributor.funder Ministerio de Industria, Turismo y Comercio es_ES
dc.description.references Civera, J., Vilar, J. M., Cubel, E., Lagarda, A. L., Barrachina, S., Casacuberta, F., … González, J. (2004). A Syntactic Pattern Recognition Approach to Computer Assisted Translation. Structural, Syntactic, and Statistical Pattern Recognition, 207-215. doi:10.1007/978-3-540-27868-9_21 es_ES
dc.description.references MARTI, U.-V., & BUNKE, H. (2001). USING A STATISTICAL LANGUAGE MODEL TO IMPROVE THE PERFORMANCE OF AN HMM-BASED CURSIVE HANDWRITING RECOGNITION SYSTEM. International Journal of Pattern Recognition and Artificial Intelligence, 15(01), 65-90. doi:10.1142/s0218001401000848 es_ES
dc.description.references Rabiner, L. R. (1989). A tutorial on hidden Markov models and selected applications in speech recognition. Proceedings of the IEEE, 77(2), 257-286. doi:10.1109/5.18626 es_ES
dc.description.references TOSELLI, A. H., JUAN, A., GONZÁLEZ, J., SALVADOR, I., VIDAL, E., CASACUBERTA, F., … NEY, H. (2004). INTEGRATED HANDWRITING RECOGNITION AND INTERPRETATION USING FINITE-STATE MODELS. International Journal of Pattern Recognition and Artificial Intelligence, 18(04), 519-539. doi:10.1142/s0218001404003344 es_ES
dc.description.references Toselli, A. H., Romero, V., Pastor, M., & Vidal, E. (2010). Multimodal interactive transcription of text images. Pattern Recognition, 43(5), 1814-1825. doi:10.1016/j.patcog.2009.11.019 es_ES
dc.description.references Zimmermann, M., Chappelier, J.-C., & Bunke, H. (2006). Offline grammar-based recognition of handwritten sentences. IEEE Transactions on Pattern Analysis and Machine Intelligence, 28(5), 818-821. doi:10.1109/tpami.2006.103 es_ES


Este ítem aparece en la(s) siguiente(s) colección(ones)

Mostrar el registro sencillo del ítem