A Spanish dataset for reproducible benchmarked offline handwriting recognition

España Boquera, Salvador; Castro-Bleda, Maria Jose

doi:10.1007/s10579-022-09587-3

Identificarse

Buscar en RiuNet

Listar

Todo RiuNet
Esta colección

Mi cuenta

Acceder

Estadísticas

Ver Estadísticas de uso

Ayuda RiuNet

Admin. UPV

Compartir/Enviar a

Citas

Estadísticas

A Spanish dataset for reproducible benchmarked offline handwriting recognition

Mostrar el registro sencillo del ítem

Ficheros en el ítem

Nombre: EspanaCastro-Bleda ...

Tamaño: 507.8Kb

Formato: PDF

Descripción: Versión editorial

Abrir

dc.contributor.author	España Boquera, Salvador	es_ES
dc.contributor.author	Castro-Bleda, Maria Jose	es_ES
dc.date.accessioned	2023-10-19T18:02:15Z
dc.date.available	2023-10-19T18:02:15Z
dc.date.issued	2022-09	es_ES
dc.identifier.issn	1574-020X	es_ES
dc.identifier.uri	http://hdl.handle.net/10251/198426
dc.description.abstract	[EN] In this paper, a public dataset for Offline Handwriting Recognition, along with an appropriate evaluation method to provide benchmark indicators at sentence level, is presented. This dataset, called SPA-Sentences, consists of offline handwritten Spanish sentences extracted from 1617 forms produced by the same number of writers. A total of 13,691 sentences comprising around 100,000 word instances out of a vocabulary of 3288 words occur in the collection. Careful attention has been paid to make the baseline experiments both reproducible and competitive. To this end, experiments are based on state-of-the-art recognition techniques combining convolutional blocks with one-dimensional Bidirectional Long Short Term Memory (LSTM) networks using Connectionist Temporal Classification (CTC) decoding. The scripts with the entire experimental setting have been made available. The SPA-Sentences dataset and its baseline evaluation are freely available for research purposes via the institutional University repository. We expect the research community to include this corpus, as is usually done with English IAM and French RIMES datasets, in their battery of experiments when reporting novel handwriting recognition techniques.	es_ES
dc.language	Inglés	es_ES
dc.publisher	Springer-Verlag	es_ES
dc.relation.ispartof	Language Resources and Evaluation	es_ES
dc.rights	Reserva de todos los derechos	es_ES
dc.subject	Handwriting recognition	es_ES
dc.subject	Offline handwriting recognition	es_ES
dc.subject	Datasets	es_ES
dc.subject	Evaluation	es_ES
dc.subject	Benchmarking	es_ES
dc.subject	Experimental reproducibility	es_ES
dc.subject	Spanish resources	es_ES
dc.subject	Deep learning	es_ES
dc.subject	Convolutional neural networks (CNN)	es_ES
dc.subject	Long short term memory (LSTM) networks	es_ES
dc.subject	Connectionist temporal classification (CTC)	es_ES
dc.subject.classification	LENGUAJES Y SISTEMAS INFORMATICOS	es_ES
dc.title	A Spanish dataset for reproducible benchmarked offline handwriting recognition	es_ES
dc.type	Artículo	es_ES
dc.identifier.doi	10.1007/s10579-022-09587-3	es_ES
dc.rights.accessRights	Abierto	es_ES
dc.contributor.affiliation	Universitat Politècnica de València. Escola Tècnica Superior d'Enginyeria Informàtica	es_ES
dc.description.bibliographicCitation	España Boquera, S.; Castro-Bleda, MJ. (2022). A Spanish dataset for reproducible benchmarked offline handwriting recognition. Language Resources and Evaluation. 56(3):1009-1022. https://doi.org/10.1007/s10579-022-09587-3	es_ES
dc.description.accrualMethod	S	es_ES
dc.relation.publisherversion	https://doi.org/10.1007/s10579-022-09587-3	es_ES
dc.description.upvformatpinicio	1009	es_ES
dc.description.upvformatpfin	1022	es_ES
dc.type.version	info:eu-repo/semantics/publishedVersion	es_ES
dc.description.volume	56	es_ES
dc.description.issue	3	es_ES
dc.relation.pasarela	S\491455	es_ES
dc.description.references	Amengual, J. C., Benedí, J. M., Casacuberta, F., Castaño, A., Castellanos, A., Jiménez, V. M., Llorens, D., Marzal, A., Prat, F., Vilar, J.M., Benedí, J.M., Casacuberta, F., Pastor, M., & Vidal. E. (2000). The EUTRANS-I speech translation system. Machine Translation Journal, 15, 75–103.	es_ES
dc.description.references	Amodei, D., Anubhai, R., Battenberg, E., Case, C., Casper, J., Catanzaro, B., Chen, J., Chrzanowski, M., Coates, A., Diamos, G., Elsen, E., Engel, J., Fan, L., Fougner, C., Han, T., Hannun, A., Jun, B., LeGresley, P., Lin, L., Narang, S., Ng, A., Ozair, S., Prenger, R., Raiman, J., Satheesh, S., Seetapun, D., Sengupta, S., Wang, Y., Wang, Z., Wang, C., Xiao, B., Yogatama, D., Zhan, J., & Zhu. Z. (2016). Deep speech 2: End-to-end speech recognition in English and Mandarin. In Proceedings of the 33rd international conference on international conference on machine learning (ICML) (Vol. 48, pp. 173–182). JMLR.org.	es_ES
dc.description.references	Chetlur, S., Woolley, C., Vandermersch, P., Cohen, J., Tran, J., Catanzaro, B., & Shelhamer, E. (2014). cuDNN: Efficient primitives for deep learning. CoRR abs/1410.0759. http://arxiv.org/abs/1410.0759.	es_ES
dc.description.references	Collobert, R., Kavukcuoglu, K., & Farabet, C. (2011). Torch7: A Matlab-like environment for machine learning. In Proceedings of big learning 2011: NIPS 2011 workshop on algorithms, systems, and tools for learning at scale.	es_ES
dc.description.references	Díaz-Verdejo, J. E., Peinado, A. M., Rubio, A. J., Segarra, E., Prieto, N., & Casacuberta, F. (1998). ALBAYZIN: A task-oriented Spanish speech corpus. In Proceedings of the first international conference on language resources and evaluation (LREC) (pp. 497–501). Granada, Spain.	es_ES
dc.description.references	Doetsch, P., Kozielski, M., & Ney, H. (2014). Fast and robust training of recurrent neural networks for offline handwriting recognition. In Proceedings of the 14th international conference on frontiers in handwriting recognition (ICFHR) (pp. 279–284). IEEE.	es_ES
dc.description.references	España Boquera, S., Castro Bleda, M. J., & Hidalgo, J. L. (2004). The SPARTACUS-Database: A Spanish sentence database for offline handwriting recognition. In Proceedings of the fourth international conference on language resources and evaluation (LREC) (pp. 227–230). Lisbon, Portugal.	es_ES
dc.description.references	Fischer, A., Baechler, M., Garz, A., Liwicki, M., & Ingold, R. (2014). A combined system for text line extraction and handwriting recognition in historical documents. In Proceedings of the 11th IAPR international workshop on document analysis systems (DAS) (pp. 71–75). IEEE.	es_ES
dc.description.references	Fischer, A., Indermühle, E., Bunke, H., Viehhauser, G., & Stolz, M. (2010). Ground Truth Creation for Handwriting Recognition in Historical Documents. In Proceedings of the 9th IAPR international workshop on document analysis systems (DAS) (pp. 3–10). ACM, New York, NY, USA. https://doi.org/10.1145/1815330.1815331.	es_ES
dc.description.references	Gers, F. A., Schraudolph, N. N., & Schmidhuber, J. (2002). Learning precise timing with LSTM recurrent networks. Journal of machine learning research, 3(Aug), 115–143.	es_ES
dc.description.references	Graves, A., Fernández, S., Gomez, F., & Schmidhuber, J. (2006). Connectionist temporal classification: labelling unsegmented sequence data with recurrent neural networks. In Proceedings of the 23rd international conference on machine learning (ICML) (pp. 369–376). ACM.	es_ES
dc.description.references	Graves, A., Liwicki, M., Fernández, S., Bertolami, R., Bunke, H., & Schmidhuber, J. (2008). A novel connectionist system for unconstrained handwriting recognition. IEEE Transaction on Pattern Analysis and Machine Intelligence, 31(5), 855–868.	es_ES
dc.description.references	Graves, A., & Schmidhuber, J. (2005). Framewise phoneme classification with bidirectional LSTM and other neural network architectures. Neural Networks, 18(5–6), 602–610.	es_ES
dc.description.references	Graves, A., & Schmidhuber, J. (2009). Offline handwriting recognition with multidimensional recurrent neural networks. In Advances in neural information processing systems, pp. 545–552.	es_ES
dc.description.references	Grosicki, E., Carré, M., Brodin, J. M., & Geoffrois, E. (2008). RIMES evaluation campaign for handwritten mail processing. In Proceedings of the 11th international conference on frontiers in handwriting recognition (ICFHR), pp. 1–6. Concordia University, Montreal, Canada. https://hal.archives-ouvertes.fr/hal-01395332.	es_ES
dc.description.references	Hochreiter, S., & Schmidhuber, J. (1997). Long short-term memory. Neural Computation, 9(8), 1735–1780.	es_ES
dc.description.references	Hull, J. J. (1994). A database for handwritten text recognition research. IEEE Transaction on Pattern Analysis and Machine Intelligence, 16(5), 550–554.	es_ES
dc.description.references	Hussain, R., Raza, A., Siddiqi, I., Khurshid, K., & Djeddi, C. (2015). A comprehensive survey of handwritten document benchmarks: structure, usage and evaluation (p. 46). Image and Video Processing: EURASIP J.	es_ES
dc.description.references	Juan, A., Toselli, A. H., Domnech, J., González, J., Salvador, I., Vidal, E., & Casacuberta, F. (2004). Integrated handwriting recognition and interpretation via finite-state models. International Journal of Pattern Recognition and Artificial Intelligence, 18(04), 519–539.	es_ES
dc.description.references	LeCun, Y., Bottou, L., Bengio, Y., & Haffner, P. (1998). Gradient-based learning applied to document recognition. In Proceedings of the IEEE, 86(11), 2278–2324	es_ES
dc.description.references	Maas, A. L., Hannun, A. Y., & Ng, A. Y. (2013). Rectifier nonlinearities improve neural network acoustic models. In Proceedings of the international conference on international conference on machine learning (ICML) (Vol. 30, p. 3).	es_ES
dc.description.references	Marti, U. V., & Bunke, H. (2002). The IAM-database: An English sentence database for offline handwriting recognition. International Journal on Document Analysis and Recognition, 5, 39–46.	es_ES
dc.description.references	Mocholí Calvo, C., Mocholí-Calvo Mocholí-Calvo, C. Tutored by E. VIdal and J. Puigcerver. (2017–2018). Development and experimentation of a deep learning system for convolutional and recurrent neural networks. Master’s thesis, ETSINF Universitat Politècnica de València, Valencia (Spain).	es_ES
dc.description.references	Paszke, A., Gross, S., Chintala, S., Chanan, G., Yang, E., DeVito, Z., Lin, Z., Desmaison, A., Antiga, L., & Lerer, A. (2017). Automatic differentiation in PyTorch. In Proceedings of the 31st conference on neural information processing systems (NIPS). Long Beach, CA, USA.	es_ES
dc.description.references	Pérez, D., Tarazón, L., Serrano, N., Castro, F., Terrades, O.R., & Juan-Císcar, A. (2009). The GERMANA database. In 10th International conference on document analysis and recognition (pp. 301–305).	es_ES
dc.description.references	Povey, D., Ghoshal, A., Boulianne, G., Burget, L., Glembek, O., Goel, N., Hannemann, M., Motlicek, P., Qian, Y., Schwarz, P., Silovsky, J., Stemmer, G., & Vesely, K. (2011). The Kaldi speech recognition toolkit. Technical report: IEEE signal processing society.	es_ES
dc.description.references	Puigcerver, J. (2017). Are multidimensional recurrent layers really necessary for handwritten text recognition? In Proceedings of the 14th IAPR international conference on document analysis and recognition (ICDAR) (Vol. 01, pp. 67–72). https://doi.org/10.1109/ICDAR.2017.20.	es_ES
dc.description.references	Puigcerver, J., Martin-Albo, D., & Villegas, M. (2016). Laia: A deep learning toolkit for HTR.	es_ES
dc.description.references	Sabir, E., Rawls, S., & Natarajan, P. (2017). Implicit language model in LSTM for OCR. In Proceedings of the 14th IAPR international conference on document analysis and recognition (ICDAR) (Vol. 7, pp. 27–31). IEEE.	es_ES
dc.description.references	Sanchez, J. A., Toselli, A. H., Romero, V., & Vidal, E. (2015). ICDAR 2015 competition HTRtS: Handwritten text recognition on the tranScriptorium dataset. In Proceedings of the 13th international conference on document analysis and recognition (ICDAR).	es_ES
dc.description.references	Shi, B., Bai, X., & Yao, C. (2016). An end-to-end trainable neural network for image-based sequence recognition and its application to scene text recognition. IEEE Transaction on Pattern Analysis and Machine Intelligence, 39(11), 2298–2304.	es_ES
dc.description.references	Slavik, P., & Govindaraju, V. (2001). Equivalence of Different Methods for Slant and Skew Corrections in Word Recognition Applications. IEEE Transaction on Pattern Analysis and Machine Intelligence, 23(3), 323–326.	es_ES
dc.description.references	Suen, C. Y., Nadal, C., Legault, R., Mai, T. A., & Lam, L. (1992). Computer recognition of unconstrained handwritten numerals. Special Issue of Proceedings of IEEE, 7(80), 1162–1180.	es_ES
dc.description.references	Toselli, A. H., Romero, V., & Vidal, E. (2007). Viterbi based alignment between text images and their transcripts. In Proceedings of the workshop on language technology for cultural heritage data (LaTeCH) (pp. 9–16).	es_ES
dc.description.references	Viard-Gaudin, C., Lallican, P. M., Knerr, S., & Binter, P. (1999). The IRESTE on/off (IRONOFF) dual handwriting database. In Proceedings of the fifth international conference on document analysis and recognition (ICDAR) (pp. 455–458). Bangalore, India.	es_ES
dc.description.references	Wilkinson, R., Geist, J., Janet, S., Grother, P., Burges, C., Creecy, R., Hammond, B., Hull, J., Larsen, N., Vogl, T., & Wilson, C. (1992). The first census optical character recognition systems conference. In #NISTIR 4912. The U.S. Bureau of Census and the National Institute of Standards and Technology, Gaithersburg, MD.	es_ES

Este ítem aparece en la(s) siguiente(s) colección(ones)

Artículos, conferencias, monografías [46085]

Mostrar el registro sencillo del ítem

A Spanish dataset for reproducible benchmarked offline handwriting recognition

RiuNet: Repositorio Institucional de la Universidad Politécnica de Valencia

Buscar en RiuNet

Listar

Todo RiuNet

Esta colección

Mi cuenta

Estadísticas

Ayuda RiuNet

Admin. UPV

Compartir/Enviar a

Citas

Estadísticas

A Spanish dataset for reproducible benchmarked offline handwriting recognition

Ficheros en el ítem

Este ítem aparece en la(s) siguiente(s) colección(ones)