Text Baseline Detection, a single page trained system

Pastor Gadea, Moisés

doi:10.1016/j.patcog.2019.05.031

Identificarse

Buscar en RiuNet

Listar

Todo RiuNet
Esta colección

Mi cuenta

Acceder

Estadísticas

Ver Estadísticas de uso

Ayuda RiuNet

Admin. UPV

Compartir/Enviar a

Citas

Estadísticas

Text Baseline Detection, a single page trained system

Mostrar el registro sencillo del ítem

Ficheros en el ítem

Nombre: Pastor - Text ...

Tamaño: 11.03Mb

Formato: PDF

Descripción: Versión del Autor.

Abrir

Nombre: ArticlePublicat.pdf

Tamaño: 6.591Mb

Formato: PDF

Descripción: Versión editorial

Solicitar una copia al autor

dc.contributor.author	Pastor Gadea, Moisés	es_ES
dc.date.accessioned	2020-12-04T04:31:58Z
dc.date.available	2020-12-04T04:31:58Z
dc.date.issued	2019-10	es_ES
dc.identifier.issn	0031-3203	es_ES
dc.identifier.uri	http://hdl.handle.net/10251/156420
dc.description.abstract	[EN] Nowadays, there are a lot of page images available and the scanning process is quite well resolved and can be done industrially. On the other hand, HTR systems can only deal with single text line images. Segmenting pages into single text line images is a very expensive process which has traditionally been done manually. This is a bottleneck which is holding back any massive industrial document processing. A baseline detection method will be presented here'. The initial problem is reformulated as a clustering problem over a set of interest points. Its design aim is to be fast and to resist the noise artifacts that usually appear in historical manuscripts: variable interline spacing, the overlapping and touching of words in adjacent lines, humidity spots, etc. Results show that this system can be used to massively detect where the text lines are in pages. Highlight: This system reached second place in the ICDAR 2017 Competition on Baseline Detection (see Table 1). (C) 2019 Elsevier Ltd. All rights reserved.	es_ES
dc.description.sponsorship	This work was partially supported by the project Carabela (PR[17]_HUM_D4_0059), sponsored by the programme "Ayudas a Equipos de Investigacion en Humanidades Digitales" of the BBVA Foundacion.	es_ES
dc.language	Inglés	es_ES
dc.publisher	Elsevier	es_ES
dc.relation.ispartof	Pattern Recognition	es_ES
dc.rights	Reconocimiento - No comercial - Sin obra derivada (by-nc-nd)	es_ES
dc.subject	Empirical performance Evaluation	es_ES
dc.subject	Segmentation	es_ES
dc.subject	Documents	es_ES
dc.subject.classification	LENGUAJES Y SISTEMAS INFORMATICOS	es_ES
dc.title	Text Baseline Detection, a single page trained system	es_ES
dc.type	Artículo	es_ES
dc.identifier.doi	10.1016/j.patcog.2019.05.031	es_ES
dc.relation.projectID	info:eu-repo/grantAgreement/fBBVA//PR[17]_HUM_D4_0059/	es_ES
dc.rights.accessRights	Abierto	es_ES
dc.contributor.affiliation	Universitat Politècnica de València. Departamento de Sistemas Informáticos y Computación - Departament de Sistemes Informàtics i Computació	es_ES
dc.description.bibliographicCitation	Pastor Gadea, M. (2019). Text Baseline Detection, a single page trained system. Pattern Recognition. 94:149-161. https://doi.org/10.1016/j.patcog.2019.05.031	es_ES
dc.description.accrualMethod	S	es_ES
dc.relation.publisherversion	https://doi.org/10.1016/j.patcog.2019.05.031	es_ES
dc.description.upvformatpinicio	149	es_ES
dc.description.upvformatpfin	161	es_ES
dc.type.version	info:eu-repo/semantics/publishedVersion	es_ES
dc.description.volume	94	es_ES
dc.relation.pasarela	S\388426	es_ES
dc.contributor.funder	Fundación BBVA	es_ES

Este ítem aparece en la(s) siguiente(s) colección(ones)

Artículos, conferencias, monografías [45942]

Mostrar el registro sencillo del ítem

Text Baseline Detection, a single page trained system

RiuNet: Repositorio Institucional de la Universidad Politécnica de Valencia

Buscar en RiuNet

Listar

Todo RiuNet

Esta colección

Mi cuenta

Estadísticas

Ayuda RiuNet

Admin. UPV

Compartir/Enviar a

Citas

Estadísticas

Text Baseline Detection, a single page trained system

Ficheros en el ítem

Este ítem aparece en la(s) siguiente(s) colección(ones)