- -

Text Baseline Detection, a single page trained system

RiuNet: Repositorio Institucional de la Universidad Politécnica de Valencia

Compartir/Enviar a

Citas

Estadísticas

  • Estadisticas de Uso

Text Baseline Detection, a single page trained system

Mostrar el registro sencillo del ítem

Ficheros en el ítem

dc.contributor.author Pastor Gadea, Moisés es_ES
dc.date.accessioned 2020-12-04T04:31:58Z
dc.date.available 2020-12-04T04:31:58Z
dc.date.issued 2019-10 es_ES
dc.identifier.issn 0031-3203 es_ES
dc.identifier.uri http://hdl.handle.net/10251/156420
dc.description.abstract [EN] Nowadays, there are a lot of page images available and the scanning process is quite well resolved and can be done industrially. On the other hand, HTR systems can only deal with single text line images. Segmenting pages into single text line images is a very expensive process which has traditionally been done manually. This is a bottleneck which is holding back any massive industrial document processing. A baseline detection method will be presented here'. The initial problem is reformulated as a clustering problem over a set of interest points. Its design aim is to be fast and to resist the noise artifacts that usually appear in historical manuscripts: variable interline spacing, the overlapping and touching of words in adjacent lines, humidity spots, etc. Results show that this system can be used to massively detect where the text lines are in pages. Highlight: This system reached second place in the ICDAR 2017 Competition on Baseline Detection (see Table 1). (C) 2019 Elsevier Ltd. All rights reserved. es_ES
dc.description.sponsorship This work was partially supported by the project Carabela (PR[17]_HUM_D4_0059), sponsored by the programme "Ayudas a Equipos de Investigacion en Humanidades Digitales" of the BBVA Foundacion. es_ES
dc.language Inglés es_ES
dc.publisher Elsevier es_ES
dc.relation.ispartof Pattern Recognition es_ES
dc.rights Reconocimiento - No comercial - Sin obra derivada (by-nc-nd) es_ES
dc.subject Empirical performance Evaluation es_ES
dc.subject Segmentation es_ES
dc.subject Documents es_ES
dc.subject.classification LENGUAJES Y SISTEMAS INFORMATICOS es_ES
dc.title Text Baseline Detection, a single page trained system es_ES
dc.type Artículo es_ES
dc.identifier.doi 10.1016/j.patcog.2019.05.031 es_ES
dc.relation.projectID info:eu-repo/grantAgreement/fBBVA//PR[17]_HUM_D4_0059/ es_ES
dc.rights.accessRights Abierto es_ES
dc.contributor.affiliation Universitat Politècnica de València. Departamento de Sistemas Informáticos y Computación - Departament de Sistemes Informàtics i Computació es_ES
dc.description.bibliographicCitation Pastor Gadea, M. (2019). Text Baseline Detection, a single page trained system. Pattern Recognition. 94:149-161. https://doi.org/10.1016/j.patcog.2019.05.031 es_ES
dc.description.accrualMethod S es_ES
dc.relation.publisherversion https://doi.org/10.1016/j.patcog.2019.05.031 es_ES
dc.description.upvformatpinicio 149 es_ES
dc.description.upvformatpfin 161 es_ES
dc.type.version info:eu-repo/semantics/publishedVersion es_ES
dc.description.volume 94 es_ES
dc.relation.pasarela S\388426 es_ES
dc.contributor.funder Fundación BBVA es_ES


Este ítem aparece en la(s) siguiente(s) colección(ones)

Mostrar el registro sencillo del ítem