Mostrar el registro sencillo del ítem
dc.contributor.author | Pastor Gadea, Moisés | es_ES |
dc.date.accessioned | 2020-12-04T04:31:58Z | |
dc.date.available | 2020-12-04T04:31:58Z | |
dc.date.issued | 2019-10 | es_ES |
dc.identifier.issn | 0031-3203 | es_ES |
dc.identifier.uri | http://hdl.handle.net/10251/156420 | |
dc.description.abstract | [EN] Nowadays, there are a lot of page images available and the scanning process is quite well resolved and can be done industrially. On the other hand, HTR systems can only deal with single text line images. Segmenting pages into single text line images is a very expensive process which has traditionally been done manually. This is a bottleneck which is holding back any massive industrial document processing. A baseline detection method will be presented here'. The initial problem is reformulated as a clustering problem over a set of interest points. Its design aim is to be fast and to resist the noise artifacts that usually appear in historical manuscripts: variable interline spacing, the overlapping and touching of words in adjacent lines, humidity spots, etc. Results show that this system can be used to massively detect where the text lines are in pages. Highlight: This system reached second place in the ICDAR 2017 Competition on Baseline Detection (see Table 1). (C) 2019 Elsevier Ltd. All rights reserved. | es_ES |
dc.description.sponsorship | This work was partially supported by the project Carabela (PR[17]_HUM_D4_0059), sponsored by the programme "Ayudas a Equipos de Investigacion en Humanidades Digitales" of the BBVA Foundacion. | es_ES |
dc.language | Inglés | es_ES |
dc.publisher | Elsevier | es_ES |
dc.relation.ispartof | Pattern Recognition | es_ES |
dc.rights | Reconocimiento - No comercial - Sin obra derivada (by-nc-nd) | es_ES |
dc.subject | Empirical performance Evaluation | es_ES |
dc.subject | Segmentation | es_ES |
dc.subject | Documents | es_ES |
dc.subject.classification | LENGUAJES Y SISTEMAS INFORMATICOS | es_ES |
dc.title | Text Baseline Detection, a single page trained system | es_ES |
dc.type | Artículo | es_ES |
dc.identifier.doi | 10.1016/j.patcog.2019.05.031 | es_ES |
dc.relation.projectID | info:eu-repo/grantAgreement/fBBVA//PR[17]_HUM_D4_0059/ | es_ES |
dc.rights.accessRights | Abierto | es_ES |
dc.contributor.affiliation | Universitat Politècnica de València. Departamento de Sistemas Informáticos y Computación - Departament de Sistemes Informàtics i Computació | es_ES |
dc.description.bibliographicCitation | Pastor Gadea, M. (2019). Text Baseline Detection, a single page trained system. Pattern Recognition. 94:149-161. https://doi.org/10.1016/j.patcog.2019.05.031 | es_ES |
dc.description.accrualMethod | S | es_ES |
dc.relation.publisherversion | https://doi.org/10.1016/j.patcog.2019.05.031 | es_ES |
dc.description.upvformatpinicio | 149 | es_ES |
dc.description.upvformatpfin | 161 | es_ES |
dc.type.version | info:eu-repo/semantics/publishedVersion | es_ES |
dc.description.volume | 94 | es_ES |
dc.relation.pasarela | S\388426 | es_ES |
dc.contributor.funder | Fundación BBVA | es_ES |