Álvaro Muñoz, Francisco; Cruz Fernández, Francisco; Sánchez Peiró, Joan Andreu; Ramos Terrades, Oriol; Benedí Ruiz, José Miguel(Springer, 2013)
n this paper we define a bidimensional extension of Stochastic Context-Free Grammars for page segmentation of structured documents. Two sets of text classification features are used to perform an initial classification of ...
Álvaro Muñoz, Francisco; Cruz Fernández, Francisco; Sánchez Peiró, Joan Andreu; Ramos Terrades, Oriol; Benedí Ruiz, José Miguel(Elsevier, 2015-02-20)
[EN] In this paper we define a bidimensional extension of stochastic context-free grammars for structure detection and segmentation of images of documents. Two sets of text classification features are used to perform an ...