- -

Using the words/leafs ratio in the DOM tree for content extraction

RiuNet: Institutional repository of the Polithecnic University of Valencia

Share/Send to

Cited by

Statistics

Using the words/leafs ratio in the DOM tree for content extraction

Show full item record

Insa Cabrera, D.; Silva Galiana, JF.; Tamarit, S. (2013). Using the words/leafs ratio in the DOM tree for content extraction. Journal of Logic and Algebraic Programming. 82(8):311-325. doi:10.1016/j.jlap.2013.01.002

Por favor, use este identificador para citar o enlazar este ítem: http://hdl.handle.net/10251/37664

Files in this item

Item Metadata

Title: Using the words/leafs ratio in the DOM tree for content extraction
Author:
UPV Unit: Universitat Politècnica de València. Departamento de Sistemas Informáticos y Computación - Departament de Sistemes Informàtics i Computació
Issued date:
Abstract:
The main content in a webpage is usually centered and visible without the need to scroll. It is often rounded by the navigation menus of the website and it can include advertisements, panels, banners, and other not ...[+]
Subjects: Content extraction , Block detection , DOM , Information retrieval
Copyrigths: Reserva de todos los derechos
Source:
Journal of Logic and Algebraic Programming. (issn: 1567-8326 )
DOI: 10.1016/j.jlap.2013.01.002
Publisher:
Elsevier
Publisher version: http://dx.doi.org/10.1016/j.jlap.2013.01.002
Thanks:
This work has been partially supported by the Spanish Ministerio de Economia y Competitividad (Secretaria de Estado de Investigacion, Desarrollo e Innovacion) under Grant TIN2008-06622-C03-02 and by the Generalitat Valenciana ...[+]
Type: Artículo

This item appears in the following Collection(s)

Show full item record