Using the words/leafs ratio in the DOM tree for content extraction

Insa Cabrera, David; Silva Galiana, Josep Francesc; Tamarit, Salvador

doi:10.1016/j.jlap.2013.01.002

Identificarse

Buscar en RiuNet

Listar

Todo RiuNet
Esta colección

Mi cuenta

Acceder

Estadísticas

Ver Estadísticas de uso

Ayuda RiuNet

Admin. UPV

Compartir/Enviar a

Citas

Estadísticas

Using the words/leafs ratio in the DOM tree for content extraction

Mostrar el registro completo del ítem

Insa Cabrera, D.; Silva Galiana, JF.; Tamarit, S. (2013). Using the words/leafs ratio in the DOM tree for content extraction. Journal of Logic and Algebraic Programming. 82(8):311-325. https://doi.org/10.1016/j.jlap.2013.01.002

Por favor, use este identificador para citar o enlazar este ítem: http://hdl.handle.net/10251/37664

Ficheros en el ítem

Nombre: paper.pdf

Tamaño: 1.528Mb

Formato: PDF

Descripción: Versión del Autor.

Abrir/Preview

Nombre: Insa,D; Silva,J;T ...

Tamaño: 2.215Mb

Formato: PDF

Descripción: Versión editorial

Solicitar una copia al autor

Metadatos del ítem

Título:

Using the words/leafs ratio in the DOM tree for content extraction

Autor:

Insa Cabrera, David

Silva Galiana, Josep Francesc Tamarit, Salvador

Entidad UPV:

Universitat Politècnica de València. Departamento de Sistemas Informáticos y Computación - Departament de Sistemes Informàtics i Computació

Fecha difusión:

2013-11

Resumen:

The main content in a webpage is usually centered and visible without the need to scroll. It is often rounded by the navigation menus of the website and it can include advertisements, panels, banners, and other not ...[+]

Palabras clave:

Content extraction , Block detection , DOM , Information retrieval

Derechos de uso:

Reserva de todos los derechos

Fuente:

Journal of Logic and Algebraic Programming. (issn: 1567-8326 )

DOI:

10.1016/j.jlap.2013.01.002

Editorial:

Elsevier

Versión del editor:

http://dx.doi.org/10.1016/j.jlap.2013.01.002

Código del Proyecto:

info:eu-repo/grantAgreement/MICINN//TIN2008-06622-C03-02/ES/VERIFICACION Y DEPURACION AGILES ORIENTADAS A MEJORAR LA SEGURIDAD DEL SOFTWARE/
info:eu-repo/grantAgreement/GVA//PROMETEO%2F2011%2F052/ES/LOGICEXTREME: TECNOLOGIA LOGICA Y SOFTWARE SEGURO/
info:eu-repo/grantAgreement/MICINN//BES-2009-015019/ES/BES-2009-015019/
info:eu-repo/grantAgreement/ME//AP2010-4415/ES/AP2010-4415/

Agradecimientos:

This work has been partially supported by the Spanish Ministerio de Economia y Competitividad (Secretaria de Estado de Investigacion, Desarrollo e Innovacion) under Grant TIN2008-06622-C03-02 and by the Generalitat Valenciana ...[+]

Tipo:

Artículo

recommendations

Este ítem aparece en la(s) siguiente(s) colección(ones)

Artículos, conferencias, monografías [48357]

Mostrar el registro completo del ítem

Using the words/leafs ratio in the DOM tree for content extraction

RiuNet: Repositorio Institucional de la Universidad Politécnica de Valencia

Buscar en RiuNet

Listar

Todo RiuNet

Esta colección

Mi cuenta

Estadísticas

Ayuda RiuNet

Admin. UPV

Compartir/Enviar a

Citas

Estadísticas

Using the words/leafs ratio in the DOM tree for content extraction

Ficheros en el ítem

Metadatos del ítem

recommendations

Este ítem aparece en la(s) siguiente(s) colección(ones)