- -

Automatic detection of webpages that share the same web template

RiuNet: Repositorio Institucional de la Universidad Politécnica de Valencia

Compartir/Enviar a

Citas

Estadísticas

  • Estadisticas de Uso

Automatic detection of webpages that share the same web template

Mostrar el registro sencillo del ítem

Ficheros en el ítem

dc.contributor.author Alarte, Julián es_ES
dc.contributor.author Insa Cabrera, David es_ES
dc.contributor.author Silva Galiana, Josep Francesc es_ES
dc.contributor.author Tamarit Muñoz, Salvador es_ES
dc.date.accessioned 2015-05-18T13:54:10Z
dc.date.available 2015-05-18T13:54:10Z
dc.date.issued 2014-09
dc.identifier.issn 2075-2180
dc.identifier.uri http://hdl.handle.net/10251/50402
dc.description.abstract [EN] Template extraction is the process of isolating the template of a given webpage. It is widely used in several disciplines, including webpages development, content extraction, block detection, and webpages indexing. One of the main goals of template extraction is identifying a set of webpages with the same template without having to load and analyze too many webpages prior to identifying the template. This work introduces a new technique to automatically discover a reduced set of webpages in a website that implement the template. This set is computed with an hyperlink analysis that computes a very small set with a high level of confidence. es_ES
dc.description.sponsorship This work has been partially supported by the Spanish Ministerio de Econom´ıa y Competitividad (Secretar´ıa de Estado de Investigacion, Desarrollo e Innovaci ´ on) ´ under grant TIN2013-44742-C4-1-R and by the Generalitat Valenciana under grant PROMETEO/2011/052. David Insa was partially supported by the Spanish Ministerio de Eduacion under FPU grant AP2010-4415. Salvador Tamarit was partially supported by research project POLCA, Programming Large Scale Heterogeneous Infrastructures (610686), funded by the European Union, STREP FP7.
dc.language Inglés es_ES
dc.relation.ispartof Electronic Proceedings in Theoretical Computer Science es_ES
dc.rights Reconocimiento (by) es_ES
dc.subject.classification LENGUAJES Y SISTEMAS INFORMATICOS es_ES
dc.title Automatic detection of webpages that share the same web template es_ES
dc.type Artículo es_ES
dc.identifier.doi 10.4204/EPTCS.163.2
dc.relation.projectID info:eu-repo/grantAgreement/EC/FP7/610686/EU/Programming Large Scale Heterogeneous Infrastructures/ es_ES
dc.relation.projectID info:eu-repo/grantAgreement/MINECO//TIN2013-44742-C4-1-R/ES/VALIDACION ASISTIDA DE PROGRAMAS MEDIANTE METODOS PRECISOS Y RIGUROSOS PARA UNA INGENIERIA DEL SOFTWARE ROBUSTA/ es_ES
dc.relation.projectID info:eu-repo/grantAgreement/GVA//PROMETEO%2F2011%2F052/ES/LOGICEXTREME: TECNOLOGIA LOGICA Y SOFTWARE SEGURO/ es_ES
dc.relation.projectID info:eu-repo/grantAgreement/MECD//AP2010-4415/ES/AP2010-4415/ es_ES
dc.rights.accessRights Abierto es_ES
dc.contributor.affiliation Universitat Politècnica de València. Departamento de Sistemas Informáticos y Computación - Departament de Sistemes Informàtics i Computació es_ES
dc.description.bibliographicCitation Alarte, J.; Insa Cabrera, D.; Silva Galiana, JF.; Tamarit Muñoz, S. (2014). Automatic detection of webpages that share the same web template. Electronic Proceedings in Theoretical Computer Science. 163:2-15. https://doi.org/10.4204/EPTCS.163.2 es_ES
dc.description.accrualMethod S es_ES
dc.relation.publisherversion http://dx.doi.org/10.4204/EPTCS.163.2 es_ES
dc.description.upvformatpinicio 2 es_ES
dc.description.upvformatpfin 15 es_ES
dc.type.version info:eu-repo/semantics/publishedVersion es_ES
dc.description.volume 163 es_ES
dc.relation.senia 272407
dc.contributor.funder European Commission
dc.contributor.funder Ministerio de Economía y Competitividad
dc.contributor.funder Generalitat Valenciana
dc.contributor.funder Ministerio de Educación, Cultura y Deporte es_ES


Este ítem aparece en la(s) siguiente(s) colección(ones)

Mostrar el registro sencillo del ítem