Mostrar el registro sencillo del ítem
dc.contributor.author | Alarte, Julián | es_ES |
dc.contributor.author | Insa Cabrera, David | es_ES |
dc.contributor.author | Silva Galiana, Josep Francesc | es_ES |
dc.contributor.author | Tamarit Muñoz, Salvador | es_ES |
dc.date.accessioned | 2015-05-18T13:54:10Z | |
dc.date.available | 2015-05-18T13:54:10Z | |
dc.date.issued | 2014-09 | |
dc.identifier.issn | 2075-2180 | |
dc.identifier.uri | http://hdl.handle.net/10251/50402 | |
dc.description.abstract | [EN] Template extraction is the process of isolating the template of a given webpage. It is widely used in several disciplines, including webpages development, content extraction, block detection, and webpages indexing. One of the main goals of template extraction is identifying a set of webpages with the same template without having to load and analyze too many webpages prior to identifying the template. This work introduces a new technique to automatically discover a reduced set of webpages in a website that implement the template. This set is computed with an hyperlink analysis that computes a very small set with a high level of confidence. | es_ES |
dc.description.sponsorship | This work has been partially supported by the Spanish Ministerio de Econom´ıa y Competitividad (Secretar´ıa de Estado de Investigacion, Desarrollo e Innovaci ´ on) ´ under grant TIN2013-44742-C4-1-R and by the Generalitat Valenciana under grant PROMETEO/2011/052. David Insa was partially supported by the Spanish Ministerio de Eduacion under FPU grant AP2010-4415. Salvador Tamarit was partially supported by research project POLCA, Programming Large Scale Heterogeneous Infrastructures (610686), funded by the European Union, STREP FP7. | |
dc.language | Inglés | es_ES |
dc.relation.ispartof | Electronic Proceedings in Theoretical Computer Science | es_ES |
dc.rights | Reconocimiento (by) | es_ES |
dc.subject.classification | LENGUAJES Y SISTEMAS INFORMATICOS | es_ES |
dc.title | Automatic detection of webpages that share the same web template | es_ES |
dc.type | Artículo | es_ES |
dc.identifier.doi | 10.4204/EPTCS.163.2 | |
dc.relation.projectID | info:eu-repo/grantAgreement/EC/FP7/610686/EU/Programming Large Scale Heterogeneous Infrastructures/ | es_ES |
dc.relation.projectID | info:eu-repo/grantAgreement/MINECO//TIN2013-44742-C4-1-R/ES/VALIDACION ASISTIDA DE PROGRAMAS MEDIANTE METODOS PRECISOS Y RIGUROSOS PARA UNA INGENIERIA DEL SOFTWARE ROBUSTA/ | es_ES |
dc.relation.projectID | info:eu-repo/grantAgreement/GVA//PROMETEO%2F2011%2F052/ES/LOGICEXTREME: TECNOLOGIA LOGICA Y SOFTWARE SEGURO/ | es_ES |
dc.relation.projectID | info:eu-repo/grantAgreement/MECD//AP2010-4415/ES/AP2010-4415/ | es_ES |
dc.rights.accessRights | Abierto | es_ES |
dc.contributor.affiliation | Universitat Politècnica de València. Departamento de Sistemas Informáticos y Computación - Departament de Sistemes Informàtics i Computació | es_ES |
dc.description.bibliographicCitation | Alarte, J.; Insa Cabrera, D.; Silva Galiana, JF.; Tamarit Muñoz, S. (2014). Automatic detection of webpages that share the same web template. Electronic Proceedings in Theoretical Computer Science. 163:2-15. https://doi.org/10.4204/EPTCS.163.2 | es_ES |
dc.description.accrualMethod | S | es_ES |
dc.relation.publisherversion | http://dx.doi.org/10.4204/EPTCS.163.2 | es_ES |
dc.description.upvformatpinicio | 2 | es_ES |
dc.description.upvformatpfin | 15 | es_ES |
dc.type.version | info:eu-repo/semantics/publishedVersion | es_ES |
dc.description.volume | 163 | es_ES |
dc.relation.senia | 272407 | |
dc.contributor.funder | European Commission | |
dc.contributor.funder | Ministerio de Economía y Competitividad | |
dc.contributor.funder | Generalitat Valenciana | |
dc.contributor.funder | Ministerio de Educación, Cultura y Deporte | es_ES |