- -

A Systematic Study of Knowledge Graph Analysis for Cross-language Plagiarism Detection

RiuNet: Institutional repository of the Polithecnic University of Valencia

Share/Send to

Cited by

Statistics

A Systematic Study of Knowledge Graph Analysis for Cross-language Plagiarism Detection

Show simple item record

Files in this item

dc.contributor.author Franco-Salvador, Marc es_ES
dc.contributor.author Rosso, Paolo es_ES
dc.contributor.author Montes Gomez, Manuel es_ES
dc.date.accessioned 2017-05-31T08:50:35Z
dc.date.available 2017-05-31T08:50:35Z
dc.date.issued 2016-07
dc.identifier.issn 0306-4573
dc.identifier.uri http://hdl.handle.net/10251/82079
dc.description This is the author’s version of a work that was accepted for publication in Information Processing and Management. Changes resulting from the publishing process, such as peer review, editing, corrections, structural formatting, and other quality control mechanisms may not be reflected in this document. Changes may have been made to this work since it was submitted for publication. A definitive version was subsequently published in Information Processing and Management 52 (2016) 550–570. DOI 10.1016/j.ipm.2015.12.004 es_ES
dc.description.abstract Cross-language plagiarism detection aims to detect plagiarised fragments of text among documents in different languages. In this paper, we perform a systematic examination of Cross-language Knowledge Graph Analysis; an approach that represents text fragments using knowledge graphs as a language independent content model. We analyse the contributions to cross-language plagiarism detection of the different aspects covered by knowledge graphs: word sense disambiguation, vocabulary expansion, and representation by similarities with a collection of concepts. In addition, we study both the relevance of concepts and their relations when detecting plagiarism. Finally, as a key component of the knowledge graph construction, we present a new weighting scheme of relations between concepts based on distributed representations of concepts. Experimental results in Spanish–English and German–English plagiarism detection show state-of-the-art performance and provide interesting insights on the use of knowledge graphs. © 2015 Elsevier Ltd. All rights reserved. es_ES
dc.description.sponsorship This research has been carried out in the framework of the European Commission WIQ-EI IRSES (No. 269180) and DIANA APPLICATIONS - Finding Hidden Knowledge in Texts: Applications (TIN2012-38603-C02-01) projects. We would like to thank Tomas Mikolov, Martin Potthast, and Luis A. Leiva for their support and comments during this research. en_EN
dc.language Inglés es_ES
dc.publisher Elsevier es_ES
dc.relation European Commission/ WIQ-EI IRSES/ 269180 es_ES
dc.relation DIANA APPLICATIONS - Finding Hidden Knowledge in Texts: Applications/ TIN2012-38603-C02-01 es_ES
dc.relation.ispartof Information Processing and Management es_ES
dc.rights Reserva de todos los derechos es_ES
dc.subject Cross-language es_ES
dc.subject Plagiarism detection es_ES
dc.subject Knowledge graphs es_ES
dc.subject Multilingual semantic network es_ES
dc.subject Distributed representations es_ES
dc.subject Evaluation es_ES
dc.subject.classification LENGUAJES Y SISTEMAS INFORMATICOS es_ES
dc.title A Systematic Study of Knowledge Graph Analysis for Cross-language Plagiarism Detection es_ES
dc.type Artículo es_ES
dc.identifier.doi 10.1016/j.ipm.2015.12.004
dc.rights.accessRights Abierto es_ES
dc.contributor.affiliation Universitat Politècnica de València. Departamento de Sistemas Informáticos y Computación - Departament de Sistemes Informàtics i Computació es_ES
dc.contributor.affiliation Universitat Politècnica de València. Escola Tècnica Superior d'Enginyeria Informàtica es_ES
dc.description.bibliographicCitation Franco-Salvador, M.; Rosso, P.; Montes Gomez, M. (2016). A Systematic Study of Knowledge Graph Analysis for Cross-language Plagiarism Detection. Information Processing and Management. 52(4):550-570. doi:10.1016/j.ipm.2015.12.004 es_ES
dc.description.accrualMethod Senia es_ES
dc.relation.publisherversion http://dx.doi.org/10.1016/j.ipm.2015.12.004 es_ES
dc.description.upvformatpinicio 550 es_ES
dc.description.upvformatpfin 570 es_ES
dc.type.version info:eu-repo/semantics/publishedVersion es_ES
dc.description.volume 52 es_ES
dc.description.issue 4 es_ES
dc.relation.senia 326672 es_ES


This item appears in the following Collection(s)

Show simple item record