Towards the detection of cross-language source code reuse

Flores Sáez, Enrique; Barrón Cedeño, Luis Alberto; Rosso, Paolo; Moreno Boronat, Lidia Ana

doi:10.1007/978-3-642-22327-3_31

Identificarse

Buscar en RiuNet

Listar

Todo RiuNet
Esta colección

Mi cuenta

Acceder

Estadísticas

Ver Estadísticas de uso

Ayuda RiuNet

Admin. UPV

Compartir/Enviar a

Citas

Estadísticas

Towards the detection of cross-language source code reuse

Mostrar el registro sencillo del ítem

Ficheros en el ítem

Nombre: nldbdelivery.pdf

Tamaño: 209.3Kb

Formato: PDF

Descripción: Versión del Autor.

Abrir

Nombre: Towards the Detection ...

Tamaño: 99.64Kb

Formato: PDF

Descripción: Versión editorial

Solicitar una copia al autor

dc.contributor.author	Flores Sáez, Enrique	es_ES
dc.contributor.author	Barrón Cedeño, Luis Alberto	es_ES
dc.contributor.author	Rosso, Paolo	es_ES
dc.contributor.author	Moreno Boronat, Lidia Ana	es_ES
dc.date.accessioned	2014-02-27T12:12:26Z
dc.date.issued	2011
dc.identifier.isbn	978-3-642-22326-6
dc.identifier.issn	0302-9743
dc.identifier.uri	http://hdl.handle.net/10251/36010
dc.description.abstract	Internet has made available huge amounts of information, also source code. Source code repositories and, in general, programming related websites, facilitate its reuse. In this work, we propose a simple approach to the detection of cross-language source code reuse, a nearly investigated problem. Our preliminary experiments, based on character n-grams comparison, show that considering different sections of the code (i.e., comments, code, reserved words, etc.), leads to different results. When considering three programming languages: C++, Java, and Python, the best result is obtained when comments are discarded and the entire source code is considered.	es_ES
dc.description.sponsorship	This work has been developed with the support of the project TEXT-ENTERPRISE 2.0: Text comprehension techniques applied to the needs of the Enterprise 2.0 (MICINN, Spain TIN2009-13391-C04-03 (PlanI+D+i)).	es_ES
dc.format.extent	4	es_ES
dc.language	Inglés	es_ES
dc.publisher	Springer Verlag (Germany)	es_ES
dc.relation.ispartof	Natural Language Processing and Information Systems	es_ES
dc.relation.ispartofseries	Lecture Notes in Computer Science;vol. 6716
dc.rights	Reserva de todos los derechos	es_ES
dc.subject	Source code reuse	es_ES
dc.subject	Cross-language source code reuse analysis	es_ES
dc.subject	Plagiarism detection	es_ES
dc.subject.classification	CIENCIAS DE LA COMPUTACION E INTELIGENCIA ARTIFICIAL	es_ES
dc.subject.classification	LENGUAJES Y SISTEMAS INFORMATICOS	es_ES
dc.title	Towards the detection of cross-language source code reuse	es_ES
dc.type	Capítulo de libro	es_ES
dc.embargo.lift	10000-01-01
dc.embargo.terms	forever	es_ES
dc.identifier.doi	10.1007/978-3-642-22327-3_31
dc.relation.projectID	info:eu-repo/grantAgreement/MICINN//TIN2009-13391-C04-03/ES/Text-Enterprise 2.0: Tecnicas De Comprension De Textos Aplicadas A Las Necesidades De La Empresa 2.0/	es_ES
dc.rights.accessRights	Abierto	es_ES
dc.contributor.affiliation	Universitat Politècnica de València. Departamento de Sistemas Informáticos y Computación - Departament de Sistemes Informàtics i Computació	es_ES
dc.description.bibliographicCitation	Flores Sáez, E.; Barrón Cedeño, LA.; Rosso, P.; Moreno Boronat, LA. (2011). Towards the detection of cross-language source code reuse. En Natural Language Processing and Information Systems. Springer Verlag (Germany). 6716:250-253. https://doi.org/10.1007/978-3-642-22327-3_31	es_ES
dc.description.accrualMethod	S	es_ES
dc.relation.conferencename	16th International Conference on Applications of Natural Language to Information Systems, NLDB 2011	es_ES
dc.relation.conferencedate	June 28-30, 2011	es_ES
dc.relation.conferenceplace	Alicante, Spain	es_ES
dc.relation.publisherversion	http://link.springer.com/chapter/10.1007/978-3-642-22327-3_31	es_ES
dc.description.upvformatpinicio	250	es_ES
dc.description.upvformatpfin	253	es_ES
dc.type.version	info:eu-repo/semantics/publishedVersion	es_ES
dc.description.volume	6716	es_ES
dc.relation.senia	214034
dc.contributor.funder	Ministerio de Ciencia e Innovación	es_ES
dc.description.references	Arwin, C., Tahaghoghi, S.M.M.: Plagiarism Detection across Programming Languages. In: Proceedings of the 29th Australasian Computer Science Conference, vol. 48, pp. 277–286 (2006)	es_ES
dc.description.references	Faidhi, J., Robinson, S.: An empirical approach for detecting program similarity and plagiarism within a university programming environment. Comput. Educ. 11, 11–19 (1987)	es_ES
dc.description.references	Jankowitz, H.T.: Detecting plagiarism in student pascal programs. The Computer Journal 31(1) (1988)	es_ES
dc.description.references	Pinto, D., Civera, J., Barrón-Cedeño, A., Juan, A., Rosso, P.: A statistical approach to crosslingual natural language tasks. Journal of Algorithms 64(1), 51–60 (2009)	es_ES
dc.description.references	Potthast, M., Barrón-Cedeño, A., Stein, B., Rosso, P.: Cross-Language Plagiarism Detection. Languages Resources and Evaluation. Special Issue on Plagiarism and Authorship Analysis 45(1) (2011)	es_ES
dc.description.references	Rosales, F., García, A., Rodríguez, S., Pedraza, J.L., Méndez, R., Nieto, M.M.: Detection of plagiarism in programming assignments. IEEE Transactions on Education 51(2), 174–183 (2008)	es_ES
dc.description.references	Stamatatos, E.: Intrinsic Plagiarism Detection Using Character n-gram Profiles. In: Proc. SEPLN 2009, Donostia, Spain, pp. 38–46 (2009)	es_ES

Este ítem aparece en la(s) siguiente(s) colección(ones)

Artículos, conferencias, monografías [47171]

Mostrar el registro sencillo del ítem

Towards the detection of cross-language source code reuse

RiuNet: Repositorio Institucional de la Universidad Politécnica de Valencia

Buscar en RiuNet

Listar

Todo RiuNet

Esta colección

Mi cuenta

Estadísticas

Ayuda RiuNet

Admin. UPV

Compartir/Enviar a

Citas

Estadísticas

Towards the detection of cross-language source code reuse

Ficheros en el ítem

Este ítem aparece en la(s) siguiente(s) colección(ones)