- -

Towards the detection of cross-language source code reuse

RiuNet: Repositorio Institucional de la Universidad Politécnica de Valencia

Compartir/Enviar a

Citas

Estadísticas

  • Estadisticas de Uso

Towards the detection of cross-language source code reuse

Mostrar el registro sencillo del ítem

Ficheros en el ítem

dc.contributor.author Flores Sáez, Enrique es_ES
dc.contributor.author Barrón Cedeño, Luis Alberto es_ES
dc.contributor.author Rosso, Paolo es_ES
dc.contributor.author Moreno Boronat, Lidia Ana es_ES
dc.date.accessioned 2014-02-27T12:12:26Z
dc.date.issued 2011
dc.identifier.isbn 978-3-642-22326-6
dc.identifier.issn 0302-9743
dc.identifier.uri http://hdl.handle.net/10251/36010
dc.description.abstract Internet has made available huge amounts of information, also source code. Source code repositories and, in general, programming related websites, facilitate its reuse. In this work, we propose a simple approach to the detection of cross-language source code reuse, a nearly investigated problem. Our preliminary experiments, based on character n-grams comparison, show that considering different sections of the code (i.e., comments, code, reserved words, etc.), leads to different results. When considering three programming languages: C++, Java, and Python, the best result is obtained when comments are discarded and the entire source code is considered. es_ES
dc.description.sponsorship This work has been developed with the support of the project TEXT-ENTERPRISE 2.0: Text comprehension techniques applied to the needs of the Enterprise 2.0 (MICINN, Spain TIN2009-13391-C04-03 (PlanI+D+i)). es_ES
dc.format.extent 4 es_ES
dc.language Inglés es_ES
dc.publisher Springer Verlag (Germany) es_ES
dc.relation.ispartof Natural Language Processing and Information Systems es_ES
dc.relation.ispartofseries Lecture Notes in Computer Science;vol. 6716
dc.rights Reserva de todos los derechos es_ES
dc.subject Source code reuse es_ES
dc.subject Cross-language source code reuse analysis es_ES
dc.subject Plagiarism detection es_ES
dc.subject.classification CIENCIAS DE LA COMPUTACION E INTELIGENCIA ARTIFICIAL es_ES
dc.subject.classification LENGUAJES Y SISTEMAS INFORMATICOS es_ES
dc.title Towards the detection of cross-language source code reuse es_ES
dc.type Capítulo de libro es_ES
dc.embargo.lift 10000-01-01
dc.embargo.terms forever es_ES
dc.identifier.doi 10.1007/978-3-642-22327-3_31
dc.relation.projectID info:eu-repo/grantAgreement/MICINN//TIN2009-13391-C04-03/ES/Text-Enterprise 2.0: Tecnicas De Comprension De Textos Aplicadas A Las Necesidades De La Empresa 2.0/ es_ES
dc.rights.accessRights Abierto es_ES
dc.contributor.affiliation Universitat Politècnica de València. Departamento de Sistemas Informáticos y Computación - Departament de Sistemes Informàtics i Computació es_ES
dc.description.bibliographicCitation Flores Sáez, E.; Barrón Cedeño, LA.; Rosso, P.; Moreno Boronat, LA. (2011). Towards the detection of cross-language source code reuse. En Natural Language Processing and Information Systems. Springer Verlag (Germany). 6716:250-253. https://doi.org/10.1007/978-3-642-22327-3_31 es_ES
dc.description.accrualMethod S es_ES
dc.relation.conferencename 16th International Conference on Applications of Natural Language to Information Systems, NLDB 2011 es_ES
dc.relation.conferencedate June 28-30, 2011 es_ES
dc.relation.conferenceplace Alicante, Spain es_ES
dc.relation.publisherversion http://link.springer.com/chapter/10.1007/978-3-642-22327-3_31 es_ES
dc.description.upvformatpinicio 250 es_ES
dc.description.upvformatpfin 253 es_ES
dc.type.version info:eu-repo/semantics/publishedVersion es_ES
dc.description.volume 6716 es_ES
dc.relation.senia 214034
dc.contributor.funder Ministerio de Ciencia e Innovación es_ES
dc.description.references Arwin, C., Tahaghoghi, S.M.M.: Plagiarism Detection across Programming Languages. In: Proceedings of the 29th Australasian Computer Science Conference, vol. 48, pp. 277–286 (2006) es_ES
dc.description.references Faidhi, J., Robinson, S.: An empirical approach for detecting program similarity and plagiarism within a university programming environment. Comput. Educ. 11, 11–19 (1987) es_ES
dc.description.references Jankowitz, H.T.: Detecting plagiarism in student pascal programs. The Computer Journal 31(1) (1988) es_ES
dc.description.references Pinto, D., Civera, J., Barrón-Cedeño, A., Juan, A., Rosso, P.: A statistical approach to crosslingual natural language tasks. Journal of Algorithms 64(1), 51–60 (2009) es_ES
dc.description.references Potthast, M., Barrón-Cedeño, A., Stein, B., Rosso, P.: Cross-Language Plagiarism Detection. Languages Resources and Evaluation. Special Issue on Plagiarism and Authorship Analysis 45(1) (2011) es_ES
dc.description.references Rosales, F., García, A., Rodríguez, S., Pedraza, J.L., Méndez, R., Nieto, M.M.: Detection of plagiarism in programming assignments. IEEE Transactions on Education 51(2), 174–183 (2008) es_ES
dc.description.references Stamatatos, E.: Intrinsic Plagiarism Detection Using Character n-gram Profiles. In: Proc. SEPLN 2009, Donostia, Spain, pp. 38–46 (2009) es_ES


Este ítem aparece en la(s) siguiente(s) colección(ones)

Mostrar el registro sencillo del ítem