Franco Salvador, Marc(Universitat Politècnica de València, 2017-07-03)
Natural Language Processing (NLP) is a field of computer science, artificial intelligence, and computational linguistics concerned with the interactions between computers and human languages. One of its most challenging ...
Rangel-Pardo, Francisco Manuel; Franco-Salvador, Marc; Rosso, Paolo(Springer-Verlag, 2018)
[EN] Language variety identification aims at labelling texts in a
native language (e.g. Spanish, Portuguese, English) with its specific variation (e.g. Argentina, Chile, Mexico, Peru, Spain; Brazil, Portugal; UK, US). In ...
[EN] Recognizing semantically similar sentences or paragraphs across languages is beneficial for many tasks, ranging from cross-lingual information retrieval and plagiarism detection to machine translation. Recently proposed ...
Franco-Salvador, Marc; Rosso, Paolo; Montes Gomez, Manuel(Elsevier, 2016-07)
Cross-language plagiarism detection aims to detect plagiarised fragments of text among
documents in different languages. In this paper, we perform a systematic examination of
Cross-language Knowledge Graph Analysis; an ...
Giménez Pérez, Rosa María(Universitat Politècnica de València, 2019-01-15)
En este trabajo de final de máster se emplearán los string kernels para abordar el problema de la clasificación de la polaridad en el mismo dominio así como en dominios diferentes.
In this paper, we propose the use of meta-learning to combine and enrich those approaches by adding also other knowledge-based features. In addition to the
aforementioned classical approaches, our system uses the BabelNet ...
Cross-language (CL) plagiarism detection aims at detecting plagiarised fragments of text among documents
in different languages. The main research question of this work is on whether knowledge graph
representations and ...
Cross-language plagiarism refers to the type of plagiarism where the source and suspicious documents are in different languages. Plagiarism detection across languages is still in its infancy state. In this article, we ...
[EN] In recent years there have been important advances in the field of automatic plagiarism detection. One variant is cross-language plagiarism detection, which tries to detect plagiarism between documents in different ...
Franco Salvador, Marc(Universitat Politècnica de València, 2014-11-24)
[EN] Plagiarism is defined as the unauthorized use of the original content of other authors. It
is a difficult phenomenon to detect whose problem has worsened in recent years because
of the Internet: a vast source of ...
Franco Salvador, Marc; Rangel, Francisco; Rosso, Paolo; Taulé, Mariona; Martí, M. Antònia(Springer International Publishing, 2015-11-20)
In this work we focus on the use of distributed representations of words and documents using the continuous Skip-gram model. We compare this model with three recent approaches: Information Gain Word-Patterns, TF-IDF graphs ...
Sarvazyan, Areg Mikael(Universitat Politècnica de València, 2023-09-27)
[ES] Las sólidas capacidades lingüísticas de los Modelos de Lenguaje de Gran Tamaño (LLMs) actuales están motivando su adopción a gran escala en los flujos de trabajo de empresas y particulares. Estos LLMs tienen el potencial ...
Sarvazyan, Areg Mikael; González, José Ángel; Franco-Salvador, Marc; Rangel, Francisco; Chulvi-Ferriols, María Alberta; Rosso, Paolo(Sociedad Española para el Procesamiento del Lenguaje Natural, 2023-09)
[EN] This paper presents the overview of the AuTexTification shared task as part of the IberLEF 2023 Workshop in Iberian Languages Evaluation Forum, within the framework of the SEPLN 2023 conference. AuTexTification consists ...
Areg Mikael Sarvazyan; González, Jose Angel; Rangel, Francisco; Rosso, Paolo; Franco-Salvador, Marc(Sociedad Española para el Procesamiento del Lenguaje Natural, 2024-09)
[ES] Este artículo presenta un resumen de la tarea IberAuTexTification como
parte del workshop IberLEF 2023 en el Iberian Languages Evaluation Forum, dentro del marco de la conferencia SEPLN 2024. IberAuTexTification ...
[EN] Paraphrase plagiarism identification represents a very complex task given that plagiarized texts are intentionally modified through several rewording techniques. Accordingly, this paper introduces two new measures for ...
[EN] This paper presents the contributions of the UPV-Symanto team, a collaboration between Symanto Research and the PRHLT Center, in the eRisk 2021 shared tasks on gambling addiction, self-harm detection and prediction ...