Wikipedia vandalism detection

Mola Velasco, Santiago Moisés

RiuNet repositorio UPV
:
Docencia
:
Trabajos académicos
:
Servicio de alumnado - Trabajos académicos
:
Ver ítem

Identificarse

Buscar en RiuNet

Listar

Todo RiuNet
Esta colección

Mi cuenta

Acceder

Estadísticas

Ver Estadísticas de uso

Ayuda RiuNet

Admin. UPV

Compartir/Enviar a

Citas

Estadísticas

Wikipedia vandalism detection

Mostrar el registro sencillo del ítem

Ficheros en el ítem

Nombre: Mola-Velasco-2011.pdf

Tamaño: 7.282Mb

Formato: PDF

Abrir

dc.contributor.advisor	Rosso ., Paolo	es_ES
dc.contributor.author	Mola Velasco, Santiago Moisés	es_ES
dc.date.accessioned	2012-05-25T09:51:40Z
dc.date.available	2012-05-25T09:51:40Z
dc.date.created	2011-09
dc.date.issued	2012-05-25
dc.identifier.uri	http://hdl.handle.net/10251/15871
dc.description.abstract	Wikipedia is an online encyclopedia that anyone can edit. The fact that there are almost no restrictions to contributing content is at the core of its success. However, it also attracts pranksters, lobbysts, spammers and other people who degradatesWikipedia's contents. One of the most frequent kind of damage is vandalism, which is defined as any bad faith attempt to damage Wikipedia's integrity. For some years, the Wikipedia community has been fighting vandalism using automatic detection systems. In this work, we develop one of such systems, which won the 1st International Competition on Wikipedia Vandalism Detection. This system consists of a feature set exploiting textual content of Wikipedia articles. We performed a study of different supervised classification algorithms for this task, concluding that ensemble methods such as Random Forest and LogitBoost are clearly superior. After that, we combine this system with two other leading approaches based on different kind of features: metadata analysis and reputation. This joint system obtains one of the best results reported in the literature. We also conclude that our approach is mostly language independent, so we can adapt it to languages other than English with minor changes.	es_ES
dc.format.extent	75	es_ES
dc.language	Inglés	es_ES
dc.publisher	Universitat Politècnica de València	es_ES
dc.rights	Reserva de todos los derechos	es_ES
dc.subject	Wikipedia	es_ES
dc.subject	Text classification	es_ES
dc.subject	Machine learning	es_ES
dc.subject.classification	LENGUAJES Y SISTEMAS INFORMATICOS	es_ES
dc.subject.other	Máster Universitario en Inteligencia Artificial, Reconocimiento de Formas e Imagen Digital-Màster Universitari en Intel·Ligència Artificial: Reconeixement de Formes i Imatge Digital	es_ES
dc.title	Wikipedia vandalism detection	es_ES
dc.type	Tesis de máster	es_ES
dc.rights.accessRights	Abierto	es_ES
dc.contributor.affiliation	Universitat Politècnica de València. Servicio de Alumnado - Servei d'Alumnat	es_ES
dc.description.bibliographicCitation	Mola Velasco, SM. (2011). Wikipedia vandalism detection. http://hdl.handle.net/10251/15871	es_ES
dc.description.accrualMethod	Archivo delegado	es_ES

Este ítem aparece en la(s) siguiente(s) colección(ones)

Servicio de alumnado - Trabajos académicos [7391]

Mostrar el registro sencillo del ítem

Wikipedia vandalism detection

RiuNet: Repositorio Institucional de la Universidad Politécnica de Valencia

Buscar en RiuNet

Listar

Todo RiuNet

Esta colección

Mi cuenta

Estadísticas

Ayuda RiuNet

Admin. UPV

Compartir/Enviar a

Citas

Estadísticas

Wikipedia vandalism detection

Ficheros en el ítem

Este ítem aparece en la(s) siguiente(s) colección(ones)