- -

Toxic language detection: A systematic review of Arabic datasets

RiuNet: Repositorio Institucional de la Universidad Politécnica de Valencia

Compartir/Enviar a

Citas

Estadísticas

  • Estadisticas de Uso

Toxic language detection: A systematic review of Arabic datasets

Mostrar el registro sencillo del ítem

Ficheros en el ítem

dc.contributor.author Bensalem, Imene es_ES
dc.contributor.author Rosso, Paolo es_ES
dc.contributor.author Zitouni, Hanane es_ES
dc.date.accessioned 2024-06-06T18:16:19Z
dc.date.available 2024-06-06T18:16:19Z
dc.date.issued 2024-01 es_ES
dc.identifier.issn 0266-4720 es_ES
dc.identifier.uri http://hdl.handle.net/10251/204777
dc.description.abstract [EN] The detection of toxic language in the Arabic language has emerged as an active area of research in recent years, and reviewing the existing datasets employed for training the developed solutions has become a pressing need. This paper offers a comprehensive survey of Arabic datasets focused on online toxic language. We systematically gathered a total of 54 available datasets and their corresponding papers and conducted a thorough analysis, considering 18 criteria across four primary dimensions: availability details, content, annotation process, and reusability. This analysis enabled us to identify existing gaps and make recommendations for future research works. For the convenience of the research community, the list of the analysed datasets is maintained in a GitHub repository. es_ES
dc.description.sponsorship Qatar National Research Fund, Grant/Award Number: 13S-0206-200281; CRUE-Universitat Politecnica de Valencia es_ES
dc.language Inglés es_ES
dc.publisher Blackwell Publishing es_ES
dc.relation.ispartof Expert Systems es_ES
dc.rights Reserva de todos los derechos es_ES
dc.subject Annotation es_ES
dc.subject Arabic datasets es_ES
dc.subject Dataset accessibility es_ES
dc.subject Dataset reusability es_ES
dc.subject Hate speech es_ES
dc.subject Offensive language es_ES
dc.subject Toxic language es_ES
dc.subject.classification LENGUAJES Y SISTEMAS INFORMATICOS es_ES
dc.title Toxic language detection: A systematic review of Arabic datasets es_ES
dc.type Artículo es_ES
dc.identifier.doi 10.1111/exsy.13551 es_ES
dc.relation.projectID info:eu-repo/grantAgreement/QNRF//13S-0206-200281/ es_ES
dc.rights.accessRights Embargado es_ES
dc.date.embargoEndDate 2025-01-31 es_ES
dc.contributor.affiliation Universitat Politècnica de València. Escola Tècnica Superior d'Enginyeria Informàtica es_ES
dc.description.bibliographicCitation Bensalem, I.; Rosso, P.; Zitouni, H. (2024). Toxic language detection: A systematic review of Arabic datasets. Expert Systems. https://doi.org/10.1111/exsy.13551 es_ES
dc.description.accrualMethod S es_ES
dc.relation.publisherversion https://doi.org/10.1111/exsy.13551 es_ES
dc.type.version info:eu-repo/semantics/publishedVersion es_ES
dc.relation.pasarela S\513743 es_ES
dc.contributor.funder Qatar National Research Fund es_ES


Este ítem aparece en la(s) siguiente(s) colección(ones)

Mostrar el registro sencillo del ítem