- -

Sexism Prediction in Spanish and English Tweets Using Monolingual and Multilingual BERT and Ensemble Models

RiuNet: Repositorio Institucional de la Universidad Politécnica de Valencia

Compartir/Enviar a

Citas

Estadísticas

  • Estadisticas de Uso

Sexism Prediction in Spanish and English Tweets Using Monolingual and Multilingual BERT and Ensemble Models

Mostrar el registro sencillo del ítem

Ficheros en el ítem

dc.contributor.author Magnossao de Paula, Angel Felipe es_ES
dc.contributor.author Fray da Silva, Roberto es_ES
dc.contributor.author Baris Schlicht, Ipek es_ES
dc.date.accessioned 2022-12-12T08:08:44Z
dc.date.available 2022-12-12T08:08:44Z
dc.date.issued 2021-09-21 es_ES
dc.identifier.issn 1613-0073 es_ES
dc.identifier.uri http://hdl.handle.net/10251/190561
dc.description.abstract [EN] The popularity of social media has created problems such as hate speech and sexism. The identification and classification of sexism in social media are very relevant tasks, as they would allow building a healthier social environment. Nevertheless, these tasks are considerably challenging. This work proposes a system to use multilingual and monolingual BERT and data points translation and ensemble strategies for sexism identification and classification in English and Spanish. It was conducted in the context of the sEXism Identification in Social neTworks shared 2021 (EXIST 2021) task, proposed by the Iberian Languages Evaluation Forum (IberLEF). The proposed system and its main components are described, and an in-depth hyperparameters analysis is conducted. The main results observed were: (i) the system obtained better results than the baseline model (multilingual BERT); (ii) ensemble models obtained better results than monolingual models; and (iii) an ensemble model considering all individual models and the best standardized values obtained the best accuracies and F1-scores for both tasks. This work obtained first place in both tasks at EXIST, with the highest accuracies (0.780 for task 1 and 0.658 for task 2) and F1-scores (F1-binary of 0.780 for task 1 and F1-macro of 0.579 for task 2). es_ES
dc.language Inglés es_ES
dc.publisher CEUR Workshop es_ES
dc.relation.ispartof Proceedings of the Iberian Languages Evaluation Forum (IberLEF 2021) es_ES
dc.rights Reconocimiento (by) es_ES
dc.subject Sexism identification es_ES
dc.subject Sexism classification es_ES
dc.subject BERT es_ES
dc.subject Deep learning es_ES
dc.title Sexism Prediction in Spanish and English Tweets Using Monolingual and Multilingual BERT and Ensemble Models es_ES
dc.type Comunicación en congreso es_ES
dc.type Artículo es_ES
dc.rights.accessRights Abierto es_ES
dc.description.bibliographicCitation Magnossao De Paula, AF.; Fray Da Silva, R.; Baris Schlicht, I. (2021). Sexism Prediction in Spanish and English Tweets Using Monolingual and Multilingual BERT and Ensemble Models. CEUR Workshop. 356-373. http://hdl.handle.net/10251/190561 es_ES
dc.description.accrualMethod S es_ES
dc.relation.conferencename Iberian Languages Evaluation Forum (IberLEF 2021) es_ES
dc.relation.conferencedate Septiembre 21-21,2021 es_ES
dc.relation.conferenceplace Online es_ES
dc.relation.publisherversion https://ceur-ws.org/Vol-2943/ es_ES
dc.description.upvformatpinicio 356 es_ES
dc.description.upvformatpfin 373 es_ES
dc.type.version info:eu-repo/semantics/publishedVersion es_ES
dc.relation.pasarela S\450755 es_ES


Este ítem aparece en la(s) siguiente(s) colección(ones)

Mostrar el registro sencillo del ítem