- -

Several approaches for tweet topic classification in COSET - IberEval 2017

RiuNet: Repositorio Institucional de la Universidad Politécnica de Valencia

Compartir/Enviar a

Citas

Estadísticas

  • Estadisticas de Uso

Several approaches for tweet topic classification in COSET - IberEval 2017

Mostrar el registro sencillo del ítem

Ficheros en el ítem

dc.contributor.author Villar Lafuente, Carlos es_ES
dc.contributor.author Garcés Díaz-Munío, Gonçal es_ES
dc.date.accessioned 2021-05-14T11:27:12Z
dc.date.available 2021-05-14T11:27:12Z
dc.date.issued 2017-09-19 es_ES
dc.identifier.issn 1613-0073 es_ES
dc.identifier.uri http://hdl.handle.net/10251/166361
dc.description.abstract [EN] These working notes summarize the different approaches we have explored in order to classify a corpus of tweets related to the 2015 Spanish General Election (COSET 2017 task from IberEval 2017). Two approaches were tested during the COSET 2017 evaluations: Neural Networks with Sentence Embeddings (based on TensorFlow) and N-gram Language Models (based on SRILM). Our results with these approaches were modest: both ranked above the ¿Most frequent baseline¿, but below the ¿Bag-of-words + SVM¿ baseline. A third approach was tried after the COSET 2017 evaluation phase was over: Advanced Linear Models (based on fastText). Results measured over the COSET 2017 Dev and Test show that this approach is well above the ¿TF-IDF+RF¿ baseline. es_ES
dc.language Inglés es_ES
dc.publisher CEUR Workshop Proceedings es_ES
dc.relation.ispartof Proceedings of the Second Workshop on Evaluation of Human Language Technologies for Iberian Languages (IberEval 2017) es_ES
dc.rights Reconocimiento (by) es_ES
dc.subject COSET2017 es_ES
dc.subject Language models es_ES
dc.subject Linear models es_ES
dc.subject Neural networks es_ES
dc.subject Sentence embeddings es_ES
dc.subject Text classification es_ES
dc.subject.classification LENGUAJES Y SISTEMAS INFORMATICOS es_ES
dc.title Several approaches for tweet topic classification in COSET - IberEval 2017 es_ES
dc.type Comunicación en congreso es_ES
dc.type Artículo es_ES
dc.rights.accessRights Abierto es_ES
dc.description.bibliographicCitation Villar Lafuente, C.; Garcés Díaz-Munío, G. (2017). Several approaches for tweet topic classification in COSET - IberEval 2017. CEUR Workshop Proceedings. 36-42. http://hdl.handle.net/10251/166361 es_ES
dc.description.accrualMethod S es_ES
dc.relation.conferencename 2nd Workshop on Human Language Technologies for Iberian languages (IberEval 2017) es_ES
dc.relation.conferencedate Septiembre 19-19,2017 es_ES
dc.relation.conferenceplace Murcia, España es_ES
dc.relation.publisherversion http://ceur-ws.org/Vol-1881/ es_ES
dc.description.upvformatpinicio 36 es_ES
dc.description.upvformatpfin 42 es_ES
dc.type.version info:eu-repo/semantics/publishedVersion es_ES
dc.relation.pasarela S\342074 es_ES


Este ítem aparece en la(s) siguiente(s) colección(ones)

Mostrar el registro sencillo del ítem