- -

A Particle Swarm Optimizer to Cluster Parallel Spanish-English Short-text Corpora

RiuNet: Institutional repository of the Polithecnic University of Valencia

Share/Send to

Cited by

Statistics

A Particle Swarm Optimizer to Cluster Parallel Spanish-English Short-text Corpora

Show simple item record

Files in this item

dc.contributor.author Ingaramo, Diego Alejandro es_ES
dc.contributor.author Errecalde, Marcelo Luis es_ES
dc.contributor.author Cagnina, Leticia es_ES
dc.contributor.author Rosso, Paolo es_ES
dc.date.accessioned 2013-11-12T13:17:47Z
dc.date.issued 2011
dc.identifier.issn 1613-0073
dc.identifier.uri http://hdl.handle.net/10251/33475
dc.description.abstract Short-texts clustering is currently an important research area because of its applicability to web information retrieval, text summarization and text mining. These texts are often available in different languages and parallel multilingual corpora. Some previous works have demonstrated the effectiveness of a discrete Particle Swarm Optimizer algorithm, named CLUDIPSO, for clustering monolingual corpora containing very short documents. In all the considered cases, CLUDIPSO outperformed different algorithms representative of the state-of-the-art in the area. This paper presents a preliminary study showing the performance of CLUDIPSO on parallel Spanish-English corpora. The idea is to analyze how this bilingual information can be incorporated in the CLUDIPSO algorithm and to what extent this information can improve the clustering results. In order to adapt CLUDIPSO to a bilingual environment, some alternatives are proposed and evaluated. The results were compared considering CLUDIPSO in both environments, bilingual and monolingual. The experimental work shows that bilingual information allows to obtain just comparable results to those obtained with monolingual corpora. More work is required to make an effective use of this kind of information.
dc.language Inglés es_ES
dc.publisher CEUR Workshop Proceedings es_ES
dc.relation.ispartof CEUR Workshop Proceedings es_ES
dc.rights Reserva de todos los derechos es_ES
dc.subject Clustering of Short Texts es_ES
dc.subject Parallel Spanish-English Corpora es_ES
dc.subject Particle Swarm Optimizer es_ES
dc.subject Agrupamiento de Textos Cortos es_ES
dc.subject Optimizador basado en C'umulo de Partículas es_ES
dc.subject Colecciones paralelas en Español-Inglés es_ES
dc.subject.classification LENGUAJES Y SISTEMAS INFORMATICOS es_ES
dc.title A Particle Swarm Optimizer to Cluster Parallel Spanish-English Short-text Corpora es_ES
dc.type Artículo es_ES
dc.embargo.lift 10000-01-01
dc.embargo.terms forever es_ES
dc.rights.accessRights Abierto es_ES
dc.contributor.affiliation Universitat Politècnica de València. Departamento de Sistemas Informáticos y Computación - Departament de Sistemes Informàtics i Computació es_ES
dc.description.bibliographicCitation Ingaramo, DA.; Errecalde, ML.; Cagnina, L.; Rosso, P. (2011). A Particle Swarm Optimizer to Cluster Parallel Spanish-English Short-text Corpora. CEUR Workshop Proceedings. 824:43-48. http://hdl.handle.net/10251/33475 es_ES
dc.description.accrualMethod S es_ES
dc.relation.conferencename Iberian Cross-Language Natural Language Processing Tasks es_ES
dc.relation.conferencedate 07/09/2011 es_ES
dc.relation.conferenceplace Huelva, España es_ES
dc.relation.publisherversion http://ceur-ws.org/Vol-824/ es_ES
dc.description.upvformatpinicio 43 es_ES
dc.description.upvformatpfin 48 es_ES
dc.type.version info:eu-repo/semantics/publishedVersion es_ES
dc.description.volume 824 es_ES
dc.relation.senia 217557


This item appears in the following Collection(s)

Show simple item record