Mostrar el registro sencillo del ítem
dc.contributor.author | Ingaramo, Diego Alejandro | es_ES |
dc.contributor.author | Errecalde, Marcelo Luis | es_ES |
dc.contributor.author | Cagnina, Leticia | es_ES |
dc.contributor.author | Rosso, Paolo | es_ES |
dc.date.accessioned | 2013-11-12T13:17:47Z | |
dc.date.issued | 2011 | |
dc.identifier.issn | 1613-0073 | |
dc.identifier.uri | http://hdl.handle.net/10251/33475 | |
dc.description.abstract | Short-texts clustering is currently an important research area because of its applicability to web information retrieval, text summarization and text mining. These texts are often available in different languages and parallel multilingual corpora. Some previous works have demonstrated the effectiveness of a discrete Particle Swarm Optimizer algorithm, named CLUDIPSO, for clustering monolingual corpora containing very short documents. In all the considered cases, CLUDIPSO outperformed different algorithms representative of the state-of-the-art in the area. This paper presents a preliminary study showing the performance of CLUDIPSO on parallel Spanish-English corpora. The idea is to analyze how this bilingual information can be incorporated in the CLUDIPSO algorithm and to what extent this information can improve the clustering results. In order to adapt CLUDIPSO to a bilingual environment, some alternatives are proposed and evaluated. The results were compared considering CLUDIPSO in both environments, bilingual and monolingual. The experimental work shows that bilingual information allows to obtain just comparable results to those obtained with monolingual corpora. More work is required to make an effective use of this kind of information. | |
dc.language | Inglés | es_ES |
dc.publisher | CEUR Workshop Proceedings | es_ES |
dc.relation.ispartof | CEUR Workshop Proceedings | es_ES |
dc.rights | Reserva de todos los derechos | es_ES |
dc.subject | Clustering of Short Texts | es_ES |
dc.subject | Parallel Spanish-English Corpora | es_ES |
dc.subject | Particle Swarm Optimizer | es_ES |
dc.subject | Agrupamiento de Textos Cortos | es_ES |
dc.subject | Optimizador basado en C'umulo de Partículas | es_ES |
dc.subject | Colecciones paralelas en Español-Inglés | es_ES |
dc.subject.classification | LENGUAJES Y SISTEMAS INFORMATICOS | es_ES |
dc.title | A Particle Swarm Optimizer to Cluster Parallel Spanish-English Short-text Corpora | es_ES |
dc.type | Artículo | es_ES |
dc.embargo.lift | 10000-01-01 | |
dc.embargo.terms | forever | es_ES |
dc.rights.accessRights | Abierto | es_ES |
dc.contributor.affiliation | Universitat Politècnica de València. Departamento de Sistemas Informáticos y Computación - Departament de Sistemes Informàtics i Computació | es_ES |
dc.description.bibliographicCitation | Ingaramo, DA.; Errecalde, ML.; Cagnina, L.; Rosso, P. (2011). A Particle Swarm Optimizer to Cluster Parallel Spanish-English Short-text Corpora. CEUR Workshop Proceedings. 824:43-48. http://hdl.handle.net/10251/33475 | es_ES |
dc.description.accrualMethod | S | es_ES |
dc.relation.conferencename | Iberian Cross-Language Natural Language Processing Tasks | es_ES |
dc.relation.conferencedate | 07/09/2011 | es_ES |
dc.relation.conferenceplace | Huelva, España | es_ES |
dc.relation.publisherversion | http://ceur-ws.org/Vol-824/ | es_ES |
dc.description.upvformatpinicio | 43 | es_ES |
dc.description.upvformatpfin | 48 | es_ES |
dc.type.version | info:eu-repo/semantics/publishedVersion | es_ES |
dc.description.volume | 824 | es_ES |
dc.relation.senia | 217557 |