- -

Cross-language high similarity search using a conceptual thesaurus

RiuNet: Institutional repository of the Polithecnic University of Valencia

Share/Send to

Cited by

Statistics

Cross-language high similarity search using a conceptual thesaurus

Show simple item record

Files in this item

dc.contributor.author Gupta, Parth es_ES
dc.contributor.author Barrón Cedeño, Luis Alberto es_ES
dc.contributor.author Rosso, Paolo es_ES
dc.date.accessioned 2014-03-07T12:01:40Z
dc.date.issued 2012
dc.identifier.isbn 978-3-642-33247-0
dc.identifier.issn 0302-9743
dc.identifier.uri http://hdl.handle.net/10251/36280
dc.description.abstract This work addresses the issue of cross-language high similarity and near-duplicates search, where, for the given document, a highly similar one is to be identified from a large cross-language collection of documents. We propose a concept-based similarity model for the problem which is very light in computation and memory. We evaluate the model on three corpora of different nature and two language pairs English-German and English-Spanish using the Eurovoc conceptual thesaurus. Our model is compared with two state-of-the-art models and we find, though the proposed model is very generic, it produces competitive results and is significantly stable and consistent across the corpora. es_ES
dc.description.sponsorship This work was done in the framework of the VLC/CAMPUS Microcluster on Multimodal Interaction in Intelligent Systems and it has been partially funded by the European Commission as part of the WIQ-EI IRSES project (grant no. 269180) within the FP 7 Marie Curie People Framework, and by the Text-Enterprise 2.0 research project (TIN2009-13391-C04-03). The research work of the second author is supported by the CONACyT 192021/302009 grant
dc.format.extent 9 es_ES
dc.language Inglés es_ES
dc.publisher Springer Verlag (Germany) es_ES
dc.relation MICINN/TIN2009-13391-C04-03 es_ES
dc.relation CONACYT/192021/302009 es_ES
dc.relation.ispartof Information Access Evaluation. Multilinguality, Multimodality, and Visual Analytics es_ES
dc.relation.ispartofseries Lecture Notes in Computer Science;vol. 7488
dc.rights Reserva de todos los derechos es_ES
dc.subject Language translation and linguistics es_ES
dc.subject Artificial Intelligence es_ES
dc.subject.classification LENGUAJES Y SISTEMAS INFORMATICOS es_ES
dc.title Cross-language high similarity search using a conceptual thesaurus es_ES
dc.type Capítulo de libro es_ES
dc.embargo.lift 10000-01-01
dc.embargo.terms forever es_ES
dc.identifier.doi 10.1007/978-3-642-33247-0_8
dc.relation.projectID info:eu-repo/grantAgreement/EC/FP7/269180 es_ES
dc.rights.accessRights Abierto es_ES
dc.contributor.affiliation Universitat Politècnica de València. Departamento de Sistemas Informáticos y Computación - Departament de Sistemes Informàtics i Computació es_ES
dc.description.bibliographicCitation Gupta, P.; Barrón Cedeño, LA.; Rosso, P. (2012). Cross-language high similarity search using a conceptual thesaurus. En Information Access Evaluation. Multilinguality, Multimodality, and Visual Analytics. Springer Verlag (Germany). 7488:67-75. https://doi.org/10.1007/978-3-642-33247-0_8 es_ES
dc.description.accrualMethod S es_ES
dc.relation.conferencename Third International Conference of the CLEF Initiative, CLEF 2012. es_ES
dc.relation.conferencedate September 17-20, 2012 es_ES
dc.relation.conferenceplace Rome, Italy es_ES
dc.relation.publisherversion http://link.springer.com/chapter/10.1007/978-3-642-33247-0_8 es_ES
dc.description.upvformatpinicio 67 es_ES
dc.description.upvformatpfin 75 es_ES
dc.type.version info:eu-repo/semantics/publishedVersion es_ES
dc.description.volume 7488 es_ES
dc.relation.senia 232474
dc.contributor.funder European Commission
dc.contributor.funder Ministerio de Ciencia e Innovación
dc.contributor.funder Consejo Nacional de Ciencia y Tecnología, México


This item appears in the following Collection(s)

Show simple item record