- -

DEXTER: A workbench for automatic term extraction with specialized corpora

RiuNet: Repositorio Institucional de la Universidad Politécnica de Valencia

Compartir/Enviar a

Citas

Estadísticas

  • Estadisticas de Uso

DEXTER: A workbench for automatic term extraction with specialized corpora

Mostrar el registro sencillo del ítem

Ficheros en el ítem

dc.contributor.author Periñán-Pascual, Carlos es_ES
dc.date.accessioned 2018-11-08T05:32:56Z
dc.date.available 2018-11-08T05:32:56Z
dc.date.issued 2018 es_ES
dc.identifier.issn 1351-3249 es_ES
dc.identifier.uri http://hdl.handle.net/10251/112083
dc.description.abstract [EN] Automatic term extraction has become a priority area of research within corpus processing. Despite the extensive literature in this field, there are still some outstanding issues that should be dealt with during the construction of term extractors, particularly those oriented to support research in terminology and terminography. In this regard, this article describes the design and development of DEXTER, an online workbench for the extraction of simple and complex terms from domain-specific corpora in English, French, Italian and Spanish. In this framework, three issues contribute to placing the most important terms in the foreground. First, unlike the elaborate morphosyntactic patterns proposed by most previous research, shallow lexical filters have been constructed to discard term candidates. Second, a large number of common stopwords are automatically detected by means of a method that relies on the IATE database together with the frequency distribution of the domain-specific corpus and a general corpus. Third, the term-ranking metric, which is grounded on the notions of salience, relevance and cohesion, is guided by the IATE database to display an adequate distribution of terms. es_ES
dc.description.sponsorship Financial support for this research has been provided by the DGI, Spanish Ministry of Education and Science, grant FFI2014-53788-C3-1-P. en_EN
dc.language Inglés es_ES
dc.publisher Cambridge University Press es_ES
dc.relation.ispartof Natural Language Engineering es_ES
dc.rights Reserva de todos los derechos es_ES
dc.subject Terminology es_ES
dc.subject Terminography es_ES
dc.subject Automatic term extraction es_ES
dc.subject DEXTER es_ES
dc.subject.classification FILOLOGIA INGLESA es_ES
dc.title DEXTER: A workbench for automatic term extraction with specialized corpora es_ES
dc.type Artículo es_ES
dc.identifier.doi 10.1017/S1351324917000365 es_ES
dc.relation.projectID info:eu-repo/grantAgreement/MINECO//FFI2014-53788-C3-1-P/ES/DESARROLLO DE UN LABORATORIO VIRTUAL PARA EL PROCESAMIENTO COMPUTACIONAL DEL LENGUAJE NATURAL DESDE UN PARADIGMA FUNCIONAL/ es_ES
dc.rights.accessRights Abierto es_ES
dc.contributor.affiliation Universitat Politècnica de València. Departamento de Lingüística Aplicada - Departament de Lingüística Aplicada es_ES
dc.description.bibliographicCitation Periñán-Pascual, C. (2018). DEXTER: A workbench for automatic term extraction with specialized corpora. Natural Language Engineering. 24(2):163-198. https://doi.org/10.1017/S1351324917000365 es_ES
dc.description.accrualMethod S es_ES
dc.relation.publisherversion https://doi.org/10.1017/S1351324917000365 es_ES
dc.description.upvformatpinicio 163 es_ES
dc.description.upvformatpfin 198 es_ES
dc.type.version info:eu-repo/semantics/publishedVersion es_ES
dc.description.volume 24 es_ES
dc.description.issue 2 es_ES
dc.relation.pasarela S\352341 es_ES
dc.contributor.funder Ministerio de Economía y Competitividad es_ES


Este ítem aparece en la(s) siguiente(s) colección(ones)

Mostrar el registro sencillo del ítem