Mostrar el registro sencillo del ítem
dc.contributor.author | Outahajala, Mohamed | es_ES |
dc.contributor.author | Benajiba, Yassine | es_ES |
dc.contributor.author | Rosso, Paolo | es_ES |
dc.contributor.author | Zenkouar, Lahbib | es_ES |
dc.date.accessioned | 2016-05-11T12:39:55Z | |
dc.date.available | 2016-05-11T12:39:55Z | |
dc.date.issued | 2015 | |
dc.identifier.issn | 1064-1246 | |
dc.identifier.uri | http://hdl.handle.net/10251/63906 | |
dc.description.abstract | Amazigh is used by tens of millions of people mainly for oral communication. However, and like all the newly investigated languages in natural language processing, it is resource-scarce. The main aim of this paper is to present our POS taggers results based on two state of the art sequence labeling techniques, namely Conditional Random Fields and Support Vector Machines, by making use of a small manually annotated corpus of only 20k tokens. Since creating labeled data is very time-consuming task while obtaining unlabeled data is less so, we have decided to gather a set of unlabeled data of Amazigh language that we have preprocessed and tokenized. The paper is also meant to address using semi-supervised techniques to improve POS tagging accuracy. An adapted self training algorithm, combining confidence measure with a function of Out Of Vocabulary words to select data for self training, has been used. Using this language independent method, we have managed to obtain encouraging results. | es_ES |
dc.description.sponsorship | The first author wants to grant CODESRIA. The work of the third author was carried out in the framework of the VLC/CAMPUS Microcluster on Multimodal Interaction in Intelligent Systems and the European Commission WIQ-EI IRSES (no. 269180) and DIANA-APPLICATIONS-Finding Hidden Knowledge in Texts: Applications(TIN2012-38603- C02-01) research projects. | en_EN |
dc.language | Inglés | es_ES |
dc.publisher | IOS Press | es_ES |
dc.relation | European Commission WIQ-EI IRSES (no. 269180) | es_ES |
dc.relation.ispartof | Journal of Intelligent and Fuzzy Systems | es_ES |
dc.rights | Reserva de todos los derechos | es_ES |
dc.subject | POS-tagging | es_ES |
dc.subject | Amazigh | es_ES |
dc.subject | Conditional random fields | es_ES |
dc.subject | Support vector machines | es_ES |
dc.subject | Out of vocabulary | es_ES |
dc.subject | Self training | es_ES |
dc.subject.classification | LENGUAJES Y SISTEMAS INFORMATICOS | es_ES |
dc.title | Using confidence and informativeness criteria to improve POS-tagging in amazigh | es_ES |
dc.type | Artículo | es_ES |
dc.identifier.doi | 10.3233/IFS-141417 | |
dc.relation.projectID | info:eu-repo/grantAgreement/MINECO//TIN2012-38603-C02-01/ES/DIANA-APPLICATIONS: FINDING HIDDEN KNOWLEDGE IN TEXTS: APPLICATIONS/ | es_ES |
dc.rights.accessRights | Abierto | es_ES |
dc.contributor.affiliation | Universitat Politècnica de València. Departamento de Sistemas Informáticos y Computación - Departament de Sistemes Informàtics i Computació | es_ES |
dc.description.bibliographicCitation | Outahajala, M.; Benajiba, Y.; Rosso, P.; Zenkouar, L. (2015). Using confidence and informativeness criteria to improve POS-tagging in amazigh. Journal of Intelligent and Fuzzy Systems. 28(3):1319-1330. https://doi.org/10.3233/IFS-141417 | es_ES |
dc.description.accrualMethod | S | es_ES |
dc.relation.publisherversion | http://content.iospress.com/articles/journal-of-intelligent-and-fuzzy-systems/ifs1417 | es_ES |
dc.description.upvformatpinicio | 1319 | es_ES |
dc.description.upvformatpfin | 1330 | es_ES |
dc.type.version | info:eu-repo/semantics/publishedVersion | es_ES |
dc.description.volume | 28 | es_ES |
dc.description.issue | 3 | es_ES |
dc.relation.senia | 306222 | es_ES |
dc.contributor.funder | VLC/CAMPUS | es_ES |