- -

Does more data always yield better translations?

RiuNet: Institutional repository of the Polithecnic University of Valencia

Share/Send to

Cited by


Does more data always yield better translations?

Show full item record

Gascó Mora, G.; Rocha Sánchez, MA.; Sanchis Trilles, G.; Andrés Ferrer, J.; Casacuberta Nolla, F. (2012). Does more data always yield better translations?. Association for Computational Linguistics. 152-161. http://hdl.handle.net/10251/35214

Por favor, use este identificador para citar o enlazar este ítem: http://hdl.handle.net/10251/35214

Files in this item

Item Metadata

Title: Does more data always yield better translations?
Author: Gascó Mora, Guillem Rocha Sánchez, Martha Alicia Sanchis Trilles, Germán Andrés Ferrer, Jesús Casacuberta Nolla, Francisco
UPV Unit: Universitat Politècnica de València. Departamento de Sistemas Informáticos y Computación - Departament de Sistemes Informàtics i Computació
Issued date:
Nowadays, there are large amounts of data available to train statistical machine translation systems. However, it is not clear whether all the training data actually help or not. A system trained on a subset of such huge ...[+]
Subjects: Bilingual corpora , Training data selection techniques , Probability of an indomain corpus , Infrequent n-gram occurrence
Copyrigths: Reserva de todos los derechos
ISBN: 978-1-937284-19-0
Association for Computational Linguistics
Publisher version: http://www.aclweb.org/anthology/E12-1016
Conference name: 13th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2012)
Conference place: Avignon, Francia
Conference date: 2012-04-23
Project ID:
MICINN/TIN2009- 14511
MITYC/TSI-020110- 2009-439
The research leading to these results has received funding from the European Union Seventh Framework Programme (FP7/2007-2013) under grant agreement nr. 287755. This work was also supported by the Spanish MEC/MICINN under ...[+]
Type: Comunicación en congreso

This item appears in the following Collection(s)

Show full item record