- -

Does more data always yield better translations?

RiuNet: Institutional repository of the Polithecnic University of Valencia

Share/Send to

Cited by


Does more data always yield better translations?

Show full item record

Gascó Mora, G.; Rocha Sánchez, MA.; Sanchis Trilles, G.; Andrés Ferrer, J.; Casacuberta Nolla, F. (2012). Does more data always yield better translations?. Association for Computational Linguistics. 152-161. http://hdl.handle.net/10251/35214

Por favor, use este identificador para citar o enlazar este ítem: http://hdl.handle.net/10251/35214

Files in this item

Item Metadata

Title: Does more data always yield better translations?
UPV Unit: Universitat Politècnica de València. Departamento de Sistemas Informáticos y Computación - Departament de Sistemes Informàtics i Computació
Issued date:
Nowadays, there are large amounts of data available to train statistical machine translation systems. However, it is not clear whether all the training data actually help or not. A system trained on a subset of such huge ...[+]
Subjects: Bilingual corpora , Training data selection techniques , Probability of an indomain corpus , Infrequent n-gram occurrence
Copyrigths: Reserva de todos los derechos
ISBN: 978-1-937284-19-0
Association for Computational Linguistics
Publisher version: http://www.aclweb.org/anthology/E12-1016
Conference name: 13th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2012)
Conference place: Avignon, Francia
Conference date: 2012-04-23
Project ID: info:eu-repo/grantAgreement/EC/FP7/287755
Type: Comunicación en congreso

This item appears in the following Collection(s)

Show full item record