Minimum Bayes’ risk subsequence combination for machine translation

Gonzalez Rubio, Jesus; Casacuberta Nolla, Francisco

doi:10.1007/s10044-014-0387-5

Identificarse

Buscar en RiuNet

Listar

Todo RiuNet
Esta colección

Mi cuenta

Acceder

Estadísticas

Ver Estadísticas de uso

Ayuda RiuNet

Admin. UPV

Compartir/Enviar a

Citas

Estadísticas

Minimum Bayes’ risk subsequence combination for machine translation

Mostrar el registro sencillo del ítem

Ficheros en el ítem

Nombre: 2015-PAA-Gonzalez ...

Tamaño: 496.6Kb

Formato: PDF

Descripción: Versión del Autor.

Abrir

Nombre: 2015-PAAA-Gonzale ...

Tamaño: 1.007Mb

Formato: PDF

Descripción: Versión editorial

Solicitar una copia al autor

dc.contributor.author	Gonzalez Rubio, Jesus	es_ES
dc.contributor.author	Casacuberta Nolla, Francisco	es_ES
dc.date.accessioned	2016-05-11T13:16:15Z
dc.date.available	2016-05-11T13:16:15Z
dc.date.issued	2015-08
dc.identifier.issn	1433-7541
dc.identifier.uri	http://hdl.handle.net/10251/63924
dc.description	The final publication is available at Springer via http://dx.doi.org/10.1007/s10044-014-0387-5	es_ES
dc.description.abstract	System combination has proved to be a successful technique in the pattern recognition field. However, several difficulties arise when combining the outputs of tasks, e.g. machine translation, that generate structured patterns. So far, machine translation system combination approaches either implement sophisticated classifiers to select one of the provided translations, or generate new sentences by combining the "best" subsequences of the provided translations. We present minimum Bayes' risk system combination (MBRSC), a system combination method for machine translation that gathers together the advantages of sentence-selection and subsequence-combination methods. MBRSC is able to detect and utilize the "best" subsequences of the provided translations to generate the optimal consensus translation with respect to a particular performance met- ric. Experiments show that MBRSC yields significant improvements in translation quality.	es_ES
dc.description.sponsorship	Work supported by the EC (FEDER/FSE) and the Spanish MEC/MICINN under the MIPRCV "Consolider Ingenio 2010'' program (CSD2007-00018), the iTrans2 (TIN2009-14511) project, the UPV under Grant 20091027, the Spanish MITyC under the erudito.com (TSI-020110-2009-439) project and by the General-itat Valenciana under grant Prometeo/2009/014.	en_EN
dc.language	Inglés	es_ES
dc.publisher	Springer Verlag (Germany)	es_ES
dc.relation.ispartof	Pattern Analysis and Applications	es_ES
dc.rights	Reserva de todos los derechos	es_ES
dc.subject	Minimum Bayes’ risk	es_ES
dc.subject	System combination	es_ES
dc.subject	Statistical machine translation	es_ES
dc.subject.classification	LENGUAJES Y SISTEMAS INFORMATICOS	es_ES
dc.title	Minimum Bayes’ risk subsequence combination for machine translation	es_ES
dc.type	Artículo	es_ES
dc.identifier.doi	10.1007/s10044-014-0387-5
dc.relation.projectID	info:eu-repo/grantAgreement/MEC//CSD2007-00018/ES/Multimodal Intraction in Pattern Recognition and Computer Visionm/	es_ES
dc.relation.projectID	info:eu-repo/grantAgreement/MICINN//TIN2009-14511/ES/Traduccion De Textos Y Transcripcion De Voz Interactivas/	es_ES
dc.relation.projectID	info:eu-repo/grantAgreement/UPV//20091027/	es_ES
dc.relation.projectID	info:eu-repo/grantAgreement/MITURCO//TSI-020110-2009-0439/ES/ERUDITO.COM/	es_ES
dc.relation.projectID	info:eu-repo/grantAgreement/GVA//PROMETEO09%2F2009%2F014/ES/Adaptive learning and multimodality in pattern recognition (Almapater)/	es_ES
dc.rights.accessRights	Abierto	es_ES
dc.contributor.affiliation	Universitat Politècnica de València. Departamento de Sistemas Informáticos y Computación - Departament de Sistemes Informàtics i Computació	es_ES
dc.description.bibliographicCitation	Gonzalez Rubio, J.; Casacuberta Nolla, F. (2015). Minimum Bayes’ risk subsequence combination for machine translation. Pattern Analysis and Applications. 18(3):523-533. https://doi.org/10.1007/s10044-014-0387-5	es_ES
dc.description.accrualMethod	S	es_ES
dc.relation.publisherversion	http://dx.doi.org/10.1007/s10044-014-0387-5	es_ES
dc.description.upvformatpinicio	523	es_ES
dc.description.upvformatpfin	533	es_ES
dc.type.version	info:eu-repo/semantics/publishedVersion	es_ES
dc.description.volume	18	es_ES
dc.description.issue	3	es_ES
dc.relation.senia	283749	es_ES
dc.identifier.eissn	1433-755X
dc.contributor.funder	Ministerio de Educación y Ciencia	es_ES
dc.contributor.funder	Ministerio de Ciencia e Innovación	es_ES
dc.contributor.funder	Generalitat Valenciana	es_ES
dc.contributor.funder	Universitat Politècnica de València	es_ES
dc.contributor.funder	Ministerio de Industria, Turismo y Comercio	es_ES
dc.description.references	Bangalore S (2001) Computing consensus translation from multiple machine translation systems. In: IEEE automatic speech recognition and understanding workshop, pp 351–354	es_ES
dc.description.references	Becker MA (2008) Active learning - an explicit treatment of unreliable parameters. Ph.D. thesis, University of Edinburgh	es_ES
dc.description.references	Bellman R (1957) Dynamic programming. Princeton University Press, Princeton	es_ES
dc.description.references	Bickel PJ, Doksum KA (1977) Mathematical statistics : basic ideas and selected topics. Holden-Day, San Francisco	es_ES
dc.description.references	Callison-burch C, Flournoy RS (2001) A program for automatically selecting the best output from multiple machine translation engines. In: Proceedings of the VIII machine translation summit, pp 63–66	es_ES
dc.description.references	Callison-Burch C, Fordyce C, Koehn P, Monz C, Schroeder J (2008) Further meta-evaluation of machine translation. In: Proceedings of the 3rd Workshop on statistical machine translation, Association for Computational Linguistics, pp 70–106	es_ES
dc.description.references	Callison-Burch C, Koehn P, Monz C, Schroeder J (2009) Findings of the 2009 workshop on statistical machine translation. In: Proceedings of the 4th workshop on statistical machine translation, Association for Computational Linguistics, Athens, pp 1–28	es_ES
dc.description.references	Callison-Burch C, Koehn P, Monz C, Zaidan OF (eds) (2011) Proceedings of the 6th workshop on statistical machine translation. Association for Computational Linguistics, Edinburgh	es_ES
dc.description.references	Chinchor N (1992) The statistical significance of the muc-4 results. In: Proceedings of the conference on message understanding, pp 30–50	es_ES
dc.description.references	DeNero J, Chiang D, Knight K (2009) Fast consensus decoding over translation forests. In: Proceedings of the 47th annual meeting of the Association for Computational Linguistics, Association for Computational Linguistics, pp 567–575	es_ES
dc.description.references	DeNero J, Kumar S, Chelba C, Och F (2010) Model combination for machine translation. In: Proceedings of the 11th conference of the North American chapter of the Association for Computational Linguistics, Association for Computational Linguistics, pp 975–983	es_ES
dc.description.references	Dietterich TG (2000) Ensemble methods in machine learning. In: Proceedings of the 1st International workshop on multiple classifier systems, MCS ’00, Springer, pp 1–15	es_ES
dc.description.references	Duan N, Li M, Zhang D, Zhou M (2010) Mixture model-based minimum bayes risk decoding using multiple machine translation systems. In: Proceedings of the 23rd conference on Computational Linguistics, pp 313–321	es_ES
dc.description.references	Duda RO, Hart PE, Stork DG (2001) Pattern classification, 2nd edn. Wiley, New York	es_ES
dc.description.references	Ehling N, Zens R, Ney H (2007) Minimum bayes risk decoding for bleu. In: Proceedings of the 45th annual aeeting of the Association for Computational Linguistics, Association for Computational Linguistics, pp 101–104	es_ES
dc.description.references	Fiscus JG (1997) A post-processing system to yield reduced Word error rates: recogniser output voting error reduction (ROVER). In: Proceedings IEEE Workshop on automatic speech recognition and understanding, pp 347–352	es_ES
dc.description.references	González-Rubio J, Juan A, Casacuberta F (2011) Minimum bayes-risk system combination. In: Proceedings of the 49th annual meeting of the Association for Computational Linguistics, pp 1268–1277	es_ES
dc.description.references	González-Rubio J, Casacuberta F (2011) The UPV-PRHLT combinatio nsystem for WMT 2011. In: Proceedings of the 49th annual meeting of the Association for Computational Linguistics, pp 1268–1277	es_ES
dc.description.references	He X, Toutanova K (2009) Joint optimization for machine translation system combination. In: Proceedings of the 2009 conference on empirical methods in natural language processing, Association for Computational Linguistics, pp 1202–1211	es_ES
dc.description.references	He X, Yang M, Gao J, Nguyen P, Moore R (2008) Indirect-hmm-based hypothesis alignment for combining outputs from machine translation systems. In: Proceedings of the 2008 conference on empirical methods in natural language processing, Association for Computational Linguistics, pp 98–107	es_ES
dc.description.references	Heafield K, Lavie A (2011) Cmu system combination in wmt 2011. In: Proceedings of the 6th workshop on statistical machine translation, Association for Computational Linguistics, Edinburgh, pp 145–151	es_ES
dc.description.references	Jayaraman S, Lavie A (2005) Multi-engine machine translation guided by explicit word matching. In: Proceeding of the 10th conference of the European Association for Machine Translation, pp 143–152	es_ES
dc.description.references	Jelinek F (1997) Statistical methods for speech recognition. MIT Press, Cambridge	es_ES
dc.description.references	Kittler J, Hatef M, Duin RPW, Matas J (1998) On combining classifiers. IEEE Trans Pattern Anal Mach Intell 20:226–239. doi: 10.1109/34.667881.	es_ES
dc.description.references	Knight K (1999) Decoding complexity in word-replacement translation models. Comput Linguist 25(4):607–615. http://dl.acm.org/citation.cfm?id=973226.973232	es_ES
dc.description.references	Kumar S, Macherey W, Dyer C, Och F (2009) Efficient minimum error rate training and minimum bayes-risk decoding for translation hypergraphs and lattices. In: Proceedings of the 47th annual meeting of the Association for Computational Linguistics, Association for Computational Linguistics, pp 163–171	es_ES
dc.description.references	Land AH, Doig AG (1960) An automatic method of solving discrete programming problems. Econometrica 28(3):497–520	es_ES
dc.description.references	Larkey LS, Croft BW (1996) Combining classifiers in text categorization. In: Frei HP, Harman D, Schäuble P, Wilkinson R (eds) Proceedings of the 19th ACM International Conference on Research and Development in Information Retrieval. ACM Press, New York, pp 289–297	es_ES
dc.description.references	Leusch G, Freitag M, Ney H (2011) The rwth system combination system for wmt 2011. In: Proceedings of the 6th workshop on Statistical Machine Translation, Association for Computational Linguistics, Edinburgh, pp 152–158	es_ES
dc.description.references	Matusov E, Leusch G, Banchs RE, Bertoldi N, Dechelotte D, Federico M, Kolss M, suk Lee Y, no JBM, Paulik M, Roukos S, Schwenk H, Ney H (2008) System combination for machine translation of spoken and written language. IEEE Trans Audio Speech Lang Process 16:1222–1237	es_ES
dc.description.references	Nelder JA, Mead R (1965) A simplex method for function minimization. Comput J 7(4):308–313	es_ES
dc.description.references	NIST (2006) NIST 2006 machine translation evaluation official results. http://www.itl.nist.gov/iad/mig/tests/mt/	es_ES
dc.description.references	Nomoto T (2004) Multi-engine machine translation with voted language model. In: Proceedings of the 42nd annual meeting on Association for Computational Linguistics, Association for Computational Linguistics, pp 494–501	es_ES
dc.description.references	Noreen E (1989) Computer-intensive methods for testing hypotheses: an introduction. A wiley interscience publication. Wiley, New York	es_ES
dc.description.references	Och FJ (2003) Minimum error rate training in statistical machine translation. In: Proceedings of the 41st annual meeting on Association for Computational Linguistics, Association for Computational Linguistics, pp 160–167	es_ES
dc.description.references	Papineni K, Roukos S, Ward T, Zhu WJ (2002) BLEU: a method for automatic evaluation of machine translation. In: Proceedings of the 40th annual meeting on Association for Computational Linguistics, Association for Computational Linguistics, pp 311–318	es_ES
dc.description.references	Paul M, Doi T, Hwang Y, Imamura K, Okuma H, Sumita E (2005) Nobody is perfect: atr’s hybrid approach to spoken language translation. In: Proceedings of the 2005 International Workshop on spoken language translation, pp 55–62	es_ES
dc.description.references	Rosti A, Ayan NF, Xiang B, Matsoukas S, Schwartz R, Dorr B (2007) Combining outputs from multiple machine translation systems. In: Proceedings of the 6th conference of the North American Chapter of the Association for Computational Linguistics, Association for Computational Linguistics, pp 228–235	es_ES
dc.description.references	Rosti A, Zhang B, Matsoukas S, Schwartz R (2011) Expected bleu training for graphs: Bbn system description for wmt11 system combination task. In: Proceedings of the 6th workshop on statistical machine translation, Association for Computational Linguistics, pp 159–165	es_ES
dc.description.references	Roth D, Zelenko D (1998) Part of speech tagging using a network of linear separators. In: Proceedings of the 17th international conference on Computational linguistics - Volume 2, COLING ’98, Association for Computational Linguistics, pp 1136–1142	es_ES
dc.description.references	Snover M, Dorr B, Schwartz R, Micciulla L, Weischedel R (2006) A study of translation error rate with targeted human annotation. In: Proceedings of the 7th conference of the Association for Machine Transaltion in the Americas, pp 223–231	es_ES
dc.description.references	Stanley R (2002) Enumerative combinatorics. Cambridge studies in advanced mathematics. Cambridge University Press, Cambridge	es_ES
dc.description.references	Udupa R, Maji HK (2006) Computational complexity of statistical machine translation. In: McCarthy D, Wintner S (eds) Proceedings of the European Chapter of the Association for Computational Linguistics. The Association for Computer Linguistics. http://acl.ldc.upenn.edu/E/E06/E06-1004	es_ES
dc.description.references	Xu D, Cao Y, Karakos D (2011) Description of the jhu system combination scheme for wmt 2011. In: Proceedings of the 6th workshop on Statistical Machine Translation, Association for Computational Linguistics, pp 171–176	es_ES

Este ítem aparece en la(s) siguiente(s) colección(ones)

Artículos, conferencias, monografías [47243]

Mostrar el registro sencillo del ítem

Minimum Bayes’ risk subsequence combination for machine translation

RiuNet: Repositorio Institucional de la Universidad Politécnica de Valencia

Buscar en RiuNet

Listar

Todo RiuNet

Esta colección

Mi cuenta

Estadísticas

Ayuda RiuNet

Admin. UPV

Compartir/Enviar a

Citas

Estadísticas

Minimum Bayes’ risk subsequence combination for machine translation

Ficheros en el ítem

Este ítem aparece en la(s) siguiente(s) colección(ones)