- -

Minimum Bayes’ risk subsequence combination for machine translation

RiuNet: Repositorio Institucional de la Universidad Politécnica de Valencia

Compartir/Enviar a

Citas

Estadísticas

  • Estadisticas de Uso

Minimum Bayes’ risk subsequence combination for machine translation

Mostrar el registro sencillo del ítem

Ficheros en el ítem

dc.contributor.author Gonzalez Rubio, Jesus es_ES
dc.contributor.author Casacuberta Nolla, Francisco es_ES
dc.date.accessioned 2016-05-11T13:16:15Z
dc.date.available 2016-05-11T13:16:15Z
dc.date.issued 2015-08
dc.identifier.issn 1433-7541
dc.identifier.uri http://hdl.handle.net/10251/63924
dc.description The final publication is available at Springer via http://dx.doi.org/10.1007/s10044-014-0387-5 es_ES
dc.description.abstract System combination has proved to be a successful technique in the pattern recognition field. However, several difficulties arise when combining the outputs of tasks, e.g. machine translation, that generate structured patterns. So far, machine translation system combination approaches either implement sophisticated classifiers to select one of the provided translations, or generate new sentences by combining the "best" subsequences of the provided translations. We present minimum Bayes' risk system combination (MBRSC), a system combination method for machine translation that gathers together the advantages of sentence-selection and subsequence-combination methods. MBRSC is able to detect and utilize the "best" subsequences of the provided translations to generate the optimal consensus translation with respect to a particular performance met- ric. Experiments show that MBRSC yields significant improvements in translation quality. es_ES
dc.description.sponsorship Work supported by the EC (FEDER/FSE) and the Spanish MEC/MICINN under the MIPRCV "Consolider Ingenio 2010'' program (CSD2007-00018), the iTrans2 (TIN2009-14511) project, the UPV under Grant 20091027, the Spanish MITyC under the erudito.com (TSI-020110-2009-439) project and by the General-itat Valenciana under grant Prometeo/2009/014. en_EN
dc.language Inglés es_ES
dc.publisher Springer Verlag (Germany) es_ES
dc.relation.ispartof Pattern Analysis and Applications es_ES
dc.rights Reserva de todos los derechos es_ES
dc.subject Minimum Bayes’ risk es_ES
dc.subject System combination es_ES
dc.subject Statistical machine translation es_ES
dc.subject.classification LENGUAJES Y SISTEMAS INFORMATICOS es_ES
dc.title Minimum Bayes’ risk subsequence combination for machine translation es_ES
dc.type Artículo es_ES
dc.identifier.doi 10.1007/s10044-014-0387-5
dc.relation.projectID info:eu-repo/grantAgreement/MEC//CSD2007-00018/ES/Multimodal Intraction in Pattern Recognition and Computer Visionm/ es_ES
dc.relation.projectID info:eu-repo/grantAgreement/MICINN//TIN2009-14511/ES/Traduccion De Textos Y Transcripcion De Voz Interactivas/ es_ES
dc.relation.projectID info:eu-repo/grantAgreement/UPV//20091027/ es_ES
dc.relation.projectID info:eu-repo/grantAgreement/MITURCO//TSI-020110-2009-0439/ES/ERUDITO.COM/ es_ES
dc.relation.projectID info:eu-repo/grantAgreement/GVA//PROMETEO09%2F2009%2F014/ES/Adaptive learning and multimodality in pattern recognition (Almapater)/ es_ES
dc.rights.accessRights Abierto es_ES
dc.contributor.affiliation Universitat Politècnica de València. Departamento de Sistemas Informáticos y Computación - Departament de Sistemes Informàtics i Computació es_ES
dc.description.bibliographicCitation Gonzalez Rubio, J.; Casacuberta Nolla, F. (2015). Minimum Bayes’ risk subsequence combination for machine translation. Pattern Analysis and Applications. 18(3):523-533. https://doi.org/10.1007/s10044-014-0387-5 es_ES
dc.description.accrualMethod S es_ES
dc.relation.publisherversion http://dx.doi.org/10.1007/s10044-014-0387-5 es_ES
dc.description.upvformatpinicio 523 es_ES
dc.description.upvformatpfin 533 es_ES
dc.type.version info:eu-repo/semantics/publishedVersion es_ES
dc.description.volume 18 es_ES
dc.description.issue 3 es_ES
dc.relation.senia 283749 es_ES
dc.identifier.eissn 1433-755X
dc.contributor.funder Ministerio de Educación y Ciencia es_ES
dc.contributor.funder Ministerio de Ciencia e Innovación es_ES
dc.contributor.funder Generalitat Valenciana es_ES
dc.contributor.funder Universitat Politècnica de València es_ES
dc.contributor.funder Ministerio de Industria, Turismo y Comercio es_ES
dc.description.references Bangalore S (2001) Computing consensus translation from multiple machine translation systems. In: IEEE automatic speech recognition and understanding workshop, pp 351–354 es_ES
dc.description.references Becker MA (2008) Active learning - an explicit treatment of unreliable parameters. Ph.D. thesis, University of Edinburgh es_ES
dc.description.references Bellman R (1957) Dynamic programming. Princeton University Press, Princeton es_ES
dc.description.references Bickel PJ, Doksum KA (1977) Mathematical statistics : basic ideas and selected topics. Holden-Day, San Francisco es_ES
dc.description.references Callison-burch C, Flournoy RS (2001) A program for automatically selecting the best output from multiple machine translation engines. In: Proceedings of the VIII machine translation summit, pp 63–66 es_ES
dc.description.references Callison-Burch C, Fordyce C, Koehn P, Monz C, Schroeder J (2008) Further meta-evaluation of machine translation. In: Proceedings of the 3rd Workshop on statistical machine translation, Association for Computational Linguistics, pp 70–106 es_ES
dc.description.references Callison-Burch C, Koehn P, Monz C, Schroeder J (2009) Findings of the 2009 workshop on statistical machine translation. In: Proceedings of the 4th workshop on statistical machine translation, Association for Computational Linguistics, Athens, pp 1–28 es_ES
dc.description.references Callison-Burch C, Koehn P, Monz C, Zaidan OF (eds) (2011) Proceedings of the 6th workshop on statistical machine translation. Association for Computational Linguistics, Edinburgh es_ES
dc.description.references Chinchor N (1992) The statistical significance of the muc-4 results. In: Proceedings of the conference on message understanding, pp 30–50 es_ES
dc.description.references DeNero J, Chiang D, Knight K (2009) Fast consensus decoding over translation forests. In: Proceedings of the 47th annual meeting of the Association for Computational Linguistics, Association for Computational Linguistics, pp 567–575 es_ES
dc.description.references DeNero J, Kumar S, Chelba C, Och F (2010) Model combination for machine translation. In: Proceedings of the 11th conference of the North American chapter of the Association for Computational Linguistics, Association for Computational Linguistics, pp 975–983 es_ES
dc.description.references Dietterich TG (2000) Ensemble methods in machine learning. In: Proceedings of the 1st International workshop on multiple classifier systems, MCS ’00, Springer, pp 1–15 es_ES
dc.description.references Duan N, Li M, Zhang D, Zhou M (2010) Mixture model-based minimum bayes risk decoding using multiple machine translation systems. In: Proceedings of the 23rd conference on Computational Linguistics, pp 313–321 es_ES
dc.description.references Duda RO, Hart PE, Stork DG (2001) Pattern classification, 2nd edn. Wiley, New York es_ES
dc.description.references Ehling N, Zens R, Ney H (2007) Minimum bayes risk decoding for bleu. In: Proceedings of the 45th annual aeeting of the Association for Computational Linguistics, Association for Computational Linguistics, pp 101–104 es_ES
dc.description.references Fiscus JG (1997) A post-processing system to yield reduced Word error rates: recogniser output voting error reduction (ROVER). In: Proceedings IEEE Workshop on automatic speech recognition and understanding, pp 347–352 es_ES
dc.description.references González-Rubio J, Juan A, Casacuberta F (2011) Minimum bayes-risk system combination. In: Proceedings of the 49th annual meeting of the Association for Computational Linguistics, pp 1268–1277 es_ES
dc.description.references González-Rubio J, Casacuberta F (2011) The UPV-PRHLT combinatio nsystem for WMT 2011. In: Proceedings of the 49th annual meeting of the Association for Computational Linguistics, pp 1268–1277 es_ES
dc.description.references He X, Toutanova K (2009) Joint optimization for machine translation system combination. In: Proceedings of the 2009 conference on empirical methods in natural language processing, Association for Computational Linguistics, pp 1202–1211 es_ES
dc.description.references He X, Yang M, Gao J, Nguyen P, Moore R (2008) Indirect-hmm-based hypothesis alignment for combining outputs from machine translation systems. In: Proceedings of the 2008 conference on empirical methods in natural language processing, Association for Computational Linguistics, pp 98–107 es_ES
dc.description.references Heafield K, Lavie A (2011) Cmu system combination in wmt 2011. In: Proceedings of the 6th workshop on statistical machine translation, Association for Computational Linguistics, Edinburgh, pp 145–151 es_ES
dc.description.references Jayaraman S, Lavie A (2005) Multi-engine machine translation guided by explicit word matching. In: Proceeding of the 10th conference of the European Association for Machine Translation, pp 143–152 es_ES
dc.description.references Jelinek F (1997) Statistical methods for speech recognition. MIT Press, Cambridge es_ES
dc.description.references Kittler J, Hatef M, Duin RPW, Matas J (1998) On combining classifiers. IEEE Trans Pattern Anal Mach Intell 20:226–239. doi: 10.1109/34.667881. es_ES
dc.description.references Knight K (1999) Decoding complexity in word-replacement translation models. Comput Linguist 25(4):607–615. http://dl.acm.org/citation.cfm?id=973226.973232 es_ES
dc.description.references Kumar S, Macherey W, Dyer C, Och F (2009) Efficient minimum error rate training and minimum bayes-risk decoding for translation hypergraphs and lattices. In: Proceedings of the 47th annual meeting of the Association for Computational Linguistics, Association for Computational Linguistics, pp 163–171 es_ES
dc.description.references Land AH, Doig AG (1960) An automatic method of solving discrete programming problems. Econometrica 28(3):497–520 es_ES
dc.description.references Larkey LS, Croft BW (1996) Combining classifiers in text categorization. In: Frei HP, Harman D, Schäuble P, Wilkinson R (eds) Proceedings of the 19th ACM International Conference on Research and Development in Information Retrieval. ACM Press, New York, pp 289–297 es_ES
dc.description.references Leusch G, Freitag M, Ney H (2011) The rwth system combination system for wmt 2011. In: Proceedings of the 6th workshop on Statistical Machine Translation, Association for Computational Linguistics, Edinburgh, pp 152–158 es_ES
dc.description.references Matusov E, Leusch G, Banchs RE, Bertoldi N, Dechelotte D, Federico M, Kolss M, suk Lee Y, no JBM, Paulik M, Roukos S, Schwenk H, Ney H (2008) System combination for machine translation of spoken and written language. IEEE Trans Audio Speech Lang Process 16:1222–1237 es_ES
dc.description.references Nelder JA, Mead R (1965) A simplex method for function minimization. Comput J 7(4):308–313 es_ES
dc.description.references NIST (2006) NIST 2006 machine translation evaluation official results. http://www.itl.nist.gov/iad/mig/tests/mt/ es_ES
dc.description.references Nomoto T (2004) Multi-engine machine translation with voted language model. In: Proceedings of the 42nd annual meeting on Association for Computational Linguistics, Association for Computational Linguistics, pp 494–501 es_ES
dc.description.references Noreen E (1989) Computer-intensive methods for testing hypotheses: an introduction. A wiley interscience publication. Wiley, New York es_ES
dc.description.references Och FJ (2003) Minimum error rate training in statistical machine translation. In: Proceedings of the 41st annual meeting on Association for Computational Linguistics, Association for Computational Linguistics, pp 160–167 es_ES
dc.description.references Papineni K, Roukos S, Ward T, Zhu WJ (2002) BLEU: a method for automatic evaluation of machine translation. In: Proceedings of the 40th annual meeting on Association for Computational Linguistics, Association for Computational Linguistics, pp 311–318 es_ES
dc.description.references Paul M, Doi T, Hwang Y, Imamura K, Okuma H, Sumita E (2005) Nobody is perfect: atr’s hybrid approach to spoken language translation. In: Proceedings of the 2005 International Workshop on spoken language translation, pp 55–62 es_ES
dc.description.references Rosti A, Ayan NF, Xiang B, Matsoukas S, Schwartz R, Dorr B (2007) Combining outputs from multiple machine translation systems. In: Proceedings of the 6th conference of the North American Chapter of the Association for Computational Linguistics, Association for Computational Linguistics, pp 228–235 es_ES
dc.description.references Rosti A, Zhang B, Matsoukas S, Schwartz R (2011) Expected bleu training for graphs: Bbn system description for wmt11 system combination task. In: Proceedings of the 6th workshop on statistical machine translation, Association for Computational Linguistics, pp 159–165 es_ES
dc.description.references Roth D, Zelenko D (1998) Part of speech tagging using a network of linear separators. In: Proceedings of the 17th international conference on Computational linguistics - Volume 2, COLING ’98, Association for Computational Linguistics, pp 1136–1142 es_ES
dc.description.references Snover M, Dorr B, Schwartz R, Micciulla L, Weischedel R (2006) A study of translation error rate with targeted human annotation. In: Proceedings of the 7th conference of the Association for Machine Transaltion in the Americas, pp 223–231 es_ES
dc.description.references Stanley R (2002) Enumerative combinatorics. Cambridge studies in advanced mathematics. Cambridge University Press, Cambridge es_ES
dc.description.references Udupa R, Maji HK (2006) Computational complexity of statistical machine translation. In: McCarthy D, Wintner S (eds) Proceedings of the European Chapter of the Association for Computational Linguistics. The Association for Computer Linguistics. http://acl.ldc.upenn.edu/E/E06/E06-1004 es_ES
dc.description.references Xu D, Cao Y, Karakos D (2011) Description of the jhu system combination scheme for wmt 2011. In: Proceedings of the 6th workshop on Statistical Machine Translation, Association for Computational Linguistics, pp 171–176 es_ES


Este ítem aparece en la(s) siguiente(s) colección(ones)

Mostrar el registro sencillo del ítem