Article

Free Access

BLEU: a method for automatic evaluation of machine translation

Authors:
Kishore Papineni

IBM T. J. Watson Research Center, Yorktown Heights, NY

IBM T. J. Watson Research Center, Yorktown Heights, NY
View Profile

,
Salim Roukos

IBM T. J. Watson Research Center, Yorktown Heights, NY

IBM T. J. Watson Research Center, Yorktown Heights, NY
View Profile

,
Todd Ward

IBM T. J. Watson Research Center, Yorktown Heights, NY

IBM T. J. Watson Research Center, Yorktown Heights, NY
View Profile

,
Wei-Jing Zhu

IBM T. J. Watson Research Center, Yorktown Heights, NY

IBM T. J. Watson Research Center, Yorktown Heights, NY
View Profile

ACL '02: Proceedings of the 40th Annual Meeting on Association for Computational LinguisticsJuly 2002Pages 311–318https://doi.org/10.3115/1073083.1073135

Published:06 July 2002Publication History

ACL '02: Proceedings of the 40th Annual Meeting on Association for Computational Linguistics

Pages 311–318

ABSTRACT

Human evaluations of machine translation are extensive but expensive. Human evaluations can take months to finish and involve human labor that can not be reused. We propose a method of automatic machine translation evaluation that is quick, inexpensive, and language-independent, that correlates highly with human evaluation, and that has little marginal cost per run. We present this method as an automated understudy to skilled human judges which substitutes for them when there is need for quick or frequent evaluations.

References

E. H. Hovy. 1999. Toward finely differentiated evaluation metrics for machine translation. In Proceedings of the Eagles Workshop on Standards and Evaluation, Pisa, Italy.Google Scholar
Kishore Papineni, Salim Roukos, Todd Ward, John Henderson, and Florence Reeder. 2002. Corpus-based comprehensive and diagnostic MT evaluation: Initial Arabic, Chinese, French, and Spanish results. In Proceedings of Human Language Technology 2002, San Diego, CA. To appear. Google ScholarDigital Library
Florence Reeder. 2001. Additional mt-eval references. Technical report, International Standards for Language Engineering, Evaluation Working Group. http://issco-www.unige.ch/projects/isle/taxonomy2/Google Scholar
J. S. White and T. O'Connell. 1994. The ARPA MT evaluation methodologies: evolution, lessons, and future approaches. In Proceedings of the First Conference of the Association for Machine Translation in the Americas, pages 193--205, Columbia, Maryland.Google Scholar

BLEU: a method for automatic evaluation of machine translation

Recommendations

Extending the BLEU MT evaluation method with frequency weightings
ACL '04: Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics

We present the results of an experiment on extending the automatic method of Machine Translation evaluation BLUE with statistical weights for lexical items, such as tf.idf scores. We show that this extension gives additional information about evaluated ...
Read More
Evaluation of English to Arabic Machine Translation Systems using BLEU and GTM
ICETC '17: Proceedings of the 9th International Conference on Education Technology and Computers

The aim of this research study is to compare the effectiveness of three systems: Google Translator, Bing Translator and Golden Alwafi that are used to translate the corpus sentences from English language to Arabic language and then evaluate these ...
Read More
Comparing reordering constraints for SMT using efficient Bleu oracle computation
SSST '07: Proceedings of the NAACL-HLT 2007/AMTA Workshop on Syntax and Structure in Statistical Translation

This paper describes a new method to compare reordering constraints for Statistical Machine Translation. We investigate the best possible (oracle) Bleu score achievable under different reordering constraints. Using dynamic programming, we efficiently ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
ACL '02: Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
July 2002
543 pages
General Chair:
Pierre Isabelle
Sponsors
In-Cooperation
Publisher
Association for Computational Linguistics
United States
Publication History
- Published: 6 July 2002
Qualifiers
- Article
Conference

Acceptance Rates
Overall Acceptance Rate85of443submissions,19%
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 2,080
  Total Citations
  View Citations
- 24,425
  Total Downloads
- Downloads (Last 12 months)4,337
- Downloads (Last 6 weeks)629
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

BLEU: a method for automatic evaluation of machine translation

ACL '02: Proceedings of the 40th Annual Meeting on Association for Computational Linguistics

ABSTRACT

References

Cited By

Recommendations

Extending the BLEU MT evaluation method with frequency weightings

Evaluation of English to Arabic Machine Translation Systems using BLEU and GTM

Comparing reordering constraints for SMT using efficient Bleu oracle computation

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Qualifiers

Conference

Acceptance Rates

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

BLEU: a method for automatic evaluation of machine translation

ACL '02: Proceedings of the 40th Annual Meeting on Association for Computational Linguistics

ABSTRACT

References

Cited By

Recommendations

Extending the BLEU MT evaluation method with frequency weightings

Evaluation of English to Arabic Machine Translation Systems using BLEU and GTM

Comparing reordering constraints for SMT using efficient Bleu oracle computation

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Qualifiers

Conference

Acceptance Rates

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media