Article

Free Access

A polynomial-time algorithm for statistical machine translation

Author:
Dekai Wu

University of Science and Technology, Clear Water Bay, Hong Kong

University of Science and Technology, Clear Water Bay, Hong Kong
View Profile

ACL '96: Proceedings of the 34th annual meeting on Association for Computational LinguisticsJune 1996Pages 152–158https://doi.org/10.3115/981863.981884

Published:24 June 1996Publication History

ACL '96: Proceedings of the 34th annual meeting on Association for Computational Linguistics

Pages 152–158

ABSTRACT

We introduce a polynomial-time algorithm for statistical machine translation. This algorithm can be used in place of the expensive, slow best-first search strategies in current statistical translation architectures. The approach employs the stochastic bracketing transduction grammar (SBTG) model we recently introduced to replace earlier word alignment channel models, while retaining a bigram language model. The new algorithm in our experience yields major speed improvement with no significant loss of accuracy.

References

Barton, G. Edward, Robert C. Berwick, and Eric Sven Ristad. 1987. Computational Complexity and Natural Language. MIT Press, Cambridge, MA. Google ScholarDigital Library
Brown, Peter F., John Cocke, Stephen A. DellaPietra, Vincent J. DellaPietra, Frederick Jelinek, John D. Lafferty, Robert L. Mercer, and Paul S. Roossin. 1990. A statistical approach to machine translation. Computational Linguistics, 16(2):29--85. Google ScholarDigital Library
Brown, Peter F., Stephen A. DellaPietra, Vincent J. DellaPietra, and Robert L. Mercer. 1993. The mathematics of statistical machine translation: Parameter estimation. Computational Linguistics, 19(2):263--311. Google ScholarDigital Library
Dagan, Ido, Kenneth W. Church, and William A. Gale. 1993. Robust bilingual word alignment for machine aided translation. In Proceedings of the Workshop on Very Large Corpora, pages 1--8, Columbus, OH, June.Google Scholar
Earley, Jay. 1970. An efficient context-free parsing algorithm. Communications of the Association for Computing Machinery, 13(2):94--102. Google ScholarDigital Library
Fung, Pascale and Dekai Wu. 1994. Statistical augmentation of a Chinese machine-readable dictionary. In Proceedings of the Second Annual Workshop on Very Large Corpora, pages 69--85, Kyoto, August.Google Scholar
Joshi, Aravind K. 1987. Word-order variation in natural language generation. In Proceedings of AAAI-87, Sixth National Conference on Artificial Intelligence, pages 550--555.Google Scholar
Shieber, Stuart M. 1984. Direct parsing of ID/LP grammars. Linguistics and Philosophy, 7:135--154.Google ScholarCross Ref
Viterbi, Andrew J. 1967. Error bounds for convolutional codes and an asymptotically optimal decoding algorithm. IEEE Transactions on Information Theory, 13:260--269.Google ScholarDigital Library
Wu, Dekai. 1994. Aligning a parallel English-Chinese corpus statistically with lexical criteria. In Proceedings of the 32nd Annual Conference of the Association for Computational Linguistics, pages 80--87, Las Cruces, New Mexico, June. Google ScholarDigital Library
Wu, Dekai. 1995a. An algorithm for simultaneously bracketing parallel texts by aligning words. In Proceedings of the 33rd Annual Conference of the Association for Computational Linguistics, pages 244--251, Cambridge, Massachusetts, June. Google ScholarDigital Library
Wu, Dekai. 1995b. Grammarless extraction of phrasal translation examples from parallel texts. In TMI-95, Proceedings of the Sixth International Conference on Theoretical and Methodological Issues in Machine Translation, volume 2, pages 354--372, Leuven, Belgium, July.Google Scholar
Wu, Dekai. 1995c. Stochastic inversion transduction grammars, with application to segmentation, bracketing, and alignment of parallel corpora. In Proceedings of IJCAI-95, Fourteenth International Joint Conference on Artificial Intelligence, pages 1328--1334, Montreal, August. Google ScholarDigital Library
Wu, Dekai. 1995d. Trainable coarse bilingual grammars for parallel text bracketing. In Proceedings of the Third Annual Workshop on Very Large Corpora, pages 69--81, Cambridge, Massachusetts, June.Google Scholar
Wu, Dekai and Pascale Fung. 1994. Improving Chinese tokenization with linguistic filters on statistical lexical acquisition. In Proceedings of the Fourth Conference on Applied Natural Language Processing, pages 180--181, Stuttgart, October. Google ScholarDigital Library
Wu, Dekai and Cindy Ng. 1995. Using brackets to improve search for statistical machine translation. In PACLIC-10, Pacific Asia Conference on Language, Information and Computation, pages 195--204, Hong Kong, December.Google Scholar
Wu, Dekai and Xuanyin Xia. 1995. Large-scale automatic extraction of an English-Chinese lexicon. Machine Translation, 9(3--4):285--313.Google Scholar

A polynomial-time algorithm for statistical machine translation
1. Computing methodologies
  1. Artificial intelligence
2. Hardware
  1. Power and energy
    1. Power estimation and optimization

Recommendations

N-gram-based statistical machine translation versus syntax augmented machine translation: comparison and system combination
EACL '09: Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics

In this paper we compare and contrast two approaches to Machine Translation (MT): the CMU-UKA Syntax Augmented Machine Translation system (SAMT) and UPC-TALP N-gram-based Statistical Machine Translation (SMT). SAMT is a hierarchical syntax-driven ...
Read More
Linguistically annotated BTG for statistical machine translation
COLING '08: Proceedings of the 22nd International Conference on Computational Linguistics - Volume 1

Bracketing Transduction Grammar (BTG) is a natural choice for effective integration of desired linguistic knowledge into statistical machine translation (SMT). In this paper, we propose a Linguistically Annotated BTG (LABTG) for SMT. It conveys ...
Read More
Dependency-Based Chinese-English Statistical Machine Translation
CICLing '07: Proceedings of the 8th International Conference on Computational Linguistics and Intelligent Text Processing

We present a Chinese-English Statistical Machine Translation (SMT) system based on dependency tree mappings. We use a state-of-the-art dependency parser to parse the English translation of the Penn Chinese Treebank to make it bilingual and then learn a ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
ACL '96: Proceedings of the 34th annual meeting on Association for Computational Linguistics
June 1996
399 pages
Program Chairs:
Aravind Joshi
University of Pennsylvania, Philadelphia, PA
,
Martha Palmer
University of Pennsylvania, Philadelphia, PA
Sponsors
In-Cooperation
Publisher
Association for Computational Linguistics
United States
Publication History
- Published: 24 June 1996
Qualifiers
- Article
Conference

Acceptance Rates
Overall Acceptance Rate85of443submissions,19%
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 45
  Total Citations
  View Citations
- 632
  Total Downloads
- Downloads (Last 12 months)33
- Downloads (Last 6 weeks)5
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

A polynomial-time algorithm for statistical machine translation

ACL '96: Proceedings of the 34th annual meeting on Association for Computational Linguistics

ABSTRACT

References

Cited By

Recommendations

N-gram-based statistical machine translation versus syntax augmented machine translation: comparison and system combination

Linguistically annotated BTG for statistical machine translation

Dependency-Based Chinese-English Statistical Machine Translation

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Qualifiers

Conference

Acceptance Rates

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

A polynomial-time algorithm for statistical machine translation

ACL '96: Proceedings of the 34th annual meeting on Association for Computational Linguistics

ABSTRACT

References

Cited By

Recommendations

N-gram-based statistical machine translation versus syntax augmented machine translation: comparison and system combination

Linguistically annotated BTG for statistical machine translation

Dependency-Based Chinese-English Statistical Machine Translation

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Qualifiers

Conference

Acceptance Rates

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media