skip to main content
10.3115/981863.981884dlproceedingsArticle/Chapter ViewAbstractPublication PagesaclConference Proceedingsconference-collections
Article
Free Access

A polynomial-time algorithm for statistical machine translation

Authors Info & Claims
Published:24 June 1996Publication History

ABSTRACT

We introduce a polynomial-time algorithm for statistical machine translation. This algorithm can be used in place of the expensive, slow best-first search strategies in current statistical translation architectures. The approach employs the stochastic bracketing transduction grammar (SBTG) model we recently introduced to replace earlier word alignment channel models, while retaining a bigram language model. The new algorithm in our experience yields major speed improvement with no significant loss of accuracy.

References

  1. Barton, G. Edward, Robert C. Berwick, and Eric Sven Ristad. 1987. Computational Complexity and Natural Language. MIT Press, Cambridge, MA. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Brown, Peter F., John Cocke, Stephen A. DellaPietra, Vincent J. DellaPietra, Frederick Jelinek, John D. Lafferty, Robert L. Mercer, and Paul S. Roossin. 1990. A statistical approach to machine translation. Computational Linguistics, 16(2):29--85. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Brown, Peter F., Stephen A. DellaPietra, Vincent J. DellaPietra, and Robert L. Mercer. 1993. The mathematics of statistical machine translation: Parameter estimation. Computational Linguistics, 19(2):263--311. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Dagan, Ido, Kenneth W. Church, and William A. Gale. 1993. Robust bilingual word alignment for machine aided translation. In Proceedings of the Workshop on Very Large Corpora, pages 1--8, Columbus, OH, June.Google ScholarGoogle Scholar
  5. Earley, Jay. 1970. An efficient context-free parsing algorithm. Communications of the Association for Computing Machinery, 13(2):94--102. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Fung, Pascale and Dekai Wu. 1994. Statistical augmentation of a Chinese machine-readable dictionary. In Proceedings of the Second Annual Workshop on Very Large Corpora, pages 69--85, Kyoto, August.Google ScholarGoogle Scholar
  7. Joshi, Aravind K. 1987. Word-order variation in natural language generation. In Proceedings of AAAI-87, Sixth National Conference on Artificial Intelligence, pages 550--555.Google ScholarGoogle Scholar
  8. Shieber, Stuart M. 1984. Direct parsing of ID/LP grammars. Linguistics and Philosophy, 7:135--154.Google ScholarGoogle ScholarCross RefCross Ref
  9. Viterbi, Andrew J. 1967. Error bounds for convolutional codes and an asymptotically optimal decoding algorithm. IEEE Transactions on Information Theory, 13:260--269.Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Wu, Dekai. 1994. Aligning a parallel English-Chinese corpus statistically with lexical criteria. In Proceedings of the 32nd Annual Conference of the Association for Computational Linguistics, pages 80--87, Las Cruces, New Mexico, June. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Wu, Dekai. 1995a. An algorithm for simultaneously bracketing parallel texts by aligning words. In Proceedings of the 33rd Annual Conference of the Association for Computational Linguistics, pages 244--251, Cambridge, Massachusetts, June. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Wu, Dekai. 1995b. Grammarless extraction of phrasal translation examples from parallel texts. In TMI-95, Proceedings of the Sixth International Conference on Theoretical and Methodological Issues in Machine Translation, volume 2, pages 354--372, Leuven, Belgium, July.Google ScholarGoogle Scholar
  13. Wu, Dekai. 1995c. Stochastic inversion transduction grammars, with application to segmentation, bracketing, and alignment of parallel corpora. In Proceedings of IJCAI-95, Fourteenth International Joint Conference on Artificial Intelligence, pages 1328--1334, Montreal, August. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Wu, Dekai. 1995d. Trainable coarse bilingual grammars for parallel text bracketing. In Proceedings of the Third Annual Workshop on Very Large Corpora, pages 69--81, Cambridge, Massachusetts, June.Google ScholarGoogle Scholar
  15. Wu, Dekai and Pascale Fung. 1994. Improving Chinese tokenization with linguistic filters on statistical lexical acquisition. In Proceedings of the Fourth Conference on Applied Natural Language Processing, pages 180--181, Stuttgart, October. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Wu, Dekai and Cindy Ng. 1995. Using brackets to improve search for statistical machine translation. In PACLIC-10, Pacific Asia Conference on Language, Information and Computation, pages 195--204, Hong Kong, December.Google ScholarGoogle Scholar
  17. Wu, Dekai and Xuanyin Xia. 1995. Large-scale automatic extraction of an English-Chinese lexicon. Machine Translation, 9(3--4):285--313.Google ScholarGoogle Scholar
  1. A polynomial-time algorithm for statistical machine translation

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image DL Hosted proceedings
        ACL '96: Proceedings of the 34th annual meeting on Association for Computational Linguistics
        June 1996
        399 pages
        • Program Chairs:
        • Aravind Joshi,
        • Martha Palmer

        Publisher

        Association for Computational Linguistics

        United States

        Publication History

        • Published: 24 June 1996

        Qualifiers

        • Article

        Acceptance Rates

        Overall Acceptance Rate85of443submissions,19%

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader