Article

Free Access

A SNoW based supertagger with application to NP chunking

Authors:
Libin Shen

University of Pennsylvania, Philadelphia, PA

University of Pennsylvania, Philadelphia, PA
View Profile

,
Aravind K. Joshi

University of Pennsylvania, Philadelphia, PA

University of Pennsylvania, Philadelphia, PA
View Profile

ACL '03: Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 1July 2003Pages 505–512https://doi.org/10.3115/1075096.1075160

Published:07 July 2003Publication History

ACL '03: Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 1

Pages 505–512

ABSTRACT

Supertagging is the tagging process of assigning the correct elementary tree of LTAG, or the correct supertag, to each word of an input sentence. In this paper we propose to use supertags to expose syntactic dependencies which are unavailable with POS tags. We first propose a novel method of applying Sparse Network of Winnow (SNoW) to sequential models. Then we use it to construct a supertagger that uses long distance syntactical dependencies, and the supertagger achieves an accuracy of 92.41%. We apply the supertagger to NP chunking. The use of supertags in NP chunking gives rise to almost 1% absolute increase (from 92.03% to 92.95%) in F-score under Transformation Based Learning(TBL) frame. The surpertagger described here provides an effective and efficient way to exploit syntactic information.

References

S. Abney. 1991. Parsing by chunks. In Principle-Based Parsing. Kluwer Academic Publishers.Google Scholar
E. Brill. 1995. Transformation-based error-driven learning and natural language processing: A case study in part-of-speech tagging. Computational Linguistics, 21(4):543--565. Google ScholarDigital Library
J. Chen, B. Srinivas, and K. Vijay-Shanker. 1999. New models for improving supertag disambiguation. In Proceedings of the 9th EACL. Google ScholarDigital Library
J. Chen. 2001. Towards Efficient Statistical Parsing using Lexicalized Grammatical Information. Ph.D. thesis, University of Delaware. Google ScholarDigital Library
M. Collins. 2002. Discriminative training methods for hidden markov models: Theory and experiments with perceptron algorithms. In EMNLP 2002. Google ScholarDigital Library
A. Joshi and Y. Schabes. 1997. Tree-adjoining grammars. In G. Rozenberg and A. Salomaa, editors, Handbook of Formal Languages, volume 3, pages 69--124. Springer. Google ScholarDigital Library
A. Joshi and B. Srinivas. 1994. Disambiguation of super parts of speech (or supertags): Almost parsing. In COLING'94. Google ScholarDigital Library
T. Kudo and Y. Matsumoto. 2001. Chunking with support vector machines. In Proceedings of NAACL 2001. Google ScholarDigital Library
J. Lafferty, A. McCallum, and F. Pereira. 2001. Conditional random fields: Probabilistic models for stgmentation and labeling sequence data. In Proceedings of ICML 2001. Google ScholarDigital Library
M. P. Marcus, B. Santorini, and M. A. Marcinkiewicz. 1994. Building a large annotated corpus of english: the penn treebank. Computational Linguistics, 19(2):313--330. Google ScholarDigital Library
M. Muñoz, V. Punyakanok, D. Roth, and D. Zimak. 1999. A learning approach to shallow parsing. In Proceedings of EMNLP-WVLC'99.Google Scholar
G. Ngai and R. Florian. 2001. Transformation-based learning in the fast lane. In Proceedings of NAACL-2001, pages 40--47. Google ScholarDigital Library
V. Punyakanok and D. Roth. 2000. The use of classifiers in sequential inference. In NIPS'00.Google Scholar
L. Ramshaw and M. Marcus. 1995. Text chunking using transformation-based learning. In Proceedings of the 3rd WVLC.Google Scholar
A. Ratnaparkhi. 1996. A maximum entropy part-of-speech tagger. In Proceedings of EMNLP 96.Google Scholar
D. Roth. 1998. Learning to resolve natural language ambiguities: A unified approach. In AAAI'98. Google ScholarDigital Library
Erik F. Tjong Kim Sang. 2002. Memory-based shallow parsing. Journal of Machine Learning Research, 2:559--594. Google ScholarDigital Library
F. Sha and F. Pereira. 2003. Shallow parsing with conditional random fields. In Proceedings of NAACL 2003. Google ScholarDigital Library
B. Srinivas and A. Joshi. 1999. Supertagging: An approach to almost parsing. Computational Linguistics, 25(2). Google ScholarDigital Library
B. Srinivas. 1997. Performance evaluation of supertagging for partial parsing. In IWPT 1997.Google Scholar
H. van Halteren, J. Zavrel, and W. Daelmans. 1998. Improving data driven wordclass tagging by system combination. In Proceedings of COLING-ACL 98. Google ScholarDigital Library
F. Xia. 2001. Automatic Grammar Generation From Two Different Perspectives. Ph.D. thesis, University of Pennsylvania. Google ScholarDigital Library
XTAG-Group. 2001. A lexicalized tree adjoining grammar for english. Technical Report 01-03, IRCS, Univ. of Pennsylvania.Google Scholar
T. Zhang, F. Damerau, and D. Johnson. 2001. Text chunking using regularized winnow. In Proceedings of ACL 2001. Google ScholarDigital Library

A SNoW based supertagger with application to NP chunking
1. Computing methodologies
  1. Artificial intelligence
2. Hardware
  1. Power and energy
    1. Power estimation and optimization

Recommendations

Structure-guided supertagger learning

As described in this paper, we specifically examine the structural learning problem of a supertagging task. Supertagging is a task to assign the most probable lexical entry to each word in a sentence. A supertagger is extremely important for a ...
Read More
Faster parsing by supertagger adaptation
ACL '10: Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics

We propose a novel self-training method for a parser which uses a lexicalised grammar and supertagger, focusing on increasing the speed of the parser rather than its accuracy. The idea is to train the supertagger on large amounts of parser output, so ...
Read More
Forest-guided supertagger training
COLING '10: Proceedings of the 23rd International Conference on Computational Linguistics

Supertagging is an important technique for deep syntactic analysis. A supertagger is usually trained independently of the parser using a sequence labeling method. This presents an inconsistent training objective between the supertagger and the parser. ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
ACL '03: Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 1
July 2003
571 pages
Program Chairs:
Erhard W. Hinrichs,
Dan Roth
Sponsors
In-Cooperation
Publisher
Association for Computational Linguistics
United States
Publication History
- Published: 7 July 2003
Qualifiers
- Article
Conference

Acceptance Rates
Overall Acceptance Rate85of443submissions,19%
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 6
  Total Citations
  View Citations
- 232
  Total Downloads
- Downloads (Last 12 months)16
- Downloads (Last 6 weeks)1
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

A SNoW based supertagger with application to NP chunking

ACL '03: Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 1

ABSTRACT

References

Cited By

Recommendations

Structure-guided supertagger learning

Faster parsing by supertagger adaptation

Forest-guided supertagger training

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Qualifiers

Conference

Acceptance Rates

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

A SNoW based supertagger with application to NP chunking

ACL '03: Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 1

ABSTRACT

References

Cited By

Recommendations

Structure-guided supertagger learning

Faster parsing by supertagger adaptation

Forest-guided supertagger training

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Qualifiers

Conference

Acceptance Rates

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media