Skip to main content

Shallow Parsing Based on Comma Values

  • Conference paper
Advances in Artificial Intelligence - IBERAMIA-SBIA 2006 (IBERAMIA 2006, SBIA 2006)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4140))

  • 906 Accesses

Abstract

In the belief that punctuation can aid in the process of sentence structure analysis, our work focuses on a prior assignment of values to commas in Spanish texts. Supervised machine learning techniques are applied for learning commas classifiers, taking as input attributes positional information and part of speech tags. One of these comma classifiers and a rule-based analyzer are combined in order to recognize and label text structures. The prior assignment of values to commas allowed the simplification of recognition rules, with very encouraging results.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Bayraktar, M., Say, B., Akman, V.: An analysis of english punctuation: The special case of comma. International Journal of Corpus Linguistics 3(1), 33–57 (1998)

    Article  Google Scholar 

  2. Briscoe, T., Carroll, J.: Developing and evaluating a probabilistic lr parser of part-of-speech and punctuation labels. In: Proceedings of the ACL/SIGPARSE 4th International Workshop on Parsing Technologies, Prague/Kqrlovy Vary, Czech Republic, pp. 48–58 (1995)

    Google Scholar 

  3. Carreras, X., Chao, I., Padró, L., Padró, M.: Freeling: An open-source suite of language analyzers. In: Proceedings of the 4th International Conference on Language Resources and Evaluation (LREC 2004), Lisbon, Portugal (June 2004)

    Google Scholar 

  4. Jones, B.: Can punctuation help parsing? Technical Report 29, Cambridge University, Cambridge, UK (1994)

    Google Scholar 

  5. Ross Quinlan, J.: C4.5: Programs for Machine Learning. Morgan Kaufmann, San Mateo (1993)

    Google Scholar 

  6. Real Academia Española. Diccionario Panhispánico de Dudas. Santillana, Bogotá, Colombia (2005)

    Google Scholar 

  7. Schapire, R.E., Singer, Y.: Boostexter: A boosting-based system for text categorization. Machine Learning 39(2/3), 135–168 (2000)

    Article  MATH  Google Scholar 

  8. van Delden, S., Gómez, F.: A finite state comma tagger. International Journal on Artificial Intelligence Tools 13(3), 449–468 (2004)

    Article  Google Scholar 

  9. White, M.: Presenting punctuation. In: Proceedings of the Fifth European Workshop on Natural Language Generation, Leiden, the Netherlands, May 1995, pp. 107–125 (1995) Faculty of Social and Behavioural Sciences, University of Leiden

    Google Scholar 

  10. Wonsever, D., Minel, J.-L.: Contextual rules for text analysis. In: Gelbukh, A. (ed.) CICLing 2001. LNCS, vol. 2004, pp. 509–521. Springer, Heidelberg (2001)

    Chapter  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Garat, D. (2006). Shallow Parsing Based on Comma Values. In: Sichman, J.S., Coelho, H., Rezende, S.O. (eds) Advances in Artificial Intelligence - IBERAMIA-SBIA 2006. IBERAMIA SBIA 2006 2006. Lecture Notes in Computer Science(), vol 4140. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11874850_53

Download citation

  • DOI: https://doi.org/10.1007/11874850_53

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-45462-5

  • Online ISBN: 978-3-540-45464-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics