Abstract
In the belief that punctuation can aid in the process of sentence structure analysis, our work focuses on a prior assignment of values to commas in Spanish texts. Supervised machine learning techniques are applied for learning commas classifiers, taking as input attributes positional information and part of speech tags. One of these comma classifiers and a rule-based analyzer are combined in order to recognize and label text structures. The prior assignment of values to commas allowed the simplification of recognition rules, with very encouraging results.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Bayraktar, M., Say, B., Akman, V.: An analysis of english punctuation: The special case of comma. International Journal of Corpus Linguistics 3(1), 33–57 (1998)
Briscoe, T., Carroll, J.: Developing and evaluating a probabilistic lr parser of part-of-speech and punctuation labels. In: Proceedings of the ACL/SIGPARSE 4th International Workshop on Parsing Technologies, Prague/Kqrlovy Vary, Czech Republic, pp. 48–58 (1995)
Carreras, X., Chao, I., Padró, L., Padró, M.: Freeling: An open-source suite of language analyzers. In: Proceedings of the 4th International Conference on Language Resources and Evaluation (LREC 2004), Lisbon, Portugal (June 2004)
Jones, B.: Can punctuation help parsing? Technical Report 29, Cambridge University, Cambridge, UK (1994)
Ross Quinlan, J.: C4.5: Programs for Machine Learning. Morgan Kaufmann, San Mateo (1993)
Real Academia Española. Diccionario Panhispánico de Dudas. Santillana, Bogotá, Colombia (2005)
Schapire, R.E., Singer, Y.: Boostexter: A boosting-based system for text categorization. Machine Learning 39(2/3), 135–168 (2000)
van Delden, S., Gómez, F.: A finite state comma tagger. International Journal on Artificial Intelligence Tools 13(3), 449–468 (2004)
White, M.: Presenting punctuation. In: Proceedings of the Fifth European Workshop on Natural Language Generation, Leiden, the Netherlands, May 1995, pp. 107–125 (1995) Faculty of Social and Behavioural Sciences, University of Leiden
Wonsever, D., Minel, J.-L.: Contextual rules for text analysis. In: Gelbukh, A. (ed.) CICLing 2001. LNCS, vol. 2004, pp. 509–521. Springer, Heidelberg (2001)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Garat, D. (2006). Shallow Parsing Based on Comma Values. In: Sichman, J.S., Coelho, H., Rezende, S.O. (eds) Advances in Artificial Intelligence - IBERAMIA-SBIA 2006. IBERAMIA SBIA 2006 2006. Lecture Notes in Computer Science(), vol 4140. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11874850_53
Download citation
DOI: https://doi.org/10.1007/11874850_53
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-45462-5
Online ISBN: 978-3-540-45464-9
eBook Packages: Computer ScienceComputer Science (R0)