Skip to main content

A Cascaded Syntactic Analyser for Basque

  • Conference paper
Computational Linguistics and Intelligent Text Processing (CICLing 2004)

Abstract

This article presents a robust syntactic analyser for Basque and the different modules it contains. Each module is structured in different analysis layers for which each layer takes the information provided by the previous layer as its input; thus creating a gradually deeper syntactic analysis in cascade. This analysis is carried out using the Constraint Grammar (CG) formalism. Moreover, the article describes the standardisation process of the parsing formats using XML.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  • Aduriz, I., Agirre, E., Aldezabal, I., Alegria, I., Ansa, O., Arregi, X., Arriola, J.M., Artola, X., de Ilarraza, A.D., Ezeiza, N., Gojenola, K., Maritxalar, A., Maritxalar, M., Oronoz, M., Sarasola, K., Soroa, A., Urizar, R., Urkia, M.: A Framework for the Automatic Processing of Basque. In: Proceedings of the First International Conference on Language Resources and Evaluation, Granada (1998)

    Google Scholar 

  • Aduriz, I., Aldezabal, I., Aranzabe, M., Arrieta, B., Arriola, J., Atutxa, A., de Ilarraza, A.D., Gojenola, K., Oronoz, M., Sarasola, K.: Construcción de un corpus etiquetado sintácticamente para el euskera. Actas del XVIII Congreso de la SEPLN, Valladolid, Spain (2002)

    Google Scholar 

  • Aduriz, I., de Ilarraza, A.D.: Morphosyntactic Disambiguation and Shallow Parsing in Computational Processing of Basque. In: Oyharçabal, B. (ed.) Inquiries into the lexiconsyntax relations in Basque (2003) (forthcoming)

    Google Scholar 

  • Ait-Mokhtar, S., Chanod, J.-P., Roux, C.: Robustness beyond shallowness: incremental deep parsing. Natural Language Engineering 8, 121–144 (2002)

    Article  Google Scholar 

  • Aldezabal, I., Ansa, O., Arrieta, B., Artola, X., Ezeiza, A., Hernández, G., Lersundi, M.: EDBL: a General Lexical Basis for the Automatic Processing of Basque. In: IRCS Workshop on Linguistic Databases, Philadelphia (USA)(2001)

    Google Scholar 

  • Alegria, I., Balza, I., Ezeiza, N., Fernandez, I., Urizar, R.: Named Entity Recognition and Classification for texts in Basque. II Jornadas de Tratamiento y Recuperación de Información, JOTRI, Madrid, Spain (2003)

    Google Scholar 

  • Artola, X., de Ilarraza, A.D., Ezeiza, N., Gojenola, K., Hernández, G., Soroa, A.: A Class Library for the Integration of NLP Tools: Definition and implementation of an Abstract Data Type Collection for the manipulation of SGML documents in a context of stand-off linguistic annotation. In: Proceedings of the Third International Conference on Language Resources and Evaluation. Las Palmas de Gran Canaria, Spain (2002)

    Google Scholar 

  • Carroll, J.: P̀arsing’. In: Mitkov, R. (ed.) The Oxford Handbook of Computational Linguistics, pp. 233–248. OUP, Oxford (2003)

    Google Scholar 

  • Ezeiza, N.: Corpusak ustiatzeko tresna linguistikoak. Euskararen etiketatzaile sintaktiko sendo eta malgua. PhD thesis, University of the Basque Country (2003)

    Google Scholar 

  • Karlsson, F., Voutilainen, A., Heikkila, J., Anttila, A.: Constraint Grammar: Languageindependent System for Parsing Unrestricted Text. Mouton de Gruyter, Berlin (1995)

    Google Scholar 

  • Karttunen, L., Chanod, J.-P., Grefenstette, G., Schiller, A.: Regular Expressions For Language Engineering. Journal of Natural Language Engineering (1997)

    Google Scholar 

  • Koskenniemi, K.: Two-level Morphology: A general Computational Model for Word-Form Recognition and Production. University of Helsinki, Department of General Linguistics. Publications 11 (1983)

    Google Scholar 

  • Sperberg-McQueen, C.M., Burnard, L.: Guidelines for Electronic Text Encoding and Interchange. TEI P3 Text Encoding Initiative (1994)

    Google Scholar 

  • Tapanainen, P., Voutilainen, A.: Tagging Accurately-Dońt guess if you know. In: Proceedings of the 4th Conference on Applied Natural Language Processing, Washington (1994)

    Google Scholar 

  • Verdejo, M.F., Gonzalo, J., Màrquez, L., Padró, L., Rodríguez, H., Agirre, E.: HERMES, Hemerotecas electrónicas: Recuperación multilingüe y extracción semántica, TIC2000- 0335-C03. Jornada de Seguimiento de Proyectos en Tecnologías del Software. Programa Nacional de Tecnologías de la Información y las Comunicaciones (2002)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2004 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Aduriz, I. et al. (2004). A Cascaded Syntactic Analyser for Basque. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2004. Lecture Notes in Computer Science, vol 2945. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-24630-5_15

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-24630-5_15

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-21006-1

  • Online ISBN: 978-3-540-24630-5

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics