Article

Free Access

Shallow language processing architecture for Bulgarian

Authors:
Hristo Tanev

Centro per la Ricerca Scientifica e Tecnologica, Trento, Italy

Centro per la Ricerca Scientifica e Tecnologica, Trento, Italy
View Profile

,
Ruslan Mitkov

Languages and Social Studies, Wolverhampton, UK

Languages and Social Studies, Wolverhampton, UK
View Profile

COLING '02: Proceedings of the 19th international conference on Computational linguistics - Volume 1August 2002Pages 1–7https://doi.org/10.3115/1072228.1072255

Published:24 August 2002Publication History

COLING '02: Proceedings of the 19th international conference on Computational linguistics - Volume 1

Pages 1–7

ABSTRACT

This paper describes LINGUA - an architecture for text processing in Bulgarian. First, the pre-processing modules for tokenisation, sentence splitting, paragraph segmentation, part-of-speech tagging, clause chunking and noun phrase extraction are outlined. Next, the paper proceeds to describe in more detail the anaphora resolution module. Evaluation results are reported for each processing task.

References

J. Allen. 1995. Natural Language Understanding. The Benjamin/Cummings Publishing Company, Inc.]] Google ScholarDigital Library
T. Avgustinova, K. Oliva, and E. Paskaleva. 1989. An HPSG-based parser for bulgarian. In International Seminar on Machine Translation 'Computer and Translation 89', Moscow, Russia.]]Google Scholar
P. Barkalova. 1997. Bulgarian syntax - known and unknown. Plovdiv University Press, Plovdiv. in Bulgarian.]]Google Scholar
H. Krushkov. 1997. Modelling and building of machine dictionaries and morphological processors. Ph.D. thesis, University of Plovdiv. in Bulgarian.]]Google Scholar
R. Mitkov. 1998. Robust pronoun resolution with limited knowledge. In Proceedings of the 18th International Conference on Computational Linguistics (COLING'98)/ACL'98 Conference, pages 869--875, Montreal, Canada.]] Google ScholarDigital Library
R. Mitkov. 2001. Towards a more consistent and comprehensive evaluation of anaphora resolution algorithms and systems. Towards a more consistent and comprehensive evaluation of anaphora resolution algorithms and systems, (15):253--276.]]Google Scholar
E. Murat and E. Charniak. 1995. A statistical syntactic disambiguation program and what it learns. CS, 29--95.]]Google Scholar
C. Orasan, R. Evans, and R. Mitkov. 2000. Enhancing preference-based anaphora resolution with genetic algorithms. In Proceedings of NLP'2000, Patras, Greece.]] Google ScholarDigital Library
J. Penchev. 1993. Bulgarian Syntax - Government and Binding. Plovdiv University Press, Plovdiv. in Bulgarian.]]Google Scholar
K. Simov, E. Paskaleva, M. Damova, and M. Slavcheva. 1992. Morpho-assistant - a knowledge based system for bulgarian morphology. In Proceedings of the Third Conference on Applied Natural Language Processing, Trento, Italy.]]Google Scholar
G. Totkov and Ch. Tanev. 1999. Computerized extraction of word semantics through connected text analysis. In Proc. of the International Workshop DIALOGUE '99, pages 360--365.]]Google Scholar
A. Voutilainen. 1995. A syntax-based part-of-speech tagger. In Proceedings of the 7th conference of the European Chapter of EACL, Dublin, Ireland.]] Google ScholarDigital Library

Shallow language processing architecture for Bulgarian
1. Computing methodologies
  1. Artificial intelligence
2. Hardware
  1. Power and energy
    1. Power estimation and optimization

Recommendations

Bulgarian-Polish-Lithuanian corpus: current development
MRTECEEL '09: Proceedings of the Workshop on Multilingual Resources, Technologies and Evaluation for Central and Eastern European Languages

This paper discusses the building of the first Bulgarian---Polish---Lithuanian (for short, BG---PL---LT) experimental corpus. The BG---PL---LT corpus (currently under development only for research) contains more than 3 million words and comprises two ...
Read More
Introduction to Chinese Natural Language Processing
Read More
Urdu language processing: a survey

Extensive work has been done on different activities of natural language processing for Western languages as compared to its Eastern counterparts particularly South Asian Languages. Western languages are termed as resource-rich languages. Core ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in

COLING '02: Proceedings of the 19th international conference on Computational linguistics - Volume 1
August 2002
1184 pages
Sponsors
In-Cooperation
Publisher
Association for Computational Linguistics
United States
Publication History
- Published: 24 August 2002
Qualifiers
- Article
Conference

Acceptance Rates
Overall Acceptance Rate1,537of1,537submissions,100%
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 6
  Total Citations
  View Citations
- 240
  Total Downloads
- Downloads (Last 12 months)28
- Downloads (Last 6 weeks)3
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Shallow language processing architecture for Bulgarian

COLING '02: Proceedings of the 19th international conference on Computational linguistics - Volume 1

ABSTRACT

References

Cited By

Recommendations

Bulgarian-Polish-Lithuanian corpus: current development

Introduction to Chinese Natural Language Processing

Urdu language processing: a survey

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Qualifiers

Conference

Acceptance Rates

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Shallow language processing architecture for Bulgarian

COLING '02: Proceedings of the 19th international conference on Computational linguistics - Volume 1

ABSTRACT

References

Cited By

Recommendations

Bulgarian-Polish-Lithuanian corpus: current development

Introduction to Chinese Natural Language Processing

Urdu language processing: a survey

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Qualifiers

Conference

Acceptance Rates

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media