A Shallow Parsing Model for Hindi Using Conditional Random Field

Asopa, Sneha; Asopa, Pooja; Mathur, Iti; Joshi, Nisheeth

doi:10.1007/978-981-13-2354-6_31

Sneha Asopa⁷,
Pooja Asopa⁷,
Iti Mathur⁷ &
…
Nisheeth Joshi⁷

Part of the book series: Lecture Notes in Networks and Systems ((LNNS,volume 56))

859 Accesses

Abstract

In Natural Language Parsing, in order to perform sequential labeling and segmenting tasks, a probabilistic framework named Conditional Random Field (CRF) have an advantage over Hidden Markov Models (HMMs) and Maximum Entropy Markov Models (MEMMs). This research work is an attempt to develop an efficient model for shallow parsing which is based on CRF. For training the model, around 1,000 handcrafted chunked sentences of Hindi language were used. The developed model is tested on 864 sentences and evaluation is done by comparing the results with gold data. The accuracy is measured by precision, recall, and F-measure and is found to be 98.04, 98.04, and 98.04, respectively.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
https://taku910.github.io/crfpp/.

References

Asopa S, Asopa P, Mathur I, Joshi N (2016) Rule based chunker for Hindi. In: 2016 2nd international conference on contemporary computing and informatics (IC3I), 14 Dec 2006. IEEE, pp 442–445
Google Scholar
Lafferty J, McCallum A, Pereira F (2011) Conditional random fields: probabilistic models for segmenting and labeling sequence data. In: Proceedings of the eighteenth international conference on machine learning, ICML, vol 1, pp 282–289
Google Scholar
Rao D, Yarowsky D (2007) Part of speech tagging and shallow parsing of Indian languages. Shallow Parsing South Asian Lang 8:17
Google Scholar
Gahlot H, Krishnarao AA, Kushwaha DS (2009) Shallow parsing for Hindi—an extensive analysis of sequential learning algorithms using a large annotated corpus. In: IEEE international advance computing conference, 2009, IACC 2009, 6 Mar 2009. IEEE, pp 1158–1163
Google Scholar
Sha F, Pereira F (2003) Shallow parsing with conditional random fields. In: Proceedings of the 2003 conference of the North American chapter of the association for computational linguistics on human language technology, vol 1. Association for Computational Linguistics, pp 134–141
Google Scholar
Nongmeikapam K, Chingangbam C, Keisham N, Varte B, Bandopadhyay S (2014) Chunking in Manipuri using CRF. Int J Nat Lang Comput (IJNLC) 3(3)
Article Google Scholar
Nivre J, Scholz M (2004) Deterministic dependency parsing of english text. In: Proceedings of the 20th international conference on computational linguistics, 23 Aug 2004. Association for Computational Linguistics, p 64
Google Scholar
Ghosh A, Das A, Bhaskar P, Bandyopadhyay S (2009) Dependency parser for Bengali: the JU system at ICON 2009. NLP tool contest ICON
Google Scholar
Bharati A, Sangal R (1990) A karaka based approach to parsing of Indian languages. In: Proceedings of the 13th conference on computational linguistics, vol 3, 20 Aug 1990. Association for Computational Linguistics, pp 25–29
Google Scholar

Download references

Author information

Authors and Affiliations

Banasthali Vidyapith, P.O. Banasthali Vidyapith, Newai, 304022, Rajasthan, India
Sneha Asopa, Pooja Asopa, Iti Mathur & Nisheeth Joshi

Authors

Sneha Asopa
View author publications
You can also search for this author in PubMed Google Scholar
Pooja Asopa
View author publications
You can also search for this author in PubMed Google Scholar
Iti Mathur
View author publications
You can also search for this author in PubMed Google Scholar
Nisheeth Joshi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Sneha Asopa .

Editor information

Editors and Affiliations

Department of Computer Application, RCC Institute of Information Technology, Kolkata, West Bengal, India
Siddhartha Bhattacharyya
Faculty of Computers and Information, Cairo University, Giza, Egypt
Aboul Ella Hassanien
Department of Computer Science and Engineering, Maharaja Agrasen Institute of Technology, New Delhi, Delhi, India
Deepak Gupta
Department of Computer Science and Engineering, Maharaja Agrasen Institute of Technology, New Delhi, Delhi, India
Ashish Khanna
Department of Information Technology, RCC Institute of Information Technology, Kolkata, West Bengal, India
Indrajit Pan

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Asopa, S., Asopa, P., Mathur, I., Joshi, N. (2019). A Shallow Parsing Model for Hindi Using Conditional Random Field. In: Bhattacharyya, S., Hassanien, A., Gupta, D., Khanna, A., Pan, I. (eds) International Conference on Innovative Computing and Communications. Lecture Notes in Networks and Systems, vol 56. Springer, Singapore. https://doi.org/10.1007/978-981-13-2354-6_31

Download citation

DOI: https://doi.org/10.1007/978-981-13-2354-6_31
Published: 20 November 2018
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-2353-9
Online ISBN: 978-981-13-2354-6
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics