Skip to main content

An Enhanced Cos-Neuro Bio-Inspired Approach for Document Clustering

  • Conference paper
  • First Online:
Intelligent Computing and Innovation on Data Science

Part of the book series: Lecture Notes in Networks and Systems ((LNNS,volume 248))

Abstract

Data mining is a dynamic and attractive research domain that has become known to discover information from the vast amount of constantly created data. Clustering is an unsupervised approach to data mining in which a group of similar items is assembled in one cluster. The quality of documents retrieved within a lesser amount of time has always been a fundamental problem in web document clustering. The authors introduce similarity technique-based K-means clustering using bee swarm optimization and artificial neural networks in this work. The artificial neural network helps classify the best centroid location based on the similarity index of the document and according to the trained structure of ANN to organize the best cluster number to test queries. The quality of papers returned is improved significantly with lesser execution time and improved efficiency through the projected method.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 189.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Vijayalakshmi B et al (2020) An attention based deep learning model for traffic flow prediction using spatio temporal features towards sustainable smart city. IJCS, Wiley, Hoboken, pp 1–14

    Google Scholar 

  2. Batra I et al (2020) Hybrid logical security framework for privacy preservation in the green internet of things. MDPI-Sustainability 12(14):1–15

    Google Scholar 

  3. Batra I et al (2019) A lightweight IoT based security framework for inventory automation using wireless sensor network. IJCS, Wiley, Hoboken, pp 1–16

    Google Scholar 

  4. Hearst MA (1999, June) Untangling text data mining. In: Proceedings of the 37th annual meeting of the association for computational linguistics, pp 3–10

    Google Scholar 

  5. MacQueen J (1967, June) Some methods for classification and analysis of multivariate observations. In: Proceedings of the fifth Berkeley symposium on mathematical statistics and probability, vol 1, no 14, pp 281–297

    Google Scholar 

  6. Nazeer KA, Sebastian MP (2009, July) Improving the accuracy and efficiency of the k-means clustering algorithm. In: Proceedings of the world congress on engineering, vol 1. Association of Engineers, London, pp 1–3

    Google Scholar 

  7. Kapil S, Chawla M (2016, July) Performance evaluation of K-means clustering algorithm with various distance metrics. In: 2016 IEEE 1st international conference on power electronics, intelligent control and energy systems (ICPEICES), pp 1–4, IEEE

    Google Scholar 

  8. Shafeeq A, Hareesha KS (2012) Dynamic clustering of data with modified k-means algorithm. In: Proceedings of the 2012 conference on information and computer networks, pp 221–225

    Google Scholar 

  9. Tan S (2005) Neighbor-weighted k-nearest neighbor for unbalanced text corpus. Expert Syst Appl 28(4):667–671

    Article  Google Scholar 

  10. Zheng Z, Wu X, Srihari R (2004) Feature selection for text categorization on imbalanced data. ACM SIGKDD Explor Newsl 6(1):80–89

    Article  Google Scholar 

  11. Del Castillo MD, Serrano JI (2004) A multistrategy approach for digital text categorization from imbalanced documents. ACM SIGKDD Explor Newsl 6(1):70–79

    Article  Google Scholar 

  12. Cui X, Potok TE, Palathingal P (2005, June) Document clustering using particle swarm optimization. In: Proceedings 2005 IEEE swarm intelligence symposium, 2005. SIS 2005, pp 185–191, IEEE

    Google Scholar 

  13. Bharathi A, Deepan kumar E (2014) Survey on classification techniques in data mining. Int J Recent Innov Trends Comput Commun 2(7):1983–1986

    Google Scholar 

  14. Djenouri Y, Belhadi A, Belkebir R (2018) Bees swarm optimization guided by data mining techniques for document information retrieval. Expert Syst Appl 94:126–136

    Article  Google Scholar 

  15. Karaboga D, Ozturk C (2011) A novel clustering approach: Artificial Bee Colony (ABC) algorithm. Appl Soft Comput 11(1):652–657

    Article  Google Scholar 

  16. Lenc L, Král P (2016, April) Deep neural networks for Czech multi-label document classification. In: International conference on intelligent text processing and computational linguistics. Springer, Cham, pp 460–471

    Google Scholar 

  17. Zheng J, Guo Y, Feng C, Chen H (2018) A hierarchical neural-network-based document representation approach for text classification. Math Probl Eng

    Google Scholar 

  18. Datta D et al (2020) UAV environment in FANET: an overview. Applications of cloud computing: approaches and practices. CRC Press, Taylor & Francis Group, Boca Raton, pp 1–16

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Sahil Verma .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Madaan, V., Munjal, K., Verma, S., Jhanjhi, N.Z., Singh, A. (2021). An Enhanced Cos-Neuro Bio-Inspired Approach for Document Clustering. In: Peng, SL., Hsieh, SY., Gopalakrishnan, S., Duraisamy, B. (eds) Intelligent Computing and Innovation on Data Science. Lecture Notes in Networks and Systems, vol 248. Springer, Singapore. https://doi.org/10.1007/978-981-16-3153-5_54

Download citation

Publish with us

Policies and ethics