Skip to main content

Parallel Ants Colony Optimization Algorithm for Dimensionality Reduction of Scientific Documents

  • Conference paper
  • First Online:
Rising Threats in Expert Applications and Solutions

Abstract

Dimensionality reduction is crucial in Machine Learning, to obtain main characteristics. The method of selecting characteristics that we will use is a multivariate filter, where we will jointly evaluate the relevance between the characteristics; using unsupervised learning. For which we will use information from Institute of Education Sciences, and application of TF-IDF to obtain the weights of each word in each document. To perform the dimensionality reduction, the PUFSACO (Parallelization Unsupervised future selection based on Ant Colony Optimization) algorithm will be applied, due to the large amount of information that will be processed. The output of PUFSACO will be the input of the classification algorithm. The present work proposes to parallelize the UFSACO algorithm (Unsupervised future selection based on Ant Colony Optimization). Being the basis of PUFSACO, comparing the computational time to validate the improvement of the proposed algorithm, the results show that applying parallelization improves 117% than the original algorithm.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. K. Dnuggets, IDC Study: Digital Universe in 2020. https://www.kdnuggets.com/2012/12/idc-digital-universe-2020.html

  2. F. Herrera, F. Charte, A.J. Rivera, M.J. Del Jesus, Multilabel Classification (Springer, 2016), pp. 17–31

    Google Scholar 

  3. M.E. Celebi, K. Aydin, Unsupervised Learning Algorithms (Springer, 2016)

    Google Scholar 

  4. S. García, J. Luengo, F. Herrera, Data Preprocessing in Data Mining (Springer, 2015)

    Google Scholar 

  5. S. Solorio-Fernández, J.A. Carrasco-Ochoa, J.F. Martínez-Trinidad, A review of unsupervised feature selection methods. , Artifi. Intell. Rev. (2019). http://dx.doi.org/10.1007/s10462-019-09682-y

  6. S. Alelyani, On feature selection stability: a data perspective. Citeseer (2013)

    Google Scholar 

  7. G. Dong, H. Liu, Feature Engineering for Machine Learning and Data Analytics (CRC Press, 2018)

    Google Scholar 

  8. S. Tabakhi, A. Najafi, R. Ranjbar, P. Moradi, Gene selection for microarray data classification using a novel ant colony optimization. Neurocomputing 168 (2015)

    Google Scholar 

  9. Z.A. Zhao, H. Liu, Spectral Feature Selection for Data Mining (Chapman and Hall/CRC, 2011)

    Google Scholar 

  10. G. Beni, J. Wang, Swarm intelligence in cellular robotic systems, in Robots and Biological Systems: Towards a New Bionics? (Springer, 1993), pp. 703–712

    Google Scholar 

  11. S. Agarwal, P. Ranjan, R. Rajesh, Dimensionality reduction methods classical and recent trends: a survey (2016)

    Google Scholar 

  12. Education Resources Information Center, ERIC. https://eric.ed.gov/

  13. Wikipedia contributors, tf–idf, (2019). https://en.wikipedia.org/wiki/Tf%E2%80%93idf

  14. S. Tabakhi, P. Moradi, F. Akhlaghian, An unsupervised feature selection algorithm based on ant colony optimization (2014). http://dx.doi.org/10.1016/j.engappai.2014.03.007

  15. S. Tabakhi, A. Najafi, R. Ranjbar, P. Moradi, Gene selection for microarray data classification using a novel ant colony optimization. Neurocomputing (2015). https://doi.org/10.1016/j.neucom.2015.05.022

  16. B.Z. Dadaneh, H.Y. Markid, A. Zakerolhos-seini, Unsupervised probabilistic feature selection using ant colony optimization. Expert Syst. Appl. (2016). https://doi.org/10.1016/j.eswa.2016.01.021

Download references

Acknowledgements

The research work was developed thanks to the research project IBA-0029-2016. “Servicios de Vigilancia Tecnológica para centros de investigación y Aula de Innovación Tecnológica, Orientadas al Desarrollo de Proyectos I+D+I en TICs y Educación” We thank the “Universidad Nacional de San Agustín de Arequipa” for making possible the realization of the research article.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Rosario Nery Huanca-Gonza .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Huanca-Gonza, R.N., Vera-Sancho, J., Hinojosa-Cárdenas, E., Arbieto-Batallanos, C.E., Córdova-Martinez, M.D.C. (2021). Parallel Ants Colony Optimization Algorithm for Dimensionality Reduction of Scientific Documents. In: Rathore, V.S., Dey, N., Piuri, V., Babo, R., Polkowski, Z., Tavares, J.M.R.S. (eds) Rising Threats in Expert Applications and Solutions. Advances in Intelligent Systems and Computing, vol 1187. Springer, Singapore. https://doi.org/10.1007/978-981-15-6014-9_53

Download citation

Publish with us

Policies and ethics