Abstract
Dimensionality reduction is crucial in Machine Learning, to obtain main characteristics. The method of selecting characteristics that we will use is a multivariate filter, where we will jointly evaluate the relevance between the characteristics; using unsupervised learning. For which we will use information from Institute of Education Sciences, and application of TF-IDF to obtain the weights of each word in each document. To perform the dimensionality reduction, the PUFSACO (Parallelization Unsupervised future selection based on Ant Colony Optimization) algorithm will be applied, due to the large amount of information that will be processed. The output of PUFSACO will be the input of the classification algorithm. The present work proposes to parallelize the UFSACO algorithm (Unsupervised future selection based on Ant Colony Optimization). Being the basis of PUFSACO, comparing the computational time to validate the improvement of the proposed algorithm, the results show that applying parallelization improves 117% than the original algorithm.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
K. Dnuggets, IDC Study: Digital Universe in 2020. https://www.kdnuggets.com/2012/12/idc-digital-universe-2020.html
F. Herrera, F. Charte, A.J. Rivera, M.J. Del Jesus, Multilabel Classification (Springer, 2016), pp. 17–31
M.E. Celebi, K. Aydin, Unsupervised Learning Algorithms (Springer, 2016)
S. García, J. Luengo, F. Herrera, Data Preprocessing in Data Mining (Springer, 2015)
S. Solorio-Fernández, J.A. Carrasco-Ochoa, J.F. Martínez-Trinidad, A review of unsupervised feature selection methods. , Artifi. Intell. Rev. (2019). http://dx.doi.org/10.1007/s10462-019-09682-y
S. Alelyani, On feature selection stability: a data perspective. Citeseer (2013)
G. Dong, H. Liu, Feature Engineering for Machine Learning and Data Analytics (CRC Press, 2018)
S. Tabakhi, A. Najafi, R. Ranjbar, P. Moradi, Gene selection for microarray data classification using a novel ant colony optimization. Neurocomputing 168 (2015)
Z.A. Zhao, H. Liu, Spectral Feature Selection for Data Mining (Chapman and Hall/CRC, 2011)
G. Beni, J. Wang, Swarm intelligence in cellular robotic systems, in Robots and Biological Systems: Towards a New Bionics? (Springer, 1993), pp. 703–712
S. Agarwal, P. Ranjan, R. Rajesh, Dimensionality reduction methods classical and recent trends: a survey (2016)
Education Resources Information Center, ERIC. https://eric.ed.gov/
Wikipedia contributors, tf–idf, (2019). https://en.wikipedia.org/wiki/Tf%E2%80%93idf
S. Tabakhi, P. Moradi, F. Akhlaghian, An unsupervised feature selection algorithm based on ant colony optimization (2014). http://dx.doi.org/10.1016/j.engappai.2014.03.007
S. Tabakhi, A. Najafi, R. Ranjbar, P. Moradi, Gene selection for microarray data classification using a novel ant colony optimization. Neurocomputing (2015). https://doi.org/10.1016/j.neucom.2015.05.022
B.Z. Dadaneh, H.Y. Markid, A. Zakerolhos-seini, Unsupervised probabilistic feature selection using ant colony optimization. Expert Syst. Appl. (2016). https://doi.org/10.1016/j.eswa.2016.01.021
Acknowledgements
The research work was developed thanks to the research project IBA-0029-2016. “Servicios de Vigilancia Tecnológica para centros de investigación y Aula de Innovación Tecnológica, Orientadas al Desarrollo de Proyectos I+D+I en TICs y Educación” We thank the “Universidad Nacional de San Agustín de Arequipa” for making possible the realization of the research article.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Huanca-Gonza, R.N., Vera-Sancho, J., Hinojosa-Cárdenas, E., Arbieto-Batallanos, C.E., Córdova-Martinez, M.D.C. (2021). Parallel Ants Colony Optimization Algorithm for Dimensionality Reduction of Scientific Documents. In: Rathore, V.S., Dey, N., Piuri, V., Babo, R., Polkowski, Z., Tavares, J.M.R.S. (eds) Rising Threats in Expert Applications and Solutions. Advances in Intelligent Systems and Computing, vol 1187. Springer, Singapore. https://doi.org/10.1007/978-981-15-6014-9_53
Download citation
DOI: https://doi.org/10.1007/978-981-15-6014-9_53
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-6013-2
Online ISBN: 978-981-15-6014-9
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)