To read this content please select one of the options below:

Cancer data classification by quantum-inspired immune clone optimization-based optimal feature selection using gene expression data: deep learning approach

Nageswara Rao Eluri (Department of Computer Science and Engineering, Acharya Nagarjuna University, Guntur, India)
Gangadhara Rao Kancharla (Department of Computer Science and Engineering, Acharya Nagarjuna University, Guntur, India)
Suresh Dara (Department of Computer Science and Engineering, Padmasri Dr BV Raju Institute of Technology, Narsapur, India)
Venkatesulu Dondeti (Department of Computer Science and Engineering, VFSTR, Guntur, India)

Data Technologies and Applications

ISSN: 2514-9288

Article publication date: 28 September 2021

Issue publication date: 15 March 2022

202

Abstract

Purpose

Gene selection is considered as the fundamental process in the bioinformatics field. The existing methodologies pertain to cancer classification are mostly clinical basis, and its diagnosis capability is limited. Nowadays, the significant problems of cancer diagnosis are solved by the utilization of gene expression data. The researchers have been introducing many possibilities to diagnose cancer appropriately and effectively. This paper aims to develop the cancer data classification using gene expression data.

Design/methodology/approach

The proposed classification model involves three main phases: “(1) Feature extraction, (2) Optimal Feature Selection and (3) Classification”. Initially, five benchmark gene expression datasets are collected. From the collected gene expression data, the feature extraction is performed. To diminish the length of the feature vectors, optimal feature selection is performed, for which a new meta-heuristic algorithm termed as quantum-inspired immune clone optimization algorithm (QICO) is used. Once the relevant features are selected, the classification is performed by a deep learning model called recurrent neural network (RNN). Finally, the experimental analysis reveals that the proposed QICO-based feature selection model outperforms the other heuristic-based feature selection and optimized RNN outperforms the other machine learning methods.

Findings

The proposed QICO-RNN is acquiring the best outcomes at any learning percentage. On considering the learning percentage 85, the accuracy of the proposed QICO-RNN was 3.2% excellent than RNN, 4.3% excellent than RF, 3.8% excellent than NB and 2.1% excellent than KNN for Dataset 1. For Dataset 2, at learning percentage 35, the accuracy of the proposed QICO-RNN was 13.3% exclusive than RNN, 8.9% exclusive than RF and 14.8% exclusive than NB and KNN. Hence, the developed QICO algorithm is performing well in classifying the cancer data using gene expression data accurately.

Originality/value

This paper introduces a new optimal feature selection model using QICO and QICO-based RNN for effective classification of cancer data using gene expression data. This is the first work that utilizes an optimal feature selection model using QICO and QICO-RNN for effective classification of cancer data using gene expression data.

Keywords

Citation

Eluri, N.R., Kancharla, G.R., Dara, S. and Dondeti, V. (2022), "Cancer data classification by quantum-inspired immune clone optimization-based optimal feature selection using gene expression data: deep learning approach", Data Technologies and Applications, Vol. 56 No. 2, pp. 247-282. https://doi.org/10.1108/DTA-05-2020-0109

Publisher

:

Emerald Publishing Limited

Copyright © 2021, Emerald Publishing Limited

Related articles