Abstract
Humongous amount of data is continuously generated by thousands of data sources, which simultaneously send data records that include a wide variety of elements of mixed types such as electronic purchases, information from social networks. They should be processed sequentially and incrementally over flexible time windows and then used for different analyzes. On account on these new instances that includes new attributes which have to be learned as development proceeds, called data stream, this paper tackles the incremental attribute learning task for mixed data using k-prototypes algorithm. Firstly, we propose a novel Incremental k-prototypes algorithm based on the merge technique to ensure the attribute learning task. Subsequently, we present the experiments to evaluate our new method using several real mixed data sets. The results show an improvement of the performance of the k-prototypes algorithm used for incremental attribute learning where the proposed Incremental k-prototypes method gives better results regarding the k-prototypes algorithms.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Ahmad, A., Khan, S.S.: Survey of state-of-the-art mixed data clustering algorithms. IEEE Access 7, 31883–31902 (2019)
Macqueen, J., et al.: Some methods for classification and analysis of multivariate observations. In: Proceedings of the 5th Berkeley Symposium on Mathematical Statistics and Probability, pp. 281–297 (1967)
Huang, Z.: A fast clustering algorithm to cluster very large categorical data sets in data mining. In: DMKD 1997, vol. 3, no. 8, pp. 34–39 (1997)
Huang, Z.: Extensions to the k-means algorithm for clustering large data sets with categorical values. Data Min. Knowl. Disc. 2(3), 283–304 (1998)
Wang, T., Zhou, W., Zhu, X., et al.: Integrated feature preprocessing for classification based on neural incremental attribute learning. In: 2016 19th International Conference on Information Fusion (FUSION), pp. 386–393. IEEE (2016)
Chao, S., Wong, F.: An incremental decision tree learning methodology regarding attributes in medical data mining. In: 2009 International Conference on Machine Learning and Cybernetics, pp. 1694–1699. IEEE (2009)
Guan, S.-U., Zhu, F.: An incremental approach to genetic-algorithms-based classification. IEEE Trans. Syst. Man Cybern. Part B (Cybern.) 35(2), 227–239 (2005)
Guan, S.-U., Liu, J.: Incremental neural network training with an increasing input dimension. J. Intell. Syst. 13(1), 71–94 (2004)
Liu, X., Zhang, G., Zhan, Y., et al.: An incremental feature learning algorithm based on least square support vector machine. In: International Workshop on Frontiers in Algorithmics, pp. 330–338. Springer, Heidelberg (2008)
Agrawal, R.K., Bala, R.: Incremental Bayesian classification for multivariate normal distribution data. Pattern Recogn. Lett. 29(13), 1873–1876 (2008)
Davies, D.L., Bouldin, D.W.: A cluster separation measure. IEEE Trans. Pattern Anal. Mach. Intell. PAMI–1(2), 224–227 (1979)
Caliński, T., Harabasz, J.: A Dendrite Method for Cluster Analysis. Communications in Statistics-Theory and Methods, vol. 3, no 1, pp. 1–27 (1974)
Lichman, M., et al.: UCI machine learning repository (2013). http://archive.ics.uci.edu/ml
Rijn, J.V.: https://www.openml.org/d/268 (2014)
Ounali, C., Ben Rejab, F., Ferchichi, K.N.: Incremental algorithm based on split technique. In: International Conference on Intelligent Systems Design and Applications, pp. 567–576. Springer, Cham (2018)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Gorrab, S., Rejab, F.B. (2021). IK-prototypes: Incremental Mixed Attribute Learning Based on K-Prototypes Algorithm, a New Method. In: Abraham, A., Piuri, V., Gandhi, N., Siarry, P., Kaklauskas, A., Madureira, A. (eds) Intelligent Systems Design and Applications. ISDA 2020. Advances in Intelligent Systems and Computing, vol 1351. Springer, Cham. https://doi.org/10.1007/978-3-030-71187-0_81
Download citation
DOI: https://doi.org/10.1007/978-3-030-71187-0_81
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-71186-3
Online ISBN: 978-3-030-71187-0
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)