An Enhanced Support Vector Machines Model for Classification and Rule Generation

Pai, Ping-Feng; Hsu, Ming-Fu

doi:10.1007/978-3-642-20859-1_11

Ping-Feng Pai⁴ &
Ming-Fu Hsu⁵

Part of the book series: Studies in Computational Intelligence ((SCI,volume 356))

4158 Accesses
7 Citations

Abstract

Based on statistical learning theory, support vector machines (SVM) model is an emerging machine learning technique solving classification problems with small sampling, non-linearity and high dimension. Data preprocessing, parameter selection, and rule generation influence performance of SVM models a lot. Thus, the main purpose of this chapter is to propose an enhanced support vector machines (ESVM) model which can integrate the abilities of data preprocessing, parameter selection and rule generation into a SVM model; and apply the ESVM model to solve real world problems. The structure of this chapter is organized as follows. Section 11.1 presents the purpose of classification and the basic concept of SVM models. Sections 11.2 and 11.3 introduce data preprocessing techniques, metaheuristics for selecting SVM models. Rule extraction of SVM models is addressed in Section 11.4. An enhanced SVM scheme and numerical results are illustrated in Section 11.5 and 11.6. Conclusions are made in Section 11.7.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Agarwal, S., Agrawal, R., Deshpande, P.M., Gupta, A., Naughton, J.F., Ramakrishnan, R., Sarawagi, S.: On the computation of multidimensional aggregates. In: Proc. Int. Conf. Very Large Data Bases, pp. 506–521 (1996)
Google Scholar
Barbar’a, D., DuMouchel, W., Faloutos, C., Haas, P.J., Hellerstein, J.H., Ioannidis, Y., Jagadish, H.V., Johnson, T., Ng, R., Poosala, V., Ross, K.A., Servcik, K.C.: The New Jersey data reduction report. Bull. Technical Committee on Data Engineering 20, 3–45 (1997)
Google Scholar
Ballou, D.P., Tayi, G.K.: Enhancing data quality in data warehouse environments. Comm. ACM 78, 42–73 (1999)
Google Scholar
Breiman, L., Friedman, J., Olshen, R., Stone, C.: Classifcation and Regression Trees, Wadsworth International Group (1984)
Google Scholar
Chakrabart, S., Cox, E., Frank, E., Guiting, R.H., Han, J., Jiang, X., Kamber, M., Lightstone, S.S., Nadeau, T.P., Neapolitan, R.E., Pyle, D., Refaat, M., Schneider, M., Teorey, T.J.I., Witten, H.: Data Mining: Know It All. Morgan Kaufmann, San Francisco (2008)
Google Scholar
Taylor, J.S., Cristianini, N.: Support Vector Machines and other kernel-based learning methods. Cambridge University Press, Cambridge (2000)
Google Scholar
Dash, M., Liu, H.: Feature selection methods for classification. Intell. Data Anal. (1), 131–156 (1997)
Article Google Scholar
Dwyer, D.W., Kocagil, A.E., Stein, R.M.: Moody’s kmv riskcalc v3.1 model (2004)
Google Scholar
English, L.: Improving Data Warehouse and Business Information Quality: Methods for Reducing Costs and Increasing. John Wiley & Sons, Chichester (1999)
Google Scholar
Farmer, J.D., Packard, N.H., Perelson, A.: The immune system, adaptation, and machine learning. Physica. D 22(1–3), 187–204 (1986)
Article MathSciNet Google Scholar
Glover, F., Kelly, J.P., Laguna, M.: Genetic algorithms and tabu search: hybrids for optimization. Comput. Oper. Res. 22, 111–134 (1995)
Article MATH Google Scholar
Hamel, L.H.: Knowledge Discovery with Support Vector Machines. Wiley, Chichester (2009)
Book Google Scholar
Holland, J.H.: Adaptation in Natural and Artificial Systems. University of Michigan Press, Ann Arbor (1975)
Google Scholar
Huang, C.L., Chen, M.C., Wang, C.J.: Credit scoring with a data mining approach based on support vector machines. Expert Systems with Applications 33(4), 847–856 (2007)
Article MathSciNet Google Scholar
Kennedy, R.L., Lee, Y., Van Roy, B., Reed, C.D., Lippman, R.P.: Solving Data Mining Problems Through Pattern Recognition. Prentice-Hall, Englewood Cliffs (1998)
Google Scholar
Kennedy, J., Eberhart, R.: Particle swarm optimization, In Proceedings of IEEE conference on neural network, vol. 4, pp. 1942–1948 (1995)
Google Scholar
Kohavi, R., John, G.H.: Wrappers for feature subset selection. Artif. Intell. 97, 273–324 (1997)
Article MATH Google Scholar
Langley, P., Simon, H.A., Bradshaw, G.L., Zytkow, J.M.: Scientific Discovery: Computational Explorations of the Creative Processes. MIT Press, Cambridge (1987)
Google Scholar
Liu, H., Motoda, H.: Feature Extraction, Construction, and Selection: A Data Mining Perspective. Kluwer Academic Publishers, Dordrecht (1998)
MATH Google Scholar
Lin, S.W., Shiue, Y.R., Chen, S.C., Cheng, H.M.: Applying enhanced data mining approaches in predicting bank performance: A case of Taiwanese commercial banks. Expert Syst. Appl. (36), 11543–11551 (2009)
Article Google Scholar
Loshin, D.: Enterprise Knowledge Management: The Data Quality Approach. Morgan Kaufmann, San Francisco (2001)
Google Scholar
Lopez, F.G., Torres, G.M., Batista, B.M.: Solving feature subset selection problem by parallel scatter search. Eur. J. Oper. Res. (169), 477–489 (2006)
Article MATH Google Scholar
Martens, D., Baesens, B., Gestel, T.V., Vanthienen, J.: Comprehensible credit scoring models using rule extraction from support vector machines. Eur. J. Oper. Res. 183(3), 1466–1476 (2007)
Article MATH Google Scholar
Martin, D.: Early warning of bank failure a logit regression approach. J. Bank. Financ. (1), 249–276 (1977)
Article Google Scholar
Nunez, H., Angulo, C., Catala, A.: Rule extraction from support vector machines. In: European Symposium on Artificial Neural Networks Proceedings, pp. 107–112 (2002)
Google Scholar
Nunez, H., Angulo, C., Catala, A.: Rule based learning systems from SVM and RBFNN. Tendencias de la mineria de datos en espana, Red Espaola de Minera de Datos (2004)
Google Scholar
Neter, J., Kutner, M.H., Nachtsheim, C.J., Wasserman, L.: Applied Linear Statistical Models. Irwin (1996)
Google Scholar
Olson, J.E.: Data Quality: The Accuracy Dimension. Morgan Kaufmann, San Francisco (2003)
Google Scholar
Pai, P.F., Hong, W.C.: Forecasting regional electricity load based on recurrent support vector machines with genetic algorithms. Electr. Pow. Syst. Res. 74(3), 417–425 (2005)
Article MathSciNet Google Scholar
Pai, P.F., Lin, C.S.: A hybrid ARIMA and support vector machines model in stock price forecasting. Omega 33(6), 497–505 (2005)
Article Google Scholar
Pai, P.F.: System reliability forecasting by support vector machines with genetic algorithms. Math. Comput. Model. 433(3-4), 262–274 (2006)
Article MathSciNet Google Scholar
Pai, P.F., Chen, S.Y., Huang, C.W., Chang, Y.H.: Analyzing foreign exchange rates by rough set theory and directed acyclic graph support vector machines. Expert Syst. Appl. 37(8), 5993–5998 (2010)
Article Google Scholar
Pai, P.F., Chang, Y.H., Hsu, M.F., Fu, J.C., Chen, H.H.: A hybrid kernel principal component analysis and support vector machines model for analyzing sonographic parotid gland in Sjogren’s Syndrome. International Journal of Mathematical Modelling and Numerical Optimisation (2010) (in press)
Google Scholar
Pai, P.F., Hsu, M.F., Wang, M.C.: A support vector machine-based model for detecting top management fraud. Knowl.-Based Syst. 24(2), 314–321 (2011)
Article Google Scholar
Pyle, D.: Data Preparation for Data Mining. Morgan Kaufmann, San Francisco (1999)
Google Scholar
Quinlan, J.R.: Unknown attribute values in induction. In: Proc. 1989 Int. Conf. Machine Learning (ICML 1989), Ithaca, NY, pp. 164–168 (1989)
Google Scholar
Redman, T.: Data Quality: Management and Technology. Bantam Books (1992)
Google Scholar
Ross, K., Srivastava, D.: Fast computation of sparse datacubes. In: Proc Int. Conf. Very Large Data Bases, pp. 116–125 (1997)
Google Scholar
Sarawagi, S., Stonebraker, M.: Efficient organization of large multidimensional arrays. In: Proc. Int. Conf. Data Engineering, ICDE 1994 (1994)
Google Scholar
Siedlecki, W., Sklansky, J.: On automatic feature selection. Int. J. Pattern Recognition and Artificial Intelligence (2), 197–220 (1988)
Article Google Scholar
Scholkopf, B., Smola, A.J.: Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond. MIT Press, Cambridge (2001)
Google Scholar
Vapnik, V.: Statistical learning theory. John Wiley and Sons, New York (1998)
MATH Google Scholar
Vapnik, V., Golowich, S., Smola, A.: Support vector machine for function approximation, regression estimation, and signal processing. Advances in Neural Information processing System (9), 281–287 (1996)
Google Scholar
Wang, R., Storey, V., Firth, C.: A framework for analysis of data quality research. IEEE Trans. Knowledge and Data Engineering (7), 623–640 (1995)
Article Google Scholar
Zhao, Y., Deshpande, P.M., Naughton, J.F.: An array-based algorithm for simultaneous multi-dimensional aggregates. In: Proc. 1997 ACM-SIGMOD Int. Conf. Management of Data, pp. 159–170 (1997)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Information Management, National Chi Nan University, Taiwan, ROC
Ping-Feng Pai
Department of International Business Studies, National Chi Nan University, Taiwan, ROC
Ming-Fu Hsu

Authors

Ping-Feng Pai
View author publications
You can also search for this author in PubMed Google Scholar
Ming-Fu Hsu
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Science and Engineering Engineering Optimization & Modeling Center, Reykjavik University, Menntavegur 1, 101, Reykjavik, Iceland
Slawomir Koziel
Mathematics and Scientific Computing, National Physical Laboratory, TW11 0LW, Teddington, UK
Xin-She Yang

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Pai, PF., Hsu, MF. (2011). An Enhanced Support Vector Machines Model for Classification and Rule Generation. In: Koziel, S., Yang, XS. (eds) Computational Optimization, Methods and Algorithms. Studies in Computational Intelligence, vol 356. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-20859-1_11

Download citation

DOI: https://doi.org/10.1007/978-3-642-20859-1_11
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-20858-4
Online ISBN: 978-3-642-20859-1
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics