Abstract
This paper presents an algorithm that permits the search for dependencies among sets of data (univariate or multivariate time-series, or cross-sectional observations). The procedure is modeled after genetic theories and Darwinian concepts, such as natural selection and survival of the fittest. It permits the discovery of equations of the data-generating process in symbolic form. The genetic algorithm that is described here uses parts of equations as building blocks to breed ever better formulas. Apart from furnishing a deeper understanding of the dynamics of a process, the method also permits global predictions and forecasts. The algorithm is successfully tested with artificial and with economic time-series and also with cross-sectional data on the performance and salaries of NBA players during the 94–95 season.
Similar content being viewed by others
References
Allen, F. and Karjalainen, R. (1993). Using genetic algorithms to find technical trading rules, working paper, Rodney L. White Center for Financial Research, The Wharton School of the University of Pennsylvania.
Brock, W.A., Lakonishok, J. and LeBaron, B. (1992). Simple technical trading rules and the stochastic properties of stock returns, J. of Finance, 47, 1731-1764.
Farmer, J.D. and Sidorovich, J.J. (1987). Predicting Chaotic Time Series, Phys. Rev. Lett., 59, 845-848.
Goldberg, D.E. (1989). Genetic algorithms in search, optimization and machine learning, Addison-Wesley, Reading, MA.
Hénon, M. (1976). ‘A two-dimensional mapping with a strange attractor’, Comm. Math. Phys. 50, 69-77.
Holland, J.H. (1975). Adaptation in natural and artificial systems, University of Michigan Press, Ann Arbor. 2nd ed 1992, MIT Press.
Hsieh, D.A. (1991). ‘Chaos and nonlinear dynamics: Applications to financial markets’, J. of Finance, 46, 1839-1877.
Koza, J.R. (1992). Genetic programming, MIT Press, Cambridge.
LeBaron, B. (1992). ‘Nonlinear forecasts for the S&P stock index’, in Casdagli and Eubanks (eds), Nonlinear modeling and forecasting, Santa Fe Institute, Addison-Wesley, Reading, Mass.
LeBaron, B. (1993). ‘Nonlinear diagnostics and simple trading rules for high-frequency foreign exchange rates’, in Weigend and Gerschenfeld (eds.), Time series prediction: Forecasting the future and understanding the past, Santa Fe Institute, Addison-Wesley, Reading, Mass.
Palmer, R.G., Arthur, W.B., Holland, J.H., LeBaron, B. and Taylor, P. (1994). ‘Artificial economic life: a simple model of a stockmarket’, Physica D, 75, 264.
Scheinkman, J.A. and LeBaron, B. (1989). ‘Nonlinear dynamics and stock returns’, J. of Business, 62, 311-338.
Szpiro, G.G. (1997). ‘Forecasting chaotic time series with genetic algorithms’, Phys. Rev. E, 55, 2557-2568.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
SZPIRO, G.G. A Search for Hidden Relationships: Data Mining with Genetic Algorithms. Computational Economics 10, 267–277 (1997). https://doi.org/10.1023/A:1008673309609
Issue Date:
DOI: https://doi.org/10.1023/A:1008673309609