Skip to main content
Log in

A Search for Hidden Relationships: Data Mining with Genetic Algorithms

  • Published:
Computational Economics Aims and scope Submit manuscript

Abstract

This paper presents an algorithm that permits the search for dependencies among sets of data (univariate or multivariate time-series, or cross-sectional observations). The procedure is modeled after genetic theories and Darwinian concepts, such as natural selection and survival of the fittest. It permits the discovery of equations of the data-generating process in symbolic form. The genetic algorithm that is described here uses parts of equations as building blocks to breed ever better formulas. Apart from furnishing a deeper understanding of the dynamics of a process, the method also permits global predictions and forecasts. The algorithm is successfully tested with artificial and with economic time-series and also with cross-sectional data on the performance and salaries of NBA players during the 94–95 season.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Allen, F. and Karjalainen, R. (1993). Using genetic algorithms to find technical trading rules, working paper, Rodney L. White Center for Financial Research, The Wharton School of the University of Pennsylvania.

  • Brock, W.A., Lakonishok, J. and LeBaron, B. (1992). Simple technical trading rules and the stochastic properties of stock returns, J. of Finance, 47, 1731-1764.

    Google Scholar 

  • Farmer, J.D. and Sidorovich, J.J. (1987). Predicting Chaotic Time Series, Phys. Rev. Lett., 59, 845-848.

    Google Scholar 

  • Goldberg, D.E. (1989). Genetic algorithms in search, optimization and machine learning, Addison-Wesley, Reading, MA.

    Google Scholar 

  • Hénon, M. (1976). ‘A two-dimensional mapping with a strange attractor’, Comm. Math. Phys. 50, 69-77.

    Google Scholar 

  • Holland, J.H. (1975). Adaptation in natural and artificial systems, University of Michigan Press, Ann Arbor. 2nd ed 1992, MIT Press.

    Google Scholar 

  • Hsieh, D.A. (1991). ‘Chaos and nonlinear dynamics: Applications to financial markets’, J. of Finance, 46, 1839-1877.

    Google Scholar 

  • Koza, J.R. (1992). Genetic programming, MIT Press, Cambridge.

    Google Scholar 

  • LeBaron, B. (1992). ‘Nonlinear forecasts for the S&P stock index’, in Casdagli and Eubanks (eds), Nonlinear modeling and forecasting, Santa Fe Institute, Addison-Wesley, Reading, Mass.

    Google Scholar 

  • LeBaron, B. (1993). ‘Nonlinear diagnostics and simple trading rules for high-frequency foreign exchange rates’, in Weigend and Gerschenfeld (eds.), Time series prediction: Forecasting the future and understanding the past, Santa Fe Institute, Addison-Wesley, Reading, Mass.

    Google Scholar 

  • Palmer, R.G., Arthur, W.B., Holland, J.H., LeBaron, B. and Taylor, P. (1994). ‘Artificial economic life: a simple model of a stockmarket’, Physica D, 75, 264.

    Google Scholar 

  • Scheinkman, J.A. and LeBaron, B. (1989). ‘Nonlinear dynamics and stock returns’, J. of Business, 62, 311-338.

    Google Scholar 

  • Szpiro, G.G. (1997). ‘Forecasting chaotic time series with genetic algorithms’, Phys. Rev. E, 55, 2557-2568.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to GEORGE G. SZPIRO.

Rights and permissions

Reprints and permissions

About this article

Cite this article

SZPIRO, G.G. A Search for Hidden Relationships: Data Mining with Genetic Algorithms. Computational Economics 10, 267–277 (1997). https://doi.org/10.1023/A:1008673309609

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1023/A:1008673309609

Navigation