Skip to main content

Advertisement

Log in

A Genetic Selection Algorithm for OLAP Data Cubes

  • Published:
Knowledge and Information Systems Aims and scope Submit manuscript

Abstract

Multidimensional data analysis, as supported by OLAP (online analytical processing) systems, requires the computation of many aggregate functions over a large volume of historically collected data. To decrease the query time and to provide various viewpoints for the analysts, these data are usually organized as a multidimensional data model, called data cubes. Each cell in a data cube corresponds to a unique set of values for the different dimensions and contains the metric of interest. The data cube selection problem is, given the set of user queries and a storage space constraint, to select a set of materialized cubes from the data cubes to minimize the query cost and/or the maintenance cost. This problem is known to be an NP-hard problem. In this study, we examined the application of genetic algorithms to the cube selection problem. We proposed a greedy-repaired genetic algorithm, called the genetic greedy method. According to our experiments, the solution obtained by our genetic greedy method is superior to that found using the traditional greedy method. That is, within the same storage constraint, the solution can greatly reduce the amount of query cost as well as the cube maintenance cost.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. 1996AK96 Aytug H, Koehler GJ (1996) Stopping criteria for finite length genetic algorithms. INFORMS Journal on Computing 8(2):183–191

  2. 1997Bar97 Baralis E, Paraboschi S, Teniente E (1997) Materialized view selection in a multidimensional database. In Jarke M, Carey MJ, Dittrich KR, et al. (eds). Proceedings of the 23rd international conference on very large data bases, Athens, Greece, August 1997, pp 156–165

  3. 1997CD97 Chaudhuri S, Dayal U (1997) An overview of data warehouse and OLAP technology. ACM SIGMOD Record 26(1):65–74

  4. 1997Eze97 Ezeife CI (1997) A uniform approach for selecting views and indexes in a data warehouse. In Proceedings of the 2nd international database engineering and applications symposium, Montreal, Canada, August 1997, pp 151–160

  5. 1989Gol89 Goldberg DE (1989) Genetic algorithms in search, optimization, and machine learning. Addison-Wesley, Reading, MA

  6. 1991GD91 Goldberg DE, Deb K (1991) A comparative analysis of selection schemes used in genetic algorithms. In Whitley LD (ed). Foundations of genetic algorithms. Morgan Kaufmann, San Mateo, CA, pp 69–93

  7. 1997Gra97 Gray J, Chaudhuri S, Bosworth A, et al. (1997) Data cube: a relational aggregation operator generalizing group-by, cross-tabs and subtotals. Data Mining and Knowledge Discovery 1(1):29–53

  8. 2000GM00 Greenhalgh D, Marshall S (2000) Convergence criteria for genetic algorithms. SIAM Journal on Computing 30(1):269–282

  9. 1997Gup97a Gupta H (1997) Selection of views to materialize in a data warehouse. In Afrati FN, Kolaitis P (eds). Proceedings of the 5th international conference on database theory, Delphi, Greece, January 1997, pp 98–112

  10. 1997Gup97b Gupta H, Harinarayan V, Rajaraman A, Ullman JD (1997) Index selection for OLAP. In Gray A, Larson P (eds). Proceedings of the 13th international conference on data engineering, Birmingham, UK, April 1997. Lecture Notes in Computer Science 1186, Springer, Berlin, pp 208–219

  11. 1999GM99 Gupta H, Mumick IS (1999) Selection of views to materialize under a maintenance cost constraint. In Beeri C, Buneman P (eds). Proceedings of the 7th international conference on database theory, Jerusalam, Israel, January 1999. Lecture Notes in Computer Science 1540, Springer, Berlin, pp 453–470

  12. 1996Har96 Harinarayan V, Rajaraman A, Ullman JD (1996) Implementing data cubes efficiently. In Jagadish HV, Mumick IS (eds). Proceedings of ACM SIGMOD international conference on management of data, Montreal, Canada, June 1996, pp 205–216

  13. 1992Hol92 Holland JH (1992) Adaptation in natural and artificial systems (2nd edn). MIT Press, Cambridge,~MA

  14. 1999Hor99 Horng JT, Chang YJ, Liu BJ, Kao CY (1999) Materialized view selection using genetic algorithms in a data warehouse. In Proceedings of world congress on evolutionary computation, Washington, DC, July 1999, pp 2221–2227

  15. 1993IK93 Inmon WH, Kelley C (1993) Rdb/VMS: developing the data warehouse. QED, Boston, MA

  16. 2001JM01 Jamil HM, Modica GA (2001) A view selection tool for multidimensional databases. In Monostori L, Váncza J, Ali M (eds). Proceedings of the 14th international conference on industrial and engineering applications of artificial intelligence and expert systems, Budapest, Hungary, June 2001. Lecture Notes in Computer Science 2070, Springer, Berlin, pp 237–246

  17. 2001Lia01 Liang W, Wang H, Orlowska ME (2001) Materalized view selection under the maintenance time constraint. Data and Knowledge Engineering 37(2):203–216

  18. 1994Mic94 Michalewicz Z (1994) Genetic algorithms + data structures = evolution programs. Springer, Berlin

  19. 1996Mit96 Mitchell M (1996) An introduction to genetic algorithms. MIT Press, Cambridge, MA

  20. hlenbein1992Muh92 M

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Wen-Yang Lin.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Lin, WY., Kuo, IC. A Genetic Selection Algorithm for OLAP Data Cubes. Knowledge and Information Systems 6, 83–102 (2004). https://doi.org/10.1007/s10115-003-0093-x

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10115-003-0093-x

Keywords

Navigation