Skip to main content
Log in

A novel coral reefs optimization algorithm for materialized view selection in data warehouse environments

  • Published:
Applied Intelligence Aims and scope Submit manuscript

Abstract

High response time of analytical queries is one of the most challenging issues of data warehouses. Complicated nature of analytical queries and enormous volume of data are the most important reasons of this high response time. The aim of materialized view selection is to reduce the response time of these analytical queries. For this purpose, the search space is firstly constructed by producing the set of all possible views based on given queries and then, the (semi-) optimal set of materialized views will be selected so that the queries can be answered at the lowest cost using them. Various materialized view selection methods have been proposed in the literature, most of which are randomized methods due to the time-consuming nature of this problem. Randomized view selection methods choose a semi-optimal set of proper views for materialization in an appropriate time using one or a combination of some meta-heuristic(s). In this paper, a novel coral reefs optimization-based method is introduced for materialized view selection in a data warehouse. Coral reefs optimization algorithm is an optimization method that solves problems by simulating the coral behaviors for placement and growth in reefs. In the proposed method, each solution of the problem is considered as a coral, which is always trying to be placed and grow in the reefs. In each step, special operators of the coral reefs optimization algorithm are applied on the solutions. After several steps, better solutions are more likely to survive and grow on the reefs. The best solution is finally chosen as the final solution of the problem. The practical evaluations of the proposed method show that this method offers higher quality solutions than other similar random methods in terms of coverage rate of queries.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16
Fig. 17
Fig. 18
Fig. 19
Fig. 20
Fig. 21
Fig. 22
Fig. 23
Fig. 24
Fig. 25
Fig. 26
Fig. 27

Similar content being viewed by others

References

  1. Chandra P, Gupta MK (2018) Comprehensive survey on data warehousing research. Int J Inf Technol 10(2):217–224

    Google Scholar 

  2. Inmon WH (2005) Building the data warehouse. Wiley, New York

    Google Scholar 

  3. Sohrabi MK, Ghods V (2016) Materialized view selection for a data warehouse using frequent Itemset mining. JCP 11(2):140–148

    Google Scholar 

  4. Dhote CA, Ali MS (2009) Materialized view selection in data warehousing: a survey. J Appl Sci 9(3):401–414

    Google Scholar 

  5. Harinarayan V, Rajaraman A, Ullman JD (1996) Implementing data cubes efficiently. ACM SIGMOD Rec 25(2):205–216

    Google Scholar 

  6. Roy P, Seshadri S, Sudarshan S, Bhobe S (2000) Efficient and extensible algorithms for multi query optimization. In: ACM SIGMOD Record (Vol. 29, No. 2, p 249–260). ACM

    Google Scholar 

  7. Yang J, Karlapalem K, Li Q (1997) Algorithms for materialized view design in data warehousing environment. In: VLDB (Vol. 97, p 25–29)

  8. Sohrabi MK, Azgomi H (2018) A survey on the combined use of optimization methods and game theory. Arch Comput Meth Eng. https://doi.org/10.1007/s11831-018-9300-5

    MathSciNet  Google Scholar 

  9. Salcedo-Sanz S, Del Ser J, Landa-Torres I, Gil-López S, Portilla-Figueras JA (2014) The coral reefs optimization algorithm: a novel metaheuristic for efficiently solving optimization problems. Sci World J. https://doi.org/10.1155/2014/739768

    Google Scholar 

  10. Ficco M, Esposito C, Palmieri F, Castiglione A (2018) A coral-reefs and game theory-based approach for optimizing elastic cloud resource allocation. Futur Gener Comput Syst 78(1):343–352

    Google Scholar 

  11. Salcedo-Sanz S, Pastor-Sánchez A, Prieto L, Blanco-Aguilera A, García-Herrera R (2014) Feature selection in wind speed prediction systems based on a hybrid coral reefs optimization–extreme learning machine approach. Energy Convers Manag 87:10–18

    Google Scholar 

  12. Yan C, Ma J, Luo H, Patel A (2019) Hybrid binary coral reefs optimization algorithm with simulated annealing for feature selection in high-dimensional biomedical datasets. Chemom Intell Lab Syst 184:102–111

    Google Scholar 

  13. Durán-Rosal AM, Gutiérrez PA, Salcedo-Sanz S, Hervás-Martínez C (2018) A statistically-driven coral reef optimization algorithm for optimal size reduction of time series. Appl Soft Comput 63:139–153

    Google Scholar 

  14. Tsai C-W, Chang W-Y, Wang Y-C, Chen H (2019) A high-performance parallel coral reef optimization for data clustering. Soft Comput. https://doi.org/10.1007/s00500-019-03950-3

    Google Scholar 

  15. Bermejo E, Chica M, Damas S, Salcedo-Sanz S, Cordón O (2018) Coral reef optimization with substrate layers for medical image registration. Swarm Evol Comput 42:138–159

    Google Scholar 

  16. Gupta H (1997) Selection of views to materialize in a data warehouse. In: Database theory—ICDT'97. Springer, Berlin Heidelberg, pp 98–112

    Google Scholar 

  17. Mistry H, Roy P, Sudarshan S, Ramamritham K (2001) Materialized view selection and maintenance using multi-query optimization. In: ACM SIGMOD Record (Vol. 30, No. 2, p 307–318). ACM

  18. Sohrabi MK, Azgomi H (2017) TSGV: a table-like structure-based greedy method for materialized view selection in data warehouses. Turk J Electr Eng Comput Sci 25(4):3175–3187

    Google Scholar 

  19. Theodoratos D, Ligoudistianos S, Sellis T (2001) View selection for designing the global data warehouse. Data Knowl Eng 39(3):219–240

    MATH  Google Scholar 

  20. Wu X, Theodoratos D, Wang WH, Sellis T (2013) Optimizing XML queries: bitmapped materialized views vs. indexes. Inf Syst 38(6):863–884

    Google Scholar 

  21. Katsifodimos A, Manolescu I, Vassalos V (2012) Materialized view selection for XQuery workloads. In Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data. ACM, p 565–576

  22. Afrati F, Damigos M, Gergatsoulis M (2014) On solving efficiently the view selection problem under bag and bag-set semantics. Inf Syst 42:153–176

    MATH  Google Scholar 

  23. Letrache K, El Beggar O, Ramdani M (2018) OLAP cube partitioning based on association rules method. Appl Intell 49(2):420–434

    Google Scholar 

  24. Bouakkaz M, Ouinten Y, Loudcher S, Fournier-Viger P (2018) Efficiently mining frequent itemsets applied for textual aggregation. Appl Intell 48(4):1013–1019

    Google Scholar 

  25. Sohrabi MK, Azgomi H (2019) Finding similar documents using frequent pattern mining methods. Int J Uncertainty Fuzziness Knowledge Based Syst 27(1):73–96

    Google Scholar 

  26. Aouiche K, Jouve PE, Darmont J (2006) Clustering-based materialized view selection in data warehouses. In: East European conference on advances in databases and information systems. Springer, Berlin Heidelberg, pp 81–95

    Google Scholar 

  27. Aouiche K, Darmont J (2009) Data mining-based materialized view and index selection in data warehouses. J Intell Inf Syst 33(1):65–93

    Google Scholar 

  28. Sohrabi MK, Azgomi H (2019) A survey on the combined use of optimization methods and game theory. Arch Comput Meth Eng. https://doi.org/10.1007/s11831-018-9300-5

    MathSciNet  Google Scholar 

  29. Azgomi H, Sohrabi MK (2018) A game theory based framework for materialized view selection in data warehouses. Eng Appl Artif Intell 71:125–137

    Google Scholar 

  30. Zhang C, Yang J (1999) Genetic algorithm for materialized view selection in data warehouse environments. In: Data warehousing and knowledge discovery. Springer, Berlin Heidelberg, pp 116–125

    Google Scholar 

  31. Lee M, Hammer J (2001) Speeding up materialized view selection in data warehouses using a randomized algorithm. Int J Coop Inf Syst 10(03):327–353

    Google Scholar 

  32. Yu JX, Yao X, Choi CH, Gou G (2003) Materialized view selection as constrained evolutionary optimization. IEEE Trans Syst Man Cybern Part C Appl Rev 33(4):458–467

    Google Scholar 

  33. Vijay Kumar TV, Kumar S (2012) Materialized view selection using genetic algorithm. Contemporary Computing, IC3: International Conference on Contemporary Computing, p 225–237

  34. Kalnis P, Mamoulis N, Papadias D (2002) View selection using randomized search. Data Knowl Eng 42(1):89–111

    MATH  Google Scholar 

  35. Derakhshan R, Dehne FK, Korn O, Stantic B (2006) Simulated annealing for materialized view selection in data warehousing environment. In: Databases and applications, pp 89–94

    Google Scholar 

  36. Kumar, T. V., & Kumar, S. (2012). Materialized view selection using simulated annealing. In: BDA. p 168–179

  37. Song X, Gao L (2010) An ant colony based algorithm for optimal selection of materialized view. In: Intelligent Computing and Integrated Systems (ICISS), 2010 International Conference on. IEEE, p 534–536

  38. Sun X, Wang Z (2009). An efficient materialized views selection algorithm based on PSO. In: Intelligent Systems and Applications, 2009. ISA 2009. International Workshop on. IEEE, p 1–4

  39. Li X, Qian X, Jiang J, Wang Z (2010). Shuffled frog leaping algorithm for materialized views selection. In: Education Technology and Computer Science (ETCS), 2010 Second International Workshop on (Vol. 3). IEEE, p 7–10

  40. Kumar TV, Arun B (2015) Materialized view selection using improvement based bee colony optimization. International Journal of Software Science and Computational Intelligence 7(4):35–61

    Google Scholar 

  41. Arun B, Kumar TV (2017) Materialized view selection using artificial bee colony optimization. Int J Intell Inf Technol 13(1):26–49

    Google Scholar 

  42. Vijay Kumar TV, Kumar S (2014) Materialized view selection using differential evolution. Int J Innov Comput Appl 6(2):102–113

    Google Scholar 

  43. Zhou L, Geng H, Xu M (2011) An improved algorithm for materialized view selection. J Comput 6(1):130–138

    Google Scholar 

  44. Phuboon-ob J, Auepanwiriyakul R (2007) Selecting materialized views using two-phase optimization with multiple view processing plan. World Academy of Science, Engineering and Technology, 27

  45. Suchyukorn B, Auepanwiriyakul R (2013) Dynamic materialized view selection using 2PO based on re-optimized multiple view processing plan. International Journal of Advancements in Computing Technology 5(14):150

    Google Scholar 

  46. Gosain A, Sachdeva K (2018) Materialized view selection using backtracking search optimization algorithm. In: Intelligent engineering informatics. Springer, Singapore, pp 241–251

    Google Scholar 

  47. Zhang C, Yao X, Yang J (2001) An evolutionary approach to materialized views selection in a data warehouse environment. IEEE Trans Syst Man Cybern Part C Appl Rev 31(3):282–294

    Google Scholar 

  48. Molina D, LaTorre A, Herrera F (2018) An insight into bio-inspired and evolutionary algorithms for global optimization: review, analysis, and lessons learnt over a decade of competitions. Cogn Comput 10(4):517–544

    Google Scholar 

  49. Srivastava S, Sahana SK (2017) Nested hybrid evolutionary model for traffic signal optimization. Appl Intell 46(1):113–123

    Google Scholar 

  50. Cheng F, Fu G, Zhang X, Qiu J (2019) Multi-objective evolutionary algorithm for optimizing the partial area under the ROC curve. Knowl-Based Syst 170:61–69

    Google Scholar 

  51. Ramírez A, Romero JR, Ventura S (2018) Interactive multi-objective evolutionary optimization of software architectures. Inf Sci 463–464:92–109

    MathSciNet  Google Scholar 

  52. Sohrabi MK, Azgomi H (2019) Evolutionary game theory approach to materialized view selection in data warehouses. Knowl-Based Syst 163:558–571

    Google Scholar 

  53. Kumar S, Kumar TVV (2018) A novel quantum-inspired evolutionary view selection algorithm. Sādhanā 43:166

    Google Scholar 

  54. Goswami R, Bhattacharyya DK, Dutta M (2017) Materialized view selection using evolutionary algorithm for speeding up big data query processing. J Intell Inf Syst 49(3):407–433

    Google Scholar 

  55. Sohrabi MK, Barforoush AA (2013) Parallel frequent itemset mining using systolic arrays. Knowl-Based Syst 37:462–471

    Google Scholar 

  56. Sohrabi MK (2018) A gossip based information fusion protocol for distributed frequent itemset mining. Enterp Inform Syst 12(6):674–694

    Google Scholar 

  57. Sohrabi MK, Taheri N (2018) A haoop-based parallel mining of frequent itemsets using N-lists. J Chin Inst Eng 41(3):229–238

    Google Scholar 

  58. Sohrabi MK, Azgomi H (2017) Parallel set similarity join on big data based on locality-sensitive hashing. Sci Comput Program 145:1–12

    Google Scholar 

  59. Golov N, Rönnbäck L (2017) Big data normalization for massively parallel processing databases. Comput Stand Inter 54(2):86–93

    Google Scholar 

  60. Wang H, Qin X, Zhou X, Li F, Qin Z, Zhu Q, Wang S (2015) Efficient query processing framework for big data warehouse: an almost join-free approach. Front Comp Sci 9(2):224–236

    MathSciNet  Google Scholar 

  61. Sebaa A, Chikh F, Nouicer A, Tari A (2018) Medical big data warehouse: architecture and system design, a case study: improving healthcare resources distribution. J Med Syst 42:59. https://doi.org/10.1007/s10916-018-0894-9

    Article  Google Scholar 

  62. Zhang C, Bi J, Xu S, Ramentol E, Fan G, Qiao B, Fujita H (2019) Multi-imbalance: an open-source software for multi-class imbalance learning. Knowl-Based Syst. https://doi.org/10.1016/j.knosys.2019.03.001

    Google Scholar 

  63. Fujita H, Cimr D (2019) Computer aided detection for fibrillations and flutters using deep convolutional neural network. Inf Sci 486:231–239

    Google Scholar 

  64. Zhang C, Liu C, Zhang X, Almpanidis G (2017) An up-to-date comparison of state-of-the-art classification algorithms. Expert Syst Appl 82:128–150

    Google Scholar 

  65. Hemmatian F, Sohrabi MK (2017) A survey on classification techniques for opinion mining and sentiment analysis. Artif Intell Rev. https://doi.org/10.1007/s10462-017-9599-6

    Google Scholar 

  66. Salcedo-Sanz S, Pastor-Sánchez A, Gallo-Marazuela D, Portilla-Figueras A (2013) A novel coral reefs optimization algorithm for multi-objective problems. In: International conference on intelligent data engineering and automated learning. Springer, Berlin, Heidelberg, pp 326–333

    Google Scholar 

  67. Sohrabi MK, Azgomi H RTLTDS dataset. Available at https://rtltds.github.io

  68. TPC-H data warehouse. Avalibale at http://www.tpc.org/tpc_documents_current_versions/pdf/tpc-h_v2.17.1.pdf

  69. Cuevas A, Febrero M, Fraiman R (2004) An anova test for functional data. Comput Stat Data Anal 47(1):111–122

    MathSciNet  MATH  Google Scholar 

  70. Kim TK (2015) T test as a parametric statistic. Korean J Anesthesiol 68(6):540–546

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Mohammad Karim Sohrabi.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Azgomi, H., Sohrabi, M.K. A novel coral reefs optimization algorithm for materialized view selection in data warehouse environments. Appl Intell 49, 3965–3989 (2019). https://doi.org/10.1007/s10489-019-01481-w

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10489-019-01481-w

Keywords

Navigation