Abstract
In this paper we present a technique based on an analytical interpretation of multi-dimensional data and on the well-known Least Squares Approximation (LSA) method for supporting approximate aggregate query answering in OLAP environments, the most common application interfaces for a Data Warehouse Server (DWS). Our technique consists in building data synopses by interpreting the original data distribution as a set of discrete functions. These synopses, called Δ-Syn, are obtained by approximating data with a set of polynomial coefficients, and storing these coefficients instead of the original data. Queries are issued on the compressed representation, thus reducing the number of disk accesses needed to evaluate the answer. We also provide some experimental results on several kinds of synthetic OLAP data cubes.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Bruno, N., Chaudhuri, S., Gravano, L.: STHoles: A Multidimensional Workload- Aware Histogram. Microsoft Technical Report MSR-TR-2001-36 (2001)
Gibbons, P.B., Matias, Y.: New Sampling-Based Summary Statistics for Improving Approximate Query Answers. In: Proc. of the 1998 ACM SIGMOD, Seattle, WA, USA, pp. 331–342 (1998)
Gibbons, P.B., Matias, Y., Poosala, V.: Fast Incremental Maintenance of Approximate Histograms. In: Proc. of the 23rd VLDB, Athens, Greece, pp. 466–475 (1997)
Hellerstein, J.M., Haas, P.J., Wang, H.J.: Online Aggregation. In: Proc. of the 1997 ACM SIGMOD, Tucson, AZ, USA, pp. 171–182 (1997)
Ho, C.-T., Agrawal, R., Megiddo, N., Srikant, R.: Range Queries in OLAP Data Cubes. In: Proc. of the 1997 ACM SIGMOD, Tucson, AZ, USA, pp. 73–88 (1997)
Ioannidis, Y.E., Poosala, V.: Histogram-based Approximation of Set-Valued Query Answers. In: Proc. of the 25th VLDB, Edinburgh, Scotland, pp. 174–185 (1999)
Jagadish, H.V., Koudas, N., Muthukrishnan, S., Poosala, V., Sevcik, K., Suel, T.: Optimal Histograms with Quality Guarantees. In: Proc. of the 24th VLDB, New York City, NY, USA, pp. 275–286 (1998)
Kenney, J.F., Keeping, E.S.: Skewness. In: Mathematics of Statistics, Pt. 1, 3rd edn., pp. 100–101. Van Nostrand, Princeton (1962)
Matias, Y., Vitter, J.S., Wang, M.: Wavelet-Based Histograms for Selectivity Estimation. In: Proc. of the 1998 ACM SIGMOD, Seattle, WA, USA, pp. 448–459 (1998)
Papoulis, A.: Probability, Random Variables, and Stochastic Processes, 2nd edn. McGraw-Hill, New York City (1984)
Poosala, V., Ioannidis, Y.E., Haas, P.J., Shekita, E.: Improved Histograms for Selectivity Estimation of Range Predicates. In: Proc. of the 1996 ACM SIGMOD, Montreal, Canada, pp. 294–305 (1996)
Poosala, V., Ioannidis, Y.E.: Selectivity Estimation without the Attribute Value Independence Assumption. In: Proc. of the 23rd VLDB, Athens, Greece, pp. 486–495 (1997)
Poosala, V., Ganti, V.: Fast Approximate Answers to Aggregate Queries on a Data Cube. In: Proc. of the 11th SSDBM, Cleveland, OH, USA, pp. 24–33 (1999)
Powell, M.J.D.: Approximation Theory and Methods. Cambridge University Press, Cambridge (1982)
Stuart, A., Ord, J.K.: Kendall’s Advanced Theory of Statistics, 6th edn. Distribution Theory, vol. 1. Oxford University Press, Oxford (1998)
Vitter, J.S., Wang, M., Iyer, B.: Data Cube Approximation and Histograms via Wavelets. In: Proc. of the 7th ACM CIKM, Bethesda, ML, USA, pp. 96–104 (1998)
Vitter, J.S., Wang, M.: Approximate Computation of Multidimensional Aggregates of Sparse Data Using Wavelets. In: Proc. of the 1999 ACM SIGMOD, Philadelphia, PA, USA, pp. 194–204 (1999)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Cuzzocrea, A., Matrangolo, U. (2004). Analytical Synopses for Approximate Query Answering in OLAP Environments. In: Galindo, F., Takizawa, M., Traunmüller, R. (eds) Database and Expert Systems Applications. DEXA 2004. Lecture Notes in Computer Science, vol 3180. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30075-5_35
Download citation
DOI: https://doi.org/10.1007/978-3-540-30075-5_35
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-22936-0
Online ISBN: 978-3-540-30075-5
eBook Packages: Springer Book Archive