skip to main content
10.1145/308386.308455acmconferencesArticle/Chapter ViewAbstractPublication PagespodsConference Proceedingsconference-collections
Article
Free Access

Statistical estimators for relational algebra expressions

Authors Info & Claims
Published:01 March 1988Publication History

ABSTRACT

Present database systems process all the data related to a query before giving out responses. As a result, the size of the data to be processed becomes excessive for real-time/time-constrained environments. A new methodology is needed to cut down systematically the time to process the data involved in processing the query. To this end, we propose to use data samples and construct an approximate synthetic response to a given query.

In this paper, we consider only COUNT(E) type queries, where E is an arbitrary relational algebra expression. We make no assumptions about the distribution of attribute values and ordering of tuples in the input relations, and propose consistent and unbiased estimators for arbitrary COUNT(E) type queries. We design a sampling plan based on the cluster sampling method to improve the utilization of sampled data and to reduce the cost of sampling. We also evaluate the performance of the proposed estimators.

References

  1. Coch 77.Cochmn. w o, "~g T~que,". T~rd ~d John Wdey & Sons, 1977Google ScholarGoogle Scholar
  2. Chri 83.Chnstodoulakas, S. "Estunaung Record SelecuvtJes", Informaucm Systems, Vol 8, 1983Google ScholarGoogle Scholar
  3. Devo 84.Devote, J L, "Probabthty & Staumcs for Eng, neermg and Sclences", Brook/Cole, 1984Google ScholarGoogle Scholar
  4. DFHO 86.Datta, A, Fourmer, B. Hc~. W-C, and Ozsoyoglu, G, "The Implementat~n of SSDB". Proc Thml Internauonal Workshop oa Stausucal Databam Management", July 1986.Google ScholarGoogle Scholar
  5. Good 49.Goodman, LA., "On the Esamauon of the N~ of Classes m a Populaum", Ann Math Sta., 1949Google ScholarGoogle Scholar
  6. HoOT 87.Hou, W-C, Ozsoyoglu, G, and Taneja, B k., "Sta- U~cal Emmators for RelalLtonal Algebra Expres. mons", Tech Rpt. CES-87-15, CWRU, 1987Google ScholarGoogle Scholar
  7. Liu 68.Lm, C L, "Intmdu~m to Ccxnbmatonal Mathematics", McGraw-Hall, 196&Google ScholarGoogle Scholar
  8. Morg 81.Morgenstem, J P, "Compmer Based Management lnfommtton Systems Embodymg Answer Accuracy As a User Parameter", Ph D Thems, Umv of Cahfornm, Berkeley, 1981 Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Olke 86.Olken, F, "Phymcal Database Support for Sc~enufic and Statlmcal Databases", Third Int. Scsenttfic and Statlmcal Databases Workshop, 1986 Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. OlkR 86.Olken, F and Rotem, D, "Ssmple Random Samphng from Relational Databases", VLDB Conf 1986 Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Ross 80.Ross, S M, "lntroductloa to Probabshty Models", 2nd Ed., Acndenac Press, 1980Google ScholarGoogle Scholar
  12. Rowe 85.Rowe, N C, "Antlsamphn8 for Emmattom An overvsew", IEE Trans on Software Eng., Oct 1985 Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. SMO 86.Scheaffer, MendenhaH, and Ott, "Elementary Survey Samphn8", 3rd Ed, Duxbury press, 1986Google ScholarGoogle Scholar
  14. Sukh 84.Sukhatme, P V, etc , "Samphng ~ry of Surveys Apphcauon", 3rd Ed, New Delhi, Indm and Iowa State Umv Press, 1984Google ScholarGoogle Scholar

Recommendations

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Sign in
  • Published in

    cover image ACM Conferences
    PODS '88: Proceedings of the seventh ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems
    March 1988
    352 pages
    ISBN:0897912632
    DOI:10.1145/308386

    Copyright © 1988 ACM

    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    • Published: 1 March 1988

    Permissions

    Request permissions about this article.

    Request Permissions

    Check for updates

    Qualifiers

    • Article

    Acceptance Rates

    Overall Acceptance Rate642of2,707submissions,24%

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader