skip to main content
10.1145/2882903.2882919acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
research-article
Public Access

Dynamic Prefetching of Data Tiles for Interactive Visualization

Published:14 June 2016Publication History

ABSTRACT

In this paper, we present ForeCache, a general-purpose tool for exploratory browsing of large datasets. ForeCache utilizes a client-server architecture, where the user interacts with a lightweight client-side interface to browse datasets, and the data to be browsed is retrieved from a DBMS running on a back-end server. We assume a detail-on-demand browsing paradigm, and optimize the back-end support for this paradigm by inserting a separate middleware layer in front of the DBMS. To improve response times, the middleware layer fetches data ahead of the user as she explores a dataset.

We consider two different mechanisms for prefetching: (a) learning what to fetch from the user's recent movements, and (b) using data characteristics (e.g., histograms) to find data similar to what the user has viewed in the past. We incorporate these mechanisms into a single prediction engine that adjusts its prediction strategies over time, based on changes in the user's behavior. We evaluated our prediction engine with a user study, and found that our dynamic prefetching strategy provides: (1) significant improvements in overall latency when compared with non-prefetching systems (430% improvement); and (2) substantial improvements in both prediction accuracy (25% improvement) and latency (88% improvement) relative to existing prefetching techniques.

References

  1. S. Agarwal, B. Mozafari, A. Panda, H. Milner, S. Madden, and I. Stoica. Blinkdb: queries with bounded errors and bounded response times on very large data. In Proc. EuroSys 2013, pages 29--42, New York, NY, USA, 2013. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. L. Battle, M. Stonebraker, and R. Chang. Dynamic reduction of query result sets for interactive visualizaton. In IEEE BigDataVis Workshop, pages 1--8, 2013.Google ScholarGoogle ScholarCross RefCross Ref
  3. E. Brown, A. Ottley, H. Zhao, Q. Lin, R. Souvenir, A. Endert, and R. Chang. Finding Waldo: Learning about Users from their Interactions. IEEE TVCG, 20(12):1663--1672, Dec. 2014.Google ScholarGoogle Scholar
  4. S. K. Card, G. G. Robertson, and J. D. Mackinlay. The Information Visualizer, an Information Workspace. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, CHI '91, pages 181--186, New York, NY, USA, 1991. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. U. Cetintemel, M. Cherniack, J. DeBrabant, Y. Diao, K. Dimitriadou, A. Kalinin, O. Papaemmanouil, and S. B. Zdonik. Query steering for interactive data exploration. In CIDR, 2013.Google ScholarGoogle Scholar
  6. S.-M. Chan, L. Xiao, J. Gerth, and P. Hanrahan. Maintaining interactivity while exploring massive time series. In VAST, 2008.Google ScholarGoogle Scholar
  7. S. F. Chen and J. Goodman. An empirical study of smoothing techniques for language modeling. Computer Speech & Language, 13(4):359--394, Oct. 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. P. Doshi, E. Rundensteiner, and M. Ward. Prefetching for visual data exploration. In Proc. DASFAA, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. D. Fisher. Incremental, approximate database queries and uncertainty for exploratory visualization. In LDAV, 2011.Google ScholarGoogle ScholarCross RefCross Ref
  10. N. Kamat, P. Jayachandran, K. Tunga, and A. Nandi. Distributed interactive cube exploration. ICDE, 2014.Google ScholarGoogle ScholarCross RefCross Ref
  11. R. Kohavi et al. A study of cross-validation and bootstrap for accuracy estimation and model selection. In Ijcai, volume 14, pages 1137--1145, 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. D. H. Lee, J. S. Kim, S. D. Kim, K.-C. Kim, Y.-S. Kim, and J. Park. Adaptation of a Neighbor Selection Markov Chain for Prefetching Tiled Web GIS Data. ADVIS '02, pages 213--222, London, UK, UK, 2002. Springer-Verlag. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. R. Li, R. Guo, Z. Xu, and W. Feng. A Prefetching Model Based on Access Popularity for Geospatial Data in a Cluster-based Caching System. Int. J. Geogr. Inf. Sci., 26(10):1831--1844, Oct. 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. L. Lins, J. Klosowski, and C. Scheidegger. Nanocubes for real-time exploration of spatiotemporal datasets. IEEE TVCG, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Z. Liu and J. Heer. The Effects of Interactive Latency on Exploratory Visual Analysis. IEEE TVCG, 20(12):2122--2131, Dec. 2014.Google ScholarGoogle Scholar
  16. Z. Liu, B. Jiang, and J. Heer. immens: Real-time visual querying of big data. Proc. EuroVis, 32, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. J. Nielsen. Powers of 10: Time Scales in User Experience, Oct. 2009.Google ScholarGoogle Scholar
  18. A. Pauls and D. Klein. Faster and smaller n-gram language models. HLT, pages 258--267, Stroudsburg, PA, USA, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. P. Pirolli and S. Card. The sensemaking process and leverage points for analyst technology as identified through cognitive task analysis. In Proc. International Conference on Intelligence Analysis, volume 2005, pages 2--4, 2005.Google ScholarGoogle Scholar
  20. G. Planthaber, M. Stonebraker, and J. Frew. Earthdb: Scalable analysis of modis data using scidb. In BigSpatial, pages 11--19, New York, NY, USA. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. K. Rittger, T. H. Painter, and J. Dozier. Assessment of methods for mapping snow cover from modis. Advances in Water Resources, 51(0):367 -- 380, 2013.Google ScholarGoogle ScholarCross RefCross Ref
  22. E. Soroush, M. Balazinska, S. Krughoff, and A. Connolly. Efficient Iterative Processing in the SciDB Parallel Array Engine. In Proceedings of the 27th International Conference on Scientific and Statistical Database Management, SSDBM '15, pages 39:1--39:6, New York, NY, USA, 2015. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. M. Stonebraker, P. Brown, A. Poliakov, and S. Raman. The architecture of scidb. In SSDBM, pages 1--16. Springer, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. R. Taft, M. Vartak, N. R. Satish, N. Sundaram, S. Madden, and M. Stonebraker. GenBase: A Complex Analytics Genomics Benchmark. In Proceedings of the 2014 ACM SIGMOD International Conference on Management of Data, SIGMOD '14, pages 177--188, New York, NY, USA, 2014. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. S. Yesilmurat and V. Isler. Retrospective Adaptive Prefetching for Interactive Web GIS Applications. Geoinformatica, 16(3):435--466, July 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Dynamic Prefetching of Data Tiles for Interactive Visualization

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        SIGMOD '16: Proceedings of the 2016 International Conference on Management of Data
        June 2016
        2300 pages
        ISBN:9781450335317
        DOI:10.1145/2882903

        Copyright © 2016 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 14 June 2016

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article

        Acceptance Rates

        Overall Acceptance Rate785of4,003submissions,20%

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader