skip to main content
research-article

Clay: fine-grained adaptive partitioning for general database schemas

Published:01 November 2016Publication History
Skip Abstract Section

Abstract

Transaction processing database management systems (DBMSs) are critical for today's data-intensive applications because they enable an organization to quickly ingest and query new information. Many of these applications exceed the capabilities of a single server, and thus their database has to be deployed in a distributed DBMS. The key factor affecting such a system's performance is how the database is partitioned. If the database is partitioned incorrectly, the number of distributed transactions can be high. These transactions have to synchronize their operations over the network, which is considerably slower and leads to poor performance. Previous work on elastic database repartitioning has focused on a certain class of applications whose database schema can be represented in a hierarchical tree structure. But many applications cannot be partitioned in this manner, and thus are subject to distributed transactions that impede their performance and scalability.

In this paper, we present a new on-line partitioning approach, called Clay, that supports both tree-based schemas and more complex "general" schemas with arbitrary foreign key relationships. Clay dynamically creates blocks of tuples to migrate among servers during repartitioning, placing no constraints on the schema but taking care to balance load and reduce the amount of data migrated. Clay achieves this goal by including in each block a set of hot tuples and other tuples co-accessed with these hot tuples. To evaluate our approach, we integrate Clay in a distributed, main-memory DBMS and show that it can generate partitioning schemes that enable the system to achieve up to 15× better throughput and 99% lower latency than existing approaches.

References

  1. A. Adya, D. Myers, J. Howell, J. Elson, C. Meek, V. Khemani, S. Fulger, P. Gu, L. Bhuvanagiri, J. Hunter, et al. Slicer: Auto-sharding for datacenter applications. In USENIX Symposium on Operating Systems Design and Implementation, pages 739--753, 2016.Google ScholarGoogle Scholar
  2. J. Baker, C. Bond, J. C. Corbett, J. Furman, A. Khorlin, J. Larson, J.-M. Leon, Y. Li, A. Lloyd, and V. Yushprakh. Megastore: Providing scalable, highly available storage for interactive services. In Conference on Innovative Data Systems Research, pages 223--234, 2011.Google ScholarGoogle Scholar
  3. P. Bernstein, I. Cseri, N. Dani, N. Ellis, A. Kalhan, G. Kakivaya, D. B. Lomet, R. Manne, L. Novik, and T. Talius. Adapting Microsoft SQL server for cloud computing. In IEEE International Conference on Data Engineering, pages 1255--1263, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. M. Cha, H. Haddadi, F. Benevenuto, and P. K. Gummadi. Measuring user influence in twitter: The million follower fallacy. In AAAI Conference on Web and Social Media, pages 10--17, 2010.Google ScholarGoogle Scholar
  5. C. Curino, E. Jones, Y. Zhang, and S. Madden. Schism: A workload-driven approach to database replication and partitioning. Proceedings of the VLDB Endowment, 3(1--2):48--57, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. S. Das, D. Agrawal, and A. El Abbadi. Elastras: An elastic, scalable, and self-managing transactional database for the cloud. ACM Transactions on Database Systems, 38(1):5:1--5:45, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. D. E. Difallah, A. Pavlo, C. Curino, and P. Cudre-Mauroux. Oltp-bench: An extensible testbed for benchmarking relational databases. Proceedings of the VLDB Endowment, 7(4):277--288, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. A. J. Elmore, V. Arora, R. Taft, A. Pavlo, D. Agrawal, and A. El Abbadi. Squall: Fine-grained live reconfiguration for partitioned main memory databases. In ACM SIGMOD International Conference on Management of Data, pages 299--313, 2015. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. M. Ghosh, W. Wang, G. Holla, and I. Gupta. Morphus: Supporting online reconfigurations in sharded nosql systems. In IEEE International Conference on Autonomic Computing, pages 1--10, 2015. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. A. Gionis, F. Junqueira, V. Leroy, M. Serafini, and I. Weber. Piggybacking on social networks. Proceedings of the VLDB Endowment, 6(6):409--420, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Y.-J. Hong and M. Thottethodi. Understanding and mitigating the impact of load imbalance in the memory caching tier. In ACM Symposium on Cloud Computing, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. H-Store: A Next Generation OLTP DBMS. http://hstore.cs.brown.edu.Google ScholarGoogle Scholar
  13. B. A. Huberman, D. M. Romero, and F. Wu. Social networks that matter: Twitter under the microscope. SSRN 1313405, 2008.Google ScholarGoogle Scholar
  14. A. Jindal and J. Dittrich. Relax and let the database do the partitioning online. In International Workshop on Enabling Real-Time Business Intelligence, pages 65--80. 2012.Google ScholarGoogle ScholarCross RefCross Ref
  15. R. Kallman, H. Kimura, J. Natkins, A. Pavlo, A. Rasin, S. Zdonik, E. P. C. Jones, S. Madden, M. Stonebraker, Y. Zhang, J. Hugg, and D. J. Abadi. H-store: A high-performance, distributed main memory transaction processing system. Proceedings of the VLDB Endowment, 1(2):1496--1499, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. G. Karypis and V. Kumar. Metis-unstructured graph partitioning and sparse matrix ordering system, version 5.0. http://glaros.dtc.umn.edu/gkhome/metis/metis/overview.Google ScholarGoogle Scholar
  17. B. Liu, J. Tatemura, O. Po, W.-P. Hsiung, and H. Hacigumus. Automatic entity-grouping for OLTP workloads. In IEEE International Conference on Data Engineering, pages 712--723, 2014.Google ScholarGoogle ScholarCross RefCross Ref
  18. M. A. U. Nasir, G. D. F. Morales, D. García-Soriano, N. Kourtellis, and M. Serafini. The power of both choices: Practical load balancing for distributed stream processing engines. In IEEE International Conference on Data Engineering, pages 137--148, 2015.Google ScholarGoogle ScholarCross RefCross Ref
  19. M. A. U. Nasir, G. D. F. Morales, N. Kourtellis, and M. Serafini. When two choices are not enough: Balancing at scale in distributed stream processing. In IEEE International Conference on Data Engineering, pages 589--600, 2016.Google ScholarGoogle ScholarCross RefCross Ref
  20. R. Nehme and N. Bruno. Automated partitioning design in parallel database systems. In ACM SIGMOD International Conference on Management of data, pages 1137--1148, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. NuoDB. http://www.nuodb.com.Google ScholarGoogle Scholar
  22. I. Pandis, P. Tözün, R. Johnson, and A. Ailamaki. PLP: Page latch-free shared-everything OLTP. Proceedings of the VLDB Endowment, 4(10):610--621, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. A. Pavlo, C. Curino, and S. Zdonik. Skew-aware automatic database partitioning in shared-nothing, parallel OLTP systems. In ACM SIGMOD International Conference on Management of Data, pages 61--72, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. D. Porobic, E. Liarou, P. Tözün, and A. Ailamaki. Atrapos: Adaptive transaction processing on hardware islands. In IEEE International Conference on Data Engineering, pages 688--699, 2014.Google ScholarGoogle ScholarCross RefCross Ref
  25. A. Quamar, K. A. Kumar, and A. Deshpande. Sword: scalable workload-aware data placement for transactional workloads. In International Conference on Extending Database Technology, pages 430--441, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. M. Serafini, E. Mansour, A. Aboulnaga, K. Salem, T. Rafiq, and U. F. Minhas. Accordion: Elastic scalability for database systems supporting distributed transactions. Proceedings of the VLDB Endowment, 7(12):1035--1046, 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. M. Stonebraker, A. Pavlo, R. Taft, and M. L. Brodie. Enterprise database applications and the cloud: A difficult road ahead. In IEEE International Conference on Cloud Engineering, pages 1--6, 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. R. Taft, E. Mansour, M. Serafini, J. Duggan, A. J. Elmore, A. Aboulnaga, A. Pavlo, and M. Stonebraker. E-store: Fine-grained elastic partitioning for distributed transaction processing systems. Proceedings of the VLDB Endowment, 8(3):245--256, 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. The TPC-C Benchmark, 1992. http://www.tpc.org/tpcc/.Google ScholarGoogle Scholar
  30. K. Q. Tran, J. F. Naughton, B. Sundarmurthy, and D. Tsirogiannis. JECB: A join-extension, code-based approach to OLTP data partitioning. In ACM SIGMOD International Conference on Management of Data, pages 39--50, 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. B. Trushkowsky, P. Bodík, A. Fox, M. J. Franklin, M. I. Jordan, and D. A. Patterson. The SCADS director: Scaling a distributed storage system under stringent performance requirements. In USENIX Conference on File and Storage Technologies, pages 163--176, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. S. Yang, X. Yan, B. Zong, and A. Khan. Towards effective partition management for large graphs. In ACM SIGMOD International Conference on Management of Data, pages 517--528, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. E. Zamanian, C. Binnig, and A. Salama. Locality-aware partitioning in parallel database systems. In ACM SIGMOD International Conference on Management of Data, pages 17--30, 2015. Google ScholarGoogle ScholarDigital LibraryDigital Library

Recommendations

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Sign in

Full Access

  • Published in

    cover image Proceedings of the VLDB Endowment
    Proceedings of the VLDB Endowment  Volume 10, Issue 4
    November 2016
    180 pages
    ISSN:2150-8097
    Issue’s Table of Contents

    Publisher

    VLDB Endowment

    Publication History

    • Published: 1 November 2016
    Published in pvldb Volume 10, Issue 4

    Qualifiers

    • research-article

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader