research-article

Hive: a warehousing solution over a map-reduce framework

Authors:
Ashish Thusoo

Facebook Data Infrastructure Team

Facebook Data Infrastructure Team
View Profile

,
Joydeep Sen Sarma

Facebook Data Infrastructure Team

Facebook Data Infrastructure Team
View Profile

,
Namit Jain

Facebook Data Infrastructure Team

Facebook Data Infrastructure Team
View Profile

,
Zheng Shao

Facebook Data Infrastructure Team

Facebook Data Infrastructure Team
View Profile

,
Prasad Chakka

Facebook Data Infrastructure Team

Facebook Data Infrastructure Team
View Profile

,
Suresh Anthony

Facebook Data Infrastructure Team

Facebook Data Infrastructure Team
View Profile

,
Hao Liu

Facebook Data Infrastructure Team

Facebook Data Infrastructure Team
View Profile

,
Pete Wyckoff

Facebook Data Infrastructure Team

Facebook Data Infrastructure Team
View Profile

,
Raghotham Murthy

Facebook Data Infrastructure Team

Facebook Data Infrastructure Team
View Profile

Proceedings of the VLDB Endowment Volume 2 Issue 2pp 1626–1629https://doi.org/10.14778/1687553.1687609

Published:01 August 2009Publication History

Proceedings of the VLDB Endowment

Abstract

The size of data sets being collected and analyzed in the industry for business intelligence is growing rapidly, making traditional warehousing solutions prohibitively expensive. Hadoop [3] is a popular open-source map-reduce implementation which is being used as an alternative to store and process extremely large data sets on commodity hardware. However, the map-reduce programming model is very low level and requires developers to write custom programs which are hard to maintain and reuse.

References

A. Pavlo et. al. A Comparison of Approaches to Large-Scale Data Analysis. Proc. ACM SIGMOD, 2009. Google ScholarDigital Library
C. Ronnie et al. SCOPE: Easy and Efficient Parallel Processing of Massive Data Sets. Proc. VLDB Endow., 1(2):1265--1276, 2008. Google ScholarDigital Library
Apache Hadoop. Available at http://wiki.apache.org/hadoop.Google Scholar
Hive Performance Benchmark. Available at https://issues.apache.org/jira/browse/HIVE-396.Google Scholar
Hive Language Manual. Available at http://wiki.apache.org/hadoop/Hive/LanguageManual.Google Scholar
Facebook Lexicon. Available at http://www.facebook.com/lexicon.Google Scholar
Apache Pig. http://wiki.apache.org/pig.Google Scholar
Apache Thrift. http://incubator.apache.org/thrift.Google Scholar

Index Terms

Hive: a warehousing solution over a map-reduce framework
1. Information systems
  1. Data management systems
    1. Information integration
      1. Data warehouses
  2. Information systems applications
    1. Data mining
    2. Decision support systems
      1. Data warehouses
2. Software and its engineering
  1. Software notations and tools
    1. General programming languages
      1. Language features
        Frameworks

Recommendations

Major technical advancements in apache hive
SIGMOD '14: Proceedings of the 2014 ACM SIGMOD International Conference on Management of Data

Apache Hive is a widely used data warehouse system for Apache Hadoop, and has been adopted by many organizations for various big data analytics applications. Closely working with many users and organizations, we have identified several shortcomings of ...
Read More
Apache Hive Essentials: Essential techniques to help you process, and get unique insights from, big data, 2nd Edition
Read More
Apache Hive 34 Success Secrets - 34 Most Asked Questions on Apache Hive - What You Need to Know
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

Published in

Proceedings of the VLDB Endowment Volume 2, Issue 2
August 2009
367 pages
ISSN:2150-8097
Issue’s Table of Contents
Sponsors
In-Cooperation
Publisher
VLDB Endowment
Publication History
- Published: 1 August 2009
Published in pvldb Volume 2, Issue 2
Qualifiers
- research-article
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 494
  Total Citations
  View Citations
- 6,955
  Total Downloads
- Downloads (Last 12 months)168
- Downloads (Last 6 weeks)20
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Hive: a warehousing solution over a map-reduce framework

Proceedings of the VLDB Endowment

Abstract

References

Cited By

Index Terms

Recommendations

Major technical advancements in apache hive

Apache Hive Essentials: Essential techniques to help you process, and get unique insights from, big data, 2nd Edition

Apache Hive 34 Success Secrets - 34 Most Asked Questions on Apache Hive - What You Need to Know

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Qualifiers

Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Hive: a warehousing solution over a map-reduce framework

Proceedings of the VLDB Endowment

Abstract

References

Cited By

Index Terms

Recommendations

Major technical advancements in apache hive

Apache Hive Essentials: Essential techniques to help you process, and get unique insights from, big data, 2nd Edition

Apache Hive 34 Success Secrets - 34 Most Asked Questions on Apache Hive - What You Need to Know

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Qualifiers

Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media