skip to main content
article

Toward network data independence

Published:01 September 2003Publication History
Skip Abstract Section

Abstract

A number of researchers have become interested in the design of global-scale networked systems and applications. Our thesis here is that the database community's principles and technologies have an important role to play in the design of these systems. The point of departure is at the roots of database research: we generalize Codd's notion of data independence to physical environments beyond storage systems. We note analogies between the development of database indexes and the new generation of structured peer-to-peer networks. We illustrate the emergence of data independence in networks by surveying a number of recent network facilities and applications, seen through a database lens. We present a sampling of database query processing techniques that can contribute in this arena, and discuss methods for adoption of these technologies.

References

  1. W. Adjie-Winoto, E. Schwartz, H. Balakrishnan, and J. Lilley. The design and implementation of an intentional naming system. In Proc. 17th ACM SOSP, Dec. 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. A. Andrzejak and Z. Xu. Scalable, efficient range queries for grid information services. In Proc. Second IEEE International Conference on Peer-to-Peer Computing, Linkoping University, Sweden, Sept. 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. R. Avnur and J. M. Hellerstein. Eddies: Continuously adaptive query processing. In Proc. ACM SIGMOD, pages 261--272, May 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. H. Balakrishnan, S. Shenker, and M. Walfish. Semantic-free referencing in linked distributed systems. In 2nd International Workshop on Peer-to-Peer Systems (IPTPS), Feb. 2003.Google ScholarGoogle ScholarCross RefCross Ref
  5. F. Cacace, S. Ceri, and M. A. W. Houtsma. A survey of parallel execution strategies for transitive closure and logic programs. Distributed and Parallel Databases, 1(4):337--382, 1993. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Y. Chawathe, S. Ratnasamy, L. Breslau, and N. Lanham. Making gnutella-like p2p systems scalable. In Proc. of ACM SIGCOMM, Aug. 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. E. F. Codd. A relational model of data for large shared data banks. CACM, 13(6):377--387, 1970. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. C. J. Date and P. Hopewell. File definition and logical data independence. In Proc. ACM SIGFIDET, pages 117--138. ACM, 1971.Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. M. J. Flynn, J. Gray, A. K. Jones, K. Lagally, H. Opderbeck, G. J. Popek, B. Randell, J. H. Saltzer, and H.-R. Wiehle, editors. Operating Systems, An Advanced Course, volume 60 of Lecture Notes in Computer Science. Springer, 1978. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. D. K. Gifford, P. Jouvelot, M. Sheldon, and J. O'Toole. Semantic file systems. In Proc. 13th ACM SOSP, Oct. 1991. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. K. Gummadi, R. Gummadi, S. Ratnasamy, S. Shenker, and I. Stoica. The impact of dht routing geometry on resilience and proximity. In ACM SIGCOMM, Aug. 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. A. Gupta, D. Agrawal, and A. E. Abbadi. Approximate range selection queries in peer-to-peer systems. In Proc. First Biennial Conference on Innovative Data Systems Research (CIDR 2003), Asilomar, CA, Jan. 2003.Google ScholarGoogle Scholar
  13. J. M. Hellerstein, W. Hong, S. Madden, and K. Stanek. Beyond average: Toward sophisticated sensing with queries. In Proc. Information Processing in Sensor Networks (IPSN), 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. R. Huebsch, J. M. Hellerstein, N. Lanham, B. T. Loo, S. Shenker, and I. Stoica. Querying the internet with PIER. In Proc. 29th International Conference on Very Large Data Bases (VLDB), 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. D. Karger, E. Lehman, T. Leighton, M. Levine, D. Lewin, and R. Panigrahy. Consistent hashing and random trees: Distributed caching protocols for relieving hot spots on the world wide web. In Proc. 29th Annual ACM Symposium on Theory of Computing (STOC), 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. J. Li, B. T. Loo, J. M. Hellerstein, M. F. Kaashoek, D. Karger, and R. Morris. On the feasibility of peer-to-peer web indexing and search. In 2nd International Workshop on Peer-to-Peer Systems (IPTPS), 2003.Google ScholarGoogle ScholarCross RefCross Ref
  17. W. Litwin. Trie hashing. In Proc. ACM SIGMOD International Conference on Management of Data, pages 19--29, Ann Arbor, Michigan, Apr. 1981. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. S. R. Madden, M. A. Shah, J. M. Hellerstein, and V. Raman. Continuously adaptive continuous queries over streams. In Proc. ACM SIGMOD, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. P. Maymounkov and D. Mazieres. Kademlia: A peer-to-peer information system based on the xor metric. In Proc. of the 1st International Workshop on Peer-to-Peer Systems (IPTPS'02), Mar. 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. B. Oki, M. Pfluegl, A. Siegel, and D. Skeen. The information bus--an architecture for extensible distributed systems. In Proc. 14th ACM SOSP, pages 58--68, Dec. 1993. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. C. H. Papadimitriou. Database metatheory: Asking the big queries. In Proc. 14th Symposium on Principles of Database Systems (PODS), pages 1--10, May 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. V. Paxson, J. Mahdavi, A. Adams, and M. Mathis. An architecture for large-scale internet measurement. IEEE Communications, 36(8):48--54, Aug. 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. C. G. Plaxton, R. Rajaraman, and A. W. Richa. Accessing nearby copies of replicated objects in a distributed environment. In ACM Symposium on Parallel Algorithms and Architectures (SPAA), 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. V. Raman, A. Deshpande, and J. M. Hellerstein. Using state modules for adaptive query processing. In Proc. IEEE International Conference on Data Engineering (ICDE), 2003.Google ScholarGoogle Scholar
  25. S. Ratnasamay, J. M. Hellerstein, and S. Shenker. Range queries in DHTs. Technical Report IRB-TR-03-009, Intel Research, July 2003.Google ScholarGoogle Scholar
  26. S. Ratnasamy, P. Francis, M. Handley, R. Karp, and S. Shenker. A scalable content addressable network. In Proc. 2001 ACM SIGCOM Conference, August 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. A. Rowstron and P. Druschel. Pastry: Scalable, decentralized object location, and routing for large-scale peer-to-peer systems. Lecture Notes in Computer Science, 2218, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. I. Stoica, D. Adkins, S. Zhaung, S. Shenker, and S. Surana. Internet indirection infrastructure. In Proc. ACM SIGCOMM, Aug. 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. I. Stoica, R. Morris, D. Karger, F. Kaashoek, and H. Balakrishnan. Chord: Scalable Peer-To-Peer lookup service for internet applications. In Proc. 2001 ACM SIGCOMM Conference, pages 149--160, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. R. Wolski, N. Spring, and J. Hayes. The Network Weather Service: A distributed resource performance forecasting service for metacomputing. Journal of Future Generation Computing Systems, 15(5--6):757--768, Oct. 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. B. Yang and H. Garcia-Molina. Designing a super-peer network. In Proc. of the 19th International Conference on Data Engineering (ICDE), Mar. 2003.Google ScholarGoogle Scholar
  32. B. Y. Zhao, J. D. Kubiatowicz, and A. D. Joseph. Tapestry: An infrastructure for fault-tolerant wide-area location and routing. Technical Report UCB/CSD-01-1141, UC Berkeley, Apr. 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Toward network data independence
        Index terms have been assigned to the content through auto-classification.

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in

        Full Access

        • Published in

          cover image ACM SIGMOD Record
          ACM SIGMOD Record  Volume 32, Issue 3
          September 2003
          86 pages
          ISSN:0163-5808
          DOI:10.1145/945721
          Issue’s Table of Contents

          Copyright © 2003 Author

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 1 September 2003

          Check for updates

          Qualifiers

          • article

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader