CycloidGrid: A proximity-aware P2P-based resource discovery architecture in volunteer computing systems

doi:10.1016/j.future.2012.08.010

Future Generation Computer Systems

Volume 29, Issue 6, August 2013, Pages 1583-1595

https://doi.org/10.1016/j.future.2012.08.010 Get rights and content

Abstract

Volunteer computing which benefits from idle cycles of volunteer resources over the Internet can integrate the power of hundreds to thousands of resources to achieve high computing power. In such an environment the resources are heterogeneous in terms of CPU speed, RAM, disk capacity, and network bandwidth. So finding a suitable resource to run a particular job becomes difficult. Resource discovery architecture is a key factor for overall performance of peer-to-peer based volunteer computing systems. The main contribution of this paper is to develop a proximity-aware resource discovery architecture for peer-to-peer based volunteer computing systems. The proposed resource discovery algorithm consists of two stages. In the first stage, it selects resources based on the requested quality of service and current load of peers. In the second stage, a resource with higher priority to communication delay is selected among the discovered resources. Communication delay between two peers is computed by a network model based on queuing theory, taking into account the background traffic of the Internet. Simulation results show that the proposed resource discovery algorithm improves the response time of user’s requests by a factor of 4.04 under a moderate load.

Highlights

► A resource discovery architecture in P2P volunteer computing systems is proposed. ► This architecture considers QoS constraints of request and communication overhead. ► We adapt the analytical model for computing communication delay between two peers. ► The background traffic of the Internet is considered in the analytical model. ► The proposed architecture is evaluated under realistic workload models.

Introduction

Volunteer computing in which volunteers donate processing and storage resources is an attractive cost efficient platform for running scientific projects with significant computational requirements (tens or hundreds of TeraFLOPs) [1]. Some of these projects are the SETI@home [2], Folding@home [3], EDGeS [4], DEGISCO [5] and EDGI [6] and Climate@home [7] projects.

Some of the popular volunteer computing systems are BOINC [8], [9], condor-like grid system [10], Entropia [11], XtremeWeb [12], Aneka [13], and SZTAKI [14]. Peer-to-Peer (P2P) based volunteer computing (VC) systems represent a decentralized, self-organized and scalable environment for running applications such as PastryGrid [15], BonjourGrid [16], ShareGrid [17], Condor-Flock P2P [18], and Self-Gridron [19].

A fundamental challenge in this large, decentralized and distributed resource sharing environment is efficient discovery of resources described by a set of attributes such as CPU speed, memory, and operating system type. Another challenge comes from these complex environments characterized by large scale and geographically scattered resources. In such an environment, millions of heterogeneous resources are scattered across geographically distributed nodes, therefore a resource discovery architecture with proximity-aware features has great impact on overall performance of the system [18]. There are many resource discovery algorithms in P2P-based volunteer computing systems [20], [21], [22], [23], [24], but none of them considers a model in computing communication overhead.

The main contribution of this work is to propose a proximity-aware resource discovery architecture considering quality of service (QoS) constraints of requests, and communication overhead in P2P-based VC systems. In a previous work [25], we presented an architecture for resource discovery in P2P-based VC systems. This architecture is called CycloidGrid, and it distributes a load between peers based on QoS constraints of requests, round trip time (RTT), and current load of resources (for more information refers to Section 3). In this research we focus on a network model based on queuing theory to compute more realistic communication overhead considering the background traffic of the Internet.

The proposed resource discovery algorithm is composed of two stages. In the first stage, resources are selected based on QoS constraints of requests and current load of peers. The QoS constraints can be selected from CPU speed, RAM requirements, hard disk requirements, operating system, and processor model. In the second stage, the algorithm selects a resource with higher priority to communication delay among the discovered resources in the first stage. To compute the communication delay between two peers in the system, each connection is modeled as a GI/GI/1 queue. Background traffic of the Internet is considered in the model to generate a more realistic communication delay. In summary, we have the following contributions in this paper:

•
We provide a resource discovery architecture in P2P-based volunteer computing systems considering QoS constraints of request and communication overhead.
•
We adapt the analytical model for computing communication delay between two peers. The background traffic of the Internet is considered during computation of communication delay between two peers.
•
We evaluate the proposed resource discovery architecture under realistic workload models and different numbers of peers to show the scalability of the system.

The rest of this paper is organized as follows: Section 2 presents related work. Section 3 discusses the CycloidGrid environment, including the application model, architecture, and churn management. Section 4 presents an analytical model for computing communication delay in CycloidGrid and resource discovery policy in this architecture. Section 5 describes the performance evaluation of the proposed resource discovery architecture under a realistic workload model. Conclusions and future directions are presented in Section 6.

Section snippets

Related work

There are several research works that have investigated resource discovery based on load balancing, QoS constraints of request, and proximity-aware features in P2P-based volunteer computing systems. These research works can be divided into two categories: in the first category, the resources discovery approach focuses on load balancing and QoS constraints of requests. Some works in this category consider load balancing and QoS constraints of requests and some of them only consider one of them.

CycloidGrid: Proximity-aware resource discovery architecture in peer-to-peer based volunteer computing systems

In this section CycloidGrid is discussed in detail. CycloidGrid is an architecture for resource discovery in P2P-based volunteer computing systems. Routing of requests in this architecture is done by Cycloid [36] as the P2P overlay. This P2P network is discussed in the following section briefly.

Resource discovery algorithm in CycloidGrid

In CycloidGrid, each request (job) containing some tasks is executed within a single peer. Each request has the following characteristics:

•
Number of independent tasks
•
Estimated duration of each task
•
QoS constraints of this job in terms of minimum CPU speed, RAM, hard disk requirements, operating system, and processor model. The number of QoS constraints varies between zero to five.

Each request is served in the system within two stages. During the first stage, a subset of resources is advertised

Performance evaluation

In order to evaluate the performance of the proposed resource discovery algorithm, we implemented CycloidGrid simulator as a discrete event simulator. CycloidGrid is written in Java and it is an extended version of Cycloid simulator [36] to emulate the P2P-based volunteer computing systems.

The physical network in CycloidGrid is emulated by the Brite topology generator [48]. A physical network with $n$ computers, which are connected by the Waxman model and different link bandwidth, are generated

Conclusion

In this paper, we propose a proximity-aware resource discovery architecture in P2P-based volunteer computing systems. We consider a request arriving into the system as the Bag of Tasks, where each request may have QoS constraints such as minimum CPU speed, RAM, hard disk requirements, operating system, and processor model. The resource discovery algorithm has two phases. In the first phase, it takes into account the load balancing and QoS constraints of requests, whereas in the second phase,

Acknowledgments

This project was partially supported by Iran Telecommunication Research Centre (ITRC). The authors would like to thank Rodrigo N. Calheiros for useful discussions.

Toktam Ghafarian received the B.S. degree and M.S. degree in computer engineering from Ferdowsi University of Mashhad, Iran in 1999 and 2002 respectively. She is now a Ph.D. student in the Department of Computer Engineering at Ferdowsi University of Mashhad, Iran. From May 2011 to December 2011, she was with the Cloud Computing and Distributed Systems (CLOUDS) Laboratory at the University of Melbourne, Australia as a visiting researcher. Her research interests include distributed and parallel

References (52)

D.H.J. Epema et al.
A worldwide flock of condors: load sharing among workstation clusters
Future Gener. Comput. Syst.
(1996)
A. Chien et al.
Entropia: architecture and performance of an enterprise desktop grid system
J. Parallel Distrib. Comput.
(2003)
F. Cappello et al.
Computing on large scale distributed systems: XtremWeb architecture, programming models, security, tests and convergence with grid
Future Gener. Comput. Syst.
(2005)
A.R. Butt et al.
A self-organizing flock of condors
J. Parallel Distrib. Comput.
(2006)
J.S. Kim et al.
Trade-offs in matchmaking job and balancing load for distributed desktop grids
Future Gener. Comput. Syst.
(2008)
C. Mastroianni et al.
A scalable super-peer approach for public scientific computation
Future Gener. Comput. Syst.
(2009)
H. Shen et al.
Cycloid: a scalable constant-degree p2p overlay network
Perform Evaluation
(2006)
F. Araujo et al.
A maximum independent set approach for collusion detection in voting pools
J. Parallel Distrib. Comput.
(2011)
F.A.B. da Silva et al.
Scalability limits of Bag-of-Tasks applications running on hierarchical platforms
J. Parallel Distrib. Comput.
(2011)
D. Kondo, B. Javadi, P. Malecot, F. Cappello, D.P. Anderson, Cost-Benefit analysis of cloud computing versus desktop...

D.P. Anderson et al.

SETI@home: an experiment in public-resource computing

Commun. ACM

(2002)

A.L. Beberg, D.L. Ensign, G. Jayachandran, S. Khaliq, V.S. Pande, Folding@home: lessons from eight years of volunteer...

G. Fedak, H. HE, O. Lodygensky, Z. Balaton, Z. Farkas, G. Gombas, P. Kacsuk, R. Lovas, A.C. Marosi, I. Kelley, I....

DEGISCO project,...

EDGI project,...

P. Guinnessy

Climate@home

Phys. Today

(2003)

D.P. Anderson, BOINC: a system for public-resource computing and storage, in: Proceeding of Grid Computing, Grid, 2004,...

A. Elwaer, A. Harrison, I. Kelley, I. Taylor, Attic: a case study for distributing data in BOINC projects, in: IEEE...

X. Chu, K. Nadiminti, C. Jin, S. Venugopal, R. Buyya, Aneka: next-generation enterprise grid platform for e-science and...

A.C. Marosi et al.

SZTAKI desktopgrid: building a scalable, secure platform for desktop grid computing

Making Grids Work

(2008)

H. Abbes et al.

A decentralized and fault-tolerant desktop grid system for distributed applications

Concurrency Computat: Pract. Exper.

(2010)

H. Abbes, C. Cerin, M. Jemni, Bonjourgrid: orchestration of multi-instances of grid middlewares on institutional...

C. Anglano et al.

The ShareGrid peer-to-peer desktop grid: infrastructure, applications, and performance evaluation

J. Grid Comput.

(2010)

E. Byun, H. Kim, S. Choi, S. Lee, Y.S. Han, J.M. Gil, S.Y. Jung, Self-gridron: reliable, autonomous, and fully...

T. Abdullah et al.

Hybrid resource discovery mechanism in ad hoc grid using structured overlay

Lect. Notes Comput. Sci.

(2009)

D. Lazaro, J.M. Marques, X. Vilajosana, Flexible resource discovery for decentralized P2P and volunteer computing...

Cited by (0)

Hossein Deldari received a B.S. degree in Physics from Ferdowsi University of Mashhad, Iran in 1970, his Masters degree in Computer Science from University of Oregon, USA in 1979 and his Ph.D. in Parallel and distributed systems from University of Leeds, England in 1995. His research interests include distributed and parallel computer systems, parallel programming, parallel algorithmic skeletons, parallel fuzzy genetic algorithms, grid/cluster and cloud computing, and multi-core architectures.

Bahman Javadi is a Lecturer in Networking and Cloud Computing at the University of Western Sydney, Australia. Prior to this appointment, he was a Research Fellow at the University of Melbourne, Australia. From 2008 to 2010, he was a Postdoctoral Fellow at the INRIA Rhone-Alpes, France. He received his M.S. and Ph.D. degrees in Computer Engineering from the Amirkabir University of Technology in 2001 and 2007, respectively. He was a Research Scholar at the School of Engineering and Information Technology, Deakin University, Australia during his Ph.D. course. He is co-founder of the Failure Trace Archive, which serves as a public repository of failure traces and algorithms for distributed systems. He has received numerous Best Paper Awards at IEEE/ACM conferences for his research papers. He has served as a program committee of many international conferences and workshops. His research interests include Cloud and Grid computing, performance evaluation of large-scale distributed computing systems, and reliability and fault tolerance.

Mohammad Hossien Yaghmaee was born in July 1971 in Mashad, Iran. He received his B.S. degree in Communication Engineering from Sharif University of Technology, Tehran, Iran in 1993, and M.S. degree in communication engineering from Tehran Polytechnic (Amirkabir) University of Technology in 1995. He received his Ph.D. degree in Communication Engineering from Tehran Polytechnic (Amirkabir) University of Technology in 2000. He has been a computer network engineer with several networking projects in Iran Telecommunication Research Center (ITRC) since 1992. From November 1998 to July 1999, he was with Network Technology Group (NTG), C&C Media Research Labs., NEC Corporation, Tokyo, Japan, as a visiting research scholar. From September 2007 to August 2008, he was with the Lane Department of Computer Science and Electrical Engineering, West Virginia University, Morgantown, USA as a visiting Associate Professor. He is the author of 3 books, all in the Farsi language. He has published more than 85 international conference and journal papers. His research interests are in Wireless Sensor Networks (WSNs), traffic and congestion control, high-speed networks, including ATM and MPLS, Quality of Services (QoS) and fuzzy logic control.

Rajkumar Buyya is Professor of Computer Science and Software Engineering; and Director of the Cloud Computing and Distributed Systems (CLOUDS) Laboratory at the University of Melbourne, Australia. He also serves as the founding CEO of Manjrasoft Pty Ltd., a spin-off company of the University, commercializing its innovations in Grid and Cloud Computing. He has authored and published over 300 research papers and four textbooks. The books on emerging topics that Dr. Buyya edited include, High Performance Cluster Computing (Prentice Hall, USA, 1999), Content Delivery Networks (Springer, Germany, 2008), Market-Oriented Grid and Utility Computing (Wiley, USA, 2009), and Cloud Computing: Principles and Paradigms (Wiley, USA, 2011). He is one of the highly cited authors in computer science and software engineering worldwide (h-index =52, gindex =111, 14 500 citations).

Software technologies for Grid and Cloud computing developed under Dr. Buyya’s leadership have gained rapid acceptance and are in use at several academic institutions and commercial enterprises in 40 countries around the world. Dr. Buyya has led the establishment and development of key community activities, including serving as foundation Chair of the IEEE Technical Committee on Scalable Computing and four IEEE conferences (CCGrid, Cluster, Grid, and e-Science). He has presented over 250 invited talks on his vision on IT Futures and advanced computing technologies at international conferences and institutions in Asia, Australia, Europe, North America, and South America. These contributions and international research leadership of Dr. Buyya are recognized through the award of the “2009 IEEE Medal for Excellence in Scalable Computing” from the IEEE Computer Society, USA. Manjrasoft’s Aneka technology for Cloud Computing developed under his leadership received the “2010 Asia Pacific Frost & Sullivan New Product Innovation Award”.

View full text

CycloidGrid: A proximity-aware P2P-based resource discovery architecture in volunteer computing systems

Abstract

Highlights

Introduction

Section snippets

Related work

CycloidGrid: Proximity-aware resource discovery architecture in peer-to-peer based volunteer computing systems

Resource discovery algorithm in CycloidGrid

Performance evaluation

Conclusion

Acknowledgments

Future Gener. Comput. Syst.

J. Parallel Distrib. Comput.

Future Gener. Comput. Syst.

J. Parallel Distrib. Comput.

Future Gener. Comput. Syst.

Future Gener. Comput. Syst.

Perform Evaluation

J. Parallel Distrib. Comput.

J. Parallel Distrib. Comput.

SETI@home: an experiment in public-resource computing

Commun. ACM

Climate@home

Phys. Today

SZTAKI desktopgrid: building a scalable, secure platform for desktop grid computing

Making Grids Work

A decentralized and fault-tolerant desktop grid system for distributed applications

Concurrency Computat: Pract. Exper.

The ShareGrid peer-to-peer desktop grid: infrastructure, applications, and performance evaluation

J. Grid Comput.

Hybrid resource discovery mechanism in ad hoc grid using structured overlay

Lect. Notes Comput. Sci.