ABSTRACT
Despite the considerable effort spent researching and developing multi-agent systems the actual number of deployed systems is still surprisingly small. One of the reasons for the significant gap between developed and deployed systems is their brittleness. Multi-agent systems are susceptible to all of the same faults as any distributed system, they lack centralized control components, which makes it difficult to detect and treat failures of individual agents, and the agents making up the system are social, thus risking fault-propagation These faults can impact system performance and lead to overall system failure. Multi-agent systems must be made more dependable before they will be deployed on a large scale. Using redundancy by replication of individual agents within a multi-agent system is one possible approach for improving fault-tolerance, and hence improving reliability and availability-two key components of dependability. Having a group of agents, a replicate group, act like an individual agent leads to increased complexity and system load, and it introduces new challenges to system construction. Using a message proxy, to handle communication for the group, and passive replication strategies effectively deals with the complexity and overhead issues. This paper presents an architecture for implementing agent replicate groups using a message proxy and passive replicate group management. Experimentation and application testing using an implementation of the architecture is presented. The architecture is demonstrated to be a viable technique for increasing dependability in multi-agent systems.
- N.R. Jennings, K. Sycara, and M. Wooldridge. A roadmap of agent research and development. Journal of Autonomous Agents and Multi-Agent Systems, 1(1): 7--38, 1998 Google ScholarDigital Library
- M. Klein and C. Dallarocas. Exception handling in agent systems. In O. Etzioni, J.P. Mueller, and J.M. Bradshaw, editors, Proceedings of Agents'99, pages 62--68, Seattle, WA, 1999. Google ScholarDigital Library
- S. Mishra and Y. Huang. Fault-Tolerance in agent-based computing systems. In Proceedings of the 13th ISCA International Conference on Parallel and Distributed Computing Systems, Las Vegas, NV, 2000.Google Scholar
Index Terms
- Using dynamic proxy agent replicate groups to improve fault-tolerance in multi-agent systems
Recommendations
On fault tolerance in law-governed multi-agent systems
SELMAS '06: Proceedings of the 2006 international workshop on Software engineering for large-scale multi-agent systemsThere has been much research about frameworks and tools to build multi-agent systems in different domains in recent years. These systems have particular features such as autonomy, distribution, sociability, cooperation and others implemented in another ...
Improving Fault-tolerance in MAS with Dynamic Proxy Replicate Groups
IAT '03: Proceedings of the IEEE/WIC International Conference on Intelligent Agent TechnologyThis paper presents a technique for replicating agents in a multi-agent system (MAS) with a goal of improving the fault-tolerance of the system. Replicating agents, or forming a replicate group, will always add complexity and overhead to a system. To ...
Improving fault-tolerance by replicating agents
AAMAS '02: Proceedings of the first international joint conference on Autonomous agents and multiagent systems: part 2Despite the considerable efforts spent on developing multi-agent systems the actual number of deployed systems is surprisingly small. One of the reasons for the significant gap between developed and deployed systems is their brittleness.The absence of ...
Comments