Resource Adaptive Agent Based Fault Tolerant Architecture

Date

2008-04-22T02:41:28Z

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Computer Science & Engineering

Abstract

Pervasive environment consists of increasing number of mobile, tiny and heterogeneous devices communicating through interconnected network. As ubiquitous computing has seeped into various aspects of everyday life, there has been an increasing demand for dependable systems. However providing reliability demands fault tolerance mechanisms that require substantial time and resources. The dynamic nature and the uncertainty associated with pervasive systems coupled with the energy constraints of the devices involved makes fault tolerance a challenging task. In general, the techniques used to provide fault-tolerance are based on having redundancy and duplication of the user tasks. However the additional cost and the low resource availability will prohibit implementation of such fault tolerance methodologies for a pervasive environment. The traditional fault detection and recovery techniques need to be modified to make it applicable in a pervasive environment. Pervasive Information Community Organization (PICO) is a framework consisting of software agents, called delegents that perform services on behalf of users and devices. In the PICO framework computing community of collaborating delegents is formed to carry out application-specific services. PerSON (Service Overlay Network for Pervasive Environments) provides the service overlay network for the implementation of the community computing concept introduced in PICO. PerSON uses the device model proposed in PICO and provides an overlay network which abstracts the details of service creation, discovery and utilization in a pervasive environment. In this thesis we have developed a Resource Adaptive Agent System (RAAS) which is integrated with PerSON to enhance and facilitate the services provided by PerSON. To deal with the dynamic nature and make best use of resources available in a pervasive environment, RAAS adds features like fault tolerance, checkpointing and resource aware distribution of user requests to PerSON. RAAS not only provides reactive measures to failures, but also proactively deals with the probable future failures and if required performs reassignment of user task from the recently saved checkpoint.

Demonstration applications that perform data intensive tasks have been developed and tested on RAAS. For a set of tasks, energy savings of about 40% was achieved by adding the resource adaptiveness feature to PerSON. The energy savings achieved is proportional to the size of tasks and is subject to the devices available in the environment.

Description

Keywords

Citation