International Journal of Engineering in Computer Science

P-ISSN: 2663-3582, E-ISSN: 2663-3590
Printed Journal   |   Refereed Journal   |   Peer Reviewed Journal

2019, Vol. 1, Issue 2, Part A

Efficient checkpoint algorithm for distributed system


Author(s): Neeraj Rathore and Jyoti Rathore

Abstract: The Grid is rapidly emerging as the means for coordinated resource sharing and problem solving in multi-institutional virtual organizations while providing dependable, consistent, pervasive access to global resources. The emergence of computational Grids and the potential for seamless aggregation and interactions between distributed services and resources, has led to the start of new era of computing. Tremendously large number and the heterogeneous nature of Grid Computing resource make the resource management a significantly challenging job. Resource management scenarios often include resource discovery, resource monitoring, resource inventories, resource provisioning, fault isolation, variety of autonomic capabilities and service level management activities. Out of this fault tolerance has become the main topic of research as till date there is no single system that can be called as the complete system that will handle all the faults in grids. Checkpointing is one of the fault-tolerant techniques to restore faults and to restart job fast. The algorithms for checkpointing on distributed systems have been under study for years. These algorithms can be classified into three classes: coordinated, uncoordinated and communication-induced algorithms. In this paper, a checkpointing algorithm that has minimum checkpointing counts equivalent to periodic checkpointing algorithm has been proposed. For relatively short rollback distance at faulty situations and produces better performance rather than other algorithms in terms of task completion time, in both fault-free and faulty situations. This algorithm has been implemented in Alchemi.NET because it did not currently support any fault tolerance mechanism.

DOI: 10.33545/26633582.2019.v1.i2a.22

Pages: 59-66 | Views: 899 | Downloads: 424

Download Full Article: Click Here
How to cite this article:
Neeraj Rathore, Jyoti Rathore. Efficient checkpoint algorithm for distributed system. Int J Eng Comput Sci 2019;1(2):59-66. DOI: 10.33545/26633582.2019.v1.i2a.22
International Journal of Engineering in Computer Science

International Journal of Engineering in Computer Science

International Journal of Engineering in Computer Science
Call for book chapter