Design and Implementation of a Consistent Time Service for Fault-Tolerant Distributed Systems

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

2 Scopus citations

Abstract

Clock-related operations are one of the many sources of replica non-determinism and of replica inconsistency in fault-tolerant distributed systems. In passive replication, if the primary server crashes, the next clock value returned by the new primary server might have actually rolled back in time, which can lead to undesirable consequences for the replicated application. The same problem can happen for active replication where the result of the first replica to respond is taken as the next clock value. In this paper, we describe the design and implementation of a consistent time service for fault-tolerant distributed systems. The consistent time service introduces a group clock that is consistent across the replicas and that ensures the determinism of the replicas with respect to clock-related operations. The group clock is monotonically increasing, is transparent to the application and is fault-tolerant. The consistent time service guarantees the consistency of the group clock even when faults occur, when new replicas are added into the group and when failed replicas recover.
Original languageEnglish
Title of host publicationProceedings of the International Conference on Dependable Systems and Networks
Place of Publicationusa
PublisherIEEE Computer Society
Pages341-350
Number of pages10
DOIs
StatePublished - Dec 1 2003
Event2003 International Conference on Dependable Systems and Networks - San Francisco, CA, United States
Duration: Jun 22 2003Jun 25 2003

Conference

Conference2003 International Conference on Dependable Systems and Networks
Country/TerritoryUnited States
CitySan Francisco, CA
Period06/22/0306/25/03

Cite this