Asynchronous Optimistic Rollback Recovery Using Secure Distributed Time

Get BibTex-formatted data

Author

Sean W. Smith,David B. Johnson,J.D. Tygar

Entry type

techreport

Abstract

In an asynchronous distributed computation, processes may fail and restart from saved state. A protocol for "optimistic rollback recovery" must recover the sytem when other processes may depend on lost states at failed processes. Previous work has used forms of partial order clocks to track potential causality. Our research addresses two crucial short- comings: the rollback problem also involves tracking a second level of partial order time (potential knowledge of failures and rollbacks), and protocols based on partial order clocks are open to inherent security and privacy risks. We have developed a "distributed time" framework that provides the tools for multiple levels of time abstraction, and for identifying and solving the corresponding security and privacy risks. This paper applies our framework to the rollback problem. We derive a new optimistic rollback recovery protocol that provides "completely asynchronous" recovery (thus directly supporting concurrent recovery and tolerating network partitions) and that enables processes to take full advantage of their maximum potential knowledge of orphans (thus reducing the worst case bound on asynchronous recovery after a single failure from exponetial to at most one rollback per process). By explicitly tracking and utilizing both levels of partial order time, our protocol substantially improves on previous work in optimistic recovery. Our work also provides a foundation for incorporating security and privacy in optimistic rollback recovery.

Date

1994 – March

Address

Pittsburgh, PA 15213

Institution

Carnegie Mellon University

Key alpha

Smith

Publication Date

0000-00-00

Location

A hard-copy of this is in the Papers Cabinet

BibTex-formatted data

To refer to this entry, you may select and copy the text below and paste it into your BibTex document. Note that the text may not contain all macros that BibTex supports.

@Techreport{ Smith,
	title = "Asynchronous Optimistic Rollback Recovery Using Secure Distributed Time",
	author = "Sean W. Smith,David B. Johnson,J.D. Tygar",
	year = "1994",
	month = "March",
	address = "Pittsburgh, PA 15213",
	institution = "Carnegie Mellon University",
	abstract = "In an asynchronous distributed computation, processes may fail and restart
from saved state. A protocol for "optimistic rollback recovery" must
recover the sytem when other processes may depend on lost states at
failed processes. Previous work has used forms of partial order clocks to
track potential causality. Our research addresses two crucial short-
comings: the rollback problem also involves tracking a second level of
partial order time (potential knowledge of failures and rollbacks), and
protocols based on partial order clocks are open to inherent security
and privacy risks. We have developed a "distributed time" framework that
provides the tools for multiple levels of time abstraction, and for
identifying and solving the corresponding security and privacy risks.
This paper applies our framework to the rollback problem. We derive a
new optimistic rollback recovery protocol that provides "completely
asynchronous" recovery (thus directly supporting concurrent recovery
and tolerating network partitions) and that enables processes to take
full advantage of their maximum potential knowledge of orphans (thus
reducing the worst case bound on asynchronous recovery after a single
failure from exponetial to at most one rollback per process). By
explicitly tracking and utilizing both levels of partial order time,
our protocol substantially improves on previous work in optimistic
recovery. Our work also provides a foundation for incorporating
security and privacy in optimistic rollback recovery.",
}