Abstrakti
Consider a fully-connected synchronous distributed system of n nodes, where up to f nodes may be faulty and every node starts in an arbitrary initial state. In the synchronous counting problem, all nodes need to eventually agree on a counter that is increased by one modulo some C in each round. In the self-stabilising firing squad problem, the task is to eventually guarantee that all non-faulty nodes have simultaneous responses to external inputs: if a subset of the correct nodes receive an external “go” signal as input, then all correct nodes should agree on a round (in the not-too-distant future) in which to jointly output a “fire” signal. Moreover, no node should generate a “fire” signal without some correct node having previously received a “go” signal as input. We present a framework reducing both tasks to binary consensus at very small cost: we maintain the resilience of the underlying consensus routine, while the stabilisation time and message size are, up to constant factors, bounded by the sum of the cost of the consensus routine for f faults and recursively applying our scheme to f’<f/2 faults. For example, we obtain a deterministic algorithm for self-stabilising Byzantine firing squads with optimal resilience f<n/3, asymptotically optimal stabilisation and response time O(f), and message size O(\log f). As our framework does not restrict the type of consensus routines used, we also obtain efficient randomised solutions, and it is straightforward to adapt our framework to allow for f<n/2 omission or f<n crash faults, respectively. Our results resolve various open questions on the two problems, most prominently whether (communication-efficient) self-stabilising Byzantine firing squads or (randomised) sublinear-time solutions for either problem exist. For example, we obtain a deterministic algorithm for self-stabilising Byzantine firing squads with optimal resilience f < n/3, asymptotically optimal stabilisation and response time O(f), and message size O(log f). As our framework does not restrict the type of consensus routines used, we can also obtain efficient randomised solutions, and it is straightforward to adapt our framework to allow f < n/2 omission or f < n crash faults.
Alkuperäiskieli | Englanti |
---|---|
Otsikko | Stabilization, Safety, and Security of Distributed Systems - 18th International Symposium, SSS 2016, Proceedings |
Kustantaja | Springer |
Sivut | 263-280 |
Sivumäärä | 18 |
Vuosikerta | 10083 LNCS |
ISBN (painettu) | 9783319492582 |
DOI - pysyväislinkit | |
Tila | Julkaistu - 2016 |
OKM-julkaisutyyppi | A4 Artikkeli konferenssijulkaisussa |
Tapahtuma | International Symposium on Stabilization, Safety, and Security of Distributed Systems - Lyon, Ranska Kesto: 7 marrask. 2016 → 10 marrask. 2016 Konferenssinumero: 18 |
Julkaisusarja
Nimi | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) |
---|---|
Vuosikerta | 10083 LNCS |
ISSN (painettu) | 03029743 |
ISSN (elektroninen) | 16113349 |
Conference
Conference | International Symposium on Stabilization, Safety, and Security of Distributed Systems |
---|---|
Lyhennettä | SSS |
Maa/Alue | Ranska |
Kaupunki | Lyon |
Ajanjakso | 07/11/2016 → 10/11/2016 |