{"id":336,"date":"2019-07-17T21:00:30","date_gmt":"2019-07-18T01:00:30","guid":{"rendered":"http:\/\/www.podc.org\/podc2019\/?page_id=336"},"modified":"2021-10-29T10:40:36","modified_gmt":"2021-10-29T14:40:36","slug":"table-of-contents","status":"publish","type":"page","link":"https:\/\/www.podc.org\/podc2019\/table-of-contents\/","title":{"rendered":"Table of Contents"},"content":{"rendered":"\n<p>\u00a0<\/p>\n<div id=\"DLtoc\">\n<div id=\"DLheader\">\n<h1>PODC &#8217;19- Proceedings of the 2019 ACM Symposium on Principles of Distributed Computing<\/h1>\n<a class=\"DLcitLink\" title=\"Go to the ACM Digital Library for additional information about this proceeding\" href=\"https:\/\/dl.acm.org\/citation.cfm?id=3293611\"><img loading=\"lazy\" decoding=\"async\" class=\"DLlogo\" src=\"https:\/\/dl.acm.org\/img\/dllogo.png\" alt=\"Digital Library logo\" width=\"30\" height=\"30\" \/>Full Citation in the ACM Digital Library<\/a><\/div>\n<div id=\"DLcontent\">\n<h2>SESSION: Awards<\/h2>\n<div class=\"DLabstract\">\u00a0<\/div>\n<h3><a class=\"DLtitleLink\" title=\"Get the Full Text from the ACM Digital Library\" href=\"https:\/\/dl.acm.org\/authorize?N688514\">2019 Edsger W. Dijkstra Prize in Distributed Computing<\/a><\/h3>\n<ul class=\"DLauthors\">\n<li class=\"nameList First\">Lorenzo Alvisi<\/li>\n<li class=\"nameList\">Shlomi Dolev<\/li>\n<li class=\"nameList\">Faith Ellen (chair)<\/li>\n<li class=\"nameList\">Idit Keidar<\/li>\n<li class=\"nameList\">Fabian Kuhn<\/li>\n<li class=\"nameList Last\">Jukka Suomela<\/li>\n<\/ul>\n<div class=\"DLabstract\">\n<div>\n<p>The committee decided to award the 2019 Edsger W. Dijkstra Prize in Distributed Computing to Alessandro Panconesi and Aravind Srinivasan for their paper Randomized Distributed Edge Coloring via an Extension of the Chernoff-Hoeffding Bounds, SIAM Journal on Computing, volume 26, number 2, 1997, pages 350-368. A preliminary version of this paper appeared as Fast Randomized Algorithms for Distributed Edge Coloring, Proceedings of the Eleventh Annual ACM Symposium Principles of Distributed Computing (PODC), 1992, pages 251-262.<\/p>\n<p>The paper presents a simple synchronous algorithm in which processes at the nodes of an undirected network color its edges so that the edges adjacent to each node have different colors. It is randomized, using 1.6\u0394 + O(log<sup>1+\u03b6<\/sup>n) colors and O(log n) rounds with high probability for any constant \u03b6&gt;0, where n is the number of nodes and is the maximum degree of the nodes. This was the first nontrivial distributed algorithm for the edge coloring problem and has influenced a great deal of follow-up work. Edge coloring has applications to many other problems in distributed computing such as routing, scheduling, contention resolution, and resource allocation.<\/p>\n<p>In spite of its simplicity, the analysis of their edge coloring algorithm is highly nontrivial. Chernoff-Hoeffding bounds, which assume random variables to be independent, cannot be used. Instead, they develop upper bounds for sums of negatively correlated random variables, for example, which arise when sampling without replacement. More generally, they extend Chernoff-Hoeffding bounds to certain random variables they call \u03bb-correlated. This has directly inspired more specialized concentration inequalities. The new techniques they introduced have also been applied to the analyses of important randomized algorithms in a variety of areas including optimization, machine learning, cryptography, streaming, quantum computing, and mechanism design.<\/p>\n<\/div>\n<\/div>\n<h3><a class=\"DLtitleLink\" title=\"Get the Full Text from the ACM Digital Library\" href=\"https:\/\/dl.acm.org\/authorize?N688525\">2019 Principles of Distributed Computing Doctoral Dissertation Award<\/a><\/h3>\n<ul class=\"DLauthors\">\n<li class=\"nameList First\">Prasad Jayanti<\/li>\n<li class=\"nameList\">Nancy A. Lynch<\/li>\n<li class=\"nameList\">Boaz Patt-Shamir<\/li>\n<li class=\"nameList Last\">Ulrich Schmid (chair)<\/li>\n<\/ul>\n<div class=\"DLabstract\">\n<div>\n<p>The winner of the 2019 Principles of Distributed Computing Doctoral Dissertation Award is Dr. Sepehr Assadi for his dissertation Combinatorial Optimization on Massive Datasets: Streaming, Distributed, and Massively Parallel Computation, written under the supervision of Prof. Sanjeev Khanna at the University of Pennsylvania.<\/p>\n<p>The thesis resolves a number of long-standing problems in the exciting and still relatively new area of sublinear computation. The area of sublinear computation focuses on design of algorithms that use sublinear space, time, or communication to solve global optimization problems on very large datasets. In addition to addressing a wide range of different problems, comprising graph optimization problems (matching, vertex cover, and connectivity), submodular optimization (set cover and maximum coverage), and resource-constrained optimization (combinatorial auctions and learning), these problems are studied in three different models of computation, namely, streaming algorithms, multiparty communication, and massively parallel computation (MPC). The thesis also reveals interesting relations between these different models, including generic algorithmic and analysis techniques that can be applied in all of them.<\/p>\n<p>For many fundamental optimization problems, the thesis gives asymptotically matching algorithmic and intractability results, completely resolving several long-standing problems. This is accomplished by using a broad spectrum of mathematical methods in very detailed and intricate proofs. In addition to a wide variety of classic techniques, ranging from graph theory, combinatorics, probability, linear algebra and calculus, it also makes heavy use of communication complexity and information theory, for example.<\/p>\n<p>Sepehr&#8217;s dissertation work has been published in a remarkably large number of top-conference papers. It received multiple best paper awards and multiple special issue invitations, as well as two invitations to the Highlights of Algorithms (HALG) conference. Due to its contributions to the field of distributed computing and all the merits mentioned above, the award committee unanimously selected this thesis as the winner of the 2019 Principles of Distributed Computing Doctoral Dissertation Award.<\/p>\n<\/div>\n<\/div>\n<h2>SESSION: Keynote Lecture 1<\/h2>\n<div class=\"DLabstract\">\u00a0<\/div>\n<h3><a class=\"DLtitleLink\" title=\"Get the Full Text from the ACM Digital Library\" href=\"https:\/\/dl.acm.org\/authorize?N688526\">Local Computation Algorithms<\/a><\/h3>\n<ul class=\"DLauthors\">\n<li class=\"nameList Last\">Ronitt Rubinfeld<\/li>\n<\/ul>\n<div class=\"DLabstract\">\n<div>\n<p>Consider a setting in which inputs to and outputs from a computational problem are so large, that there is not time to read them in their entirety. However, if one is only interested in small parts of the output at any given time, is it really necessary to solve the entire computational problem? Is it even necessary to view the whole input? We survey recent work in the model of &#8220;local computation algorithms&#8221; which for a given input, supports queries by a user to values of specified bits of a legal output. The goal is to design local computation algorithms in such a way that very little of the input needs to be seen in order to determine the value of any single bit of the output. Though this model describes sequential computations, techniques from local distributed algorithms have been extremely important in designing efficient local computation algorithms.<\/p>\n<p>In this talk, we describe results on a variety of problems for which sublinear time and space local computation algorithms have been developed &#8212; we will give special focus to finding maximal independent sets and sparse spanning graphs.<\/p>\n<\/div>\n<\/div>\n<h2>SESSION: Session 1<\/h2>\n<div class=\"DLabstract\">\u00a0<\/div>\n<h3><a class=\"DLtitleLink\" title=\"Get the Full Text from the ACM Digital Library\" href=\"https:\/\/dl.acm.org\/authorize?N688527\">Symmetry Breaking in the Plane: Rendezvous by Robots with Unknown Attributes<\/a><\/h3>\n<ul class=\"DLauthors\">\n<li class=\"nameList First\">Jurek Czyzowicz<\/li>\n<li class=\"nameList\">Leszek Gasieniec<\/li>\n<li class=\"nameList\">Ryan Killick<\/li>\n<li class=\"nameList Last\">Evangelos Kranakis<\/li>\n<\/ul>\n<div class=\"DLabstract\">\n<div>\n<p>We study a fundamental question related to the feasibility of deterministic symmetry breaking in the infinite Euclidean plane for two robots that have minimal or no knowledge of the respective capabilities and &#8220;measuring instruments&#8221; of themselves and each other. Assume that two anonymous mobile robots are placed at different locations at unknown distance d from each other on the infinite Euclidean plane. Each robot knows neither the location of itself nor of the other robot. The robots cannot communicate wirelessly, but have a certain nonzero visibility radius r (with range r unknown to the robots). By rendezvous we mean that they are brought at distance at most r of each other by executing symmetric (identical) mobility algorithms. The robots are moving with unknown and constant but not necessarily identical speeds, their clocks and pedometers may be asymmetric, and their chirality inconsistent.<\/p>\n<p>We demonstrate that rendezvous for two robots is feasible under the studied model iff the robots have either: different speeds; or different clocks; or different orientations but equal chiralities. When the rendezvous is feasible, we provide a universal algorithm which always solves rendezvous despite the fact that the robots have no knowledge of which among their respective parameters may be different.<\/p>\n<\/div>\n<\/div>\n<h3><a class=\"DLtitleLink\" title=\"Get the Full Text from the ACM Digital Library\" href=\"https:\/\/dl.acm.org\/authorize?N688528\">Composable Computation in Discrete Chemical Reaction Networks<\/a><\/h3>\n<ul class=\"DLauthors\">\n<li class=\"nameList First\">Eric E. Severson<\/li>\n<li class=\"nameList\">David Haley<\/li>\n<li class=\"nameList Last\">David Doty<\/li>\n<\/ul>\n<div class=\"DLabstract\">\n<div>\n<p>We study the composability of discrete chemical reaction networks (CRNs) that stably compute (i.e., with probability 0 of error) integer-valued functions \u0192:N<sup>d<\/sup>\u2192 N. We consider output-oblivious CRNs in which the output species is never a reactant (input) to any reaction. The class of output-oblivious CRNs is fundamental, appearing in earlier studies of CRN computation, because it is precisely the class of CRNs that can be composed by simply renaming the output of the upstream CRN to match the input of the downstream CRN.<\/p>\n<p>Our main theorem precisely characterizes the functions f stably computable by output-oblivious CRNs with an initial leader. The key necessary condition is that for sufficiently large inputs, f is the minimum of a finite number of nondecreasing quilt-affine functions. (An affine function is linear with a constant offset; a quilt-affine function is linear with a periodic offset).<\/p>\n<\/div>\n<\/div>\n<h3><a class=\"DLtitleLink\" title=\"Get the Full Text from the ACM Digital Library\" href=\"https:\/\/dl.acm.org\/authorize?N688529\">How to Spread a Rumor: Call Your Neighbors or Take a Walk?<\/a><\/h3>\n<ul class=\"DLauthors\">\n<li class=\"nameList First\">George Giakkoupis<\/li>\n<li class=\"nameList\">Frederik Mallmann-Trenn<\/li>\n<li class=\"nameList Last\">Hayk Saribekyan<\/li>\n<\/ul>\n<div class=\"DLabstract\">\n<div>\n<p>We study the problem of randomized information dissemination in networks. We compare the now standard PUSH-PULL protocol, with agent-based alternatives where information is disseminated by a collection of agents performing independent random walks. In the VISIT-EXCHANGE protocol, both nodes and agents store information, and each time an agent visits a node, the two exchange all the information they have. In the MEET-EXCHANGE protocol, only the agents store information, and exchange their information with each agent they meet.<\/p>\n<p>We consider the broadcast time of a single piece of information in an n-node graph for the above three protocols, assuming a linear number of agents that start from the stationary distribution. We observe that there are graphs on which the agent-based protocols are significantly faster than PUSH-PULL, and graphs where the converse is true. We attribute the good performance of agent-based algorithms to their inherently fair bandwidth utilization, and conclude that, in certain settings, agent-based information dissemination, separately or in combination with PUSH-PULL, can significantly improve the broadcast time.<\/p>\n<p>The graphs considered above are highly non-regular. Our main technical result is that on any regular graph of at least logarithmic degree, PUSH-PULL and VISIT-EXCHANGE have the same asymptotic broadcast time. The proof uses a novel coupling argument which relates the random choices of vertices in PUSH-PULL with the random walks in VISIT-EXCHANGE. Further, we show that the broadcast time of MEET-EXCHANGE is asymptotically at least as large as the other two&#8217;s on all regular graphs, and strictly larger on some regular graphs.<\/p>\n<p>As far as we know, this is the first systematic and thorough comparison of the running times of these very natural information dissemination protocols.<\/p>\n<\/div>\n<\/div>\n<h3><a class=\"DLtitleLink\" title=\"Get the Full Text from the ACM Digital Library\" href=\"https:\/\/dl.acm.org\/authorize?N688520\">Efficient Size Estimation and Impossibility of Termination in Uniform Dense Population Protocols<\/a><\/h3>\n<ul class=\"DLauthors\">\n<li class=\"nameList First\">David Doty<\/li>\n<li class=\"nameList Last\">Mahsa Eftekhari<\/li>\n<\/ul>\n<div class=\"DLabstract\">\n<div>\n<p>We study uniform population protocols: networks of anonymous agents whose pairwise interactions are chosen at random, where each agent uses an identical transition algorithm that does not depend on the population size n. Many existing polylog(n) time protocols for leader election and majority computation are nonuniform: to operate correctly, they require all agents to be initialized with an approximate estimate of n (specifically, the value \u0142floor\u0142og n\\rfloor). Our first main result is a uniform protocol for calculating \u0142og(n) \\pm O(1) with high probability in O(\u0142og^2 n) time and O(\u0142og^4 n) states (O(\u0142og \u0142og n) bits of memory). The protocol is not terminating : it does not signal when the estimate is close to the true value of \u0142og n. If it could be made terminating with high probability, this would allow composition with protocols requiring a size estimate initially. We do show how our main protocol can be indirectly composed with others in a simple and elegant way, based on leaderless phase clocks, demonstrating that those protocols can in fact be made uniform. However, our second main result implies that the protocol cannot be made terminating, a consequence of a much stronger result: a uniform protocol for any task requiring more than constant time cannot be terminating even with probability bounded above 0, if infinitely many initial configurations are dense : any state present initially occupies \u00d8mega(n) agents. (In particular no leader is allowed.) Crucially, the result holds no matter the memory or time permitted.<\/p>\n<\/div>\n<\/div>\n<h3><a class=\"DLtitleLink\" title=\"Get the Full Text from the ACM Digital Library\" href=\"https:\/\/dl.acm.org\/authorize?N688521\">On Counting the Population Size<\/a><\/h3>\n<ul class=\"DLauthors\">\n<li class=\"nameList First\">Petra Berenbrink<\/li>\n<li class=\"nameList\">Dominik Kaaser<\/li>\n<li class=\"nameList Last\">Tomasz Radzik<\/li>\n<\/ul>\n<div class=\"DLabstract\">\n<div>\n<p>We consider the problem of counting the population size in the population model. In this model, we are given a distributed system of n identical agents which interact in pairs with the goal to solve a common task. In each time step, the two interacting agents are selected uniformly at random. In this paper, we consider so-called uniform protocols, where the actions of two agents upon an interaction may not depend on the population size n. We present two population protocols to count the size of the population: protocol Approximate, which computes with high probability either [log n] or [log n], and protocol CountExact, which computes the exact population size in optimal O(log n) interactions, using \u00d5 (n) states. Both protocols can also be converted to stable protocols that give a correct result with probability 1 by using an additional multiplicative factor of O(log n) states.<\/p>\n<\/div>\n<\/div>\n<h3><a class=\"DLtitleLink\" title=\"Get the Full Text from the ACM Digital Library\" href=\"https:\/\/dl.acm.org\/authorize?N688522\">Self-Stabilizing Leader Election<\/a><\/h3>\n<ul class=\"DLauthors\">\n<li class=\"nameList First\">Hsueh-Ping Chen<\/li>\n<li class=\"nameList Last\">Ho-Lin Chen<\/li>\n<\/ul>\n<div class=\"DLabstract\">\n<div>\n<p>In this paper, we study the self-stabilizing leader election (SSLE) problem in population protocols. We construct a non-deterministic population protocol that can solve SSLE on directed rings of all sizes. Our algorithm uses a constant number of states and can be converted to a deterministic population protocol on undirected rings using previous techniques [8]. Furthermore, we extend our algorithm to perform SSLE on directed and undirected tori of arbitrary sizes.<\/p>\n<\/div>\n<\/div>\n<h3><a class=\"DLtitleLink\" title=\"Get the Full Text from the ACM Digital Library\" href=\"https:\/\/dl.acm.org\/authorize?N688523\">Logarithmic Expected-Time Leader Election in Population Protocol Model<\/a><\/h3>\n<ul class=\"DLauthors\">\n<li class=\"nameList First\">Yuichi Sudo<\/li>\n<li class=\"nameList\">Fukuhito Ooshita<\/li>\n<li class=\"nameList\">Taisuke Izumi<\/li>\n<li class=\"nameList\">Hirotsugu Kakugawa<\/li>\n<li class=\"nameList Last\">Toshimitsu Masuzawa<\/li>\n<\/ul>\n<div class=\"DLabstract\">\n<div>\n<p>In this paper, we present a leader election protocol in the population protocol model that stabilizes within O(log n) parallel time in expectation with O(log n) states per agent, where n is the number of agents. Given a rough knowledge m of the population size n such that m \u2265 = log<sub>2<\/sub> n and m=O(log n), this protocol guarantees that exactly one leader is elected and the unique leader is kept forever thereafter.<\/p>\n<\/div>\n<\/div>\n<h3><a class=\"DLtitleLink\" title=\"Get the Full Text from the ACM Digital Library\" href=\"https:\/\/dl.acm.org\/authorize?N688524\">On Site Fidelity and the Price of Ignorance in Swarm Robotic Central Place Foraging Algorithms<\/a><\/h3>\n<ul class=\"DLauthors\">\n<li class=\"nameList First\">Abhinav Aggarwal<\/li>\n<li class=\"nameList\">G. Matthew Fricke<\/li>\n<li class=\"nameList\">Diksha Gupta<\/li>\n<li class=\"nameList Last\">Melanie E. Moses<\/li>\n<\/ul>\n<div class=\"DLabstract\">\n<div>\n<p>A key factor limiting the performance of central place foraging algorithms is the awareness of the agent(s) about the location of food items around the nest. We study the ratio of how much time an ignorant agent takes relative to an omniscient forager for complete collection of food items in the arena. This effectively quantifies the penalty each algorithm pays for not knowing (or choosing to ignore information gained about) where the resources are located. We model the effect of depletion of food items from the arena on the foraging efficiency over time and analytically verify that returning to the location of the last food item found strongly helps in counteracting this effect. To the best of our knowledge, these results have only been empirically argued so far.<\/p>\n<\/div>\n<\/div>\n<h2>SESSION: Session 2<\/h2>\n<div class=\"DLabstract\">\u00a0<\/div>\n<h3><a class=\"DLtitleLink\" title=\"Get the Full Text from the ACM Digital Library\" href=\"https:\/\/dl.acm.org\/authorize?N688535\">Improved Distributed Expander Decomposition and Nearly Optimal Triangle Enumeration<\/a><\/h3>\n<ul class=\"DLauthors\">\n<li class=\"nameList First\">Yi-Jun Chang<\/li>\n<li class=\"nameList Last\">Thatchaphol Saranurak<\/li>\n<\/ul>\n<div class=\"DLabstract\">\n<div>\n<p>An(\u03b5,\u03c6)-expander decomposition of a graph G=(V,E) is a clustering of the vertices V=V<sub>1<\/sub>\u222a\u2026\u222a V<sub>x<\/sub> such that (1) each cluster V<sub>i<\/sub> induces subgraph with conductance at least \u03c6, and (2) the number of inter-cluster edges is at most \u03b5|E|. In this paper, we give an improved distributed expander decomposition, and obtain a nearly optimal distributed triangle enumeration algorithm in the CONGEST model.<\/p>\n<p>Specifically, we construct an (\u03b5,\u03c6)-expander decomposition with \u03c6=(\u03b5\/log n)<sup>2 O(k)<\/sup> in O(n<sup>2\/k<\/sup> \u22c5 poly (1\/\u03c6, log n))rounds for any \u03b5 \u2208(0,1) and positive integer k. For example, a (1\/n<sup>o(1)<\/sup>, 1\/n<sup>o(1)<\/sup>)-expander decomposition only requires O(n<sup>o(1)<\/sup>) rounds to compute, which is optimal up to subpolynomial factors, and a (0.01,1\/poly log n)-expander decomposition can be computed in O(n<sup>\u03b3<\/sup>) rounds, for any arbitrarily small constant \u03b3 &gt; 0. Previously, the algorithm by Chang, Pettie, and Zhang can construct a (1\/6,1\/poly log n)-expander decomposition using \u00d5 (n<sup>1-\u03b4<\/sup>) rounds for any \u03b4 &gt; 0, with a caveat that the algorithm is allowed to throw away a set of edges into an extra part which form a subgraph with arboricity at most n<sup>\u03b4<\/sup>. Our algorithm does not have this caveat.<\/p>\n<p>By slightly modifying the distributed algorithm for routing on expanders by Ghaffari, Kuhn and Su [PODC&#8217;17], we obtain a triangle enumeration algorithm using \u00d5(n<sup>1\/3<\/sup>) rounds. This matches the lower bound by Izumi and LeGall [PODC&#8217;17] and Pandurangan, Robinson and Scquizzato [SPAA&#8217;18] of \u00d8(n<sup>1\/3<\/sup>) which holds even in the CONGESTED-CLIQUE model. To the best of our knowledge, this provides the first non-trivial example for a distributed problem that has essentially the same complexity (up to a polylogarithmic factor) in both CONGEST and CONGESTED-CLIQUE.<\/p>\n<p>The key technique in our proof is the first distributed approximation algorithm for finding a low conductance cut that is as balanced as possible. Previous distributed sparse cut algorithms do not have this nearly most balanced guarantee.<\/p>\n<\/div>\n<\/div>\n<h3><a class=\"DLtitleLink\" title=\"Get the Full Text from the ACM Digital Library\" href=\"https:\/\/dl.acm.org\/authorize?N688536\">Fast Approximate Shortest Paths in the Congested Clique<\/a><\/h3>\n<ul class=\"DLauthors\">\n<li class=\"nameList First\">Keren Censor-Hillel<\/li>\n<li class=\"nameList\">Michal Dory<\/li>\n<li class=\"nameList\">Janne H. Korhonen<\/li>\n<li class=\"nameList Last\">Dean Leitersdorf<\/li>\n<\/ul>\n<div class=\"DLabstract\">\n<div>\n<p>We design fast deterministic algorithms for distance computation in the CONGESTED CLIQUE model. Our key contributions include:<\/p>\n<ul>\n<li>A (2+\u03b5)-approximation for all-pairs shortest paths problem in O(log<sup>2<\/sup>n \/ \u03b5) rounds on unweighted undirected graphs. With a small additional additive factor, this also applies for weighted graphs. This is the first sub-polynomial constant-factor approximation for APSP in this model.<\/li>\n<li>A (1+\u03b5)-approximation for multi-source shortest paths problem from O(\u221an) sources in O(log<sup>2<\/sup> n \/ \u03b5) rounds on weighted undirected graphs. This is the first sub-polynomial algorithm obtaining this approximation for a set of sources of polynomial size.<\/li>\n<\/ul>\n<p>Our main techniques are new distance tools that are obtained via improved algorithms for sparse matrix multiplication, which we leverage to construct efficient hopsets and shortest paths. Furthermore, our techniques extend to additional distance problems for which we improve upon the state-of-the-art, including diameter approximation, and an exact single-source shortest paths algorithm for weighted undirected graphs in \u00d5 (n<sub>1\/6<\/sub>) rounds.<\/p>\n<\/div>\n<\/div>\n<h3><a class=\"DLtitleLink\" title=\"Get the Full Text from the ACM Digital Library\" href=\"https:\/\/dl.acm.org\/authorize?N688537\">Quantum Distributed Algorithm for the All-Pairs Shortest Path Problem in the CONGEST-CLIQUE Model<\/a><\/h3>\n<ul class=\"DLauthors\">\n<li class=\"nameList First\">Taisuke Izumi<\/li>\n<li class=\"nameList Last\">Fran{c c}ois Le Gall<\/li>\n<\/ul>\n<div class=\"DLabstract\">\n<div>\n<p>The All-Pairs Shortest Path problem (APSP) is one of the most central problems in distributed computation. In the CONGEST-CLIQUE model, in which n nodes communicate with each other over a fully connected network by exchanging messages of O(\u0142og n) bits in synchronous rounds, the best known general algorithm for APSP uses \u00d5(n<sub>1\/3<\/sub>) rounds. Breaking this barrier is a fundamental challenge in distributed graph algorithms. In this paper we investigate for the first time quantum distributed algorithms in the CONGEST-CLIQUE model, where nodes can exchange messages of O(log n) quantum bits, and show that this barrier can be broken: we construct a \u00d5(n<sub>1\/4<\/sub>)-round quantum distributed algorithm for the APSP over directed graphs with polynomial weights in the CONGEST-CLIQUE model. This speedup in the quantum setting contrasts with the case of the standard CONGEST model, for which Elkin et al. (PODC 2014) showed that quantum communication does not offer significant advantages over classical communication.<\/p>\n<p>Our quantum algorithm is based on a relationship discovered by Vassilevska Williams and Williams (JACM 2018) between the APSP and the detection of negative triangles in a graph. The quantum part of our algorithm exploits the framework for quantum distributed search recently developed by Le Gall and Magniez (PODC 2018). Our main technical contribution is a method showing how to implement multiple quantum searches (one for each edge in the graph) in parallel without introducing congestions.<\/p>\n<\/div>\n<\/div>\n<h3><a class=\"DLtitleLink\" title=\"Get the Full Text from the ACM Digital Library\" href=\"https:\/\/dl.acm.org\/authorize?N688538\">Deterministic Distributed Dominating Set Approximation in the CONGEST Model<\/a><\/h3>\n<ul class=\"DLauthors\">\n<li class=\"nameList First\">Janosch Deurer<\/li>\n<li class=\"nameList\">Fabian Kuhn<\/li>\n<li class=\"nameList Last\">Yannic Maus<\/li>\n<\/ul>\n<div class=\"DLabstract\">\n<div>\n<p>We develop deterministic approximation algorithms for the minimum dominating set problem in the CONGEST model with an almost optimal approximation guarantee. For \u03b5 1\/ poly log \u0394 we obtain two algorithms with approximation factor (1 + \u03b5)(1 + \u0142 n (\u0394 + 1)) and with runtimes 2<sup>O<\/sup>(\u221a log n log log n) and O(\u0394 poly log \u0394 + poly log \u0394 log* n), respectively. Further we show how dominating set approximations can be deterministically transformed into a connected dominating set in the CONGEST model while only increasing the approximation guarantee by a constant factor. This results in a deterministic O(log \u0394)-approximation algorithm for the minimum connected dominating set with time complexity 2O(\u221a log n log log n).<\/p>\n<\/div>\n<\/div>\n<h3><a class=\"DLtitleLink\" title=\"Get the Full Text from the ACM Digital Library\" href=\"https:\/\/dl.acm.org\/authorize?N688539\">Optimal Distributed Covering Algorithms<\/a><\/h3>\n<ul class=\"DLauthors\">\n<li class=\"nameList First\">Ran Ben Basat<\/li>\n<li class=\"nameList\">Guy Even<\/li>\n<li class=\"nameList\">Ken-ichi Kawarabayashi<\/li>\n<li class=\"nameList Last\">Gregory Schwartzman<\/li>\n<\/ul>\n<div class=\"DLabstract\">\n<div>\n<p>We present a time-optimal deterministic distributed algorithm for approximating a minimum weight vertex cover in hypergraphs of rank \u0192. This problem is equivalent to the Minimum Weight Set Cover problem in which the frequency of every element is bounded by \u0192. The approximation factor of our algorithm is (\u0192 + \u03b5). Let \u0394 denote the maximum degree in the hypergraph. Our algorithm runs in the CONGEST model and requires O(log \u0394\/log log \u0394) rounds, for constants \u03b5 \u2208 (0,1] and \u0192 \u2208 N<sup>+<\/sup>. This is the first distributed algorithm for this problem whose running time does not depend on the vertex weights nor the number of vertices. Thus adding another member to the exclusive family of emphprovably optimal distributed algorithms.<\/p>\n<p>For constant values of \u0192 and \u03b5, our algorithm improves over the (&amp;3402; + \u03b5)-approximation algorithm of [16] whose running time is O(log \u0394 + log W), where W is the ratio between the largest and smallest vertex weights in the graph. Our algorithm also achieves an \u0192-approximation for the problem in O(\u0192 log n) rounds, improving over the classical result of [13] that achieves a running time of O(\u0192 log <sup>2<\/sup> n). Finally, for weighted vertex cover (\u0192=2) our algorithm achieves a deterministic running time of O(log n), matching the randomized previously best result of [14].<\/p>\n<p>We also show that integer covering-programs can be reduced to the Minimum Weight Set Cover problem in the distributed setting. This allows us to achieve an (\u0192 + \u03b5)-approximate integral solution in O(1 + \u0192 \/log n)\u22c5 log \u0394 over log log \u0394+(\u0192 \u22c5 log M)<sup>1.01<\/sup>\u22c5 log <sup>\u03b5-1<\/sup> \u22c5(log \u0394)<sup>0.01<\/sup>)) rounds, where \u0192 bounds the number of variables in a constraint, \u0394 bounds the number of constraints a variable appears in, and M=max{1,1\/a min},, a<sub>min<\/sub>, where a<sub>min<\/sub> is the smallest normalized constraint coefficient. This significantly improves over the results of [16] for the integral case, which achieves the same guarantees in O(\u03b5<sup>-4<\/sup> \u22c5 \u0192<sup>4<\/sup> \u22c5 log \u0192 \u22c5 log(M \u22c5 \u0394)) rounds.<\/p>\n<\/div>\n<\/div>\n<h2>SESSION: Session 3<\/h2>\n<div class=\"DLabstract\">\u00a0<\/div>\n<h3><a class=\"DLtitleLink\" title=\"Get the Full Text from the ACM Digital Library\" href=\"https:\/\/dl.acm.org\/authorize?N688530\">Secure Distributed Computing Made (Nearly) Optimal<\/a><\/h3>\n<ul class=\"DLauthors\">\n<li class=\"nameList First\">Merav Parter<\/li>\n<li class=\"nameList Last\">Eylon Yogev<\/li>\n<\/ul>\n<div class=\"DLabstract\">\n<div>\n<p>In this paper, we study secure distributed algorithms that are nearly optimal, with respect to running time, for the given input graph G. Roughly speaking, an algorithm is secure if the nodes learn only their final output while gaining no information on the input (or output) of other nodes.<\/p>\n<p>A graph theoretic framework for secure distributed computation was recently introduced by the authors (SODA 2019). This framework is quite general and it is based on a new combinatorial structure called private neighborhood trees : a collection of n trees T(u<sub>1<\/sub>), \u2026, T(u<sub>n<\/sub>) such that each tree T(u<sub>i<\/sub>) spans the neighbors of u<sub>i<\/sub> without going through u<sub>i<\/sub>. Intuitively, each tree T(u<sub>i<\/sub>) allows all neighbors of u<sub>i<\/sub> to exchange a secret that is hidden from u<sub>i<\/sub>. The efficiency of the framework depends on two key parameters of these trees: their depth and the amount of overlap. In a (d,c)-private neighborhood trees each tree T(u<sub>i<\/sub>) has depth O(d) and each edge e \u2208 G appears in at most O(c) different trees. An existentially optimal construction of private neighborhood trees with d=O(\u0394 \u2026 D) and c=\u00d5 (D) was presented therein. We make two key contributions:<\/p>\n<p><b>Universally Optimal Private Trees:<\/b> We show a combinatorial construction of nearly (universally) optimal (d,c)-private neighborhood trees with d + c=\u00d5 (OPT(G)) for any input graph G. Perhaps surprisingly, we show that OPT(G) is equal to the best depth possible for these trees even without the congestion constraint. We also present efficient distributed constructions of these private trees.<\/p>\n<p><b>Optimal Secure Computation:<\/b> Using the optimal constructions above, we get a secure compiler for distributed algorithms where the overhead for each round is \u00d5 (poly(\u0394)\u2026 OPT(G)). As our second key contribution, we design an optimal compiler with an overhead of merely \u00d5 (OPT(G)) per round for a class of &#8220;simple&#8221; algorithms. This class includes many standard distributed algorithms such as Luby-MIS, the standard logarithmic-round algorithms for matching and \u0394 + 1-coloring, as well as the computation of aggregate functions.<\/p>\n<\/div>\n<\/div>\n<h3><a class=\"DLtitleLink\" title=\"Get the Full Text from the ACM Digital Library\" href=\"https:\/\/dl.acm.org\/authorize?N688531\">With Great Speed Come Small Buffers: Space-Bandwidth Tradeoffs for Routing<\/a><\/h3>\n<ul class=\"DLauthors\">\n<li class=\"nameList First\">Avery Miller<\/li>\n<li class=\"nameList\">Boaz Patt-Shamir<\/li>\n<li class=\"nameList Last\">Will Rosenbaum<\/li>\n<\/ul>\n<div class=\"DLabstract\">\n<div>\n<p>We consider the Adversarial Queuing Theory (AQT) model, where packet arrivals are subject to a maximum average rate 0 \u2264 \u03c1 \u2264 1 and burstiness \u03c3 \u2264 0. In this model, we analyze the size of buffers required to avoid overflows in the basic case of a path. Our main results characterize the space required by the average rate and the number of distinct destinations: we show that O(\u2113 d<sup>1\/\u2113<\/sup> + \u03c3) space suffice, where d is the number of distinct destinations and \u2113=\u230b1\/\u03c1\u230a and we show that \u03a9(1 over \u2113 <sup>d1\/\u2113<\/sup> + \u03c3) space is necessary. For directed trees, we describe an algorithm whose buffer space requirement is at most 1 + d&#8217; + \u03c3 where d&#8217; is the maximum number of destinations on any root-leaf path.<\/p>\n<\/div>\n<\/div>\n<h3><a class=\"DLtitleLink\" title=\"Get the Full Text from the ACM Digital Library\" href=\"https:\/\/dl.acm.org\/authorize?N688532\">Plain SINR is Enough!<\/a><\/h3>\n<ul class=\"DLauthors\">\n<li class=\"nameList First\">Magnus M. Halldorsson<\/li>\n<li class=\"nameList Last\">Tigran Tonoyan<\/li>\n<\/ul>\n<div class=\"DLabstract\">\n<div>\n<p>We develop randomized distributed algorithms for many of the most fundamental communication problems in the wireless SINR model, including (multi-message) broadcast, local broadcast, coloring, MIS, and aggregation. The complexity of the algorithms is optimal up to polylogarithmic preprocessing time. It shows &#8212; contrary to expectation &#8212; that the plain vanilla SINR model is just as powerful and fast (modulo the preprocessing) as various extensions studied, including power control, carrier sense, collision detection, free acknowledgements, and GPS location. A key component of the algorithms is an efficient simulation of CONGEST algorithms on a constant-density SINR backbone.<\/p>\n<\/div>\n<\/div>\n<h3><a class=\"DLtitleLink\" title=\"Get the Full Text from the ACM Digital Library\" href=\"https:\/\/dl.acm.org\/authorize?N688533\">Efficient Multiparty Interactive Coding for Insertions, Deletions, and Substitutions<\/a><\/h3>\n<ul class=\"DLauthors\">\n<li class=\"nameList First\">Ran Gelles<\/li>\n<li class=\"nameList\">Yael Tauman Kalai<\/li>\n<li class=\"nameList Last\">Govind Ramnarayan<\/li>\n<\/ul>\n<div class=\"DLabstract\">\n<div>\n<p>In the field of interactive coding, two or more parties wish to carry out a distributed computation over a communication network that may be noisy. The ultimate goal is to develop efficient coding schemes that can tolerate a high level of noise while increasing the communication by only a constant factor (i.e., constant rate).<\/p>\n<p>In this work we consider synchronous communication networks over an arbitrary topology, in the powerful adversarial insertion-deletion noise model. Namely, the noisy channel may adversarially alter the content of any transmitted symbol, as well as completely remove a transmitted symbol or inject a new symbol into the channel.<\/p>\n<p>We provide efficient, constant rate schemes that successfully conduct any computation with high probability as long as the adversary corrupts at most \u03b5 over m fraction of the total communication, where m is the number of links in the network and \u03b5 is a small constant. This scheme assumes an oblivious adversary which is independent of the parties&#8217; inputs and randomness. We can remove this assumption and resist a worst-case adversary at the price of being resilient to \u03b5 over m log m errors.<\/p>\n<p>While previous work considered the insertion-deletion noise model in the two-party setting, to the best of our knowledge, our scheme is the first multiparty scheme that is resilient to insertions and deletions. Furthermore, our scheme is the first computationally efficient scheme in the multiparty setting that is resilient to adversarial noise.<\/p>\n<\/div>\n<\/div>\n<h3><a class=\"DLtitleLink\" title=\"Get the Full Text from the ACM Digital Library\" href=\"https:\/\/dl.acm.org\/authorize?N688534\">Multiparty Interactive Communication with Private Channels<\/a><\/h3>\n<ul class=\"DLauthors\">\n<li class=\"nameList First\">Abhinav Aggarwal<\/li>\n<li class=\"nameList\">Varsha Dani<\/li>\n<li class=\"nameList\">Thomas P. Hayes<\/li>\n<li class=\"nameList Last\">Jared Saia<\/li>\n<\/ul>\n<div class=\"DLabstract\">\n<div>\n<p>A group of n players wants to run a distributed protocol \u2118 over a network where communication occurs via private point-to-point channels. Can we efficiently simulate \u2118 in the presence of an adversary who knows \u2118 and is able to maliciously flip bits on the channels? We show that this is possible, even when L, the number of bits sent in \u2118, the average message size \u03b1 in \u2118, and T, the number of bits flipped by the adversary are not known in advance. In particular, we show how to create a robust version of \u2118, \u2118 such that 1) \u2118&#8217; fails with probability at most \u03b4, for any \u03b4&gt;0; and 2) \u2118&#8217; sends O( L (1 + (1\/\u03b1) \u0142og (n L\/\u03b4)) + T) bits. We note that if \u03b1 is \u03a9 (log (n L\/\u03b4), then \u2118 sends only O(L+T) bits, and is therefore within a constant factor of optimal. Critically, our result requires that \u2118 runs correctly in an asynchronous network and our protocol \u2118 must run in a synchronous network.<\/p>\n<\/div>\n<\/div>\n<h3><a class=\"DLtitleLink\" title=\"Get the Full Text from the ACM Digital Library\" href=\"https:\/\/dl.acm.org\/authorize?N688545\">Coded State Machine &#8212; Scaling State Machine Execution under Byzantine Faults<\/a><\/h3>\n<ul class=\"DLauthors\">\n<li class=\"nameList First\">Songze Li<\/li>\n<li class=\"nameList\">Saeid Sahraei<\/li>\n<li class=\"nameList\">Mingchao Yu<\/li>\n<li class=\"nameList\">Salman Avestimehr<\/li>\n<li class=\"nameList\">Sreeram Kannan<\/li>\n<li class=\"nameList Last\">Pramod Viswanath<\/li>\n<\/ul>\n<div class=\"DLabstract\">\n<div>\n<p>We introduce Coded State Machine (CSM), an information-theoretic framework to securely and efficiently execute multiple state machines on Byzantine nodes. The standard method of solving this problem is using State Machine Replication, which achieves high security at the cost of low efficiency. CSM simultaneously achieves the optimal linear scaling in storage, throughput, and security with increasing network size. The storage is scaled via the design of Lagrange coded states and coded input commands that require the same storage size as their origins. The computational efficiency is scaled using a novel delegation algorithm, called INTERMIX, which is an information-theoretically verifiable matrix-vector multiplication algorithm of independent interest.<\/p>\n<\/div>\n<\/div>\n<h3><a class=\"DLtitleLink\" title=\"Get the Full Text from the ACM Digital Library\" href=\"https:\/\/dl.acm.org\/authorize?N688546\">On Termination of a Flooding Process<\/a><\/h3>\n<ul class=\"DLauthors\">\n<li class=\"nameList First\">Walter Hussak<\/li>\n<li class=\"nameList Last\">Amitabh Trehan<\/li>\n<\/ul>\n<div class=\"DLabstract\">\n<div>\n<p>Flooding is a fundamental distributed algorithms technique. Consider the following flooding process, for simplicity, in a synchronous message passing network: A distinguished node begins the flooding process by sending the (same) message to all its neighbours in the first round. In subsequent rounds, every node receiving the message relays a copy of the message further to all those, and only those, nodes it did not receive the message from in the previous round. However, the nodes do not remember if they&#8217;ve taken part in the flooding before and therefore will repeat the process every time they get a message. In other words, they execute an amnesiac flooding process with memory only of the present round. The flooding process terminates in a particular round when no edge in the network carries the message in that, and, hence, subsequent, rounds. We call this process Amnesiac Flooding (AF).<\/p>\n<p>In this work, the main question we address is whether AF will terminate on an arbitrary network (graph) and in what time? We show that, indeed, AF terminates on any arbitrary graph. Further, AF terminates in at most D rounds in bipartite graphs and at most 2D + 1 rounds in non-bipartite graphs &#8211; in this brief announcement, we show this for the bipartite case only.<\/p>\n<p>We also show that in a natural asynchronous variant of AF, an adversary can always ensure non-termination.<\/p>\n<\/div>\n<\/div>\n<h2>SESSION: Keynote Lecture 2<\/h2>\n<div class=\"DLabstract\">\u00a0<\/div>\n<h3><a class=\"DLtitleLink\" title=\"Get the Full Text from the ACM Digital Library\" href=\"https:\/\/dl.acm.org\/authorize?N688547\">Towards a Theory of Randomized Shared Memory Algorithms<\/a><\/h3>\n<ul class=\"DLauthors\">\n<li class=\"nameList Last\">Philipp Woelfel<\/li>\n<\/ul>\n<div class=\"DLabstract\">\n<div>\n<p>Randomization has become an invaluable tool to overcome some of the problems associated with asynchrony and faultiness. Allowing processors to use random bits helps to break symmetry, and to reduce the likelihood of undesirable schedules. As a consequence, randomized techniques can lead to simpler and more efficient algorithms, and sometimes to solutions of otherwise unsolvable computational problems. However, the design and the analysis of randomized shared memory algorithms remains challenging. This talk will give an overview of recent progress towards developing a theory of randomized shared memory algorithms.<\/p>\n<p>For many years, linearizability [6] has been the gold standard of distributed correctness conditions, and the corner stone of modular programming. In deterministic algorithms, implemented linearizable methods can be assumed to be atomic. But when processes can make random choices, the situation is not the same: Probability distributions of outcomes of algorithms using linearizable methods may be very different from those using equivalent atomic operations [4]. In general, modular algorithm design is much more difficult for randomized algorithms than for deterministic ones. The first part of the talk will present a correctness condition [2, 5] that is suitable for randomized algorithms in certain settings, and will explain why in other settings no such correctness condition exists [3] and what we can do about that.<\/p>\n<p>To this date, almost all randomized shared memory algorithms are Las Vegas, meaning they permit no error. Monte Carlo algorithms, which allow errors to occur with small probability, have been studied thoroughly for sequential systems. But in the shared memory world such algorithms have been neglected. The second part of this talk will discuss recent attempts to devise Monte Carlo algorithms for fundamental shared memory problems (e.g., [1]). It will also present some general techniques, that have proved useful in the design of concurrent randomized algorithms.<\/p>\n<\/div>\n<\/div>\n<h2>SESSION: Session 4<\/h2>\n<div class=\"DLabstract\">\u00a0<\/div>\n<h3><a class=\"DLtitleLink\" title=\"Get the Full Text from the ACM Digital Library\" href=\"https:\/\/dl.acm.org\/authorize?N688548\">Optimal Memory-Anonymous Symmetric Deadlock-Free Mutual Exclusion<\/a><\/h3>\n<ul class=\"DLauthors\">\n<li class=\"nameList First\">Zahra Aghazadeh<\/li>\n<li class=\"nameList\">Damien Imbs<\/li>\n<li class=\"nameList\">Michel Raynal<\/li>\n<li class=\"nameList\">Gadi Taubenfeld<\/li>\n<li class=\"nameList Last\">Philipp Woelfel<\/li>\n<\/ul>\n<div class=\"DLabstract\">\n<div>\n<p>The notion of an anonymous shared memory, introduced by Taubenfeld in PODC 2017, considers that processes use different names for the same memory location. As an example, a location name A used by a process p and a location name B \u2260 A used by another process q can correspond to the very same memory location X, and similarly for the names B used by p and A used by q which may (or may not) correspond to the same memory location Y \u2260 X. In this context, the PODC paper presented a 2-process symmetric deadlock-free mutual exclusion (mutex) algorithm and a necessary condition on the size m of the anonymous memory for the existence of such an n-process algorithm. This condition states that m must be belongs to M(n) {1} where M(n)= {m: \u2200 \u2113: (1) &lt; \u2113 \u2264 n: gcd(\u2113,m)=1). Symmetric means here that,process identities define a specific data type which allows a process to check only if two identities are equal or not.<\/p>\n<p>The present paper presents two optimal deadlock-free symmetric mutual exclusion algorithms for n-process systems where communication is through m registers. The first algorithm, which considers anonymous read\/write registers, works for any m which is \u2265 n and belongs to the set M(n). It follows that this condition on m is both necessary and sufficient for symmetric deadlock-free mutual exclusion in this anonymity context, and this algorithm is optimal with respect to m The second algorithm, which considers anonymous read\/modify\/write atomic registers, works for any m\u2208 M(n), which is shown to be necessary and sufficient for anonymous read\/modify\/write registers. It follows that, when m &gt; 1, m \u2208 M(n) is a tight characterization of the size of the anonymous shared memory needed to solve deadlock-free mutex, be the registers read\/write or read\/modify\/write.<\/p>\n<\/div>\n<\/div>\n<h3><a class=\"DLtitleLink\" title=\"Get the Full Text from the ACM Digital Library\" href=\"https:\/\/dl.acm.org\/authorize?N688549\">Constant Amortized RMR Abortable Mutex for CC and DSM<\/a><\/h3>\n<ul class=\"DLauthors\">\n<li class=\"nameList First\">Prasad Jayanti<\/li>\n<li class=\"nameList Last\">Siddhartha Jayanti<\/li>\n<\/ul>\n<div class=\"DLabstract\">\n<div>\n<p>The Abortable mutual exclusion problem, proposed by Scott and Scherer in response to the needs in real time systems and databases, is a variant of mutual exclusion that allows processes to abort from their attempt to acquire the lock. Worst-case constant remote memory reference (RMR) algorithms for mutual exclusion using hardware instructions such as Fetch&amp;Add or Fetch&amp;Store have long existed for both Cache Coherent (CC) and Distributed Shared Memory (DSM) multiprocessors, but no such algorithms are known for abortable mutual exclusion. Even relaxing the worst-case requirement to amortized, algorithms are only known for the CC model.<\/p>\n<p>In this paper, we improve this state-of-the-art by designing a deterministic algorithm that uses Fetch&amp;Store (FAS) to achieve amortized O(1) RMR in both the CC and DSM models. Our algorithm supports Fast Abort (a process aborts within six steps of receiving the abort signal), and has the following additional desirable properties: it supports an arbitrary number of processes of arbitrary names, requires only O(1) space per process, and satisfies a novel fairness condition that we call &#8220;Airline FCFS&#8221;. Our algorithm is short and practical with fewer than a dozen lines of code.<\/p>\n<\/div>\n<\/div>\n<h3><a class=\"DLtitleLink\" title=\"Get the Full Text from the ACM Digital Library\" href=\"https:\/\/dl.acm.org\/authorize?N688540\">A Recoverable Mutex Algorithm with Sub-logarithmic RMR on Both CC and DSM<\/a><\/h3>\n<ul class=\"DLauthors\">\n<li class=\"nameList First\">Prasad Jayanti<\/li>\n<li class=\"nameList\">Siddhartha Jayanti<\/li>\n<li class=\"nameList Last\">Anup Joshi<\/li>\n<\/ul>\n<div class=\"DLabstract\">\n<div>\n<p>In light of recent advances in non-volatile main memory technology, Golab and Ramaraju reformulated the traditional mutex problem into the novel Recoverable Mutual Exclusion (RME) problem. In the best known solution for RME, due to Golab and Hendler from PODC 2017, a process incurs at most O(\u221a log n log log n) remote memory references (RMRs) per passage on a system with n processes, where a passage is an interval from when a process enters the Try section to when it subsequently returns to Remainder. Their algorithm, however, guarantees this bound only for cache-coherent (CC) multiprocessors, leaving open the question of whether a similar bound is possible for distributed shared memory (DSM) multiprocessors.<\/p>\n<p>We answer this question affirmatively by designing an algorithm for a system with n processes, such that, it satisfies the same complexity bound as Golab and Hendler&#8217;s for both CC and DSM multiprocessors. Our algorithm has some additional advantages over Golab and Hendler&#8217;s: (i) its Exit section is wait-free, (ii) it uses only the Fetch-and-Store instruction, and (iii) on a CC machine our algorithm needs each process to have a cache of only O(1) words, while their algorithm needs a cache of size that is a function of n.<\/p>\n<\/div>\n<\/div>\n<h3><a class=\"DLtitleLink\" title=\"Get the Full Text from the ACM Digital Library\" href=\"https:\/\/dl.acm.org\/authorize?N688541\">Randomized Concurrent Set Union and Generalized Wake-Up<\/a><\/h3>\n<ul class=\"DLauthors\">\n<li class=\"nameList First\">Siddhartha Jayanti<\/li>\n<li class=\"nameList\">Robert E. Tarjan<\/li>\n<li class=\"nameList Last\">Enric Boix-Adser\u00e0<\/li>\n<\/ul>\n<div class=\"DLabstract\">\n<div>\n<p>We consider the disjoint set union problem in the asynchronous shared memory multiprocessor computation model. We design a randomized algorithm that performs at most O(log n) work per operation (with high probability), and performs at most O(m #8226; (\u03b1(n, m\/(np)) + log(np\/m + 1)) total work in expectation for a problem instance with m operations on n elements solved by p processes. Our algorithm is the first to have work bounds that grow sublinearly with p against an adversarial scheduler.<\/p>\n<p>We use Jayanti&#8217;s Wake Up problem and our newly defined Generalized Wake Up problem to prove several lower bounds on concurrent set union. We show an \u03a9(log min {n,p}) expected work lower bound on the cost of any single operation on a set union algorithm. This shows that our single-operation upper bound is optimal across all algorithms when p = n<sup>\u03a9(1)<\/sup>. Furthermore, we identify a class of &#8220;symmetric algorithms&#8221; that captures the complexities of all the known algorithms for the disjoint set union problem, and prove an \u03a9(m\u2022(\u03b1(n, m(np)) + log(np\/m + 1))) expected total work lower bound on algorithms of this class, thereby showing that our algorithm has optimal total work complexity for this class. Finally, we prove that any randomized algorithm, symmetric or not, cannot breach an \u03a9(m \u2022(\u03b1(n, m\/n) + log log(np\/m + 1))) expected total work lower bound.<\/p>\n<\/div>\n<\/div>\n<h3><a class=\"DLtitleLink\" title=\"Get the Full Text from the ACM Digital Library\" href=\"https:\/\/dl.acm.org\/authorize?N688542\">Strongly Linearizable Implementations of Snapshots and Other Types<\/a><\/h3>\n<ul class=\"DLauthors\">\n<li class=\"nameList First\">Sean Ovens<\/li>\n<li class=\"nameList Last\">Philipp Woelfel<\/li>\n<\/ul>\n<div class=\"DLabstract\">\n<div>\n<p>Linearizability is the gold standard of correctness conditions for shared memory algorithms, and historically has been considered the practical equivalent of atomicity. However, it has been shown that replacing atomic objects with linearizable implementations can affect the probability distribution of execution outcomes in randomized algorithms. Thus, linearizable objects are not always suitable replacements for atomic objects. A stricter correctness condition called strong linearizability has been developed and shown to be appropriate for randomized algorithms in a strong adaptive adversary model[16].<\/p>\n<p>We devise several new lock-free strongly linearizable implementations from atomic registers. In particular, we give the first strongly linearizable lock-free snapshot implementation that uses bounded space. This improves on the unbounded space solution of Denysyuk and Woelfel[14]. As a building block, our algorithm uses a lock-free strongly linearizable ABA-detecting register. We obtain this object by modifying the wait-free linearizable ABA-detecting register of Aghazadeh and Woelfel [5], which, as we show, is not strongly linearizable.<\/p>\n<p>Aspnes and Herlihy[8] identified a wide class of types that have wait-free linearizable implementations. These types require that any pair of operations either commute, or one overwrites the other. Aspnes and Herlihy gave a general wait-free linearizable implementation of such types, employing an atomic snapshot object. We show that this implementation is strongly linearizable, proving that all types in this class have a lock-free strongly linearizable implementation from atomic registers.<\/p>\n<\/div>\n<\/div>\n<h3><a class=\"DLtitleLink\" title=\"Get the Full Text from the ACM Digital Library\" href=\"https:\/\/dl.acm.org\/authorize?N688543\">Fast Concurrent Data Sketches<\/a><\/h3>\n<ul class=\"DLauthors\">\n<li class=\"nameList First\">Arik Rinberg<\/li>\n<li class=\"nameList\">Alexander Spiegelman<\/li>\n<li class=\"nameList\">Edward Bortnikov<\/li>\n<li class=\"nameList\">Eshcar Hillel<\/li>\n<li class=\"nameList\">Idit Keidar<\/li>\n<li class=\"nameList Last\">Hadar Serviansky<\/li>\n<\/ul>\n<div class=\"DLabstract\">\n<div>\n<p>Data sketches are approximate succinct summaries of long data streams. They are widely used for processing massive amounts of data and answering statistical queries about it. Existing libraries producing sketches are very fast, but do not allow parallelism for creating sketches using multiple threads or querying them while they are being built. We present a generic approach to parallelising data sketches efficiently and allowing them to be queried in real time, while bounding the error that such parallelism introduces. Utilising relaxed semantics and the notion of strong linearisability we prove our algorithm&#8217;s correctness and analyse the error it induces in two specific sketches. Our implementation achieves high scalability while keeping the error small. We have contributed one of our concurrent sketches to the open-source data sketches library.<\/p>\n<\/div>\n<\/div>\n<h3><a class=\"DLtitleLink\" title=\"Get the Full Text from the ACM Digital Library\" href=\"https:\/\/dl.acm.org\/authorize?N688544\">Self-Stabilizing Snapshot Objects for Asynchronous Failure-Prone Networked Systems<\/a><\/h3>\n<ul class=\"DLauthors\">\n<li class=\"nameList First\">Chryssis Georgiou<\/li>\n<li class=\"nameList\">Oskar Lundstr\u00f6m<\/li>\n<li class=\"nameList Last\">Elad M. Schiller<\/li>\n<\/ul>\n<div class=\"DLabstract\">\n<div>\n<p>A snapshot object simulates the behavior of an array of single-writer\/multi-reader shared registers that can be read atomically. Delporte-Gallet et al. proposed two fault-tolerant algorithms for snapshot objects in asynchronous crash-prone message-passing systems. Their first algorithm is non-blocking; it allows snapshot operations to terminate once all write operations had ceased. It uses O(n) messages of O(n v) bits, where n is the number of nodes and v is the number of bits it takes to represent the object. Their second algorithm allows snapshot operations to always terminate independently of write operations. It incurs O(n^2) messages. The fault model of Delporte-Gallet et al. considers node failures (crashes). We aim at the design of even more robust snapshot objects. We do so through the lenses of self-stabilization&#8212;a very strong notion of fault-tolerance. In addition to Delporte-Gallet et al.&#8217;s fault model, a self-stabilizing algorithm can recover after the occurrence of transient faults; these faults represent arbitrary violations of the assumptions according to which the system was designed to operate (as long as the code stays intact). In particular, in this work, we propose self-stabilizing variations of Delporte-Gallet et al.&#8217;s non-blocking algorithm and always-terminating algorithm. Our algorithms have similar communication costs to the ones by Delporte-Gallet et al. and O(1) recovery time (in terms of asynchronous cycles) from transient faults. The main differences are that our proposal considers repeated gossiping of O(v) bits messages and deals with bounded space, which is a prerequisite for self-stabilization.<\/p>\n<\/div>\n<\/div>\n<h3><a class=\"DLtitleLink\" title=\"Get the Full Text from the ACM Digital Library\" href=\"https:\/\/dl.acm.org\/authorize?N688655\">The Recoverable Consensus Hierarchy<\/a><\/h3>\n<ul class=\"DLauthors\">\n<li class=\"nameList Last\">Wojciech Golab<\/li>\n<\/ul>\n<div class=\"DLabstract\">\n<div>\n<p>Herlihy&#8217;s consensus hierarchy ranks the power of various synchronization primitives for solving consensus in a model where asynchronous processes communicate through shared memory, and may fail by halting. This paper revisits the consensus hierarchy in a model with crash-recovery failures, where the specification of consensus, called recoverable consensus in this paper, is weakened by allowing non-terminating executions when a process fails infinitely often. Two variations of this model are considered: with independent process failures, and with simultaneous (i.e., system-wide) process failures. We prove two fundamental results: (i) Test-And-Set is at level 2 of the recoverable consensus hierarchy if failures are simultaneous, and similarly for any primitive at level 2 of the traditional consensus hierarchy; and (ii) Test-And-Set drops to level 1 of the hierarchy if failures are independent, unless the number of such failures is bounded. To our knowledge, this is the first separation between the simultaneous and independent crash-recovery failure models with respect to the computability of consensus.<\/p>\n<\/div>\n<\/div>\n<h3><a class=\"DLtitleLink\" title=\"Get the Full Text from the ACM Digital Library\" href=\"https:\/\/dl.acm.org\/authorize?N688656\">How Fast Reads Affect Multi-Valued Register Simulations<\/a><\/h3>\n<ul class=\"DLauthors\">\n<li class=\"nameList First\">Soma Chaudhuri<\/li>\n<li class=\"nameList\">Reginald Frank<\/li>\n<li class=\"nameList Last\">Jennifer L. Welch<\/li>\n<\/ul>\n<div class=\"DLabstract\">\n<div>\n<p>We consider the problem of simulating a k-valued register in a wait-free manner using binary registers as building blocks, where k 2. We show that for any simulation using atomic binary base registers to simulate a safe k-valued register in which the read algorithm takes the optimal number of steps (log<sub>2<\/sub> k), the write algorithm must take at least log<sub>2<\/sub> k steps in the worst case. A fortiori, the same lower bound applies when the simulated register should be regular. Previously known algorithms show that both these lower bounds are tight. We also show that in order to simulate an atomic k-valued register for two readers, the optimal number of steps for the read algorithm must be strictly larger than log<sub>2<\/sub> k.<\/p>\n<\/div>\n<\/div>\n<h2>SESSION: Session 5<\/h2>\n<div class=\"DLabstract\">\u00a0<\/div>\n<h3><a class=\"DLtitleLink\" title=\"Get the Full Text from the ACM Digital Library\" href=\"https:\/\/dl.acm.org\/authorize?N688657\">Topological Characterization of Consensus under General Message Adversaries<\/a><\/h3>\n<ul class=\"DLauthors\">\n<li class=\"nameList First\">Thomas Nowak<\/li>\n<li class=\"nameList\">Ulrich Schmid<\/li>\n<li class=\"nameList Last\">Kyrill Winkler<\/li>\n<\/ul>\n<div class=\"DLabstract\">\n<div>\n<p>In this paper, we provide a rigorous characterization of consensus solvability in synchronous directed dynamic networks controlled by an arbitrary message adversary using point-set topology: We extend the approach introduced by Alpern and Schneider in 1985 by introducing two novel topologies on the space of infinite executions: the process-view topology, induced by a distance function that relies on the local view of a given process in an execution, and the minimum topology, which is induced by a distance function that focuses on the local view of the process that is the last to distinguish two executions. We establish some simple but powerful topological results, which not only lead to a topological explanation of bivalence arguments, but also provide necessary and sufficient topological conditions on the admissible graph sequences of a message adversary for solving consensus. In particular, we characterize consensus solvability in terms of connectivity of the set of admissible graph sequences. For non-compact message adversaries, which are not limit-closed in the sense that there is a convergent sequence of graph sequences whose limit is not permitted, this requires the exclusion of all &#8220;fair&#8221; and &#8220;unfair&#8221; limit sequences that coincide with the forever bivalent runs constructed in bivalence proofs. For both compact and non-compact message adversaries, we also provide tailored characterizations of consensus solvability, i.e., tight conditions for impossibility and existence of algorithms, based on the broadcastability of the connected components of the set of admissible graph sequences.<\/p>\n<\/div>\n<\/div>\n<h3><a class=\"DLtitleLink\" title=\"Get the Full Text from the ACM Digital Library\" href=\"https:\/\/dl.acm.org\/authorize?N688658\">Can Distributed Uniformity Testing Be Local?<\/a><\/h3>\n<ul class=\"DLauthors\">\n<li class=\"nameList First\">Uri Meir<\/li>\n<li class=\"nameList\">Dor Minzer<\/li>\n<li class=\"nameList Last\">Rotem Oshman<\/li>\n<\/ul>\n<div class=\"DLabstract\">\n<div>\n<p>In the distributed uniformity testing problem, k servers draw samples from some unknown distribution, and the goal is to determine whether the unknown distribution is uniform or whether it is \u03b5-far from uniform, where \u03b5 is a proximity parameter. Each server decides whether to accept or reject, and these decisions are sent to a referee, who makes a final decision based on the servers&#8217; local decisions. Uniformity testing is a particularly useful building-block, because it is complete for the problem of testing identity to any fixed distribution.<\/p>\n<p>It was recently shown that distributing the task of uniformity testing allows each server to draw fewer samples than are needed in the centralized case, but so far the number of samples required for distributed uniformity testing has not been well understood. In this paper we settle this question, and also investigate the cost of using local decision rules, such as rejecting iff at least one server wants to reject (the usual decision rule used in local distributed decision). To answer these questions, we develop a new Fourier-based technique for proving lower bounds on the sample complexity of distribution testing, which lends itself particularly well to the distributed case.<\/p>\n<p>Using our technique, we tightly characterize the number of samples required for uniformity testing when the referee can apply any decision function to the servers&#8217; local decisions. We also show that if the network rejects whenever one server wants to reject, then the cost of uniformity testing is much higher, and in fact we do not gain compared to the centralized case unless the number of servers is exponential in \u03a9 (1\/\u03b5). Finally, we apply our lower bound technique to the case where the referee applies a threshold decision rule, and also generalize a lower bound from[1] for learning an unknown input distribution.<\/p>\n<\/div>\n<\/div>\n<h3><a class=\"DLtitleLink\" title=\"Get the Full Text from the ACM Digital Library\" href=\"https:\/\/dl.acm.org\/authorize?N688659\">Hardness of Distributed Optimization<\/a><\/h3>\n<ul class=\"DLauthors\">\n<li class=\"nameList First\">Nir Bacrach<\/li>\n<li class=\"nameList\">Keren Censor-Hillel<\/li>\n<li class=\"nameList\">Michal Dory<\/li>\n<li class=\"nameList\">Yuval Efron<\/li>\n<li class=\"nameList\">Dean Leitersdorf<\/li>\n<li class=\"nameList Last\">Ami Paz<\/li>\n<\/ul>\n<div class=\"DLabstract\">\n<div>\n<p>This paper studies lower bounds for fundamental optimization problems in the CONGEST model. We show that solving problems exactly in this model can be a hard task, by providing tilde\u03a9mega (n<sup>2<\/sup>) lower bounds for cornerstone problems, such as minimum dominating set (MDS), Hamiltonian path, Steiner tree and max-cut. These are almost tight, since all of these problems can be solved optimally in O(n<sup>2<\/sup>) rounds. Moreover, we show that even in bounded-degree graphs and even in simple graphs with maximum degree 5 and logarithmic diameter, it holds that various tasks, such as finding a maximum independent set (MaxIS) or a minimum vertex cover, are still difficult, requiring a near-tight number of tilde\u03a9 (n) rounds.<\/p>\n<p>Furthermore, we show that in some cases even approximations are difficult, by providing an tilde\u03a9 (n<sup>2<\/sup>) lower bound for a (7\/8+\u03b5)-approximation for MaxIS, and a nearly-linear lower bound for an O(log n )-approximation for the k-MDS problem for any constant k geq\u2265 2, as well as for several variants of the Steiner tree problem.<\/p>\n<p>Our lower bounds are based on a rich variety of constructions that leverage novel observations, and reductions among problems that are specialized for the CONGEST model. However, for several additional approximation problems, as well as for exact computation of some central problems in P, such as maximum matching and max flow, we show that such constructions cannot be designed, by which we exemplify some limitations of this framework.<\/p>\n<\/div>\n<\/div>\n<h3><a class=\"DLtitleLink\" title=\"Get the Full Text from the ACM Digital Library\" href=\"https:\/\/dl.acm.org\/authorize?N688650\">Broadcast Congested Clique: Planted Cliques and Pseudorandom Generators<\/a><\/h3>\n<ul class=\"DLauthors\">\n<li class=\"nameList First\">Lijie Chen<\/li>\n<li class=\"nameList Last\">Ofer Grossman<\/li>\n<\/ul>\n<div class=\"DLabstract\">\n<div>\n<p>We develop techniques to prove lower bounds for the BCAST(log n) Broadcast Congested Clique model (a distributed message passing model where in each round, each processor can broadcast an O(log n)-sized message to all other processors). Our techniques are built to prove bounds for natural input distributions. So far, all lower bounds for problems in the model relied on constructing specifically tailored graph families for the specific problem at hand, resulting in lower bounds for artificially constructed inputs, instead of natural input distributions.<\/p>\n<p>One of our results is a lower bound for the directed planted clique problem. In this problem, an input graph is either a random directed graph (each directed edge is included with probability 1\/2), or a random graph with a planted clique of size k. That is, k randomly chosen vertices have all of the edges between them included, and all other edges in the graph appear with probability 1\/2. The goal is to determine whether a clique exists. We show that when k = n(1\/4 &#8211; \u03b5), this problem requires a number of rounds polynomial in n.<\/p>\n<p>Additionally, we construct a pseudo-random generator which fools the Broadcast Congested Clique. This allows us to show that every k round randomized algorithm in which each processor uses up to n random bits can be efficiently transformed into an O(k)-round randomized algorithm in which each processor uses only up to O(k log n) random bits, while maintaining a high success probability. The pseudo-random generator is simple to describe, computationally very cheap, and its seed size is optimal up to constant factors. However, the analysis is quite involved, and is based on the new technique for proving lower bounds in the model.<\/p>\n<p>The technique also allows us to prove the first average case lower bound for the Broadcast Congested Clique, as well as an average-case time hierarchy. We hope our technique will lead to more lower bounds for problems such as triangle counting, APSP, MST, diameter, and more, for natural input distributions.<\/p>\n<\/div>\n<\/div>\n<h3><a class=\"DLtitleLink\" title=\"Get the Full Text from the ACM Digital Library\" href=\"https:\/\/dl.acm.org\/authorize?N688651\">Connectivity Lower Bounds in Broadcast Congested Clique<\/a><\/h3>\n<ul class=\"DLauthors\">\n<li class=\"nameList First\">Shreyas Pai<\/li>\n<li class=\"nameList Last\">Sriram V. Pemmaraju<\/li>\n<\/ul>\n<div class=\"DLabstract\">\n<div>\n<p>We prove three new lower bounds for graph connectivity in the 1-bit broadcast congested clique model, BCC(1). First, in the KT-0 version of BCC(1), in which nodes are aware of neighbors only through port numbers, we show an \u00d8mega(log n) round lower bound for CONNECTIVITY even for constant-error randomized Monte Carlo algorithms. The deterministic version of this result can be obtained via the well-known &#8220;edge-crossing&#8221; argument, but, the randomized version of this result requires establishing new combinatorial results regarding the indistinguishability graph induced by inputs. In our second result, we show that the \u00d8mega(log n) lower bound result extends to the KT-1 version of the BCC(1) model, in which nodes are aware of IDs of all neighbors, though our proof works only for deterministic algorithms. Since nodes know IDs of their neighbors in the KT-1 model, it is no longer possible to play &#8220;edge-crossing&#8221; tricks; instead we present a reduction from the 2-party communication complexity problem PARTITION in which Alice and Bob are give two set partitions on [n] and are required to determine if the join of these two set partitions equals the trivial one-part set partition. While our KT-1 CONNECTIVITY lower bound holds only for deterministic algorithms, in our third result we extend this \u00d8mega(log n) KT-1 lower bound to constant-error Monte Carlo algorithms for the closely related CONNECTED COMPONENTS problem. We use information-theoretic techniques to obtain this result. All our results hold for the seemingly easy special case of CONNECTIVITY in which an algorithm has to distinguish an instance with one cycle from an instance with multiple cycles. Our results showcase three rather different lower bound techniques and lay the groundwork for further improvements in lower bounds for CONNECTIVITY in the BCC(1) model.<\/p>\n<\/div>\n<\/div>\n<h3><a class=\"DLtitleLink\" title=\"Get the Full Text from the ACM Digital Library\" href=\"https:\/\/dl.acm.org\/authorize?N688652\">Does Preprocessing Help under Congestion?<\/a><\/h3>\n<ul class=\"DLauthors\">\n<li class=\"nameList First\">Klaus-Tycho Foerster<\/li>\n<li class=\"nameList\">Janne H. Korhonen<\/li>\n<li class=\"nameList\">Joel Rybicki<\/li>\n<li class=\"nameList Last\">Stefan Schmid<\/li>\n<\/ul>\n<div class=\"DLabstract\">\n<div>\n<p>This paper investigates the power of preprocessing in the CONGEST model. Schmid and Suomela (ACM HotSDN 2013) introduced the SUPPORTED CONGEST model to study the application of distributed algorithms in Software-Defined Networks (SDNs). In this paper, we show that a large class of lower bounds in the CONGEST model still hold in the SUPPORTED model, highlighting the robustness of these bounds. This also raises the question how much does preprocessing help in the CONGEST model<\/p>\n<\/div>\n<\/div>\n<h2>SESSION: Session 6<\/h2>\n<div class=\"DLabstract\">\u00a0<\/div>\n<h3><a class=\"DLtitleLink\" title=\"Get the Full Text from the ACM Digital Library\" href=\"https:\/\/dl.acm.org\/authorize?N688653\">The Distributed Complexity of Locally Checkable Problems on Paths is Decidable<\/a><\/h3>\n<ul class=\"DLauthors\">\n<li class=\"nameList First\">Alkida Balliu<\/li>\n<li class=\"nameList\">Sebastian Brandt<\/li>\n<li class=\"nameList\">Yi-Jun Chang<\/li>\n<li class=\"nameList\">Dennis Olivetti<\/li>\n<li class=\"nameList\">Mika\u00ebl Rabie<\/li>\n<li class=\"nameList Last\">Jukka Suomela<\/li>\n<\/ul>\n<div class=\"DLabstract\">\n<div>\n<p>Consider a computer network that consists of a path with n nodes. The nodes are labeled with inputs from a constant-sized set, and the task is to find output labels from a constant-sized set subject to some local constraints&#8212;more formally, we have an LCL (locally checkable labeling) problem. How many communication rounds are needed (in the standard LOCAL model of computing) to solve this problem?<\/p>\n<p>It is well known that the answer is always either O(1) rounds, or \u0398(log<sub>\u22c5<\/sub> n) rounds, or \u0398(n) rounds. In this work we show that this question is decidable (albeit PSPACE-hard): we present an algorithm that, given any LCL problem defined on a path, outputs the distributed computational complexity of this problem and the corresponding asymptotically optimal algorithm.<\/p>\n<\/div>\n<\/div>\n<h3><a class=\"DLtitleLink\" title=\"Get the Full Text from the ACM Digital Library\" href=\"https:\/\/dl.acm.org\/authorize?N688654\">Hardness of Exact Distance Queries in Sparse Graphs Through Hub Labeling<\/a><\/h3>\n<ul class=\"DLauthors\">\n<li class=\"nameList First\">Adrian Kosowski<\/li>\n<li class=\"nameList\">Przemyslaw Uznanski<\/li>\n<li class=\"nameList Last\">Laurent Viennot<\/li>\n<\/ul>\n<div class=\"DLabstract\">\n<div>\n<p>A distance labeling scheme is an assignment of bit-labels to the vertices of an undirected, unweighted graph such that the distance between any pair of vertices can be decoded solely from their labels. An important class of distance labeling schemes is that of hub labelings, where a node \u03bd \u2208 G stores its distance to the so-called hubs S<sub>\u03bd<\/sub> \u2286 V, chosen so that for any u,\u03bd \u2208 V there is w \u2208 S<sub>u<\/sub> \u2229 S<sub>v<\/sub> belonging to some shortest uv path. Notice that for most existing graph classes, the best distance labelling constructions existing use at some point a hub labeling scheme at least as a key building block.<\/p>\n<p>Our interest lies in hub labelings of sparse graphs, i.e., those with |E(G)| = O (n), for which we show a lowerbound of n 2<sup>O<\/sup> (<sup>\u221alog n)<\/sup> for the average size of the hubsets. Additionally, we show a hub-labeling construction for sparse graphs of average size O(\u221an RS (n)<sup>c<\/sup>) for some 0 &lt; c &lt; 1, where RS (n) is the so-called Ruzsa-Szemer\u00e9di function, linked to structure of induced matchings in dense graphs. This implies that further improving the lower bound on hub labeling size to n over 2(log n)o(1 would require a breakthrough in the study of lower bounds on RS (n), which have resisted substantial improvement in the last 70 years.<\/p>\n<p>For general distance labeling of sparse graphs, we show a lowerbound of 1 over 2<sup>\u0398<\/sup>(\u221alog n) SumIndex (n), where SumIndex (n) is the communication complexity of the SUM-I problem over Z<sub>n<\/sub>. Our results suggest that the best achievable hub-label size and distance-label size in sparse graphs may be \u0398(n over 2<sup>(log n)c<\/sup> ) for some 0 &lt; c &lt; 1.,<\/p>\n<\/div>\n<\/div>\n<h3><a class=\"DLtitleLink\" title=\"Get the Full Text from the ACM Digital Library\" href=\"https:\/\/dl.acm.org\/authorize?N688665\">On the Complexity of Distributed Splitting Problems<\/a><\/h3>\n<ul class=\"DLauthors\">\n<li class=\"nameList First\">Philipp Bamberger<\/li>\n<li class=\"nameList\">Mohsen Ghaffari<\/li>\n<li class=\"nameList\">Fabian Kuhn<\/li>\n<li class=\"nameList\">Yannic Maus<\/li>\n<li class=\"nameList Last\">Jara Uitto<\/li>\n<\/ul>\n<div class=\"DLabstract\">\n<div>\n<p>One of the fundamental open problems in the area of distributed graph algorithms is whether randomization is needed for efficient symmetry breaking. While there are poly log n-time randomized algorithms for all the classic symmetry breaking problems, for many of them, the best deterministic algorithms are almost exponentially slower. The following basic local splitting problem, which is known as weak splitting, takes a central role in this context: Each node of a graph G=(V,E) has to be colored red or blue such that each node of sufficiently large degree has at least one neighbor of each color. Ghaffari, Kuhn, and Maus [STOC &#8217;17] showed that this seemingly simple problem is complete w.r.t. the above fundamental open question in the following sense: If there is an efficient poly log n-time determinstic distributed algorithm for weak splitting, then there is such an algorithm for all locally checkable graph problems for which an efficient randomized algorithm exists. We investigate the distributed complexity of weak splitting and some closely related problems and we in particular obtain the following results:<\/p>\n<ul>\n<li>We obtain efficient algorithms for special cases of weak splitting in nearly regular graphs. We show that if \u03b4=\u00d8(log n) and \u0394 are the minimum and maximum degrees of G, weak splitting can be solved deterministically in time O #916;(\u221a over \u03b4 \u2022 poly(log n)). Further, if \u03b4 = \u00d8(log log n) and \u0394 \u2264 2<sup>\u03b5 \u03b4<\/sup>, the time complexity is O(\u0394 over \u03b4\u22c5poly(log log n)).<\/li>\n<li>We prove that the following two related problems are also complete in the same sense: (I) Color the nodes of a graph with C \u2264 poly log n colors such that each node with a sufficiently large polylogarithmic degree has at least 2 log n different colors among its neighbors, and (II) Color the nodes with a large constant number of colors so that for each node of a sufficiently large at least logarithmic degree d(v), the number of neighbors of each color is at most (1-\u03b5d(v) for some constant \u03b5 &gt; 0.<\/li>\n<\/ul>\n<\/div>\n<\/div>\n<h3><a class=\"DLtitleLink\" title=\"Get the Full Text from the ACM Digital Library\" href=\"https:\/\/dl.acm.org\/authorize?N688666\">On the Use of Randomness in Local Distributed Graph Algorithms<\/a><\/h3>\n<ul class=\"DLauthors\">\n<li class=\"nameList First\">Mohsen Ghaffari<\/li>\n<li class=\"nameList Last\">Fabian Kuhn<\/li>\n<\/ul>\n<div class=\"DLabstract\">\n<div>\n<p>We attempt to better understand randomization in local distributed graph algorithms by exploring how randomness is used and what we can gain from it:<\/p>\n<ul>\n<li>We first ask the question of how much randomness is needed to obtain efficient randomized algorithms. We show that for all locally checkable problems with poly log n-time randomized algorithms, there are such algorithms even if either (I) there is a only a single (private) independent random bit in each poly log n-neighborhood of the graph, (II) the (private) bits of randomness of different nodes are only poly log n-wise independent, or (III) there are only poly log n bits of global shared randomness (and no private randomness).<\/li>\n<li>Second, we study how much we can improve the error probability of randomized algorithms. For all locally checkable problems with poly log n-time randomized algorithms, we show that there are such algorithms that succeed with probability 1-n<sup>-2 \u03b5(log log n) 2<\/sup> and more generally T-round algorithms, for T \u2265 poly log n, with success probability 1-n<sup>-2 \u03b5log 2T<\/sup>. We also show that poly log n-time randomized algorithms with success probability 1-2<sup>-2 log \u03b5 n<\/sup> for some \u03b5 &gt; 0 can be derandomized to poly log n-time deterministic algorithms.<\/li>\n<\/ul>\n<p>Both of the directions mentioned above, reducing the amount of randomness and improving the success probability, can be seen as partial derandomization of existing randomized algorithms. In all the above cases, we also show that any significant improvement of our results would lead to a major breakthrough, as it would imply significantly more efficient deterministic distributed algorithms for a wide class of problems.<\/p>\n<\/div>\n<\/div>\n<h3><a class=\"DLtitleLink\" title=\"Get the Full Text from the ACM Digital Library\" href=\"https:\/\/dl.acm.org\/authorize?N688667\">Message Reduction in the LOCAL Model is a Free Lunch<\/a><\/h3>\n<ul class=\"DLauthors\">\n<li class=\"nameList First\">Shimon Bitton<\/li>\n<li class=\"nameList\">Yuval Emek<\/li>\n<li class=\"nameList\">Taisuke Izumi<\/li>\n<li class=\"nameList Last\">Shay Kutten<\/li>\n<\/ul>\n<div class=\"DLabstract\">\n<div>\n<p>A new spanner construction algorithm is presented, working under the LOCAL model assuming unique edge IDs. Given an n-node communication graph, a spanner with a constant stretch and \u00d5(n<sup>1 + c<\/sup>) edges (for any small constant c &gt; 0) is constructed efficiently &#8212; i.e., in a constant number of rounds and a message complexity of \u00d5 (n<sup>1 + 2c<\/sup>) whp.<\/p>\n<p>One of the many known applications of spanners is for reducing the number of messages of various algorithms. However, usually, one still needs to pay the cost of constructing the spanner. Due to the efficiency of the spanner construction here, we show that every t-round LOCAL algorithm can be transformed into a randomized one with the same asymptotic time complexity and \u00d5(t<sup>2<\/sup>n<sup>1 + O(1\/log t)<\/sup>) message complexity. All previous message-reduction schemes for LOCAL algorithms incur either an O(log n)-multiplicative or an O(polylog (n))-additive blow-up of the round complexity.<\/p>\n<\/div>\n<\/div>\n<h3><a class=\"DLtitleLink\" title=\"Get the Full Text from the ACM Digital Library\" href=\"https:\/\/dl.acm.org\/authorize?N688668\">P-SLOCAL-Completeness of Maximum Independent Set Approximation<\/a><\/h3>\n<ul class=\"DLauthors\">\n<li class=\"nameList Last\">Yannic Maus<\/li>\n<\/ul>\n<div class=\"DLabstract\">\n<div>\n<p>We prove that the maximum independent set approximation problem with polylogarithmic approximation factor is P-SLOCAL-complete.<\/p>\n<\/div>\n<\/div>\n<h2>SESSION: Keynote Lecture 3<\/h2>\n<div class=\"DLabstract\">\u00a0<\/div>\n<h3><a class=\"DLtitleLink\" title=\"Get the Full Text from the ACM Digital Library\" href=\"https:\/\/dl.acm.org\/authorize?N688669\">Engineering Distributed Systems that We Can Trust (and Also Run)<\/a><\/h3>\n<ul class=\"DLauthors\">\n<li class=\"nameList Last\">Ilya Sergey<\/li>\n<\/ul>\n<div class=\"DLabstract\">\n<div>\n<p>The interest in formal methods and verification of correctness-critical distributed systems is on the rise in the past few years. But what are the gains from proving statements about software in full mathematical rigour? Do they justify the high cost of verification? And how far can we extend our trust in formal methods when talking about realistic distributed systems and their client programs?<\/p>\n<p>This talk is in three parts. First, I will provide an overview of the state of the art in machine-assisted reasoning about distributed consensus protocols, their implementations, and applications. Next, I will discuss the trade-offs that have to be made in order to enable mechanised proofs about runnable systems code, as well as implications of the assumptions made to describe the real-world execution environments. Lastly, I will focus on the ongoing work propelled by the programming languages community towards engineering modular proofs about distributed protocols-a way to build correct-by-construction composite systems from verified reusable components.<\/p>\n<\/div>\n<\/div>\n<h2>SESSION: Session 7<\/h2>\n<div class=\"DLabstract\">\u00a0<\/div>\n<h3><a class=\"DLtitleLink\" title=\"Get the Full Text from the ACM Digital Library\" href=\"https:\/\/dl.acm.org\/authorize?N688660\">The Consensus Number of a Cryptocurrency<\/a><\/h3>\n<ul class=\"DLauthors\">\n<li class=\"nameList First\">Rachid Guerraoui<\/li>\n<li class=\"nameList\">Petr Kuznetsov<\/li>\n<li class=\"nameList\">Matteo Monti<\/li>\n<li class=\"nameList\">Matej Pavlovi\u010d<\/li>\n<li class=\"nameList Last\">Dragos-Adrian Seredinschi<\/li>\n<\/ul>\n<div class=\"DLabstract\">\n<div>\n<p>Many blockchain-based algorithms, such as Bitcoin, implement a decentralized asset transfer system, often referred to as a cryptocurrency. As stated in the original paper by Nakamoto, at the heart of these systems lies the problem of preventing double-spending ; this is usually solved by achieving consensus on the order of transfers among the participants. By treating the asset transfer problem as a concurrent object and determining its consensus number, we show that consensus is not necessary to prevent double-spending. We first consider the problem as defined by Nakamoto, where only a single process&#8212;the account owner&#8212;can withdraw from each account. Safety and liveness need to be ensured for correct account owners, whereas misbehaving account owners might be unable to perform transfers. We show that the consensus number of an asset transfer object is 1. We then consider a more general k-shared asset transfer object where up to k processes can atomically withdraw from the same account, and show that this object has consensus number k. We first establish these these results in the context of shared memory with benign faults, in order to properly understand the level of difficulty of the asset transfer problem. Then, we translate our result in the more practically relevant message passing setting with Byzantine players. We describe an asynchronous Byzantine fault-tolerant asset transfer implementation that is both simpler and more efficient than state-of-the-art consensus-based solutions. Our results are applicable to both the permissioned (private) and permissionless (public) setting, as normally their differentiation is hidden by the abstractions on top of which our algorithms are based.<\/p>\n<\/div>\n<\/div>\n<h3><a class=\"DLtitleLink\" title=\"Get the Full Text from the ACM Digital Library\" href=\"https:\/\/dl.acm.org\/authorize?N688661\">Communication Complexity of Byzantine Agreement, Revisited<\/a><\/h3>\n<ul class=\"DLauthors\">\n<li class=\"nameList First\">Ittai Abraham<\/li>\n<li class=\"nameList\">T-H. Hubert Chan<\/li>\n<li class=\"nameList\">Danny Dolev<\/li>\n<li class=\"nameList\">Kartik Nayak<\/li>\n<li class=\"nameList\">Rafael Pass<\/li>\n<li class=\"nameList\">Ling Ren<\/li>\n<li class=\"nameList Last\">Elaine Shi<\/li>\n<\/ul>\n<div class=\"DLabstract\">\n<div>\n<p>As Byzantine Agreement (BA) protocols find application in large-scale decentralized cryptocurrencies, an increasingly important problem is to design BA protocols with improved communication complexity. A few existing works have shown how to achieve subquadratic BA under an adaptive adversary. Intriguingly, they all make a common relaxation about the adaptivity of the attacker, that is, if an honest node sends a message and then gets corrupted in some round, the adversary cannot erase the message that was already sent &#8211; henceforth we say that such an adversary cannot perform &#8220;after-the-fact removal&#8221;. By contrast, many (super-)quadratic BA protocols in the literature can tolerate after-the-fact removal. In this paper, we first prove that disallowing after-the-fact removal is necessary for achieving subquadratic-communication BA.<\/p>\n<p>Next, we show a new subquadratic binary BA construction (of course, assuming no after-the-fact removal) that achieves near- optimal resilience and expected constant rounds under standard cryptographic assumptions and a public-key infrastructure (PKI). In comparison, all known subquadratic protocols make additional strong assumptions such as random oracles or the ability of honest nodes to erase secrets from memory, and even with these strong assumptions, no prior work can achieve the above properties. Lastly, we show that some setup assumption is necessary for achieving subquadratic multicast-based BA.<\/p>\n<\/div>\n<\/div>\n<h3><a class=\"DLtitleLink\" title=\"Get the Full Text from the ACM Digital Library\" href=\"https:\/\/dl.acm.org\/authorize?N688662\">Exact Byzantine Consensus on Undirected Graphs under Local Broadcast Model<\/a><\/h3>\n<ul class=\"DLauthors\">\n<li class=\"nameList First\">Muhammad Samir Khan<\/li>\n<li class=\"nameList\">Syed Shalan Naqvi<\/li>\n<li class=\"nameList Last\">Nitin H. Vaidya<\/li>\n<\/ul>\n<div class=\"DLabstract\">\n<div>\n<p>This paper considers the Byzantine consensus problem for nodes with binary inputs. The nodes are interconnected by a network represented as an undirected graph, and the system is assumed to be synchronous. Under the classical point-to-point communication model, it is well-known that the following two conditions are both necessary and sufficient to achieve Byzantine consensus among n nodes in the presence of up to \u0192 Byzantine faulty nodes: n &amp; 3 #8805; 3 \u2265 \u0192+ 1 and vertex connectivity at least 2 \u0192 + 1. In the classical point-to-point communication model, it is possible for a faulty node to equivocate, i.e., transmit conflicting information to different neighbors. Such equivocation is possible because messages sent by a node to one of its neighbors are not overheard by other neighbors.<\/p>\n<p>This paper considers the local broadcast model. In contrast to the point-to-point communication model, in the local broadcast model, messages sent by a node are received identically by all of its neighbors. Thus, under the local broadcast model, attempts by a node to send conflicting information can be detected by its neighbors. Under this model, we show that the following two conditions are both necessary and sufficient for Byzantine consensus: vertex connectivity at least \u230b 3 f\u0192 \/ 2 \u230a + 1 and minimum node degree at least 2 \u0192. Observe that the local broadcast model results in a lower requirement for connectivity and the number of nodes n, as compared to the point-to-point communication model.<\/p>\n<p>We extend the above results to a hybrid model that allows some of the Byzantine faulty nodes to equivocate. The hybrid model bridges the gap between the point-to-point and local broadcast models, and helps to precisely characterize the trade-off between equivocation and network requirements.<\/p>\n<\/div>\n<\/div>\n<h3><a class=\"DLtitleLink\" title=\"Get the Full Text from the ACM Digital Library\" href=\"https:\/\/dl.acm.org\/authorize?N688663\">Asymptotically Optimal Validated Asynchronous Byzantine Agreement<\/a><\/h3>\n<ul class=\"DLauthors\">\n<li class=\"nameList First\">Ittai Abraham<\/li>\n<li class=\"nameList\">Dahlia Malkhi<\/li>\n<li class=\"nameList Last\">Alexander Spiegelman<\/li>\n<\/ul>\n<div class=\"DLabstract\">\n<div>\n<p>We provide a new protocol for Validated Asynchronous Byzantine Agreement in the authenticated setting. Validated (multi-valued) Asynchronous Byzantine Agreement is a key building block in constructing Atomic Broadcast and fault-tolerant state machine replication in the asynchronous setting. Our protocol has optimal resilience of \u0192 &lt; n\/3 Byzantine failures and asymptotically optimal expected O(1) running time to reach agreement. Honest parties in our protocol send only an expected O(n<sup>2<\/sup>) messages where each message contains a value and a constant number of signatures. Hence our total expected communication is O(n<sup>2<\/sup>) words. The best previous result of Cachin et al. from 2001 solves Validated Byzantine Agreement with optimal resilience and O(1) expected time but with O(n<sup>3<\/sup>) expected word communication. Our work addresses an open question of Cachin et al. from 2001 and improves the expected word communication from O(n<sup>3<\/sup>) to asymptotically optimal O(n<sup>2<\/sup>).<\/p>\n<\/div>\n<\/div>\n<h3><a class=\"DLtitleLink\" title=\"Get the Full Text from the ACM Digital Library\" href=\"https:\/\/dl.acm.org\/authorize?N688664\">HotStuff: BFT Consensus with Linearity and Responsiveness<\/a><\/h3>\n<ul class=\"DLauthors\">\n<li class=\"nameList First\">Maofan Yin<\/li>\n<li class=\"nameList\">Dahlia Malkhi<\/li>\n<li class=\"nameList\">Michael K. Reiter<\/li>\n<li class=\"nameList\">Guy Golan Gueta<\/li>\n<li class=\"nameList Last\">Ittai Abraham<\/li>\n<\/ul>\n<div class=\"DLabstract\">\n<div>\n<p>We present HotStuff, a leader-based Byzantine fault-tolerant replication protocol for the partially synchronous model. Once network communication becomes synchronous, HotStuff enables a correct leader to drive the protocol to consensus at the pace of actual (vs. maximum) network delay&#8211;a property called responsiveness&#8212;and with communication complexity that is linear in the number of replicas. To our knowledge, HotStuff is the first partially synchronous BFT replication protocol exhibiting these combined properties. Its simplicity enables it to be further pipelined and simplified into a practical, concise protocol for building large-scale replication services.<\/p>\n<\/div>\n<\/div>\n<h3><a class=\"DLtitleLink\" title=\"Get the Full Text from the ACM Digital Library\" href=\"https:\/\/dl.acm.org\/authorize?N688675\">Fault Tolerant Gradient Clock Synchronization<\/a><\/h3>\n<ul class=\"DLauthors\">\n<li class=\"nameList First\">Johannes Bund<\/li>\n<li class=\"nameList\">Christoph Lenzen<\/li>\n<li class=\"nameList Last\">Will Rosenbaum<\/li>\n<\/ul>\n<div class=\"DLabstract\">\n<div>\n<p>Synchronizing clocks in distributed systems is well-understood, both in terms of fault-tolerance in fully connected systems, and the optimal achievable local skew in general fault-free networks. However, so far nothing non-trivial is known about the local skew that can be achieved in non-fully-connected topologies even under a single Byzantine fault. In this work, we show that asymptotically optimal local skew can be achieved in the presence of Byzantine faults.<\/p>\n<p>Our approach combines the Lynch-Welch algorithm [19] for synchronizing a clique of n nodes with up to \u0192 &lt; n\/3 Byzantine faults, and the gradient clock synchronization (GCS) algorithm by Lenzen et al. [15] in order to render the latter resilient to faults. This is not possible on general graphs, so we augment an arbitrary input graph G by replacing each node with a fully connected cluster of 3 \u0192 +1 copies, and execute an instance of the Lynch-Welch algorithm within each cluster. We interpret the clusters as supernodes executing the GCS algorithm on G, where each node in the cluster maintains an estimate of the logical clock of its supernode. By also fully connecting clusters corresponding to neighbors in l G, supernodes maintain estimates of neighboring clusters&#8217; logical clocks. We achieve asymptotically optimal local skew, assuming that no cluster contains more than \u0192 faulty nodes. This construction yields factors of O(\u0192) and O(\u0192<sup>2<\/sup>) overheads in terms of nodes and edges, respectively. Since tolerating \u0192 faulty neighbors trivially requires degrees larger than \u0192, these overheads are asymptotically optimal.<\/p>\n<\/div>\n<\/div>\n<h3><a class=\"DLtitleLink\" title=\"Get the Full Text from the ACM Digital Library\" href=\"https:\/\/dl.acm.org\/authorize?N688676\">Bootstrapping Public Blockchains Without a Trusted Setup<\/a><\/h3>\n<ul class=\"DLauthors\">\n<li class=\"nameList First\">Abhinav Aggarwal<\/li>\n<li class=\"nameList\">Mahnush Movahedi<\/li>\n<li class=\"nameList\">Jared Saia<\/li>\n<li class=\"nameList Last\">Mahdi Zamani<\/li>\n<\/ul>\n<div class=\"DLabstract\">\n<div>\n<p>We propose a protocol that allows the participants of a permissionless decentralized system to agree on a set of identities in the presence of a computationally-bounded Byzantine adversary. Our protocol guarantees that the fraction of identities belonging to the adversary in the set of identities is at most equal to the total computational hash power of the adversary.<\/p>\n<p>We significantly improve on the existing state-of-the-art in the following four ways. First, our algorithm runs in expected O(1) rounds, in contrast to previous results which require O(log n\/log log n) rounds, where n is the number of initial nodes in the system. Second, we require each node to solve just one computational puzzle, whereas previous algorithms require O(log n\/log log n) puzzles per node. Third, our algorithm sends only O(n) bits per node in expectation, whereas previous algorithms send O( (log<sup>2<\/sup> n\/log log n)) bits in expectation. Finally, in contrast to past results, our algorithm handles dynamic joining and leaving of nodes.<\/p>\n<\/div>\n<\/div>\n<h2>SESSION: Session 8<\/h2>\n<div class=\"DLabstract\">\u00a0<\/div>\n<h3><a class=\"DLtitleLink\" title=\"Get the Full Text from the ACM Digital Library\" href=\"https:\/\/dl.acm.org\/authorize?N688677\">Hardness of Minimal Symmetry Breaking in Distributed Computing<\/a><\/h3>\n<ul class=\"DLauthors\">\n<li class=\"nameList First\">Alkida Balliu<\/li>\n<li class=\"nameList\">Juho Hirvonen<\/li>\n<li class=\"nameList\">Dennis Olivetti<\/li>\n<li class=\"nameList Last\">Jukka Suomela<\/li>\n<\/ul>\n<div class=\"DLabstract\">\n<div>\n<p>A graph is weakly 2-colored if the nodes are labeled with colors black and white such that each black node is adjacent to at least one white node and vice versa. In this work we study the distributed computational complexity of weak 2-coloring in the standard \u0142ocal model of distributed computing, and how it is related to the distributed computational complexity of other graph problems.<\/p>\n<p>First, we show that weak 2-coloring is a minimal distributed symmetry-breaking problem for regular even-degree trees and high-girth graphs: if there is any non-trivial locally checkable labeling problem that is solvable in o(log<sup>\u22c5<\/sup> n) rounds with a distributed graph algorithm in the middle of a regular even-degree tree, then weak 2-coloring is also solvable in o(log<sup>\u22c5<\/sup> n) rounds there.<\/p>\n<p>Second, we prove a tight lower bound of \u03a9(log <sup>\u22c5<\/sup> n) for the distributed computational complexity of weak 2-coloring in regular trees; previously only a lower bound of \u03a9 n(log log<sup>\u22c5<\/sup> n) was known. By minimality, the same lower bound holds for any non-trivial locally checkable problem inside regular even-degree trees.<\/p>\n<\/div>\n<\/div>\n<h3><a class=\"DLtitleLink\" title=\"Get the Full Text from the ACM Digital Library\" href=\"https:\/\/dl.acm.org\/authorize?N688678\">An Automatic Speedup Theorem for Distributed Problems<\/a><\/h3>\n<ul class=\"DLauthors\">\n<li class=\"nameList Last\">Sebastian Brandt<\/li>\n<\/ul>\n<div class=\"DLabstract\">\n<div>\n<p>Recently, Brandt et al.\\ [STOC&#8217;16] proved a lower bound for the distributed Lov\u00e1sz Local Lemma, which has been conjectured to be tight for sufficiently relaxed LLL criteria by Chang and Pettie [FOCS&#8217;17]. At the heart of their result lies a speedup technique that, for graphs of girth at least 2t+2, transforms any t-round algorithm for one specific LLL problem into a (t-1)-round algorithm for the same problem. We substantially improve on this technique by showing that such a speedup exists for any locally checkable problem \u00b6i, with the difference that the problem \u00b6i_1 the inferred (t-1)-round algorithm solves is not (necessarily) the same problem as \u00b6i. Our speedup is automatic in the sense that there is a fixed procedure that transforms a description for \u00b6i into a description for \u00b6i_1 and reversible in the sense that any (t-1)-round algorithm for \u00b6i_1 can be transformed into a t-round algorithm for \u00b6i. In particular, for any locally checkable problem \u00b6i with exact deterministic time complexity T(n, \u0394) \u0142eq t on graphs with n nodes, maximum node degree \u0394, and girth at least 2t+2, there is a sequence of problems \u00b6i_1, \u00b6i_2, \\dots with time complexities T(n, \u0394)-1, T(n, \u0394)-2, \\dots, that can be inferred from \u00b6i. As a first application of our generalized speedup, we solve a long-standing open problem of Naor and Stockmeyer [STOC&#8217;93]: we show that weak 2-coloring in odd-degree graphs cannot be solved in o(\u0142og^* \u0394) rounds, thereby providing a matching lower bound to their upper bound.<\/p>\n<\/div>\n<\/div>\n<h3><a class=\"DLtitleLink\" title=\"Get the Full Text from the ACM Digital Library\" href=\"https:\/\/dl.acm.org\/authorize?N688679\">A Sharp Threshold Phenomenon for the Distributed Complexity of the Lov\u00e1sz Local Lemma<\/a><\/h3>\n<ul class=\"DLauthors\">\n<li class=\"nameList First\">Sebastian Brandt<\/li>\n<li class=\"nameList\">Yannic Maus<\/li>\n<li class=\"nameList Last\">Jara Uitto<\/li>\n<\/ul>\n<div class=\"DLabstract\">\n<div>\n<p>The Lov\u00e1 sz Local Lemma (LLL) says that, given a set of bad events that depend on the values of some random variables and where each event happens with probability at most p and depends on at most d other events, there is an assignment of the variables that avoids all bad events if the LLL criterion ep(d+1)&lt;1 is satisfied. Nowadays, in the area of distributed graph algorithms it has also become a powerful framework for developing&#8212;mostly randomized&#8212;algorithms. A classic result by Moser and Tardos yields an O(\u0142og^2 n) algorithm for the distributed Lov\u00e1 sz Local Lemma [JACM&#8217;10] if ep(d + 1) &lt; 1 is satisfied. Given a stronger criterion, i.e., demanding a smaller error probability, it is conceivable that we can find better algorithms. Indeed, for example Chung, Pettie and Su [PODC&#8217;14] gave an O(\u0142og_epd^2 n) algorithm under the epd^2 &lt; 1 criterion. Going further, Ghaffari, Harris and Kuhn introduced an 2^O(\\sqrt\u0142og \u0142og n ) time algorithm given d^8 p = O(1) [FOCS&#8217;18]. On the negative side, Brandt et al.\\ and Chang et al.\\ showed that we cannot go below \u00d8mega(\u0142og \u0142og n) (randomized) [STOC&#8217;16] and \u00d8mega(\u0142og n) (deterministic) [FOCS&#8217;16], respectively, under the criterion p\u0142eq 2^-d . Furthermore, there is a lower bound of \u00d8mega(\u0142og^* n) that holds for any criterion. In this paper, we study the dependency of the distributed complexity of the LLL problem on the chosen LLL criterion. We show that for the fundamental case of each random variable of the considered LLL instance being associated with an edge of the input graph, that is, each random variable influences at most two events, a sharp threshold phenomenon occurs at p = 2^-d : we provide a simple deterministic (!) algorithm that matches the \u00d8mega(\u0142og^* n) lower bound in bounded degree graphs, if p &lt; 2^-d , whereas for p \\geq 2^-d , the \u00d8mega(\u0142og \u0142og n) randomized and the \u00d8mega(\u0142og n) deterministic lower bounds hold. In many applications variables affect more than two events; our main contribution is to extend our algorithm to the case where random variables influence at most three different bad events. We show that, surprisingly, the sharp threshold occurs at the exact same spot, providing evidence for our conjecture that this phenomenon always occurs at p = 2^-d , independent of the number r of events that are affected by a variable. Almost all steps of the proof framework we provide for the case r=3 extend directly to the case of arbitrary r; consequently, our approach serves as a step towards characterizing the complexity of the LLL under different exponential criteria.<\/p>\n<\/div>\n<\/div>\n<h2>SESSION: Session 9<\/h2>\n<div class=\"DLabstract\">\u00a0<\/div>\n<h3><a class=\"DLtitleLink\" title=\"Get the Full Text from the ACM Digital Library\" href=\"https:\/\/dl.acm.org\/authorize?N688670\">Reconfigurable Atomic Transaction Commit<\/a><\/h3>\n<ul class=\"DLauthors\">\n<li class=\"nameList First\">Manuel Bravo<\/li>\n<li class=\"nameList Last\">Alexey Gotsman<\/li>\n<\/ul>\n<div class=\"DLabstract\">\n<div>\n<p>Modern data stores achieve scalability by partitioning data into shards and fault-tolerance by replicating each shard across several servers. A key component of such systems is a Transaction Certification Service (TCS), which atomically commits a transaction spanning multiple shards. Existing TCS protocols require 2f+1 crash-stop replicas per shard to tolerate f failures. In this paper we present atomic commit protocols that require only f+1 replicas and reconfigure the system upon failures using an external reconfiguration service. We furthermore rigorously prove that these protocols correctly implement a recently proposed TCS specification. We present protocols in two different models&#8212;the standard asynchronous message-passing model and a model with Remote Direct Memory Access (RDMA), which allows a machine to access the memory of another machine over the network without involving the latter&#8217;s CPU. Our protocols are inspired by a recent FARM system for RDMA-based transaction processing. Our work codifies the core ideas of FARM as distributed TCS protocols, rigorously proves them correct and highlights the trade-offs required by the use of RDMA.<\/p>\n<\/div>\n<\/div>\n<h3><a class=\"DLtitleLink\" title=\"Get the Full Text from the ACM Digital Library\" href=\"https:\/\/dl.acm.org\/authorize?N688671\">The Impact of RDMA on Agreement<\/a><\/h3>\n<ul class=\"DLauthors\">\n<li class=\"nameList First\">Marcos K. Aguilera<\/li>\n<li class=\"nameList\">Naama Ben-David<\/li>\n<li class=\"nameList\">Rachid Guerraoui<\/li>\n<li class=\"nameList\">Virendra Marathe<\/li>\n<li class=\"nameList Last\">Igor Zablotchi<\/li>\n<\/ul>\n<div class=\"DLabstract\">\n<div>\n<p>Remote Direct Memory Access (RDMA) is becoming widely available in data centers. This technology allows a process to directly read and write the memory of a remote host, with a mechanism to control access permissions. In this paper, we study the fundamental power of these capabilities. We consider the well-known problem of achieving consensus despite failures, and find that RDMA can improve the inherent trade-off in distributed computing between failure resilience and performance. Specifically, we show that RDMA allows algorithms that simultaneously achieve high resilience and high performance, while traditional algorithms had to choose one or another. With Byzantine failures, we give an algorithm that only requires n \\geq 2f_P + 1 processes (where f_P is the maximum number of faulty processes) and decides in two (network) delays in common executions. With crash failures, we give an algorithm that only requires n \\geq f_P + 1 processes and also decides in two delays. Both algorithms tolerate a minority of memory failures inherent to RDMA, and they provide safety in asynchronous systems and liveness with standard additional assumptions.<\/p>\n<\/div>\n<\/div>\n<h3><a class=\"DLtitleLink\" title=\"Get the Full Text from the ACM Digital Library\" href=\"https:\/\/dl.acm.org\/authorize?N688672\">Hyaline: Fast and Transparent Lock-Free Memory Reclamation<\/a><\/h3>\n<ul class=\"DLauthors\">\n<li class=\"nameList First\">Ruslan Nikolaev<\/li>\n<li class=\"nameList Last\">Binoy Ravindran<\/li>\n<\/ul>\n<div class=\"DLabstract\">\n<div>\n<p>We present a new lock-free safe memory reclamation algorithm, Hyaline, which is fast, scalable, and transparent to the underlying data structures. Hyaline easily handles virtually unbounded number of threads that can be created and deleted dynamically, while retaining O(1) reclamation cost. We also extend Hyaline to avoid situations where stalled threads prevent others from reclaiming newly allocated objects, a common problem with epoch-based reclamation. Our evaluation reveals that Hyaline&#8217;s throughput is high &#8212; it steadily outperformed other reclamation schemes by &gt;10% in one test and yielded even higher gains in oversubscribed scenarios.<\/p>\n<\/div>\n<\/div>\n<h3><a class=\"DLtitleLink\" title=\"Get the Full Text from the ACM Digital Library\" href=\"https:\/\/dl.acm.org\/authorize?N688673\">Layering Data Structures over Skip Graphs for Increased NUMA Locality<\/a><\/h3>\n<ul class=\"DLauthors\">\n<li class=\"nameList First\">Samuel Thomas<\/li>\n<li class=\"nameList Last\">Hammurabi Mendes<\/li>\n<\/ul>\n<div class=\"DLabstract\">\n<div>\n<p>We present a lock-free, linearizable, and NUMA-aware data structure that implements sets, maps, and priority queue abstract data types (ADTs), based on using thread-local, sequential maps that are used to &#8220;jump&#8221; to suitable positions in a lock-free, linearizable variant of a skip graph. Our skip graph is suitably constrained in height and subjected to a data partition scheme that reduces contention and increases NUMA locality. We developed an additional skip graph variant, which we call sparse skip graph, that causes our thread-local maps as well as our shared structure to become more sparse. Compared to using regular skip graphs, sparse skip graphs show increased performance in workloads dominated by &#8220;insert&#8221; or &#8220;remove&#8221; operations, and comparable performance in workloads dominated by &#8220;contains&#8221; operations.<\/p>\n<\/div>\n<\/div>\n<h2>SESSION: Session 10<\/h2>\n<div class=\"DLabstract\">\u00a0<\/div>\n<h3><a class=\"DLtitleLink\" title=\"Get the Full Text from the ACM Digital Library\" href=\"https:\/\/dl.acm.org\/authorize?N688674\">Partially Replicated Causally Consistent Shared Memory: Lower Bounds and An Algorithm<\/a><\/h3>\n<ul class=\"DLauthors\">\n<li class=\"nameList First\">Zhuolun Xiang<\/li>\n<li class=\"nameList Last\">Nitin H. Vaidya<\/li>\n<\/ul>\n<div class=\"DLabstract\">\n<div>\n<p>The focus of this paper is on causal consistency in a partially replicated distributed shared memory (DSM) system that provides the abstraction of shared read\/write registers. Maintaining causal consistency in distributed shared memory systems has received significant attention in the past, mostly on full replication wherein each replica stores a copy of all the registers in the shared memory. To ensure causal consistency, all causally preceding updates must be performed before an update is performed at any given replica. Therefore, some mechanism for tracking causal dependencies is required, such as vector timestamps with the number of vector elements being equal to the number of replicas in the context of full replication. In this paper, we investigate causal consistency in partially replicated systems, wherein each replica may store only a subset of the shared registers. Building on the past work, this paper makes three key contributions:<\/p>\n<ul>\n<li>present a necessary condition on the metadata (which we refer as a timestamp) that must be maintained by each replica to be able to track causality accurately. The necessary condition identifies a set of directed edges in a share graph that a replica&#8217;s timestamp must keep track of.<\/li>\n<li>We present an algorithm for achieving causal consistency using a timestamp that matches the above necessary condition, thus showing that the condition is necessary and sufficient.<\/li>\n<li>We define a measurement of timestamp space size and present a lower bound (in bits) on the size of the timestamps. The lower bound matches our algorithm in several special cases.<\/li>\n<\/ul>\n<\/div>\n<\/div>\n<h3><a class=\"DLtitleLink\" title=\"Get the Full Text from the ACM Digital Library\" href=\"https:\/\/dl.acm.org\/authorize?N688685\">Vorpal: Vector Clock Ordering For Large Persistent Memory Systems<\/a><\/h3>\n<ul class=\"DLauthors\">\n<li class=\"nameList First\">Kunal Korgaonkar<\/li>\n<li class=\"nameList\">Joseph Izraelevitz<\/li>\n<li class=\"nameList\">Jishen Zhao<\/li>\n<li class=\"nameList Last\">Steven Swanson<\/li>\n<\/ul>\n<div class=\"DLabstract\">\n<div>\n<p>In systems with non-volatile main memories (NVMMs), programmers must carefully control the order in which writes become persistent. Otherwise, what will remain in persistence after a crash may be unusable upon recovery. Prior art has already explored semantic models for specifying this persist order, but most enforcement algorithms for the order are not scalable to large server machines because they assume that the machine contains only one or two memory controllers. In this paper, we describe a collection of provably correct algorithms for enforcing the persist-order across writes, generated at many different cores, and persisted across numerous different memory controllers. Relative to existing solutions, our algorithms improve performance by 48% by reducing both traffic and serialization overheads.<\/p>\n<\/div>\n<\/div>\n<h3><a class=\"DLtitleLink\" title=\"Get the Full Text from the ACM Digital Library\" href=\"https:\/\/dl.acm.org\/authorize?N688686\">On the Parallels between Paxos and Raft, and how to Port Optimizations<\/a><\/h3>\n<ul class=\"DLauthors\">\n<li class=\"nameList First\">Zhaoguo Wang<\/li>\n<li class=\"nameList\">Changgeng Zhao<\/li>\n<li class=\"nameList\">Shuai Mu<\/li>\n<li class=\"nameList\">Haibo Chen<\/li>\n<li class=\"nameList Last\">Jinyang Li<\/li>\n<\/ul>\n<div class=\"DLabstract\">\n<div>\n<p>In recent years, Raft has surpassed Paxos to become the more popular consensus protocol in the industry. While many researchers have observed the similarities between the two protocols, no one has shown how Raft and Paxos are formally related to each other. In this paper, we present a formal mapping between Raft and Paxos, and use this knowledge to port a certain class of optimizations from Paxos to Raft. In particular, our porting method can automatically generate an optimized protocol specification with guaranteed correctness. As case studies, we port and evaluate two optimizations, Mencius and Paxos Quorum Lease to Raft.<\/p>\n<\/div>\n<\/div>\n<h3><a class=\"DLtitleLink\" title=\"Get the Full Text from the ACM Digital Library\" href=\"https:\/\/dl.acm.org\/authorize?N688687\">Linearizable State Machine Replication of State-Based CRDTs without Logs<\/a><\/h3>\n<ul class=\"DLauthors\">\n<li class=\"nameList First\">Jan Skrzypczak<\/li>\n<li class=\"nameList\">Florian Schintke<\/li>\n<li class=\"nameList Last\">Thorsten Sch\u00fctt<\/li>\n<\/ul>\n<div class=\"DLabstract\">\n<div>\n<p>General solutions of state machine replication have to ensure that all replicas apply the same commands in the same order, even in the presence of failures. Such strict ordering incurs high synchronization costs due to the use of distributed consensus or a leader. This paper presents a protocol for linearizable state machine replication of conflict-free replicated data types (CRDTs) that neither requires consensus nor a leader. By leveraging the properties of state-based CRDTs&#8212;in particular the monotonic growth of a join semilattice&#8212;synchronization overhead is greatly reduced. In addition, updates just need a single round trip and modify the state &#8216;in-place&#8217; without the need for a log. Furthermore, the message size overhead for coordination consists of a single counter per message. While reads in the presence of concurrent updates are not wait-free without a coordinator, we show that more than 97\\,% of reads can be handled in one or two round trips under highly concurrent accesses. Our protocol achieves high throughput without auxiliary processes such as command log management or leader election. It is well suited for all practical scenarios that need linearizable access on CRDT data on a fine-granular scale.<\/p>\n<\/div>\n<\/div>\n<h3><a class=\"DLtitleLink\" title=\"Get the Full Text from the ACM Digital Library\" href=\"https:\/\/dl.acm.org\/authorize?N688688\">On Mixing Eventual and Strong Consistency: Bayou Revisited<\/a><\/h3>\n<ul class=\"DLauthors\">\n<li class=\"nameList First\">Maciej Kokoci\u0144ski<\/li>\n<li class=\"nameList\">Tadeusz Kobus<\/li>\n<li class=\"nameList Last\">Pawe\u0142 T. Wojciechowski<\/li>\n<\/ul>\n<div class=\"DLabstract\">\n<div>\n<p>In this paper we study the properties of eventually consistent distributed systems that feature arbitrarily complex semantics and mix eventual and strong consistency. These systems execute requests in a highly-available, weakly-consistent fashion, but also enable stronger guarantees through additional inter-replica synchronization mechanisms that require the ability to solve distributed consensus. We use the seminal Bayou system as a case study, and then generalize our findings to a whole class of systems. We show dubious and unintuitive behaviour exhibited by those systems and provide a theoretical framework for reasoning about their correctness. We also state an impossibility result that formally proves the inherent limitation of such systems, namely temporary operation reordering, which admits interim disagreement between replicas on the relative order in which the client requests were executed.<\/p>\n<\/div>\n<\/div>\n<h2>SESSION: Session 11<\/h2>\n<div class=\"DLabstract\">\u00a0<\/div>\n<h3><a class=\"DLtitleLink\" title=\"Get the Full Text from the ACM Digital Library\" href=\"https:\/\/dl.acm.org\/authorize?N688689\">Massively Parallel Algorithms for Finding Well-Connected Components in Sparse Graphs<\/a><\/h3>\n<ul class=\"DLauthors\">\n<li class=\"nameList First\">Sepehr Assadi<\/li>\n<li class=\"nameList\">Xiaorui Sun<\/li>\n<li class=\"nameList Last\">Omri Weinstein<\/li>\n<\/ul>\n<div class=\"DLabstract\">\n<div>\n<p>Massively parallel computation (MPC) algorithms for graph problems have witnessed a resurgence of interest in recent years. Despite major progress for numerous graph problems however, the complexity of the sparse graph connectivity problem in this model has remained elusive: While classical logarithmic-round PRAM algorithms for finding connected components in any n-vertex graph have been known for more than three decades (and imply the same bounds for MPC model), no o(log n)-round MPC algorithms are known for this task with truly sublinear in n memory per machine (which is the only interesting regime for sparse graphs with O(n) edges). It is conjectured that an o(log n)-round algorithm for connectivity on general sparse graphs with n<sup>1-\u03a9 (1)<\/sup> per-machine memory may not exist, a conjecture that also forms the basis for multiple conditional hardness results on the round complexity of other problems in the MPC model.<\/p>\n<p>We take an opportunistic approach towards the sparse graph connectivity problem by designing an algorithm with improved performance in terms of the connectivity structure of the input graph. Formally, we design an MPC algorithm that finds all connected components with spectral gap at least \u03bb in a graph in O(log log n + log(1\/\u03bb)) MPC rounds and n<sup>\u03b4<\/sup> memory per machine for any constant \u03b4 \u2208 (0,1). While this algorithm still requires \u0398(log n) rounds in the worst-case, it achieves an exponential round reduction on &#8220;well-connected&#8221; components with \u03bb \u2265 1\/polylog(n) using only n<sup>\u03b4<\/sup> memory per machine and \u0142(n) total memory, and still operates in o(log n)l rounds even when \u03bb = 1\/n<sup>o(1)<\/sup>.<\/p>\n<p>En-route to our main result, we design a new distributed data structure for performing independent random walks from all vertices simultaneously, as well as a new leader-election algorithm with exponentially faster round complexity on random graphs.<\/p>\n<\/div>\n<\/div>\n<h3><a class=\"DLtitleLink\" title=\"Get the Full Text from the ACM Digital Library\" href=\"https:\/\/dl.acm.org\/authorize?N688680\">The Complexity of (\u0394+1) Coloring in Congested Clique, Massively Parallel Computation, and Centralized Local Computation<\/a><\/h3>\n<ul class=\"DLauthors\">\n<li class=\"nameList First\">Yi-Jun Chang<\/li>\n<li class=\"nameList\">Manuela Fischer<\/li>\n<li class=\"nameList\">Mohsen Ghaffari<\/li>\n<li class=\"nameList\">Jara Uitto<\/li>\n<li class=\"nameList Last\">Yufan Zheng<\/li>\n<\/ul>\n<div class=\"DLabstract\">\n<div>\n<p>In this paper, we present new randomized algorithms that improve the complexity of the classic (\u0394+1)-coloring problem, and its generalization (\u0394+1)-list-coloring, in three well-studied models of distributed, parallel, and centralized computation: <b>Distributed Congested Clique:<\/b> We present an O(1)-round randomized algorithm for (\u0394 + 1)-list-coloring in the congested clique model of distributed computing. This settles the asymptotic complexity of this problem. It moreover improves upon the O(log* \u0394)-round randomized algorithms of Parter and Su [DISC&#8217;18] and O((log log \u0394)\u22c5 log* \u0394)-round randomized algorithm of Parter [ICALP&#8217;18].<\/p>\n<p><b>Massively Parallel Computation:<\/b> We present a randomized (\u0394 + 1)-list-coloring algorithm with round complexity O(\u221a log log n ) in the Massively Parallel Computation (MPC) model with strongly sublinear memory per machine. This algorithm uses a memory of O(n<sup>\u03b1<\/sup>) per machine, for any desirable constant \u03b1 &gt; 0, and a total memory of \u00d5 (m), where m is the number of edges in the graph. Notably, this is the first coloring algorithm with sublogarithmic round complexity, in the sublinear memory regime of MPC. For the quasilinear memory regime of MPC, an O(1)-round algorithm was given very recently by Assadi et al. [SODA&#8217;19].<\/p>\n<p><b>Centralized Local Computation:<\/b> We show that (\u0394 + 1)-list-coloring can be solved by a randomized algorithm with query complexity \u0394 O(1) \u2026 O(log n), in the centralized local computation model. The previous state of the art for (\u0394+1)-list-coloring in the centralized local computation model are based on simulation of known LOCAL algorithms. The deterministic O(\u221a \u0394 poly log \u0394 + log* n)-round LOCAL algorithm of Fraigniaud et al. [FOCS&#8217;16] can be implemented in the centralized local computation model with query complexity \u0394<sup>O<\/sup>(\u221a \u0394 poly log \u0394) \u2026 O(log* n); the randomized O(log* \u0394) + 2^<sup>O<\/sup>(\u221a log log n)-round LOCAL algorithm of Chang et al. [STOC&#8217;18] can be implemented in the centralized local computation model with query complexity \u0394<sup>O<\/sup>(log* \u0394) \u2026 O(log n).<\/p>\n<\/div>\n<\/div>\n<h3><a class=\"DLtitleLink\" title=\"Get the Full Text from the ACM Digital Library\" href=\"https:\/\/dl.acm.org\/authorize?N688681\">Massively Parallel Computation of Matching and MIS in Sparse Graphs<\/a><\/h3>\n<ul class=\"DLauthors\">\n<li class=\"nameList First\">Soheil Behnezhad<\/li>\n<li class=\"nameList\">Sebastian Brandt<\/li>\n<li class=\"nameList\">Mahsa Derakhshan<\/li>\n<li class=\"nameList\">Manuela Fischer<\/li>\n<li class=\"nameList\">MohammadTaghi Hajiaghayi<\/li>\n<li class=\"nameList\">Richard M. Karp<\/li>\n<li class=\"nameList Last\">Jara Uitto<\/li>\n<\/ul>\n<div class=\"DLabstract\">\n<div>\n<p>The Massively Parallel Computation (MPC) model serves as a common abstraction of many modern large-scale parallel computation frameworks and has recently gained a lot of importance, especially in the context of classic graph problems. In this work, we mainly consider maximal matching and maximal independent set problems in the MPC model.<\/p>\n<p>These problems are known to admit efficient MPC algorithms if the space available per machine is near-linear in the number n of nodes. This is not only often significantly more than what we can afford, but also allows for easy if not trivial solutions for sparse graphs&#8212;which are common in real-world large-scale graphs. We are, therefore, interested in the low-memory MPC model, where the space per machine is restricted to be strongly sublinear, that is, n<sup>\u03b4<\/sup> for any constant 0 &lt; \u03b4 &lt; 1.<\/p>\n<p>We parametrize our algorithms by the arboricity \u03bb of the input graph. Our key ingredient is a degree reduction technique that reduces these problems in graphs with arboricity \u03bb to the corresponding problems in graphs with maximum degree poly(\u03bb, log n) in O(log<sup>2<\/sup> log n) rounds, giving rise to O(\u221a log \u03bb \u22c5 log log \u03bb + log <sup>2<\/sup> log n)-round algorithms.<\/p>\n<p>Our result is particularly interesting for graphs with poly log n arboricity as for such graphs, we get O(log <sup>2<\/sup> log n)-round algorithms. This covers most natural families of sparse graphs and almost exponentially improves over previous algorithms that all required log <sup>\u03a9(1)<\/sup> n rounds in this regime of MPC.<\/p>\n<p>Finally, our maximal matching algorithm can be employed to obtain a (1+\u03b5)-approximate maximum cardinality matching, a (2+\u03b5)-approximate maximum weighted matching, as well as a 2-approximate minimum vertex cover in essentially the same number of rounds.<\/p>\n<\/div>\n<\/div>\n<h3><a class=\"DLtitleLink\" title=\"Get the Full Text from the ACM Digital Library\" href=\"https:\/\/dl.acm.org\/authorize?N688682\">Weighted Matchings via Unweighted Augmentations<\/a><\/h3>\n<ul class=\"DLauthors\">\n<li class=\"nameList First\">Buddhima Gamlath<\/li>\n<li class=\"nameList\">Sagar Kale<\/li>\n<li class=\"nameList\">Slobodan Mitrovic<\/li>\n<li class=\"nameList Last\">Ola Svensson<\/li>\n<\/ul>\n<div class=\"DLabstract\">\n<div>\n<p>We design a generic method to reduce the task of finding weighted matchings to that of finding short augmenting paths in unweighted graphs. This method enables us to provide efficient implementations for approximating weighted matchings in the massively parallel computation (MPC) model and in the streaming model.<\/p>\n<p>For the MPC and the multi-pass streaming model, we show that any algorithm computing a (1-\u03b4)-approximate unweighted matching in bipartite graphs can be translated into an algorithm that computes a (1-(\u03b5(\u03b4))-approximate maximum weighted matching. Furthermore, this translation incurs only a constant factor (that depends on \u03b5 &gt; 0) overhead in the complexity. Instantiating this with the current best MPC algorithm for unweighted matching yields a (1 &#8211; \u03b5)-approximation algorithm for maximum weighted matching that uses O<sub>\u03b5<\/sub>(log log n) rounds, O(m\/n) machines per round, and O(npoly(logn)) memory per machine. This improves upon the previous best approximation guarantee of (1\/2-\u03b5) for weighted graphs. In the context of single-pass streaming with random edge arrivals, our techniques yield a (1\/2+c)-approximation algorithm thus breaking the natural barrier of 1\/2.<\/p>\n<\/div>\n<\/div>\n<h2>SESSION: Session 12<\/h2>\n<div class=\"DLabstract\">\u00a0<\/div>\n<h3><a class=\"DLtitleLink\" title=\"Get the Full Text from the ACM Digital Library\" href=\"https:\/\/dl.acm.org\/authorize?N688683\">Implementing Mediators with Asynchronous Cheap Talk<\/a><\/h3>\n<ul class=\"DLauthors\">\n<li class=\"nameList First\">Ittai Abraham<\/li>\n<li class=\"nameList\">Danny Dolev<\/li>\n<li class=\"nameList\">Ivan Geffner<\/li>\n<li class=\"nameList Last\">Joseph Y. Halpern<\/li>\n<\/ul>\n<div class=\"DLabstract\">\n<div>\n<p>A mediator can help non-cooperative agents obtain an equilibrium that may otherwise not be possible. We study the ability of players to obtain the same equilibrium without a mediator, using only cheap talk, that is, nonbinding pre-play communication. Previous work has considered this problem in a synchronous setting. Here we consider the effect of asynchrony on the problem, and provide upper bounds for implementing mediators. Considering asynchronous environments introduces new subtleties, including exactly what solution concept is most appropriate and determining what move is played if the cheap talk goes on forever. Different results are obtained depending on whether the move after such &#8220;infinite play&#8221; is under the control of the players or part of the description of the game.<\/p>\n<\/div>\n<\/div>\n<h3><a class=\"DLtitleLink\" title=\"Get the Full Text from the ACM Digital Library\" href=\"https:\/\/dl.acm.org\/authorize?N688684\">Distributed Minimum Degree Spanning Trees<\/a><\/h3>\n<ul class=\"DLauthors\">\n<li class=\"nameList First\">Michael Dinitz<\/li>\n<li class=\"nameList\">Magnus M. Halldorsson<\/li>\n<li class=\"nameList\">Taisuke Izumi<\/li>\n<li class=\"nameList Last\">Calvin Newport<\/li>\n<\/ul>\n<div class=\"DLabstract\">\n<div>\n<p>The minimum degree spanning tree (MDST) problem requires the construction of a spanning tree T for graph G, such that the maximum degree of T is the smallest among all spanning trees of G. Let d be this MDST degree for a given graph. In this paper, we present a randomized distributed approximation algorithm for the MDST problem that constructs a spanning tree with maximum degree in O(d+log n ). With high probability in n, the algorithm runs in O((D + \u221an) log<sub>4<\/sub> n) rounds, in the broadcast-CONGEST model, where D is the graph diameter and n is the graph size. We then show how to derandomize this algorithm, obtaining the same asymptotic guarantees on degree and time complexity, but now requiring the standard CONGEST model. Although efficient approximation algorithms for the MDST problem have been known in the sequential setting since the 1990&#8217;s (finding an exact solution is NP-hard), our algorithms are the first efficient distributed solutions. We conclude by proving a lower bound that establishes that any randomized MDST algorithm that guarantees a maximum degree in \u223c\u03a9 (d) requires &amp;\u00d8#8764;\u03a9 (n<sup>1\/3<\/sup>) rounds, and any deterministic solution requires \u223c\u03a9 (n<sup>1\/2<\/sup>) rounds. These bounds proves our deterministic algorithm to be asymptotically optimal, and eliminates the possibility of significantly more efficient randomized solutions.<\/p>\n<\/div>\n<\/div>\n<h3><a class=\"DLtitleLink\" title=\"Get the Full Text from the ACM Digital Library\" href=\"https:\/\/dl.acm.org\/authorize?N688695\">Improved Distributed Approximations for Minimum-Weight Two-Edge-Connected Spanning Subgraph<\/a><\/h3>\n<ul class=\"DLauthors\">\n<li class=\"nameList First\">Michal Dory<\/li>\n<li class=\"nameList Last\">Mohsen Ghaffari<\/li>\n<\/ul>\n<div class=\"DLabstract\">\n<div>\n<p>The minimum-weight 2-edge-connected spanning subgraph (2-ECSS) problem is a natural generalization of the well-studied minimum-weight spanning tree (MST) problem, and it has received considerable attention in the area of network design. The latter problem asks for a minimum-weight subgraph with an edge connectivity of 1 between each pair of vertices while the former strengthens this edge-connectivity requirement to 2. Despite this resemblance, the 2-ECSS problem is considerably more complex than MST. While MST admits a linear-time centralized exact algorithm, 2-ECSS is NP-hard and the best known centralized approximation algorithm for it (that runs in polynomial time) gives a 2-approximation.<\/p>\n<p>In this paper, we give a deterministic distributed algorithm with round complexity of \u00d5 (D + \u221an) that computes a (9 + \u03b5)-approximation of 2-ECSS, for any constant \u03b5 &gt; 0. Up to logarithmic factors, this complexity matches the \u00d8 (D + \u221a n) lower bound that can be derived from the technique of Das Sarma et al. [STOC&#8217;11], as shown by Censor-Hillel and Dory [OPODIS&#8217;17]. Our result is the first distributed constant approximation for 2-ECSS in the nearly optimal time and it improves on a recent randomized algorithm of Dory [PODC&#8217;18], which achieved an O(\u0142og n)-approximation in \u00d5 (D+\u221a ) rounds.<\/p>\n<p>We also present an alternative algorithm for O(log n)-approximation, whose round complexity is linear in the low-congestion shortcut parameter of the network&#8212;following a framework introduced by Ghaffari and Haeupler [SODA&#8217;16]. This algorithm has round complexity \u00d6 (D+\u221an) in worst-case networks but it provably runs much faster in many well-behaved graph families of interest. For instance, it runs in \u00d5 (D) time in planar networks and those with bounded genus, bounded path-width or bounded tree-width.<\/p>\n<\/div>\n<\/div>\n<h3><a class=\"DLtitleLink\" title=\"Get the Full Text from the ACM Digital Library\" href=\"https:\/\/dl.acm.org\/authorize?N688696\">Near-Additive Spanners In Low Polynomial Deterministic CONGEST Time<\/a><\/h3>\n<ul class=\"DLauthors\">\n<li class=\"nameList First\">Michael Elkin<\/li>\n<li class=\"nameList Last\">Shaked Matar<\/li>\n<\/ul>\n<div class=\"DLabstract\">\n<div>\n<p>Given a pair of parameters \u03b1 \u2265 1,\u03b2 \u2265 0, a subgraph G&#8217;=(V,H) of an n-vertex unweighted undirected graph G=(V,E) is called an (\u03b1,\u03b2)-spanner if for every pair u,\u03bd \u2208 V of vertices, we have d<sub>G&#8217;<\/sub> (u,\u03bd)\u2264 \u03b1 d<sub>G<\/sub> (u,\u03b1)+\u03b2. If \u03b2=0 the spanner is called a multiplicative \u03b1-spanner, and if \u03b1 = 1+\u03b5, for an arbitrarily small \u03b5&gt;0, the spanner is said to be near-additive.<\/p>\n<p>Graph spanners [5,36], are a fundamental and extremely well-studied combinatorial construct, with a multitude of applications in distributed computing and in other areas. Near-additive spanners, introduced in [27], preserve large distances much more faithfully than the more traditional multiplicative spanners. Also, recent lower bounds [1] ruled out the existence of arbitrarily sparse purely additive spanners (i.e., spanners with \u03b1=1), and therefore showed that essentially near-additive spanners provide the best approximation of distances that one can hope for.<\/p>\n<p>Numerous distributed algorithms, for constructing sparse near-additive spanners, were devised in [17,20,25,28,40]. In particular, there are now known efficient randomized algorithms in the CONGEST model that construct such spanners [25]., and also there are efficient deterministic algorithms in the LOCAL model [17]. However, the only known deterministic CONGEST-model algorithm for the problem [20] requires super-linear time in n. In this paper, we remedy the situation and devise an efficient deterministic CONGEST-model algorithm for constructing arbitrarily sparse near-additive spanners.<\/p>\n<p>The running time of our algorithm is low polynomial, i.e., roughly O(\u03b2 \u22c5 n<sup>\u03c1<\/sup>), where \u03c1 &gt; 0 is an arbitrarily small positive constant that affects the additive term \u03b2. In general, the parameters of our new algorithm and of the resulting spanner are at the same ballpark as the respective parameters of the state-of-the-art randomized algorithm due to [25].<\/p>\n<\/div>\n<\/div>\n<h3><a class=\"DLtitleLink\" title=\"Get the Full Text from the ACM Digital Library\" href=\"https:\/\/dl.acm.org\/authorize?N688697\">A Trivial Yet Optimal Solution to Vertex Fault Tolerant Spanners<\/a><\/h3>\n<ul class=\"DLauthors\">\n<li class=\"nameList First\">Greg Bodwin<\/li>\n<li class=\"nameList Last\">Shyamal Patel<\/li>\n<\/ul>\n<div class=\"DLabstract\">\n<div>\n<p>We give a short and easy upper bound on the worst-case size of fault tolerant spanners, which improves on all prior work and is fully optimal at least in the setting of vertex faults.<\/p>\n<\/div>\n<\/div>\n<h2>SESSION: Tutorials<\/h2>\n<div class=\"DLabstract\">\u00a0<\/div>\n<h3><a class=\"DLtitleLink\" title=\"Get the Full Text from the ACM Digital Library\" href=\"https:\/\/dl.acm.org\/authorize?N688698\">From Classical to Blockchain Consensus: What Are the Exact Algorithms?<\/a><\/h3>\n<ul class=\"DLauthors\">\n<li class=\"nameList First\">Yanhong A. Liu<\/li>\n<li class=\"nameList Last\">Scott D. Stoller<\/li>\n<\/ul>\n<div class=\"DLabstract\">\n<div>\n<p>This tutorial describes well-known algorithms for distributed consensus problems, from classical consensus to blockchain consensus, and discusses exact algorithms that are high-level as in pseudocode and directly executable at the same time. The tutorial consists of five parts:<\/p>\n<ul>\n<li style=\"list-style-type: none;\">\n<ol>(1) A introduction to different distributed consensus problems, from classical consensus and Byzantine consensus to blockchain consensus.<\/ol>\n<\/li>\n<li style=\"list-style-type: none;\">\n<ol>(2) An overview of well-known algorithms, from Paxos for classical consensus to the Bitcoin algorithm for blockchain consensus, including important variants such as Viewstamped Replication and Virtual Synchrony, as well as Proof-of-Stake vs. Proof-of-Work.<\/ol>\n<\/li>\n<li style=\"list-style-type: none;\">\n<ol>(3) An overview of a method and language, DistAlgo, for expressing distributed algorithms precisely at a high-level as pseudocode and having them be directly executable at the same time.<\/ol>\n<\/li>\n<li style=\"list-style-type: none;\">\n<ol>(4) A study of exact algorithms expressed at a high level for the most extensively studied algorithm variants, including Lamport&#8217;s Paxos for classical consensus and Nakamoto&#8217;s Bitcoin algorithm for blockchain consensus.<\/ol>\n<\/li>\n<li style=\"list-style-type: none;\">\n<ol>(5) A demo of the direct execution of these exact algorithms by distributed processes.<\/ol>\n<\/li>\n<\/ul>\n<\/div>\n<\/div>\n<h3><a class=\"DLtitleLink\" title=\"Get the Full Text from the ACM Digital Library\" href=\"https:\/\/dl.acm.org\/authorize?N688699\">Byzantine Fault Tolerant State Machine Replication in Any Programming Language<\/a><\/h3>\n<ul class=\"DLauthors\">\n<li class=\"nameList Last\">Ethan Buchman<\/li>\n<\/ul>\n<div class=\"DLabstract\">\n<div>\n<p>State machine replication is a fundamental primitive in fault tolerant distributed computing, but few production tools exist to support the replication of arbitrary state machines. The tools that do exist, like Apache Zookeeper, CoreOS&#8217;s etcd, and Hashicrop&#8217;s Consul, include an implementation of a consensus algorithm (eg. ZAB or Raft) for replication, and a service-discovery oriented key-value store as the state machine. While these tools can tolerate crash failures, they cannot tolerate malicious or adversarial (&#8220;Byzantine&#8221;) faults.<\/p>\n<p>We present Tendermint, a production-grade Byzantine Fault Tolerant State Machine Replication engine written in Go. Tendermint supports replication for state machines written in any language by using a socket protocol to communicate between the state machine and the replication engine. Tendermint is being used on the public internet today to secure upwards of 1 Billion USD in value, with deployments supporting hundreds of consensus nodes. In this workshop, we provide an overview of the Tendermint system and demonstrate how to build Byzantine Fault Tolerant applications in Go and Javascript. We will also introduce the Cosmos Hub, an advanced Proof-of-Stake cryptocurrency system built on Tendermint.<\/p>\n<\/div>\n<\/div>\n<h3><a class=\"DLtitleLink\" title=\"Get the Full Text from the ACM Digital Library\" href=\"https:\/\/dl.acm.org\/authorize?N688690\">Central Control over Distributed Asynchronous Systems: A Tutorial on Software-Defined Networks and Consistent Network Updates<\/a><\/h3>\n<ul class=\"DLauthors\">\n<li class=\"nameList Last\">Klaus-Tycho Foerster<\/li>\n<\/ul>\n<div class=\"DLabstract\">\n<div>\n<p>This tutorial will give an introduction to a topic that lies at the intersection of distributed computing and networking, and combines asynchronous distributed systems with central control, namely consistent updates in Software-Defined Networks (SDNs). We will give an overview on current models and algorithms, but also selected related topics, in particular those of potential interest to the PODC community, showcasing avenues for further research.<\/p>\n<p>In more detail, SDNs have been an intensively studied topic in networking over the last years, but much of its focus has been on (logical) central control, abstracting away most of its underlying foundation, namely that a network is still a distributed asynchronous system at its core. Summarized in a simplified way, SDNs come with the promise that the network state can be optimized and updated from a global point of view. However, such a simplification becomes especially problematic when consistency guarantees have to maintained. In asynchronous distributed systems, it is not possible to simultaneously change the state of all nodes, such a naive approach will lead to an inconsistent mix of old and new states, introducing e.g. forwarding loops. Notwithstanding, most approaches tackle these issues from the viewpoints of the networking\/systems communities, and we believe could henceforth greatly benefit from connections to and ideas from the PODC community.<\/p>\n<\/div>\n<\/div>\n<h3><a class=\"DLtitleLink\" title=\"Get the Full Text from the ACM Digital Library\" href=\"https:\/\/dl.acm.org\/authorize?N688691\">Tutorial: Specifying, Implementing, and Verifying Algorithms for Persistent Memory<\/a><\/h3>\n<ul class=\"DLauthors\">\n<li class=\"nameList First\">Diego Cepeda<\/li>\n<li class=\"nameList\">Sakib Chowdhury<\/li>\n<li class=\"nameList Last\">Wojciech Golab<\/li>\n<\/ul>\n<div class=\"DLabstract\">\n<div>\n<p>High-density byte-addressable non-volatile memory became a reality earlier this year when Intel launched the long-awaited Optane persistent memory module. This tutorial is intended for researchers interested in using persistent memory to construct fault-tolerant data structures that can maintain state consistently across power outages and system crashes without relying on conventional secondary storage. A number of practical and theoretical topics will be covered including hardware purchasing considerations, operating system and programming language support for persistent memory, definitions of correctness properties for fault-tolerant data structures, techniques for implementing fault-tolerant concurrency control and memory management, as well as verification of correctness.<\/p>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n","protected":false},"excerpt":{"rendered":"<p>\u00a0 PODC &#8217;19- Proceedings of the 2019 ACM Symposium on Principles of Distributed Computing Full Citation in the ACM Digital Library SESSION: Awards \u00a0 2019 Edsger W. Dijkstra Prize in Distributed Computing Lorenzo Alvisi Shlomi Dolev Faith Ellen (chair) Idit Keidar Fabian Kuhn Jukka Suomela The committee decided to award the 2019 Edsger W. Dijkstra &hellip; <a href=\"https:\/\/www.podc.org\/podc2019\/table-of-contents\/\" class=\"more-link\">Continue reading<span class=\"screen-reader-text\"> &#8220;Table of Contents&#8221;<\/span><\/a><\/p>\n","protected":false},"author":11,"featured_media":0,"parent":0,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":{"footnotes":""},"class_list":["post-336","page","type-page","status-publish","hentry"],"_links":{"self":[{"href":"https:\/\/www.podc.org\/podc2019\/wp-json\/wp\/v2\/pages\/336","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.podc.org\/podc2019\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/www.podc.org\/podc2019\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/www.podc.org\/podc2019\/wp-json\/wp\/v2\/users\/11"}],"replies":[{"embeddable":true,"href":"https:\/\/www.podc.org\/podc2019\/wp-json\/wp\/v2\/comments?post=336"}],"version-history":[{"count":3,"href":"https:\/\/www.podc.org\/podc2019\/wp-json\/wp\/v2\/pages\/336\/revisions"}],"predecessor-version":[{"id":339,"href":"https:\/\/www.podc.org\/podc2019\/wp-json\/wp\/v2\/pages\/336\/revisions\/339"}],"wp:attachment":[{"href":"https:\/\/www.podc.org\/podc2019\/wp-json\/wp\/v2\/media?parent=336"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}