arXiv (Cornell University)
A Graphical Interactive Debugger for Distributed Systems
June 2018 • Doug Woos, Zachary Tatlock, Michael D. Ernst, Thomas E. Anderson
Designing and debugging distributed systems is notoriously difficult. The correctness of a distributed system is largely determined by its handling of failure scenarios. The sequence of events leading to a bug can be long and complex, and it is likely to include message reorderings and failures. On single-node systems, interactive debuggers enable stepping through an execution of the program, but they lack the ability to easily simulate failure scenarios and control the order in which messages are delivered. Oddit…