Predicate control for active debugging of distributed programs
- 27 November 2002
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- p. 763-769
- https://doi.org/10.1109/ipps.1998.670013
Abstract
Existing approaches to debugging distributed systems involve a cycle of passive observation followed by computation replaying. We propose predicate control as an active approach to debugging such systems. The predicate control approach involves a cycle of observation followed by controlled replaying of computation, based on observation. We formalize the predicate control problem for both off-line and on-line scenarios. We prove that off-line predicate control for general boolean predicates is NP-hard. However, we provide an efficient solution for off-line predicate control for the class of disjunctive predicates. We further solve on-line predicate control for disjunctive predicates under certain restrictions on the system. Lastly, we demonstrate how both off-line and on-line predicate control facilitate distributed debugging by allowing the programmer to control computations to maintain global safety properties.Keywords
This publication has 9 references indexed in Scilit:
- Predicate control for active debugging of distributed programsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- A distributed K-mutual exclusion algorithmPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Detection of strong unstable predicates in distributed programsIEEE Transactions on Parallel and Distributed Systems, 1996
- Optimal tracing and replay for debugging message-passing parallel programsThe Journal of Supercomputing, 1995
- A Taxonomy of Distributed Mutual ExclusionJournal of Parallel and Distributed Computing, 1993
- Detecting atomic sequences of predicates in distributed computationsPublished by Association for Computing Machinery (ACM) ,1993
- Distributed snapshotsACM Transactions on Computer Systems, 1985
- Modeling a solution for a control problem in distributed systems by restrictionsTheoretical Computer Science, 1981
- Time, clocks, and the ordering of events in a distributed systemCommunications of the ACM, 1978