Software defects and their impact on system availability-a study of field failures in operating systems
Top Cited Papers
- 10 December 2002
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
Abstract
Defects reported between 1986 and 1989 in the MVS operating system are studied in order to gain the insight needed to provide a clear strategy for avoiding or tolerating them. Typical defects (regular) are compared to those that corrupt a program's memory (overlay), given that overlays are considered by field services to be particularly hard to find and fix. It is shown that the impact of an overlay defect is, on average, much higher than that of a regular defect, that boundary conditions and allocation management are the major causes of overlay defects, not timing, and that most overlays are small and corrupt data near the data that the programmer meant to update. Further analysis is provided on defects in fixes to other defects, failure symptoms, and the impact of defects on customers.<>Keywords
This publication has 12 references indexed in Scilit:
- Defects and reliability analysis of large software systems: field experiencePublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- Understanding large system failures-a fault injection experimentPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- FIAT-fault injection based automated testing environmentPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- Defect type and its impact on the growth curve (software development)Published by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- On the Reliability of the IBM MVS/XA Operating SystemIEEE Transactions on Software Engineering, 1987
- Measurement and modeling of computer reliability as affected by system activityACM Transactions on Computer Systems, 1986
- Bootstrap Methods for Standard Errors, Confidence Intervals, and Other Measures of Statistical AccuracyStatistical Science, 1986
- Software errors and complexity: an empirical investigation0Communications of the ACM, 1984
- Persistent Software ErrorsIEEE Transactions on Software Engineering, 1981
- An analysis of errors and their causes in system programsIEEE Transactions on Software Engineering, 1975