Effect of System Workload on Operating System Reliability: A Study on IBM 3081
- 1 December 1985
- journal article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Software Engineering
- Vol. SE-11 (12), 1438-1448
- https://doi.org/10.1109/tse.1985.232180
Abstract
This paper presents an analysis of operating system failures on an IBM 3081 running VM/SP. We find three broad categories of software failures: error handling (ERH), program control or logic (CTL), and hardware related (HS); it is found that more than 25 percent of software failures occur in the hardware/software interface. Measurements show that results on software reliability cannot be considered representative unless the system workload is taken into account. For example, it is shown that the risk of a software failure increases in a nonlinear fashion with the amount of interactive processing, as measured by parameters such as the paging rate and the amount of overhead (operating system CPU time). The overall CPU execution rate, although measured to be close to 100 percent most of the time, is not found to correlate strongly with the occurrence of failures. The paper discusses possible reasons for the observed workload failure dependency based on detailed investigations of the failure data.Keywords
This publication has 12 references indexed in Scilit:
- A STATISTICAL LOAD DEPENDENCY MODEL FOR CPU ERRORS AT SLACPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- Hardware-Related Software Errors: Measurement and AnalysisIEEE Transactions on Software Engineering, 1985
- A Statistical Failure/Load Relationship: Results of a Multicomputer StudyIEEE Transactions on Computers, 1982
- Persistent Software ErrorsIEEE Transactions on Software Engineering, 1981
- Theories of Software Reliability: How Good Are They and How Can They Be Improved?IEEE Transactions on Software Engineering, 1980
- Measurement and experimentation in software engineeringProceedings of the IEEE, 1980
- The measurement and management of software reliabilityProceedings of the IEEE, 1980
- On learning how to predictProceedings of the IEEE, 1980
- Fault-Tolerant SoftwareIEEE Transactions on Reliability, 1979
- Virtual machine advantages in security, integrity, and decision support systems [Authors' response]IBM Systems Journal, 1976