Discussion Between Reviewers Does Not Improve Reliability of Peer Review of Hospital Quality
- 1 February 2000
- journal article
- review article
- Published by Wolters Kluwer Health in Medical Care
- Vol. 38 (2), 152-161
- https://doi.org/10.1097/00005650-200002000-00005
Abstract
Peer review is used to make final judgments about quality of care in many quality assurance activities. To overcome the low reliability of peer review, discussion between several reviewers is often recommended to point out overlooked information or allow for reconsideration of opinions and thus improve reliability. The authors assessed the impact of discussion between 2 reviewers on the reliability of peer review. A group of 13 board-certified physicians completed a total of 741 structured implicit record reviews of 95 records for patients who experienced severe adverse events related to laboratory abnormalities while in the hospital (hypokalemia, hyperkalemia, renal failure, hyponatremia, and digoxin toxicity). They independently assessed the degree to which each adverse event was caused by medical care and the quality of the care leading up to the adverse event. Working in pairs, they then discussed differences of opinion, clarified factual discrepancies, and rerated the record. The authors compared the reliability of each measure before and after discussion, and between and within pairs of reviewers, using the intraclass correlation coefficient for continuous ratings and the kappa statistic for a dichotomized rating. The assessment of whether the laboratory abnormality was iatrogenic had a reliability of 0.46 before discussion and 0.71 after discussion between paired reviewers, indicating considerably improved agreement between the members of a pair. However, across reviewer pairs, the reviewer reliability was 0.36 before discussion and 0.40 after discussion. Similarly, for the rating of overall quality of care, reliability of physician review went from 0.35 before discussion to 0.58 after discussion as assessed by pair. However, across pairs the reliability increased only from 0.14 to 0.17. Even for prediscussion ratings, reliability was substantially higher between 2 members of a pair than across pairs, suggesting that reviewers who work in pairs learn to be more consistent with each other even before discussion, but this consistency also did not improve overall reliability across pairs. When 2 physicians discuss a record that they are reviewing, it substantially improves the agreement between those 2 physicians. However, this improvement is illusory, as discussion does not improve the overall reliability as assessed by examining the reliability between physicians who were part of different discussions. This finding may also have implications with regard to how disagreements are resolved on consensus panels, guideline committees, and reviews of literature quality for meta-analyses.Keywords
This publication has 26 references indexed in Scilit:
- Total Parenteral Nutrition in the Critically Ill PatientJAMA, 1998
- Carotid endarterectomy for asymptomatic carotid stenosis: a meta-analysisBMJ, 1998
- Benzodiazepine use in pregnancy and major malformations or oral cleft: meta-analysis of cohort and case-control studiesBMJ, 1998
- The Effect of Group Discussion on Interrater Reliability of Structured Peer ReviewAnesthesiology, 1998
- The Appropriateness of Use of Cardiovascular Procedures in Women and MenArchives of Internal Medicine, 1994
- Appropriateness of Medication Prescribing in Ambulatory Elderly PatientsJournal of the American Geriatrics Society, 1994
- The appropriateness of hysterectomy. A comparison of care in seven health plans. Health Maintenance Organization Quality of Care ConsortiumJAMA, 1993
- Evaluating the Care of General Medicine Inpatients: How Good Is Implicit Review?Annals of Internal Medicine, 1993
- Iatrogenic complications in high-risk, elderly patientsArchives of Internal Medicine, 1992
- A comparison of implicit and explicit methods of process quality assurance for blunt trauma patientsAnnals of Emergency Medicine, 1990