Discussion Between Reviewers Does Not Improve Reliability of Peer Review of Hospital Quality

1 February 2000

journal article
review article
Published by Wolters Kluwer Health in Medical Care

Vol. 38 (2), 152-161
https://doi.org/10.1097/00005650-200002000-00005

Abstract

Peer review is used to make final judgments about quality of care in many quality assurance activities. To overcome the low reliability of peer review, discussion between several reviewers is often recommended to point out overlooked information or allow for reconsideration of opinions and thus improve reliability. The authors assessed the impact of discussion between 2 reviewers on the reliability of peer review. A group of 13 board-certified physicians completed a total of 741 structured implicit record reviews of 95 records for patients who experienced severe adverse events related to laboratory abnormalities while in the hospital (hypokalemia, hyperkalemia, renal failure, hyponatremia, and digoxin toxicity). They independently assessed the degree to which each adverse event was caused by medical care and the quality of the care leading up to the adverse event. Working in pairs, they then discussed differences of opinion, clarified factual discrepancies, and rerated the record. The authors compared the reliability of each measure before and after discussion, and between and within pairs of reviewers, using the intraclass correlation coefficient for continuous ratings and the kappa statistic for a dichotomized rating. The assessment of whether the laboratory abnormality was iatrogenic had a reliability of 0.46 before discussion and 0.71 after discussion between paired reviewers, indicating considerably improved agreement between the members of a pair. However, across reviewer pairs, the reviewer reliability was 0.36 before discussion and 0.40 after discussion. Similarly, for the rating of overall quality of care, reliability of physician review went from 0.35 before discussion to 0.58 after discussion as assessed by pair. However, across pairs the reliability increased only from 0.14 to 0.17. Even for prediscussion ratings, reliability was substantially higher between 2 members of a pair than across pairs, suggesting that reviewers who work in pairs learn to be more consistent with each other even before discussion, but this consistency also did not improve overall reliability across pairs. When 2 physicians discuss a record that they are reviewing, it substantially improves the agreement between those 2 physicians. However, this improvement is illusory, as discussion does not improve the overall reliability as assessed by examining the reliability between physicians who were part of different discussions. This finding may also have implications with regard to how disagreements are resolved on consensus panels, guideline committees, and reviews of literature quality for meta-analyses.

Keywords

This publication has 26 references indexed in Scilit:

Total Parenteral Nutrition in the Critically Ill Patient
JAMA, 1998
Carotid endarterectomy for asymptomatic carotid stenosis: a meta-analysis
BMJ, 1998
Benzodiazepine use in pregnancy and major malformations or oral cleft: meta-analysis of cohort and case-control studies
BMJ, 1998
The Effect of Group Discussion on Interrater Reliability of Structured Peer Review
Anesthesiology, 1998
The Appropriateness of Use of Cardiovascular Procedures in Women and Men
Archives of Internal Medicine, 1994
Appropriateness of Medication Prescribing in Ambulatory Elderly Patients
Journal of the American Geriatrics Society, 1994
The appropriateness of hysterectomy. A comparison of care in seven health plans. Health Maintenance Organization Quality of Care Consortium
JAMA, 1993
Evaluating the Care of General Medicine Inpatients: How Good Is Implicit Review?
Annals of Internal Medicine, 1993
Iatrogenic complications in high-risk, elderly patients
Archives of Internal Medicine, 1992
A comparison of implicit and explicit methods of process quality assurance for blunt trauma patients
Annals of Emergency Medicine, 1990

Cited by 66 articles