Evaluation of diagnostic tests without gold standards

Abstract
This paper reviews statistical methods developed to estimate the sensitivity and specificity of screening or diagnostic tests when the fallible tests are not evaluated against a gold standard. It gives a brief summary of the earlier historical developments and focuses on the more recent methods. It covers Bayesian approaches and longitudinal studies with repeated testing. In particular, it reviews the procedures that do not require the assumption of independence between tests conditional on the true disease status.