Comparison of the computer programs DEREK and TOPKAT to predict bacterial mutagenicity

Abstract
The performance of two computer programs, DEREK and TOPKAT, was examined with regard to predicting the outcome of the Ames bacterial mutagenicity assay. The results of over 400 Ames tests conducted at Glaxo Wellcome (now GlaxoSmithKline) during the last 15 years on a wide variety of chemical classes were compared with the mutagenicity predictions of both computer programs. DEREK was considered concordant with the Ames assay if (i) the Ames assay was negative (not mutagenic) and no structural alerts for mutagenicity were identified or (ii) the Ames assay was positive (mutagenic) and at least one structural alert was identified. Conversely, the DEREK output was considered discordant if (i) the Ames assay was negative and any structural alert was identified or (ii) the Ames assay was positive and no structural alert was identified. The overall concordance of the DEREK program with the Ames results was 65% and the overall discordance was 35%, based on over 400 compounds. About 23% of the test molecules were outside the permissible limits of the optimum prediction space of TOPKAT. Another 4% of the compounds were either not processable or had indeterminate mutagenicity predictions; these molecules were excluded from the TOPKAT analysis. If the TOPKAT probability was (i) ≥0.7 the molecule was predicted to be mutagenic, (ii) ≤0.3 the compound was predicted to be non-mutagenic and (iii) between 0.3 and 0.7 the prediction was considered indeterminate. From over 300 acceptable predictions, the overall TOPKAT concordance was 73% and the overall discordance was 27%. While the overall concordance of the TOPKAT program was higher than DEREK, TOPKAT fared more poorly than DEREK in the critical Ames-positive category, where 60% of the compounds were incorrectly predicted by TOPKAT as negative but were mutagenic in the Ames test. For DEREK, 54% of the Ames-positive molecules had no structural alerts and were predicted to be non-mutagenic. Alternative methods of analyzing the output of the programs to increase the accuracy with Ames-positive compounds are discussed.