Assessment of Band-Based Similarity Coefficients for Automatic Type and Subtype Classification of Microbial Isolates Analyzed by Pulsed-Field Gel Electrophoresis
- 1 November 2005
- journal article
- research article
- Published by American Society for Microbiology in Journal of Clinical Microbiology
- Vol. 43 (11), 5483-5490
- https://doi.org/10.1128/jcm.43.11.5483-5490.2005
Abstract
Pulsed-field gel electrophoresis (PFGE) has been the typing method of choice for strain identification in epidemiological studies of several bacterial species of medical importance. The usual procedure for the comparison of strains and assignment of strain type and subtype relies on visual assessment of band difference number, followed by an incremental assignment to the group hosting the most similar type previously seen. Band-based similarity coefficients, such as the Dice or the Jaccard coefficient, are then used for dendrogram construction, which provides a quantitative assessment of strain similarity. PFGE type assignment is based on the definition of a threshold linkage value, below which strains are assigned to the same group. This is typically performed empirically by inspecting the hierarchical cluster analysis dendrogram containing the strains of interest. This approach has the problem that the threshold value selected is dependent on the linkage method used for dendrogram construction. Furthermore, the use of a linkage method skews the original similarity values between strains. In this paper we assess the goodness of classification of several band-based similarity coefficients by comparing it with the band difference number for PFGE type and subtype classification using receiver operating characteristic curves. The procedure described was applied to a collection of PFGE results for 1,798 isolates of Streptococcus pneumoniae , which documented 96 types and 396 subtypes. The band-based similarity coefficients were found to perform equally well for type classification, but with different proportions of false-positive and false-negative classifications in their minimal false discovery rate when they were used for subtype classification.Keywords
This publication has 22 references indexed in Scilit:
- Effect of the Seven-Valent Conjugate Pneumococcal Vaccine on Carriage and Drug Resistance of Streptococcus pneumoniae in Healthy Children Attending Day-Care Centers in LisbonThe Pediatric Infectious Disease Journal, 2005
- Clonal Relationships between Invasive and CarriageStreptococcus pneumoniaeand Serotype‐ and Clone‐Specific Differences in Invasive Disease PotentialThe Journal of Infectious Diseases, 2003
- Nomenclature of Major Antimicrobial-Resistant Clones of Streptococcus pneumoniae Defined by the Pneumococcal Molecular Epidemiology NetworkJournal of Clinical Microbiology, 2001
- Molecular Typing of Methicillin-ResistantStaphylococcus aureusby Pulsed-Field Gel Electrophoresis: Comparison of Results Obtained in a Multilaboratory Effort Using Identical Protocols and MRSA StrainsMicrobial Drug Resistance, 2000
- AFLP genotyping and fingerprintingTrends in Ecology & Evolution, 1999
- The use of the area under the ROC curve in the evaluation of machine learning algorithmsPattern Recognition, 1997
- AFLP: a new technique for DNA fingerprintingNucleic Acids Research, 1995
- Separation of yeast chromosome-sized DNAs by pulsed field gradient gel electrophoresisCell, 1984
- Zoogeographical Studies on the Soleoid Fishes Found in Japan and its Neighhouring Regions-IINIPPON SUISAN GAKKAISHI, 1957
- Measures of the Amount of Ecologic Association Between SpeciesEcology, 1945