VFDB 2019: a comparative pathogenomic platform with an interactive web interface
Top Cited Papers
Open Access
- 5 November 2018
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 47 (D1), D687-D692
- https://doi.org/10.1093/nar/gky1080
Abstract
The virulence factor database (VFDB, http://www.mgc.ac.cn/VFs/) is devoted to providing the scientific community with a comprehensive warehouse and online platform for deciphering bacterial pathogenesis. The various combinations, organizations and expressions of virulence factors (VFs) are responsible for the diverse clinical symptoms of pathogen infections. Currently, whole-genome sequencing is widely used to decode potential novel or variant pathogens both in emergent outbreaks and in routine clinical practice. However, the efficient characterization of pathogenomic compositions remains a challenge for microbiologists or physicians with limited bioinformatics skills. Therefore, we introduced to VFDB an integrated and automatic pipeline, VFanalyzer, to systematically identify known/potential VFs in complete/draft bacterial genomes. VFanalyzer first constructs orthologous groups within the query genome and preanalyzed reference genomes from VFDB to avoid potential false positives due to paralogs. Then, it conducts iterative and exhaustive sequence similarity searches among the hierarchical prebuilt datasets of VFDB to accurately identify potential untypical/strain-specific VFs. Finally, via a context-based data refinement process for VFs encoded by gene clusters, VFanalyzer can achieve relatively high specificity and sensitivity without manual curation. In addition, a thoroughly optimized interactive web interface is introduced to present VFanalyzer reports in comparative pathogenomic style for easy online analysis.Keywords
Funding Information
- Ministry of Science and Technology of China (2016YFC1202404, 2015CB554204)
- CAMS Innovation Fund for Medical Sciences (2017-I2M-3-017)
This publication has 25 references indexed in Scilit:
- Challenges in homology search: HMMER3 and convergent evolution of coiled-coil regionsNucleic Acids Research, 2013
- VFDB 2012 update: toward the genetic diversity and molecular evolution of bacterial virulence factorsNucleic Acids Research, 2011
- Open-Source Genomic Analysis of Shiga-Toxin–ProducingE. coliO104:H4New England Journal of Medicine, 2011
- Origins of theE. coliStrain Causing an Outbreak of Hemolytic–Uremic Syndrome in GermanyNew England Journal of Medicine, 2011
- Sequence-Based Prediction of Type III Secreted ProteinsPLoS Pathogens, 2009
- VirulentPred: a SVM based prediction method for virulent proteins in bacterial pathogensBMC Bioinformatics, 2008
- VFDB 2008 release: an enhanced web-based resource for comparative pathogenomicsNucleic Acids Research, 2007
- Identifying bacterial genes and endosymbiont DNA with GlimmerBioinformatics, 2007
- OrthoMCL: Identification of Ortholog Groups for Eukaryotic GenomesGenome Research, 2003
- HomologyTrends in Genetics, 2000