A fuzzy guided genetic algorithm for operon prediction
Open Access
- 25 November 2004
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 21 (8), 1403-1407
- https://doi.org/10.1093/bioinformatics/bti156
Abstract
Motivation: The operon structure of the prokaryotic genome is a critical input for the reconstruction of regulatory networks at the whole genome level. As experimental methods for the detection of operons are difficult and time-consuming, efforts are being put into developing computational methods that can use available biological information to predict operons. Method: A genetic algorithm is developed to evolve a starting population of putative operon maps of the genome into progressively better predictions. Fuzzy scoring functions based on multiple criteria are used for assessing the ‘fitness’ of the newly evolved operon maps and guiding their evolution. Results: The algorithm organizes the whole genome into operons. The fuzzy guided genetic algorithm-based approach makes it possible to use diverse biological information like genome sequence data, functional annotations and conservation across multiple genomes, to guide the organization process. This approach does not require any prior training with experimental operons. The predictions from this algorithm for Escherchia coli K12 and Bacillussubtilis are evaluated against experimentally discovered operons for these organisms. The accuracy of the method is evaluated using an ROC (receiver operating characteristic) analysis. The area under the ROC curve is around 0.9, which indicates excellent accuracy. Contact:roschen_csir@rediffmail.comKeywords
This publication has 10 references indexed in Scilit:
- Inference of protein function and protein linkages in Mycobacterium tuberculosis based on prokaryotic genome organization: a combined computational approachGenome Biology, 2003
- A Bayesian network approach to operon predictionBioinformatics, 2003
- Co-expression pattern from DNA microarray experiments as a tool for operon predictionNucleic Acids Research, 2002
- Pattern and Timing of Gene Duplication in Animal GenomesGenome Research, 2001
- Transcription unit conservation in the three domains of life: a perspective from Escherichia coliTrends in Genetics, 2001
- Prediction of operons in microbial genomesNucleic Acids Research, 2001
- Operons in Escherichia coli : Genomic analyses and predictionsProceedings of the National Academy of Sciences, 2000
- Modeling and predicting transcriptional units of Escherichia coligenes using hidden Markov modelsBioinformatics, 1999
- The use of gene clusters to infer functional couplingProceedings of the National Academy of Sciences, 1999
- Conserved Gene Clusters in Bacterial Genomes Provide Further Support for the Primacy of RNAJournal of Molecular Evolution, 1997