DDBJ working on evaluation and classification of bacterial genes in INSDC
Open Access
- 15 November 2006
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 35 (Databae), D13-D15
- https://doi.org/10.1093/nar/gkl908
Abstract
DNA Data Bank of Japan (DDBJ) ( ) newly collected and released 12 927 184 entries or 13 787 688 598 bases in the period from July 2005 to June 2006. The released data contain honeybee expressed sequence tags (ESTs), re-examined and re-annotated complete genome data of Escherichia coli K-12 W3110, medaka WGS and human MGA. We also systematically evaluated and classified the genes in the complete bacterial genomes submitted to the International Nucleotide Sequence Database Collaboration (INSDC, ) that is composed of DDBJ, EMBL Bank and GenBank. The examination and classification selected 557 000 genes as reliable ones among all the bacterial genes predicted by us.Keywords
This publication has 12 references indexed in Scilit:
- DDBJ in preparation for overview of research activities behind data submissionsNucleic Acids Research, 2006
- Identifying Protein Function—A Call for Community ActionPLoS Biology, 2004
- Rfam: an RNA family databaseNucleic Acids Research, 2003
- The hemK gene in Escherichia coli encodes the N5-glutamine methyltransferase that modifies peptide release factorsThe EMBO Journal, 2002
- HemK, a class of protein methyl transferase with similarity to DNA methyl transferases, methylates polypeptide chain release factors, and hemK knockout induces defects in translational terminationProceedings of the National Academy of Sciences, 2002
- A probabilistic method for identifying start codons in bacterial genomesBioinformatics, 2001
- The InterPro database, an integrated documentation resource for protein families, domains and functional sitesNucleic Acids Research, 2001
- Improved microbial gene identification with GLIMMERNucleic Acids Research, 1999
- Microbial gene identification using interpolated Markov modelsNucleic Acids Research, 1998
- tRNAscan-SE: A Program for Improved Detection of Transfer RNA Genes in Genomic SequenceNucleic Acids Research, 1997