CDK-Taverna: an open workflow environment for cheminformatics
Open Access
- 29 March 2010
- journal article
- Published by Springer Nature in BMC Bioinformatics
- Vol. 11 (1), 159
- https://doi.org/10.1186/1471-2105-11-159
Abstract
Background Small molecules are of increasing interest for bioinformatics in areas such as metabolomics and drug discovery. The recent release of large open access chemistry databases generates a demand for flexible tools to process them and discover new knowledge. To freely support open science based on these data resources, it is desirable for the processing tools to be open source and available for everyone. Results Here we describe a novel combination of the workflow engine Taverna and the cheminformatics library Chemistry Development Kit (CDK) resulting in a open source workflow solution for cheminformatics. We have implemented more than 160 different workers to handle specific cheminformatics tasks. We describe the applications of CDK-Taverna in various usage scenarios. Conclusions The combination of the workflow engine Taverna and the Chemistry Development Kit provides the first open source cheminformatics workflow solution for the biosciences. With the Taverna-community working towards a more powerful workflow engine and a more user-friendly user interface, CDK-Taverna has the potential to become a free alternative to existing proprietary workflow tools.Keywords
This publication has 16 references indexed in Scilit:
- Public chemical compound databases2008
- ChEBI: a database and ontology for chemical entities of biological interestNucleic Acids Research, 2007
- Workflow based framework for life science informaticsComputational Biology and Chemistry, 2007
- Chemical Markup, XML, and the World Wide Web. 7. CMLSpect, an XML Vocabulary for Spectral DataJournal of Chemical Information and Modeling, 2007
- Cheminformatics analysis and learning in a data pipelining environmentMolecular Diversity, 2006
- Taverna: a tool for the composition and enactment of bioinformatics workflowsBioinformatics, 2004
- The Chemistry Development Kit (CDK): An Open-Source Java Library for Chemo- and BioinformaticsJournal of Chemical Information and Computer Sciences, 2003
- Chemical Markup, XML, and the Worldwide Web. 1. Basic PrinciplesJournal of Chemical Information and Computer Sciences, 1999
- Description of several chemical structure file formats used by computer programs developed at Molecular Design LimitedJournal of Chemical Information and Computer Sciences, 1992
- ART 2-A: An adaptive resonance algorithm for rapid category learning and recognitionNeural Networks, 1991