Directly e-mailing authors of newly published papers encourages community curation
Open Access
- 1 May 2012
- journal article
- research article
- Published by Oxford University Press (OUP) in Database: The Journal of Biological Databases and Curation
- Vol. 2012, bas024
- https://doi.org/10.1093/database/bas024
Abstract
Much of the data within Model Organism Databases (MODs) comes from manual curation of the primary research literature. Given limited funding and an increasing density of published material, a significant challenge facing all MODs is how to efficiently and effectively prioritize the most relevant research papers for detailed curation. Here, we report recent improvements to the triaging process used by FlyBase. We describe an automated method to directly e-mail corresponding authors of new papers, requesting that they list the genes studied and indicate ('flag') the types of data described in the paper using an online tool. Based on the author-assigned flags, papers are then prioritized for detailed curation and channelled to appropriate curator teams for full data extraction. The overall response rate has been 44% and the flagging of data types by authors is sufficiently accurate for effective prioritization of papers. In summary, we have established a sustainable community curation program, with the result that FlyBase curators now spend less time triaging and can devote more effort to the specialized task of detailed data extraction.Keywords
This publication has 8 references indexed in Scilit:
- Automatic categorization of diverse experimental information in the bioscience literatureBMC Bioinformatics, 2012
- FlyBase 101 - the basics of navigating FlyBaseNucleic Acids Research, 2011
- WormBase 2012: more genomes, more data, new websiteNucleic Acids Research, 2011
- Community annotation in biologyBiology Direct, 2010
- Semi-automated curation of protein subcellular localization: a text mining-based approach to Gene Ontology (GO) Cellular Component curationBMC Bioinformatics, 2009
- Integrating text mining into the MGI biocuration workflowDatabase: The Journal of Biological Databases and Curation, 2009
- The Arabidopsis Information Resource (TAIR): gene structure and function annotationNucleic Acids Research, 2007
- Biomedical Language Processing: What's Beyond PubMed?Molecular Cell, 2006