Wikidata as a knowledge graph for the life sciences
Open Access
- 17 March 2020
- journal article
- research article
- Published by eLife Sciences Publications, Ltd in eLife
Abstract
Wikidata is a community-maintained knowledge base that has been assembled from repositories in the fields of genomics, proteomics, genetic variants, pathways, chemical compounds, and diseases, and that adheres to the FAIR principles of findability, accessibility, interoperability and reusability. Here we describe the breadth and depth of the biomedical knowledge contained within Wikidata, and discuss the open-source tools we have built to add information to Wikidata and to synchronize it with source databases. We also demonstrate several use cases for Wikidata, including the crowdsourced curation of biomedical ontologies, phenotype-based diagnosis of disease, and drug repurposing.Keywords
All Related Versions
Funding Information
- National Institute of General Medical Sciences (R01 GM089820)
- National Institute of General Medical Sciences (U54 GM114833)
- National Institute of General Medical Sciences (R01 GM100039)
- National Human Genome Research Institute (R00HG007940)
- National Cancer Institute (U24CA237719)
- V Foundation for Cancer Research (V2018-007)
- National Institute of Allergy and Infectious Diseases (R01 AI126785)
- National Center for Advancing Translational Sciences (UL1 TR002550)
This publication has 62 references indexed in Scilit:
- Directly e-mailing authors of newly published papers encourages community curationDatabase: The Journal of Biological Databases and Curation, 2012
- Normalized names for clinical drugs: RxNorm at 6 yearsJournal of the American Medical Informatics Association, 2011
- Locus Reference Genomic sequences: an improved basis for describing human DNA variantsGenome Medicine, 2010
- The BridgeDb framework: standardized access to gene, protein and metabolite identifier mapping servicesBMC Bioinformatics, 2010
- Clinical Diagnostics in Human Genetics with Semantic Similarity Searches in OntologiesAmerican Journal of Human Genetics, 2009
- PubChem: a public information system for analyzing bioactivities of small moleculesNucleic Acids Research, 2009
- The NCI Thesaurus quality assurance life cycleJournal of Biomedical Informatics, 2009
- Models for financial sustainability of biological databases and resourcesDatabase: The Journal of Biological Databases and Curation, 2009
- Mendelian Inheritance in Man and Its Online Version, OMIMAmerican Journal of Human Genetics, 2007
- LMSD: LIPID MAPS structure databaseNucleic Acids Research, 2006