Systematic identification of genetic systems associated with phenotypes in patients with rare genomic copy number variations
- 1 March 2021
- journal article
- research article
- Published by Springer Nature in Human Genetics
- Vol. 140 (3), 457-475
- https://doi.org/10.1007/s00439-020-02214-7
Abstract
Copy number variation (CNV) related disorders tend to show complex phenotypic profiles that do not match known diseases. This makes it difficult to ascertain their underlying molecular basis. A potential solution is to compare the affected genomic regions for multiple patients that share a pathological phenotype, looking for commonalities. Here, we present a novel approach to associate phenotypes with functional systems, in terms of GO categories and KEGG and Reactome pathways, based on patient data. The approach uses genomic and phenomic data from the same patients, finding shared genomic regions between patients with similar phenotypes. These regions are mapped to genes to find associated functional systems. We applied the approach to analyse patients in the DECIPHER database with de novo CNVs, finding functional systems associated with most phenotypes, often due to mutations affecting related genes in the same genomic region. Manual inspection of the ten top-scoring phenotypes found multiple FunSys connections supported by the previous studies for seven of them. The workflow also produces reports focussed on the genes and FunSys connected to the different phenotypes, alongside patient-specific reports, which give details of the associated genes and FunSys for each individual in the cohort. These can be run in "confidential" mode, preserving patient confidentiality. The workflow presented here can be used to associate phenotypes with functional systems using data at the level of a whole cohort of patients, identifying important connections that could not be found when considering them individually. The full workflow is available for download, enabling it to be run on any patient cohort for which phenotypic and CNV data are available.Funding Information
- Fundación Progreso y Salud (PI-0075-2017)
- Instituto de Salud Carlos III (SAF2016-78041-C2-1-R)
- Junta de Andalucía (CTS-486)
- Fundación Ramón Areces
This publication has 72 references indexed in Scilit:
- The immune consequences of preterm birthFrontiers in Neuroscience, 2013
- clusterProfiler: an R Package for Comparing Biological Themes Among Gene ClustersOMICS: A Journal of Integrative Biology, 2012
- Serotonin receptors and heart valve disease—It was meant 2BPharmacology & Therapeutics, 2011
- The Gene Ontology in 2010: extensions and refinementsNucleic Acids Research, 2009
- DECIPHER: Database of Chromosomal Imbalance and Phenotype in Humans Using Ensembl ResourcesAmerican Journal of Human Genetics, 2009
- A role for the host coatomer and KDEL receptor in early vaccinia biogenesisProceedings of the National Academy of Sciences, 2009
- Reduced purifying selection prevails over positive selection in human copy number variant evolutionGenome Research, 2008
- The implications of human metabolic network topology for disease comorbidityProceedings of the National Academy of Sciences, 2008
- The human disease networkProceedings of the National Academy of Sciences, 2007
- Global variation in copy number in the human genomeNature, 2006