Diverse convergent evidence in the genetic analysis of complex disease: Coordinating omic, informatic, and experimental evidence to better identify and validate risk factors

Timothy H. Ciesielski, Sarah A. Pendergrass, Marquitta J. White, Nuri Kodaman, Rafal S. Sobota, Minjun Huang, Jacquelaine Bartlett, Jing Li, Qinxin Pan, Jiang Gui, Scott B. Selleck, Christopher I. Amos, Marylyn D. Ritchie, Jason H. Moore, Scott M. Williams

Research output: Contribution to journalArticle

11 Scopus citations

Abstract

In omic research, such as genome wide association studies, researchers seek to repeat their results in other datasets to reduce false positive findings and thus provide evidence for the existence of true associations. Unfortunately this standard validation approach cannot completely eliminate false positive conclusions, and it can also mask many true associations that might otherwise advance our understanding of pathology. These issues beg the question: How can we increase the amount of knowledge gained from high throughput genetic data? To address this challenge, we present an approach that complements standard statistical validation methods by drawing attention to both potential false negative and false positive conclusions, as well as providing broad information for directing future research. The Diverse Convergent Evidence approach (DiCE) we propose integrates information from multiple sources (omics, informatics, and laboratory experiments) to estimate the strength of the available corroborating evidence supporting a given association. This process is designed to yield an evidence metric that has utility when etiologic heterogeneity, variable risk factor frequencies, and a variety of observational data imperfections might lead to false conclusions. We provide proof of principle examples in which DiCE identified strong evidence for associations that have established biological importance, when standard validation methods alone did not provide support. If used as an adjunct to standard validation methods this approach can leverage multiple distinct data types to improve genetic risk factor discovery/ validation, promote effective science communication, and guide future research directions.

Original languageEnglish (US)
Article number10
JournalBioData Mining
Volume7
Issue number1
DOIs
StatePublished - Jun 30 2014

All Science Journal Classification (ASJC) codes

  • Biochemistry
  • Molecular Biology
  • Genetics
  • Computer Science Applications
  • Computational Theory and Mathematics
  • Computational Mathematics

Fingerprint Dive into the research topics of 'Diverse convergent evidence in the genetic analysis of complex disease: Coordinating omic, informatic, and experimental evidence to better identify and validate risk factors'. Together they form a unique fingerprint.

  • Cite this

    Ciesielski, T. H., Pendergrass, S. A., White, M. J., Kodaman, N., Sobota, R. S., Huang, M., Bartlett, J., Li, J., Pan, Q., Gui, J., Selleck, S. B., Amos, C. I., Ritchie, M. D., Moore, J. H., & Williams, S. M. (2014). Diverse convergent evidence in the genetic analysis of complex disease: Coordinating omic, informatic, and experimental evidence to better identify and validate risk factors. BioData Mining, 7(1), [10]. https://doi.org/10.1186/1756-0381-7-10