Constrained maximum entropy models to select genotype interactions associated with censored failure times

Aotian Yang, David Miller, Qing Pan

Research output: Contribution to journalArticle

1 Scopus citations

Abstract

We propose a novel screening method targeting genotype interactions associated with disease risks. The proposed method extends the maximum entropy conditional probability model to address disease occurrences over time. Continuous occurrence times are grouped into intervals. The model estimates the conditional distribution over the disease occurrence intervals given individual genotypes by maximizing the corresponding entropy subject to constraints linking genotype interactions to time intervals. The EM algorithm is employed to handle observations with uncertainty, for which the disease occurrence is censored. Stepwise greedy search is proposed to screen a large number of candidate constraints. The minimum description length is employed to select the optimal set of constraints. Extensive simulations show that five or so quantile-dependent intervals are sufficient to categorize disease outcomes into different risk groups. Performance depends on sample size, number of genotypes, and minor allele frequencies. The proposed method outperforms the likelihood ratio test, Lasso, and a previous maximum entropy method with only binary (disease occurrence, non-occurrence) outcomes. Finally, a GWAS study for type 1 diabetes patients is used to illustrate our method. Novel one-genotype and two-genotype interactions associated with neuropathy are identified.

Original languageEnglish (US)
Article number18400243
JournalJournal of Bioinformatics and Computational Biology
Volume16
Issue number6
DOIs
StatePublished - Dec 1 2018

All Science Journal Classification (ASJC) codes

  • Biochemistry
  • Molecular Biology
  • Computer Science Applications

Fingerprint Dive into the research topics of 'Constrained maximum entropy models to select genotype interactions associated with censored failure times'. Together they form a unique fingerprint.

  • Cite this