A model-free approach for detecting interactions in genetic association studies

Jiahan Li, Jun Dan, Chunlei Li, Rongling Wu

Research output: Contribution to journalArticle

13 Citations (Scopus)

Abstract

Over the past few decades, genome-wide association studies analyzed by efficient statistical procedures have successfully identified single-nucleotide polymorphisms (SNPs) that are associated with complex traits or human diseases. However, due to the overwhelming number of SNPs, most approaches have focused on additive genetic model without genome-wide SNP-SNP interactions. In this study, we propose an efficient statistical procedure in a genetic model-free framework for detecting SNPs exhibiting main genetic effects as well as epistatic interactions. Specifically, the association between phenotype and genotype is characterized by an unknown function to be estimated using nonparametric techniques, and a two-stage non-parametric independence screening procedure is proposed to sequentially identify potentially important main genetic effects and interactions. Finally, the subset of genetic predictors implied by two-stage non-parametric independence screening is analyzed by penalized regressions such as LASSO, and a final model is identified. In this framework, specific genetic model is not assumed and interactions are not only amongmarginally important SNPs.Therefore, SNPs that are involved in genetic regulatory networks but missed by previous studies are expected to be recognized. In simulation studies, we show that the procedure is computationally efficient and has an outstanding finite sample performance in selecting potential SNPs as well as SNP^SNP interactions. A real data analysis further indicates the importance of epistatic interactions in explaining body mass index.

Original languageEnglish (US)
Article numberbbt082
Pages (from-to)1057-1068
Number of pages12
JournalBriefings in bioinformatics
Volume15
Issue number6
DOIs
StatePublished - Aug 2 2013

Fingerprint

Genetic Association Studies
Nucleotides
Polymorphism
Single Nucleotide Polymorphism
Genetic Models
Screening
Genes
Genome-Wide Association Study
Body Mass Index
Genome

All Science Journal Classification (ASJC) codes

  • Information Systems
  • Molecular Biology

Cite this

Li, Jiahan ; Dan, Jun ; Li, Chunlei ; Wu, Rongling. / A model-free approach for detecting interactions in genetic association studies. In: Briefings in bioinformatics. 2013 ; Vol. 15, No. 6. pp. 1057-1068.
@article{3d63046bbe3142a4b8dbd6c2ac19f94e,
title = "A model-free approach for detecting interactions in genetic association studies",
abstract = "Over the past few decades, genome-wide association studies analyzed by efficient statistical procedures have successfully identified single-nucleotide polymorphisms (SNPs) that are associated with complex traits or human diseases. However, due to the overwhelming number of SNPs, most approaches have focused on additive genetic model without genome-wide SNP-SNP interactions. In this study, we propose an efficient statistical procedure in a genetic model-free framework for detecting SNPs exhibiting main genetic effects as well as epistatic interactions. Specifically, the association between phenotype and genotype is characterized by an unknown function to be estimated using nonparametric techniques, and a two-stage non-parametric independence screening procedure is proposed to sequentially identify potentially important main genetic effects and interactions. Finally, the subset of genetic predictors implied by two-stage non-parametric independence screening is analyzed by penalized regressions such as LASSO, and a final model is identified. In this framework, specific genetic model is not assumed and interactions are not only amongmarginally important SNPs.Therefore, SNPs that are involved in genetic regulatory networks but missed by previous studies are expected to be recognized. In simulation studies, we show that the procedure is computationally efficient and has an outstanding finite sample performance in selecting potential SNPs as well as SNP^SNP interactions. A real data analysis further indicates the importance of epistatic interactions in explaining body mass index.",
author = "Jiahan Li and Jun Dan and Chunlei Li and Rongling Wu",
year = "2013",
month = "8",
day = "2",
doi = "10.1093/bib/bbt082",
language = "English (US)",
volume = "15",
pages = "1057--1068",
journal = "Briefings in Bioinformatics",
issn = "1467-5463",
publisher = "Oxford University Press",
number = "6",

}

A model-free approach for detecting interactions in genetic association studies. / Li, Jiahan; Dan, Jun; Li, Chunlei; Wu, Rongling.

In: Briefings in bioinformatics, Vol. 15, No. 6, bbt082, 02.08.2013, p. 1057-1068.

Research output: Contribution to journalArticle

TY - JOUR

T1 - A model-free approach for detecting interactions in genetic association studies

AU - Li, Jiahan

AU - Dan, Jun

AU - Li, Chunlei

AU - Wu, Rongling

PY - 2013/8/2

Y1 - 2013/8/2

N2 - Over the past few decades, genome-wide association studies analyzed by efficient statistical procedures have successfully identified single-nucleotide polymorphisms (SNPs) that are associated with complex traits or human diseases. However, due to the overwhelming number of SNPs, most approaches have focused on additive genetic model without genome-wide SNP-SNP interactions. In this study, we propose an efficient statistical procedure in a genetic model-free framework for detecting SNPs exhibiting main genetic effects as well as epistatic interactions. Specifically, the association between phenotype and genotype is characterized by an unknown function to be estimated using nonparametric techniques, and a two-stage non-parametric independence screening procedure is proposed to sequentially identify potentially important main genetic effects and interactions. Finally, the subset of genetic predictors implied by two-stage non-parametric independence screening is analyzed by penalized regressions such as LASSO, and a final model is identified. In this framework, specific genetic model is not assumed and interactions are not only amongmarginally important SNPs.Therefore, SNPs that are involved in genetic regulatory networks but missed by previous studies are expected to be recognized. In simulation studies, we show that the procedure is computationally efficient and has an outstanding finite sample performance in selecting potential SNPs as well as SNP^SNP interactions. A real data analysis further indicates the importance of epistatic interactions in explaining body mass index.

AB - Over the past few decades, genome-wide association studies analyzed by efficient statistical procedures have successfully identified single-nucleotide polymorphisms (SNPs) that are associated with complex traits or human diseases. However, due to the overwhelming number of SNPs, most approaches have focused on additive genetic model without genome-wide SNP-SNP interactions. In this study, we propose an efficient statistical procedure in a genetic model-free framework for detecting SNPs exhibiting main genetic effects as well as epistatic interactions. Specifically, the association between phenotype and genotype is characterized by an unknown function to be estimated using nonparametric techniques, and a two-stage non-parametric independence screening procedure is proposed to sequentially identify potentially important main genetic effects and interactions. Finally, the subset of genetic predictors implied by two-stage non-parametric independence screening is analyzed by penalized regressions such as LASSO, and a final model is identified. In this framework, specific genetic model is not assumed and interactions are not only amongmarginally important SNPs.Therefore, SNPs that are involved in genetic regulatory networks but missed by previous studies are expected to be recognized. In simulation studies, we show that the procedure is computationally efficient and has an outstanding finite sample performance in selecting potential SNPs as well as SNP^SNP interactions. A real data analysis further indicates the importance of epistatic interactions in explaining body mass index.

UR - http://www.scopus.com/inward/record.url?scp=84913583552&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84913583552&partnerID=8YFLogxK

U2 - 10.1093/bib/bbt082

DO - 10.1093/bib/bbt082

M3 - Article

C2 - 24273216

AN - SCOPUS:84913583552

VL - 15

SP - 1057

EP - 1068

JO - Briefings in Bioinformatics

JF - Briefings in Bioinformatics

SN - 1467-5463

IS - 6

M1 - bbt082

ER -