Powerful genetic association analysis for common or rare variants with high-dimensional structured traits

Xiang Zhan, Ni Zhao, Anna Plantinga, Timothy A. Thornton, Karen N. Conneely, Michael P. Epstein, Michael C. Wu

Research output: Contribution to journalArticle

4 Citations (Scopus)

Abstract

Many genetic association studies collect a wide range of complex traits. As these traits may be correlated and share a common genetic mechanism, joint analysis can be statistically more powerful and biologically more meaningful. However, most existing tests for multiple traits cannot be used for high-dimensional and possibly structured traits, such as network-structured transcriptomic pathway expressions. To overcome potential limitations, in this article we propose the dual kernel-based association test (DKAT) for testing the association between multiple traits and multiple genetic variants, both common and rare. In DKAT, two individual kernels are used to describe the phenotypic and genotypic similarity, respectively, between pairwise subjects. Using kernels allows for capturing structure while accommodating dimensionality. Then, the association between traits and genetic variants is summarized by a coefficient which measures the association between two kernel matrices. Finally, DKAT evaluates the hypothesis of nonassociation with an analytical P-value calculation without any computationally expensive resampling procedures. By collapsing information in both traits and genetic variants using kernels, the proposed DKAT is shown to have a correct type-I error rate and higher power than other existing methods in both simulation studies and application to a study of genetic regulation of pathway gene expressions.

Original languageEnglish (US)
Pages (from-to)1779-1790
Number of pages12
JournalGenetics
Volume206
Issue number4
DOIs
StatePublished - Jan 1 2017

Fingerprint

Gene Expression Regulation
Genetic Association Studies

All Science Journal Classification (ASJC) codes

  • Genetics

Cite this

Zhan, X., Zhao, N., Plantinga, A., Thornton, T. A., Conneely, K. N., Epstein, M. P., & Wu, M. C. (2017). Powerful genetic association analysis for common or rare variants with high-dimensional structured traits. Genetics, 206(4), 1779-1790. https://doi.org/10.1534/genetics.116.199646
Zhan, Xiang ; Zhao, Ni ; Plantinga, Anna ; Thornton, Timothy A. ; Conneely, Karen N. ; Epstein, Michael P. ; Wu, Michael C. / Powerful genetic association analysis for common or rare variants with high-dimensional structured traits. In: Genetics. 2017 ; Vol. 206, No. 4. pp. 1779-1790.
@article{b647bd3fd1ed469db363ebc0587c9ce0,
title = "Powerful genetic association analysis for common or rare variants with high-dimensional structured traits",
abstract = "Many genetic association studies collect a wide range of complex traits. As these traits may be correlated and share a common genetic mechanism, joint analysis can be statistically more powerful and biologically more meaningful. However, most existing tests for multiple traits cannot be used for high-dimensional and possibly structured traits, such as network-structured transcriptomic pathway expressions. To overcome potential limitations, in this article we propose the dual kernel-based association test (DKAT) for testing the association between multiple traits and multiple genetic variants, both common and rare. In DKAT, two individual kernels are used to describe the phenotypic and genotypic similarity, respectively, between pairwise subjects. Using kernels allows for capturing structure while accommodating dimensionality. Then, the association between traits and genetic variants is summarized by a coefficient which measures the association between two kernel matrices. Finally, DKAT evaluates the hypothesis of nonassociation with an analytical P-value calculation without any computationally expensive resampling procedures. By collapsing information in both traits and genetic variants using kernels, the proposed DKAT is shown to have a correct type-I error rate and higher power than other existing methods in both simulation studies and application to a study of genetic regulation of pathway gene expressions.",
author = "Xiang Zhan and Ni Zhao and Anna Plantinga and Thornton, {Timothy A.} and Conneely, {Karen N.} and Epstein, {Michael P.} and Wu, {Michael C.}",
year = "2017",
month = "1",
day = "1",
doi = "10.1534/genetics.116.199646",
language = "English (US)",
volume = "206",
pages = "1779--1790",
journal = "Genetics",
issn = "0016-6731",
publisher = "Genetics Society of America",
number = "4",

}

Zhan, X, Zhao, N, Plantinga, A, Thornton, TA, Conneely, KN, Epstein, MP & Wu, MC 2017, 'Powerful genetic association analysis for common or rare variants with high-dimensional structured traits', Genetics, vol. 206, no. 4, pp. 1779-1790. https://doi.org/10.1534/genetics.116.199646

Powerful genetic association analysis for common or rare variants with high-dimensional structured traits. / Zhan, Xiang; Zhao, Ni; Plantinga, Anna; Thornton, Timothy A.; Conneely, Karen N.; Epstein, Michael P.; Wu, Michael C.

In: Genetics, Vol. 206, No. 4, 01.01.2017, p. 1779-1790.

Research output: Contribution to journalArticle

TY - JOUR

T1 - Powerful genetic association analysis for common or rare variants with high-dimensional structured traits

AU - Zhan, Xiang

AU - Zhao, Ni

AU - Plantinga, Anna

AU - Thornton, Timothy A.

AU - Conneely, Karen N.

AU - Epstein, Michael P.

AU - Wu, Michael C.

PY - 2017/1/1

Y1 - 2017/1/1

N2 - Many genetic association studies collect a wide range of complex traits. As these traits may be correlated and share a common genetic mechanism, joint analysis can be statistically more powerful and biologically more meaningful. However, most existing tests for multiple traits cannot be used for high-dimensional and possibly structured traits, such as network-structured transcriptomic pathway expressions. To overcome potential limitations, in this article we propose the dual kernel-based association test (DKAT) for testing the association between multiple traits and multiple genetic variants, both common and rare. In DKAT, two individual kernels are used to describe the phenotypic and genotypic similarity, respectively, between pairwise subjects. Using kernels allows for capturing structure while accommodating dimensionality. Then, the association between traits and genetic variants is summarized by a coefficient which measures the association between two kernel matrices. Finally, DKAT evaluates the hypothesis of nonassociation with an analytical P-value calculation without any computationally expensive resampling procedures. By collapsing information in both traits and genetic variants using kernels, the proposed DKAT is shown to have a correct type-I error rate and higher power than other existing methods in both simulation studies and application to a study of genetic regulation of pathway gene expressions.

AB - Many genetic association studies collect a wide range of complex traits. As these traits may be correlated and share a common genetic mechanism, joint analysis can be statistically more powerful and biologically more meaningful. However, most existing tests for multiple traits cannot be used for high-dimensional and possibly structured traits, such as network-structured transcriptomic pathway expressions. To overcome potential limitations, in this article we propose the dual kernel-based association test (DKAT) for testing the association between multiple traits and multiple genetic variants, both common and rare. In DKAT, two individual kernels are used to describe the phenotypic and genotypic similarity, respectively, between pairwise subjects. Using kernels allows for capturing structure while accommodating dimensionality. Then, the association between traits and genetic variants is summarized by a coefficient which measures the association between two kernel matrices. Finally, DKAT evaluates the hypothesis of nonassociation with an analytical P-value calculation without any computationally expensive resampling procedures. By collapsing information in both traits and genetic variants using kernels, the proposed DKAT is shown to have a correct type-I error rate and higher power than other existing methods in both simulation studies and application to a study of genetic regulation of pathway gene expressions.

UR - http://www.scopus.com/inward/record.url?scp=85027013507&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85027013507&partnerID=8YFLogxK

U2 - 10.1534/genetics.116.199646

DO - 10.1534/genetics.116.199646

M3 - Article

VL - 206

SP - 1779

EP - 1790

JO - Genetics

JF - Genetics

SN - 0016-6731

IS - 4

ER -