Epigenetic combinatorial patterns predict disease variants

Yu Zhang

Research output: Contribution to journalArticle

5 Citations (Scopus)

Abstract

Most genetic variants identified in genome-wide association studies are noncoding and are likely tagging nearby causal variants. It is a challenging task to pinpoint the precise locations of disease-causal variants and understand their functions in disease. A promising approach to improve fine mapping is to integrate the functional data currently available on hundreds of human tissues and cell types. Although there are several methods that use functional data to prioritize disease variants, they mainly use linear models, or equivalent naive likelihood-based models for prediction. Here, we investigate whether study of the combinatorial patterns of functional data across cell types can improve prediction accuracy for disease variants. Using functional annotation in 127 human cell types, we first introduce a Bayesian method to identify recurring cell-type-specificity partitions on the scale of the genome. We show that our de novo identification of epigenome partition patterns agrees well with known cell-type origins and that the associated functional elements are strongly enriched in disease variants. Using epigenetic cell-type specificity in addition to enrichment of functional elements, we further demonstrate that the power to predict disease variants can be greatly improved over that achievable with linear models. Our approach thus provides a new way to prioritize disease functional variants for testing.

Original languageEnglish (US)
Article number71
JournalFrontiers in Genetics
Volume8
Issue numberMAY
DOIs
StatePublished - May 30 2017

Fingerprint

Epigenomics
Linear Models
Bayes Theorem
Genome-Wide Association Study
Genome

All Science Journal Classification (ASJC) codes

  • Molecular Medicine
  • Genetics
  • Genetics(clinical)

Cite this

@article{9fca687358af4cd1a919879467b7817c,
title = "Epigenetic combinatorial patterns predict disease variants",
abstract = "Most genetic variants identified in genome-wide association studies are noncoding and are likely tagging nearby causal variants. It is a challenging task to pinpoint the precise locations of disease-causal variants and understand their functions in disease. A promising approach to improve fine mapping is to integrate the functional data currently available on hundreds of human tissues and cell types. Although there are several methods that use functional data to prioritize disease variants, they mainly use linear models, or equivalent naive likelihood-based models for prediction. Here, we investigate whether study of the combinatorial patterns of functional data across cell types can improve prediction accuracy for disease variants. Using functional annotation in 127 human cell types, we first introduce a Bayesian method to identify recurring cell-type-specificity partitions on the scale of the genome. We show that our de novo identification of epigenome partition patterns agrees well with known cell-type origins and that the associated functional elements are strongly enriched in disease variants. Using epigenetic cell-type specificity in addition to enrichment of functional elements, we further demonstrate that the power to predict disease variants can be greatly improved over that achievable with linear models. Our approach thus provides a new way to prioritize disease functional variants for testing.",
author = "Yu Zhang",
year = "2017",
month = "5",
day = "30",
doi = "10.3389/fgene.2017.00071",
language = "English (US)",
volume = "8",
journal = "Frontiers in Genetics",
issn = "1664-8021",
publisher = "Frontiers Media S. A.",
number = "MAY",

}

Epigenetic combinatorial patterns predict disease variants. / Zhang, Yu.

In: Frontiers in Genetics, Vol. 8, No. MAY, 71, 30.05.2017.

Research output: Contribution to journalArticle

TY - JOUR

T1 - Epigenetic combinatorial patterns predict disease variants

AU - Zhang, Yu

PY - 2017/5/30

Y1 - 2017/5/30

N2 - Most genetic variants identified in genome-wide association studies are noncoding and are likely tagging nearby causal variants. It is a challenging task to pinpoint the precise locations of disease-causal variants and understand their functions in disease. A promising approach to improve fine mapping is to integrate the functional data currently available on hundreds of human tissues and cell types. Although there are several methods that use functional data to prioritize disease variants, they mainly use linear models, or equivalent naive likelihood-based models for prediction. Here, we investigate whether study of the combinatorial patterns of functional data across cell types can improve prediction accuracy for disease variants. Using functional annotation in 127 human cell types, we first introduce a Bayesian method to identify recurring cell-type-specificity partitions on the scale of the genome. We show that our de novo identification of epigenome partition patterns agrees well with known cell-type origins and that the associated functional elements are strongly enriched in disease variants. Using epigenetic cell-type specificity in addition to enrichment of functional elements, we further demonstrate that the power to predict disease variants can be greatly improved over that achievable with linear models. Our approach thus provides a new way to prioritize disease functional variants for testing.

AB - Most genetic variants identified in genome-wide association studies are noncoding and are likely tagging nearby causal variants. It is a challenging task to pinpoint the precise locations of disease-causal variants and understand their functions in disease. A promising approach to improve fine mapping is to integrate the functional data currently available on hundreds of human tissues and cell types. Although there are several methods that use functional data to prioritize disease variants, they mainly use linear models, or equivalent naive likelihood-based models for prediction. Here, we investigate whether study of the combinatorial patterns of functional data across cell types can improve prediction accuracy for disease variants. Using functional annotation in 127 human cell types, we first introduce a Bayesian method to identify recurring cell-type-specificity partitions on the scale of the genome. We show that our de novo identification of epigenome partition patterns agrees well with known cell-type origins and that the associated functional elements are strongly enriched in disease variants. Using epigenetic cell-type specificity in addition to enrichment of functional elements, we further demonstrate that the power to predict disease variants can be greatly improved over that achievable with linear models. Our approach thus provides a new way to prioritize disease functional variants for testing.

UR - http://www.scopus.com/inward/record.url?scp=85043785554&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85043785554&partnerID=8YFLogxK

U2 - 10.3389/fgene.2017.00071

DO - 10.3389/fgene.2017.00071

M3 - Article

C2 - 28611825

AN - SCOPUS:85043785554

VL - 8

JO - Frontiers in Genetics

JF - Frontiers in Genetics

SN - 1664-8021

IS - MAY

M1 - 71

ER -