A varying threshold method for ChIP peak-calling using multiple sources of information

Kuan Bei Chen, Yu Zhang

Research output: Contribution to journalArticle

5 Scopus citations

Abstract

Motivation: Gene regulation commonly involves interaction among DNA, proteins and biochemical conditions. Using chromatin immunoprecipitation (ChIP) technologies, protein-DNA interactions are routinely detected in the genome scale. Computational methods that detect weak protein-binding signals and simultaneously maintain a high specificity yet remain to be challenging. An attractive approach is to incorporate biologically relevant data, such as protein co-occupancy, to improve the power of protein-binding detection. We call the additional data related with the target protein binding as supporting tracks. Results: We propose a novel but rigorous statistical method to identify protein occupancy in ChIP data using multiple supporting tracks (PASS2). We demonstrate that utilizing biologically related information can significantly increase the discovery of true protein-binding sites, while still maintaining a desired level of false positive calls. Applying the method to GATA1 restoration in mouse erythroid cell line, we detected many new GATA1-binding sites using GATA1 co-occupancy data.

Original languageEnglish (US)
Article numberbtq379
Pages (from-to)i504-i510
JournalBioinformatics
Volume26
Issue number18
DOIs
StatePublished - Sep 4 2010

All Science Journal Classification (ASJC) codes

  • Statistics and Probability
  • Biochemistry
  • Molecular Biology
  • Computer Science Applications
  • Computational Theory and Mathematics
  • Computational Mathematics

Fingerprint Dive into the research topics of 'A varying threshold method for ChIP peak-calling using multiple sources of information'. Together they form a unique fingerprint.

  • Cite this