A Bayesian approach to joint modeling of protein-DNA binding, gene expression and sequence data

Yang Xie, Wei Pan, Kyeong S. Jeong, Guanghua Xiao, Arkady B. Khodursky

Research output: Contribution to journalArticlepeer-review

7 Scopus citations

Abstract

The genome-wide DNA-protein-binding data, DNA sequence data and gene expression data represent complementary means to deciphering global and local transcriptional regulatory circuits. Combining these different types of data can not only improve the statistical power, but also provide a more comprehensive picture of gene regulation. In this paper, we propose a novel statistical model to augment protein-DNA-binding data with gene expression and DNA sequence data when available. We specify a hierarchical Bayes model and use Markov chain Monte Carlo simulations to draw inferences. Both simulation studies and an analysis of an experimental data set show that the proposed joint modeling method can significantly improve the specificity and sensitivity of identifying target genes as compared with conventional approaches relying on a single data source.

Original languageEnglish (US)
Pages (from-to)489-503
Number of pages15
JournalStatistics in Medicine
Volume29
Issue number4
DOIs
StatePublished - Feb 20 2010

Keywords

  • Bayesian model
  • ChIP-chip data
  • Joint modeling
  • Microarray

ASJC Scopus subject areas

  • Epidemiology
  • Statistics and Probability

Fingerprint

Dive into the research topics of 'A Bayesian approach to joint modeling of protein-DNA binding, gene expression and sequence data'. Together they form a unique fingerprint.

Cite this