DNA motif elucidation using belief propagation

Ka-Chun Wong, Tak-Ming Chan, Chengbin Peng, Yue Li, Zhaolei Zhang

Research output: Contribution to journalArticlepeer-review

39 Scopus citations

Abstract

Protein-binding microarray (PBM) is a high-throughout platform that can measure the DNA-binding preference of a protein in a comprehensive and unbiased manner. A typical PBM experiment can measure binding signal intensities of a protein to all the possible DNA k-mers (k = 8 ?10); such comprehensive binding affinity data usually need to be reduced and represented as motif models before they can be further analyzed and applied. Since proteins can often bind to DNA in multiple modes, one of the major challenges is to decompose the comprehensive affinity data into multimodal motif representations. Here, we describe a new algorithm that uses Hidden Markov Models (HMMs) and can derive precise and multimodal motifs using belief propagations. We describe an HMM-based approach using belief propagations (kmerHMM), which accepts and preprocesses PBM probe raw data into median-binding intensities of individual k-mers. The k-mers are ranked and aligned for training an HMM as the underlying motif representation. Multiple motifs are then extracted from the HMM using belief propagations. Comparisons of kmerHMM with other leading methods on several data sets demonstrated its effectiveness and uniqueness. Especially, it achieved the best performance on more than half of the data sets. In addition, the multiple binding modes derived by kmerHMM are biologically meaningful and will be useful in interpreting other genome-wide data such as those generated from ChIP-seq. The executables and source codes are available at the authors' websites: e.g. http://www.cs.toronto.edu/?wkc/kmerHMM. 2013 The Author(s).
Original languageEnglish (US)
Pages (from-to)e153-e153
Number of pages1
JournalNucleic Acids Research
Volume41
Issue number16
DOIs
StatePublished - Jun 29 2013

ASJC Scopus subject areas

  • Genetics

Fingerprint Dive into the research topics of 'DNA motif elucidation using belief propagation'. Together they form a unique fingerprint.

Cite this