TY - JOUR
T1 - A linear model for transcription factor binding affinity prediction in protein binding microarrays
AU - Annala, Matti
AU - Laurila, Kirsti
AU - Lähdesmäki, Harri
AU - Nykter, Matti
PY - 2011
Y1 - 2011
N2 - Protein binding microarrays (PBM) are a high throughput technology used to characterize protein-DNA binding. The arrays measure a protein's affinity toward thousands of double-stranded DNA sequences at once, producing a comprehensive binding specificity catalog. We present a linear model for predicting the binding affinity of a protein toward DNA sequences based on PBM data. Our model represents the measured intensity of an individual probe as a sum of the binding affinity contributions of the probe's subsequences. These subsequences characterize a DNA binding motif and can be used to predict the intensity of protein binding against arbitrary DNA sequences. Our method was the best performer in the Dialogue for Reverse Engineering Assessments and Methods 5 (DREAM5) transcription factor/DNA motif recognition challenge. For the DREAM5 bonus challenge, we also developed an approach for the identification of transcription factors based on their PBM binding profiles. Our approach for TF identification achieved the best performance in the bonus challenge.
AB - Protein binding microarrays (PBM) are a high throughput technology used to characterize protein-DNA binding. The arrays measure a protein's affinity toward thousands of double-stranded DNA sequences at once, producing a comprehensive binding specificity catalog. We present a linear model for predicting the binding affinity of a protein toward DNA sequences based on PBM data. Our model represents the measured intensity of an individual probe as a sum of the binding affinity contributions of the probe's subsequences. These subsequences characterize a DNA binding motif and can be used to predict the intensity of protein binding against arbitrary DNA sequences. Our method was the best performer in the Dialogue for Reverse Engineering Assessments and Methods 5 (DREAM5) transcription factor/DNA motif recognition challenge. For the DREAM5 bonus challenge, we also developed an approach for the identification of transcription factors based on their PBM binding profiles. Our approach for TF identification achieved the best performance in the bonus challenge.
KW - Cell Differentiation
KW - cytology/immunology; Transcription
KW - Genetic
KW - genetics/immunology; Oligonucleotide Array Sequence Analysis; STAT6 Transcription Factor
KW - genetics/immunology; Th2 Cells
KW - immunology; Gene Expression; Gene Expression Profiling; Gene Expression Regulation
KW - immunology; Genome-Wide Association Study; Humans; Interleukin-4
KW - Cell Differentiation
KW - cytology/immunology; Transcription
KW - Genetic
KW - genetics/immunology; Oligonucleotide Array Sequence Analysis; STAT6 Transcription Factor
KW - genetics/immunology; Th2 Cells
KW - immunology; Gene Expression; Gene Expression Profiling; Gene Expression Regulation
KW - immunology; Genome-Wide Association Study; Humans; Interleukin-4
KW - Cell Differentiation
KW - cytology/immunology
KW - Genetic
KW - genetics
KW - Transcription
KW - immunology
KW - Genome-Wide Association Study
KW - Gene Expression
KW - Gene Expression Profiling
KW - Gene Expression Regulation
KW - Humans
KW - Interleukin-4
UR - http://www.plosone.org/article/info%3Adoi%2F10.1371%2Fjournal.pone.0020059
U2 - 10.1371/journal.pone.0020059
DO - 10.1371/journal.pone.0020059
M3 - Article
VL - 6
SP - 1
EP - 13
JO - PloS one
JF - PloS one
SN - 1932-6203
IS - 5
M1 - e20059
ER -