Submitted on June 25, 2008
Accepted on May 1, 2009
Diversity and Complexity in DNA Recognition by Transcription Factors
Gwenael Badis 1
, Michael F. Berger 2
, Anthony A. Philippakis 3
, Shaheynoor Talukder 4
, Andrew R. Gehrke 5
, Savina A. Jaeger 5
, Esther T. Chan 6
, Genita Metzler 7, Anastasia Vedenko 8, Xiaoyu Chen 1, Hanna Kuznetsov 7, Chi-Fong Wang 9, David Coburn 1, Daniel E. Newburger 5, Quaid Morris 10, Timothy R. Hughes 11*, Martha L. Bulyk 12*
1 Banting and Best Department of Medical Research, University of Toronto, 160 College Street, Toronto, ON, Canada M5S 3E1.
2 Division of Genetics, Department of Medicine, Brigham and Women’s Hospital and Harvard Medical School, Boston, MA 02115, USA.; Committee on Higher Degrees in Biophysics, Harvard University, Cambridge, MA 02138, USA.
3 Division of Genetics, Department of Medicine, Brigham and Women’s Hospital and Harvard Medical School, Boston, MA 02115, USA.; Harvard-MIT Division of Health Sciences and Technology (HST), Harvard Medical School, Boston, MA 02115, USA.; Committee on Higher Degrees in Biophysics, Harvard University, Cambridge, MA 02138, USA.
4 Banting and Best Department of Medical Research, University of Toronto, 160 College Street, Toronto, ON, Canada M5S 3E1.; Department of Molecular Genetics, University of Toronto, 160 College Street, Toronto, ON, Canada M5S 3E1.
5 Division of Genetics, Department of Medicine, Brigham and Women’s Hospital and Harvard Medical School, Boston, MA 02115, USA.
6 Department of Molecular Genetics, University of Toronto, 160 College Street, Toronto, ON, Canada M5S 3E1.
7 Department of Biology, Massachusetts Institute of Technology, Cambridge, MA 02139, USA.
8 Department of Biology, Wellesley College, Wellesley, MA 02481, USA.
9 Department of Physics, Massachusetts Institute of Technology, Cambridge, MA 02139, USA.
10 Banting and Best Department of Medical Research, University of Toronto, 160 College Street, Toronto, ON, Canada M5S 3E1.; Department of Molecular Genetics, University of Toronto, 160 College Street, Toronto, ON, Canada M5S 3E1.; Department of Computer Science, University of Toronto, 160 College Street, Toronto, ON, Canada M5S 3E1.; Terrence Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, 160 College Street, Toronto, ON, Canada M5S 3E1.
11 Banting and Best Department of Medical Research, University of Toronto, 160 College Street, Toronto, ON, Canada M5S 3E1.; Department of Molecular Genetics, University of Toronto, 160 College Street, Toronto, ON, Canada M5S 3E1.; Terrence Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, 160 College Street, Toronto, ON, Canada M5S 3E1.
12 Division of Genetics, Department of Medicine, Brigham and Women’s Hospital and Harvard Medical School, Boston, MA 02115, USA.; Department of Pathology, Brigham and Women’s Hospital and Harvard Medical School, Boston, MA 02115, USA.; Harvard-MIT Division of Health Sciences and Technology (HST), Harvard Medical School, Boston, MA 02115, USA.; Committee on Higher Degrees in Biophysics, Harvard University, Cambridge, MA 02138, USA.
* To whom correspondence should be addressed.
Timothy R. Hughes , E-mail: t.hughes{at}utoronto.ca
Martha L. Bulyk , E-mail: mlbulyk{at}receptor.med.harvard.edu
These authors contributed equally to this work.
Sequence preferences of DNA-binding proteins are a primary mechanism by which cells interpret the genome. Despite these proteins central importance in physiology, development, and evolution, comprehensive DNA-binding specificities have been determined experimentally for few proteins. Here, we used microarrays containing all 10-base-pair sequences to examine the binding specificities of 104 distinct mouse DNA-binding proteins representing 22 structural classes. Our results reveal a complex landscape of binding, with virtually every protein analyzed possessing unique preferences. Roughly half of the proteins each recognized multiple distinctly different sequence motifs, challenging our molecular understanding of how proteins interact with their DNA binding sites. This complexity in DNA recognition may be important in gene regulation and in evolution of transcriptional regulatory networks.