Note to users. If you're seeing this message, it means that your browser cannot find this page's style/presentation instructions -- or possibly that you are using a browser that does not support current Web standards. Find out more about why this message is appearing, and what you can do to make your experience of our site the best it can be.


Science 3 October 1997:
Vol. 278. no. 5335, pp. 82 - 87
DOI: 10.1126/science.278.5335.82

Research Articles

De Novo Protein Design: Fully Automated Sequence Selection

Bassil I. Dahiyat, dagger Stephen L. Mayo *

The first fully automated design and experimental validation of a novel sequence for an entire protein is described. A computational design algorithm based on physical chemical potential functions and stereochemical constraints was used to screen a combinatorial library of 1.9 × 1027 possible amino acid sequences for compatibility with the design target, a beta beta alpha protein motif based on the polypeptide backbone structure of a zinc finger domain. A BLAST search shows that the designed sequence, full sequence design 1 (FSD-1), has very low identity to any known protein sequence. The solution structure of FSD-1 was solved by nuclear magnetic resonance spectroscopy and indicates that FSD-1 forms a compact well-ordered structure, which is in excellent agreement with the design target structure. This result demonstrates that computational methods can perform the immense combinatorial search required for protein design, and it suggests that an unbiased and quantitative algorithm can be used in various structural contexts.

B I. Dahiyat, Division of Chemistry and Chemical Engineering, California Institute of Technology, mail code 147-75, Pasadena, CA 91125, USA.
S. L. Mayo, Howard Hughes Medical institute and Division of Biology, California Institute of Technology, mail code 147-75, Pasadena, CA 91125, USA.
*   To whom correspondence should be addressed. E-mail: steve{at}mayo.caltech.edu

dagger    Present address: Xencor, Pasadena, CA 91106, USA.

Read the Full Text



THIS ARTICLE HAS BEEN CITED BY OTHER ARTICLES:
Design, expression and characterization of mutants of fasciculin optimized for interaction with its target, acetylcholinesterase.
O. Sharabi, Y. Peleg, E. Mashiach, E. Vardy, Y. Ashani, I. Silman, J. L. Sussman, and J. M. Shifman (2009)
Protein Eng. Des. Sel. 22, 641-648
   Abstract »    Full Text »    PDF »
Protein design in biological networks: from manipulating the input to modifying the output.
A. M. Van der Sloot, C. Kiel, L. Serrano, and F. Stricher (2009)
Protein Eng. Des. Sel. 22, 537-542
   Abstract »    Full Text »    PDF »
Challenges in the computational design of proteins.
M. Suarez and A. Jaramillo (2009)
J R Soc Interface 6, S477-S491
   Abstract »    Full Text »    PDF »
Alteration of enzyme specificity by computational loop remodeling and design.
P. M. Murphy, J. M. Bolduc, J. L. Gallaher, B. L. Stoddard, and D. Baker (2009)
PNAS 106, 9215-9220
   Abstract »    Full Text »    PDF »
Computational structure-based redesign of enzyme activity.
C.-Y. Chen, I. Georgiev, A. C. Anderson, and B. R. Donald (2009)
PNAS 106, 3764-3769
   Abstract »    Full Text »    PDF »
An antibody loop replacement design feasibility study and a loop-swapped dimer structure.
L. A. Clark, P. A. Boriack-Sjodin, E. Day, J. Eldredge, C. Fitch, M. Jarpe, S. Miller, Y. Li, K. Simon, and H. W.T. van Vlijmen (2009)
Protein Eng. Des. Sel. 22, 93-101
   Abstract »    Full Text »    PDF »
NMR-detected conformational exchange observed in a computationally designed variant of protein G{beta}1.
K. A. Crowhurst and S. L. Mayo (2008)
Protein Eng. Des. Sel. 21, 577-587
   Abstract »    Full Text »    PDF »
Evaluating and optimizing computational protein design force fields using fixed composition-based negative design.
O. Alvizo and S. L. Mayo (2008)
PNAS 105, 12242-12247
   Abstract »    Full Text »    PDF »
PVS: a web server for protein sequence variability analysis tuned to facilitate conserved epitope discovery.
M. Garcia-Boronat, C. M. Diez-Rivero, E. L. Reinherz, and P. A. Reche (2008)
Nucleic Acids Res. 36, W35-W41
   Abstract »    Full Text »    PDF »
High-resolution design of a protein loop.
X. Hu, H. Wang, H. Ke, and B. Kuhlman (2007)
PNAS 104, 17668-17673
   Abstract »    Full Text »    PDF »
Dead-End Elimination with Backbone Flexibility.
I. Georgiev and B. R. Donald (2007)
Bioinformatics 23, i185-i194
   Abstract »    Full Text »    PDF »
Altered Tethering of the SspB Adaptor to the ClpXP Protease Causes Changes in Substrate Delivery.
K. E. McGinness, D. N. Bolon, M. Kaganovich, T. A. Baker, and R. T. Sauer (2007)
J. Biol. Chem. 282, 11465-11473
   Abstract »    Full Text »    PDF »
Functional residues serve a dominant role in mediating the cooperativity of the protein ensemble.
T. Liu, S. T. Whitten, and V. J. Hilser (2007)
PNAS 104, 4347-4352
   Abstract »    Full Text »    PDF »
Synthesis and Selection of De Novo Proteins That Bind and Impede Cellular Functions of an Essential Mycobacterial Protein.
A. Rao, G. Ram, A. K. Saini, R. Vohra, K. Kumar, Y. Singh, and A. Ranganathan (2007)
Appl. Envir. Microbiol. 73, 1320-1331
   Abstract »    Full Text »    PDF »
Computationally designed libraries of fluorescent proteins evaluated by preservation and diversity of function.
T. P. Treynor, C. L. Vizcarra, D. Nedelcu, and S. L. Mayo (2007)
PNAS 104, 48-53
   Abstract »    Full Text »    PDF »
Combinatorial methods for small-molecule placement in computational enzyme design.
J. K. Lassila, H. K. Privett, B. D. Allen, and S. L. Mayo (2006)
PNAS 103, 16710-16715
   Abstract »    Full Text »    PDF »
A Monte Carlo Sampling Method of Amino Acid Sequences Adaptable to Given Main-Chain Atoms in the Proteins.
K. Ogata, K. Soejima, and J. Higo (2006)
J. Biochem. 140, 543-552
   Abstract »    Full Text »    PDF »
Ca2+/calmodulin-dependent protein kinase II (CaMKII) is activated by calmodulin with two bound calciums.
J. M. Shifman, M. H. Choi, S. Mihalas, S. L. Mayo, and M. B. Kennedy (2006)
PNAS 103, 13968-13973
   Abstract »    Full Text »    PDF »
Application of the multiensemble sampling to the equilibrium folding of proteins.
H. S. Son, S.-Y. Kim, J. Lee, and K.-K. Han (2006)
Bioinformatics 22, 1832-1837
   Abstract »    Full Text »    PDF »
RosettaDesign server for protein design..
Y. Liu and B. Kuhlman (2006)
Nucleic Acids Res. 34, W235-W238
   Abstract »    Full Text »    PDF »
Generation and analysis of proline mutants in protein G.
E. J. Choi and S. L. Mayo (2006)
Protein Eng. Des. Sel. 19, 285-289
   Abstract »    Full Text »    PDF »
Common attributes of native-state structures of proteins, disordered proteins, and amyloid.
T. X. Hoang, L. Marsella, A. Trovato, F. Seno, J. R. Banavar, and A. Maritan (2006)
PNAS 103, 6883-6888
   Abstract »    Full Text »    PDF »
Shaping up the protein folding funnel by local interaction: Lesson from a structure prediction study.
G. Chikenji, Y. Fujitsuka, and S. Takada (2006)
PNAS 103, 3141-3146
   Abstract »    Full Text »    PDF »
Rational Design of Intercellular Adhesion Molecule-1 (ICAM-1) Variants for Antagonizing Integrin Lymphocyte Function-associated Antigen-1-dependent Adhesion.
G. Song, G. A. Lazar, T. Kortemme, M. Shimaoka, J. R. Desjarlais, D. Baker, and T. A. Springer (2006)
J. Biol. Chem. 281, 5042-5049
   Abstract »    Full Text »    PDF »
Residue-rotamer-reduction algorithm for the protein side-chain conformation problem.
W. Xie and N. V. Sahinidis (2006)
Bioinformatics 22, 188-194
   Abstract »    Full Text »    PDF »
An Active Enzyme Constructed from a 9-Amino Acid Alphabet.
K. U. Walter, K. Vamvaca, and D. Hilvert (2005)
J. Biol. Chem. 280, 37742-37746
   Abstract »    Full Text »    PDF »
Progress in Modeling of Protein Structures and Interactions.
O. Schueler-Furman, C. Wang, P. Bradley, K. Misura, and D. Baker (2005)
Science 310, 638-642
   Abstract »    Full Text »    PDF »
Improvement in protein functional site prediction by distinguishing structural and functional constraints on protein family evolution using computational design.
G. Cheng, B. Qian, R. Samudrala, and D. Baker (2005)
Nucleic Acids Res. 33, 5861-5867
   Abstract »    Full Text »    PDF »
Specificity versus stability in computational protein design.
D. N. Bolon, R. A. Grant, T. A. Baker, and R. T. Sauer (2005)
PNAS 102, 12724-12729
   Abstract »    Full Text »    PDF »
Application of the "Codon-shuffling" Method: SYNTHESIS AND SELECTION OF DE NOVO PROTEINS AS ANTIBACTERIALS.
A. Rao, S. Chopra, G. Ram, A. Gupta, and A. Ranganathan (2005)
J. Biol. Chem. 280, 23605-23614
   Abstract »    Full Text »    PDF »
Computational Thermostabilization of an Enzyme.
A. Korkegian, M. E. Black, D. Baker, and B. L. Stoddard (2005)
Science 308, 857-860
   Abstract »    Full Text »    PDF »
Computationally designed variants of Escherichia coli chorismate mutase show altered catalytic activity.
J. K. Lassila, J. R. Keeffe, P. Oelschlaeger, and S. L. Mayo (2005)
Protein Eng. Des. Sel. 18, 161-163
   Abstract »    Full Text »    PDF »
Solving and analyzing side-chain positioning problems using linear and integer programming.
C. L. Kingsford, B. Chazelle, and M. Singh (2005)
Bioinformatics 21, 1028-1039
   Abstract »    Full Text »    PDF »
Protein sequence entropy is closely related to packing density and hydrophobicity.
H. Liao, W. Yeh, D. Chiang, R.L. Jernigan, and B. Lustig (2005)
Protein Eng. Des. Sel. 18, 59-64
   Abstract »    Full Text »    PDF »
The crystal structure of human endonuclease VIII-like 1 (NEIL1) reveals a zincless finger motif required for glycosylase activity.
S. Doublie, V. Bandaru, J. P. Bond, and S. S. Wallace (2004)
PNAS 101, 10284-10289
   Abstract »    Full Text »    PDF »
Site-directed protein recombination as a shortest-path problem.
J. B. Endelman, J. J. Silberg, Z.-G. Wang, and F. H. Arnold (2004)
Protein Eng. Des. Sel. 17, 589-594
   Abstract »    Full Text »    PDF »
Computational design of receptors for an organophosphate surrogate of the nerve agent soman.
M. Allert, S. S. Rizk, L. L. Looger, and H. W. Hellinga (2004)
PNAS 101, 7907-7912
   Abstract »    Full Text »    PDF »
Paradigms for computational nucleic acid design.
R. M. Dirks, M. Lin, E. Winfree, and N. A. Pierce (2004)
Nucleic Acids Res. 32, 1392-1403
   Abstract »    Full Text »    PDF »
A Semidefinite Programming Approach to Side Chain Positioning with New Rounding Strategies.
B. Chazelle, C. Kingsford, and M. Singh (2004)
INFORMS Journal on Computing 16, 380-392
   Abstract »    PDF »
Probabilistic approach to the design of symmetric protein quaternary structures.
X. Fu, H. Kono, and J. G. Saven (2003)
Protein Eng. Des. Sel. 16, 971-977
   Abstract »    Full Text »    PDF »
Functional tuning of a salvaged green fluorescent protein variant with a new sequence space by directed evolution.
S.-H. Nam, K.-H. Oh, G.-J. Kim, and H.-S. Kim (2003)
Protein Eng. Des. Sel. 16, 1099-1105
   Abstract »    Full Text »    PDF »
Design of a Novel Globular Protein Fold with Atomic-Level Accuracy.
B. Kuhlman, G. Dantas, G. C. Ireton, G. Varani, B. L. Stoddard, and D. Baker (2003)
Science 302, 1364-1368
   Abstract »    Full Text »    PDF »
Solution structure of a de novo protein from a designed combinatorial library.
Y. Wei, S. Kim, D. Fela, J. Baum, and M. H. Hecht (2003)
PNAS 100, 13270-13273
   Abstract »    Full Text »    PDF »
Exploring the origins of binding specificity through the computational redesign of calmodulin.
J. M. Shifman and S. L. Mayo (2003)
PNAS 100, 13274-13279
   Abstract »    Full Text »    PDF »
Computational design of a Zn2+ receptor that controls bacterial gene expression.
M. A. Dwyer, L. L. Looger, and H. W. Hellinga (2003)
PNAS 100, 11255-11260
   Abstract »    Full Text »    PDF »
Using protein design for homology detection and active site searches.
J. Pei, N. V. Dokholyan, E. I. Shakhnovich, and N. V. Grishin (2003)
PNAS 100, 11361-11366
   Abstract »    Full Text »    PDF »
Identifying residue-residue clashes in protein hybrids by using a second-order mean-field approach.
G. L. Moore and C. D. Maranas (2003)
PNAS 100, 5091-5096
   Abstract »    Full Text »    PDF »
Combining computational and experimental screening for rapid optimization of protein properties.
R. J. Hayes, J. Bentzien, M. L. Ary, M. Y. Hwang, J. M. Jacinto, J. Vielmetter, A. Kundu, and B. I. Dahiyat (2002)
PNAS 99, 15926-15931
   Abstract »    Full Text »    PDF »
An Alanine-Zipper Structure Determined by Long Range Intermolecular Interactions.
J. Liu and M. Lu (2002)
J. Biol. Chem. 277, 48708-48713
   Abstract »    Full Text »    PDF »
A simple physical model for binding energy hot spots in protein-protein complexes.
T. Kortemme and D. Baker (2002)
PNAS 99, 14116-14121
   Abstract »    Full Text »    PDF »
Folding free energy function selects native-like protein sequences in the core but not on the surface.
A. Jaramillo, L. Wernisch, S. Hery, and S. J. Wodak (2002)
PNAS 99, 13554-13559
   Abstract »    Full Text »    PDF »
Protein Design is NP-hard.
N. A. Pierce and E. Winfree (2002)
Protein Eng. Des. Sel. 15, 779-782
   Abstract »    Full Text »    PDF »
Designability of alpha -helical proteins.
E. G. Emberly, N. S. Wingreen, and C. Tang (2002)
PNAS 99, 11163-11168
   Abstract »    Full Text »    PDF »
Rationally designed mutations convert de novo amyloid-like fibrils into monomeric beta -sheet proteins.
W. Wang and M. H. Hecht (2002)
PNAS 99, 2760-2765
   Abstract »    Full Text »    PDF »
A method for optimizing potential-energy functions by a hierarchical design of the potential-energy landscape: Application to the UNRES force field.
A. Liwo, P. Arlukowicz, C. Czaplewski, S. Oldziej, J. Pillardy, and H. A. Scheraga (2002)
PNAS 99, 1937-1942
   Abstract »    Full Text »    PDF »
Protein topology and stability define the space of allowed sequences.
P. Koehl and M. Levitt (2002)
PNAS
   Abstract »    Full Text »    PDF »
Protein design from in silico dynamic information: the emergence of the `turn-dock-lock' motif.
A. Fernandez (2002)
Protein Eng. Des. Sel. 15, 1-6
   Abstract »    Full Text »    PDF »
Enzyme-like proteins by computational design.
D. N. Bolon and S. L. Mayo (2001)
PNAS
   Abstract »    Full Text »    PDF »
Searching sequence space for protein catalysts.
S. V. Taylor, K. U. Walter, P. Kast, and D. Hilvert (2001)
PNAS
   Abstract »    Full Text »    PDF »
Conversion of monomeric protein L to an obligate dimer by computational protein design.
B. Kuhlman, J. W. O'Neill, D. E. Kim, K. Y. J. Zhang, and D. Baker (2001)
PNAS
   Abstract »    Full Text »    PDF »
Knowledge-based potential defined for a rotamer library to design protein sequences.
M. Ota, Y. Isogai, and K. Nishikawa (2001)
Protein Eng. Des. Sel. 14, 557-564
   Abstract »    Full Text »    PDF »
Tryptophan zippers: Stable, monomeric beta -hairpins.
A. G. Cochran, N. J. Skelton, and M. A. Starovasnik (2001)
PNAS
   Abstract »    Full Text »
Computational method to reduce the search space for directed protein evolution.
C. A. Voigt, S. L. Mayo, F. H. Arnold, and Z.-G. Wang (2001)
PNAS 98, 3778-3783
   Abstract »    Full Text »    PDF »
Altering dimerization specificity by changes in surface electrostatics.
M. J. Nohaile, Z. S. Hendsch, B. Tidor, and R. T. Sauer (2001)
PNAS
   Abstract »    Full Text »
Recent improvements in prediction of protein structure by global optimization of a potential energy function.
J. Pillardy, C. Czaplewski, A. Liwo, J. Lee, D. R. Ripoll, R. Ka, S. Oldziej, W. J. Wedemeyer, K. D. Gibson, Y. A. Arnautova, et al. (2001)
PNAS
   Abstract »    Full Text »
Native protein sequences are close to optimal for their structures.
B. Kuhlman and D. Baker (2000)
PNAS 97, 10383-10388
   Abstract »    Full Text »    PDF »
Inaugural Article: Retrostructural analysis of metalloproteins: Application to the design of a minimal model for diiron proteins.
A. Lombardi, C. M. Summa, S. Geremia, L. Randaccio, V. Pavone, and W. F. DeGrado (2000)
PNAS 97, 6298-6305
   Abstract »    Full Text »    PDF »
Use of a quantitative structure-property relationship to design larger model proteins that fold rapidly.
A. R. Dinner, E. Verosub, and M. Karplus (1999)
Protein Eng. Des. Sel. 12, 909-917
   Abstract »    Full Text »    PDF »
Tanford-Kirkwood electrostatics for protein modeling.
J. J. Havranek and P. B. Harbury (1999)
PNAS 96, 11145-11150
   Abstract »    Full Text »    PDF »
A mini-protein designed by removing a module from barnase: molecular modeling and NMR measurements of the conformation.
K.-i. Takahashi, T. Noguti, H. Hojo, K. Yamauchi, M. Kinoshita, S. Aimoto, T. Ohkubo, and M. Go (1999)
Protein Eng. Des. Sel. 12, 673-680
   Abstract »    Full Text »    PDF »
Solution structure and dynamics of a de novo designed three-helix bundle protein.
S. T. R. Walsh, H. Cheng, J. W. Bryson, H. Roder, and W. F. DeGrado (1999)
PNAS 96, 5486-5491
   Abstract »    Full Text »    PDF »
Tolerance of Arc repressor to multiple-alanine substitutions.
B. M. Brown and R. T. Sauer (1999)
PNAS 96, 1983-1988
   Abstract »    Full Text »    PDF »
High-Resolution Protein Design with Backbone Freedom.
P. B. Harbury, J. J. Plecs, B. Tidor, T. Alber, and P. S. Kim (1998)
Science 282, 1462-1467
   Abstract »    Full Text »
Design of a 20-Amino Acid, Three-Stranded {beta}-Sheet Protein.
T. Kortemme, M. Ramírez-Alvarado, and L. Serrano (1998)
Science 281, 253-256
   Abstract »    Full Text »    PDF »
The Crystal Structure of Indoleglycerol-phosphate Synthase from Thermotoga maritima. KINETIC STABILIZATION BY SALT BRIDGES.
T. Knochel, A. Pappenberger, J. N. Jansonius, and K. Kirschner (2002)
J. Biol. Chem. 277, 8626-8634
   Abstract »    Full Text »    PDF »
Protein topology and stability define the space of allowed sequences.
P. Koehl and M. Levitt (2002)
PNAS 99, 1280-1285
   Abstract »    Full Text »    PDF »
Recent improvements in prediction of protein structure by global optimization of a potential energy function.
J. Pillardy, C. Czaplewski, A. Liwo, J. Lee, D. R. Ripoll, R. Kazmierkiewicz, S. Oldziej, W. J. Wedemeyer, K. D. Gibson, Y. A. Arnautova, et al. (2001)
PNAS 98, 2329-2333
   Abstract »    Full Text »    PDF »
Altering dimerization specificity by changes in surface electrostatics.
M. J. Nohaile, Z. S. Hendsch, B. Tidor, and R. T. Sauer (2001)
PNAS 98, 3109-3114
   Abstract »    Full Text »    PDF »
Tryptophan zippers: Stable, monomeric beta -hairpins.
A. G. Cochran, N. J. Skelton, and M. A. Starovasnik (2001)
PNAS 98, 5578-5583
   Abstract »    Full Text »    PDF »
Conversion of monomeric protein L to an obligate dimer by computational protein design.
B. Kuhlman, J. W. O'Neill, D. E. Kim, K. Y. J. Zhang, and D. Baker (2001)
PNAS 98, 10687-10691
   Abstract »    Full Text »    PDF »
Searching sequence space for protein catalysts.
S. V. Taylor, K. U. Walter, P. Kast, and D. Hilvert (2001)
PNAS 98, 10596-10601
   Abstract »    Full Text »    PDF »
From the Cover: Enzyme-like proteins by computational design.
D. N. Bolon and S. L. Mayo (2001)
PNAS 98, 14274-14279
   Abstract »    Full Text »    PDF »



To Advertise     Find Products


Science. ISSN 0036-8075 (print), 1095-9203 (online)