Note to users. If you're seeing this message, it means that your browser cannot find this page's style/presentation instructions -- or possibly that you are using a browser that does not support current Web standards. Find out more about why this message is appearing, and what you can do to make your experience of our site the best it can be.
Oxford Global

Site Tools

  • AAAS
  • Subscribe
  • Feedback

Site Search

Search Advanced

Science 3 October 1997:
Vol. 278. no. 5335, pp. 82 - 87
DOI: 10.1126/science.278.5335.82

Research Articles

De Novo Protein Design: Fully Automated Sequence Selection

Bassil I. Dahiyat, dagger Stephen L. Mayo *

The first fully automated design and experimental validation of a novel sequence for an entire protein is described. A computational design algorithm based on physical chemical potential functions and stereochemical constraints was used to screen a combinatorial library of 1.9 × 1027 possible amino acid sequences for compatibility with the design target, a beta beta alpha protein motif based on the polypeptide backbone structure of a zinc finger domain. A BLAST search shows that the designed sequence, full sequence design 1 (FSD-1), has very low identity to any known protein sequence. The solution structure of FSD-1 was solved by nuclear magnetic resonance spectroscopy and indicates that FSD-1 forms a compact well-ordered structure, which is in excellent agreement with the design target structure. This result demonstrates that computational methods can perform the immense combinatorial search required for protein design, and it suggests that an unbiased and quantitative algorithm can be used in various structural contexts.

B I. Dahiyat, Division of Chemistry and Chemical Engineering, California Institute of Technology, mail code 147-75, Pasadena, CA 91125, USA.
S. L. Mayo, Howard Hughes Medical institute and Division of Biology, California Institute of Technology, mail code 147-75, Pasadena, CA 91125, USA.
*   To whom correspondence should be addressed. E-mail: steve{at}mayo.caltech.edu

dagger    Present address: Xencor, Pasadena, CA 91106, USA.

Read the Full Text



THIS ARTICLE HAS BEEN CITED BY OTHER ARTICLES:
NMR-detected conformational exchange observed in a computationally designed variant of protein G{beta}1.
K. A. Crowhurst and S. L. Mayo (2008)
Protein Eng. Des. Sel. 21, 577-587
   Abstract »    Full Text »    PDF »
Evaluating and optimizing computational protein design force fields using fixed composition-based negative design.
O. Alvizo and S. L. Mayo (2008)
PNAS 105, 12242-12247
   Abstract »    Full Text »    PDF »
PVS: a web server for protein sequence variability analysis tuned to facilitate conserved epitope discovery.
M. Garcia-Boronat, C. M. Diez-Rivero, E. L. Reinherz, and P. A. Reche (2008)
Nucleic Acids Res. 36, W35-W41
   Abstract »    Full Text »    PDF »
A de novo designed protein protein interface.
P.-S. Huang, J. J. Love, and S. L. Mayo (2007)
Protein Sci. 16, 2770-2774
   Abstract »    Full Text »    PDF »
High-resolution design of a protein loop.
X. Hu, H. Wang, H. Ke, and B. Kuhlman (2007)
PNAS 104, 17668-17673
   Abstract »    Full Text »    PDF »
Dead-End Elimination with Backbone Flexibility.
I. Georgiev and B. R. Donald (2007)
Bioinformatics 23, i185-i194
   Abstract »    Full Text »    PDF »
Altered Tethering of the SspB Adaptor to the ClpXP Protease Causes Changes in Substrate Delivery.
K. E. McGinness, D. N. Bolon, M. Kaganovich, T. A. Baker, and R. T. Sauer (2007)
J. Biol. Chem. 282, 11465-11473
   Abstract »    Full Text »    PDF »
Computational design and biochemical characterization of maize nonspecific lipid transfer protein variants for biosensor applications.
E. J. Choi, J. Mao, and S. L. Mayo (2007)
Protein Sci. 16, 582-588
   Abstract »    Full Text »    PDF »
Functional residues serve a dominant role in mediating the cooperativity of the protein ensemble.
T. Liu, S. T. Whitten, and V. J. Hilser (2007)
PNAS 104, 4347-4352
   Abstract »    Full Text »    PDF »
Synthesis and Selection of De Novo Proteins That Bind and Impede Cellular Functions of an Essential Mycobacterial Protein.
A. Rao, G. Ram, A. K. Saini, R. Vohra, K. Kumar, Y. Singh, and A. Ranganathan (2007)
Appl. Envir. Microbiol. 73, 1320-1331
   Abstract »    Full Text »    PDF »
Computationally designed libraries of fluorescent proteins evaluated by preservation and diversity of function.
T. P. Treynor, C. L. Vizcarra, D. Nedelcu, and S. L. Mayo (2007)
PNAS 104, 48-53
   Abstract »    Full Text »    PDF »
New algorithms and an in silico benchmark for computational enzyme design.
A. Zanghellini, L. Jiang, A. M. Wollacott, G. Cheng, J. Meiler, E. A. Althoff, D. Rothlisberger, and D. Baker (2006)
Protein Sci. 15, 2785-2794
   Abstract »    Full Text »    PDF »
Combinatorial methods for small-molecule placement in computational enzyme design.
J. K. Lassila, H. K. Privett, B. D. Allen, and S. L. Mayo (2006)
PNAS 103, 16710-16715
   Abstract »    Full Text »    PDF »
A Monte Carlo Sampling Method of Amino Acid Sequences Adaptable to Given Main-Chain Atoms in the Proteins.
K. Ogata, K. Soejima, and J. Higo (2006)
J. Biochem. 140, 543-552
   Abstract »    Full Text »    PDF »
Ca2+/calmodulin-dependent protein kinase II (CaMKII) is activated by calmodulin with two bound calciums.
J. M. Shifman, M. H. Choi, S. Mihalas, S. L. Mayo, and M. B. Kennedy (2006)
PNAS 103, 13968-13973
   Abstract »    Full Text »    PDF »
Configurational-bias sampling technique for predicting side-chain conformations in proteins..
T. Jain, D. S. Cerutti, and J. A. McCammon (2006)
Protein Sci. 15, 2029-2039
   Abstract »    Full Text »    PDF »
Simple electrostatic model improves designed protein sequences.
E. S. Zollars, S. A. Marshall, and S. L. Mayo (2006)
Protein Sci. 15, 2014-2018
   Abstract »    Full Text »    PDF »
Application of the multiensemble sampling to the equilibrium folding of proteins.
H. S. Son, S.-Y. Kim, J. Lee, and K.-K. Han (2006)
Bioinformatics 22, 1832-1837
   Abstract »    Full Text »    PDF »
CIRSE: A solvation energy estimator compatible with flexible protein docking and design applications..
D. S. Cerutti, T. Jain, and J. A. McCammon (2006)
Protein Sci. 15, 1579-1596
   Abstract »    Full Text »    PDF »
RosettaDesign server for protein design..
Y. Liu and B. Kuhlman (2006)
Nucleic Acids Res. 34, W235-W238
   Abstract »    Full Text »    PDF »
Generation and analysis of proline mutants in protein G.
E. J. Choi and S. L. Mayo (2006)
Protein Eng. Des. Sel. 19, 285-289
   Abstract »    Full Text »    PDF »
Common attributes of native-state structures of proteins, disordered proteins, and amyloid.
T. X. Hoang, L. Marsella, A. Trovato, F. Seno, J. R. Banavar, and A. Maritan (2006)
PNAS 103, 6883-6888
   Abstract »    Full Text »    PDF »
Affinity enhancement of an in vivo matured therapeutic antibody using structure-based computational design.
L. A. Clark, P. A. Boriack-Sjodin, J. Eldredge, C. Fitch, B. Friedman, K. J.M. Hanf, M. Jarpe, S. F. Liparoto, Y. Li, A. Lugovskoy, et al. (2006)
Protein Sci. 15, 949-960
   Abstract »    Full Text »    PDF »
Repeat protein architectures predicted by a continuum representation of fold space.
A. C. Hausrath and A. Goriely (2006)
Protein Sci. 15, 753-760
   Abstract »    Full Text »    PDF »
Shaping up the protein folding funnel by local interaction: Lesson from a structure prediction study.
G. Chikenji, Y. Fujitsuka, and S. Takada (2006)
PNAS 103, 3141-3146
   Abstract »    Full Text »    PDF »
Rational Design of Intercellular Adhesion Molecule-1 (ICAM-1) Variants for Antagonizing Integrin Lymphocyte Function-associated Antigen-1-dependent Adhesion.
G. Song, G. A. Lazar, T. Kortemme, M. Shimaoka, J. R. Desjarlais, D. Baker, and T. A. Springer (2006)
J. Biol. Chem. 281, 5042-5049
   Abstract »    Full Text »    PDF »
Residue-rotamer-reduction algorithm for the protein side-chain conformation problem.
W. Xie and N. V. Sahinidis (2006)
Bioinformatics 22, 188-194
   Abstract »    Full Text »    PDF »
An Active Enzyme Constructed from a 9-Amino Acid Alphabet.
K. U. Walter, K. Vamvaca, and D. Hilvert (2005)
J. Biol. Chem. 280, 37742-37746
   Abstract »    Full Text »    PDF »
Progress in Modeling of Protein Structures and Interactions.
O. Schueler-Furman, C. Wang, P. Bradley, K. Misura, and D. Baker (2005)
Science 310, 638-642
   Abstract »    Full Text »    PDF »
Improvement in protein functional site prediction by distinguishing structural and functional constraints on protein family evolution using computational design.
G. Cheng, B. Qian, R. Samudrala, and D. Baker (2005)
Nucleic Acids Res. 33, 5861-5867
   Abstract »    Full Text »    PDF »
Specificity versus stability in computational protein design.
D. N. Bolon, R. A. Grant, T. A. Baker, and R. T. Sauer (2005)
PNAS 102, 12724-12729
   Abstract »    Full Text »    PDF »
Application of the "Codon-shuffling" Method: SYNTHESIS AND SELECTION OF DE NOVO PROTEINS AS ANTIBACTERIALS.
A. Rao, S. Chopra, G. Ram, A. Gupta, and A. Ranganathan (2005)
J. Biol. Chem. 280, 23605-23614
   Abstract »    Full Text »    PDF »
Computational Thermostabilization of an Enzyme.
A. Korkegian, M. E. Black, D. Baker, and B. L. Stoddard (2005)
Science 308, 857-860
   Abstract »    Full Text »    PDF »
Action-at-a-distance interactions enhance protein binding affinity.
B. A. Joughin, D. F. Green, and B. Tidor (2005)
Protein Sci. 14, 1363-1369
   Abstract »    Full Text »    PDF »
Computationally designed variants of Escherichia coli chorismate mutase show altered catalytic activity.
J. K. Lassila, J. R. Keeffe, P. Oelschlaeger, and S. L. Mayo (2005)
Protein Eng. Des. Sel. 18, 161-163
   Abstract »    Full Text »    PDF »
Solving and analyzing side-chain positioning problems using linear and integer programming.
C. L. Kingsford, B. Chazelle, and M. Singh (2005)
Bioinformatics 21, 1028-1039
   Abstract »    Full Text »    PDF »
Protein sequence entropy is closely related to packing density and hydrophobicity.
H. Liao, W. Yeh, D. Chiang, R.L. Jernigan, and B. Lustig (2005)
Protein Eng. Des. Sel. 18, 59-64
   Abstract »    Full Text »    PDF »
Folding Trp-Cage to NMR Resolution Native Structure Using a Coarse-Grained Protein Model.
F. Ding, S. V. Buldyrev, and N. V. Dokholyan (2005)
Biophys. J. 88, 147-155
   Abstract »    Full Text »    PDF »
In silico protein design by combinatorial assembly of protein building blocks.
H.-H.(G. Tsai, C.-J. Tsai, B. Ma, and R. Nussinov (2004)
Protein Sci. 13, 2753-2765
   Abstract »    Full Text »    PDF »
The crystal structure of human endonuclease VIII-like 1 (NEIL1) reveals a zincless finger motif required for glycosylase activity.
S. Doublie, V. Bandaru, J. P. Bond, and S. S. Wallace (2004)
PNAS 101, 10284-10289
   Abstract »    Full Text »    PDF »
Site-directed protein recombination as a shortest-path problem.
J. B. Endelman, J. J. Silberg, Z.-G. Wang, and F. H. Arnold (2004)
Protein Eng. Des. Sel. 17, 589-594
   Abstract »    Full Text »    PDF »
Computational design of receptors for an organophosphate surrogate of the nerve agent soman.
M. Allert, S. S. Rizk, L. L. Looger, and H. W. Hellinga (2004)
PNAS 101, 7907-7912
   Abstract »    Full Text »    PDF »
The response of internal dynamics to hydrophobic core mutations in the SH3 domain from the Fyn tyrosine kinase.
A. Mittermaier and L. E. Kay (2004)
Protein Sci. 13, 1088-1099
   Abstract »    Full Text »    PDF »
Energy functions for protein design I: Efficient and accurate continuum electrostatics and solvation.
N. Pokala and T. M. Handel (2004)
Protein Sci. 13, 925-936
   Abstract »    Full Text »    PDF »
Improved side-chain prediction accuracy using an ab initio potential energy function and a very large rotamer library.
R. W. Peterson, P. L. Dutton, and A. J. Wand (2004)
Protein Sci. 13, 735-751
   Abstract »    Full Text »    PDF »
Paradigms for computational nucleic acid design.
R. M. Dirks, M. Lin, E. Winfree, and N. A. Pierce (2004)
Nucleic Acids Res. 32, 1392-1403
   Abstract »    Full Text »    PDF »
A Semidefinite Programming Approach to Side Chain Positioning with New Rounding Strategies.
B. Chazelle, C. Kingsford, and M. Singh (2004)
INFORMS Journal on Computing 16, 380-392
   Abstract »    PDF »
Understanding the determinants of stability and folding of small globular proteins from their energetics.
G. Tiana, F. Simona, G. M.S. De Mori, R. A. Broglia, and G. Colombo (2004)
Protein Sci. 13, 113-124
   Abstract »    Full Text »    PDF »
Probabilistic approach to the design of symmetric protein quaternary structures.
X. Fu, H. Kono, and J. G. Saven (2003)
Protein Eng. Des. Sel. 16, 971-977
   Abstract »    Full Text »    PDF »
Functional tuning of a salvaged green fluorescent protein variant with a new sequence space by directed evolution.
S.-H. Nam, K.-H. Oh, G.-J. Kim, and H.-S. Kim (2003)
Protein Eng. Des. Sel. 16, 1099-1105
   Abstract »    Full Text »    PDF »
Design of a Novel Globular Protein Fold with Atomic-Level Accuracy.
B. Kuhlman, G. Dantas, G. C. Ireton, G. Varani, B. L. Stoddard, and D. Baker (2003)
Science 302, 1364-1368
   Abstract »    Full Text »    PDF »
Solution structure of a de novo protein from a designed combinatorial library.
Y. Wei, S. Kim, D. Fela, J. Baum, and M. H. Hecht (2003)
PNAS 100, 13270-13273
   Abstract »    Full Text »    PDF »
Exploring the origins of binding specificity through the computational redesign of calmodulin.
J. M. Shifman and S. L. Mayo (2003)
PNAS 100, 13274-13279
   Abstract »    Full Text »    PDF »
A de novo redesign of the WW domain.
C. M. Kraemer-Pecore, J. T.J. Lecomte, and J. R. Desjarlais (2003)
Protein Sci. 12, 2194-2205
   Abstract »    Full Text »    PDF »
Computational design of a Zn2+ receptor that controls bacterial gene expression.
M. A. Dwyer, L. L. Looger, and H. W. Hellinga (2003)
PNAS 100, 11255-11260
   Abstract »    Full Text »    PDF »
Using protein design for homology detection and active site searches.
J. Pei, N. V. Dokholyan, E. I. Shakhnovich, and N. V. Grishin (2003)
PNAS 100, 11361-11366
   Abstract »    Full Text »    PDF »
Amyloid-forming peptides selected proteolytically from phage display library.
K. Koscielska-Kasprzak and J. Otlewski (2003)
Protein Sci. 12, 1675-1685
   Abstract »    Full Text »    PDF »
Importance of {alpha}-helix N-capping motif in stabilization of {beta}{beta}{alpha} fold.
K. Koscielska-Kasprzak, T. Cierpicki, and J. Otlewski (2003)
Protein Sci. 12, 1283-1289
   Abstract »    Full Text »    PDF »
Identifying residue-residue clashes in protein hybrids by using a second-order mean-field approach.
G. L. Moore and C. D. Maranas (2003)
PNAS 100, 5091-5096
   Abstract »    Full Text »    PDF »
Combining computational and experimental screening for rapid optimization of protein properties.
R. J. Hayes, J. Bentzien, M. L. Ary, M. Y. Hwang, J. M. Jacinto, J. Vielmetter, A. Kundu, and B. I. Dahiyat (2002)
PNAS 99, 15926-15931
   Abstract »    Full Text »    PDF »
An Alanine-Zipper Structure Determined by Long Range Intermolecular Interactions.
J. Liu and M. Lu (2002)
J. Biol. Chem. 277, 48708-48713
   Abstract »    Full Text »    PDF »
Crystal structures and increased stabilization of the protein G variants with switched folding pathways NuG1 and NuG2.
S. Nauli, B. Kuhlman, I. Le Trong, R. E. Stenkamp, D. Teller, and D. Baker (2002)
Protein Sci. 11, 2924-2931
   Abstract »    Full Text »    PDF »
A simple physical model for binding energy hot spots in protein-protein complexes.
T. Kortemme and D. Baker (2002)
PNAS 99, 14116-14121
   Abstract »    Full Text »    PDF »
Folding free energy function selects native-like protein sequences in the core but not on the surface.
A. Jaramillo, L. Wernisch, S. Hery, and S. J. Wodak (2002)
PNAS 99, 13554-13559
   Abstract »    Full Text »    PDF »
Protein Design is NP-hard.
N. A. Pierce and E. Winfree (2002)
Protein Eng. Des. Sel. 15, 779-782
   Abstract »    Full Text »    PDF »
Designability of alpha -helical proteins.
E. G. Emberly, N. S. Wingreen, and C. Tang (2002)
PNAS 99, 11163-11168
   Abstract »    Full Text »    PDF »
Computational stabilization of human growth hormone.
A. V. Filikov, R. J. Hayes, P. Luo, D. M. Stark, C. Chan, A. Kundu, and B. I. Dahiyat (2002)
Protein Sci. 11, 1452-1461
   Abstract »    Full Text »    PDF »
BetaCore, a designed water soluble four-stranded antiparallel {beta}-sheet protein.
N. Carulla, C. Woodward, and G. Barany (2002)
Protein Sci. 11, 1539-1551
   Abstract »    Full Text »    PDF »
Development of a cytokine analog with enhanced stability using computational ultrahigh throughput screening.
P. Luo, R. J. Hayes, C. Chan, D. M. Stark, M. Y. Hwang, J. M. Jacinto, P. Juvvadi, H. S. Chung, A. Kundu, M. L. Ary, et al. (2002)
Protein Sci. 11, 1218-1226
   Abstract »    Full Text »    PDF »
Rationally designed mutations convert de novo amyloid-like fibrils into monomeric beta -sheet proteins.
W. Wang and M. H. Hecht (2002)
PNAS 99, 2760-2765
   Abstract »    Full Text »    PDF »
The role of aromatic residues in the hydrophobic core of the villin headpiece subdomain.
B. S. Frank, D. Vardar, D. A. Buckley, and C. J. McKnight (2002)
Protein Sci. 11, 680-687
   Abstract »    Full Text »    PDF »
A method for optimizing potential-energy functions by a hierarchical design of the potential-energy landscape: Application to the UNRES force field.
A. Liwo, P. Arlukowicz, C. Czaplewski, S. Oldziej, J. Pillardy, and H. A. Scheraga (2002)
PNAS 99, 1937-1942
   Abstract »    Full Text »    PDF »
Protein topology and stability define the space of allowed sequences.
P. Koehl and M. Levitt (2002)
PNAS
   Abstract »    Full Text »    PDF »
Protein design from in silico dynamic information: the emergence of the `turn-dock-lock' motif.
A. Fernandez (2002)
Protein Eng. Des. Sel. 15, 1-6
   Abstract »    Full Text »    PDF »
Enzyme-like proteins by computational design.
D. N. Bolon and S. L. Mayo (2001)
PNAS
   Abstract »    Full Text »    PDF »
Searching sequence space for protein catalysts.
S. V. Taylor, K. U. Walter, P. Kast, and D. Hilvert (2001)
PNAS
   Abstract »    Full Text »    PDF »
Conversion of monomeric protein L to an obligate dimer by computational protein design.
B. Kuhlman, J. W. O'Neill, D. E. Kim, K. Y. J. Zhang, and D. Baker (2001)
PNAS
   Abstract »    Full Text »    PDF »
Knowledge-based potential defined for a rotamer library to design protein sequences.
M. Ota, Y. Isogai, and K. Nishikawa (2001)
Protein Eng. Des. Sel. 14, 557-564
   Abstract »    Full Text »    PDF »
Tryptophan zippers: Stable, monomeric beta -hairpins.
A. G. Cochran, N. J. Skelton, and M. A. Starovasnik (2001)
PNAS
   Abstract »    Full Text »
Computational estimation of specific side chain interaction energies in {{alpha}} helices.
S. Fisinger, L. Serrano, and E. Lacroix (2001)
Protein Sci. 10, 809-818
   Abstract »    Full Text »
Computational method to reduce the search space for directed protein evolution.
C. A. Voigt, S. L. Mayo, F. H. Arnold, and Z.-G. Wang (2001)
PNAS 98, 3778-3783
   Abstract »    Full Text »    PDF »
Altering dimerization specificity by changes in surface electrostatics.
M. J. Nohaile, Z. S. Hendsch, B. Tidor, and R. T. Sauer (2001)
PNAS
   Abstract »    Full Text »
Recent improvements in prediction of protein structure by global optimization of a potential energy function.
J. Pillardy, C. Czaplewski, A. Liwo, J. Lee, D. R. Ripoll, R. Ka, S. Oldziej, W. J. Wedemeyer, K. D. Gibson, Y. A. Arnautova, et al. (2001)
PNAS
   Abstract »    Full Text »
Design, synthesis, and characterization of a novel hemoprotein.
Z. Xu and R. S. Farid (2001)
Protein Sci. 10, 236-249
   Abstract »    Full Text »
Optimization of binding electrostatics: Charge complementarity in the barnase-barstar protein complex.
L.-P. lee and B. Tidor (2001)
Protein Sci. 10, 362-377
   Abstract »    Full Text »
Native protein sequences are close to optimal for their structures.
B. Kuhlman and D. Baker (2000)
PNAS 97, 10383-10388
   Abstract »    Full Text »    PDF »