Note to users. If you're seeing this message, it means that your browser cannot find this page's style/presentation instructions -- or possibly that you are using a browser that does not support current Web standards. Find out more about why this message is appearing, and what you can do to make your experience of our site the best it can be.


Science 5 June 1992:
Vol. 256. no. 5062, pp. 1443 - 1445
DOI: 10.1126/science.1604319

Articles

Science, Vol 256, Issue 5062, 1443-1445
Copyright © 1992 by American Association for the Advancement of Science


articles

Exhaustive matching of the entire protein sequence database

GH Gonnet, MA Cohen, and SA Benner

Institute for Scientific Computation, Swiss Federal Institute of Technology, Zurich, Switzerland.

The entire protein sequence database has been exhaustively matched. Definitive mutation matrices and models for scoring gaps were obtained from the matching and used to organize the sequence database as sets of evolutionarily connected components. The methods developed are general and can be used to manage sequence data generated by major genome sequencing projects. The alignments made possible by the exhaustive matching are the starting point for successful de novo prediction of the folded structures of proteins, for reconstructing sequences of ancient proteins and metabolisms in ancient organisms, and for obtaining new perspectives in structural biochemistry.


THIS ARTICLE HAS BEEN CITED BY OTHER ARTICLES:
Support Patterns from Different Outgroups Provide a Strong Phylogenetic Signal.
A. Schneider and G. M. Cannarozzi (2009)
Mol. Biol. Evol. 26, 1259-1272
   Abstract »    Full Text »    PDF »
Sequence context-specific profiles for homology searching.
A. Biegert and J. Soding (2009)
PNAS 106, 3770-3775
   Abstract »    Full Text »    PDF »
Problems and Solutions for Estimating Indel Rates and Length Distributions.
R. A. Cartwright (2009)
Mol. Biol. Evol. 26, 473-480
   Abstract »    Full Text »    PDF »
Diverse Transcriptional Programs Associated with Environmental Stress and Hormones in the Arabidopsis Receptor-Like Kinase Gene Family.
L. Chae, S. Sudat, S. Dudoit, T. Zhu, and S. Luan (2009)
Mol Plant 2, 84-107
   Abstract »    Full Text »    PDF »
Evaluating the robustness of phylogenetic methods to among-site variability in substitution processes.
M. T Holder, D. J Zwickl, and C. Dessimoz (2008)
Phil Trans R Soc B 363, 4013-4021
   Abstract »    Full Text »    PDF »
Molecular Template for a Voltage Sensor in a Novel K+ Channel. III. Functional Reconstitution of a Sensorless Pore Module from a Prokaryotic Kv Channel.
J. S. Santos, S. M. Grigoriev, and M. Montal (2008)
J. Gen. Physiol. 132, 651-666
   Abstract »    Full Text »    PDF »
Protein-DNA interactions: structural, thermodynamic and clustering patterns of conserved residues in DNA-binding proteins.
S. Ahmad, O. Keskin, A. Sarai, and R. Nussinov (2008)
Nucleic Acids Res. 36, 5922-5932
   Abstract »    Full Text »    PDF »
Evolutionary Persistence of the Molybdopyranopterin-Containing Sulfite Oxidase Protein Fold.
G. J. Workun, K. Moquin, R. A. Rothery, and J. H. Weiner (2008)
Microbiol. Mol. Biol. Rev. 72, 228-248
   Abstract »    Full Text »    PDF »
Comparison of the PAM and BLOSUM Amino Acid Substitution Matrices.
D. W. Mount (2008)
CSH Protocols 2008, pdb.ip59
   Abstract »    Full Text »
Using PAM Matrices in Sequence Alignments.
D. W. Mount (2008)
CSH Protocols 2008, pdb.top38
   Abstract »    Full Text »
Forkhead-Associated Domain of Yeast Xrs2, a Homolog of Human Nbs1, Promotes Nonhomologous End Joining Through Interaction With a Ligase IV Partner Protein, Lif1.
K. Matsuzaki, A. Shinohara, and M. Shinohara (2008)
Genetics 179, 213-225
   Abstract »    Full Text »    PDF »
Identification of Ciliary Localization Sequences within the Third Intracellular Loop of G Protein-coupled Receptors.
N. F. Berbari, A. D. Johnson, J. S. Lewis, C. C. Askwith, and K. Mykytyn (2008)
Mol. Biol. Cell 19, 1540-1547
   Abstract »    Full Text »    PDF »
The Reacquisition of Biotin Prototrophy in Saccharomyces cerevisiae Involved Horizontal Gene Transfer, Gene Duplication and Gene Clustering.
C. Hall and F. S. Dietrich (2007)
Genetics 177, 2293-2307
   Abstract »    Full Text »    PDF »
A novel knowledge-based approach to design inorganic-binding peptides.
E. E. Oren, C. Tamerler, D. Sahin, M. Hnilova, U. O. S. Seker, M. Sarikaya, and R. Samudrala (2007)
Bioinformatics 23, 2816-2822
   Abstract »    Full Text »    PDF »
Ngila: global pairwise alignments with logarithmic and affine gap costs.
R. A. Cartwright (2007)
Bioinformatics 23, 1427-1428
   Abstract »    Full Text »    PDF »
Probalign: multiple sequence alignment using partition function posterior probabilities.
U. Roshan and D. R. Livesay (2006)
Bioinformatics 22, 2715-2721
   Abstract »    Full Text »    PDF »
Weighted quality estimates in machine learning.
L. Budagyan and R. Abagyan (2006)
Bioinformatics 22, 2597-2603
   Abstract »    Full Text »    PDF »
Molecular Template for a Voltage Sensor in a Novel K+ Channel. I. Identification and Functional Characterization of KvLm, a Voltage-gated K+ Channel from Listeria monocytogenes.
J. S. Santos, A. Lundby, C. Zazueta, and M. Montal (2006)
J. Gen. Physiol. 128, 283-292
   Abstract »    Full Text »    PDF »
The Octopus vulgaris Estrogen Receptor Is a Constitutive Transcriptional Activator: Evolutionary and Functional Implications.
J. Keay, J. T. Bridgham, and J. W. Thornton (2006)
Endocrinology 147, 3861-3869
   Abstract »    Full Text »    PDF »
Kalign, Kalignvu and Mumsa: web servers for multiple sequence alignment..
T. Lassmann and E. L. L. Sonnhammer (2006)
Nucleic Acids Res. 34, W596-W599
   Abstract »    Full Text »    PDF »
Statistical significance in biological sequence analysis.
A. Yu. Mitrophanov and M. Borodovsky (2006)
Brief Bioinform 7, 2-24
REPPER--repeats and their periodicities in fibrous proteins.
M. Gruber, J. Soding, and A. N. Lupas (2005)
Nucleic Acids Res. 33, W239-W243
   Abstract »    Full Text »    PDF »
Contribution of Horizontal Gene Transfer to the Evolution of Saccharomyces cerevisiae.
C. Hall, S. Brachat, and F. S. Dietrich (2005)
Eukaryot. Cell 4, 1102-1115
   Abstract »    Full Text »    PDF »
Solving the protein sequence metric problem.
W. R. Atchley, J. Zhao, A. D. Fernandes, and T. Druke (2005)
PNAS 102, 6395-6400
   Abstract »    Full Text »    PDF »
Protein homology detection by HMM-HMM comparison.
J. Soding (2005)
Bioinformatics 21, 951-960
   Abstract »    Full Text »    PDF »
An alternative model of amino acid replacement.
G. E. Crooks and S. E. Brenner (2005)
Bioinformatics 21, 975-980
   Abstract »    Full Text »    PDF »
Arabidopsis Has Two Redundant Cullin3 Proteins That Are Essential for Embryo Development and That Interact with RBX1 and BTB Proteins to Form Multisubunit E3 Ubiquitin Ligase Complexes in Vivo.
P. Figueroa, G. Gusmaroli, G. Serino, J. Habashi, L. Ma, Y. Shen, S. Feng, M. Bostick, J. Callis, H. Hellmann, et al. (2005)
PLANT CELL 17, 1180-1195
   Abstract »    Full Text »    PDF »
Confirmation of the association of the R620W polymorphism in the protein tyrosine phosphatase PTPN22 with type 1 diabetes in a family based study.
H Qu, M-C Tessier, T J Hudson, and C Polychronakos (2005)
J. Med. Genet. 42, 266-270
   Full Text »    PDF »
Homology-extended sequence alignment.
V. A. Simossis, J. Kleinjung, and J. Heringa (2005)
Nucleic Acids Res. 33, 816-824
   Abstract »    Full Text »    PDF »
ProbCons: Probabilistic consistency-based multiple sequence alignment.
C. B. Do, M. S.P. Mahabhashyam, M. Brudno, and S. Batzoglou (2005)
Genome Res. 15, 330-340
   Abstract »    Full Text »    PDF »
Plus and Minus Sexual Agglutinins from Chlamydomonas reinhardtii.
P. J. Ferris, S. Waffenschmidt, J. G. Umen, H. Lin, J.-H. Lee, K. Ishida, T. Kubo, J. Lau, and U. W. Goodenough (2005)
PLANT CELL 17, 597-615
   Abstract »    Full Text »    PDF »
PRECISE: a Database of Predicted and Consensus Interaction Sites in Enzymes.
S.-H. Sheu, D. R. Lancia Jr, K. H. Clodfelter, M. R. Landon, and S. Vajda (2005)
Nucleic Acids Res. 33, D206-D211
   Abstract »    Full Text »    PDF »
Identification of an Evolutionarily Conserved Domain in Human Lens Epithelium-derived Growth Factor/Transcriptional Co-activator p75 (LEDGF/p75) That Binds HIV-1 Integrase.
P. Cherepanov, E. Devroe, P. A. Silver, and A. Engelman (2004)
J. Biol. Chem. 279, 48883-48892
   Abstract »    Full Text »    PDF »
Evolution of the tumor suppressor BRCA1 locus in primates: implications for cancer predisposition.
A. Pavlicek, V. N. Noskov, N. Kouprina, J. C. Barrett, J. Jurka, and V. Larionov (2004)
Hum. Mol. Genet. 13, 2737-2751
   Abstract »    Full Text »    PDF »
Functional diversity of three different DsbA proteins from Neisseria meningitidis.
S. Sinha, P. R. Langford, and J. S. Kroll (2004)
Microbiology 150, 2993-3000
   Abstract »    Full Text »    PDF »
LEON: multiple aLignment Evaluation Of Neighbours.
J. D. Thompson, V. Prigent, and O. Poch (2004)
Nucleic Acids Res. 32, 1298-1307
   Abstract »    Full Text »    PDF »
Using Quaternary Structures to Assess the Evolutionary History of Proteins: The Case of the Aspartate Carbamoyltransferase.
B. Labedan, Y. Xu, D. G. Naumoff, and N. Glansdorff (2004)
Mol. Biol. Evol. 21, 364-373
   Abstract »    Full Text »    PDF »
The compositional adjustment of amino acid substitution matrices.
Y.-K. Yu, J. C. Wootton, and S. F. Altschul (2003)
PNAS 100, 15688-15693
   Abstract »    Full Text »    PDF »
PROSPECT II: protein structure prediction program for genome-scale applications.
D. Kim, D. Xu, J.-t. Guo, K. Ellrott, and Y. Xu (2003)
Protein Eng. Des. Sel. 16, 641-650
   Abstract »    Full Text »    PDF »
Resemblance and Dissemblance of Arabidopsis Type II Peroxiredoxins: Similar Sequences for Divergent Gene Expression, Protein Localization, and Activity.
C. Brehelin, E. H. Meyer, J.-P. de Souris, G. Bonnard, and Y. Meyer (2003)
Plant Physiology 132, 2045-2057
   Abstract »    Full Text »    PDF »
Understanding missense mutations in the BRCA1 gene: An evolutionary approach.
M. A. Fleming, J. D. Potter, C. J. Ramirez, G. K. Ostrander, and E. A. Ostrander (2003)
PNAS 100, 1151-1156
   Abstract »    Full Text »    PDF »
A Strategy to Retrieve the Whole Set of Protein Modules in Microbial Proteomes.
S. Le Bouder-Langevin, I. Capron-Montaland, R. De Rosa, and B. Labedan (2002)
Genome Res. 12, 1961-1973
   Abstract »    Full Text »    PDF »
HecA, a member of a class of adhesins produced by diverse pathogenic bacteria, contributes to the attachment, aggregation, epidermal cell killing, and virulence phenotypes of Erwinia chrysanthemi EC16 on Nicotiana clevelandii seedlings.
C. M. Rojas, J. H. Ham, W.-L. Deng, J. J. Doyle, and A. Collmer (2002)
PNAS 99, 13142-13147
   Abstract »    Full Text »    PDF »
Evolution of Amino Acid Frequencies in Proteins Over Deep Time: Inferred Order of Introduction of Amino Acids into the Genetic Code.
D. J. Brooks, J. R. Fresco, A. M. Lesk, and M. Singh (2002)
Mol. Biol. Evol. 19, 1645-1655
   Abstract »    Full Text »    PDF »
Planetary Biology--Paleontological, Geological, and Molecular Histories of Life.
S. A. Benner, M. D. Caraco, J. M. Thomson, and E. A. Gaucher (2002)
Science 296, 864-868
   Abstract »    Full Text »    PDF »
Comparative Analysis of Chloroplast Genomes: Functional Annotation, Genome-Based Phylogeny, and Deduced Evolutionary Patterns.
J. D. L. Rivas, J. J. Lozano, and A. R. Ortiz (2002)
Genome Res. 12, 567-583
   Abstract »    Full Text »    PDF »
Increased Frequency of Cysteine, Tyrosine, and Phenylalanine Residues Since the Last Universal Ancestor.
D. J. Brooks and J. R. Fresco (2002)
Mol. Cell. Proteomics 1, 125-131
   Abstract »    Full Text »    PDF »
A Grapevine Gene Encoding a Guard Cell K+ Channel Displays Developmental Regulation in the Grapevine Berry.
R. Pratelli, B. Lacombe, L. Torregrosa, F. Gaymard, C. Romieu, J.-B. Thibaud, and H. Sentenac (2002)
Plant Physiology 128, 564-577
   Abstract »    Full Text »    PDF »
PASS2: a semi-automated database of Protein Alignments Organised as Structural Superfamilies.
V. Mallika, A. Bhaduri, and R. Sowdhamini (2002)
Nucleic Acids Res. 30, 284-288
   Abstract »    Full Text »    PDF »
Estimating Amino Acid Substitution Models: A Comparison of Dayhoff's Estimator, the Resolvent Approach and a Maximum Likelihood Method.
T. Muller, R. Spang, and M. Vingron (2002)
Mol. Biol. Evol. 19, 8-13
   Abstract »    Full Text »    PDF »
Identification and characterization of a mitochondrial thioredoxin system in plants.
C. Laloi, N. Rayapuram, Y. Chartier, J.-M. Grienenberger, G. Bonnard, and Y. Meyer (2001)
PNAS 98, 14144-14149
   Abstract »    Full Text »    PDF »
Divergence of Function in Sequence-Related Groups of Escherichia coli Proteins.
L. A. Nahum and M. Riley (2001)
Genome Res. 11, 1375-1381
   Abstract »    Full Text »    PDF »
An approach to improving multiple alignments of protein sequences using predicted secondary structure.
A. J. Jennings, C. M. Edge, and M. J.E. Sternberg (2001)
Protein Eng. Des. Sel. 14, 227-231
   Abstract »    Full Text »    PDF »
BUR1 and BUR2 Encode a Divergent Cyclin-Dependent Kinase-Cyclin Complex Important for Transcription In Vivo.
S. Yao, A. Neiman, and G. Prelich (2000)
Mol. Cell. Biol. 20, 7080-7087
   Abstract »    Full Text »
Protein structure alignment using environmental profiles.
J. Jung and B. Lee (2000)
Protein Eng. Des. Sel. 13, 535-543
   Abstract »    Full Text »    PDF »
High genetic variability of the group-specific a-determinant of hepatitis B virus surface antigen (HBsAg) and the corresponding fragment of the viral polymerase in chronic virus carriers lacking detectable HBsAg in serum.
K. M. Weinberger, T. Bauer, S. Böhm, and W. Jilg (2000)
J. Gen. Virol. 81, 1165-1174
   Abstract »    Full Text »
Transport of Sulfonium Compounds. CHARACTERIZATION OF THE S-ADENOSYLMETHIONINE AND S-METHYLMETHIONINE PERMEASES FROM THE YEAST SACCHAROMYCES CEREVISIAE.
A. Rouillon, Y. Surdin-Kerjan, and D. Thomas (1999)
J. Biol. Chem. 274, 28096-28105
   Abstract »    Full Text »    PDF »
Towards more meaningful hierarchical classification of amino acid scoring matrices.
A. C.W. May (1999)
Protein Eng. Des. Sel. 12, 707-712
   Abstract »    Full Text »    PDF »
In Vivo Characterization of a Thioredoxin h Target Protein Defines a New Peroxiredoxin Family.
L. Verdoucq, F. Vignols, J.-P. Jacquot, Y. Chartier, and Y. Meyer (1999)
J. Biol. Chem. 274, 19714-19722
   Abstract »    Full Text »    PDF »
A Plastidial Lysophosphatidic Acid Acyltransferase from Oilseed Rape.
F. Bourgis, J.-C. Kader, P. Barret, M. Renard, D. Robinson, C. Robinson, M. Delseny, and T. J. Roscoe (1999)
Plant Physiology 120, 913-922
   Abstract »    Full Text »
Structure and phospholipid transfer activity of human PLTP: analysis by molecular modeling and site-directed mutagenesis.
J. Huuskonen, G. Wohlfahrt, M. Jauhiainen, C. Ehnholm, O. Teleman, and V. M. Olkkonen (1999)
J. Lipid Res. 40, 1123-1130
   Abstract »    Full Text »
Combining sensitive database searches with multiple intermediates to detect distant homologues.
A. A. Salamov, M. Suwa, C. A. Orengo, and M. B. Swindells (1999)
Protein Eng. Des. Sel. 12, 95-100
   Abstract »    Full Text »    PDF »
Models of Molecular Evolution and Phylogeny.
P. Liò and N. Goldman (1998)
Genome Res. 8, 1233-1244
   Abstract »    Full Text »
Natural genetic exchange between Haemophilus and Neisseria: Intergeneric transfer of chromosomal genes between major human pathogens.
J. S. Kroll, K. E. Wilks, J. L. Farrant, and P. R. Langford (1998)
PNAS 95, 12381-12385
   Abstract »    Full Text »    PDF »
Structure comparison of human glioma pathogenesis-related protein GliPR and the plant pathogenesis-related protein P14a indicates a functional link between the human immune system and a plant defense system.
T. Szyperski, C. Fernandez, C. Mumenthaler, and K. Wuthrich (1998)
PNAS 95, 2262-2266
   Abstract »    Full Text »    PDF »
Reverse Gyrase from the Hyperthermophilic Bacterium Thermotoga maritima: Properties and Gene Structure.
C. B. de la Tour, C. Portemer, H. Kaltoum, and M. Duguet (1998)
J. Bacteriol. 180, 274-281
   Abstract »    Full Text »
Identification of a thiamin-dependent synthase in Escherichia coli required for the formation of the 1-deoxy-D-xylulose 5-phosphate precursor to isoprenoids, thiamin, and pyridoxol.
G. A. Sprenger, U. Schorken, T. Wiegert, S. Grolle, A. A. de Graaf, S. V. Taylor, T. P. Begley, S. Bringer-Meyer, and H. Sahm (1997)
PNAS 94, 12857-12862
   Abstract »    Full Text »    PDF »
The B12-dependent ribonucleotide reductase from the archaebacterium Thermoplasma acidophila: An evolutionary solution to the ribonucleotide reductase conundrum.
A. Tauer and S. A. Benner (1997)
PNAS 94, 53-58
   Abstract »    Full Text »    PDF »
Reduced Numatrin/B23/Nucleophosmin Labeling in Apoptotic Jurkat T-lymphoblasts.
S. D. Patterson, J. S. Grossman, P. D'Andrea, and G. I. Latter (1995)
J. Biol. Chem. 270, 9429-9436
   Abstract »    Full Text »    PDF »
Riboflavin Biosynthesis in Saccharomyces cerevisiae.
M. A. Santos, J. J. García-Ramírez, and J. L. Revuelta (1995)
J. Biol. Chem. 270, 437-444
   Abstract »    Full Text »    PDF »
Analysis of DNA sequences.
B. Weir (1993)
Statistical Methods in Medical Research 2, 225-239
   Abstract »    PDF »
Selecting protein targets for structural genomics of Pyrobaculum aerophilum: Validating automated fold assignment methods by using binary hypothesis testing.
P. Mallick, K. E. Goodwill, S. Fitz-Gibbon, J. H. Miller, and D. Eisenberg (2000)
PNAS 97, 2450-2455
   Abstract »    Full Text »    PDF »
Evolution of vertebrate steroid receptors from an ancestral estrogen receptor by ligand exploitation and serial genome expansions.
J. W. Thornton (2001)
PNAS 98, 5671-5676
   Abstract »    Full Text »    PDF »
From the Cover: Genome sequence of Halobacterium species NRC-1.
W. V. Ng, S. P. Kennedy, G. G. Mahairas, B. Berquist, M. Pan, H. D. Shukla, S. R. Lasky, N. S. Baliga, V. Thorsson, J. Sbrogna, et al. (2000)
PNAS 97, 12176-12181
   Abstract »    Full Text »    PDF »



To Advertise     Find Products


Science. ISSN 0036-8075 (print), 1095-9203 (online)