Note to users. If you're seeing this message, it means that your browser cannot find this page's style/presentation instructions -- or possibly that you are using a browser that does not support current Web standards. Find out more about why this message is appearing, and what you can do to make your experience of our site the best it can be.


Science 11 June 1971:
Vol. 172. no. 3988, pp. 1089 - 1096
DOI: 10.1126/science.172.3988.1089

Articles

Fitting Discrete Probability Distributions to Evolutionary Events

Thomas Uzzell 1 and Kendall W. Corbin 2

1 The Peabody Museum of Natural History of Yale University, New Haven, Connecticut
2 Yale University, The University of Minnesota, Minneapolis

The assumptions underlying the use of the Poisson distribution are essentially that the probability of an event is small but nearly identical for all occurrences and that the occurrence of an event does not alter the probability of recurrence of such events. These assumptions do not seem to be met for evolutionary events since (i) the probability of fixing nucleotide codon substitutions is not equal for all substitutions at a codon, and probably varies for the same substitution in different lineages; (ii) the probability of fixing codon substitutions varies among positions of a cistron; and (iii) the fixation of a nucleotide codon substitution at one position in a cistron modifies, and may even promote, the fixation of a codon substitution elsewhere along the cistron. Natural selection presumably is the causative factor that acts to modify the probability of a nucleotide codon substitution's being fixed in a population.

The use of the negative binomial distribution is consistent with the evidence that selective pressure on amino acid or nucleotide codon positions varies both among codon positions of a cistron and at a particular position during evolutionary time.

If the number of fixations of nucleotide codon substitutions per position of cistrons encoding cytochromes c are phyletically inferred (phylogeny based on a paleontological record) rather than phenetically inferred (based on paired comparisons of extant species' differences in the absence of a phylogeny) the distribution of these fixation data cannot be described adequately by a single Poisson distribution. The fit of these same data to a negative binomial distribution is very satisfactory.

It has been argued that the fit of phenetically inferred fixation data, which do not take account of parallel or reverse fixations, to the Poisson distribution was supportive evidence for the hypothesis that protein evolution results from the fixation of selectively neutral codon substitutions. This argument now appears to be undercut by the evidence that data on nucleotide codon fixation are more probably distributed according to the negative binomial distribution.

The fact that fixation data can be described by a particular discrete probability distribution does not of itself provide insight into the mechanisms of the evolutionary process. However, the facts—(i) that the assumptions underlying the use of the negative binomial distribution adequately deal with the varying probability of fixing amino acid or nucleotide codon substitutions at and among the positions of a cistron and (ii) that the negative binomial distribution provides an excellent fit for the phyletically inferred fixation data—suggest that the negative binomial is a very appropriate discrete probability distribution for describing evolutionary events.

Amino acids or their nucleotide codon substitutions may be fixed at a position of a cistron as though selectively neutral relative to the codon being replaced, even though the codon position will not be selectively neutral, since many amino acids cannot function there. The negative binomial distribution treats this situation well whereas a single Poisson distribution could only be satisfactory if all codon positions that could vary were selectively neutral.


THIS ARTICLE HAS BEEN CITED BY OTHER ARTICLES:
The impact of single substitutions on multiple sequence alignments.
S. Klaere, T. Gesell, and A. von Haeseler (2008)
Phil Trans R Soc B 363, 4041-4047
   Abstract »    Full Text »    PDF »
Penalized Likelihood Phylogenetic Inference: Bridging the Parsimony-Likelihood Gap.
J. Kim and M. J. Sanderson (2008)
Syst Biol 57, 665-674
   Abstract »    Full Text »    PDF »
Site-specific evolutionary rates in proteins are better modeled as non-independent and strictly relative.
A. D. Fernandes and W. R. Atchley (2008)
Bioinformatics 24, 2177-2183
   Abstract »    Full Text »    PDF »
Overdispersion of the Molecular Clock: Temporal Variation of Gene-Specific Substitution Rates in Drosophila.
T. Bedford and D. L. Hartl (2008)
Mol. Biol. Evol. 25, 1631-1638
   Abstract »    Full Text »    PDF »
Difficulties in Testing for Covarion-Like Properties of Sequences under the Confounding Influence of Changing Proportions of Variable Sites.
N. Gruenheit, P. J. Lockhart, M. Steel, and W. Martin (2008)
Mol. Biol. Evol. 25, 1512-1520
   Abstract »    Full Text »    PDF »
A Nonparametric Method for Accommodating and Testing Across-Site Rate Variation.
J. P. Huelsenbeck and M. A. Suchard (2007)
Syst Biol 56, 975-987
   Abstract »    Full Text »    PDF »
Accurate gene-tree reconstruction by learning gene- and species-specific substitution rates across multiple complete genomes.
M. D. Rasmussen and M. Kellis (2007)
Genome Res. 17, 1932-1942
   Abstract »    Full Text »    PDF »
Evolutionary rates vary among rRNA structural elements.
S. Smit, J. Widmann, and R. Knight (2007)
Nucleic Acids Res. 35, 3339-3354
   Abstract »    Full Text »    PDF »
Molecular Clock: An Anti-neo-Darwinian Legacy.
N. Takahata (2007)
Genetics 176, 1-6
   Full Text »    PDF »
Evolutionary Framework for Protein Sequence Evolution and Gene Pleiotropy.
X. Gu (2007)
Genetics 175, 1813-1822
   Abstract »    Full Text »    PDF »
Artifactual Phylogenies Caused by Correlated Distribution of Substitution Rates among Sites and Lineages: The Good, the Bad, and the Ugly.
V. Ruano-Rubio and M. A. Fares (2007)
Syst Biol 56, 68-82
   Abstract »    Full Text »    PDF »
Prediction of Function Divergence in Protein Families Using the Substitution Rate Variation Parameter Alpha.
S. Abhiman, C. O. Daub, and E. L. L. Sonnhammer (2006)
Mol. Biol. Evol. 23, 1406-1413
   Abstract »    Full Text »    PDF »
A Simple and Robust Statistical Test for Detecting the Presence of Recombination.
T. C. Bruen, H. Philippe, and D. Bryant (2006)
Genetics 172, 2665-2681
   Abstract »    Full Text »    PDF »
Under-parameterized Model of Sequence Evolution Leads to Bias in the Estimation of Diversification Rates from Molecular Phylogenies.
L. J. Revell, L. J. Harmon, and R. E. Glor (2005)
Syst Biol 54, 973-983
   Full Text »    PDF »
Site-to-Site Variation of Synonymous Substitution Rates.
S. K. Pond and S. V. Muse (2005)
Mol. Biol. Evol. 22, 2375-2385
   Abstract »    Full Text »    PDF »
Maximum Likelihood Outperforms Maximum Parsimony Even When Evolutionary Rates Are Heterotachous.
S. R. Gadagkar and S. Kumar (2005)
Mol. Biol. Evol. 22, 2139-2141
   Abstract »    Full Text »    PDF »
Curious Parallels and Curious Connections--Phylogenetic Thinking in Biology and Historical Linguistics.
Q. D. Atkinson and R. D. Gray (2005)
Syst Biol 54, 513-526
   Abstract »    Full Text »    PDF »
A sequence sub-sampling algorithm increases the power to detect distant homologues.
C. R. Johnston and D. C. Shields (2005)
Nucleic Acids Res. 33, 3772-3778
   Abstract »    Full Text »    PDF »
REVCOM: a robust Bayesian method for evolutionary rate estimation.
A. J. Bordner and R. Abagyan (2005)
Bioinformatics 21, 2315-2321
   Abstract »    Full Text »    PDF »
Likelihood, Parsimony, and Heterogeneous Evolution.
M. Spencer, E. Susko, and A. J. Roger (2005)
Mol. Biol. Evol. 22, 1161-1164
   Abstract »    Full Text »    PDF »
Testing Substitution Models Within a Phylogenetic Tree.
G. Weiss and A. von Haeseler (2003)
Mol. Biol. Evol. 20, 572-578
   Abstract »    Full Text »    PDF »
The Pattern of Amino Acid Replacements in {alpha}/{beta}-Barrels.
A. M. Dean, C. Neuhauser, E. Grenier, and G. B. Golding (2002)
Mol. Biol. Evol. 19, 1846-1864
   Abstract »    Full Text »    PDF »
Inference of functional regions in proteins by quantification of evolutionary constraints.
A. L. Simon, E. A. Stone, and A. Sidow (2002)
PNAS 99, 2912-2917
   Abstract »    Full Text »    PDF »
Heterotachy, an Important Process of Protein Evolution.
P. Lopez, D. Casane, and H. Philippe (2002)
Mol. Biol. Evol. 19, 1-7
   Abstract »    Full Text »    PDF »
Estimating Absolute Rates of Molecular Evolution and Divergence Times: A Penalized Likelihood Approach.
M. J. Sanderson (2002)
Mol. Biol. Evol. 19, 101-109
   Abstract »    Full Text »    PDF »
From Complete Genomes to Measures of Substitution Rate Variability Within and Between Proteins.
N. V. Grishin, Y. I. Wolf, and E. V. Koonin (2000)
Genome Res. 10, 991-1000
   Abstract »    Full Text »
Pattern of Nucleotide Substitution and Rate Heterogeneity in the Hypervariable Regions I and II of Human mtDNA.
S. Meyer, G. Weiss, and A. von Haeseler (1999)
Genetics 152, 1103-1110
   Abstract »    Full Text »
Evolution of the RECQ Family of Helicases: A Drosophila Homolog, Dmblm, Is Similar to the Human Bloom Syndrome Gene.
K. Kusano, M. E. Berres, and W. R. Engels (1999)
Genetics 151, 1027-1039
   Abstract »    Full Text »
An olfactory-specific glutathione-S-transferase in the sphinx moth Manduca sexta.
M. Rogers, M. Jani, and R. Vogt (1999)
J. Exp. Biol. 202, 1625-1637
   Abstract »
Inference of Population History Using a Likelihood Approach.
G. Weiss and A. von Haeseler (1998)
Genetics 149, 1539-1546
   Abstract »    Full Text »    PDF »
Correlation Between the Substitution Rate and Rate Variation Among Sites in Protein Evolution.
J. Zhang and X. Gu (1998)
Genetics 149, 1615-1625
   Abstract »    Full Text »    PDF »
Estimation of evolutionary distances under stationary and nonstationary models of nucleotide substitution.
X. Gu and W.-H. Li (1998)
PNAS 95, 5899-5905
   Abstract »    Full Text »    PDF »
Response: Dating the Cenancester of Organisms.
R. F. Doolittle, D.-F. Feng, S. Tsang, G. Cho, and E. Little (1996)
Science 274, 1751-1753
   PDF »
Evolution at two levels in humans and chimpanzees.
M. King and A. Wilson (1975)
Science 188, 107-116
   PDF »
Amino Acid Difference Formula to Help Explain Protein Evolution.
R. Grantham (1974)
Science 185, 862-864
   Abstract »    PDF »
Estimation of divergence times from multiprotein sequences for a few mammalian species and several distantly related organisms.
M. Nei, P. Xu, and G. Glazko (2001)
PNAS 98, 2497-2502
   Abstract »    Full Text »    PDF »



To Advertise     Find Products


Science. ISSN 0036-8075 (print), 1095-9203 (online)