Note to users. If you're seeing this message, it means that your browser cannot find this page's style/presentation instructions -- or possibly that you are using a browser that does not support current Web standards. Find out more about why this message is appearing, and what you can do to make your experience of our site the best it can be.


Science 12 November 2004:
Vol. 306. no. 5699, pp. 1172 - 1174
DOI: 10.1126/science.1102036

Reports

Prospects for Building the Tree of Life from Large Sequence Databases

Amy C. Driskell,1,2* Cécile Ané,1{dagger}{ddagger} J. Gordon Burleigh,1{dagger} Michelle M. McMahon,1{dagger} Brian C. O'Meara,2{dagger} Michael J. Sanderson1

We assess the phylogenetic potential of ~300,000 protein sequences sampled from Swiss-Prot and GenBank. Although only a small subset of these data was potentially phylogenetically informative, this subset retained a substantial fraction of the original taxonomic diversity. Sampling biases in the databases necessitate building phylogenetic data sets that have large numbers of missing entries. However, an analysis of two "supermatrices" suggests that even data sets with as much as 92% missing data can provide insights into broad sections of the tree of life.

1 Section of Evolution and Ecology, University of California, One Shields Avenue, Davis, CA 95616, USA.
2 Center for Population Biology, University of California, One Shields Avenue, Davis, CA 95616, USA.



{dagger} These authors contributed equally to this work.

{ddagger} Present address: Department of Statistics, University of Wisconsin, Medical Science Center, 1300 University Avenue, Madison, WI 53706, USA.

* To whom correspondence should be addressed. E-mail: acdriskell{at}ucdavis.edu

Read the Full Text



THIS ARTICLE HAS BEEN CITED BY OTHER ARTICLES:
The Use and Validity of Composite Taxa in Phylogenetic Analysis.
V. Campbell and F.-J. Lapointe (2009)
Syst Biol 58, 560-572
   Abstract »    Full Text »    PDF »
Sparse Supermatrices for Phylogenetic Inference: Taxonomy, Alignment, Rogue Taxa, and the Phylogeny of Living Turtles.
R. C. Thomson and H. B. Shaffer (2009)
Syst Biol
   Abstract »    Full Text »    PDF »
The Effect of Ambiguous Data on Phylogenetic Estimates Obtained by Maximum Likelihood and Bayesian Inference.
A. R. Lemmon, J. M. Brown, K. Stanger-Hall, and E. M. Lemmon (2009)
Syst Biol
   Abstract »    Full Text »    PDF »
Paleontology, Genomics, and Combined-Data Phylogenetics: Can Molecular Data Improve Phylogeny Estimation for Fossil Taxa?.
J. J. Wiens (2009)
Syst Biol
   Abstract »    Full Text »    PDF »
A hierarchical model for incomplete alignments in phylogenetic inference.
F. Cheng, S. Hartmann, M. Gupta, J. G. Ibrahim, and T. J. Vision (2009)
Bioinformatics 25, 592-598
   Abstract »    Full Text »    PDF »
Phylogenetic relationships among seed plants: Persistent questions and the limits of molecular data.
S. Mathews (2009)
Am. J. Botany 96, 228-236
   Abstract »    Full Text »    PDF »
The PhyLoTA Browser: Processing GenBank for Molecular Phylogenetics Research.
M. J. Sanderson, D. Boss, D. Chen, K. A. Cranston, and A. Wehe (2008)
Syst Biol 57, 335-346
   Abstract »    Full Text »    PDF »
Branch Lengths, Support, and Congruence: Testing the Phylogenomic Approach with 20 Nuclear Loci in Snakes.
J. J. Wiens, C. A. Kuczynski, S. A. Smith, D. G. Mulcahy, J. W. Sites Jr., T. M. Townsend, and T. W. Reeder (2008)
Syst Biol 57, 420-431
   Abstract »    Full Text »    PDF »
The Reticulate History of Medicago (Fabaceae).
I. J. Maureira-Butler, B. E. Pfeil, A. Muangprom, T. C. Osborn, and J. J. Doyle (2008)
Syst Biol 57, 466-482
   Abstract »    Full Text »    PDF »
Application of Phylogenetically Defined Names Does Not Require That Every Specifier Be Present on a Tree.
P. D. Cantino and R. G. Olmstead (2008)
Syst Biol 57, 157-160
   Full Text »    PDF »
SuperCAT: a supertree database for combined and integrative multilocus sequence typing analysis of the Bacillus cereus group of bacteria (including B. cereus, B. anthracis and B. thuringiensis).
N. J. Tourasse and A.-B. Kolsto (2008)
Nucleic Acids Res. 36, D461-D468
   Abstract »    Full Text »    PDF »
Spectral Partitioning of Phylogenetic Data Sets Based on Compatibility.
D. Chen, G. J. Burleigh, and D. Fernandez-Baca (2007)
Syst Biol 56, 623-632
   Abstract »    Full Text »    PDF »
Phylogenomic Analysis Supports the Monophyly of Cryptophytes and Haptophytes and the Association of Rhizaria with Chromalveolates.
J. D. Hackett, H. S. Yoon, S. Li, A. Reyes-Prieto, S. E. Rummele, and D. Bhattacharya (2007)
Mol. Biol. Evol. 24, 1702-1713
   Abstract »    Full Text »    PDF »
High-resolution species trees without concatenation.
S. V. Edwards, L. Liu, and D. K. Pearl (2007)
PNAS 104, 5936-5941
   Abstract »    Full Text »    PDF »
Linking of Digital Images to Phylogenetic Data Matrices Using a Morphological Ontology.
M. J. Ramirez, J. A. Coddington, W. P. Maddison, P. E. Midford, L. Prendini, J. Miller, C. E. Griswold, G. Hormiga, P. Sierwald, N. Scharff, et al. (2007)
Syst Biol 56, 283-294
   Abstract »    Full Text »    PDF »
From Phylogenetics to Phylogenomics: The Evolutionary Relationships of Insect Endosymbiotic {gamma}-Proteobacteria as a Test Case.
I. Comas, A. Moya, and F. Gonzalez-Candelas (2007)
Syst Biol 56, 1-16
   Abstract »    Full Text »    PDF »
SDM: A Fast Distance-Based Approach for (Super)Tree Building in Phylogenomics.
A. Criscuolo, V. Berry, E. J. P. Douzery, and O. Gascuel (2006)
Syst Biol 55, 740-755
   Abstract »    Full Text »    PDF »
The Ancestral Distance Test: What Relatedness can Reveal about Correlated Evolution in Large Lineages with Missing Character Data and Incomplete Phylogenies.
D. Hearn and M. Huber (2006)
Syst Biol 55, 803-817
   Abstract »    Full Text »    PDF »
Phylogenetic Supermatrix Analysis of GenBank Sequences from 2228 Papilionoid Legumes.
M. M. McMahon and M. J. Sanderson (2006)
Syst Biol 55, 818-836
   Abstract »    Full Text »    PDF »
Supertree Bootstrapping Methods for Assessing Phylogenetic Variation among Genes in Genome-Scale Data Sets.
J. G. Burleigh, A. C. Driskell, and M. J. Sanderson (2006)
Syst Biol 55, 426-440
   Abstract »    Full Text »    PDF »
Independent Ancient Polyploidy Events in the Sister Families Brassicaceae and Cleomaceae.
M. E. Schranz and T. Mitchell-Olds (2006)
PLANT CELL 18, 1152-1165
   Abstract »    Full Text »    PDF »
Dense Taxonomic EST Sampling and Its Applications for Molecular Systematics of the Coleoptera (Beetles).
J. Hughes, S. J. Longhorn, A. Papadopoulou, K. Theodorides, A. de Riva, M. Mejia-Chang, P. G. Foster, and A. P. Vogler (2006)
Mol. Biol. Evol. 23, 268-278
   Abstract »    Full Text »    PDF »
From famine to feast? Selecting nuclear DNA sequence loci for plant species-level phylogeny reconstruction.
C. E Hughes, R. J Eastwood, and C Donovan Bailey (2006)
Phil Trans R Soc B 361, 211-225
   Abstract »    Full Text »    PDF »
Animal Evolution and the Molecular Signature of Radiations Compressed in Time.
A. Rokas, D. Kruger, and S. B. Carroll (2005)
Science 310, 1933-1938
   Abstract »    Full Text »    PDF »
Asynchronous Colonization of Madagascar by the Four Endemic Clades of Primates, Tenrecs, Carnivores, and Rodents as Inferred from Nuclear Genes.
C. Poux, O. Madsen, E. Marquard, D. R. Vieites, W. W. de Jong, and M. Vences (2005)
Syst Biol 54, 719-730
   Abstract »    Full Text »    PDF »



To Advertise     Find Products


Science. ISSN 0036-8075 (print), 1095-9203 (online)