Note to users. If you're seeing this message, it means that your browser cannot find this page's style/presentation instructions -- or possibly that you are using a browser that does not support current Web standards. Find out more about why this message is appearing, and what you can do to make your experience of our site the best it can be.

Site Tools

  • AAAS
  • Subscribe
  • Feedback

Site Search

Search Advanced

Science 12 September 2003:
Vol. 301. no. 5639, p. 1482
DOI: 10.1126/science.1086558

Technical Comments

Comment on "Hexapod Origins: Monophyletic or Paraphyletic?"

Nardi et al. (1) suggested, rather cautiously, that hexapods (insects plus collembolans in their data set) might be a diphyletic rather than a monophyletic group. According to their interpretation, collembolans evolved separately from other insects and emerged before crustaceans. This unexpected result has huge consequences for the interpretation of both morphological and developmental evolution in arthropods (2) and therefore deserves further scrutiny— especially from a methodological standpoint.

Nardi et al. drew their conclusions from maximum likelihood and Bayesian analyses at the amino acid level of four of the 13 mitochondrial proteins for both the original 35-taxon data set and a 15-taxon subset. However, phylogenetic analyses of amino acids carry several potential caveats. First, the currently available models of mitochondrial amino acid substitution are based on empirically deduced matrices from mammalian-dominated sequence databases. Second, the maximum likelihood analysis used in (1) does not model the variation of rate across sites, which is known to be one of the most important parameters of the likelihood model (3). Third, bias in nucleotide composition also affects the amino acid composition of the gene product, thereby causing potential problems for phylogenetic reconstruction (4).

Some of these pitfalls might be avoided by analyzing nucleotide sequences for which more realistic models of sequence evolution and powerful reconstruction methods are available. In particular, we have recently shown that in the case of mammalian complete mitochondrial genomes (5), it is possible to deal with saturation and base composition heterogeneity by recoding nucleotides as purines (R) and pyrimidines (Y). This approach provided a solution to longstanding controversies concerning the position of the root of the mammalian tree (5).

Applying this strategy to nucleotides from the original Nardi et al. data set strongly suggests that by correcting for different artefacts it is possible to extract a useful historical signal. Unlike Nardi et al., we were able to place the honeybee (Apis) and louse (Heterodoxus) within Insecta (Fig. 1). The artefactual position of these taxa as sister-groups of ticks in (1) was explained as being a consequence of high shared AT nucleotide composition in the mitochondrial genome sequences. From our results, base composition heterogeneity seems to be more easily accommodated in phylogenetic reconstructions using nucleotides. More importantly, our analysis conforms to classical views of arthropod phylogeny: Collembolans are a sister group of insects, and these monophyletic hexapods group with crustaceans into Pancrustacea (Fig. 1). One remaining problem with this tree concerns the paraphyly of myriapods induced by the nesting of the centipede (Lithobius) inside chelicerates.


 Fig. 1. Bayesian 50% majority rule consensus tree with associated branch lengths obtained using nucleotide sequences of cox1, cox2, cox3, and cytb (3750 sites) corresponding to the 35-taxon data set in (1). The first and third codon positions were RY-coded, whereas second codon positions were kept as nucleotides. MrBayes version 3.0b4 (12) was used to perform a partitioned-likelihood Bayesian search in which three independent substitution models were attributed to each codon position—a two-state substitution model + I + {Gamma} for RY-coded first and third codon positions, and a GTR + I + model for second codon position nucleotides. Four incrementally heated Metropolis-coupled Markov chain Monte Carlo (MCMCMC) were run for 500,000 generations, sampling trees and parameters every 10 generations. The consensus tree was obtained from the 35,000 trees sampled after the initial burn-in period. Values at nodes indicate Bayesian posterior probabilities (* = 1.00). Note that the terminal branch lengths leading to the bee (Apis) and louse (Heterodoxus) have been reduced by a factor of three. Underlined taxa are not included in the 25-taxon data set. [View Larger Version of this Image (25K GIF file)]
 

As noted in (1), the phylogenetic analysis performed on the 35-taxon data set indicates uneven rates of evolution among taxa, making it difficult to draw firm conclusions about relationships between lineages. To test the collembolan position further, Nardi et al. reduced the data set to 15 taxa with more homogeneous evolution rates and amino acid compositions. Despite their conservative analysis, they still reported collembolans outside both insects and crustaceans, rendering hexapods diphyletic. However, such a reduced data set is particularly prone to systematic biases from low taxon sampling (6). Although deleting taxa with anomalous rates and base composition can be helpful, care must be taken not to delete taxa that could leave isolated branches and lead to a "long branches attract" phenomenon (7). More specifically, the inclusion of a single outgroup can have a strong impact on phylogenetic reconstruction, even in the absence of rate heterogeneity (8). In the case of placental mammal mitogenomics, taxon sampling has been shown to be a major source of phylogenetic error (9), and we found that increasing the number and diversity of taxa produced excellent agreement between nuclear and mitochondrial sequence data (10).

To maximize taxon sampling, we constructed a well-balanced 25-taxon data set designed to break isolated long branches (especially in the outgroup) without adding strong rate heterogeneity. Phylogenetic analyses of this nucleotide data set, including RY-coded third codon positions, produced a tree in which Arthropoda, Pancrustacea, Hexapoda, Insecta, and Pterygota all appear as monophyletic groups, though with variable support (Fig. 2). Moreover, this topology is much more compatible with current views of arthropod phylogeny (11). The probability of randomly selecting a topology compatible with this prior hypothesis is so small (10) that it provides strong evidence in favor of its veracity. Obviously, additional complete mitochondrial genomes are needed to strengthen the tree further. However, with the data and methods currently available, the hypothesis of a common ancestry for extant hexapods cannot be rejected.


 Fig. 2. Maximum likelihood (ML) phylogram obtained using nucleotide sequences of cox1, cox2, cox3, and cytb for a 25-taxon data set (3777 sites). The third codon positions were RY-coded, whereas first and second codon positions were kept as nucleotides. PAUP* (13) was used to perform a ML heuristic search under the best fitting GTR + I + {Gamma} model and associated ML estimates of parameters as determined by Modeltest version 3.06 (14). A partitioned-likelihood Bayesian search was carried out with MrBayes (12) using a GTR + I + {Gamma} model for first and second codon position nucleotides and a two-state substitution model + I + {Gamma} for the RY-coded third codon positions, with the same parameter settings as in Fig. 1. Values at nodes indicate ML bootstrap proportions (100 replications)/Bayesian posterior probabilities. The two collembolans are figured in bold. [View Larger Version of this Image (21K GIF file)]
 

Frédéric Delsuc
Matthew J. Phillips
David Penny

The Allan Wilson Center for Molecular
Ecology and Evolution
Institute of Molecular BioSciences,
Science Tower D
Massey University
Post Office Box 11-222
Palmerston North, New Zealand
E-mail: D.Penny{at}massey.ac.nz


References and Notes


Received for publication 7 May 2003. Accepted for publication 15 August 2003.



THIS ARTICLE HAS BEEN CITED BY OTHER ARTICLES:
Mitochondrial Genomes and Avian Phylogeny: Complex Characters and Resolvability without Explosive Radiations.
G. C. Gibb, O. Kardailsky, R. T. Kimball, E. L. Braun, and D. Penny (2007)
Mol. Biol. Evol. 24, 269-280
   Abstract »    Full Text »    PDF »
Ancestry of neuronal monoamine transporters in the Metazoa.
S. Caveney, W. Cladman, L. Verellen, and C. Donly (2006)
J. Exp. Biol. 209, 4858-4868
   Abstract »    Full Text »    PDF »
Early Penguin Fossils, Plus Mitochondrial Genomes, Calibrate Avian Evolution.
K. E. Slack, C. M. Jones, T. Ando, G. L. Harrison, R. E. Fordyce, U. Arnason, and D. Penny (2006)
Mol. Biol. Evol. 23, 1144-1155
   Abstract »    Full Text »    PDF »
Southern Hemisphere Springtails: Could Any Have Survived Glaciation of Antarctica?.
M. I. Stevens, P. Greenslade, I. D. Hogg, and P. Sunnucks (2006)
Mol. Biol. Evol. 23, 874-882
   Abstract »    Full Text »    PDF »
The Phylogenetic Positions of Three Basal-Hexapod Groups (Protura, Diplura, and Collembola) Based on Ribosomal RNA Gene Sequences.
Y.-x. Luan, J. M. Mallatt, R.-d. Xie, Y.-m. Yang, and W.-y. Yin (2005)
Mol. Biol. Evol. 22, 1579-1592
   Abstract »    Full Text »    PDF »
Mitochondrial genomes suggest that hexapods and crustaceans are mutually paraphyletic.
C. E Cook, Q. Yue, and M. Akam (2005)
Proc R Soc B 272, 1295-1304
   Abstract »    Full Text »    PDF »
Evidence for Multiple Reversals of Asymmetric Mutational Constraints during the Evolution of the Mitochondrial Genome of Metazoa, and Consequences for Phylogenetic Inferences.
A. Hassanin, N. Leger, and J. Deutsch (2005)
Syst Biol 54, 277-298
   Abstract »    Full Text »    PDF »
Visualizing Conflicting Evolutionary Hypotheses in Large Collections of Trees: Using Consensus Networks to Study the Origins of Placentals and Hexapods.
B. R. Holland, F. Delsuc, V. Moulton, and A. Baker (2005)
Syst Biol 54, 66-76
   Abstract »    Full Text »    PDF »
Genome-Scale Phylogeny and the Detection of Systematic Biases.
M. J. Phillips, F. Delsuc, and D. Penny (2004)
Mol. Biol. Evol. 21, 1455-1458
   Abstract »    Full Text »    PDF »
Four New Avian Mitochondrial Genomes Help Get to Basic Evolutionary Questions in the Late Cretaceous.
G. L. (A. Harrison, P. A. McLenachan, M. J. Phillips, K. E. Slack, A. Cooper, and D. Penny (2004)
Mol. Biol. Evol. 21, 974-983
   Abstract »    Full Text »    PDF »



To Advertise     Find Products

ADVERTISEMENT

Featured Jobs

Science. ISSN 0036-8075 (print), 1095-9203 (online)