Note to users. If you're seeing this message, it means that your browser cannot find this page's style/presentation instructions -- or possibly that you are using a browser that does not support current Web standards. Find out more about why this message is appearing, and what you can do to make your experience of our site the best it can be.


Science 20 June 2008:
Vol. 320. no. 5883, pp. 1632 - 1635
DOI: 10.1126/science.1158395

Reports

Phylogeny-Aware Gap Placement Prevents Errors in Sequence Alignment and Evolutionary Analysis

Ari Löytynoja* and Nick Goldman

Genetic sequence alignment is the basis of many evolutionary and comparative studies, and errors in alignments lead to errors in the interpretation of evolutionary information in genomes. Traditional multiple sequence alignment methods disregard the phylogenetic implications of gap patterns that they create and infer systematically biased alignments with excess deletions and substitutions, too few insertions, and implausible insertion-deletion–event histories. We present a method that prevents these systematic errors by recognizing insertions and deletions as distinct evolutionary events. We show theoretically and practically that this improves the quality of sequence alignments and downstream analyses over a wide range of realistic alignment problems. These results suggest that insertions and sequence turnover are more common than is currently thought and challenge the conventional picture of sequence evolution and mechanisms of functional and structural changes.

European Molecular Biology Laboratory—European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton CB10 1SD, UK.

* To whom correspondence should be addressed. E-mail: ari{at}ebi.ac.uk

Read the Full Text


THIS ARTICLE HAS BEEN CITED BY OTHER ARTICLES:
Reproducing the manual annotation of multiple sequence alignments using a SVM classifier.
C. Blouin, S. Perry, A. Lavell, E. Susko, and A. J. Roger (2009)
Bioinformatics 25, 3093-3098
   Abstract »    Full Text »    PDF »
Evolutionary Trajectories of Primate Genes Involved in HIV Pathogenesis.
M. Ortiz, N. Guex, E. Patin, O. Martin, I. Xenarios, A. Ciuffi, L. Quintana-Murci, and A. Telenti (2009)
Mol. Biol. Evol. 26, 2865-2875
   Abstract »    Full Text »    PDF »
eggNOG v2.0: extending the evolutionary genealogy of genes with enhanced non-supervised orthologous groups, species and functional annotations.
J. Muller, D. Szklarczyk, P. Julien, I. Letunic, A. Roth, M. Kuhn, S. Powell, C. von Mering, T. Doerks, L. J. Jensen, et al. (2009)
Nucleic Acids Res.
   Abstract »    Full Text »    PDF »
Upcoming challenges for multiple sequence alignment methods in the high-throughput era.
C. Kemena and C. Notredame (2009)
Bioinformatics 25, 2455-2465
   Abstract »    Full Text »    PDF »
Assessment of Microbial Communities by Graph Partitioning in a Study of Soil Fungi in Two Alpine Meadows.
L. Zinger, E. Coissac, P. Choler, and R. A. Geremia (2009)
Appl. Envir. Microbiol. 75, 5863-5870
   Abstract »    Full Text »    PDF »
'Candidatus Liberibacter solanacearum', associated with plants in the family Solanaceae.
L. W. Liefting, B. S. Weir, S. R. Pennycook, and G. R. G. Clover (2009)
Int J Syst Evol Microbiol 59, 2274-2276
   Abstract »    Full Text »    PDF »
A Machine-Learning Approach Reveals That Alignment Properties Alone Can Accurately Predict Inference of Lateral Gene Transfer from Discordant Phylogenies.
M. Roettger, W. Martin, and T. Dagan (2009)
Mol. Biol. Evol. 26, 1931-1939
   Abstract »    Full Text »    PDF »
Uniting Alignments and Trees.
A. Loytynoja and N. Goldman (2009)
Science 324, 1528-1529
   Abstract »    Full Text »    PDF »
Sequence progressive alignment, a framework for practical large-scale probabilistic consistency alignment.
B. Paten, J. Herrero, K. Beal, and E. Birney (2009)
Bioinformatics 25, 295-301
   Abstract »    Full Text »    PDF »
Toward Resolving Deep Neoaves Phylogeny: Data, Signal Enhancement, and Priors.
R. C. Pratt, G. C. Gibb, M. Morgan-Richards, M. J. Phillips, M. D. Hendy, and D. Penny (2009)
Mol. Biol. Evol. 26, 313-326
   Abstract »    Full Text »    PDF »
Origin of the Genetic Components of the Vomeronasal System in the Common Ancestor of all Extant Vertebrates.
W. E. Grus and J. Zhang (2009)
Mol. Biol. Evol. 26, 407-419
   Abstract »    Full Text »    PDF »
Problems and Solutions for Estimating Indel Rates and Length Distributions.
R. A. Cartwright (2009)
Mol. Biol. Evol. 26, 473-480
   Abstract »    Full Text »    PDF »
Genome-wide nucleotide-level mammalian ancestor reconstruction.
B. Paten, J. Herrero, S. Fitzgerald, K. Beal, P. Flicek, I. Holmes, and E. Birney (2008)
Genome Res. 18, 1829-1843
   Abstract »    Full Text »    PDF »



To Advertise     Find Products


Science. ISSN 0036-8075 (print), 1095-9203 (online)