Note to users. If you're seeing this message, it means that your browser cannot find this page's style/presentation instructions -- or possibly that you are using a browser that does not support current Web standards. Find out more about why this message is appearing, and what you can do to make your experience of our site the best it can be.
Freiburg University

Site Tools

  • AAAS
  • Subscribe
  • Feedback

Site Search

Search Advanced

Science 28 February 2003:
Vol. 299. no. 5611, pp. 1391 - 1394
DOI: 10.1126/science.1081331

Reports

Phylogenetic Shadowing of Primate Sequences to Find Functional Regions of the Human Genome

Dario Boffelli,12 Jon McAuliffe,3 Dmitriy Ovcharenko,2 Keith D. Lewis,2 Ivan Ovcharenko,12 Lior Pachter,4 Edward M. Rubin12*

Nonhuman primates represent the most relevant model organisms to understand the biology of Homo sapiens. The recent divergence and associated overall sequence conservation between individual members of this taxon have nonetheless largely precluded the use of primates in comparative sequence studies. We used sequence comparisons of an extensive set of Old World and New World monkeys and hominoids to identify functional regions in the human genome. Analysis of these data enabled the discovery of primate-specific gene regulatory elements and the demarcation of the exons of multiple genes. Much of the information content of the comprehensive primate sequence comparisons could be captured with a small subset of phylogenetically close primates. These results demonstrate the utility of intraprimate sequence comparisons to discover common mammalian as well as primate-specific functional elements in the human genome, which are unattainable through the evaluation of more evolutionarily distant species.

1 U.S. Department of Energy Joint Genome Institute, Walnut Creek, CA 94598, USA.
2 Department of Genome Sciences, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA.
3 Department of Statistics,
4 Department of Mathematics, University of California, Berkeley, CA 94720, USA.
*   To whom correspondence should be addressed. E-mail: emrubin{at}lbl.gov


Read the Full Text


THIS ARTICLE HAS BEEN CITED BY OTHER ARTICLES:
Combining statistical alignment and phylogenetic footprinting to detect regulatory elements.
R. Satija, L. Pachter, and J. Hein (2008)
Bioinformatics 24, 1236-1242
   Abstract »    Full Text »    PDF »
Comparative Analysis of the MIR319a MicroRNA Locus in Arabidopsis and Related Brassicaceae.
N. Warthmann, S. Das, C. Lanz, and D. Weigel (2008)
Mol. Biol. Evol. 25, 892-902
   Abstract »    Full Text »    PDF »
Dietary Change and Adaptive Evolution of enamelin in Humans and Among Primates.
J. L. Kelley and W. J. Swanson (2008)
Genetics 178, 1595-1603
   Abstract »    Full Text »    PDF »
Amino acid polymorphisms in Arabidopsis phytochrome B cause differential responses to light.
D. L. Filiault, C. A. Wessinger, J. R. Dinneny, J. Lutes, J. O. Borevitz, D. Weigel, J. Chory, and J. N. Maloof (2008)
PNAS 105, 3157-3162
   Abstract »    Full Text »    PDF »
Confidence in comparative genomics.
E. H. Margulies (2008)
Genome Res. 18, 199-200
   Full Text »    PDF »
Qualifying the relationship between sequence conservation and molecular function.
G. M. Cooper and C. D. Brown (2008)
Genome Res. 18, 201-205
   Abstract »    Full Text »    PDF »
Large-Scale Appearance of Ultraconserved Elements in Tetrapod Genomes and Slowdown of the Molecular Clock.
S. Stephen, M. Pheasant, I. V. Makunin, and J. S. Mattick (2008)
Mol. Biol. Evol. 25, 402-408
   Abstract »    Full Text »    PDF »
Reliable prediction of regulator targets using 12 Drosophila genomes.
P. Kheradpour, A. Stark, S. Roy, and M. Kellis (2007)
Genome Res. 17, 1919-1931
   Abstract »    Full Text »    PDF »
Drosophila Biology in the Genomic Age.
T. A. Markow and P. M. O'Grady (2007)
Genetics 177, 1269-1276
   Abstract »    Full Text »    PDF »
Analyses of deep mammalian sequence alignments and constraint predictions for 1% of the human genome.
E. H. Margulies, G. M. Cooper, G. Asimenos, D. J. Thomas, C. N. Dewey, A. Siepel, E. Birney, D. Keefe, A. S. Schwartz, M. Hou, et al. (2007)
Genome Res. 17, 760-774
   Abstract »    Full Text »    PDF »
De novo SVM classification of precursor microRNAs from genomic pseudo hairpins using global and intrinsic folding measures.
K. L. S. Ng and S. K. Mishra (2007)
Bioinformatics 23, 1321-1330
   Abstract »    Full Text »    PDF »
Joint Estimates of Quantitative Trait Locus Effect and Frequency Using Synthetic Recombinant Populations of Drosophila melanogaster.
S. J. Macdonald and A. D. Long (2007)
Genetics 176, 1261-1281
   Abstract »    Full Text »    PDF »
Discovering transcriptional regulatory regions in Drosophila by a nonalignment method for phylogenetic footprinting.
A. Sosinsky, B. Honig, R. S. Mann, and A. Califano (2007)
PNAS 104, 6305-6310
   Abstract »    Full Text »    PDF »
Multiple sequence alignment: In pursuit of homologous DNA positions.
S. Kumar and A. Filipski (2007)
Genome Res. 17, 127-135
   Abstract »    Full Text »    PDF »
Th2 Cell-Selective Enhancement of Human IL13 Transcription by IL13-1112C>T, a Polymorphism Associated with Allergic Inflammation.
L. Cameron, R. B. Webster, J. M. Strempel, P. Kiesler, M. Kabesch, H. Ramachandran, L. Yu, D. A. Stern, P. E. Graves, I. C. Lohman, et al. (2006)
J. Immunol. 177, 8633-8642
   Abstract »    Full Text »    PDF »
Experimental validation of predicted mammalian erythroid cis-regulatory modules.
H. Wang, Y. Zhang, Y. Cheng, Y. Zhou, D. C. King, J. Taylor, F. Chiaromonte, J. Kasturi, H. Petrykowska, B. Gibb, et al. (2006)
Genome Res. 16, 1480-1492
   Abstract »    Full Text »    PDF »
Locating mammalian transcription factor binding sites: A survey of computational and experimental techniques.
L. Elnitski, V. X. Jin, P. J. Farnham, and S. J.M. Jones (2006)
Genome Res. 16, 1455-1464
   Abstract »    Full Text »    PDF »
Identifying cis-regulatory modules by combining comparative and compositional analysis of DNA.
N. Pierstorff, C. M. Bergman, and T. Wiehe (2006)
Bioinformatics 22, 2858-2864
   Abstract »    Full Text »    PDF »
Cis-regulatory Evolution of Chalcone-Synthase Expression in the Genus Arabidopsis.
J. de Meaux, A. Pop, and T. Mitchell-Olds (2006)
Genetics 174, 2181-2202
   Abstract »    Full Text »    PDF »
Large-Scale cis-Element Detection by Analysis of Correlated Expression and Sequence Conservation between Arabidopsis and Brassica oleracea.
G. Haberer, M. T. Mader, P. Kosarev, M. Spannagl, L. Yang, and K. F.X. Mayer (2006)
Plant Physiology 142, 1589-1602
   Abstract »    Full Text »    PDF »
Thematic review series: Systems Biology Approaches to Metabolic and Cardiovascular Disorders. Approaches to lipid metabolism gene identification and characterization in the postgenomic era.
K. Reue and L. Vergnes (2006)
J. Lipid Res. 47, 1891-1907
   Abstract »    Full Text »    PDF »
The Forkhead transcription factor Hcm1 regulates chromosome segregation genes and fills the S-phase gap in the transcriptional circuitry of the cell cycle..
T. Pramila, W. Wu, S. Miles, W. S. Noble, and L. L. Breeden (2006)
Genes & Dev. 20, 2266-2278
   Abstract »    Full Text »    PDF »
Evolutionary simulations to detect functional lineage-specific genes.
I. Dupanloup and H. Kaessmann (2006)
Bioinformatics 22, 1815-1822
   Abstract »    Full Text »    PDF »
Computational identification of transcriptional regulatory elements in DNA sequence.
D. GuhaThakurta (2006)
Nucleic Acids Res. 34, 3585-3598
   Abstract »    Full Text »    PDF »
Close sequence comparisons are sufficient to identify human cis-regulatory elements.
S. Prabhakar, F. Poulin, M. Shoukry, V. Afzal, E. M. Rubin, O. Couronne, and L. A. Pennacchio (2006)
Genome Res. 16, 855-863
   Abstract »    Full Text »    PDF »
POXO: a web-enabled tool series to discover transcription factor binding sites..
M. Kankainen, P. Pehkonen, P. Rosenstom, P. Toronen, G. Wong, and L. Holm (2006)
Nucleic Acids Res. 34, W534-W540
   Abstract »    Full Text »    PDF »
Short blocks from the noncoding parts of the human genome have instances within nearly all known genes and relate to biological processes.
I. Rigoutsos, T. Huynh, K. Miranda, A. Tsirigos, A. McHardy, and D. Platt (2006)
PNAS 103, 6605-6610
   Abstract »    Full Text »    PDF »
Activation of Zoosporogenesis-Specific Genes in Phytophthora infestans Involves a 7-Nucleotide Promoter Motif and Cold-Induced Membrane Rigidity..
S. Tani and H. Judelson (2006)
Eukaryot. Cell 5, 745-752
   Abstract »    Full Text »    PDF »
Genome-wide identification of direct targets of the Drosophila retinal determination protein Eyeless.
E. J. Ostrin, Y. Li, K. Hoffman, J. Liu, K. Wang, L. Zhang, G. Mardon, and R. Chen (2006)
Genome Res. 16, 466-476
   Abstract »    Full Text »    PDF »
Molecular Evolution of the Primate Developmental Genes MSX1 and PAX9.
G. H. Perry, B. C. Verrelli, and A. C. Stone (2006)
Mol. Biol. Evol. 23, 644-654
   Abstract »    Full Text »    PDF »
Identification of transposable elements using multiple alignments of related genomes.
A. Caspi and L. Pachter (2006)
Genome Res. 16, 260-270
   Abstract »    Full Text »    PDF »
Defining the mammalian CArGome.
Q. Sun, G. Chen, J. W. Streb, X. Long, Y. Yang, C. J. Stoeckert Jr., and J. M. Miano (2006)
Genome Res. 16, 197-207
   Abstract »    Full Text »    PDF »
Genome annotation past, present, and future: How to define an ORF at each locus.
M. R. Brent (2005)
Genome Res. 15, 1777-1786
   Abstract »    Full Text »    PDF »
Camels and zebrafish, viruses and cancer: a microRNA update.
E. Berezikov and R. H.A. Plasterk (2005)
Hum. Mol. Genet. 14, R183-R190
   Abstract »    Full Text »    PDF »
Using bioinformatics and genome analysis for new therapeutic interventions.
D. W. Mount and R. Pandey (2005)
Mol. Cancer Ther. 4, 1636-1643
   Abstract »    Full Text »    PDF »
Analysis of intronic conserved elements indicates that functional complexity might represent a major source of negative selection on non-coding sequences.
M. Sironi, G. Menozzi, G. P. Comi, R. Cagliani, N. Bresolin, and U. Pozzoli (2005)
Hum. Mol. Genet. 14, 2533-2546
   Abstract »    Full Text »    PDF »
Evaluation of regulatory potential and conservation scores for detecting cis-regulatory modules in aligned mammalian genome sequences.
D. C. King, J. Taylor, L. Elnitski, F. Chiaromonte, W. Miller, and R. C. Hardison (2005)
Genome Res. 15, 1051-1060
   Abstract »    Full Text »    PDF »
Evolutionarily conserved elements in vertebrate, insect, worm, and yeast genomes.
A. Siepel, G. Bejerano, J. S. Pedersen, A. S. Hinrichs, M. Hou, K. Rosenbloom, H. Clawson, J. Spieth, L. W. Hillier, S. Richards, et al. (2005)
Genome Res. 15, 1034-1050
   Abstract »    Full Text »    PDF »
Dcode.org anthology of comparative genomic tools.
G. G. Loots and I. Ovcharenko (2005)
Nucleic Acids Res. 33, W56-W64
   Abstract »    Full Text »    PDF »
Distribution and intensity of constraint in mammalian genomic sequence.
G. M. Cooper, E. A. Stone, G. Asimenos, NISC Comparative Sequencing Program, E. D. Green, S. Batzoglou, and A. Sidow (2005)
Genome Res. 15, 901-913
   Abstract »    Full Text »    PDF »
Identification of regulatory targets of tissue-specific transcription factors: application to retina-specific gene regulation.
J. Qian, N. Esumi, Y. Chen, Q. Wang, I. Chowers, and D. J. Zack (2005)
Nucleic Acids Res. 33, 3479-3491
   Abstract »    Full Text »    PDF »
Subtree power analysis and species selection for comparative genomics.
J. D. McAuliffe, M. I. Jordan, and L. Pachter (2005)
PNAS 102, 7900-7905
   Abstract »    Full Text »    PDF »
Prospects for identifying functional variation across the genome.
S. J. Macdonald and A. D. Long (2005)
PNAS 102, 6614-6621
   Abstract »    Full Text »    PDF »
Identification of functional transcription factor binding sites using closely related Saccharomyces species.
S. W. Doniger, J. Huh, and J. C. Fay (2005)
Genome Res. 15, 701-709
   Abstract »    Full Text »    PDF »
Allele-Specific Assay Reveals Functional Variation in the Chalcone Synthase Promoter of Arabidopsis thaliana That Is Compatible with Neutral Evolution.
J. de Meaux, U. Goebel, A. Pop, and T. Mitchell-Olds (2005)
PLANT CELL 17, 676-690
   Abstract »    Full Text »    PDF »
Identifying Signatures of Selection at the Enhancer of split Neurogenic Gene Complex in Drosophila.
S. J. Macdonald and A. D. Long (2005)
Mol. Biol. Evol. 22, 607-619
   Abstract »    Full Text »    PDF »
Comparative genome sequencing of Drosophila pseudoobscura: Chromosomal, gene, and cis-element evolution.
S. Richards, Y. Liu, B. R. Bettencourt, P. Hradecky, S. Letovsky, R. Nielsen, K. Thornton, M. J. Hubisz, R. Chen, R. P. Meisel, et al. (2005)
Genome Res. 15, 1-18
   Abstract »    Full Text »    PDF »
DoOP: Databases of Orthologous Promoters, collections of clusters of orthologous upstream sequences from chordates and plants.
E. Barta, E. Sebestyen, T. B. Palfy, G. Toth, C. P. Ortutay, and L. Patthy (2005)
Nucleic Acids Res. 33, D86-D90
   Abstract »    Full Text »    PDF »
Mulan: Multiple-sequence local alignment and visualization for studying function and evolution.
I. Ovcharenko, G. G. Loots, B. M. Giardine, M. Hou, J. Ma, R. C. Hardison, L. Stubbs, and W. Miller (2005)
Genome Res. 15, 184-194
   Abstract »    Full Text »    PDF »
Uprobe: A genome-wide universal probe resource for comparative physical mapping in vertebrates.
W. A. Kellner, R. T. Sullivan, B. H. Carlson, NISC Comparative Sequencing Program, and J. W. Thomas (2005)
Genome Res. 15, 166-173
   Abstract »    Full Text »    PDF »
Reconstructing large regions of an ancestral mammalian genome in silico.
M. Blanchette, E. D. Green, W. Miller, and D. Haussler (2004)
Genome Res. 14, 2412-2423
   Abstract »    Full Text »    PDF »
Intraspecies sequence comparisons for annotating genomes.
D. Boffelli, C. V. Weer, L. Weng, K. D. Lewis, M. I. Shoukry, L. Pachter, D. N. Keys, and E. M. Rubin (2004)
Genome Res. 14, 2406-2411
   Abstract »    Full Text »    PDF »
Phylogenetic Analysis of 5'-Noncoding Regions From the ABA-Responsive rab16/17 Gene Family of Sorghum, Maize and Rice Provides Insight Into the Composition, Organization and Function of cis-Regulatory Modules.
C. D. Buchanan, P. E. Klein, and J. E. Mullet (2004)
Genetics 168, 1639-1654
   Abstract »    Full Text »    PDF »
Decoding Human Regulatory Circuits.
W. Thompson, M. J. Palumbo, W. W. Wasserman, J. S. Liu, and C. E. Lawrence (2004)
Genome Res. 14, 1967-1974
   Abstract »    Full Text »    PDF »
Large-scale sequencing of the CD33-related Siglec gene cluster in five mammalian species reveals rapid evolution by multiple mechanisms.
T. Angata, E. H. Margulies, E. D. Green, and A. Varki (2004)
PNAS 101, 13251-13256
   Abstract »    Full Text »    PDF »
ECR Browser: a tool for visualizing and accessing data from comparisons of multiple vertebrate genomes.
I. Ovcharenko, M. A. Nobrega, G. G. Loots, and L. Stubbs (2004)
Nucleic Acids Res. 32, W280-W286
   Abstract »    Full Text »    PDF »
Identifying Candidate Causal Variants Responsible for Altered Activity of the ABCB1 Multidrug Resistance Gene.
N. Soranzo, G. L. Cavalleri, M. E. Weale, N. W. Wood, C. Depondt, R. Marguerie, S. M. Sisodiya, and D. B. Goldstein (2004)
Genome Res. 14, 1333-1344
   Abstract »    Full Text »    PDF »
eShadow: A Tool for Comparing Closely Related Sequences.
I. Ovcharenko, D. Boffelli, and G. G. Loots (2004)
Genome Res. 14, 1191-1198
   Abstract »    Full Text »    PDF »
Identification of sparsely distributed clusters of cis-regulatory elements in sets of co-expressed genes.
G. Kreiman (2004)
Nucleic Acids Res. 32, 2889-2900
   Abstract »    Full Text »    PDF »
Identification of Transcription Factor Binding Sites Upstream of Human Genes Regulated by the Phosphatidylinositol 3-Kinase and MEK/ERK Signaling Pathways.
J. W. Tullai, M. E. Schaffer, S. Mullenbrock, S. Kasif, and G. M. Cooper (2004)
J. Biol. Chem. 279, 20167-20177
   Abstract »    Full Text »    PDF »
Comparison of Human Chromosome 21 Conserved Nongenic Sequences (CNGs) With the Mouse and Dog Genomes Shows That Their Selective Constraint Is Independent of Their Genic Environment.
E. T. Dermitzakis, E. Kirkness, S. Schwarz, E. Birney, A. Reymond, and S. E. Antonarakis (2004)
Genome Res. 14, 852-859
   Abstract »    Full Text »    PDF »
Genetic variation responsible for mouse strain differences in integrin {alpha}2 expression is associated with altered platelet responses to collagen.
T.-T. Li, S. Larrucea, S. Souza, S. M. Leal, J. A. Lopez, E. M. Rubin, B. Nieswandt, and P. F. Bray (2004)
Blood 103, 3396-3402
   Abstract »    Full Text »    PDF »
Characterization of Evolutionary Rates and Constraints in Three Mammalian Genomes.
G. M. Cooper, M. Brudno, E. A. Stone, I. Dubchak, S. Batzoglou, and A. Sidow (2004)
Genome Res. 14, 539-548
   Abstract »    Full Text »    PDF »
Identification of Evolutionary Hotspots in the Rodent Genomes.
V. B. Yap and L. Pachter (2004)
Genome Res. 14, 574-579
   Abstract »    Full Text »    PDF »
Automated Whole-Genome Multiple Alignment of Rat, Mouse, and Human.
M. Brudno, A. Poliakov, A. Salamov, G. M. Cooper, A. Sidow, E. M. Rubin, V. Solovyev, S. Batzoglou, and I. Dubchak (2004)
Genome Res. 14, 685-692
   Abstract »    Full Text »    PDF »
MAVID: Constrained Ancestral Alignment of Multiple Sequences.
N. Bray and L. Pachter (2004)
Genome Res. 14, 693-699
   Abstract »    Full Text »    PDF »
Visualization of Multiple Genome Annotations and Alignments With the K-BROWSER.
K. Chakrabarti and L. Pachter (2004)
Genome Res. 14, 716-720
   Abstract »    Full Text »    PDF »
Understanding Milk's Bioactive Components: A Goal for the Genomics Toolbox.
R. E. Ward and J. B. German (2004)
J. Nutr. 134, 962S-967S
   Abstract »    Full Text »    PDF »
Noncoding Sequences Conserved in a Limited Number of Mammals in the SIM2 Interval are Frequently Functional.
K. A. Frazer, H. Tao, K. Osoegawa, P. J. de Jong, X. Chen, M. F. Doherty, and D. R. Cox (2004)
Genome Res. 14, 367-372
   Abstract »    Full Text »    PDF »
Analysis of Multiple Genomic Sequence Alignments: A Web Resource, Online Tools, and Lessons Learned From Analysis of Mammalian SCL Loci.
M. A. Chapman, I. J. Donaldson, J. Gilbert, D. Grafham, J. Rogers, A. R. Green, and B. Gottgens (2004)
Genome Res. 14, 313-318
   Abstract »    Full Text »    PDF »
Linkage of Calpain 10 to Type 2 Diabetes: The Biological Rationale.
N. J. Cox, M. G. Hayes, C. A. Roe, T. Tsuchiya, and G. I. Bell (2004)
Diabetes 53, S19-25
   Abstract »    Full Text »
Birth and Evolutionary History of a Human Minisatellite.
F. Boan, M. G. Blanco, J. Quinteiro, S. Mourino, and J. Gomez-Marquez (2004)
Mol. Biol. Evol. 21, 228-235
   Abstract »    Full Text »    PDF »
JASPAR: an open-access database for eukaryotic transcription factor binding profiles.
A. Sandelin, W. Alkema, P. Engstrom, W. W. Wasserman, and B. Lenhard (2004)
Nucleic Acids Res. 32, D91-94
   Abstract »    Full Text »    PDF »
CONREAL: Conserved Regulatory Elements Anchored Alignment Algorithm for Identification of Transcription Factor Binding Sites by Phylogenetic Footprinting.
E. Berezikov, V. Guryev, R. H.A. Plasterk, and E. Cuppen (2004)
Genome Res. 14, 170-178
   Abstract »    Full Text »    PDF »
Comparative genomic analysis as a tool for biological discovery.
M. A. Nobrega and L. A. Pennacchio (2004)
J. Physiol. 554, 31-39
   Abstract »    Full Text »    PDF »
Identification and Characterization of Multi-Species Conserved Sequences.
E. H. Margulies, M. Blanchette, NISC Comparative Sequencing Program, D. Haussler, and E. D. Green (2003)
Genome Res. 13, 2507-2518
   Abstract »    Full Text »    PDF »
Evolutionary Discrimination of Mammalian Conserved Non-Genic Sequences (CNGs).
E. T. Dermitzakis, A. Reymond, N. Scamuffa, C. Ucla, E. Kirkness, C. Rossier, and S. E. Antonarakis (2003)
Science 302, 1033-1035
   Abstract »    Full Text »    PDF »
Report of the National Heart, Lung, and Blood Institute Workshop on Lipoprotein(a) and Cardiovascular Disease: Recent Advances and Future Directions.
S. M. Marcovina, M. L. Koschinsky, J. J. Albers, and S. Skarlatos (2003)
Clin. Chem. 49, 1785-1796
   Abstract »    Full Text »    PDF »
T Cell-Specific Expression of the Human TNF-{alpha} Gene Involves a Functional and Highly Conserved Chromatin Signature in Intron 3.
R. Barthel and A. E. Goldfeld (2003)
J. Immunol. 171, 3612-3619
   Abstract »    Full Text »    PDF »
MAVID multiple alignment server.
N. Bray and L. Pachter (2003)
Nucleic Acids Res. 31, 3525-3526
   Abstract »    Full Text »    PDF »
Regulatory Elements of the Floral Homeotic Gene AGAMOUS Identified by Phylogenetic Footprinting and Shadowing.
R. L. Hong, L. Hamaguchi, M. A. Busch, and D. Weigel (2003)
PLANT CELL 15, 1296-1309
   Abstract »    Full Text »
Conserved Noncoding Sequences among Cultivated Cereal Genomes Identify Candidate Regulatory Sequence Elements and Patterns of Promoter Evolution.
H. Guo and S. P. Moose (2003)
PLANT CELL 15, 1143-1158
   Abstract »    Full Text »



ADVERTISEMENT
Click Me!

ADVERTISEMENT
Click Me!

To Advertise     Find Products


AAAS Logo HWP Logo

Magazine  |  News  |  Signaling  |  Careers  |  Multimedia  |  Collections  |  Help  |  Site Map  |  RSS