Note to users. If you're seeing this message, it means that your browser cannot find this page's style/presentation instructions -- or possibly that you are using a browser that does not support current Web standards. Find out more about why this message is appearing, and what you can do to make your experience of our site the best it can be.
Click Me!

Site Tools

  • AAAS
  • Subscribe
  • Feedback

Site Search

Search Advanced

Science 6 July 2001:
Vol. 293. no. 5527, pp. 104 - 111
DOI: 10.1126/science.1060310


Abstract
Full Text
Human Chromosome 19 and Related Regions in Mouse: Conservative and Lineage-Specific Evolution
Paramvir Dehal, Paul Predki, Anne S. Olsen, Art Kobayashi, Peg Folta, Susan Lucas, Miriam Land, Astrid Terry, Carol L. Ecale Zhou, Sam Rash, Qing Zhang, Laurie Gordon, Joomyeong Kim, Christopher Elkin, Martin J. Pollard, Paul Richardson, Dan Rokhsar, Ed Uberbacher, Trevor Hawkins, Albert Branscomb, and Lisa Stubbs

Supplementary Material

Web table A. Ordered Catalog of known and predicted HSA19 genes and mouse orthologs

Genes identified or predicted in the assembled HSA19 sequence are listed in order, from the HSA19p centromere to the HSA19q telomere. "PS" in column 1 indicates that the gene in the corresponding row is a known pseudogene. Pseudogenes are listed only if they are known to be non-coding members of specific gene families or if they correspond to loci included in reference gene sets. Entries in column 2 identify genes that correspond to Refseq [R] or human locus link [LL] entries; locus link entries that have been assigned to HSA19 [LL-19], or genes corresponding to mouse locus link entries only [LL-MX, where X is the mouse chromosome location] are also indicated. Sequenced HSA19 clones that contain each gene are listed in column 3 with corresponding Genbank accession numbers listed in column 4. Gene names, listed in column 5, correspond to officially assigned RefSeq, Locus link, or unigene cluster names where available; alternative names are shown in parentheses in some cases. Genes with significant identities to named mouse, but not human genes are listed with "Mus" before the gene name. The suffix "-like" (e.g. HSTPCR86-like) refers to genes that are highly similar to, but not identical to, named sequence entries in public databases. "No identity" refers to loci without official ID or significant similarities, but with at least one high probability EST match. "Unknown" in column 5 identifies genes predicted on the basis of clustered mouse conserved sequence elements without significant matches to sequence entries in public cDNA databases. Unnamed genes that are members of clustered gene families, including zinc finger genes (ZNF), olfactory receptor (OR), Cytochrome p450 (Cyp2), pregnancy specific glycoprotein/ carcinoembryonic antigen (PSG, CEACAM), sialic acid glycoprotein (SIGLEC) are listed in column 5 by these family identifiers only; family symbols are also included in parentheses next to genes with names that do not clearly define the gene's affiliation. Plate-row-well address of mouse BAC clones that contain related genes are listed in columns 7 and 8; clone names in italics indicate that, presumably due to evolutionary rearrangements (e.g. inversions, transpositions, gene duplications), mouse clones listed are derived from a different mouse chromosomal location than expected from known syntenic homology relationships . Names and accession numbers of mouse clones that have been sequenced by other sequencing groups are underlined; these clone names are included specifically if their sequence fills gaps in our coverage or provides finished sequence to complement JGI draft sequence. Unless otherwise noted mouse clone numbers refer to plate-row-well addresses from the RPCI-23 BAC Library; plate-row-well addresses of mouse clones from other libraries are preceded by an identifying prefix (RG-MBAC, Caltech/Research Genetics 129/Sv BAC library ; RP21, Roswell Park 129/Sv PAC library ). Genbank accession number for each mouse BAC is shown in column 7. "NC" in column 7 indicates that the corresponding mouse region has not been cloned or sequenced. Boxes in column 6 mark positions of ZNF and OR clusters, as summarized in Table 2 of the text. Detailed information about genes in each family and mouse counterparts are given in WebTables C, and D (ZNF genes) and G (OR genes). Since orthologous relationships do not exist within most of these clustered families, mouse clones that contain syntenically homologous genes are boxed in columns 7 and 8. A more detailed version of this table, with direct links to DNA sequence, regional displays, supportive evidence for predicted genes and other information, is available on the project website (http://bahama.jgi-psf.org/pub/ch19). This web-based catalog will be updated regularly as HSA19 and mouse sequence is completed and assemblies are revised.


Web Table B: Summary of repeat content in human chromosome 19 and related mouse sequence

Human Chromosome 19
Complex Repeats
Homologous Mouse BACs
Complex Repeats
No. of elementsCoverage (bp)Coverage %No. of elementsCoverage (bp)Coverage %
SINEs597041491508627.55%SINEs54252676286112.49%
SINE/Alu529751404581025.94%SINE/Alu------
SINE/B1------SINE/B12765531675175.85%
SINE/B2------SINE/B2-B42178232240105.96%
SINE/IDs------SINE/IDs35762504450.46%
SINE/MIR67298692761.61%SINE/MIR12291208890.22%
LINEs13440659580512.18%LINEs747650767399.38%
LINE/L1833853729659.92%LINE/L1671349710369.18%
LINE/L2496112037982.22%LINE/L27471040310.19%
LINE/L3/CR1141190420.04%LINE/L3/CR11616720.00%
LTR Elements84484342338.02%LTR Elements1093244773308.27%
LTR/MaLR345911566122.17%LTR/MaLR571415678332.89%
LTR/ERVL12164736540.87%LTR/ERVL3372608000.48%
LTR/ERV class I328221273323.93%LTR/ERV class I6144686230.87%
LTR/ERV class II4655742181.06%LTR/ERV class II190113934712.57%
DNA Elements469110033971.85%DNA Elements12922034810.38%
DNA/MER1_type31025859351.08%DNA/MER1_type8981406810.25%
DNA/MER2_type8452931770.54%DNA/MER2_type342565380.10%
Total Interspersed864122901148549.89%Total Interspersed742691661750030.69%
Unclassified1291548640.29%Unclassified317970890.18%
Small RNA208237330.04%Small RNA349280100.05%
Simple Repeats90226698771.24%Simple Repeats1965711433932.11%
Satellites10898116881.50%Satellites9301012980.19%
Low Complexity58534039840.75%Low Complexity66704339100.80%
Total Repeats-- 2889711053.37%Total Repeats--1832236238.55%
Percent GC----47.86%Percent GC----46.41%


Web table C.

Web table D.

Web table E.

Web table F.

Web table G.

Web table H.


Supplemental Figure 1. Web fig. A. Gene duplication at the breakpoint of homology segments VII and VIII. A. Map at bottom of the figure represents genes located at the border of homology segments VII and VIII in HSA19p13.1; top line represents order of genes in human chromosome 21q22.3. Human genes are represented by gene names below or above the human maps, arrows indicate transcriptional direction of each gene. Between the two human chromosome maps are figures that show the order of related genes, with names above each map, in mouse BAC clones located at the ends of homology segment VIII (Mmu10; red line) and VIII (Mmu17, blue line), respectively. Mouse clone maps are arranged to illustrate homologies to the two human chromosomes. Plate-row-well address of the mouse clones is given below each line by boxed numbers. Both mouse BACs were selected for homology to HSA19p13.1 sequences but cross over the region of HSA19 homology into mouse DNA that is related to adjacent segments of H21q22.3. Shaded region highlights the position of gene duplications identified at breakpoint sites on both mouse chromosomes. Shaded boxes are placed over pseudogene copies or gene fragments. The position of single VR2 pseudogene, the counterpart of which is not found in HSA19 or H21 DNA, is represented by the family symbol in a shaded box.


Medium version | Full size version





ADVERTISEMENT
Click Me!

ADVERTISEMENT

To Advertise     Find Products