The human genome browser at UCSC

doi:10.1101/gr.229102

. 2002 Jun;12(6):996-1006.

doi: 10.1101/gr.229102.

The human genome browser at UCSC

W James Kent¹, Charles W Sugnet, Terrence S Furey, Krishna M Roskin, Tom H Pringle, Alan M Zahler, David Haussler

Affiliations

PMID: 12045153
PMCID: PMC186604
DOI: 10.1101/gr.229102

The human genome browser at UCSC

W James Kent et al. Genome Res. 2002 Jun.

. 2002 Jun;12(6):996-1006.

doi: 10.1101/gr.229102.

Authors

W James Kent¹, Charles W Sugnet, Terrence S Furey, Krishna M Roskin, Tom H Pringle, Alan M Zahler, David Haussler

Affiliation

¹ Department of Molecular, Cellular, and Developmental Biology, University of California, Santa Cruz, CA 95064, USA. kent@biology.ucsc.edu

PMID: 12045153
PMCID: PMC186604
DOI: 10.1101/gr.229102

Abstract

As vertebrate genome sequences near completion and research refocuses to their analysis, the issue of effective genome annotation display becomes critical. A mature web tool for rapid and reliable display of any requested portion of the genome at any scale, together with several dozen aligned annotation tracks, is provided at http://genome.ucsc.edu. This browser displays assembly contigs and gaps, mRNA and expressed sequence tag alignments, multiple gene predictions, cross-species homologies, single nucleotide polymorphisms, sequence-tagged sites, radiation hybrid data, transposon repeats, and more as a stack of coregistered tracks. Text and sequence-based searches provide quick and precise access to any region of specific interest. Secondary links from individual features lead to sequence details and supplementary off-site databases. One-half of the annotation tracks are computed at the University of California, Santa Cruz from publicly available sequence data; collaborators worldwide provide the rest. Users can stably add their own custom tracks to the browser for educational or research purposes. The conceptual and technical framework of the browser, its underlying MYSQL database, and overall use are described. The web site currently serves over 50,000 pages per day to over 3000 different users.

PubMed Disclaimer

Figures

**Figure 1**
Part of the *HOXA* cluster as viewed in the University of California, Santa Cruz (UCSC) genome browser. The shortcut bar in blue provides quick access to BLAT searches, the DNA sequence, the annotations as text tables, earlier or later assemblies the genome, the corresponding NCBI and Ensembl views, and the user's guide. The controls directly beneath position the browser over a specific region in the genome. The large white picture in the middle displays various annotations. At the bottom are controls for fine-tuning the display and for the individual tracks. Only the first 15 of 31 available tracks are shown here. This region contains three known genes that are all transcribed on the reverse strand as indicated by the arrowheads in the introns. Note the alternative splicing of *HOXA1* in the Human RNA track. The Spliced EST track indicates that there is active transcription of a region between *HOXA1* and *HOXA2*. Expressed sequence tag evidence for the presence of additional nonannotated genes in well studied regions like this often can be found using the UCSC browser. The Mouse Blat track indicated a high level of conservation between mouse and human in this region. Both the Mouse Blat and the Exofish ecores are based on translated alignments, but in highly conserved regions such as this it is not unusual for even translated alignments to paint conserved noncoding regions. The noncoding regions have diverged considerably more between human and pufferfish than between human and mouse.

**Figure 2**
All of chromosome 17. Generally, people work at smaller scales than this, but the browser is capable of displaying all of the annotations on a chromosome in a reasonable time. The centromere is depicted in red in the chromosome band track. The coverage track shows finished regions in black and draft regions in various shades of gray depending on the depth of coverage. There are two large gene deserts in chromosome bands q22 and q24.3. Tracks based on mRNAs, ESTs, and homology with *Tetraodon* all are quite sparse in these regions, though there is still quite a bit of mouse homology.

**Figure 3**
Chromosome 17 band q21.32. This region spans several million bases and is covered by a mix of finished and draft clones. The large blocks in the gap track indicate gaps between clones, while the small ticks indicate gaps within draft clones. Where there is evidence for the relative order and orientation of the contigs on either side of a gap, a white line is drawn though the gap. Most of the contigs in this region are ordered. At this scale, it is possible to resolve most individual genes but not necessarily individual exons.

**Figure 4**
One million bases in the middle of 17q21.32. This is a scale frequently used when trying to positionally clone a gene. Many of the genes in this region are already known, but the EST, mouse, and fish homology evidence suggest the presence of additional genes as well, particularly between *ITGB3* and *NPEPPS.*

**Figure 5**
A known gene and an unknown gene or two. *ITGB3*, the integrin β chain, β 3 precursor is on the left. To the right is a relatively small gene, C17001176, predicted by the Fgenesh++ program, which is supported by mouse and fish homology. Between *ITGB3* and C17001176 is a region quite likely to contain another gene judging by the EST and mouse homology evidence.

**Figure 6**
Details page on the known gene VLDLR.

**Figure 7**
Binning scheme for optimizing database accesses for genomic annotations that cover a particular region of the genome. This diagram shows bins of three different sizes. Features are put in the smallest bin in which they fit. A feature covering the range indicated by line A would go in bin 1. Similarly, line B goes in bin 4 and line C in bin 20. When the browser needs to access features in a region, it must look in bins of all different sizes. To access all the features that overlapped or were enclosed by line A, the browser looks in bins 1, 2, 3, 7, 8, 9, 10, and 11. For B the browser looks in bins 1, 4, 14, 15, 16, 17. For C, the browser looks in bins 1, 5, and 20.

See this image and copyright information in PMC

Cited by

Liver Characterization of a Cohort of Alpha-1 Antitrypsin Deficiency Patients with and without Lung Disease.
Mohammad N, Oshins R, Gu T, Clark V, Lascano J, Assarzadegan N, Marek G, Brantly M, Khodayari N. Mohammad N, et al. J Clin Transl Hepatol. 2024 Oct 28;12(10):845-856. doi: 10.14218/JCTH.2024.00201. Epub 2024 Sep 14. J Clin Transl Hepatol. 2024. PMID: 39440224 Free PMC article.
Gut microbiome impact on childhood allergic rhinitis and house dust mite IgE responses.
Li J, Shen N, He W, Pan Y, Wu J, Zhao R, Mo X, Li Y. Li J, et al. Pediatr Res. 2024 Oct 21. doi: 10.1038/s41390-024-03645-y. Online ahead of print. Pediatr Res. 2024. PMID: 39433961
Genetic association between germline JAK2 polymorphisms and myeloproliferative neoplasms in Hong Kong Chinese population: a case-control study.
Koh SP, Yip SP, Lee KK, Chan CC, Lau SM, Kho CS, Lau CK, Lin SY, Lau YM, Wong LG, Au KL, Wong KF, Chu RW, Yu PH, Chow EY, Leung KF, Tsoi WC, Yung BY. Koh SP, et al. BMC Genet. 2014 Dec 20;15:147. doi: 10.1186/s12863-014-0147-y. BMC Genet. 2014. PMID: 25526816 Free PMC article.
CCTop: An Intuitive, Flexible and Reliable CRISPR/Cas9 Target Prediction Tool.
Stemmer M, Thumberger T, Del Sol Keyer M, Wittbrodt J, Mateo JL. Stemmer M, et al. PLoS One. 2015 Apr 24;10(4):e0124633. doi: 10.1371/journal.pone.0124633. eCollection 2015. PLoS One. 2015. PMID: 25909470 Free PMC article.
Phylogenomics of strongylocentrotid sea urchins.
Kober KM, Bernardi G. Kober KM, et al. BMC Evol Biol. 2013 Apr 23;13:88. doi: 10.1186/1471-2148-13-88. BMC Evol Biol. 2013. PMID: 23617542 Free PMC article.

See all "Cited by" articles

References

1. Benson DA, Boguski MS, Lipman DJ, Ostell J, Ouellette BF, Rapp BA, Wheeler DL. GenBank. Nucleic Acids Res. 1999;27:12–17. - PMC - PubMed
1. Benson G. Tandem repeats finder: A program to analyze DNA sequences. Nucleic Acids Res. 1999;27:573–580. - PMC - PubMed
1. Birney E, Bateman A, Clamp ME, Hubbard TJ. Mining the draft human genome. Nature. 2001;409:827–828. - PMC - PubMed
1. Birney E, Durbin R. Dynamite: A flexible code generating language for dynamic programming methods used in sequence comparison. Ismb. 1997;5:56–64. - PubMed
1. Broman KW, Murray JC, Sheffield VC, White RL, Weber JL. Comprehensive human genetic maps: Individual and sex-specific variation in recombination. Am J Hum Genet. 1998;63:861–869. - PMC - PubMed

Publication types

Actions
Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Substances

Actions

Grants and funding

LinkOut - more resources

Full Text Sources
Other Literature Sources
- The Lens - Patent Citations Database
Molecular Biology Databases
- NIAID Data Ecosystem - Find datasets on Infectious and Immune-mediated Diseases
Research Materials
- NCI CPTC Antibody Characterization Program

[1] Benson DA, Boguski MS, Lipman DJ, Ostell J, Ouellette BF, Rapp BA, Wheeler DL. GenBank. Nucleic Acids Res. 1999;27:12–17. - PMC - PubMed

[2] Benson DA, Boguski MS, Lipman DJ, Ostell J, Ouellette BF, Rapp BA, Wheeler DL. GenBank. Nucleic Acids Res. 1999;27:12–17. - PMC - PubMed

[3] Benson G. Tandem repeats finder: A program to analyze DNA sequences. Nucleic Acids Res. 1999;27:573–580. - PMC - PubMed

[4] Benson G. Tandem repeats finder: A program to analyze DNA sequences. Nucleic Acids Res. 1999;27:573–580. - PMC - PubMed

[5] Birney E, Bateman A, Clamp ME, Hubbard TJ. Mining the draft human genome. Nature. 2001;409:827–828. - PMC - PubMed

[6] Birney E, Bateman A, Clamp ME, Hubbard TJ. Mining the draft human genome. Nature. 2001;409:827–828. - PMC - PubMed

[7] Birney E, Durbin R. Dynamite: A flexible code generating language for dynamic programming methods used in sequence comparison. Ismb. 1997;5:56–64. - PubMed

[8] Birney E, Durbin R. Dynamite: A flexible code generating language for dynamic programming methods used in sequence comparison. Ismb. 1997;5:56–64. - PubMed

[9] Broman KW, Murray JC, Sheffield VC, White RL, Weber JL. Comprehensive human genetic maps: Individual and sex-specific variation in recombination. Am J Hum Genet. 1998;63:861–869. - PMC - PubMed

[10] Broman KW, Murray JC, Sheffield VC, White RL, Weber JL. Comprehensive human genetic maps: Individual and sex-specific variation in recombination. Am J Hum Genet. 1998;63:861–869. - PMC - PubMed

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

The human genome browser at UCSC

Affiliation

The human genome browser at UCSC

Authors

Affiliation

Abstract

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Substances

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources

Molecular Biology Databases

Research Materials

Abstract

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Substances

Related information

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources

Molecular Biology Databases

Research Materials