The ENCODE project at UC Santa Cruz

Daryl J. Thomas, Kate R. Rosenbloom, Hiram Clawson, Angie S. Hinrichs, Heather Trumbower, Brian J. Raney, Donna Karolchik, Galt P. Barber, Rachel A. Harte, Jennifer Hillman-Jackson, Robert M. Kuhn, Brooke L. Rhead, Kayla E. Smith, Archana Thakkapallayil, Ann S. Zweig, David Haussler, W. James Kent

Research output: Contribution to journalArticle

69 Citations (Scopus)

Abstract

The goal of the Encyclopedia Of DNA Elements (ENCODE) Project is to identify all functional elements in the human genome. The pilot phase is for comparison of existing methods and for the development of new methods to rigorously analyze a defined 1% of the human genome sequence. Experimental datasets are focused on the origin of replication, DNase I hypersensitivity, chromatin immunoprecipitation, promoter function, gene structure, pseudogenes, non-protein-coding RNAs, transcribed RNAs, multiple sequence alignment and evolutionarily constrained elements. The ENCODE project at UCSC website (http://genome.ucsc.edu/ENCODE) is the primary portal for the sequence-based data produced as part of the ENCODE project. In the pilot phase of the project, over 30 labs provided experimental results for a total of 56 browser tracks supported by 385 database tables. The site provides researchers with a number of tools that allow them to visualize and analyze the data as well as download data for local analyses. This paper describes the portal to the data, highlights the data that has been made available, and presents the tools that have been developed within the ENCODE project. Access to the data and types of interactive analysis that are possible are illustrated through supplemental examples.

Original languageEnglish (US)
Pages (from-to)D663-D667
JournalNucleic acids research
Volume35
Issue numberSUPPL. 1
DOIs
StatePublished - Jan 1 2007

Fingerprint

Encyclopedias
DNA
Human Genome
Untranslated RNA
Pseudogenes
Replication Origin
Sequence Alignment
Chromatin Immunoprecipitation
Deoxyribonuclease I
Hypersensitivity
Research Personnel
Genome
Databases
Genes

All Science Journal Classification (ASJC) codes

  • Genetics

Cite this

Thomas, D. J., Rosenbloom, K. R., Clawson, H., Hinrichs, A. S., Trumbower, H., Raney, B. J., ... Kent, W. J. (2007). The ENCODE project at UC Santa Cruz. Nucleic acids research, 35(SUPPL. 1), D663-D667. https://doi.org/10.1093/nar/gkl1017
Thomas, Daryl J. ; Rosenbloom, Kate R. ; Clawson, Hiram ; Hinrichs, Angie S. ; Trumbower, Heather ; Raney, Brian J. ; Karolchik, Donna ; Barber, Galt P. ; Harte, Rachel A. ; Hillman-Jackson, Jennifer ; Kuhn, Robert M. ; Rhead, Brooke L. ; Smith, Kayla E. ; Thakkapallayil, Archana ; Zweig, Ann S. ; Haussler, David ; Kent, W. James. / The ENCODE project at UC Santa Cruz. In: Nucleic acids research. 2007 ; Vol. 35, No. SUPPL. 1. pp. D663-D667.
@article{d07bf8dc6f414c2da060f57d46b897c5,
title = "The ENCODE project at UC Santa Cruz",
abstract = "The goal of the Encyclopedia Of DNA Elements (ENCODE) Project is to identify all functional elements in the human genome. The pilot phase is for comparison of existing methods and for the development of new methods to rigorously analyze a defined 1{\%} of the human genome sequence. Experimental datasets are focused on the origin of replication, DNase I hypersensitivity, chromatin immunoprecipitation, promoter function, gene structure, pseudogenes, non-protein-coding RNAs, transcribed RNAs, multiple sequence alignment and evolutionarily constrained elements. The ENCODE project at UCSC website (http://genome.ucsc.edu/ENCODE) is the primary portal for the sequence-based data produced as part of the ENCODE project. In the pilot phase of the project, over 30 labs provided experimental results for a total of 56 browser tracks supported by 385 database tables. The site provides researchers with a number of tools that allow them to visualize and analyze the data as well as download data for local analyses. This paper describes the portal to the data, highlights the data that has been made available, and presents the tools that have been developed within the ENCODE project. Access to the data and types of interactive analysis that are possible are illustrated through supplemental examples.",
author = "Thomas, {Daryl J.} and Rosenbloom, {Kate R.} and Hiram Clawson and Hinrichs, {Angie S.} and Heather Trumbower and Raney, {Brian J.} and Donna Karolchik and Barber, {Galt P.} and Harte, {Rachel A.} and Jennifer Hillman-Jackson and Kuhn, {Robert M.} and Rhead, {Brooke L.} and Smith, {Kayla E.} and Archana Thakkapallayil and Zweig, {Ann S.} and David Haussler and Kent, {W. James}",
year = "2007",
month = "1",
day = "1",
doi = "10.1093/nar/gkl1017",
language = "English (US)",
volume = "35",
pages = "D663--D667",
journal = "Nucleic Acids Research",
issn = "0305-1048",
publisher = "Oxford University Press",
number = "SUPPL. 1",

}

Thomas, DJ, Rosenbloom, KR, Clawson, H, Hinrichs, AS, Trumbower, H, Raney, BJ, Karolchik, D, Barber, GP, Harte, RA, Hillman-Jackson, J, Kuhn, RM, Rhead, BL, Smith, KE, Thakkapallayil, A, Zweig, AS, Haussler, D & Kent, WJ 2007, 'The ENCODE project at UC Santa Cruz', Nucleic acids research, vol. 35, no. SUPPL. 1, pp. D663-D667. https://doi.org/10.1093/nar/gkl1017

The ENCODE project at UC Santa Cruz. / Thomas, Daryl J.; Rosenbloom, Kate R.; Clawson, Hiram; Hinrichs, Angie S.; Trumbower, Heather; Raney, Brian J.; Karolchik, Donna; Barber, Galt P.; Harte, Rachel A.; Hillman-Jackson, Jennifer; Kuhn, Robert M.; Rhead, Brooke L.; Smith, Kayla E.; Thakkapallayil, Archana; Zweig, Ann S.; Haussler, David; Kent, W. James.

In: Nucleic acids research, Vol. 35, No. SUPPL. 1, 01.01.2007, p. D663-D667.

Research output: Contribution to journalArticle

TY - JOUR

T1 - The ENCODE project at UC Santa Cruz

AU - Thomas, Daryl J.

AU - Rosenbloom, Kate R.

AU - Clawson, Hiram

AU - Hinrichs, Angie S.

AU - Trumbower, Heather

AU - Raney, Brian J.

AU - Karolchik, Donna

AU - Barber, Galt P.

AU - Harte, Rachel A.

AU - Hillman-Jackson, Jennifer

AU - Kuhn, Robert M.

AU - Rhead, Brooke L.

AU - Smith, Kayla E.

AU - Thakkapallayil, Archana

AU - Zweig, Ann S.

AU - Haussler, David

AU - Kent, W. James

PY - 2007/1/1

Y1 - 2007/1/1

N2 - The goal of the Encyclopedia Of DNA Elements (ENCODE) Project is to identify all functional elements in the human genome. The pilot phase is for comparison of existing methods and for the development of new methods to rigorously analyze a defined 1% of the human genome sequence. Experimental datasets are focused on the origin of replication, DNase I hypersensitivity, chromatin immunoprecipitation, promoter function, gene structure, pseudogenes, non-protein-coding RNAs, transcribed RNAs, multiple sequence alignment and evolutionarily constrained elements. The ENCODE project at UCSC website (http://genome.ucsc.edu/ENCODE) is the primary portal for the sequence-based data produced as part of the ENCODE project. In the pilot phase of the project, over 30 labs provided experimental results for a total of 56 browser tracks supported by 385 database tables. The site provides researchers with a number of tools that allow them to visualize and analyze the data as well as download data for local analyses. This paper describes the portal to the data, highlights the data that has been made available, and presents the tools that have been developed within the ENCODE project. Access to the data and types of interactive analysis that are possible are illustrated through supplemental examples.

AB - The goal of the Encyclopedia Of DNA Elements (ENCODE) Project is to identify all functional elements in the human genome. The pilot phase is for comparison of existing methods and for the development of new methods to rigorously analyze a defined 1% of the human genome sequence. Experimental datasets are focused on the origin of replication, DNase I hypersensitivity, chromatin immunoprecipitation, promoter function, gene structure, pseudogenes, non-protein-coding RNAs, transcribed RNAs, multiple sequence alignment and evolutionarily constrained elements. The ENCODE project at UCSC website (http://genome.ucsc.edu/ENCODE) is the primary portal for the sequence-based data produced as part of the ENCODE project. In the pilot phase of the project, over 30 labs provided experimental results for a total of 56 browser tracks supported by 385 database tables. The site provides researchers with a number of tools that allow them to visualize and analyze the data as well as download data for local analyses. This paper describes the portal to the data, highlights the data that has been made available, and presents the tools that have been developed within the ENCODE project. Access to the data and types of interactive analysis that are possible are illustrated through supplemental examples.

UR - http://www.scopus.com/inward/record.url?scp=33846072868&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=33846072868&partnerID=8YFLogxK

U2 - 10.1093/nar/gkl1017

DO - 10.1093/nar/gkl1017

M3 - Article

C2 - 17166863

AN - SCOPUS:33846072868

VL - 35

SP - D663-D667

JO - Nucleic Acids Research

JF - Nucleic Acids Research

SN - 0305-1048

IS - SUPPL. 1

ER -

Thomas DJ, Rosenbloom KR, Clawson H, Hinrichs AS, Trumbower H, Raney BJ et al. The ENCODE project at UC Santa Cruz. Nucleic acids research. 2007 Jan 1;35(SUPPL. 1):D663-D667. https://doi.org/10.1093/nar/gkl1017