Reconciling the numbers: ESTs versus protein-coding genes

Research output: Contribution to journalArticle

16 Citations (Scopus)

Abstract

The number of expressed sequences greatly surpasses the estimated number of protein-coding genes in mammalian genomes. An evolutionary approach reveals that only 9% to 14% of human-expressed and mouse-expressed sequences are able to code for proteins. Clustering of these sequences using cross-species relationships suggests that millions of expressed sequences may correspond to only approximately 20,000 distinct protein-coding transcripts.

Original languageEnglish (US)
Pages (from-to)1278-1282
Number of pages5
JournalMolecular Biology and Evolution
Volume21
Issue number7
DOIs
StatePublished - Jul 1 2004

Fingerprint

Expressed Sequence Tags
Genes
protein
gene
Proteins
genes
proteins
Cluster Analysis
genome
Genome
mice

All Science Journal Classification (ASJC) codes

  • Genetics
  • Biochemistry
  • Genetics(clinical)
  • Biochemistry, Genetics and Molecular Biology(all)
  • Ecology, Evolution, Behavior and Systematics
  • Agricultural and Biological Sciences (miscellaneous)
  • Molecular Biology

Cite this

@article{b8dae2815bd24245ad433b49688c555b,
title = "Reconciling the numbers: ESTs versus protein-coding genes",
abstract = "The number of expressed sequences greatly surpasses the estimated number of protein-coding genes in mammalian genomes. An evolutionary approach reveals that only 9{\%} to 14{\%} of human-expressed and mouse-expressed sequences are able to code for proteins. Clustering of these sequences using cross-species relationships suggests that millions of expressed sequences may correspond to only approximately 20,000 distinct protein-coding transcripts.",
author = "Anton Nekrutenko",
year = "2004",
month = "7",
day = "1",
doi = "10.1093/molbev/msh125",
language = "English (US)",
volume = "21",
pages = "1278--1282",
journal = "Molecular Biology and Evolution",
issn = "0737-4038",
publisher = "Oxford University Press",
number = "7",

}

Reconciling the numbers : ESTs versus protein-coding genes. / Nekrutenko, Anton.

In: Molecular Biology and Evolution, Vol. 21, No. 7, 01.07.2004, p. 1278-1282.

Research output: Contribution to journalArticle

TY - JOUR

T1 - Reconciling the numbers

T2 - ESTs versus protein-coding genes

AU - Nekrutenko, Anton

PY - 2004/7/1

Y1 - 2004/7/1

N2 - The number of expressed sequences greatly surpasses the estimated number of protein-coding genes in mammalian genomes. An evolutionary approach reveals that only 9% to 14% of human-expressed and mouse-expressed sequences are able to code for proteins. Clustering of these sequences using cross-species relationships suggests that millions of expressed sequences may correspond to only approximately 20,000 distinct protein-coding transcripts.

AB - The number of expressed sequences greatly surpasses the estimated number of protein-coding genes in mammalian genomes. An evolutionary approach reveals that only 9% to 14% of human-expressed and mouse-expressed sequences are able to code for proteins. Clustering of these sequences using cross-species relationships suggests that millions of expressed sequences may correspond to only approximately 20,000 distinct protein-coding transcripts.

UR - http://www.scopus.com/inward/record.url?scp=3042682034&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=3042682034&partnerID=8YFLogxK

U2 - 10.1093/molbev/msh125

DO - 10.1093/molbev/msh125

M3 - Article

C2 - 15034132

AN - SCOPUS:3042682034

VL - 21

SP - 1278

EP - 1282

JO - Molecular Biology and Evolution

JF - Molecular Biology and Evolution

SN - 0737-4038

IS - 7

ER -