Reconciling the numbers: ESTs versus protein-coding genes

Research output: Contribution to journalArticlepeer-review

17 Scopus citations


The number of expressed sequences greatly surpasses the estimated number of protein-coding genes in mammalian genomes. An evolutionary approach reveals that only 9% to 14% of human-expressed and mouse-expressed sequences are able to code for proteins. Clustering of these sequences using cross-species relationships suggests that millions of expressed sequences may correspond to only approximately 20,000 distinct protein-coding transcripts.

Original languageEnglish (US)
Pages (from-to)1278-1282
Number of pages5
JournalMolecular biology and evolution
Issue number7
StatePublished - Jul 2004

All Science Journal Classification (ASJC) codes

  • Ecology, Evolution, Behavior and Systematics
  • Molecular Biology
  • Genetics


Dive into the research topics of 'Reconciling the numbers: ESTs versus protein-coding genes'. Together they form a unique fingerprint.

Cite this