Protein structural alignments and functional genomics

James A. Irving, James C. Whisstock, Arthur M. Lesk

Research output: Contribution to journalArticle

65 Citations (Scopus)

Abstract

Structural genomics - the systematic solution of structures of the proteins of an organism - will increasingly often produce molecules of unknown function with no close relative of known function. Prediction of protein function from structure has thereby become a challenging problem of computational molecular biology. The strong conservation of active site conformations in homologous proteins suggests a method for identifying them. This depends on the relationship between size and goodness-of-fit of aligned substructures in homologous proteins. For all pairs of proteins studied, the root-mean-square deviation (RMSD) as a function of the number of residues aligned varies exponentially for large common substructures and linearly for small common substructures. The exponent of the dependence at large common substructures is well correlated with the RMSD of the core as originally calculated by Chothia and Lesk (EMBO J 1986;5:823-826), affording the possibility of reconciling different structural alignment procedures. In the region of small common substructures, reduced aligned subsets define active sites and can be used to suggest the locations of active sites in homologous proteins.

Original languageEnglish (US)
Pages (from-to)378-382
Number of pages5
JournalProteins: Structure, Function and Genetics
Volume42
Issue number3
DOIs
StatePublished - Feb 15 2001

Fingerprint

Genomics
Catalytic Domain
Proteins
Molecular biology
Computational Biology
Conformations
Conservation
Molecules

All Science Journal Classification (ASJC) codes

  • Structural Biology
  • Biochemistry
  • Molecular Biology

Cite this

Irving, James A. ; Whisstock, James C. ; Lesk, Arthur M. / Protein structural alignments and functional genomics. In: Proteins: Structure, Function and Genetics. 2001 ; Vol. 42, No. 3. pp. 378-382.
@article{299aa67e61c84712aeb15d094ac8bba9,
title = "Protein structural alignments and functional genomics",
abstract = "Structural genomics - the systematic solution of structures of the proteins of an organism - will increasingly often produce molecules of unknown function with no close relative of known function. Prediction of protein function from structure has thereby become a challenging problem of computational molecular biology. The strong conservation of active site conformations in homologous proteins suggests a method for identifying them. This depends on the relationship between size and goodness-of-fit of aligned substructures in homologous proteins. For all pairs of proteins studied, the root-mean-square deviation (RMSD) as a function of the number of residues aligned varies exponentially for large common substructures and linearly for small common substructures. The exponent of the dependence at large common substructures is well correlated with the RMSD of the core as originally calculated by Chothia and Lesk (EMBO J 1986;5:823-826), affording the possibility of reconciling different structural alignment procedures. In the region of small common substructures, reduced aligned subsets define active sites and can be used to suggest the locations of active sites in homologous proteins.",
author = "Irving, {James A.} and Whisstock, {James C.} and Lesk, {Arthur M.}",
year = "2001",
month = "2",
day = "15",
doi = "10.1002/1097-0134(20010215)42:3<378::AID-PROT70>3.0.CO;2-3",
language = "English (US)",
volume = "42",
pages = "378--382",
journal = "Proteins: Structure, Function and Genetics",
issn = "0887-3585",
publisher = "Wiley-Liss Inc.",
number = "3",

}

Protein structural alignments and functional genomics. / Irving, James A.; Whisstock, James C.; Lesk, Arthur M.

In: Proteins: Structure, Function and Genetics, Vol. 42, No. 3, 15.02.2001, p. 378-382.

Research output: Contribution to journalArticle

TY - JOUR

T1 - Protein structural alignments and functional genomics

AU - Irving, James A.

AU - Whisstock, James C.

AU - Lesk, Arthur M.

PY - 2001/2/15

Y1 - 2001/2/15

N2 - Structural genomics - the systematic solution of structures of the proteins of an organism - will increasingly often produce molecules of unknown function with no close relative of known function. Prediction of protein function from structure has thereby become a challenging problem of computational molecular biology. The strong conservation of active site conformations in homologous proteins suggests a method for identifying them. This depends on the relationship between size and goodness-of-fit of aligned substructures in homologous proteins. For all pairs of proteins studied, the root-mean-square deviation (RMSD) as a function of the number of residues aligned varies exponentially for large common substructures and linearly for small common substructures. The exponent of the dependence at large common substructures is well correlated with the RMSD of the core as originally calculated by Chothia and Lesk (EMBO J 1986;5:823-826), affording the possibility of reconciling different structural alignment procedures. In the region of small common substructures, reduced aligned subsets define active sites and can be used to suggest the locations of active sites in homologous proteins.

AB - Structural genomics - the systematic solution of structures of the proteins of an organism - will increasingly often produce molecules of unknown function with no close relative of known function. Prediction of protein function from structure has thereby become a challenging problem of computational molecular biology. The strong conservation of active site conformations in homologous proteins suggests a method for identifying them. This depends on the relationship between size and goodness-of-fit of aligned substructures in homologous proteins. For all pairs of proteins studied, the root-mean-square deviation (RMSD) as a function of the number of residues aligned varies exponentially for large common substructures and linearly for small common substructures. The exponent of the dependence at large common substructures is well correlated with the RMSD of the core as originally calculated by Chothia and Lesk (EMBO J 1986;5:823-826), affording the possibility of reconciling different structural alignment procedures. In the region of small common substructures, reduced aligned subsets define active sites and can be used to suggest the locations of active sites in homologous proteins.

UR - http://www.scopus.com/inward/record.url?scp=0035865982&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0035865982&partnerID=8YFLogxK

U2 - 10.1002/1097-0134(20010215)42:3<378::AID-PROT70>3.0.CO;2-3

DO - 10.1002/1097-0134(20010215)42:3<378::AID-PROT70>3.0.CO;2-3

M3 - Article

C2 - 11151008

AN - SCOPUS:0035865982

VL - 42

SP - 378

EP - 382

JO - Proteins: Structure, Function and Genetics

JF - Proteins: Structure, Function and Genetics

SN - 0887-3585

IS - 3

ER -