A cross-reference table between the Protein Data Bank of macromolecular structures and the National Biomedical Research Foundation-Protein Identification Resource amino acid sequence data bank.

Arthur Lesk, D. R. Boswell, V. I. Lesk, V. E. Lesk, A. Bairoch

Research output: Contribution to journalArticle

11 Scopus citations


The National Biomedical Research Foundation-Protein Identification Resource (NBRF-PIR) and the Protein Data Bank at Brookhaven National Laboratory (PDB) both contain protein sequences. We have prepared a cross-reference index of the sequences in these data banks, and compared the data. Of the 270 cases of sequences of the same protein appearing in both data bases, for only 31% are the sequences identical. This is often the result of a difference in the state of maturation of the proteins rather than experimental error. Nevertheless is useful to be aware that the sequence information in these two data archives should not be regarded as redundant.

Original languageEnglish (US)
Pages (from-to)295-308
Number of pages14
JournalProtein sequences & data analysis
Issue number4
Publication statusPublished - Jan 1 1989


Cite this