The distributions of amino acids at most-conserved sites nearest catalytic/active centers (C/AC) in 4,645 sequences of ten enzymes of the glycolytic Embden-Meyerhof-Parnas pathway in Archaea, Bacteria and Eukaryota are similar to the proposed temporal order of their appearance on Earth. Glycine, isoleucine, leucine, valine, glutamic acid and possibly lysine often described as prebiotic, i.e., existing or occurring before the emergence of life, were localized in positional and conservational defined aggregations in all enzymes of all Domains. The distributions of all 20 biologic amino acids in most-conserved sites nearest their C/ACs were quite different either from distributions in sites less-conserved and further from their C/ACs or from all amino acids regardless of their position or conservation. The major concentrations of glycine, e.g., perhaps the earliest prebiotic amino acid, occupies ≈16 % of all the most-conserved sites within a volume of ≈7-8 Å radius from their C/ACs and decreases linearly towards the molecule's peripheries. Spatially localized major concentrations of isoleucine, leucine and valine are in the mid-conserved and mid-distant sites from their C/ACs in protein interiors. Lysine and glutamic acid comprise ≈25-30 % of all amino acids within an irregular volume bounded by ≈24-28 Å radii from their C/ACs at the most-distant least-conserved sites. The unreported characteristics of these amino acids: their spatially and conservationally identified concentrations in Archaea, Bacteria and Eukaryota, suggest some common structural organization of glycolytic enzymes that may be relevant to their evolution and that of other proteins. We discuss our data in relation to enzyme evolution, their reported prebiotic putative temporal appearances on Earth, abundances, biological "cost", neighbor-sequence preferences or "ordering" and some thermodynamic parameters.
All Science Journal Classification (ASJC) codes
- Ecology, Evolution, Behavior and Systematics
- Space and Planetary Science