Evolution of amino acid frequencies in proteins over deep time: inferred order of introduction of amino acids into the genetic code

Mol Biol Evol. 2002 Oct;19(10):1645-55. doi: 10.1093/oxfordjournals.molbev.a003988.

Abstract

To understand more fully how amino acid composition of proteins has changed over the course of evolution, a method has been developed for estimating the composition of proteins in an ancestral genome. Estimates are based upon the composition of conserved residues in descendant sequences and empirical knowledge of the relative probability of conservation of various amino acids. Simulations are used to model and correct for errors in the estimates. The method was used to infer the amino acid composition of a large protein set in the Last Universal Ancestor (LUA) of all extant species. Relative to the modern protein set, LUA proteins were found to be generally richer in those amino acids that are believed to have been most abundant in the prebiotic environment and poorer in those amino acids that are believed to have been unavailable or scarce. It is proposed that the inferred amino acid composition of proteins in the LUA probably reflects historical events in the establishment of the genetic code.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Amino Acid Sequence
  • Amino Acid Substitution
  • Amino Acids / analysis
  • Computer Simulation
  • Conserved Sequence
  • Evolution, Molecular*
  • Genetic Code*
  • Models, Genetic
  • Probability
  • Proteins / chemistry*
  • Proteins / genetics*
  • Time Factors

Substances

  • Amino Acids
  • Proteins