A new measure for functional similarity of gene products based on Gene Ontology

BMC Bioinformatics. 2006 Jun 15:7:302. doi: 10.1186/1471-2105-7-302.

Abstract

Background: Gene Ontology (GO) is a standard vocabulary of functional terms and allows for coherent annotation of gene products. These annotations provide a basis for new methods that compare gene products regarding their molecular function and biological role.

Results: We present a new method for comparing sets of GO terms and for assessing the functional similarity of gene products. The method relies on two semantic similarity measures; simRel and funSim. One measure (simRel) is applied in the comparison of the biological processes found in different groups of organisms. The other measure (funSim) is used to find functionally related gene products within the same or between different genomes. Results indicate that the method, in addition to being in good agreement with established sequence similarity approaches, also provides a means for the identification of functionally related proteins independent of evolutionary relationships. The method is also applied to estimating functional similarity between all proteins in Saccharomyces cerevisiae and to visualizing the molecular function space of yeast in a map of the functional space. A similar approach is used to visualize the functional relationships between protein families.

Conclusion: The approach enables the comparison of the underlying molecular biology of different taxonomic groups and provides a new comparative genomics tool identifying functionally related gene products independent of homology. The proposed map of the functional space provides a new global view on the functional relationships between gene products or protein families.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms*
  • Artificial Intelligence
  • Databases, Protein*
  • Genes
  • Information Storage and Retrieval / methods*
  • Multigene Family / physiology*
  • Natural Language Processing*
  • Proteins / classification*
  • Proteins / physiology*
  • Structure-Activity Relationship
  • User-Computer Interface
  • Vocabulary, Controlled

Substances

  • Proteins