TranslatorX: multiple alignment of nucleotide sequences guided by amino acid translations

Nucleic Acids Res. 2010 Jul;38(Web Server issue):W7-13. doi: 10.1093/nar/gkq291. Epub 2010 Apr 30.

Abstract

We present TranslatorX, a web server designed to align protein-coding nucleotide sequences based on their corresponding amino acid translations. Many comparisons between biological sequences (nucleic acids and proteins) involve the construction of multiple alignments. Alignments represent a statement regarding the homology between individual nucleotides or amino acids within homologous genes. As protein-coding DNA sequences evolve as triplets of nucleotides (codons) and it is known that sequence similarity degrades more rapidly at the DNA than at the amino acid level, alignments are generally more accurate when based on amino acids than on their corresponding nucleotides. TranslatorX novelties include: (i) use of all documented genetic codes and the possibility of assigning different genetic codes for each sequence; (ii) a battery of different multiple alignment programs; (iii) translation of ambiguous codons when possible; (iv) an innovative criterion to clean nucleotide alignments with GBlocks based on protein information; and (v) a rich output, including Jalview-powered graphical visualization of the alignments, codon-based alignments coloured according to the corresponding amino acids, measures of compositional bias and first, second and third codon position specific alignments. The TranslatorX server is freely available at http://translatorx.co.uk.

MeSH terms

  • Genetic Code
  • Internet
  • Phylogeny
  • Protein Biosynthesis
  • Sequence Alignment / methods*
  • Sequence Analysis, DNA*
  • Sequence Analysis, Protein*
  • Software*