Ensembl 2020

Nucleic Acids Res. 2020 Jan 8;48(D1):D682-D688. doi: 10.1093/nar/gkz966.

Abstract

The Ensembl (https://www.ensembl.org) is a system for generating and distributing genome annotation such as genes, variation, regulation and comparative genomics across the vertebrate subphylum and key model organisms. The Ensembl annotation pipeline is capable of integrating experimental and reference data from multiple providers into a single integrated resource. Here, we present 94 newly annotated and re-annotated genomes, bringing the total number of genomes offered by Ensembl to 227. This represents the single largest expansion of the resource since its inception. We also detail our continued efforts to improve human annotation, developments in our epigenome analysis and display, a new tool for imputing causal genes from genome-wide association studies and visualisation of variation within a 3D protein model. Finally, we present information on our new website. Both software and data are made available without restriction via our website, online tools platform and programmatic interfaces (available under an Apache 2.0 license) and data updates made available four times a year.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Animals
  • Computational Biology / methods*
  • Computer Graphics
  • Databases, Genetic*
  • Databases, Protein
  • Epigenome*
  • Genetic Variation
  • Genome-Wide Association Study
  • Genomics
  • Histones / metabolism
  • Humans
  • Imaging, Three-Dimensional
  • Internet
  • Ligands
  • Molecular Sequence Annotation*
  • Search Engine
  • Software
  • Species Specificity
  • Transcriptome
  • User-Computer Interface
  • Web Browser

Substances

  • Histones
  • Ligands