Expanded encyclopaedias of DNA elements in the human and mouse genomes

Nature. 2020 Jul;583(7818):699-710. doi: 10.1038/s41586-020-2493-4. Epub 2020 Jul 29.

Abstract

The human and mouse genomes contain instructions that specify RNAs and proteins and govern the timing, magnitude, and cellular context of their production. To better delineate these elements, phase III of the Encyclopedia of DNA Elements (ENCODE) Project has expanded analysis of the cell and tissue repertoires of RNA transcription, chromatin structure and modification, DNA methylation, chromatin looping, and occupancy by transcription factors and RNA-binding proteins. Here we summarize these efforts, which have produced 5,992 new experimental datasets, including systematic determinations across mouse fetal development. All data are available through the ENCODE data portal (https://www.encodeproject.org), including phase II ENCODE1 and Roadmap Epigenomics2 data. We have developed a registry of 926,535 human and 339,815 mouse candidate cis-regulatory elements, covering 7.9 and 3.4% of their respective genomes, by integrating selected datatypes associated with gene regulation, and constructed a web-based server (SCREEN; http://screen.encodeproject.org) to provide flexible, user-defined access to this resource. Collectively, the ENCODE data and registry provide an expansive resource for the scientific community to build a better understanding of the organization and function of the human and mouse genomes.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Animals
  • Chromatin / genetics
  • Chromatin / metabolism
  • DNA / chemistry
  • DNA / genetics*
  • DNA Footprinting
  • DNA Methylation / genetics
  • DNA Replication Timing
  • Databases, Genetic*
  • Deoxyribonuclease I / metabolism
  • Genome / genetics*
  • Genome, Human
  • Genomics*
  • Histones / metabolism
  • Humans
  • Mice
  • Mice, Transgenic
  • Molecular Sequence Annotation*
  • RNA-Binding Proteins / genetics
  • Registries*
  • Regulatory Sequences, Nucleic Acid / genetics*
  • Transcription, Genetic / genetics
  • Transposases / metabolism

Substances

  • Chromatin
  • Histones
  • RNA-Binding Proteins
  • DNA
  • Transposases
  • Deoxyribonuclease I

Grants and funding