metaseq: a Python package for integrative genome-wide analysis reveals relationships between chromatin insulators and associated nuclear mRNA

Nucleic Acids Res. 2014 Aug;42(14):9158-70. doi: 10.1093/nar/gku644. Epub 2014 Jul 24.

Abstract

Here we introduce metaseq, a software library written in Python, which enables loading multiple genomic data formats into standard Python data structures and allows flexible, customized manipulation and visualization of data from high-throughput sequencing studies. We demonstrate its practical use by analyzing multiple datasets related to chromatin insulators, which are DNA-protein complexes proposed to organize the genome into distinct transcriptional domains. Recent studies in Drosophila and mammals have implicated RNA in the regulation of chromatin insulator activities. Moreover, the Drosophila RNA-binding protein Shep has been shown to antagonize gypsy insulator activity in a tissue-specific manner, but the precise role of RNA in this process remains unclear. Better understanding of chromatin insulator regulation requires integration of multiple datasets, including those from chromatin-binding, RNA-binding, and gene expression experiments. We use metaseq to integrate RIP- and ChIP-seq data for Shep and the core gypsy insulator protein Su(Hw) in two different cell types, along with publicly available ChIP-chip and RNA-seq data. Based on the metaseq-enabled analysis presented here, we propose a model where Shep associates with chromatin cotranscriptionally, then is recruited to insulator complexes in trans where it plays a negative role in insulator activity.

Publication types

  • Research Support, N.I.H., Intramural

MeSH terms

  • 5' Flanking Region
  • Binding Sites
  • Cell Line
  • Cell Nucleus / genetics
  • Chromatin / metabolism*
  • Chromatin Immunoprecipitation
  • Drosophila Proteins / metabolism*
  • Genomics / methods*
  • High-Throughput Nucleotide Sequencing
  • Immunoprecipitation
  • Insulator Elements
  • RNA, Messenger / metabolism*
  • RNA-Binding Proteins / metabolism*
  • Repressor Proteins / metabolism*
  • Sequence Analysis, RNA
  • Software*
  • Transcription, Genetic

Substances

  • Chromatin
  • Drosophila Proteins
  • RNA, Messenger
  • RNA-Binding Proteins
  • Repressor Proteins
  • Shep protein, Drosophila
  • su(Hw) protein, Drosophila

Associated data

  • GEO/GSE15596
  • GEO/GSE40797
  • GEO/GSE51462
  • GEO/GSE55894