Mechanism for full-length RNA processing of Arabidopsis genes containing intragenic heterochromatin

Saze, Hidetoshi; Kitayama, Junko; Takashima, Kazuya; Miura, Saori; Harukawa, Yoshiko; Ito, Tasuku; Kakutani, Tetsuji

doi:10.1038/ncomms3301

Article
Published: 12 August 2013

Mechanism for full-length RNA processing of Arabidopsis genes containing intragenic heterochromatin

Hidetoshi Saze^1,2,
Junko Kitayama³,
Kazuya Takashima³,
Saori Miura¹,
Yoshiko Harukawa¹,
Tasuku Ito^3,4 &
…
Tetsuji Kakutani^3,4

Nature Communications volume 4, Article number: 2301 (2013) Cite this article

4340 Accesses
69 Citations
3 Altmetric
Metrics details

Subjects

Abstract

Genomes of higher eukaryotes contain many transposable elements, which often localize within the transcribed regions of active genes. Although intragenic transposable elements can be silenced to form heterochromatin, the impact of intragenic heterochromatin on transcription and RNA processing remains largely unexplored. Here we show using a flowering plant, Arabidopsis, that full-length transcript formation over intragenic heterochromatin depends on a protein named IBM2 (Increase in Bonsai Methylation 2), which has a Bromo-Adjacent Homology domain and an RNA recognition motif. Mutation of ibm2 triggers premature termination of transcripts with 3′ RNA processing around intragenic heterochromatin at loci including the H3K9 demethylase gene IBM1. The need for IBM2 is circumvented in variant alleles that lack the heterochromatic domain. Our results reveal a mechanism that masks deleterious effects of intragenic heterochromatin, providing evolutionary sources for genetic and epigenetic variations.

You have full access to this article via your institution.

Download PDF

Epigenetic regulation of spurious transcription initiation in Arabidopsis

Article Open access 26 June 2020

Ngoc Tu Le, Yoshiko Harukawa, … Hidetoshi Saze

An siRNA-guided ARGONAUTE protein directs RNA polymerase V to initiate DNA methylation

Article Open access 08 November 2021

Meredith J. Sigman, Kaushik Panda, … Andrea D. McCue

RNA interference-independent reprogramming of DNA methylation in Arabidopsis

Article 30 November 2020

Taiko Kim To, Yuichiro Nishizawa, … Tetsuji Kakutani

Introduction

Genomes of vertebrates and plants contain a substantial number of transposable elements (TEs)¹, which are silenced by repressive epigenetic modifications, such as cytosine methylation and methylation of lysine 9 of histone H3 (H3K9me)². These modifications are essential for the formation of inactive chromatin structures called heterochromatin. In contrast, these modifications are generally excluded from active genes, which depend on passage of the transcription machinery^3,4,5. A potential complication is that active cellular genes sometimes contain TEs within their transcribed regions, particularly in higher eukaryotes with large genomes^6,7,8. Intragenic TEs can cause modulation of gene expression, truncation of transcripts, and creation of poly(A) sites and alternative splicing sites⁹. In mammals, for example, the intragenic intra-cisternal A particle (IAP) retrotransposon triggers premature termination of transcripts at cryptic poly(A) signals within or even outside the element, depending on differential DNA methylation of the element^10,11.

In plants, heterochromatic epigenetic modifications at TEs are established by RNA interference (RNAi)-mediated mechanisms called RNA-directed DNA methylation, and are maintained by DNA methylases and by chromatin remodelling factor^2,12. In general, TEs associated with heterochromatin accumulate at pericentromeric regions of chromosomes^13,14, whereas a substantial number of TEs are also found within gene units, particularly in introns^8,15,16. Despite their abundance, however, epigenetic regulation of intragenic TEs and their impact on modes of transcription remain largely unexplored.

In this study we show that heterochromatic epigenetic modifications are commonly found within actively transcribed gene units in both the Arabidopsis and rice genomes. We further show that in Arabidopsis, full-length transcription of genes with intragenic heterochromatin, most of which is formed by TE insertions, requires IBM2 (Increase in Bonsai Methylation 2), a protein with a Bromo-Adjacent Homology (BAH) domain and an RNA recognition motif (RRM). Our results reveal an epigenetic mechanism that masks the effects of genetic variations created by TE insertions, allowing evolution of complex genomes with heterochromatic domains having diverse functions.

Results

IBM2 controls full-length transcription of IBM1 locus

In the flowering plant, Arabidopsis, H3K9 methylation and DNA methylation are generally excluded from genic regions by the action of H3K9 demethylase IBM1 (refs 17, 18, 19). ibm1 mutants show developmental defects and ectopic accumulation of H3K9me and DNA methylation at non-CG sites, for thousands of transcribed genes^17,18,19. In the course of this study, we identified a novel factor named IBM2 that is required for proper expression of IBM1. A mutant of IBM2 (AT5G11470) was isolated from a genetic screen as ibm1; however, the gene encodes a previously uncharacterized protein that has a BAH domain near the N terminus and an RRM in the C-terminal part (Fig. 1a, Supplementary Fig. S1a). ibm2 mutants show phenotypes very similar to those of ibm1 mutants. In both mutants, non-CG DNA methylation accumulated in genic regions (Supplementary Figs S1–S3), whereas repeat sequences were unaffected (Supplementary Fig. S4). Ectopic genic non-CG methylation and developmental defects in the mutants depend on the functions of both histone H3K9 methylase KRYPTONITE (KYP) and DNA methylase CHROMOMETHYLASE3 (CMT3; Supplementary Fig. S5). These observations suggest that IBM2 acts on the same genetic pathway as IBM1 (Supplementary Fig. S6).

**Figure 1: IBM2 regulates transcription of *IBM1*.**

Expression profiles of wild-type Columbia (Col), and ibm1 and ibm2 mutants were investigated by mRNA-seq analysis (Supplementary Table S1). Target genes of IBM1 and IBM2 largely overlap, and overall expression changes showed a moderate correlation between ibm1 and ibm2 mutants (Pearson’s r=0.640, P<2.2e–16: Fig. 1b, Supplementary Fig. S7). Most importantly, the ibm2 mutant showed a significant reduction of IBM1 transcription at the 3′ region (P=5.2e–21, Fisher’s exact test: Fig. 1c, Supplementary Data 1), which could account for the observed ibm1-like phenotypes in the ibm2 mutant.

The reduction of IBM1 transcript levels at the 3′ region occurred downstream of the seventh intron of IBM1. This intron contains a 200-bp DNA fragment insertion that is homologous to part of ycf1, encoded by both chloroplast and mitochondrial genomes (Fig. 1c). Bisulphite sequence analysis and database searches revealed that the ycf-like sequence is marked by DNA methylation at both CG and non-CG sites, and associated with small RNA production, generating a constitutive heterochromatic domain longer than 700 bp (Fig. 1c, Supplementary Fig. S8a,b)^14,20. As the reduction of IBM1 transcripts in ibm2 occurred specifically at the 3′ region downstream of the intron containing the heterochromatic domain, we speculated that full-length transcription of IBM1 might be hindered by this sequence. To test that possibility, we generated transgenic plants with an IBM1 construct containing a deletion of this heterochromatic region (FLAG-His-HA-gIBM1Δi7; Fig. 1e). Indeed, the transgene rescued DNA hypermethylation of IBM1 target genes in the ibm2 mutant (Fig. 1f, Supplementary Fig. S8c), demonstrating that phenotypes of ibm2 mutants were mediated by reduced functional transcript of IBM1. A control IBM1 construct containing the heterochromatic domain did not rescue the ibm2 phenotype, confirming that this region inhibits the expression of IBM1 when the function of IBM2 is compromised.

Heterochromatic introns in the Arabidopsis and rice genomes

To further identify the possible IBM2 targets in the Arabidopsis genome, we searched for genes that showed expression changes in response to ibm2 mutation, by analysing mRNA-seq data. The most interesting features of the seventh intron of IBM1 are that it is relatively long compared with the remainder of the Arabidopsis genome and it includes the aforementioned heterochromatic domain. Compared to other well-studied plant species, the intron size in Arabidopsis is generally shorter^8,21. Still, we were able to identify 133 introns that are longer than the seventh intron of IBM1 (1,975 bp). Some of the genes with long introns showed drastic changes in expression patterns in the ibm2 mutant (Supplementary Data 1). Notably, expression changes strongly correlate with the state of DNA methylation of the intron; genes responding to the ibm2 mutation contain long introns associated with DNA methylation at CHG sites (Fig. 2a, Supplementary Data 1). The number of such long heterochromatic introns is low in the genome of Arabidopsis thaliana, but even the rice genome, which is reasonably compact, has a large number of long introns (Fig. 2b), and surprisingly, many of them have features of heterochromatin (Fig. 2c). We further investigated 79 of the 133 long introns in the Arabidopsis genome, listed in Supplementary Data 1, and found that 24 introns have insertions of 66 TEs (ref.²²22). Interestingly, introns with CHG hypermethylation tend to have TE insertions (Supplementary Data 1), indicating that intragenic TEs are the major target of the heterochromatic modifications. The majority of inserted TEs are Class II TEs, such as Helitron (38/66 insertions) and MuDR (18/66), while Class I TEs are relatively uncommon (LTR: 6/66, LINE: 1/66). Introns that significantly affect gene expression in ibm2 (Supplementary Data 1) contain 19 TEs that have an average length of 1,068.4 bp, which is twice as long as the average of the remaining 46 TEs (453.4 bp).

**Figure 2: Heterochromatic introns in the genomes of *Arabidopsis* and rice.**

IBM2 promotes RNA processing over intragenic heterochromatin

In the Arabidopsis ibm2 mutant, amounts of full-length transcripts spanning heterochromatic introns formed by insertion of TEs were significantly reduced (Fig. 3a–d, Supplementary Figs. S9 and S10a, Supplementary Data1, Supplementary Table S2). We tested whether long, heterochromatic TEs in introns are, in fact, responsible for transcription defects of these genes in ibm2, using natural variant alleles without TE insertions. Another natural strain, Landsberg erecta (Ler), does not contain TE insertions in two of these genes (Fig. 3e). In these alleles lacking TEs, the ibm2 mutation did not affect the level of full-length transcripts (Fig. 3f), indicating that, as in the IBM1 locus, intragenic heterochromatin within these active genes is the target of IBM2 activity.

**Figure 3: IBM2 facilitates transcription of genes with intragenic heterochromatin.**

In general, the presence of TEs in a gene unit often results in transcription elongation defects⁹. However, we found that in wild-type Col, the majority of mature poly(A) mRNAs spanned heterochromatic regions (Fig. 4, Supplementary Fig. S9), suggesting that the transcription machinery can produce read-through transcripts of intragenic TEs without disturbing the repressive epigenetic marks (Fig. 3a–d, Supplementary Fig. S9). In the ibm2 mutant, however, transcripts spanning the intragenic heterochromatin were drastically reduced, and instead, proportions of shorter forms of transcript variants were increased (Fig. 4, Supplementary Data 1). Changes in 3′ processing patterns of these genes in the ibm2 mutant suggest that IBM2 might facilitate passage of transcription machinery through the heterochromatic domain marked by repressive epigenetic modifications. Alternatively, IBM2 might have a post-transcriptional function for RNA processing. To test these possibilities, we analysed RNA polymerase II (Pol II) density at the IBM1 locus and TE-enclosing genes by chromatin immunoprecipitation (ChIP)-quantitative PCR. Importantly, loss of IBM2 function did not abolish localization of Pol II at regions downstream of intragenic heterochromatin (Fig. 5a–d, Supplementary Fig. S10b,c), suggesting that IBM2 does not affect passage of transcription machinery, but rather is involved in post-transcriptional RNA processing (Fig. 5f). IBM2 has an RRM, which is found in proteins functioning in the 3′ end processing of mRNAs, such as FPA and FCA in Arabidopsis^23,24. fpa and fca mutants induce an extensive transcriptional read-through of the 3′ region of genes^25,26. Although the functions of IBM2 and FCA / FPA in 3′ processing may have a mechanistic overlap, ibm2 mutation rather induces premature termination around heterochromatic domains.

**Figure 4: 3′ Rapid Amplification of cDNA Ends (RACE) of *IBM1* and a gene with an intragenic TE.**

**Figure 5: Pol II and IBM2 densities at *IBM1* and intragenic heterochromatin.**

In addition to RRM, IBM2 has a BAH domain, which is often found in proteins associated with chromatin^27,28,29. As intragenic heterochromatin seems to be a general target of IBM2 (Fig. 2a), we tested whether IBM2 is localized at TEs. For that purpose, we generated transgenic lines that express IBM2 protein fused with FLAG and haemagglutinin (HA) epitopes (Supplementary Fig. S11a,b). ChIP-quantitative PCR experiments with anti-HA antibody demonstrated that IBM2 protein was indeed enriched at intragenic TEs (Fig. 5e). Intriguingly, however, the signal was much weaker in TEs outside transcription units, suggesting that intragenic heterochromatin has a unique property that recruits IBM2 when combined with canonical heterochromatic epigenetic modifications.

Discussion

In this study, we identified a novel cellular mechanism that promotes production of full-length mRNAs over intragenic heterochromatin (Fig. 5f). The mechanism involving IBM2, which allows the presence of long TE sequences within transcription units, should be important especially in higher eukaryotic species with many more heterochromatic TEs in the genome^1,30. In addition, considering that the heterochromatic domain in the IBM1 is associated with an organelle genome sequence, there may be additional targets for IBM2 besides intragenic TEs (Supplementary Data 1). Maintenance of heterochromatic epigenetic states at the domains seems to be important for full-length transcription of the genes including IBM1, as erasure of heterochromatic epigenetic modifications results in a reduction of full-length transcripts of genes with heterochromatic domains, as observed in ibm2 (refs 14, 20, 31). However, ibm2 mutation affects expression of IBM1 without changing DNA methylation at the intron (Supplementary Fig. S8a,b). Therefore, IBM2 may act downstream of repressive epigenetic modifications by binding to the heterochromatic domain or to proteins required for transcriptional repression, via its BAH domain (Fig. 5e)^27,28,29. Notably, IBM2 is recruited more efficiently to intragenic heterochromatin than to heterochromatin outside genes (Fig. 5e). On the other hand, IBM2 is not directly associated with transcription machinery (Supplementary Fig. S11c). These results suggest that epigenetic marks or proteins other than those in canonical heterochromatin may be involved in recruiting IBM2 to targeted loci.

Our results demonstrate that the transcription machinery passes over intragenic TEs without changing the epigenetic marks of silent chromatin (Fig. 3a–c, Supplementary Fig. S9). That was surprising to us, because transcription elongation is generally associated with recruitment of chromatin-modifying enzymes, resulting in accumulation of active chromatin marks such as histone H3K4 methylation and H3K36 methylation^3,4,5. In budding yeast, for example, transcription elongation through heterochromatic domains resulted in displacement of silencing proteins and loss of heterochromatic structure³². On the other hand, establishment of heterochromatin at pericentromeric repeats in fission yeast requires a Pol II read-through of the regions, in which Pol II transcripts recruit RNAi complex and histone H3K9 methylase^33,34. Similarly, it has been reported that Pol II-dependent transcription in Arabidopsis in an intergenic region can induce small interfering RNA-directed gene silencing^35,36. In the case of the PAI locus in Arabidopsis, H3K9 methylation and DNA methylation are maintained despite read-through transcription of the locus, which depend on small RNAs produced by Dicers³⁷. Therefore, it would be interesting to know if Pol II transcripts generated from intragenic TEs reinforce heterochromatic structures by recruiting RNA-directed DNA methylation machinery¹².

Mechanisms that trigger premature transcriptional termination around the intragenic heterochromatin in ibm2 are presently unknown. In mammals, insertion of intragenic IAP elevates usage of cryptic poly(A) signals present in gene units, which could be due to ectopic transcription of antisense transcripts from internal promoters of TE^10,11. Therefore, it is possible that IBM2 prevents premature transcriptional termination by suppressing antisense transcription from cryptic promoters. Indeed, the production of antisense transcripts was detected from the heterochromatic domain of the IBM1 locus²⁰. Alternatively, binding of the RRM domain to pre-mRNA might directly suppress utilization of alternative poly(A) signals near or within heterochromatic domain, which would eventually promote splicing of the intron containing heterochromatin³⁸.

During evolution, or in response to perturbations of the environment, heterochromatic elements might manifest themselves via the changes in epigenetic states³⁹. It has been demonstrated that full-length IBM1 mRNA is reduced in the met1 mutant, due to a loss of DNA methylation at the heterochromatinc domain^20,40. Because IBM1 encodes a histone H3K9 demethylase¹⁹, the heterochromatic domain in IBM1 may function as a genome-wide fine-tuning mechanism, balancing active and repressive states of genes and TEs, through the control of full-length transcription of IBM1²⁰. Interestingly, the organelle-like sequence is also present in the Arabidopsis lyrata orthologue of IBM1 (Supplementary Fig. S12), suggesting that its insertion originated before the separation of these species. Moreover, one of the targets of IBM2 is a recently identified TE family that becomes mobile when plants are exposed to heat stress (Fig. 3a)^41,42. Thus, mechanisms to cope with heterochromatin allow the evolution of complex gene controls in higher eukaryotes, and could be the origin of diverse epigenetic phenomena^{43,44,45,46,47,48,49}. IBM2 genes are conserved in other plant species (Supplementary Fig. S13), and the BAH and RRM domains are evolutionarily conserved motifs in eukaryotes. It will be interesting to learn the significance of this pathway in other species possessing more intragenic TEs.

Methods

Screening and isolation of the Arabidopsis ibm2-1 mutant

Screening of ibm mutants was performed with an ethyl methanesulfonate (EMS)-mutagenized Arabidopsis population¹⁷. A Col line containing the ibm2-1 allele was crossed to a wild-type Ler strain, and ~200 F2 plants and F3 descendants were used for fine mapping. The ibm2-1 mutation was mapped to the top arm of chromosome 5, at an interval of ~330 kb between the polymorphisms PERL0881671 and PERL0884446. Candidate genes were sequenced, and we found that the gene, AT5G11470, has a single-base deletion of G, 1,603-bp downstream from the ATG translation start site, which results in a frameshift and creation of a TGA translation stop codon 39 bp downstream. The ibm2-1 mutation was complemented by a 6.6-kb genomic DNA fragment containing the AT5G11470 gene (see below), confirming that the gene is identical to IBM2. ibm2-1 and wild-type IBM2 alleles were distinguished by digestability of the PCR product by EcoRV, using the primers IBM2-CAPs P1 and IBM2-CAPs P2. PCR primers used in this study are listed in Supplementary Table S3.

Plant growth conditions and plant strains

Plants were grown under long-day conditions (16 h, light; 8 h, dark) at 22 °C. ddm1-1, cmt3-i11, kyp, ibm1-1 and ibm1-4 mutations have been reported^17,50,51,52. ibm2-2 (GABI-Kat: 045A07: NASC Code N404231) and ibm2-3 (GABI-Kat: 484B11) were obtained from NASC⁵² or GABI-Kat⁵³, respectively. Heterozygous ibm2-2 and ibm2-3 plants were backcrossed at least once to wild-type Col and used after confirming segregation of additional T-DNAs by Southern analysis. The ibm2-1 mutant was used after backcrossing either to wild-type Col or to wild-type Ler more than six times.

DNA methylation analysis

For PCR-based DNA methylation analysis⁵⁴, 200 ng of genomic DNA was digested with BglII and EcoRI, in a 20-μl reaction mix. Control ‘undigested’ sample was digested with EcoRI alone. After digestion, PCR was performed using 1 μl of digested sample as a template. For bisulphite sequencing analysis⁵⁵, 0.25–1 μg of heat-denatured genomic DNA in 20 μl H₂O was incubated with 1/9 vol of 3 M NaOH for 20 min at 37 °C. Next, 275 μl of 10 M bisulphite solution was added to the denatured DNA sample and incubated at 70 °C for 1 h. Bisulphite-treated DNA was purified and desulphonated using an EZ DNA Methylation Kit (Zymo Research) according to the manufacturer’s instructions, and eluted with 20 μl of elution buffer. PCR used 2 μl of DNA as a template. A total of 16–20 clones were sequenced for each region.

Genome-wide DNA methylation analysis

To estimate genome-wide DNA methylation levels in introns, sequence data were downloaded from GEO (accession GSE10966 for Arabidopsis Col-0 (ref. 14) and GSE22591 for rice shoot⁵⁶). Reads were mapped to reference genomes (Release 10 of the Arabidopsis Information Resources; Release 7 of the MSU Rice Genome Annotation Project) using the Bowtie alignment algorithm⁵⁷ under conditions described⁵⁸. Only uniquely mapped reads were used for the analysis.

Reverse transcription–PCR

Total RNA of wild-type and mutant plants was isolated from Arabidopsis leaves or flowers, using RNAiso (Takara) according to the manufacturer’s instructions. RNA was further treated with DNaseI (Takara) followed by phenol/chloroform extraction and ethanol precipitation. cDNA was synthesized using a TAKARA RNA PCR kit (AMV) Ver.3.0 (Takara) with oligo-(dT) primer. A total of 1 μg RNA in the reverse transcription (RT) reaction mixture (total 20 μl) was reverse transcribed at 42–50 °C for 1 h, followed by heat inactivation at 95 °C for 5 min. 1 μl of the RT reaction was used as a template for PCR (total 20 μl)⁵⁴. 3′ Rapid Amplification of cDNA Ends was performed using oligo-dT primers followed by the first round of PCR using gene-specific primers and the first oligo-dT-specific primer (Supplementary Table S3). Amplified fragments were diluted and further used for the second round of PCR using another gene-specific primer and a second oligo-dT-specific primer (Supplementary Table S3). Quantitative RT–PCR (qRT-PCR) was performed using SYBR Premix Ex Taq (Takara).

mRNA-seq analysis

Total RNA of wild-type Col, ibm1-4 and ibm2-2 plants homozygous for three generations (F4) was isolated from unopened flower buds using RNAiso (Takara), followed by a cleanup using an RNAeasy Minikit (Qiagen). RNA samples were sent to Hokkaido System Science (Hokkaido, Japan), treated with the mRNA-seq sample prep kit (Illumina), and sequenced by illumina GAII. mRNA-seq data (75-base reads) were aligned to the Arabidopsis genome (TAIR10) using Tophat 1.0.14.OSX (ref. 59). Read coverage of gene models was visualized using Integrated Genome Browser 6.5 (ref. 60) and CLC Bio Genomics Workbench software (CLC bio).

Estimating effects of ibm2 on transcripts over introns

Transcript reads were counted before and after respective introns. The effect of the ibm2 mutation is shown by –log_e(((observed post-intron read number in ibm2)+1)/(expected post-intron read number in ibm2)+1). The expected post-intron read number in ibm2 is calculated by (pre-intron read number in ibm2) × (post-intron read number in wild type)/(pre-intron read number in wild type). If at least one of these three parameters was 0, then that intron was not included in the analysis. For long introns, At1g24880.1 and At1g25054.1 were not included because the methylation level could not be examined, most likely due to inefficient mapping of the repeats. Reads for At1g11270.2 were not mapped, but At1g11270.2 was included in the ‘High mCHG’ class, because of its high non-CG methylation (Fig. 3a).

Generation of transgenic plants

A genomic DNA fragment including AT5G11470 was designed to express a protein fused with 2xFLAG and 2xHA epitopes to the N terminus of IBM2 by the endogenous promoter (FLAG–HA–gIBM2), cloned into the pRI909 vector (Takara) and transformed into ibm2-1 heterozygous plants. Complementation of the ibm2-1 mutation was confirmed by the absence of DNA methylation in T2–T4 ibm2 homozygous plants at several loci, including the BNS locus, using DNA methylation PCR (Supplementary Fig. S11a). The presence of FLAG–HA–IBM2 protein in the nuclear extract of transgenic plants was confirmed by western blotting with an anti-HA antibody (Roche; 2013819) (Supplementary Fig. S11b). For complementation of ibm2-2 with IBM1, a 7.8-kb piece of the genomic IBM1 DNA fragment was designed to express a protein fused with FLAG, 6xHis and HA epitopes to the N-terminal of IBM1 with the endogenous IBM1 promoter (FLAG–His–HA–gIBM1) (Fig. 1e). The FLAG–His–HA–gIBM1 was able to rescue ibm1 phenotypes. In addition, a FLAG–His–HA–gIBM1 transgene with a 720-bp deletion of the sequence similar to the organelle genome in the seventh intron was constructed (FLAG–His–HA–gIBM1Δi7) (Fig. 1e). These transgenes were cloned into pRI909 vector and transformed into ibm2-2 heterozygous plants.

Immunoprecipitation and ChIP assay

Nuclei of wild-type Col and FLAG–HA–gIBM2 transgenic plants were isolated using a CelLyticTM PN Isolation/Extraction kit (Sigma-Aldrich) according to the manufacturer’s instructions. Nuclei were suspended in buffer A (10 mM HEPES (pH 7.9)/1.5 mM MgCl₂/10 mM KCl/0.5 mM dithiothreitol/0.1 mM phenylmethylsulphonyl fluoride). After centrifugation, nuclei were suspended in buffer C (20 mM HEPES (pH 7.9)/25% (v/v) glycerol/0.42 M NaCl/1.5 mM MgCl₂/0.2 mM EDTA/0.5 mM dithiothreitol/0.1 mM phenylmethylsulphonyl fluoride) with vortexing for 40 min. After centrifugation, supernatant was used as the nuclear extract for further experiments.

For the Pol II pull-down assay, nuclear extract from FLAG–HA–gIBM2 transgenic plants was incubated with either 2 μg of anti-Pol II antibody (Abcam, ab817) or 20 μl of anti-HA affinity matrix (Roche, 1815016). Anti-Pol II antibody complex was further precipitated with 20 μl of Dynabeads M-280 anti-mouse IgG (Invitrogen, 112-01D). Beads were washed and boiled for 10 min in SDS loading buffer, and proteins were subjected to SDS–PAGE, followed by western blotting. Pol II protein and IBM2 protein fused with HA tag were detected with anti-Pol II antibody (Abcam, ab817; 1:500 dilution) and anti-HA antibody (Roche, 2013819; 1:500 dilution), respectively.

The ChIP assay was essentially performed as described¹⁷, with minor modifications. Briefly, 3 g of leaves were fixed with fixation buffer (10 mM Tris-HCl, pH 7.5, 0.1 M sucrose, 1% formaldehyde). Nuclei of wild-type Col and ibm2 plants were isolated using a CelLyticTM PN Isolation/Extraction kit (Sigma-Aldrich) according to the manufacturer’s instructions, and were lysed and sonicated. Chromatin was immunoprecipitated by overnight incubation with 2 μg of anti-Pol II antibody (Abcam, ab817) bound to Dynabeads M-280 anti-mouse IgG (Invitrogen, 112-01D). Purified DNA was analysed by quantitative RT–PCR, using SYBR Premix Ex Taq (Takara). For precipitation of chromatin bound with FLAG–HA–IBM2, nuclei of wild-type Col and ibm2-1 plants complimented with FLAG–HA–gIBM2 were isolated as described above. Sonicated chromatin was precleared with Rat-IgG (Abcam, ab37361) and protein G magnetic beads (Invitrogen, 100-03D), and 2 μg of anti-HA antibody (Roche, 2013819) or 2 μg of control IgG were added for overnight incubation. Antibody complexes were precipitated with protein G magnetic beads and washed. Purified DNA was analysed by quantitative RT–PCR as described above.

Additional information

Accession code: Sequencing data have been deposited in the DDBJ Sequence Read Archive under accession code DRP001015.

How to cite this article: Saze, H. et al. Mechanism for full-length RNA processing of Arabidopsis genes containing intragenic heterochromatin. Nat. Commun. 4:2301 doi: 10.1038/ncomms3301 (2013)

Accession codes

Accessions

Gene Expression Omnibus

GSE10966

Sequence Read Archive

DRP001015

References

Kazazian, H. H. Jr Mobile elements: drivers of genome evolution. Science 303, 1626–1632 (2004).
Article CAS ADS Google Scholar
Law, J. A. & Jacobsen, S. E. Establishing, maintaining and modifying DNA methylation patterns in plants and animals. Nat. Rev. Genet. 11, 204–220 (2010).
Article CAS Google Scholar
Li, J. Moazed, D. & Gygi, S. P. Association of the histone methyltransferase Set2 with RNA polymerase II plays a role in transcription elongation. J. Biol. Chem. 277, 49383–49388 (2002).
Article CAS Google Scholar
Ng, H. H. Robert, F. Young, R. A. & Struhl, K. Targeted recruitment of Set1 histone methylase by elongating Pol II provides a localized mark and memory of recent transcriptional activity. Mol. Cell 11, 709–719 (2003).
Article CAS Google Scholar
Oh, S. Park, S. & van Nocker, S. Genic and global functions for Paf1C in chromatin modification and gene expression in Arabidopsis. PLoS Genet. 4, e1000077 (2008).
Article Google Scholar
Yoder, J. A. Walsh, C. P. & Bestor, T. H. Cytosine methylation and the ecology of intragenomic parasites. Trends Genet. 13, 335–340 (1997).
Article CAS Google Scholar
Sela, N. et al. Comparative analysis of transposed element insertion within human and mouse genomes reveals Alu's unique role in shaping the human transcriptome. Genome Biol. 8, R127 (2007).
Article Google Scholar
Schnable, P. S. et al. The B73 maize genome: complexity, diversity, and dynamics. Science 326, 1112–1115 (2009).
Article CAS ADS Google Scholar
Cordaux, R. & Batzer, M. A. The impact of retrotransposons on human genome evolution. Nat. Rev. Genet. 10, 691–703 (2009).
Article CAS Google Scholar
Druker, R. Bruxner, T. J. Lehrbach, N. J. & Whitelaw, E. Complex patterns of transcription at the insertion site of a retrotransposon in the mouse. Nucleic Acids Res. 32, 5800–5808 (2004).
Article CAS Google Scholar
Li, J. et al. Mouse endogenous retroviruses can trigger premature transcriptional termination at a distance. Genome Res. 22, 870–884 (2012).
Article CAS Google Scholar
Matzke, M. A. & Birchler, J. A. RNAi-mediated pathways in the nucleus. Nat. Rev. Genet. 6, 24–35 (2005).
Article CAS Google Scholar
Bernatavichute, Y. V. Zhang, X. Cokus, S. Pellegrini, M. & Jacobsen, S. E. Genome-wide association of histone H3 lysine nine methylation with CHG DNA methylation in Arabidopsis thaliana. PLoS One 3, e3156 (2008).
Article ADS Google Scholar
Lister, R. et al. Highly integrated single-base resolution maps of the epigenome in Arabidopsis. Cell 133, 523–536 (2008).
Article CAS Google Scholar
Shu, Y. et al. Identification and characterization of a new member of the SINE Au retroposon family (GmAu1) in the soybean, Glycine max (L.) Merr., genome and its potential application. Plant Cell Rep. 30, 2207–2213 (2011).
Article CAS Google Scholar
Lu, C. et al. Miniature inverted-repeat transposable elements (MITEs) have been accumulated through amplification bursts and play important roles in gene expression and species diversity in Oryza sativa. Mol. Biol. Evol. 29, 1005–1017 (2012).
Article CAS Google Scholar
Saze, H. Shiraishi, A. Miura, A. & Kakutani, T. Control of genic DNA methylation by a jmjC domain-containing protein in Arabidopsis thaliana. Science 319, 462–465 (2008).
Article CAS ADS Google Scholar
Miura, A. et al. An Arabidopsis jmjC domain protein protects transcribed genes from DNA methylation at CHG sites. EMBO J. 28, 1078–1086 (2009).
Article CAS Google Scholar
Inagaki, S. et al. Autocatalytic differentiation of epigenetic modifications within the Arabidopsis genome. EMBO J. 29, 3496–3506 (2010).
Article CAS Google Scholar
Rigal, M. Kevei, Z. Pelissier, T. & Mathieu, O. DNA methylation in an intron of the IBM1 histone demethylase gene stabilizes chromatin modification patterns. EMBO J. 31, 2981–2993 (2012).
Article CAS Google Scholar
Arabidopsis Genome Initiative. Analysis of the genome sequence of the flowering plant Arabidopsis thaliana. Nature 408, 796–815 (2000).
Buisine, N. Quesneville, H. & Colot, V. Improved detection and annotation of transposable elements in sequenced genomes using multiple reference sequence sets. Genomics 91, 467–475 (2008).
Article CAS Google Scholar
Macknight, R. et al. FCA, a gene controlling flowering time in Arabidopsis, encodes a protein containing RNA-binding domains. Cell 89, 737–745 (1997).
Article CAS Google Scholar
Schomburg, F. M. Patton, D. A. Meinke, D. W. & Amasino, R. M. FPA, a gene involved in floral induction in Arabidopsis, encodes a protein containing RNA-recognition motifs. Plant Cell 13, 1427–1436 (2001).
Article CAS Google Scholar
Hornyik, C. Terzi, L. C. & Simpson, G. G. The spen family protein FPA controls alternative cleavage and polyadenylation of RNA. Dev. Cell 18, 203–213 (2010).
Article CAS Google Scholar
Sonmez, C. et al. RNA 3′ processing functions of Arabidopsis FCA and FPA limit intergenic transcription. Proc. Natl Acad. Sci. USA 108, 8508–8513 (2011).
Article CAS ADS Google Scholar
Onishi, M. Liou, G. G. Buchberger, J. R. Walz, T. & Moazed, D. Role of the conserved Sir3-BAH domain in nucleosome binding and silent chromatin assembly. Mol. Cell 28, 1015–1028 (2007).
Article CAS Google Scholar
Du, J. et al. Dual binding of chromomethylase domains to H3K9me2-containing nucleosomes directs DNA methylation in plants. Cell 151, 167–180 (2012).
Article CAS Google Scholar
Kuo, A. J. et al. The BAH domain of ORC1 links H4K20me2 to DNA replication licensing and Meier-Gorlin syndrome. Nature 484, 115–119 (2012).
Article CAS ADS Google Scholar
Springer, N. M. et al. Characterization of a novel maize retrotransposon family SPRITE that shows high levels of variability among maize inbred lines. Maydica 54, 417–428 (2009).
Google Scholar
Stroud, H. Greenberg, M. V. Feng, S. Bernatavichute, Y. V. & Jacobsen, S. E. Comprehensive analysis of silencing mutants reveals complex regulation of the Arabidopsis methylome. Cell 152, 352–364 (2013).
Article CAS Google Scholar
Varv, S. et al. Acetylation of H3 K56 is required for RNA polymerase II transcript elongation through heterochromatin in yeast. Mol. Cell Biol. 30, 1467–1477 (2010).
Article CAS Google Scholar
Djupedal, I. et al. RNA Pol II subunit Rpb7 promotes centromeric transcription and RNAi-directed chromatin silencing. Genes Dev. 19, 2301–2306 (2005).
Article CAS Google Scholar
Kato, H. et al. RNA polymerase II is required for RNAi-dependent heterochromatin assembly. Science 309, 467–469 (2005).
Article CAS ADS Google Scholar
Zheng, B. et al. Intergenic transcription by RNA polymerase II coordinates Pol IV and Pol V in siRNA-directed transcriptional gene silencing in Arabidopsis. Genes Dev. 23, 2850–2860 (2009).
Article CAS Google Scholar
Earley, K. W. et al. Mechanisms of HDA6-mediated rRNA gene silencing: suppression of intergenic Pol II transcription and differential effects on maintenance versus siRNA-directed cytosine methylation. Genes Dev. 24, 1119–1132 (2010).
Article CAS Google Scholar
Enke, R. A. Dong, Z. & Bender, J. Small RNAs prevent transcription-coupled loss of histone H3 lysine 9 methylation in Arabidopsis thaliana. PLoS Genet. 7, e1002350 (2011).
Article CAS Google Scholar
Kaida, D. et al. U1 snRNP protects pre-mRNAs from premature cleavage and polyadenylation. Nature 468, 664–668 (2010).
Article CAS ADS Google Scholar
McClintock, B. The significance of responses of the genome to challenge. Science 226, 792–801 (1984).
Article CAS ADS Google Scholar
Deleris, A. et al. Loss of the DNA methyltransferase MET1 induces H3K9 hypermethylation at PcG target genes and redistribution of H3K27 trimethylation to transposons in Arabidopsis thaliana. PLoS Genet. 8, e1003062 (2012).
Article CAS Google Scholar
Pecinka, A. et al. Epigenetic regulation of repetitive elements is attenuated by prolonged heat stress in Arabidopsis. Plant Cell 22, 3118–3129 (2010).
Article CAS Google Scholar
Ito, H. et al. An siRNA pathway prevents transgenerational retrotransposition in plants subjected to stress. Nature 472, 115–119 (2011).
Article CAS ADS Google Scholar
McClintock, B. Chromosome organization and genic expression. Cold Spring Harb. Symp. Quant. Biol. 16, 13–47 (1951).
Article CAS Google Scholar
Slotkin, R. K. & Martienssen, R. Transposable elements and the epigenetic regulation of the genome. Nat. Rev. Genet. 8, 272–285 (2007).
Article CAS Google Scholar
Wood, A. J. et al. Regulation of alternative polyadenylation by genomic imprinting. Genes Dev. 22, 1141–1146 (2008).
Article CAS ADS Google Scholar
Chandler, V. L. Paramutation’s properties and puzzles. Science 330, 628–629 (2010).
Article CAS ADS Google Scholar
Settles, A. M. Baron, A. Barkan, A. & Martienssen, R. A. Duplication and suppression of chloroplast protein translocation genes in maize. Genetics 157, 349–360 (2001).
CAS PubMed PubMed Central Google Scholar
Varagona, M. J. Purugganan, M. & Wessler, S. R. Alternative splicing induced by insertion of retrotransposons into the maize waxy gene. Plant Cell 4, 811–820 (1992).
Article CAS Google Scholar
Schwarz-Sommer, Z. Gierl, A. Berndtgen, R. & Saedler, H. Sequence comparison of ‘states’ of a1-m1 suggests a model of Spm (En) action. EMBO J. 4, 2439–2443 (1985).
Article CAS Google Scholar
Vongs, A. Kakutani, T. Martienssen, R. A. & Richards, E. J. Arabidopsis thaliana DNA methylation mutants. Science 260, 1926–1928 (1993).
Article CAS ADS Google Scholar
Bartee, L. Malagnac, F. & Bender, J. Arabidopsis cmt3 chromomethylase mutations block non-CG methylation and silencing of an endogenous gene. Genes Dev. 15, 1753–1758 (2001).
Article CAS Google Scholar
Scholl, R. L. May, S. T. & Ware, D. H. Seed and molecular resources for Arabidopsis. Plant Physiol. 124, 1477–1480 (2000).
Article CAS Google Scholar
Li, Y. Rosso, M. G. Viehoever, P. & Weisshaar, B. GABI-Kat SimpleSearch: an Arabidopsis thaliana T-DNA mutant database with detailed information for confirmed insertions. Nucleic Acids Res. 35, D874–D878 (2007).
Article CAS Google Scholar
Saze, H. & Kakutani, T. Heritable epigenetic mutation of a transposon-flanked Arabidopsis gene due to lack of the chromatin-remodeling factor DDM1. EMBO J. 26, 3641–3652 (2007).
Article CAS Google Scholar
Shiraishi, M. & Hayatsu, H. High-speed conversion of cytosine to uracil in bisulfite genomic sequencing analysis of DNA methylation. DNA Res. 11, 409–415 (2004).
Article CAS Google Scholar
Zemach, A. McDaniel, I. E. Silva, P. & Zilberman, D. Genome-wide evolutionary analysis of eukaryotic DNA methylation. Science 328, 916–919 (2010).
Article CAS ADS Google Scholar
Langmead, B. Trapnell, C. Pop, M. & Salzberg, S. L. Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol. 10, R25 (2009).
Article Google Scholar
Chen, P. Y. Cokus, S. J. & Pellegrini, M. BS Seeker: precise mapping for bisulfite sequencing. BMC Bioinformatics 11, 203 (2010).
Article CAS Google Scholar
Trapnell, C. Pachter, L. & Salzberg, S. L. TopHat: discovering splice junctions with RNA-Seq. Bioinformatics 25, 1105–1111 (2009).
Article CAS Google Scholar
Nicol, J. W. Helt, G. A. Blanchard, S. G. Jr Raja, A. & Loraine, A. E. The Integrated Genome Browser: free software for distribution and exploration of genome-scale datasets. Bioinformatics 25, 2730–2731 (2009).
Article CAS Google Scholar

Download references

Acknowledgements

We thank Akiko Terui for technical assistance, Yasushi Hiromi, Yoshiki Habu and Atsushi Shimada for critical comments on the manuscript, and Steven D. Aird for editing the manuscript. Jong-Myong Kim and Motoaki Seki provided the ChIP-PCR protocol, and the Arabidopsis Biological Resource Center, GABI-KAT and the European Arabidopsis Stock Centre furnished seed stocks. This work was supported by JST PRESTO program (H.S.).

Author information

Authors and Affiliations

Okinawa Institute of Science and Technology, 1919-1 Tancha, Onna-son, 904-0412, Okinawa, Japan
Hidetoshi Saze, Saori Miura & Yoshiko Harukawa
PRESTO, Japan Science and Technology Agency (JST), Saitama, Japan
Hidetoshi Saze
Department of Integrated Genetics, National Institute of Genetics, Yata1111, Mishima, 411-8540, Shizuoka, Japan
Junko Kitayama, Kazuya Takashima, Tasuku Ito & Tetsuji Kakutani
Department of Biological Sciences, Graduate School of Science, The University of Tokyo, Hongo, Bunkyo-ku, 113-0033, Tokyo, Japan
Tasuku Ito & Tetsuji Kakutani

Authors

Hidetoshi Saze
View author publications
You can also search for this author in PubMed Google Scholar
Junko Kitayama
View author publications
You can also search for this author in PubMed Google Scholar
Kazuya Takashima
View author publications
You can also search for this author in PubMed Google Scholar
Saori Miura
View author publications
You can also search for this author in PubMed Google Scholar
Yoshiko Harukawa
View author publications
You can also search for this author in PubMed Google Scholar
Tasuku Ito
View author publications
You can also search for this author in PubMed Google Scholar
Tetsuji Kakutani
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Experiments were designed by H.S. and T.K., and were performed by H.S., J.K., K.T., S.M. and Y.H. The RNA-seq experiment was conducted by H.S. Bioinformatic analyses of RNA-seq and DNA methylation were performed by T.I. and T.K. The manuscript was prepared by H.S. and T.K.

Corresponding author

Correspondence to Hidetoshi Saze.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Supplementary information

Supplementary Figures, Tables and References

Supplementary Figures S1-S13, Supplementary Tables S1-S3 and Supplementary References (PDF 2359 kb)

Supplementary Dataset 1

List of genes with long introns and the effect of ibm2 mutation on transcription (XLS 40 kb)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Saze, H., Kitayama, J., Takashima, K. et al. Mechanism for full-length RNA processing of Arabidopsis genes containing intragenic heterochromatin. Nat Commun 4, 2301 (2013). https://doi.org/10.1038/ncomms3301

Download citation

Received: 28 January 2013
Accepted: 12 July 2013
Published: 12 August 2013
DOI: https://doi.org/10.1038/ncomms3301

This article is cited by

Small RNAs and Karma methylation in Elaeis guineensis mother palms are linked to high clonal mantling
- Siew-Eng Ooi
- Norashikin Sarpan
- Tatiana V. Tatarinova
Plant Molecular Biology (2023)
DNA methylation-free Arabidopsis reveals crucial roles of DNA methylation in regulating gene expression and development
- Li He
- Huan Huang
- Jian-Kang Zhu
Nature Communications (2022)
Epigenetic regulation of spurious transcription initiation in Arabidopsis
- Ngoc Tu Le
- Yoshiko Harukawa
- Hidetoshi Saze
Nature Communications (2020)
Genomic re-assessment of the transposable element landscape of the potato genome
- Diego Zavallo
- Juan Manuel Crescente
- Sebastian Asurmendi
Plant Cell Reports (2020)
Genome defense against integrated organellar DNA fragments from plastids into plant nuclear genomes through DNA methylation
- Takanori Yoshida
- Hazuka Y. Furihata
- Akira Kawabe
Scientific Reports (2019)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.