Introduction

The CCR4-NOT complex is a large macromolecular assembly involved in many different aspects of mRNA processing in the nucleus and in cytoplasm, including chromatin modification, transcription elongation, mRNA degradation, miRNA gene silencing, RNA nuclear surveillance and nuclear export1,2. Its vital role in gene expression is reflected in its high conservation across eukaryotes. The CCR4-NOT complex was initially characterized as a global (positive and negative) regulator of transcription3, but its functions are now known to be much broader. CCR4-NOT is a cytoplasmic deadenylase that determines the half-life of mRNA molecules, and is responsible for the distributive shortening of the poly(A) tail before mRNA is degraded by the 3′-5′ or 5′-3′ mRNA decay pathways. Due to its ubiquitinating activity, it also regulates protein solubility and co-translational quality control4,5.

The CCR4-NOT complex consists of eight main subunits. Not1 (negative on TATA 1) is the largest of these (237 kDa) and forms a scaffold for the complex assembly. Ccr4 (76 kDa) and Caf1 (usually termed Pop2 in Schizosaccharomyces pombe; 37.5 kDa) act as deadenylases6. Not4 (54 kDa) is responsible for the second enzyme activity, ubiquitination4. The remaining subunits, Not2, Not3 and Not5 (34, 73 and 66 kDa, respectively), form the so called Not module, with no clear function yet assigned7. Caf40 (Ccr4-associated factor 40; 32 kDa) was recently reported to have a role in RNA degradation as part of the miRNA machinery8,9. Composition of the complex nonetheless varies between species. For instance, Saccharomyces cerevisiae CCR4-NOT has an additional subunit, Caf130, and its probable functional analogue is reported to be part of the Drosophila (Not10) (ref. 10) and human (Cnot10) (ref. 11) CCR4-NOT complexes. In metazoans, Cnot10 interacts with Cnot11 and Cnot1, forming a separate module of the complex10,11. In S. pombe, a nuclear form of the CCR4-NOT complex interacts with the RNA-binding protein Mmi1 (a 54 kDa YTH family RNA-binding protein) (A. Siwaszek and A. Dziembowski, in preparation), responsible for degradation and silencing of meiotic transcripts during vegetative growth, which also recruits the complex to its target mRNA12.

As a multifunctional assembly, CCR4-NOT interacts with a variety of cofactors that help recruit the complex to its mRNA targets. A CCR4-NOT-mediated mechanism to induce degradation and inhibit translation of the miRNA targets was recently proposed8,9. Cnot9 (the human homologue of yeast Caf40) binds to the central region of human Not1 (Cnot1) and interacts with GW182 (TNRC6), one of the main effectors of the miRNA degradation pathway. The entire CCR4-NOT complex is thus recruited to the mRNA directed to degradation. Moreover, the Cnot1 MIF4G domain interacts with the translation repressor and decapping activator DDX6 (refs 8, 9). CCR4-NOT uses a similar mechanism to interact with different RNA-binding proteins that recruit the complex to the target mRNA; the messenger RNA is degraded in a CCR4-NOT-mediated, deadenylation-dependent manner. The best-characterized of these binding partners are tristetraprolin (TTP)13, Nanos1 (ref. 14) and Tob/BTG15.

CCR4-NOT was recently defined as a molecular linker between the transcription and translation processes16,17. Initial studies showed that CCR4-NOT complex subunits crosslink to genes during transcription18, interact with RNA Pol II, and regulate transcription elongation19. A connection was proposed between CCR4-NOT subunit Not5 and the Rbp4/7 module of RNA Pol II16,17. The physical interaction has now been demonstrated, and seems to be crucial for Rbp4 cytoplasmic function, its association with mRNA and regulation of mRNA degradation and translation. The CCR4-NOT was also implicated in rescuing arrested RNA Pol II elongation complex in a Rbp4/7 heterodimer-dependent manner16. Not5 contributes to the cytoplasmic assembly of RNA Pol II16. Finally, the Not4 ubiquitinating activity widens the spectrum of CCR4-NOT functions in the cell, and includes general and co-translational quality control4,20,21 and proteasome assembly.

The cooperation mechanism among all CCR4-NOT functions remains largely unknown, due in part to the lack of structural information for the whole complex, although data for some of its components are available; most of the currently known structural data have been comprehensively reviewed by Xu et al.22 High-resolution structures have been reported for several CCR4-NOT subunits and/or domains, including the nuclease domain of Ccr4 (ref. 23), and the Not4 RING24 and RNA recognition motif domains. Structures of functional assemblies have been determined, such as that of the nuclease module consisting of Ccr4, Caf1 and a small Not1 fragment that incorporates the nucleases into the complex25, the Not module (Not2-Not5-Not1) (ref. 7), or structures of complete subunits such as Pop2/Caf1refs 26,27 and Caf40 (ref. 28). The recent work of Bhaskar et al.29 presented the crystal structure of the yeast Not4 C-terminal fragment bound to the Not1C-terminal domain, together with the Not4 N-terminal RING domain that interacts with its functional partner, Ubc4. The lack of a high-resolution structure for the entire CCR4-NOT complex is due to the current inability to obtain crystals, mainly because of the large size of the complex, the flexibility of some of its components, and the presence of unstructured regions.

Electron microscopy (EM) offers an alternative approach to solve the structure of the complex macromolecular assemblies, but the only structural data for the whole assembly, thus far, is derived from a low-resolution (30 Å) structure of the S. cerevisiae complex30. Combination of medium/low-resolution 3D information from EM of a macromolecular complex and the docking of the high-resolution structures of components of the complex is nonetheless a powerful tool that can be used to generate a pseudo-atomic model of a complex under study31,32.

In our analysis, the CCR4-NOT complex from S. pombe was expressed endogenously, purified, and its structure generated using cryo-EM and single-particle 3D reconstruction. The combination of immunomicroscopy and RNA-nanogold labelling techniques, coupled with the pseudo-atomic model of all components of the complex based on available high-resolution structures of subunits and domains, allows us to provide the first full molecular architecture of CCR4-NOT.

Results

Purification of CCR4-NOT and in vitro enzymatic activity

To analyze the structure of the S. pombe CCR4-NOT complex, we devised a purification protocol that combines Protein A tag and IgG affinity chromatography, followed by glycerol gradient centrifugation (Fig. 1a). The purification conditions allowed maintenance of the complex in a near-native state throughout the procedure, and high salt conditions were used to avoid non-specific protein–protein and/or protein–RNA interactions. Mass spectrometry analysis confirmed the presence of all canonical subunits described in other model systems, (Not1-4, Ccr4, Caf1, Caf40) and a S. pombe-specific subunit, Mmi1. Some degradation was also visible, in particular of Not1, Not4 and Caf40, together with some minor contaminants.

Figure 1: Purification of the active CCR4-NOT complex from fission yeast S. pombe.
figure 1

(a) SDS-PAGE analysis of the purified CCR4-NOT complex (left) and the CCR4-NOT complex lacking Mmi1 (Mmi1ΔCCR4-NOT complex; right). (b) In vitro deadenylation assay. CCR4-NOT deadenylation was assessed using polyadenylated RNA substrate (34 nucleotides) and the purified CCR4-NOT complex. Samples were collected at 0, 15, 30 and 60 min. The control reaction had no added protein.

To test the function of the purified CCR4-NOT complex, we assayed one of its enzymatic activities, the deadenylation of polyadenylated mRNA substrates by the two active exoribonuclease subunits, Ccr4 and Caf1. We observed gradual degradation of the polyadenylated RNA substrate (Fig. 1b), characteristic of the distributive enzymes. The assay confirmed conservation of enzyme activity in the purified complex, which indicates that the complex retains its native state, essential for reliable structural analysis.

Structural characterization of CCR4-NOT by cryo-EM

The glycerol gradient fraction with the highest CCR4-NOT concentration and purity was analyzed by EM, initially using negative staining. A total of 10,303 particles were selected and classified using ML2D and CL2D methods33,34 (Supplementary Fig. 1a,b). The classes selected were used for 3D reconstruction (see Online Methods). The final structure obtained has a curved L-like shape, with one arm 150 Å long and another slightly shorter (140 Å), and a cavity between these two masses (Fig. 2a). The overall shape of the complex is similar to that obtained for S. cerevisiae30, which indicates a common structure for these complexes.

Figure 2: Architecture of the CCR4-NOT complex from S. pombe.
figure 2

(a) Four orthogonal views of the CCR4-NOT 3D reconstruction by negative staining EM (23 Å resolution). (b) The views as in a of the complex by cryo-EM (20 Å resolution). Scale bar, 50 Å.

To generate a more faithful, detailed structure of CCR4-NOT, we analyzed the sample using cryo-EM. The standard purification procedure imposed certain limitations such as low concentration as well as glycerol in the sample that could not be removed completely, which reduced contrast in the cryo-EM analysis. To stabilize the complex and increase sample homogeneity, we applied the GraFix method, a gradient technique for purifying and stabilizing macromolecular complexes at low glutaraldehyde concentration, particularly suitable for EM35. The stabilized sample was concentrated by centrifugation using Amicon centrifuge membranes and subsequently vitrified (Supplementary Fig. 1c). A total of 60,480 particles were selected manually and classified using a CL2D procedure33. The 2D classes generated showed the typical orientations observed in negative staining EM analysis (Supplementary Fig. 1d and Fig. 2a). A subset of 20,500 best-quality particles was selected for 3D reconstruction (for the protocol, see Online Methods and Supplementary Figs 2 and 3). The volume generated (20 Å resolution; Fig. 2b) is similar in structure and dimensions to that obtained using a heavy metal salt, except that it shows more detailed features and a cavity in the centre of the shorter arm.

Mapping of Mmi1 into the CCR4-NOT complex

In addition to the canonical subunits described for the complex in other organisms11,36,37 the S. pombe CCR4-NOT complex contains the Mmi1 nuclear RNA-binding protein. This protein is responsible for removing meiosis-specific transcripts during vegetative growth38. It recognizes the hexanucleotide motif U(U/C)AAAC, called the determinant of selective removal (DSR), which is located in the 3′-untranslated region of meiosis-specific mRNA38. After binding, the transcript is degraded. Mmi1 is a stable component of the S. pombe CCR4-NOT complex, although the biological relevance of this interaction remains unclear.

To locate the position of Mmi1 in the 3D reconstruction of the complex, we carried out EM analysis of a CCR4-NOT complex lacking Mmi1 (ΔMmi1CCR4-NOT). The Mmi1 gene is essential, which precludes simple knockout strategies; therefore we used a subtraction strategy to purify the CCR4-NOT complex without Mmi1. We constructed a S. pombe strain in which the Not2 subunit was fused to protein A and TEV protease cleavage sites, as well as a Mmi1 gene fused to that of protein A tag lacking the TEV protease cleavage site. This strategy allowed purification of CCR4-NOT complex lacking Mmi1, since CCR4-NOT is released after TEV cleavage, while Mmi1 remains attached to resin beads. The purified complex was analyzed by SDS-polyacrylamide gel electrophoresis (PAGE), and the lack of Mmi1 protein was confirmed by the absence of a band migrating at 55 kDa (Fig. 1a). Negatively stained micrographs of the complex showed a homogenous population of particles. A total of 10,331 particles were selected, aligned and classified. The 2D classification showed a set of different views comparable to the classes obtained for the entire CCR4-NOT complex. The 3D reconstruction procedure rendered a volume very similar to that of wild-type CCR4-NOT (Fig. 3a,b). There was nonetheless a clear difference between the two volumes; a small channel in the shorter mass of the ΔMmi1CCR4-NOT complex was linked to the absence of Mmi1, and was observed clearly when the two volumes were superimposed (Fig. 3b). To confirm Mmi1 localization, we labelled CCT4-NOT with the RNA oligonucleotide consisting of four tandem DSR motifs fused to Nanogold (termed oligo-DSR). Negative staining EM analysis of the oligo-DSR-Nanogold-bound complex showed considerable heterogeneity, probably due to the presence of RNA. A total of 8,500 Nanogold-bound particles (visible as black dots, due to high electron scattering of the Nanogold particles) were selected manually and classified by CL2D and ML2D. To determine Mmi1 location based on the position of Nanogold in the particles, we selected the characteristic L-shape view from the 2D classes of the CCR4-NOT complex, which we used as a reference for alignment of the Nanogold-bound particles. The 2D average obtained shows the position of the Nanogold and thus, indirectly, that of Mmi1 (Fig. 3c) in the centre of the shorter arm, in accordance with the result of the 3D reconstruction of ΔMmi1CCR4-NOT.

Figure 3: Location of the Mmi1 subunit in the CCR4-NOT complex.
figure 3

(a) Three orthogonal views of the 3D reconstruction of the Mmi1ΔCCR4-NOT complex (20 Å resolution). (b) Superimposition of the 3D reconstruction of Mmi1ΔCCR4-NOT (pink, solid volume) on the CCR4-NOT complex (transparent, grey volume). Comparison of the two reconstructions shows an additional mass in the CCR4-NOT complex (black arrow). (c) 2D classes representing the front view of the CCR4-NOT complex. Top, 2D class of the complex used as a reference for particle alignment. Bottom, 2D class of the complex incubated with the nanogold-labelled Mmi1-binding RNA. Black dot, a nanogold particle visualized by EM. Red arrow, position of the Mmi1 subunit. Scale bar, 50 Å.

Localization of the CCR4-NOT subunits by immunomicroscopy

We then sought to locate the CCR4-NOT complex subunits. Since the low-resolution 3D model of the entire complex does not allow their direct localization, we used immuno-EM to locate each subunit within the 3D reconstruction of CCR4-NOT. The complex was purified from several yeast strains (Supplementary Table 1), in which Not2 was fused to an affinity tag, and green fluorescent protein (GFP) to the C terminus of the distinct target subunits. The growth rate of the GFP-fusion S. pombe strains was comparable to that of wild type. We confirmed the correct translational fusion and expression of modified complex subunits using western blot analysis with an anti-GFP antibody. After the first purification step, the complexes were incubated with the anti-GFP monoclonal antibody or Fab fragment and immunocomplexes were loaded onto a glycerol gradient to separate the immunocomplex from unbound antibody and impurities. Glycerol gradient fractions were analyzed by SDS-PAGE. All eight CCR4-NOT complex subunits were found in the middle fractions (Supplementary Fig. 4), with two additional bands that migrated at 50 and 25 kDa, corresponding to antibody heavy and light chains. Immunocomplex migration within the density gradient was comparable to that of wild-type CCR4-NOT, which confirms correct assembly despite the presence of the GFP tag and bound antibody.

The glycerol gradient fraction containing the immunocomplexes was analyzed by negative staining EM. For each immunocomplex, >10,000 particles were selected manually (see Supplementary Table 2) and subjected to KerDenSom39, which classifies prealigned particles by focusing mainly on a specific region of the image, defined by a predesigned mask. Particles from the selected classes were extracted and aligned using a 2D average of the CCR4-NOT representing the typical L-shaped front view (Fig. 4a,b) and classified by KerDenSom. Application of specific masks that focused on different areas of the 2D image identified protrusions in different parts when compared with the 2D average of the complex without antibody (Fig. 4b). Using this protocol, we located nucleases Ccr4 and Caf1 in the central part of the structure and in the shorter arm of the complex, respectively (Fig. 4c,d). They are near one another, which coincides with the high-resolution structure of the nuclease module25. The C terminus of the large scaffolding Not1 was found in the middle part of the front view (Fig. 4e), in a position similar to that of Ccr4, which suggests that the C-terminal ends of both subunits are located in a similar region of CCR4-NOT. In the case of Not3, the antibody binds at the top of the longer arm (Fig. 4f), opposite Caf1 and the rest of the nuclease module. As shown in the atomic structure, Not5 (a Not3 orthologue40) interacts with Not2 and with the Not1 C terminus7, which suggests that Not2 and the whole Not module are located in the long arm of the L-shaped complex. Antibody localization of Caf40 in CCR4-NOT (Fig. 4g) indicates the upper region of the long arm, near Not2 and Not3. Finally, Not4 was found in the central part of the structure, opposite the Not1 and Ccr4 C termini (Fig. 4h).

Figure 4: Immunolocalization of the CCR4-NOT complex subunits.
figure 4

(a) Front view of the CCR4-NOT 3D reconstruction. (b) 2D class average used as a reference for alignment of the selected particles with bound antibody. (ch) The same views of the immunocomplexes formed by CCR4-NOT subunits and antibody. The red arrow indicates the additional density in each immunocomplex, which corresponds to the antibody or Fab fragment bound.

Structural characterization of the Not2-Not5 heterodimer

In S. pombe, man, Drosophila and Trypanosoma, a single gene encodes Not3 protein. It is an orthologue of S. cerevisiae Not3 and Not5 genes, products of gene duplication. The proteins share a homologous C-terminal domain, the NOT-box, whereas their N termini are very different40. The baker’s yeast CCR4-NOT Not2 (23 kDa) and Not5 (73 kDa) subunits (Not2-Not3in S. pombe, man, Drosophila and Trypanosoma) form a stable heterodimer (Not2-Not5 in S. cerevisiae). The S. cerevisiae dimer was overexpressed, purified in Escherichia coli and complex formation was confirmed by size-exclusion chromatography. Negative staining EM analysis of the Not2-Not5 dimer showed a population of particles homogenous in size and shape (Supplementary Fig. 5a). A total of 20,055 particles were then selected and classified by CL2D and ML2D methods. The 2D classes, which confirmed sample homogeneity and showed general features of the complex (Supplementary Fig. 5b), were used for 3D reconstruction. The 3D model showed two connected masses, one 110 Å long and one of 70 Å (Fig. 5a). The central part of the volume encompasses a spacious cavity, and this space could be occupied by the C-terminal part of Not1 in accordance with the crystal structure of the yeast Not1-Not2-Not5-module7.

Figure 5: 3D reconstruction of the yeast Not2-Not5 heterodimer.
figure 5

(a) Four orthogonal views of the 3D reconstruction of the Not2-Not5 heterodimer (16 Å resolution). (b) Manual docking of the 3D reconstruction of Not2-Not5 (yellow) within the CCR4-NOT cryo-EM 3D reconstruction (purple). Scale bar, 50 Å.

The immunolocalization experiments indicated the position of the S. pombe Not3 subunit (Fig. 4f), which is an orthologue of S. cerevisiae Not5, a component of the reconstructed Not2-Not5 heterodimer. We used this information to locate the heterodimer in the cryo-EM 3D reconstruction of CCR4-NOT, which was further improved by manual fitting using Chimera software (Fig. 5b). The suggested Not2-Not5 position occupies most of the CCR4-NOT long arm, leaving the shorter arm for the nuclease module.

Pseudo-atomic model building of the CCR4-NOT complex

To obtain a structural model of the entire CCR4-NOT complex, we applied a computational modelling approach. First, we built homology models of all the CCR4-NOT components based on the S. pombe sequences and available high-resolution structures of orthologous proteins (Supplementary Fig. 6) and domains. We then docked all experimentally determined or modelled structures of all components into the electron density map of the complex using Situs41 and ADP_EM42 software. This docking confirmed the positions of Ccr4-Caf1-Not1, Caf40-Not1, Not2-Not3-Not1 and Not4-Not1 modules in the volume identified in the immunolabeling experiments, but did not allow us to build a model of the entire complex. We therefore applied a hybrid modelling approach using PyRy3D software (http://genesilico.pl/pyry3d/), which allows simultaneous docking of structures into the density map using spatial restraints and by modelling disordered and flexible regions during complex assembly. Spatial restraints of the complex subunits were derived from the EM analysis, supported by information on binary interactions between CCR4-NOT components7,9,25 (see the Methods section for details). Finally, we provided Situs-generated positions of Ccr4-Caf1-Not1, Caf40-Not1, Not2-Not3-Not1 and Not4-Not1 modules as starting positions for PyRy3D, and subsequently applied Monte Carlo simulations to optimize these positions in the EM map and to determine orientations of the remaining proteins (Mmi1, part of Not1 residues 1,500–1,850). We clustered the 100 best-scored models from 1,000 independent PyRy3D runs and filtered them based on the ability to fulfil known interactions not included in the initial spatial restraints. In this way, we obtained a cluster of 12 models, from which we selected the representative model with the highest PyRy3D score. This model (Fig. 6a) snugly occupies the available volume of the density map (the cross-correlation coefficient was equal to 0.81).

Figure 6: Functional implications of CCR4-NOT subunit organization.
figure 6

(a) Pseudo-atomic model generated after model building. (highly disordered regions were removed from the model, but their location for each subunit is suggested by the color codes in b and c (b) Map of the CCR4-NOT complex subunits. Expanded coloured regions of Not2, Not3, Not4 and Mmi1 represent the disordered fragments. (c) Left, binding sites of the known CCR4-NOT-interacting partners Tob/BTG (which binds to Caf1) and Nanos, TTP and DDX6 (which bind to different parts of Not1). The Caf1 deadenylase active site is indicated in yellow and Caf40 W-binding pockets in pink. Centre, proposed mechanism of miRNA substrate degradation. Caf40 recruits RITS complex through GW182 binding; the mRNA 5′ end interacts with DDX6, which enhances its decapping activity, while the 3′ end is deadenylated by Caf1. Right, mechanism for degradation of mRNA recruited by the mRNA-binding proteins Nanos and TTP. These proteins bind to the 3′-UTR of mRNA, which they recruit to the CCR4-NOT complex.

Discussion

We carried out a comprehensive structural characterization of the S. pombe CCR4-NOT complex, a macromolecular assembly with no symmetry, relatively low stability and probable high flexibility. Despite these limitations, we obtained a medium-resolution (20 Å) structure of the entire complex, which has not yet been obtained using high-resolution techniques such as X-ray crystallography. We located all individual subunits using a variety of methods, from which we built a pseudo-atomic model of the entire assembly in accordance with previous biochemical and structural data; this included the proposed location for nuclease active sites and the binding surfaces of known interacting partners. Our model supports previous functional information and provides new structural data that could be useful for understanding the mechanism of action of the CCR4-NOT complex.

Not1 is a large protein that acts as a scaffold for the complex and spans its entire volume (Fig. 6 and Supplementary Fig. 7). The N-terminal alpha-helical segment (residues 36–815), located in the shorter arm of the 3D reconstruction, forms a platform accessible for protein–protein interactions. The central part of S. cerevisiae Not1 interacts with the nuclease module (residues 754–1,000, called the MIF4G domain) and with Caf40 (residues 1,071–1,282, Caf40-binding domain)8,9. Caf40, located within CCR4-NOT, indicated the upper region of the long arm of the L-shaped complex near the Not2 and Not3 subunits. Azzouz et al.43 showed that Caf40 and Not5 (the yeast Not3 orthologue) are involved in nuclear RNA quality control by physical and functional connection to the nuclear exosome and the TRAMP complex; the proximity of the subunits reported here further supports these findings. A role was recently reported for human Caf40 in recruitment of GW182 to the miRNA silencing machinery8. The proximity of Caf40 to Caf1 deadenylase (50 Å from the Caf1 active site) bears out recent experiments showing that Caf40 recruits GW182 protein and the miRNA targets to the CCR4-NOT to initiate its degradation through deadenylation8,9. The NOT1 Caf1-binding MIF4G domain can interact simultaneously with DDX6, a DEAD-box helicase that further recruits the decapping factors and represses translation8,9. The known crystal structure of Not1-DDX6 was superimposed on the pseudo-atomic model of the CCR4-NOT complex (Supplementary Fig. 8). DDX6 and Caf1 were positioned on the opposite sites of the Not1 MIF4G domain. This spatial separation provides a suitable platform for binding the mRNA molecules, whose 3′ poly(A) tail is degraded by the Caf1 nuclease, while the 5′ cap-containing end is bound by DDX6 and possibly by other decapping factors (Fig. 6b,c). Deadenylation is thought to occur first, followed by decapping44; the structural information reported here suggests that these processes could be simultaneous, in agreement with co-translational decapping.

The molecular architecture of the CCR4-NOT complex suggests that the nuclease module is located near the S. pombe-specific subunit Mmi1. Although Mmi1 function in the regulation of S. pombe meiosis is well established, the reason it is incorporated into CCR4-NOT and its functional implications are less clear. The fact that Mmi1 is a CCR4-NOT component suggests a role for the complex in the decay of Mmi1-bound meiotic mRNA. The electrostatic potential calculation showed a positively charged patch in the region where Mmi1 is located; in accordance with the general Mmi1 function in the elimination of meiotic-specific mRNA45; we pinpointed this as a possible mRNA-binding region (Supplementary Fig. 9). Mmi1 is also involved in facultative heterochromatin formation mediated by the RITS (RNA-induced transcriptional silencing) complex46,47,48 and interestingly Mmi1 association with the CCR4-NOT complex is essential for this process12. Although no GW182 homologue has been characterized in S. pombe, another GW repeat-containing protein, Tas3, is involved in RITS formation49. Caf40 proximity to the Not module thus suggests that, in analogy to D. melanogaster, the interaction of GW repeat-containing protein in S. pombe includes additional CCR4-NOT complex components. RITS is composed of Ago1 (which binds siRNA), the chromodomain protein Chp1 and Tas3. Mmi1 guides RNAi to specific meiotic mRNAs and genes47, although direct physical interaction between RITS components and Mmi1 has not been shown. It is thus possible that CCR4-NOT complex components such as Caf40 or Not2 mediate RITS recruitment to the meiotic genes. Moreover, RITS recognizes methylation on lysine 9 of histone 3 (ref. 47), which provides a link to the second enzymatic activity of the complex, ubiquitination. Not4 (an E3 ubiquitin ligase), which in our structural model is located near Mmi1 and Caf40 (Fig. 6), regulates histone modifications indirectly through degradation of the histone dimethylase Jdh2 (ref. 50). This spatial proximity explains the finding that Not4 is indeed involved in heterochromatin formation12. ChIP-seq experiments showed a striking reduction in H3K9 dimethylation in the heterochromatin regions of not4 mutants12. The organization of CCR4-NOT complex subunits illustrates the cooperation between the CCR4-NOT complex, Mmi1 and RITS in S. pombe sexual development.

In man, the central part of Not1 (residues 820–999) binds TTP, which in turn binds the 3′-untranslated region AU-rich region on the target mRNA, leading to its deadenylation and decay13. Our structural information aids understanding of the underlying mechanisms of this process (Fig. 6c). In this model, the mRNA molecule is bound to TTP and is thus positioned near the active site of the Caf1 nuclease, which mediates deadenylation of the mRNA. It is tempting to speculate that DDX6 helicase, located nearby, binds simultaneously to the 5′ end of the mRNA and represses its translation, induces decapping and further degradation.

The HEAT domain in the scaffolding Not1 C terminus binds cooperatively to the NOT boxes located in the two Not2-Not5 dimer proteins. The entire module is located in the long arm of the L-shaped complex (Figs 5 and 6). The NOT boxes occupy the space at the top of the volume, and the remainder of Not3 stretches downwards at the back of the complex, and with Not1, forms an accessible binding platform. Analysis of CCR4-NOT electrostatic potential (Supplementary Fig. 9) and previous biochemical data7,51 suggest RNA-binding properties of the Not module, which might be responsible for recruitment of the complex to mRNA targets. Not4, which is involved in protein quality control, interacts with the C terminus of the yeast Not1 (refs 6, 29). The binding region shows an extensive interaction network between Not4 and Not1, which is independent of the Not module29. In our pseudo-atomic model, Not4 was positioned at the central part and accessible back side of the volume, between the nuclease and Not modules. The relative proximity of Not4 to the nuclease module of the complex on one hand contradicts the finding that Not4 enzymatic activity is independent of deadenylation5. On the other hand, it supports recent results showing that in humans, the E3 ligase MEX-3C regulates Cnot7 deadenylation activity by ubiquitination, but not by degradation52. By analogy, yeast Not4 might also have a role in deadenylase activation in the CCR4-NOT complex.

Finally, based on this pseudo-atomic model, the Not1 segment whose structure and function have not yet been characterized (amino acids 1,500–1,850), is located at the back of the 3D reconstruction. In this position, an open, accessible surface is formed for potential protein–protein interactions as well as a binding site for the regulatory factors.

In summary, we propose the molecular architecture of the CCR4-NOT complex and outline possible mechanistic aspects that could assist functional studies that will improve understanding of the machinery that controls so vital a process as mRNA processing within the cell.

Methods

Yeast strains

The S. pombe strains used are listed in Supplementary Table 1. The transformation procedure followed was as described53. A pku80Δ strain was used for transformation to minimize non-homologous recombination53. To generate spores, strains were crossed on EMM-N plates, resuspended in sterile water, incubated (30 min, 55 °C) and streaked on selective media54.

Purification of the CCR4-NOT complex

CCR4-NOT was purified in two steps. The first used a modified tandem affinity purification procedure55,56 with Protein A as an affinity tag and the TEV protease cleavage site (ProtA-TEV-Not2) fused to the bait protein (Not2 subunit C terminus). S. pombe cells were cultured in 4 l of × 4 YE medium to A600=8. Cells were collected by centrifugation (2,400g, 4 min, 4 °C), washed with water, the pellet resuspended (100 g 30 ml−1) in storage buffer (50 mM HEPES, pH 8.0, 1 mM ditheiothreitol), frozen by plunging into liquid nitrogen and stored at −80 °C. To extract proteins, frozen yeast cells were disrupted mechanically in a laboratory blender cooled with dry ice to avoid sample melting. Cells were melted on ice in lysis buffer (1 mM ditheiothreitol, 40 mM Hepes, pH 8.0, 1 M NaCl, two tablets EDTA-free complete protease inhibitor cocktail, 2 mM phenylmethanesulfonylfluoride (50 ml 100 g−1). The homogenate was clarified by centrifugation (48,000g, 20 min, 4 °C) and the supernatant centrifuged (131,000g, 40 min, 4 °C). The extract was dialyzed (3 h, 4 °C) against buffer D (1 mM ditheiothreitol, 40 mM HEPES, pH 8.0, 0.5 M NaCl, 1 mM phenylmethanesulfonylfluoride, 20% glycerol) using 16 mm dialysis tubes (Merck). Dialyzed protein extract was incubated (overnight, 4 °C) with 1 ml IgG beads (IgG SepharoseTM 6 Fast Flow, GE Healthcare), then centrifuged (2,400g, 4 min, 4 °C). Beads were transferred into a poly-prep chromatography column (Bio-Rad), washed with 30 ml IPP500 buffer (500 mM NaCl, 10 mM Tris-HCl, pH 8.0, 0.1% Triton X100), followed by washing with 20 ml TEV protease buffer (500 mM NaCl, 10 mM Tris-HCl, pH 8.0, 1 mM ditheiothreitol, 0.5 mM EDTA). CCR4-NOT complex-bound beads were incubated with 60 μl TEV protease (5 mg ml−1) in 1 ml TEV buffer (2.5 h, 18 °C). The complex was eluted with 0.8 ml TEV buffer.

The TEV eluate was loaded onto a 10–30% glycerol gradient (10 mM HEPES, pH 8.0, 500 mM NaCl) and centrifuged (274,000g, 18 h, 4 °C); gradient fractions were collected starting from the top (300 μl per fraction), after which 150 μl of each fraction was mixed with the same volume of the neighbouring fraction (total 300 μl), precipitated by trichloroacetic acid, resuspended in 20 μl × 5 Laemmli loading buffer (50 mM Tris-HCl, pH 7.6, 2 mM EDTA, 2% (w/v) SDS, 25% glycerol, 0.01% (w/v) bromophenol blue, 5% (v/v) β-mercaptoethanol, in H20) and analyzed by 10% SDS-PAGE. For cryo-EM analysis, the sample was loaded onto a glycerol gradient as described, except that the 30% solution contained 0.15% glutaraldehyde, according to the GraFix procedure35.

Expression and purification of the Not2-Not5 dimer

S. cerevisiae Not2 was cloned into a kanamycin-resistant pET28-derived vector as an N-terminal HIS-SUMO tag fusion protein cleavable with the SUMO-specific protease Ulp1. S. cerevisiae Not5 was cloned into a pET28-derived vector with an ampicillin resistance marker as an N-terminal Strep-tag fusion protein. Proteins were coexpressed in E. coli BL21-RIL in Super Broth Auto Induction Media (Formedium, Norfolk, UK; 48 h, 18 °C), resuspended in lysis buffer (150 mM NaCl, 50 mM Tris, pH 8.0, 10 mM β-mercaptoethanol, 10 mM imidazole, 300 mM urea, 50 mM arginine, 50 mM glutamine) supplemented with protease inhibitors, and lysed with lysozyme in a EmulsiFlex-C3 homogenizer (Avestin Europe GmbH, Mannheim, Germany). The Not2-Not5 heterodimer was purified on a ÄktaXpress FPLC system (GE Healthcare). The following protocol was used for automated purification: (i) nickel affinity chromatography with on-column SUMO protease cleavage (buffer: 150 mM NaCl, 10 mM Tris, pH 8.0, 10 mM β-mercaptoethanol, 10 mM or 600 mM imidazole), (ii) desalting followed by a second round of nickel affinity chromatography, (iii) ion exchange chromatography (RESOURCE Q column, buffer: 100–1,000 mM NaCl gradient, 10 mM Tris, pH 8.0).

In vitro deadenylation assay

A 40 bp RNA oligonucleotide with a poly(A) tail (5′-cgacgauugcaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa-3′), 32P-labelled on the 5′ end, was used as a substrate in the in vitro deadenylation reaction. The reaction was performed with 10 pmol RNA substrate and 1.5 pmol (0.1 μg) purified CCR4-NOT complex in reaction buffer (10 mM Tris-Cl pH 8.0, 10 mM ditheiothreitol, 50 mM KCl, 5 mM MgCl2, 5 mM MnCl2) and incubated at 30 °C. Samples were collected at 0, 15, 30 and 60 min. To terminate the reaction, 5 μl loading buffer were added (7 M urea, 0.1% bromophenol blue in TBE). Reaction products were analyzed by denaturing electrophoresis in a 19% urea-acrylamide gel (1 h, 300 V), and visualized by phosphoimager (Bio-Rad).

Electron microscopy

Sample preparation and data collection. For negative staining experiments, 5 μl aliquots of samples were applied to 400 mesh grids (Maxtaform Cu/Rh HR26) coated with a thin (8 nm) carbon layer and glow-discharged for 20 s. The grids were then stained (2 min) with 2% uranyl acetate and air-dried before transmission EM analysis. Images were taken using a JEM 1200 electron microscope operated at 100 kV (JEOL) in low-dose conditions (10 e/Å) at × 60,000 magnification and recorded on Kodak-electron SO-163 film. Micrographs were digitalized using a Photoscan TD Zeiss-Intergraph scanner (pixel size 2.33 Å/px) or in a Super CoolScan 9000 ED (Nikon; pixel size 2.12 Å/px). Alternatively, images were taken with a JEOL 1010 JEM electron microscope operating at 80 kV with a CCD camera (4k × 4k TemCam-F416, TVIPS). Images were recorded at a sampling rate of 2.97 Å/px.

For cryo-EM sample preparation, we used quantifoil grids (Quantifoil R 1.2/ R1.3 300 mesh grids; ref. Q09684) covered with a thin carbon layer (4 nm) and glow-discharged for 20 s. Aliquots (5 μl) of purified concentrated CCR4-NOT were incubated (2–5 min) with the grid, blotted and plunged into a liquid ethane chamber. All operations were performed on a Leica CPC manual plunger. Cryo-EM samples were analyzed with a Tecnai F20 transmission EM (200 kV). Images were acquired under low-dose conditions (10 e Å−1) at × 62,000 magnification and defocus values ranging from −2 to −4 μm. A 4k × 4k FEI Eagle CCD camera was used for image recording with a sampling rate of 1.78 Å px−1.

Image processing, particle selection and 2D classification. In all cases, the CTF (contrast transfer function) was corrected using the CTFFIND3 program57, which also calculates potential astigmatism. Micrographs with visible drift and astigmatism were discarded. Single particles were selected manually, extracted from micrographs and normalized using the XMIPP software package58. Three types of algorithms implemented in XMIPP were used to classify single images, ML2D (ref. 34), CL2D (ref. 33) and KerDenSom for immune complexes39.

3D reconstruction. Several initial models were tested in the first step of the 3D reconstruction procedure using EMAN software59: artificial noise, blob, a model created by a ‘common lines’ algorithm based on previously obtained 2D classes and, for 3D reconstruction of the CCR4-NOT complex based on cryo-EM data and ‘Mmi1-delta’ data, a negative staining 3D volume low-pass filtered to 40 Å resolution. Refinement was performed until the 3D reconstructions from these initial models converged to stable, similar 3D volumes (Supplementary Fig. 3). To obtain more structural detail, the 3D reconstruction from EMAN refinement was subjected to Projection Matching using XMIPP. Resolution of the final 3D models was estimated based on the FSC criterion (Fourier shell correlation60). The spatial frequency at 0.5 correlation was taken as the resolution of the model (Supplementary Fig. 2). Visualization of the 3D models and docking of the atomic structures into EM volumes was performed manually using USCF Chimera61.

Immunoelectron microscopy. The subunits of the CCR4-NOT complex were located using immunolabeling. Complexes were purified as described above using the Protein A tag in the Not2 subunit. The C terminus of each target subunit (Caf1, Ccr4, Not1, Caf40, Not3/5 or Not4) was fused to GFP. After affinity chromatography using IgG beads and TEV protease cleavage, the TEV eluate was incubated (30 min, 4 °C) with 50 μg anti-GFP monoclonal antibody (1 mg ml−1; G6795, Sigma) or a Fab fragment from the same antibody (Mouse IgG1 Fab and F(ab')2 Micro Preparation Kit; Pierce) and loaded onto a glycerol gradient. After separation of gradient fractions and SDS-PAGE analysis, the fraction with the highest concentration and purity of the immune complex (CCR4-NOT+IgG) was analyzed by negative staining EM.

Nanogold labelling. A 34-nt RNA oligonucleotide was covalently coupled to Nanogold (ref. 2021; Nanoprobes) containing a primary amine functional group on its surface, reactive to aldehyde groups. In the first step, 500 pmol RNA oligonucleotide were oxidized with 1,000 nM NaIO4 in 100 mM PIPES buffer pH 7.0 (90 min, 4 °C). The reaction was terminated by incubation with 2 μl glycerol (5 min). Oxidized RNA was precipitated with 96% cold ethanol. The RNA pellet was resuspended in 250 μl 0.1% glycol ethylene solution to remove excess NaIO4 and incubated (30 min, 0 °C). Finally, the RNA product was again precipitated with 96% ethanol.

Aliquots (30 nM) of monoamino Nanogold resuspended in 100 μl dimethyl sulphoxide were incubated with 60 μl oxidized RNA from the previous step in 100 mM PIPES pH 7.0 (60 min, 4 °C, with gentle mixing). Aliquots (3 μl) of freshly prepared sodium borohydride (20 mg ml−1) were added to the gold-coupling reaction and incubated (30 min, 4 °C). The reaction was terminated by adding 2 μl acetone. Nanogold-labelled RNA was purified on a S200 Ilustra MicroSpin column (GE Healthcare). Aliquots (1–10 μl) of purified Nanogold-labelled RNA were mixed with 5–10 μl purified CCR4-NOT complex and incubated (30 min, ice). Samples were prepared and analyzed by negative staining EM.

Pseudo-atomic model building

Homology modelling. Homology structures of all CCR4-NOT components were modelled based on the S. pombe sequences from the fission yeast database (http://www.pombase.org/). For template selection and domain identification, we used the GeneSilico Metaserver (https://genesilico.pl/meta2/) (ref. 62). Intrinsically disordered regions were identified with MetaDisorder63. Comparative models of the CCR4-NOT subunits were built using the ‘FRankenstein’s Monster’ modelling approach64, which comprises cycles of model building, evaluation, realignment of poorly scored regions and merging of the best-scoring fragments to obtain the best possible model. For model building we used MODELLER 8v1 (ref. 65). For regions without a template or poorly modelled, we used a REFINER66 program that performs local refinement with restraints on predicted secondary structure. Finally, we built models of Not1-Not2-Not3 (Not1: 1,583–2,072, Not2: 128–306, Not3: 491–634), Not1-Caf1-Ccr4 (Not1: 841–1,090 and Caf1: 23–273 and Ccr4: entire), Not1-Caf40 (Rcd1) (Not1: 1,097–1,328 and Caf40: entire) and Not1-Not4 (Not1: 1,568–2,078, Not4: 420–469) complexes in which specific components were oriented according to 4by6 (ref. 7), 4b8c (ref. 25), 4cru (ref. 8) and 5aje (ref. 29) crystal structures, respectively. All models obtained were evaluated with the MetaMQAPII program67. Values of predicted GDT_TS (global distance test total score) were used to evaluate global accuracy of models (the higher the GDT_TS, the better) and the predicted deviation of individual residues as a measure of predicted local accuracy (the lower the predicted deviation, the better). MQAP scores only predict the deviation of a model from the real structure; the real deviation can be calculated only by comparison to the real structures, which are not available for this complex. The scores reported here as ‘very good models’ must thus be interpreted as estimations or predictions that our models are ‘very good’, of an ultimate validation of model quality. The final models of individual proteins, with quality evaluations, are publicly available at: ftp://ftp.genesilico.pl/iamb/models/ccr4not.

Complex assembly. To predict the structure of the CCR4-NOT complex and the orientation of its components within the electron density map, rigid body docking methods were first applied to confirm the positions of the components identified by immunolocalization experiments. The PyRy3D method developed in the Bujnicki laboratory (http://genesilico.pl/pyry3d/, standalone version available from the authors on request) was used to optimize positions obtained based on other data available for the complex.

Rigid body docking was applied to all CCR4-NOT components, according to available crystal structures and models generated in this study. Situs41 and ADP_EM42 programs were then used, and best-scored solutions that coincided with immunolocalization results for specific components were selected. Both programs were used with default parameters. Map resolution was set to 20 Å and a density threshold to 0.7.

Positions for Ccr4-Caf1-Not1, Caf40-Not1 and Not2-Not3-Not1 modules returned by Situs were used as starting points for conformational space sampling with PyRy3D. Protein sequence regions predicted to be disordered were modelled in a coarse-grained representation and treated as flexible shapes able to change conformation during complex assembly. To define interactions between components of the complex, we used the following constraints:

  • Residues 16–76 and 113–200 of Not4 were constrained to be in proximity to the Not1 C terminus (residues 1,326–2,072).

  • Mmi1 was constrained to interact with Caf1 (at least one contact between any residues of these proteins was required).

  • Mmi1 (any residue) was constrained to be in contact with residues 1–1,200 of Not1.

  • The Not1 domains were constrained to remain connected to one another.

PyRy3D was used with default parameters (Monte Carlo simulated annealing method, starting temperature T0=10 in dimensionless units, temperature decrease during the simulation according to the scheme: Tn=T0 × 0.999n; where n is the number of the simulation steps, 100,000 steps, PyRy3D grid size 1.5 Å). The 0.7 density threshold was used to define the map volume.

A total of 1,000 models of the CCR4-NOT complex were generated from 1,000 runs of PyRy3D; they were clustered according to RMSD values between the models to obtain groups of most similar solutions. The resulting clusters of models were filtered according to their ability to fulfil interactions identified for human proteins (Cnot1-TTP and Nanos1 (pdb:4j8s), Cnot9-W, DDX6 (pdb:4ct7,4ct4), Caf1 with Tob/Btg family proteins (pdb:2d5r) and yeast Not4 with Ubc4 (pdb:4ajd). A group of CCR4-NOT models was selected, in which regions responsible for these interactions were exposed to the solvent. As the final model, we chose the best-scored model (according to PyRy3D score) from the filtered cluster of solutions. Goodness of fit of the models into the electron density map was measured with the cross-correlation coefficient implemented in the ‘Fit In Map’ procedure available via the Chimera viewer. The final ensembles of models are available at ftp://ftp.genesilico.pl/iamb/models/ccr4not. Molecular structure graphics were produced with PyMOL68 and Chimera61.

Additional information

Accession codes: The 3D reconstruction of the CCR4-NOT complex generated by cryo-EM has been deposited in the EMDB (code EMD3232).

How to cite this article: Ukleja, M. et al. The architecture of the Schizosaccharomyces pombe CCR4-NOT complex. Nat. Commun. 7:10433 doi: 10.1038/ncomms10433 (2016).