Introduction

The YidC/Oxa1/Alb3 family members are conserved in bacteria, mitochondria and chloroplasts and are involved in the biogenesis of membrane proteins1, especially respiratory chain complexes. In bacteria, YidC facilitates the folding and assembly of membrane proteins inserted by a conserved protein-conducting channel, the Sec translocon and also inserts proteins into the membrane independently2. YidC contains the five core transmembrane (TM) helices conserved among the family members. Recently, we reported the crystal structure of YidC from Bacillus halodurans (BhYidC) at 2.4 Å resolution3. The structure revealed that the core TM helices form a hydrophilic groove that contains many conserved hydrophilic residues and is open toward both the lipid bilayer and the cytoplasm. Structure-based in vivo analyses revealed the interactions between the hydrophilic groove and a substrate protein, as well as the importance of an arginine residue in the groove for membrane protein insertion mediated by Bacillus subtilis YidC.

Escherichia coli YidC (EcYidC) is essential for cell viability4 and one of the best-studied YidC/Oxa1/Alb3 family members1. In E. coli, YidC associates with the SecYEG complex and the SecDFYajC complex5,6,7 to form the holo-translocon, a hetero-complex composed of single copies of SecYEG, SecDFYajC and YidC, which contributes to efficient membrane protein biogenesis in the Sec-dependent pathway7. As YidC is much more abundant than the SecYEG complex8, it is likely that only a portion of the YidC protein pool forms the holo-translocon. The YidC proteins from Gram-negative bacteria possess an additional TM helix, which functions as a signal sequence and a large periplasmic domain (P1) that protrudes from the N-termini of the core TM helices. Although the P1 domain is not essential for the YidC function9, it interacts with multiple components of the Sec machinery and is considered to facilitate stable complex formation6,10. Previous structural and biochemical analyses suggested assembly models of the YidC-Sec complex6,7, but the structural details, including the orientation of the P1 domain, still remain uncertain. A high-resolution structure of full-length EcYidC is required to understand the molecular mechanism underlying Sec-dependent membrane protein integration.

Results

Crystal structure of EcYidC

We crystallized EcYidC in the lipidic cubic phase and determined the structure by molecular replacement, using the P1 domain of EcYidC (PDB ID: 3BS6) and the TM region of BhYidC (PDB ID: 3WO6) as search models. The final model was refined at 3.2 Å resolution to Rwork = 22.1% and Rfree = 27.7% (Table 1, Fig. 1A and Supplementary Fig. S1). In the crystal lattice, the molecules appeared to exist as monomers and did not form an effective oligomer as proposed previously (Supplementary Fig. S2)11,12, similar to the crystal structures of BhYidC3. This observation suggested that YidC functions as a monomer in the cytoplasmic membrane, consistent with the recent findings that monomeric EcYidC binds to translating ribosomes in a detergent solution and in lipid membranes13,14,15.

Table 1 Data collection and refinement statistics
Figure 1
figure 1

Structure of EcYidC.

(A), Ribbon representations of the EcYidC structure, viewed transversely through the membrane. (B), Close-up view of the hydrophobic core of EcYidC and the interaction surface between the P1 domain and the TM region (stereo view). Hydrogen bonds are shown as dashed lines. (C), Surface model representations of EcYidC, sliced through the center of the groove. The structure is colored according to the electrostatic potential, ranging from blue (+10 kT/e) to red (−10 kT/e). (D), Close-up view of the hydrophilic groove of EcYidC (colored), showing hydrophilic residues. BhYidC (grey) is superimposed on the TM region of EcYidC. The side chain of Arg72 in BhYidC (corresponding to Arg366 in EcYidC) is shown. (E), Surface model representations of EcYidC colored according to sequence conservation, ranging from cyan (low) to maroon (high). Dotted regions highlight the conserved surfaces in the P1-TM interface (yellow) and the TM3 helix (light green).

The EcYidC structure comprises five TM α-helices (TM2–6) and the protruding P1 and C1 regions (Fig. 1A). The P1 domain is connected to the TM region by the PH1 helix, which is amphipathic and lies parallel to the plane of the membrane. The C2 region and the C-terminal C3 tail are structurally disordered, as also found in the BhYidC structures3. The architecture of the core TM region of EcYidC is also similar to that of BhYidC3. The TM2 and TM3 helices are kinked at the conserved proline residues (Pro371 and Pro431, respectively) in the middle of the lipid bilayer. The five TM α-helices are tightly packed together in the periplasmic leaflet (Fig. 1B), but loosely interact with each other in the cytoplasmic leaflet. This results in a hydrophilic groove, opened toward both the cytoplasmic and membrane sides through a gap between the TM3 and TM5 helices (Fig. 1C). The groove contains many hydrophilic residues, including Thr362, Arg366, Thr373, Gln429, Thr474, Ser520, Asn521 and Gln527 (Fig. 1D). In particular, the highly conserved Arg366 generates a positively-charged surface in the groove (Fig. 1C and Supplementary Fig. S3).

The YidC proteins from Gram-negative bacteria have the TM1 helix and the P1 domain, in addition to the core TM region. In the EcYidC structure, the TM1 helix and the following N-terminal region of the P1 domain were disordered, although N-terminal sequencing confirmed that these TM1 helix residues were intact in the purified YidC. This suggested that the TM1 helix in the YidC proteins from Gram-negative bacteria does not specifically interact with the other TM helices, consistent with the fact that the helix is not important for the YidC activity9.

The P1 domain consists of two anti-parallel β-sheets, which form a tightly packed β-supersandwich fold. The P1 domain of the full-length EcYidC structure is essentially identical to the reported crystal structures of the isolated P1 domains (PDB ID 3BS6, rmsd = 0.660 and PDB ID 3BLC, rmsd = 0.672, over the Cα atoms of residues 59 to 320)16,17. The P1 domain is connected to the PH1 helix by a short loop. Thus, the P1 domain is likely to be located on the extracellular surface of the membrane, which is consistent with a previous report that the P1 domain alone interacts with E. coli membrane lipids17. In addition, the P1 domain forms several hydrogen bonds with the P2 region, the loop connecting the TM3 and TM4 helices and the TM3 helix. The main-chain carbonyl groups of Pro88 and Met441 hydrogen bond with the guanidinium group of Arg447 and the side-chain amide group of Gln258 and the side-chain hydroxyl group of Tyr259, respectively (Fig. 1B). These interaction sites are highly conserved in the YidC proteins from Gram-negative bacteria (Fig. 1E). Taken together, these observations suggest that the membrane binding and several hydrogen bonds stabilize the orientation of the P1 domain of YidC.

Characteristics of the YidC structure

A comparison of the EcYidC and BhYidC structures revealed a difference in the C1 region (Fig. 2A, B). The arrangement of two antiparallel helices in the C1 region of EcYidC is rotated by 35° with respect to the core region, as compared with the BhYidC structure. In the crystal, the C1 region contacts the P1 domains of the adjacent molecules (Supplementary Fig. S2). A short loop between the CH1 and CH2 helices interacts with the β7 strand and a loop between the β6 strand and the η1 helix. The CH1 helix interacts with the η1 helix. Despite these crystal contacts, the crystallographic B-factors of the C1 region are higher than those of the other regions in the EcYidC structure (Fig. 2C). Considering our previous results showing the importance of the flexible C1 region of BhYidC3, the structural difference of the C1 region and the higher crystallographic B-factors suggested that the flexibility of the C1 region is a universally conserved feature of YidC.

Figure 2
figure 2

Flexibility of the C1 region.

(A), Superimposition of the crystal structures of EcYidC (colored) and BhYidC (PDB ID: 3WO6, grey). (B), Close-up view of the C1 region. The arrangement of the helices in the C1 region is rotated by ~35°, as compared with those in BhYidC, with respect to the core region. (C), The crystallographic B-factors are colored in a gradient ranging from blue (10 Å2) to red (100 Å2).

By contrast, the conserved hydrophilic residues in EcYidC and especially the highly conserved arginine residue (Arg366) in the hydrophilic groove (Fig. 1D and Supplementary Fig. S3), were located at exactly the same places as those in BhYidC, suggesting that the arginine residue is important for the EcYidC function, as found for BhYidC3.

Positively-charged residue in the groove

To assess the importance of the highly conserved arginine residue, we performed a growth complementation assay using a YidC-depletable strain, JS71314, in which YidC expression is controlled by the arabinose promoter. In this assay, the YidC mutants were expressed under the control of the lac promoter from the pTV118N-based plasmids. All of the mutants with mutations in the hydrophilic groove, except for the reportedly-inactive T362A mutant15, rescued lethal phenotype at 37°C in the absence of arabinose (Fig. 3A), which is consistent with recent studies15,18. In contrast, the R366A and R366M mutants could not rescue the lethal phenotype at 20°C, whereas the R366K mutant could (Fig. 3B). These results indicated that a positively-charged residue, an arginine or a lysine, at position 366 is not essential, but play an important role for the YidC function under certain conditions. A positively-charged residue at this position is conserved among the YidC/Oxa1/Alb3 family members (Supplementary Fig. S3). The corresponding mutation of R366A in yeast Oxa1 (R140A) also led to a cold-sensitive phenotype19 (Supplementary Fig. S3). Altogether, we concluded that the YidC/Oxa1/Alb3 family members possess the evolutionarily conserved positively charged groove, which are related to their functions.

Figure 3
figure 3

Growth complementation of EcYidC mutants.

(A), (B), Growth complementation of JS7131 (ΔyidC, Para-yidC)4 by EcYidC mutants under conditions of wild-type EcYidC depletion (0.4% glucose) at 37°C (A) and 20°C (B). The cellular accumulation of the mutants was detected by immunoblotting, using anti-YidC antibody. The strain JS71 (yidC+, Para-yidC)4 possesses two functional yidC genes.

Substrate-contacting sites

A number of substrate-contacting residues were reported for EcYidC and BhYidC, as identified by cross-linking analyses with Sec-independent substrates, the Pf3 coat protein20, Foc21 and MifM3. These residues mapped mainly to one face of the exterior surface of the TM region, as well as the interior surface of the hydrophilic groove (Fig. 4A, B), revealing that the substrate-contacting residues in EcYidC are clustered in the vicinity of the groove opening toward the membrane in the TM3 and TM5 helices. These observations highlighted the importance of both the hydrophobic and hydrophilic interactions3 between substrates and YidC for the Sec-independent insertion. Moreover, the contact sites with a Sec-dependent substrate, FtsQ, share the same regions with the Sec-independent substrates21 (Fig. 4C), supporting the notion that the hydrophilic groove and the TM helices adjacent to the groove opening are also important for the function of EcYidC, as a membrane protein chaperone in Sec-dependent membrane protein integration.

Figure 4
figure 4

Structural mapping of substrate-contacting residues.

(A), Mapping of the substrate-contacting residues. (B), (C), Close-up views of the boxed areas in A. The residues bound to Sec-independent substrates are shown as spheres, colored green (the Pf3 coat protein), pink (MifM), brown (Foc) and dark green (both the Pf3 coat protein and Foc) and viewed from the membrane (upper) and periplasmic (lower) sides (B). The residues bound to the Sec-independent substrate, FtsQ, are shown as orange spheres (C). The P1 domain was removed for clarity.

Discussion

We have presented the crystal structure of full-length EcYidC, which reveals the orientation of the P1 domain with respect to the TM segments and allows accurate mapping of the previous crosslinking results. The structure also shows that a hydrophilic groove formed by the TM helices is a common structural feature among the YidC proteins.

The P1 domain shares structural similarity with galactose mutarotase and possesses a large cleft located at the position corresponding to its sugar binding pocket, although the domain lacks a sugar binding motif16,17. This cleft has been proposed to be a binding site for substrate proteins or periplasmic molecules16,17. Our crystal structure revealed that the cleft is oriented away from the membrane (Fig. 5), suggesting that the cleft could bind periplasmic molecules or proteins, rather than substrate proteins. However, further biological experiments are needed to clarify the function of the P1 domain in vivo.

Figure 5
figure 5

Structural insight into the role of the P1 domain.

(A), The regions interacting with SecF are colored blue, with Lys249 highlighted in red. (B), (C), Surface model representations of the boxed areas of EcYidC in A (B) and viewed from the other side (C). The cleft in the P1 domain is indicated by arrows.

The holo-translocon, composed of single copies of YidC, SecYEG and SecDFYajC, is necessary for effective protein translocation and integration5,7. Our EcYidC structure and previous studies6,10,22 have provided insights into the structural organization of the holo-translocon. SecF interacts with part of the P1 domain of EcYidC (residues 215–265), in which one residue (Lys249) was cross-linked with SecG, SecD and YajC6 (Fig. 5), suggesting that this surface provides the interface between YidC and the SecYEG-SecDFYajC complex. The highly conserved surface formed by the TM3 helices, which are located on the lateral edge of the hydrophilic groove, may provide interaction sites for the TM regions of the Sec proteins (Fig. 1E). Indeed, given that YidC was cross-linked with the lateral gate of SecY6 and that nascent TM segments of the substrate sequentially interact with SecY and YidC, it is likely that the lateral gate of SecY faces the hydrophilic groove of YidC in Sec-dependent membrane protein integration. This complex formation may facilitate the folding of multi-spanning membrane proteins, such as LacY23. In this process, YidC could form a transient complex with the nascent TM segment, due to hydrophilic interactions between the charged or polar residues of the substrate and the hydrophilic groove of YidC in the lipid bilayer, until an assembly partner emerges from the Sec translocon. Chaperones for cytosolic proteins prevent non-specific intramolecular hydrophobic interactions in the substrates, whereas YidC can provide a hydrophilic binding surface for immature membrane proteins to accomplish proper folding. Therefore, the hydrophilic groove may be important for the chaperone activity of YidC, as well as for its insertase activity.

Methods

Purification and crystallization of EcYidC

E. coliyidc, from JCM20135 genomic DNA (provided by RIKEN), was cloned into a modified pTV118N (TAKARA) vector. The resulting plasmid, encoding EcYidC1-540-LESSGENLYFQGQFTS-H8, was introduced into E. coli strain BL21(DE3). The cells were cultivated at 37°C in a 10 l culture, supplemented with 50 µg ml−1 ampicillin, to an absorbance at 600 nm of ~0.6, induced with 1 mM isopropyl-ß-D-thiogalactopyranoside for 18 h and harvested by centrifugation (4,500 g, 10 min, 4°C). Total membranes were prepared as described previously3. EcYidC was solubilized for 1 h at 4°C, in a buffer containing 20 mM N-(2-acetamido)iminodiacetic acid (ADA)-NaOH, pH 5.6, 300 mM NaCl, 2% n-decyl-ß-D-maltoside (DM) and 0.1 mM phenylmethylsulfonyl fluoride. After ultracentrifugation (138,000 g, 30 min, 4°C), the supernatant was mixed with 10 ml Ni Sepharose excel (GE Healthcare), equilibrated with buffer A (20 mM ADA-NaOH, pH 5.6, 300 mM NaCl and 0.25% DM) containing 30 mM imidazole-HCl, pH 6.6, for 30 min at 4°C. The resin was washed with buffer A containing 50 mM imidazole-HCl, pH 6.6 and then EcYidC was eluted with a gradient of 100–500 mM imidazole-HCl, pH 6.6, in buffer A. The eluate was mixed with the same volume of a buffer containing 500 mM ADA-NaOH, pH 5.6, 300 mM NaCl and 0.25% DM and then repeatedly concentrated and diluted with buffer A, using an Amicon Ultra filter (50 kDa molecular weight cutoff, Merck Millipore), to remove the imidazole. The C-terminal His8-tag was cleaved by His-tagged TEV protease (laboratory stock) in buffer A supplemented with 10 mM imidazole-HCl, pH 6.6, for 12 h at 4°C. The sample was loaded on a 10 ml Ni Sepharose excel column, equilibrated with buffer A supplemented with 10 mM imidazole-HCl and the flow-through was concentrated and loaded on a HiLoad 16/600 Superdex 200 column (GE Healthcare), which was eluted with buffer A. The N-terminal five amino acid sequence of the purified YidC, Met-Asp-Ser-Gln-Arg, was confirmed by Edman sequencing with a PPSQ-31A protein sequencer (SHIMADZU). For crystallization, the purified EcYidC was concentrated to approximately 15 mg ml−1. The crystals of EcYidC appeared from the mixture of the protein and monoolein in a reservoir solution containing 25% poly(ethyleneglycol) 600, 100 mM MES-NaOH, pH 5.9, 50 mM potassium thiocyanate and 12 mM triglycine, prepared by the twin-syringe mixing and crystallization method described previously3. The crystals grew to dimensions of 50 × 10 × 5 µm in 2 weeks and were then flash-cooled and stored in liquid nitrogen.

Data collection and structure determination

The X-ray diffraction data were collected at SPring-8 beamline BL32XU24 and processed using HKL2000 (HKL Research Inc.). Initial phases were calculated by molecular replacement with the crystal structure of the P1 domain of EcYidC (PDB ID: 3BS6) using PHASER25, which provided a partial model and weak electron density maps for the TM region of EcYidC. However, the electron density map was not sufficiently interpretable to build the model. To improve the initial phases, molecular replacement was performed using the P1 domain of EcYidC and the TM region of BhYidC (PDB ID: 3WO6)3 as search models. The model was manually rebuilt using COOT26 and refined using PHENIX27. To reduce the model bias and improve the phases, simulated annealing refinement was performed at an early stage of the model building, followed by several cycles of positional refinement combined with individual B-factor refinement. The final stage of refinement was performed by using X-ray/stereochemistry weight and X-ray/ADP weight optimization. Finally, the structure of EcYidC was refined to Rwork = 22.1% and Rfree = 27.7% at 3.2 Å resolution. The refinement statistics are summarized in Table 1. The Ramachandran plots were calculated with RAMPAGE28. The crystal contains two molecules in the asymmetric unit (Mol A and Mol B). The overall structures of Mol A and Mol B are almost identical (root-mean-square deviation, 0.884 Å over the Cα atoms of residues 56–528). The atomic coordinates and structure factors have been deposited in the Protein Data Bank, under the accession code 3WVF. The molecular graphics were illustrated with CueMol (http://www.cuemol.org/). The sequence conservation among 218 EcYidC orthologs, which share at least 30% amino acid sequence identity with EcYidC, was colored on the surface of EcYidC using the ConSurf server (http://consurf.tau.ac.il/). Electrostatic potentials were calculated using APBS29.

Complementation test of EcYidC mutants

Mutations were introduced into the EcYidC-expression plasmid by site-directed mutagenesis. To adjust the accumulation of YidC, we additionally mutated the initial codon, ATG, to GTG30. The resultant plasmid, encoding the mutant V-EcYidC2-548-LESSGENLYFQGQFTS-H8, was introduced into the JS7131 strain4. The E. coli cells were cultured at 37°C in LB medium supplemented with 0.2% arabinose and 50 µg ml-1 ampicillin until they attained an A600 of 0.4, diluted, spotted onto LB-agar plates supplemented with 0.4% glucose and 50 µg ml−1 ampicillin and incubated at 37°C for 24 h or at 20°C for ~120 h. To determine the accumulation level of YidC at 20°C, the preculture cells, grown in LB medium supplied with 0.2% arabinose and 50 µg ml−1 ampicillin at 37°C, were cultured in LB medium supplied with 0.4% glucose and 50 µg ml−1 ampicillin for 12 h at 37°C or for 12 h at 20°C. EcYidC accumulation was detected by immunoblotting using anti-YidC antiserum, raised in a rabbit immunized with the synthetic peptide (NH2-C-TPDEKYEKYKFDTIADN-COOH), corresponding to amino acids 225–241 of EcYidC.