Skip to main content
Advertisement

Main menu

  • Home
  • Articles
    • Newest Articles
    • Current Issue
    • Methods & Resources
    • Archive
    • Subjects
  • Collections
  • Submit
    • Submit a Manuscript
    • Author Guidelines
    • License, Copyright, Fee
    • FAQ
    • Why Submit
  • About
    • About Us
    • Editors & Staff
    • Board Members
    • Licensing and Reuse
    • Reviewer Guidelines
    • Privacy Policy
    • Advertise
    • Contact Us
    • LSA LLC
  • Alerts
  • Other Publications
    • EMBO Press
    • The EMBO Journal
    • EMBO reports
    • EMBO Molecular Medicine
    • Molecular Systems Biology
    • Rockefeller University Press
    • Journal of Cell Biology
    • Journal of Experimental Medicine
    • Journal of General Physiology
    • Cold Spring Harbor Laboratory Press
    • Genes & Development
    • Genome Research

User menu

  • My alerts

Search

  • Advanced search
Life Science Alliance
  • Other Publications
    • EMBO Press
    • The EMBO Journal
    • EMBO reports
    • EMBO Molecular Medicine
    • Molecular Systems Biology
    • Rockefeller University Press
    • Journal of Cell Biology
    • Journal of Experimental Medicine
    • Journal of General Physiology
    • Cold Spring Harbor Laboratory Press
    • Genes & Development
    • Genome Research
  • My alerts
Life Science Alliance

Advanced Search

  • Home
  • Articles
    • Newest Articles
    • Current Issue
    • Methods & Resources
    • Archive
    • Subjects
  • Collections
  • Submit
    • Submit a Manuscript
    • Author Guidelines
    • License, Copyright, Fee
    • FAQ
    • Why Submit
  • About
    • About Us
    • Editors & Staff
    • Board Members
    • Licensing and Reuse
    • Reviewer Guidelines
    • Privacy Policy
    • Advertise
    • Contact Us
    • LSA LLC
  • Alerts
  • Follow lsa Template on Twitter
Research Article
Transparent Process
Open Access

Phylogenetic profiling resolves early emergence of PRC2 and illuminates its functional core

View ORCID ProfileAbdoallah Sharaf  Correspondence email, Mallika Vijayanathan, Miroslav Oborník, View ORCID ProfileIva Mozgová  Correspondence email
Abdoallah Sharaf
1Biology Centre, Czech Academy of Sciences, Institute of Plant Molecular Biology, České Budějovice, Czech Republic
2Genetic Department, Faculty of Agriculture, Ain Shams University, Cairo, Egypt
Roles: Conceptualization, Data curation, Validation, Investigation, Visualization, Methodology, Writing—original draft, review, and editing
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Abdoallah Sharaf
  • For correspondence: iva.mozgova@umbr.cas.cz abdoallah.sharaf@umbr.cas.cz
Mallika Vijayanathan
1Biology Centre, Czech Academy of Sciences, Institute of Plant Molecular Biology, České Budějovice, Czech Republic
Roles: Investigation, Visualization, Methodology, Writing—original draft
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Miroslav Oborník
3Biology Centre, Czech Academy of Sciences, Institute of Parasitology, České Budějovice, Czech Republic
4University of South Bohemia, Faculty of Science, České Budějovice, Czech Republic
Roles: Conceptualization, Validation, Writing—review and editing
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Iva Mozgová
1Biology Centre, Czech Academy of Sciences, Institute of Plant Molecular Biology, České Budějovice, Czech Republic
4University of South Bohemia, Faculty of Science, České Budějovice, Czech Republic
Roles: Conceptualization, Funding acquisition, Validation, Investigation, Writing—original draft, review, and editing
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Iva Mozgová
  • For correspondence: iva.mozgova@umbr.cas.cz abdoallah.sharaf@umbr.cas.cz
Published 19 April 2022. DOI: 10.26508/lsa.202101271
  • Article
  • Figures & Data
  • Info
  • Metrics
  • Reviewer Comments
  • PDF
Loading

Article Figures & Data

Figures

  • Supplementary Materials
  • Figure S1.
    • Download figure
    • Open in new tab
    • Download powerpoint
    Figure S1. Multiple sequence alignment identity matrix of PcG-finder and experimentally identified orthologs.

    (A, B, C) The analyses were performed using the Clustal Omega tool (https://www.ebi.ac.uk/Tools/msa/clustalo/) for (A) Su(z)12 full-length sequence, (B) VEFS-Box domain sequence, and (C) ESC full-length sequence.

  • Figure 1.
    • Download figure
    • Open in new tab
    • Download powerpoint
    Figure 1. Species cladogram showing the distribution of PRC2 subunit orthologs identified using PcG-finder across eukaryotic lineages.

    Of the 283 species analyzed using the PcG-finder, only species with the BUSCO score C ≥ 50 (181 species) are plotted (Table S2), and species with score C ≥ 75 (61 species) are marked in bold. Empty symbols indicate one identified homolog, filled symbols indicate multiple identified homologs. Homologs may originate from one or more genomic loci. In case of organisms in which only VEFS-domain (Su(z)12-like) subunits were identified, Su(z)12 is depicted using a gray dotted-line triangle. Cladogram topology is based on the recently proposed topology of the eukaryotic phylogenetic tree (Burki et al, 2020). SAR = Stramenopila, Alveolata, Rhizaria.

  • Figure S2.
    • Download figure
    • Open in new tab
    • Download powerpoint
    Figure S2. A fast phylogeny tree and domain organization of the fungal VEFS-Box [Su(z)12-like] proteins.

    The tree was generated using the iTOL tool (https://itol.embl.de) based on the SMART domains database (http://smart.embl.de).

  • Figure S3.
    • Download figure
    • Open in new tab
    • Download powerpoint
    Figure S3. ML-eukaryotic phylogenetic trees of PRC2 subunits show the evolutionary relationship between orthologs.

    (A, B, C) The trees were constructed using the whole protein sequences of (A) Su(z)12, (B) ESC, and (C) NURF55 sequences, and the maximum likelihood branch support values are given in % (IQ-TREE/RAxML-NG).

  • Figure 2.
    • Download figure
    • Open in new tab
    • Download powerpoint
    Figure 2. A maximum likelihood (ML) eukaryotic phylogenetic tree of E(z) orthologs showing separation into five clades (clade I–clade V).

    The tree was constructed using the whole protein sequence and the ML branch support values are given in % (IQ-TREE/RAxML-NG). The number of leaves of the collapsed clade is indicated between parentheses. Higher taxa names are indicated in bold font and species names in normal font.

  • Figure S4.
    • Download figure
    • Open in new tab
    • Download powerpoint
    Figure S4. Venn diagrams.

    (A, B) Showing (A) overlap of species represented in the five clades of E(z) orthologs and (B) the coexistence of the E(z), conserved Su(z)12, other VEFS-Box proteins, and ESC subunits in the studied species.

  • Figure S5.
    • Download figure
    • Open in new tab
    • Download powerpoint
    Figure S5. Domain architecture of proteins representing each of the five clades of full-length E(z) orthologs.

    The domain architectures were drawn based on the sequences representatives of each clade and a search against SMART and Pfam databases.

  • Figure 3.
    • Download figure
    • Open in new tab
    • Download powerpoint
    Figure 3.

    A maximum likelihood (ML) phylogenetic tree showing the evolutionary relationships of prokaryotic and different eukaryotic subfamilies of SET-domain proteins.

    The tree was constructed using the whole protein sequence, and the ML branch support values are given in % (IQ-TREE/RAxML-NG). The number of leaves of the collapsed clade is mentioned between parentheses. The eukaryotic groups are labeled and the SET-domain subfamilies as well, whereas the uncharacterized SET-domain is not labeled.

  • Figure S6.
    • Download figure
    • Open in new tab
    • Download powerpoint
    Figure S6. ML phylogenetic tree based on SET-domain sequences showing the evolutionary and functional relationships of prokaryote SET-domain proteins and different eukaryote subfamilies of SET-domain proteins.

    The tree was rooted with the Asgard SET-domain protein, and the maximum likelihood branch support values are given in % (IQ-TREE/RAxML-NG).

  • Figure 4.
    • Download figure
    • Open in new tab
    • Download powerpoint
    Figure 4.

    Sequence structure comparison of representative SET-domains from five E(z) clades.

    (A) Alignment of SET-domains of different clade representatives. Invariant (absolutely conserved) residues are highlighted in red background with white text. Conserved residues are highlighted in red font. Conserved signature motifs (GxG, YxG, RFINHxCxPN, and ELxFDY) (Green box) and hydrophobic FLF (contribute to lysine binding pocket) (Blue box), salt bridge that causes intramolecular interaction, pseudoknot, F/Y switch controlling methylation (mono-, di-, or tri-methylation), catalytic sites (red asterisk), cofactor binding sites (blue asterisk) are highlighted. (B) Cartoon/surface view of the SET domain of each clade. Human (H. sapiens) EZH1 SET-domain (PDB ID: 7KSR, EZH1) was used as a reference model structure. Cofactor (light blue), and the substrate analogs (H3K27M-peptide inhibitor; orange color) are represented by stick and sphere forms. The position of “K27M” and lysine binding channel is indicated by a black asterisk. Peptide-binding cleft area is highlighted in light orange. Conserved signature motifs are highlighted in green and the FLF motif in blue. Interactions of domain with peptides (hydrogen bonds) are represented by red dashes.

  • Figure S7.
    • Download figure
    • Open in new tab
    • Download powerpoint
    Figure S7. Catalytic SET domains of E(z) clades and other SET domain subfamilies showing the sequence structure alignment of different subfamilies and E(z) clades.

    The analyses are performed using ESpript v. 3.0 (https://espript.ibcp.fr) and ENDscript v. 2.0 (https://endscript.ibcp.fr), and the conserved residues are highlighted in red and highly conserved motifs (GxG, YxG, FLF, RFINHxCxPN, and ELxFDY) are highlighted in light-green and dark blue blocks.

  • Figure S8.
    • Download figure
    • Open in new tab
    • Download powerpoint
    Figure S8. Superimposed SET-domains of E(z) from the five clades (left).

    The one peptide deviating from others represents the interaction of peptide substrate and the SET domain of clade II. In surface view peptide/cofactor binding areas and methyl transfer pores are highlighted (center and right).

  • Figure S9.
    • Download figure
    • Open in new tab
    • Download powerpoint
    Figure S9. Sequence-structure of Cyanoptyche gloeocystis SET-domain.

    (A) Sequence alignment of Cyanoptyche gloeocystis SET-domain (Glaucophyte) with other E(z)-SET domains. Conserved motifs are highlighted in green. (B) Cartoon representation of C. gloeocystis SET-domain showing interaction pattern with peptide substrate analogs; H3K9M interaction suggests possible methylation events, as K9M (red) is directed towards the lysine channel while K27M (orange) is away from it. (C) P. tetraurelia SET domain showing the productive binding pattern with the substrate analogues H3K9M (red) and H3K27M (orange). (D) Superimposed SET-domains of P. tetraurelia (pink) and C. gloeocystis (green).

  • Figure 5.
    • Download figure
    • Open in new tab
    • Download powerpoint
    Figure 5. Schematic of the eukaryotic tree of life shows the summary of PRC2 subunit diversity.

    The conservation of each subunit within a lineage is indicated as percentage of analyzed species in which the subunit was identified. The list (and total number) of species analyzed in each lineage is given in Table S2. In the case of lineages/groups where only VEFS-Box proteins but no Su(z)12 orthologs were identified, the Su(z)12 sector is depicted in transparent green. The depicted tree topology is designed based on the recently proposed topology of the eukaryotic phylogenetic tree (Burki et al, 2020).

  • Figure S10.
    • Download figure
    • Open in new tab
    • Download powerpoint
    Figure S10. Schematic flow chart of the PRC2 subunits prediction pipeline (PcG-finder).

Supplementary Materials

  • Figures
  • Table S1 List of PcG-finder-identified and previously identified/validated PRC2 core-subunits. The red-labeled orthologs are orthologs where PcG-finder results differ from previously validated orthologs.

  • Table S2 A list of all studied species showing taxonomic information, data source, and quality.

  • Table S3 A list of species where all four PRC2 subunits were identified.

  • Table S4 A list of numbers, names and domain architecture of identified E(z) orthologs. Sequences were assigned to Clades I - V based on the topology of the rooted tree separating full-length sequences or SET domains of the E(z) homologs.

  • Table S5 A list of numbers, names, and domain architecture of identified canonical Su(z)12 orthologs.

  • Table S6 A list of numbers, names, and domain architecture of identified ESC orthologs.

  • Table S7 A list of numbers, names, and domain architecture of identified NURF55 orthologs.

  • Table S8 A list of numbers of PcG finder-identified Su(z)12 orthologs and numbers, names, and domain architecture of other VEFS-domain proteins.

  • Table S9 A list of species where E(z) orthologs belonging to both clade IV and clade V were found.

  • Table S10 A list of E(z) sequences representative of the clades I–V, showing taxonomic information, sequences names, and domain architecture.

  • Table S11 A list of numbers, names, and domain architecture of identified E(z) orthologs in all prokaryotes (Bacteria, Archaea, and Asgard).

PreviousNext
Back to top
Download PDF
Article Alerts
Sign In to Email Alerts with your Email Address
Email Article

Thank you for your interest in spreading the word on Life Science Alliance.

NOTE: We only request your email address so that the person you are recommending the page to knows that you wanted them to see it, and that it is not junk mail. We do not capture any email address.

Enter multiple addresses on separate lines or separate them with commas.
Phylogenetic profiling resolves early emergence of PRC2 and illuminates its functional core
(Your Name) has sent you a message from Life Science Alliance
(Your Name) thought you would like to see the Life Science Alliance web site.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Citation Tools
PRC2 core subunit diversity in eukaryotes
Abdoallah Sharaf, Mallika Vijayanathan, Miroslav Oborník, Iva Mozgová
Life Science Alliance Apr 2022, 5 (7) e202101271; DOI: 10.26508/lsa.202101271

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
Share
PRC2 core subunit diversity in eukaryotes
Abdoallah Sharaf, Mallika Vijayanathan, Miroslav Oborník, Iva Mozgová
Life Science Alliance Apr 2022, 5 (7) e202101271; DOI: 10.26508/lsa.202101271
del.icio.us logo Digg logo Reddit logo Twitter logo CiteULike logo Facebook logo Google logo Mendeley logo
  • Tweet Widget
  • Facebook Like
Issue Cover

In this Issue

Volume 5, No. 7
July 2022
  • Table of Contents
  • Cover (PDF)
  • About the Cover
  • Masthead (PDF)
Advertisement

Jump to section

  • Article
    • Abstract
    • Introduction
    • Results and Discussion
    • Materials and Methods
    • Data Availability
    • Acknowledgments
    • References
  • Figures & Data
  • Info
  • Metrics
  • Reviewer Comments
  • PDF

Subjects

  • Genomics & Functional Genomics
  • Evolution

Related Articles

  • No related articles found.

Cited By...

  • No citing articles found.
  • Google Scholar

More in this TOC Section

  • MD-related proteins at the centrosome
  • Prion-like α-synuclein toxicity via proteoglycans in vivo
  • Risk variants in regulatory elements
Show more Research Article

Similar Articles

EMBO Press LogoRockefeller University Press LogoCold Spring Harbor Logo

Content

  • Home
  • Newest Articles
  • Current Issue
  • Archive
  • Subject Collections

For Authors

  • Submit a Manuscript
  • Author Guidelines
  • License, copyright, Fee

Other Services

  • Alerts
  • Twitter
  • RSS Feeds

More Information

  • Editors & Staff
  • Reviewer Guidelines
  • Feedback
  • Licensing and Reuse
  • Privacy Policy

ISSN: 2575-1077
© 2022 Life Science Alliance LLC

Life Science Alliance is registered as a trademark in the U.S. Patent and Trade Mark Office and in the European Union Intellectual Property Office.