Spliceosome Database: a tool for tracking components of the spliceosome

Cvitkovic, Ivan; Jurica, Melissa S.

doi:10.1093/nar/gks999

Abstract

The spliceosome is the extremely complex macromolecular machine responsible for pre-mRNA splicing. It assembles from five U-rich small nuclear RNAs (snRNAs) and over 200 proteins in a highly dynamic fashion. One important challenge to studying the spliceosome is simply keeping track of all these proteins, a situation further complicated by the variety of names and identifiers that exist in the literature for them. To facilitate studies of the spliceosome and its components, we created a database of spliceosome-associated proteins and snRNAs, which is available at http://spliceosomedb.ucsc.edu and can be queried through a simple browser interface. In the database, we cataloged the various names, orthologs and gene identifiers of spliceosome proteins to navigate the complex nomenclature of spliceosome proteins. We also provide links to gene and protein records for the spliceosome components in other databases. To navigate spliceosome assembly dynamics, we created tools to compare the association of spliceosome proteins with complexes that form at specific stages of spliceosome assembly based on a compendium of mass spectrometry experiments that identified proteins in purified splicing complexes. Together, the information in the database provides an easy reference for spliceosome components and will support future modeling of spliceosome structure and dynamics.

INTRODUCTION

Pre-mRNA splicing is carried out by the spliceosome, which is one of the cell's most complex and dynamic molecular machineries (1). The spliceosome assembles on each intron to be spliced from over 200 individual components (2–4). Many of these components join the spliceosome in subcomplexes, the most well known of which are the U small nuclear ribonucleoproteins (snRNPs). U snRNPs contain structured U-rich small nuclear RNAs (snRNAs) along with seven shared and several unique proteins. Assembly of U snRNPs and other proteins into the spliceosome is modeled as an ordered evolution of intermediate splicing complexes originally designated as E (early), A (pre-spliceosome), B (fully assembled) and C (catalytic). However, as new conformations of the spliceosome have been identified, additional intermediate splicing complexes (e.g. B^act and B*) have been added to the assembly pathway (3,4). Likely, many more intermediate conformations of the spliceosome remain to be characterized.

The intermediate splicing complexes vary significantly in their composition, size and arrangement of components. Over the past 15 years, several research groups have used mass spectrometry peptide sequencing (MS/MS) to identify proteins that associate with different intermediate splicing complexes and subcomplexes (5–40). Each study generated lists of dozens to hundreds of proteins, and comparisons between the lists provide insight into how the spliceosome evolves between earlier and later assembly stages (2–4). For example, A complex contains U1 and U2 snRNPs and proteins involved in early recognition of the splice sites, whereas C complex contains U2, U5 and U6 snRNPs and proteins involved in promoting the second step of splicing chemistry. At different assembly stages, the spliceosomes’ composition can change by well over 50 proteins, which is certainly too many for a simple mental map of the complex. The MS studies covered splicing complexes from a wide variety of species including humans, yeasts, flies and parasites. In comparing spliceosome-associated proteins between organisms, we can begin to delineate a conserved core splicing machinery, as well as potentially interesting species-specific elaborations. However, given the large number of proteins associated with the different splicing intermediate complexes and their subcomplexes, these comparisons can be challenging.

Another difficulty with comparing lists of proteins from different MS experiment is different nomenclatures. For example, while genetic studies in Saccharomyces cerevisiae have been particularly important in characterizing proteins that function in splicing, only a subset of protein orthologs share the same name between yeast and humans. Furthermore, many proteins have commonly used historical names and/or multiple aliases that differ from the official gene names that have been designated by genome consortiums. One example is a subunit of the SF3a subcomplex of U2 snRNP, which was reported in different MS experiments as SAP 62, SF3a66, SF3A2 or Prp11. Another example is a Prp19 complex protein that has gone by a variety of names, including Syf1, XAB2, HCRN, Cwf3 and Ntc90. With this myriad of names, it is nearly impossible to make a straightforward comparison of the different MS analyses of splicing complexes reported in the literature.

We designed the Spliceosome Database (SpliceosomeDB) with these issues in mind. It provides a simple web interface to search for spliceosome genes/proteins based on several characteristics including name(s), complex designation, identification in particular MS experiments, source organism and conserved motif/domain signatures. For each gene/protein we provide links to other databases that have amassed a great deal of additional data including links to literature, post-translational modifications, etc. Orthologous genes in several model systems are also linked. Other key features include tools for users to compare composition of different intermediate splicing complexes across several species and to directly examine the lists of proteins identified in different MS experiments. The database is a ready resource for researchers looking for information on individual spliceosome components and provides a uniquely helpful view of the dynamic assembly process of the complex.

DATABASE FEATURES

Currently, there is no simple way to query comprehensive gene and protein databases [such as Entrez (41), Ensemble (42) or UniProt (43)] to identify proteins by their association with spliceosomes exclusively. This situation makes it impossible to easily identify spliceosome proteins with specific characteristics or by their association with specific intermediate complexes. For example, a search of the Entrez gene database with “spliceosome ‘C complex’ ” yields only three gene results and a search for “spliceosome ‘B complex’ ” returns no hits, even though there are nearly 100 individual protein components in these splicing complexes. Likewise, it is not feasible to query within the group of spliceosome proteins by parameters such as molecular weight range, sequence motifs, domains and/or availability of structural information.

We established a database platform that mitigates these problems by cataloging spliceosome-associated components along with a variety of features. We entered components into the database based on one of the three criteria: (i) previous experimental evidence for a role in spliceosome function, (ii) homology to a known splicing factor and/or (iii) MS/MS identification of the protein product in isolated splicing complexes. Over 3600 genes/proteins from several model species, along with a variety of key attributes, are currently recorded in the system. To allow free access from all over the world, we developed a series of web pages to query the database and return useful information. In most cases, this information can be downloaded for further off-line analysis. We envision that investigators in the pre-mRNA processing field, in particular, will utilize SpliceosomeDB for a variety of applications. In the following sections, we outline the functionalities of different database tools and their potential uses.

Searching for spliceosome components

From the ‘Component Search’ page of SpliceosomeDB, users can perform a quick general search of spliceosome components and their attributes or a more defined query of specific individual attributes (Figure 1A). Searchable attributes include protein and gene names and aliases, accession numbers in external databases, host organism, features of the gene’s protein product such as molecular weight and conserved motifs, association with a specific snRNP or intermediate splicing complex and membership of a general protein class or family. To designate intermediate splicing complexes, we use the common E, A, B, B^act, C complex nomenclature associated with spliceosomes that assemble in human extracts. The general ‘class/family’ heading groups proteins by molecular features (e.g. SM proteins, SR proteins), association with stable spliceosome subcomplexes like the snRNPs or PRP19 complex, or other common designations (e.g. hnRNP, second step factor).

Figure 1.

Spliceosome ‘Component Search’. (A) Quick Search queries all SpliceosomeDB data, whereas additional parameters can be used to limit results. The search results in a table showing matching components that can be further refined using the ‘Filter current table’ tool or sorted by columns. Gene/protein names are linked to a page displaying additional information about the gene/protein. Checkboxes can be used in conjunction with the ‘Mass Spec Comparison’ button to generate a table showing MS experiments in which the selected proteins were identified. (B) Portion of browser view displaying individual gene/protein information linked to additional sources of information. At the bottom of the page, MS experiments identifying the protein are listed along with the number of unique peptides by which the protein was identified.

Open in new tab Download slide

Spliceosome ‘Component Search’. (A) Quick Search queries all SpliceosomeDB data, whereas additional parameters can be used to limit results. The search results in a table showing matching components that can be further refined using the ‘Filter current table’ tool or sorted by columns. Gene/protein names are linked to a page displaying additional information about the gene/protein. Checkboxes can be used in conjunction with the ‘Mass Spec Comparison’ button to generate a table showing MS experiments in which the selected proteins were identified. (B) Portion of browser view displaying individual gene/protein information linked to additional sources of information. At the bottom of the page, MS experiments identifying the protein are listed along with the number of unique peptides by which the protein was identified.

Searches result in a list of genes that match the requested parameter, and if requested, their orthologs in several model organisms. To help inspect the list, we also display basic information for the genes, including host organism, complex association, molecular weight and aliases, any of which can be sorted or filtered. Importantly, gene lists can be exported to a spreadsheet file that includes many of the individual attributes recorded in the database.

Information for individual spliceosome genes/proteins

Each gene in a search results list is linked to a page that summarizes key attributes that fall under the headings of ‘Nomenclature’, ‘Other Resources’ and ‘Gene Product Info’ (Figure 1B). ‘Nomenclature’ includes an official gene symbol, other gene symbols, full protein name and other names, all of which have been obtained from a wide variety of sources. For ‘Other Resources’, we provide links to several external databases including the NCBI Entrez gene database (41), UniProt protein database (43), applicable model organism-specific databases like FlyBase (44) or SGD (45), BioGrid interaction database (46), the Protein Data Bank (47,48) and SpliProt3D database of human spliceosome protein structural models recently generated by the Bujnicki Laboratory (49). ‘Gene Product Info’ includes molecular weight, domains, motifs, association with different intermediate splicing complexes or snRNPs and general classification.

The page also displays a list of orthologous genes from a number of model systems. Because splicing is studied in several model organisms, it is often helpful to connect gene/protein orthologs, especially given the different naming systems employed. For most genes, we used the NCBI Homologene database (50) to identify orthologous genes. We also link orthologs to their corresponding gene page in our database and to the gene family entry in Homologene. Because Homologene does not fully cover all genes or organisms, we curated several database entries manually based on BLAST searches and extant literature.

Many pages for human genes also display a table of curated protein/protein interactions from the recent publication by the Stelzl group and recorded in SPPIR (Human Spliceosome Protein-Protein Interaction Resource) (51). The data from Hegele et al. (51) are derived from manual literature searches, directed yeast-two-hybrid analyses and co-precipitation experiments, and we provide links to the relevant data sources. There are other databases that provide interaction profiles for proteins, but we chose not to display these data because the interactions are not limited to spliceosome components and are not always well vetted. Often, many proteins not related to splicing appear in those lists, and although the interactions may indicate some functional linkage, they do not fully reflect what has been observed by biochemical purification and analysis of the core splicing machinery. As noted above, however, we do provide a link to BioGrid protein interaction database (46) for interested users.

Finally, each gene page displays a list of experiments in which the encoded protein (or ortholog) was identified, usually by MS, along with the number of peptides used to identify the protein in each experiment when reported. Each MS experiment is linked to its own page, which will be described in a following section.

Displaying and comparing spliceosomal complexes

Because the spliceosome is a dynamic machine, it is important to understand its composition at the different stages of complex assembly. The point at which a component joins or leaves the spliceosome indicates a potential function in the complex. SpliceosomeDB makes it possible to quickly compare the components of different conformations of the spliceosome across several model species. From the ‘Compare Complexes’ page, comparisons can be made via curated component lists that we generated for different U snRNPs and spliceosome intermediate complexes. For these, we again use the complex nomenclature associated with spliceosomes that assemble in human extracts (i.e. E, A, B, B^act, B*, C). Users select two or more complexes and host organism, and the site will then generate a table where columns represent the selected complexes and rows display proteins present in those complexes, grouped by their general classification in the database (Figure 2). For comparisons between different organisms, orthologs are displayed in the same row. To designate proteins as belonging to a particular complex, we primarily drew from a recent publication by the Agafonov et al. (39). In that study a series of human splicing complexes were isolated under similar conditions. Associated proteins were separated by 2D electrophoresis and identified by MS/MS sequencing. The proteins were also quantified by direct staining in the gels. For each splicing complex designation, we include proteins that appear to be abundant as indicated by a high stain index. Because the association of proteins that have more dynamic interactions with the spliceosome is often not clearly stoichiometric, we also considered how consistently the proteins have been found in spliceosome complexes at a particular stage. One such example is the SF3B subcomplex, which appears less abundant in spliceosomes captured just prior and after first step chemistry (B^act and C complexes) but was detected nonetheless, and we therefore include its proteins as components of both of those complexes (13,19,21,39). For these general complex association lists, we did not include proteins that appear to associate with RNA in a splicing independent manner, such as many general RNA binding proteins.

Figure 2.

Open in new tab Download slide

Comparing the composition of spliceosome complexes. Portion of ‘Compare Complexes’ browser view displaying the components of selected complexes grouped by classification. Each component is linked to its individual information page.

MS analyses of spliceosome complexes

An ever-growing number of studies have reported MS/MS analyses of different spliceosome complexes and subunits (5–40). Indeed, these studies have gone far in helping to define the components of the spliceosome at different stages of assembly. In SpliceosomeDB, we recorded the results of over 135 individual MS experiments from 40 publications reporting analysis of endogenous splicing complexes isolated from cells or complexes assembled in vitro. The samples studied were derived from several different organisms, including human, chicken, fruit flies, yeasts and parasites. From this extensive effort by the wider splicing research community, a large portion of known splicing intermediate complexes and subunits are represented in the database. We expect that intermediates that have not yet been captured for detailed proteomic analysis will eventually succumb to analysis, and we will add those data as they are reported.

Through the ‘Mass Spec Experiments’ page users can find specific experiments that are identified by sample type, first author of the publication reporting the experiment or their lab head, year of publication and source organism (Figure 3A). For sample names, we use the reported assembly intermediate designation when applicable. Several samples represent less defined or mixed populations of spliceosomes, which we designated either as ‘mixed-spliceosomes’ or by the component target by which they were purified (i.e. ‘SMD3 pulldown’). A search from this page will produce a list of matching experiments that includes an internal SpliceosmeDB id number, basic information about the sample and associated publication, along with a PubMed link to the original publication. Each experiment is linked to its own page, which displays a list of genes encoding the proteins that were identified, each of which is then linked back to its individual gene information page and, when reported, the measure of confidence/quantification associated with that designation (Figure 3B). Typically, this measure is the number of unique peptides used to identify the protein, but may also refer to the total number of peptides sequenced from the protein or band intensity from gel staining analysis. For a subset of MS experiments, proteins were simply reported as present in a sample. In such cases, we use a filled box to denote the identification of the protein in the sample. To help parse the list, we also display some basic protein information including host organism, complex association, aliases and molecular weight. Again, as with most data returned by database searches, these results can be exported to a spreadsheet file.

Figure 3.

Open in new tab Download slide

‘Mass Spec Experiments’ (A) MS experiments can be queried by a general ‘Quick Search’ or by specific attributes to return a list of matching experiments. Each experiment is linked to a page displaying the entire experimental results and to the PubMed entry of the corresponding publication. Checkboxes can be used in conjunction with the ‘Mass Spec Comparison’ button to generate a table comparing the results of the selected MS experiments. (B) Portion of browser view displaying an individual MS experiment result. Gene/protein names of identified proteins are linked to a page displaying additional information and the number of unique peptides by which proteins were identified is given.

Comparing MS experiment results

To allow users to compare results from several experiments, we created a ‘Mass Spec Comparison’ tool. Checkboxes are used to select a subset of genes or experiments returned from a search results returned by ‘Component Search’ or ‘Mass Spec Experiments’. The selected items are then returned in a new page as a exportable table of genes in rows versus MS experiments in columns (Figure 4). The number of identifying peptides listed (or filled box) designates that a gene’s protein product was identified in the experiment.

Figure 4.

Open in new tab Download slide

Comparison of MS experiment results. Generated with the ‘Mass Spec Comparison’ tool, columns in the comparison table display results of individual MS experiments, typically shown as the number of unique peptides used to identify the proteins represented in the different rows. Descriptions for each MS ‘Experiment ID’ are displayed in a legend and linked to individual experiment results.

Documentation, discussion forum and feedback

To help users with SpliceosomeDB, the ‘About’ page and associated links summarize features of the database components and available tools. Also in the different search forms, definitions and examples for many of the elements and fields appear when a cursor is held over them. A ‘comments/report a problem’ link is available for direct feedback, and we set up a forum for moderated discussion. We welcome any comments, questions and suggestions for new features. An active dialog with users will be important for us to identify and correct errors in the database and to keep information up to date.

DATABASE ARCHITECTURE AND WEB INTERFACE

SpliceosomeDB is backed by a MySQL relational database consisting of nine primary tables storing information for (i) individual genes/protein attributes, (ii) MS experiment details, (iii) MS results, (iv) designating protein orthologs, (v) spliceosome complexes, (vi) designating complex composition, (vii) protein classes, (viii) designating proteins to classes and (ix) taxonomy information for species represented in the database. The tables are linked as shown in Supplementary Figure S1. By leveraging relationships between tables, we have been able to cross-reference data in novel and informative ways. Data in Tables ii, iii and v–viii were entered manually. Most data in Tables i, iv and ix were downloaded from the NCBI Entrez Gene (41), UniProt (43) and NCBI Homologene (50) databases, with some attributes being manually curated.

Automatic maintenance and update is a key feature of the SpliceosomeDB. Scripts regularly populate information across database tables after the upload of new mass spectrometry experiments. Additionally, public database entries are checked for updates, with local copies stored as XML files in the event that the database schema changes, which seamlessly provides links to the previously mentioned resources. At this time, there are 3636 gene/protein entries from seven different organisms: Homo sapiens, Gallus gallus, Drosophila melanogaster, S. cerevisiae, Schizosaccharomyces pombe, Caenorhabditis elegans, Plasmodium falicparum, Leishmania major, and Trypanosoma brucei. We recorded results of 135 published MS analyses of various splicing related complexes and will add more as they are reported (5–40).

Web interface

A web interface for the database is located at http://spliceosomedb.ucsc.edu. The interface is built using the Django web framework with Apache serving pages. Django follows the Model, View, Template (MTV) framework of development. Data is stored in MySQL, defined by its representation (Model or schema), accessed by Python scripts called views (View) and rendered into HTML templates (Template) to be displayed through web browsers.

DISCUSSION

To aid studies of the spliceosome, which is one of the most complicated macromolecular machines in eukaryotic cells, we have established an important information source freely available to the public. Our goal was not to replicate the information already available in other databases for spliceosome components, but instead was to create a tool for navigating the complexities of spliceosome assembly dynamics and nomenclature with easy access to other information sources. To that end, SpliceosomeDB is organized from two primary perspectives: (i) information pertaining to specific spliceosome components and (ii) grouped results of mass spectrometry analysis of splicing-related samples. By allowing for cross-reference of information from these perspectives, SpliceosomeDB offers a unique and powerful tool for exploring spliceosome structure and dynamics.

Our lab has been using an in-house form of this database for several years to compare and interpret long lists of MS analyses of spliceosome complexes. From the first MS analysis of human C complex spliceosome, we identified many proteins with homologs in yeast for which genetic and biochemical studies established roles in spliceosome functions (11,18,19). However, many additional proteins with no clear ortholog in S. cerevisiae were also associated with human spliceosomes. Whether these proteins have a role in the spliceosome or were contaminants in our preparation was not known. By comparing results across several experimental systems, it was clear that a number of these proteins are consistently identified with splicing complexes and thereby have a higher likelihood of being bona fide splicing factors.

Comparisons of MS results have also been important for understanding the dynamics of spliceosome assembly and function. Differences in the composition of spliceosomes arrested at different stages of assembly likely reflect the joining and leaving of components to and from the complex, which may suggest when they function in the spliceosome. With SpliceosomeDB, other researchers are able to easily make such comparisons across a larger number of MS studies, with the ability to focus on genes of their particular interest.

One caveat of the MS data is that they are not strictly quantitative, which is to say that all proteins reported to associate with a given complex are not necessarily stoichiometric. Some clue to the relative abundance of a given protein in a complex can be derived by the number of unique peptides used to identify the protein, which a higher number indicating significant representation of the protein in the sample. However, peptide numbers also depend on protein size, with larger proteins yielding more peptides than smaller peptides. The total amount of samples analyzed and the sample complexity, also significantly affect the number of peptides sequenced for a given protein. Therefore, it is important to look at the MS data across an entire experiment to get a feel for the number of peptides that indicate likely stoichiometric presence of a protein. In that same vein, one cannot directly compare peptide numbers from different experiments, so again looking at data en masse is key to making judgments about the relative abundance of proteins. Fortunately, SpliceosomeDB makes it possible to display the entire data from MS experiments to provide this context.

Finally, SpliceosomeDB is useful as an organizational tool to keep track of the hundreds of spliceosome proteins and quickly find them by a number of key features. For example, we needed the list of spliceosome C complex components that are in a particular molecular weight range. Without the database, answering that question would have been very time-consuming, but with SpliceosomeDB, a straightforward query immediately returned the desired list. Furthermore, ready access to gene, protein and homolog information advances discussion with other researchers in, for example, recalling the name of a yeast homolog or association of a protein in at particular stage of spliceosome assembly. We expect that other scientists in the splicing community and beyond will also find the database useful in propagating their own studies and conversations.

Looking toward the future, we will continue to add MS data to SpliceosomeDB as they are published and plan to record additional attributes for MS experiments, such as the details of purification conditions used in isolating the samples analyzed. We welcome feedback and requests for additional features. For example, based on user input, we are currently gathering interaction data for yeast proteins including genetic interactions. SpliceosomeDB is, and will continue to be an important resource for researchers studying this complicated cellular machine.

FUNDING

Funding for open access charge: National Institutes of Health [5R01GM72649 to M.S.J.].

Conflict of interest statement. None declared.

ACKNOWLEDGEMENTS

We thank Roger Jungemann and Denise Playdle for aid in designing database architecture and graphics, and members of the Jurica lab for discussion and testing.

REFERENCES

1

Nilsen

TW

.

The spliceosome: the most complex macromolecular machine in the cell?

,

Bioessays

,

2003

, vol.

25

(pg.

1147

-

1149

)

2

Jurica

MS

,

Moore

MJ

.

Pre-mRNA splicing: awash in a sea of proteins

,

Mol. Cell

,

2003

, vol.

12

(pg.

5

-

14

)

3

Wahl

MC

,

Will

CL

,

Luhrmann

R

.

The spliceosome: design principles of a dynamic RNP machine

,

Cell

,

2009

, vol.

136

(pg.

701

-

718

)

4

Will

CL

,

Luhrmann

R

.

Spliceosome structure and function

,

Cold Spring Harb. Perspect. Biol.

,

2011

, vol.

3

pg.

a003707

5

Ajuh

P

,

Kuster

B

,

Panov

K

,

Zomerdijk

JC

,

Mann

M

,

Lamond

AI

.

Functional analysis of the human CDC5L complex and identification of its components by mass spectrometry

,

EMBO J.

,

2000

, vol.

19

(pg.

6569

-

6581

)

6

Behzadnia

N

,

Golas

MM

,

Hartmuth

K

,

Sander

B

,

Kastner

B

,

Deckert

J

,

Dube

P

,

Will

CL

,

Urlaub

H

,

Stark

H

, et al.

Composition and three-dimensional EM structure of double affinity-purified, human prespliceosomal A complexes

,

EMBO J.

,

2007

, vol.

26

(pg.

1737

-

1748

)

7

Bessonov

S

,

Anokhina

M

,

Krasauskas

A

,

Golas

MM

,

Sander

B

,

Will

CL

,

Urlaub

H

,

Stark

H

,

Luhrmann

R

.

Characterization of purified human Bact spliceosomal complexes reveals compositional and morphological changes during spliceosome activation and first step catalysis

,

RNA

,

2008

, vol.

16

(pg.

2384

-

2403

)

Google Scholar

Crossref

WorldCat

8

Boehringer

D

,

Makarov

EM

,

Sander

B

,

Makarova

OV

,

Kastner

B

,

Luhrmann

R

,

Stark

H

.

Three-dimensional structure of a pre-catalytic human spliceosomal complex B

,

Nat. Struct. Mol. Biol.

,

2004

, vol.

11

(pg.

463

-

468

)

9

Carnahan

RH

,

Feoktistova

A

,

Ren

L

,

Niessen

S

,

Yates

JR

3rd

,

Gould

KL

.

Dim1p is required for efficient splicing and export of mRNA encoding lid1p, a component of the fission yeast anaphase-promoting complex

,

Eukaryot Cell.

,

2005

, vol.

4

(pg.

577

-

587

)

10

Chen

YI

,

Moore

RE

,

Ge

HY

,

Young

MK

,

Lee

TD

,

Stevens

SW

.

Proteomic analysis of in vivo-assembled pre-mRNA splicing complexes expands the catalog of participating factors

,

Nucleic Acids Res.

,

2007

, vol.

35

(pg.

3928

-

3944

)

11

Coltri

P

,

Effenberger

K

,

Chalkley

RJ

,

Burlingame

AL

,

Jurica

MS

.

Breaking up the C complex spliceosome shows stable association of proteins with the lariat intron intermediate

,

PLoS One

,

2011

, vol.

6

pg.

e19061

12

Deckert

J

,

Hartmuth

K

,

Boehringer

D

,

Behzadnia

N

,

Will

CL

,

Kastner

B

,

Stark

H

,

Urlaub

H

,

Luhrmann

R

.

Protein composition and electron microscopy structure of affinity-purified human spliceosomal B complexes isolated under physiological conditions

,

Mol. Cell. Biol.

,

2006

, vol.

26

(pg.

5528

-

5543

)

13

Fabrizio

P

,

Dannenberg

J

,

Dube

P

,

Kastner

B

,

Stark

H

,

Urlaub

H

,

Luhrmann

R

.

The evolutionarily conserved core design of the catalytic activation step of the yeast spliceosome

,

Mol. Cell

,

2009

, vol.

36

(pg.

593

-

608

)

14

Gottschalk

A

,

Neubauer

G

,

Banroques

J

,

Mann

M

,

Luhrmann

R

,

Fabrizio

P

.

Identification by mass spectrometry and functional analysis of novel proteins of the yeast [U4/U6.U5] tri-snRNP

,

EMBO J.

,

1999

, vol.

18

(pg.

4535

-

4548

)

15

Gottschalk

A

,

Tang

J

,

Puig

O

,

Salgado

J

,

Neubauer

G

,

Colot

HV

,

Mann

M

,

Seraphin

B

,

Rosbash

M

,

Luhrmann

R

, et al.

A comprehensive biochemical and genetic analysis of the yeast U1 snRNP reveals five novel proteins

,

RNA

,

1998

, vol.

4

(pg.

374

-

393

)

Google Scholar

PubMed

OpenURL Placeholder Text

WorldCat

16

Hartmuth

K

,

Urlaub

H

,

Vornlocher

HP

,

Will

CL

,

Gentzel

M

,

Wilm

M

,

Luhrmann

R

.

Protein composition of human prespliceosomes isolated by a tobramycin affinity-selection method

,

Proc. Natl Acad. Sci. USA

,

2002

, vol.

99

(pg.

16719

-

16724

)

Google Scholar

Crossref

WorldCat

17

Herold

N

,

Will

CL

,

Wolf

E

,

Kastner

B

,

Urlaub

H

,

Luhrmann

R

.

Conservation of the protein composition and electron microscopy structure of Drosophila melanogaster and human spliceosomal complexes

,

Mol. Cell. Biol.

,

2009

, vol.

29

(pg.

281

-

301

)

18

Ilagan

J

,

Yuh

P

,

Chalkley

RJ

,

Burlingame

AL

,

Jurica

MS

.

The role of exon sequences in C complex spliceosome structure

,

J. Mol. Biol.

,

2009

, vol.

394

(pg.

363

-

375

)

19

Jurica

MS

,

Licklider

LJ

,

Gygi

SR

,

Grigorieff

N

,

Moore

MJ

.

Purification and characterization of native spliceosomes suitable for three-dimensional structural analysis

,

RNA

,

2002

, vol.

8

(pg.

426

-

439

)

20

Khanna

M

,

Van Bakel

H

,

Tang

X

,

Calarco

JA

,

Babak

T

,

Guo

G

,

Emili

A

,

Greenblatt

JF

,

Hughes

TR

,

Krogan

NJ

, et al.

A systematic characterization of Cwc21, the yeast ortholog of the human spliceosomal protein SRm300

,

RNA

,

2009

, vol.

15

(pg.

2174

-

2185

)

21

Lardelli

RM

,

Thompson

JX

,

Yates

JR

3rd

,

Stevens

SW

.

Release of SF3 from the intron branchpoint activates the first step of pre-mRNA splicing

,

RNA

,

2010

, vol.

16

(pg.

516

-

528

)

22

Luz Ambrosio

D

,

Lee

JH

,

Panigrahi

AK

,

Nguyen

TN

,

Cicarelli

RM

,

Gunzl

A

.

Spliceosomal proteomics in Trypanosoma brucei reveal new RNA splicing factors

,

Eukaryot. Cell

,

2009

, vol.

8

(pg.

990

-

1000

)

23

Makarov

EM

,

Makarova

OV

,

Urlaub

H

,

Gentzel

M

,

Will

CL

,

Wilm

M

,

Luhrmann

R

.

Small nuclear ribonucleoprotein remodeling during catalytic activation of the spliceosome

,

Science

,

2002

, vol.

298

(pg.

2205

-

2208

)

24

Merz

C

,

Urlaub

H

,

Will

CL

,

Luhrmann

R

.

Protein composition of human mRNPs spliced in vitro and differential requirements for mRNP protein recruitment

,

RNA

,

2007

, vol.

13

(pg.

116

-

128

)

25

Newo

AN

,

Lutzelberger

M

,

Bottner

CA

,

Wehland

J

,

Wissing

J

,

Jansch

L

,

Kaufer

NF

.

Proteomic analysis of the U1 snRNP of Schizosaccharomyces pombe reveals three essential organism-specific proteins

,

Nucleic Acids Res.

,

2007

, vol.

35

(pg.

1391

-

1401

)

26

Ohi

MD

,

Gould

KL

.

Characterization of interactions among the Cef1p-Prp19p-associated splicing complex

,

RNA

,

2002

, vol.

8

(pg.

798

-

815

)

27

Palfi

Z

,

Jae

N

,

Preusser

C

,

Kaminska

KH

,

Bujnicki

JM

,

Lee

JH

,

Gunzl

A

,

Kambach

C

,

Urlaub

H

,

Bindereif

A

.

SMN-assisted assembly of snRNP-specific Sm cores in trypanosomes

,

Genes Dev.

,

2009

, vol.

23

(pg.

1650

-

1664

)

28

Peng

R

,

Hawkins

I

,

Link

AJ

,

Patton

JG

.

The splicing factor PSF is part of a large complex that assembles in the absence of pre-mRNA and contains all five snRNPs

,

RNA Biol.

,

2006

, vol.

3

(pg.

69

-

76

)

29

Rappsilber

J

,

Ryder

U

,

Lamond

AI

,

Mann

M

.

Large-scale proteomic analysis of the human spliceosome

,

Genome Res.

,

2002

, vol.

12

(pg.

1231

-

1245

)

30

Sharma

S

,

Falick

AM

,

Black

DL

.

Polypyrimidine tract binding protein blocks the 5′ splice site-dependent assembly of U2AF and the prespliceosomal E complex

,

Mol. Cell

,

2005

, vol.

19

(pg.

485

-

496

)

31

Sharma

S

,

Kohlstaedt

LA

,

Damianov

A

,

Rio

DC

,

Black

DL

.

Polypyrimidine tract binding protein controls the transition from exon definition to an intron defined spliceosome

,

Nat. Struct. Mol. Biol.

,

2008

, vol.

15

(pg.

183

-

191

)

32

Stevens

SW

,

Barta

I

,

Ge

HY

,

Moore

RE

,

Young

MK

,

Lee

TD

,

Abelson

J

.

Biochemical and genetic analyses of the U5, U6, and U4/U6 x U5 small nuclear ribonucleoproteins from Saccharomyces cerevisiae

,

RNA

,

2001

, vol.

7

(pg.

1543

-

1553

)

Google Scholar

PubMed

OpenURL Placeholder Text

WorldCat

33

Stevens

SW

,

Ryan

DE

,

Ge

HY

,

Moore

RE

,

Young

MK

,

Lee

TD

,

Abelson

J

.

Composition and functional characterization of the yeast spliceosomal penta-snRNP

,

Mol. Cell

,

2002

, vol.

9

(pg.

31

-

44

)

34

Tkacz

ID

,

Gupta

SK

,

Volkov

V

,

Romano

M

,

Haham

T

,

Tulinski

P

,

Lebenthal

I

,

Michaeli

S

.

Analysis of spliceosomal proteins in Trypanosomatids reveals novel functions in mRNA processing

,

J. Biol. Chem.

,

2010

, vol.

285

(pg.

27982

-

27999

)

35

Wang

Q

,

Hobbs

K

,

Lynn

B

,

Rymond

BC

.

The Clf1p splicing factor promotes spliceosome assembly through N-terminal tetratricopeptide repeat contacts

,

J. Biol. Chem.

,

2003

, vol.

278

(pg.

7875

-

7883

)

36

Will

CL

,

Schneider

C

,

Hossbach

M

,

Urlaub

H

,

Rauhut

R

,

Elbashir

S

,

Tuschl

T

,

Luhrmann

R

.

The human 18S U11/U12 snRNP contains a set of novel proteins not found in the U2-dependent spliceosome

,

RNA

,

2004

, vol.

10

(pg.

929

-

941

)

37

Zhang

C

,

Dowd

DR

,

Staal

A

,

Gu

C

,

Lian

JB

,

van Wijnen

AJ

,

Stein

GS

,

MacDonald

PN

.

Nuclear coactivator-62 kDa/Ski-interacting protein is a nuclear matrix-associated coactivator that may couple vitamin D receptor-mediated transcription and RNA splicing

,

J. Biol. Chem.

,

2003

, vol.

278

(pg.

35325

-

35336

)

38

Zhou

Z

,

Licklider

LJ

,

Gygi

SP

,

Reed

R

.

Comprehensive proteomic analysis of the human spliceosome

,

Nature

,

2002

, vol.

419

(pg.

182

-

185

)

39

Agafonov

DE

,

Deckert

J

,

Wolf

E

,

Odenwalder

P

,

Bessonov

S

,

Will

CL

,

Urlaub

H

,

Luhrmann

R

.

Semiquantitative proteomic analysis of the human spliceosome via a novel two-dimensional gel electrophoresis method

,

Mol. Cell. Biol.

,

2011

, vol.

31

(pg.

2667

-

2682

)

40

Makarov

EM

,

Owen

N

,

Bottrill

A

,

Makarova

OV

.

Functional mammalian spliceosomal complex E contains SMN complex proteins in addition to U1 and U2 snRNPs

,

Nucleic Acids Res.

,

2011

, vol.

40

(pg.

2639

-

2652

)

41

Maglott

D

,

Ostell

J

,

Pruitt

KD

,

Tatusova

T

.

Entrez Gene: gene-centered information at NCBI

,

Nucleic Acids Res.

,

2011

, vol.

39

(pg.

D52

-

D57

)

42

Flicek

P

,

Amode

MR

,

Barrell

D

,

Beal

K

,

Brent

S

,

Carvalho-Silva

D

,

Clapham

P

,

Coates

G

,

Fairley

S

,

Fitzgerald

S

, et al.

Ensembl 2012

,

Nucleic Acids Res.

,

2012

, vol.

40

(pg.

D84

-

D90

)

43

Magrane

M

,

Consortium

U

.

UniProt Knowledgebase: a hub of integrated protein data

,

Database

,

2011

, vol.

2011

pg.

bar009

44

McQuilton

P

,

St Pierre

SE

,

Thurmond

J

.

FlyBase 101–the basics of navigating FlyBase

,

Nucleic Acids Res.

,

2012

, vol.

40

(pg.

D706

-

D714

)

45

Cherry

JM

,

Hong

EL

,

Amundsen

C

,

Balakrishnan

R

,

Binkley

G

,

Chan

ET

,

Christie

KR

,

Costanzo

MC

,

Dwight

SS

,

Engel

SR

, et al.

Saccharomyces Genome Database: the genomics resource of budding yeast

,

Nucleic Acids Res.

,

2012

, vol.

40

(pg.

D700

-

D705

)

46

Stark

C

,

Breitkreutz

BJ

,

Chatr-Aryamontri

A

,

Boucher

L

,

Oughtred

R

,

Livstone

MS

,

Nixon

J

,

Van Auken

K

,

Wang

X

,

Shi

X

, et al.

The BioGRID Interaction Database: 2011 update

,

Nucleic Acids Res.

,

2011

, vol.

39

(pg.

D698

-

D704

)

47

Bernstein

FC

,

Koetzle

TF

,

Williams

GJ

,

Meyer

EF

Jr

,

Brice

MD

,

Rodgers

JR

,

Kennard

O

,

Shimanouchi

T

,

Tasumi

M

.

The Protein Data Bank

,

A computer-based archival file for macromolecular structures. Eur. J. Biochem.

,

1977

, vol.

80

(pg.

319

-

324

)

Google Scholar

OpenURL Placeholder Text

WorldCat

48

Rose

PW

,

Beran

B

,

Bi

C

,

Bluhm

WF

,

Dimitropoulos

D

,

Goodsell

DS

,

Prlic

A

,

Quesada

M

,

Quinn

GB

,

Westbrook

JD

, et al.

The RCSB Protein Data Bank: redesigned web site and web services

,

Nucleic Acids Res.

,

2011

, vol.

39

(pg.

D392

-

D401

)

49

Korneta

I

,

Magnus

M

,

Bujnicki

JM

.

Structural bioinformatics of the human spliceosomal proteome

,

Nucleic Acids Res.

,

2012

, vol.

40

(pg.

7046

-

7065

)

50

Sayers

EW

,

Barrett

T

,

Benson

DA

,

Bolton

E

,

Bryant

SH

,

Canese

K

,

Chetvernin

V

,

Church

DM

,

Dicuccio

M

,

Federhen

S

, et al.

Database resources of the National Center for Biotechnology Information

,

Nucleic Acids Res.

,

2012

, vol.

40

(pg.

D13

-

D25

)

51

Hegele

A

,

Kamburov

A

,

Grossmann

A

,

Sourlis

C

,

Wowro

S

,

Weimann

M

,

Will

CL

,

Pena

V

,

Luhrmann

R

,

Stelzl

U

.

Dynamic protein-protein interaction wiring of the human spliceosome

,

Mol. Cell

,

2012

, vol.

45

(pg.

567

-

580

)

This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by-nc/3.0/), which permits non-commercial reuse, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com.

Download all slides

Month:	Total Views:
November 2016	1
December 2016	3
January 2017	11
February 2017	17
March 2017	9
April 2017	4
May 2017	14
June 2017	7
July 2017	1
August 2017	9
September 2017	13
October 2017	6
November 2017	14
December 2017	24
January 2018	30
February 2018	18
March 2018	32
April 2018	56
May 2018	50
June 2018	25
July 2018	281
August 2018	28
September 2018	26
October 2018	32
November 2018	29
December 2018	25
January 2019	26
February 2019	31
March 2019	41
April 2019	40
May 2019	38
June 2019	36
July 2019	26
August 2019	30
September 2019	23
October 2019	26
November 2019	38
December 2019	18
January 2020	27
February 2020	32
March 2020	21
April 2020	22
May 2020	20
June 2020	24
July 2020	24
August 2020	26
September 2020	23
October 2020	33
November 2020	41
December 2020	50
January 2021	30
February 2021	38
March 2021	61
April 2021	42
May 2021	61
June 2021	38
July 2021	48
August 2021	21
September 2021	43
October 2021	89
November 2021	78
December 2021	61
January 2022	67
February 2022	47
March 2022	84
April 2022	80
May 2022	57
June 2022	34
July 2022	59
August 2022	46
September 2022	61
October 2022	76
November 2022	46
December 2022	39
January 2023	58
February 2023	62
March 2023	55
April 2023	57
May 2023	36
June 2023	50
July 2023	42
August 2023	86
September 2023	75
October 2023	83
November 2023	77
December 2023	77
January 2024	99
February 2024	86
March 2024	95
April 2024	41

Article Contents

Spliceosome Database: a tool for tracking components of the spliceosome

Abstract

INTRODUCTION

DATABASE FEATURES

Searching for spliceosome components

Information for individual spliceosome genes/proteins

Displaying and comparing spliceosomal complexes

MS analyses of spliceosome complexes

Comparing MS experiment results

Documentation, discussion forum and feedback

DATABASE ARCHITECTURE AND WEB INTERFACE

Web interface

DISCUSSION

FUNDING

ACKNOWLEDGEMENTS

REFERENCES

Supplementary data

Comments

Citations

Views

Altmetric

Email alerts

Citing articles via

Latest

Most Read

Most Cited

Article Contents

Spliceosome Database: a tool for tracking components of the spliceosome

Abstract

INTRODUCTION

DATABASE FEATURES

Searching for spliceosome components

Information for individual spliceosome genes/proteins

Displaying and comparing spliceosomal complexes

MS analyses of spliceosome complexes

Comparing MS experiment results

Documentation, discussion forum and feedback

DATABASE ARCHITECTURE AND WEB INTERFACE

Web interface

DISCUSSION

FUNDING

ACKNOWLEDGEMENTS

REFERENCES

Supplementary data

Comments

Citations

Views

Altmetric

Email alerts

Citing articles via

Latest

Most Read

Most Cited

This Feature Is Available To Subscribers Only