STRING v10: protein–protein interaction networks, integrated over the tree of life

Szklarczyk, Damian; Franceschini, Andrea; Wyder, Stefan; Forslund, Kristoffer; Heller, Davide; Huerta-Cepas, Jaime; Simonovic, Milan; Roth, Alexander; Santos, Alberto; Tsafou, Kalliopi P.; Kuhn, Michael; Bork, Peer; Jensen, Lars J.; von Mering, Christian

doi:10.1093/nar/gku1003

Abstract

The many functional partnerships and interactions that occur between proteins are at the core of cellular processing and their systematic characterization helps to provide context in molecular systems biology. However, known and predicted interactions are scattered over multiple resources, and the available data exhibit notable differences in terms of quality and completeness. The STRING database (http://string-db.org) aims to provide a critical assessment and integration of protein–protein interactions, including direct (physical) as well as indirect (functional) associations. The new version 10.0 of STRING covers more than 2000 organisms, which has necessitated novel, scalable algorithms for transferring interaction information between organisms. For this purpose, we have introduced hierarchical and self-consistent orthology annotations for all interacting proteins, grouping the proteins into families at various levels of phylogenetic resolution. Further improvements in version 10.0 include a completely redesigned prediction pipeline for inferring protein–protein associations from co-expression data, an API interface for the R computing environment and improved statistical analysis for enrichment tests in user-provided networks.

INTRODUCTION

For a full description of a protein's function, knowledge about its specific interaction partners is an important prerequisite. The concept of protein ‘function’ is somewhat hierarchical (1–4), and at all levels in this hierarchy, interactions between proteins help to describe and narrow down a protein's function: its three-dimensional structure may become meaningful only in the context of a larger protein assembly, its molecular actions may be regulated by co-operative binding or allostery, and its cellular context may be controlled by a multitude of transport, sequestering, and signaling interactions. Given this importance of interactions, many protein annotation and classification schemes assign groups of interacting proteins into functional sets, designated either as physical complexes, signaling pathways or tightly linked ‘modules’ (1,5–7). However, the partitioning of interactions into distinct pathways or complexes can be somewhat arbitrary, and may not do justice to the prevalence of crosstalk and dynamic variation in the interaction landscape (8). A widely used concept that avoids partitioning of function arbitrarily is the protein network, i.e. the topological summary of all known or predicted protein interactions in an organism. For functional studies, arguably the most useful networks are those that integrate all types of interactions: stable physical associations, transient binding, substrate chaining, information relay and others. The STRING database (Search Tool for the Retrieval of Interacting Genes/Proteins) is dedicated to such functional associations between proteins, on a global scale.

Protein–protein interaction information can already be retrieved from a number of online resources. First, primary interaction databases (e.g. 9–13) which are largely collaborating (14,15) provide curated experimental data originating from a variety of biochemical, biophysical and genetic techniques. Second, since protein–protein interactions can also be predicted computationally, a number of resources have their main focus on interaction prediction, using a variety of algorithms (e.g. 16–20). Lastly, a group of online resources is providing an integration of both known and predicted interactions, thus aiming for high comprehensiveness and coverage. These include STRING, as well as GeneMANIA (21), FunCoup (18), I2D (22), ConsensusPathDB (22) and others. Within this landscape of online resources, STRING places its focus on interaction confidence scoring, comprehensive coverage (in terms of number of proteins, organisms and prediction methods), intuitive user interfaces and on a commitment to maintain a long-term, stable resource (since 2000).

The basic interaction unit in STRING is the functional association, i.e. a specific and productive functional relationship between two proteins, likely contributing to a common biological purpose. Interactions are derived from multiple sources: (i) known experimental interactions are imported from primary databases, (ii) pathway knowledge is parsed from manually curated databases, (iii) automated text-mining is applied to uncover statistical and/or semantic links between proteins, based on Medline abstracts and a large collection of full-text articles, (iv) interactions are predicted de novo by a number of algorithms using genomic information (23–25) as well as by co-expression analysis and (v) interactions that are observed in one organism are systematically transferred to other organisms, via pre-computed orthology relations. STRING centers on protein-coding gene loci—alternative splice isoforms or post-translationally modified forms are not resolved, but are instead collapsed at the level of the gene locus. All sources of interaction evidence are benchmarked and calibrated against previous knowledge, using the high-level functional groupings provided by the manually curated Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway maps (5).

As of the current update to version 10.0, the number of organisms covered by STRING has increased to 2031, almost doubling over the previous release. The update also encompassed importing and processing all primary data sources again, re-running all prediction algorithms and re-executing the entire text-mining pipeline with new dictionaries and extended text collections. Many of the features and interfaces of STRING have already been described previously (26–28). Below, we have given a short overview of the resource and describe recent additions and modifications.

User interface

The main entry point into the STRING website is the protein search box on its start page. It supports queries for multiple proteins, can be restricted to certain organisms or clades of organisms, and uses a weighted scheme to rank annotation text matches and identifier matches. Users can also arrive via a number of external websites (29–32) that maintain cross-links with STRING, including the partner resources Search Tool for Interactions of Chemicals (STITCH; 33) and eggNOG (34)—the latter both share protein sequences, annotations and name-spaces with STRING. A third way to enter STRING is via logging on to the My Data section; this allows users to upload gene-lists, create identifier mappings, view their browsing history and provide additional ‘payload’ data to be displayed alongside the interactions.

Once a protein or set of proteins is identified, users proceed to the network view (Figure 1). From there, it is possible to inspect the interaction evidence, to re-adjust the score-cutoffs and network size limits and to view detailed information about the interacting proteins. Upon switching to the ‘advanced’ mode (via the tool panel below the network), users can also cluster and rearrange the network and test for statistical enrichments in the network. The latter feature has been enhanced for the current version 10.0 of STRING: enrichment detection now also covers human disease associations and tissue annotations, which might be statistically enriched in a given network. For this feature, STRING connects with the partner databases TISSUES (http://tissues.jensenlab.org) and DISEASES (http://diseases.jensenlab.org), which also share sequence and name spaces with STRING, and which annotate proteins to tissues or to disease entities based on a combination of automated text-mining and knowledge imports.

Figure 1.

Open in new tab Download slide

The STRING network view. Combined screenshots from the STRING website, which has been queried with a subset of proteins belonging to two different protein complexes in yeast (the COP9 signalosome, as well as the proteasome). Colored lines between the proteins indicate the various types of interaction evidence. Protein nodes which are enlarged indicate the availability of 3D protein structure information. Inset top right: for each protein, accessory information is available which includes annotations, cross-links and domain structures. Inset bottom right: the same network is shown after the addition of a user-configurable ‘payload’-dataset (26). In this case, the payload corresponds to color-coded protein abundance information, and reveals systematic differences in the expression strength of both complexes.

Interaction transfer between organisms

Since version 6.0 of STRING, a significant source of interactions for any given organism has been the transfer of interaction knowledge from orthologous proteins observed to be interacting in another organism. Since version 9.1, these so-called ‘interolog’ transfers were based on pre-computed orthology relations imported from the eggNOG database (34). Orthologs in eggNOG are provided in a hierarchical and nested fashion, allowing the transfer of interactions by traversing up and down along the hierarchy of clades in the tree of life (26). For this purpose, the nested orthology assignments should ideally be fully self-consistent: proteins assigned to an orthologous group for a given phylogenetic clade should be grouped together in all higher-level clades too. In past versions of the orthologous groups, this has not always been the case for technical reasons (orthology assignments are computed independently for each clade). However, for STRING v10, a post-processing pipeline has been devised that makes the orthology setup fully self-consistent. It implements consistency by iteratively splitting and merging orthologous groups at the various clades and levels, until a fully consistent state is achieved. As of now, this post-processed set of orthologs forms the basis for all interaction-transfers in STRING v10. In future releases, the same hierarchical and consistent set of protein families and orthologs will be used also for more intuitive navigation and search features on the user interface.

Co-expression analysis

It has long been established that co-expression is a proxy for co-regulation (35,36) and a strong indicator of functional associations. The co-expression scores in STRING v10 are computed using a revised and improved pipeline (Figure 2), making use of all microarray gene expression experiments deposited in NCBI Gene Expression Omnibus (NCBI GEO) (37). As of March 2014, GEO consisted of more than 12 000 different platforms (GPL), 45 000 experiments (GSE) and over 1 million matrices (GSM). By including the large amount of diverse arrays in the analysis we can decrease the bias of individual platforms and experiments, and reduce the impact of non-informative matrices. Prior to the analysis, 22 organisms were identified as providing sufficient data (at least 50 experiments each). The first step of the pipeline maps probe identifiers from each platform file (GPL) to STRING genes, using dictionaries from the text-mining pipeline. Samples with less than 100 map-able genes and experiments with less than three samples are excluded from further analysis. The microarray expression values (extracted from the GSE files) are then normalized (z-value normalization) and values for each probe merged into single vectors (separately for single-channel and dual-channel arrays). Additionally, single-channel array values are log₂-transformed and their mean is subtracted, to make them compatible with fold-change values in the two-channel case. Expression values of genes measured by more than one probe are averaged. In order to remove the redundancy and to increase information density between the arrays, the gene expression vectors are correlated with one another (using Spearman's rank correlation) and the full set of arrays is pruned using the Hobohm-2 algorithm (38) with similarity thresholds of 0.7 and 0.95, for single-channel and dual-channel arrays, respectively. The new gene expression values are then correlated gene-by-gene (Pearson correlation) and the resulting values are calibrated against common membership in KEGG pathway maps (release 2014-07-21) in order to compute STRING scores. Lastly, the scores from single- and dual-channel arrays are combined in a probabilistic manner to get the final scores. KEGG benchmark performance clearly improves relative to STRING v9.1 (Figure 2). The improvements can be attributed to the increased size of the GEO repository (experiments added since 2011) and to changes in our pipeline, namely: (i) the additional step to prune highly correlated samples using the Hobohm-2 algorithm and (ii) several minor improvements and bug fixes.

Figure 2.

Open in new tab Download slide

Improved Co-expression analysis. STRING v10 features a completely re-designed pipeline for accessing and processing gene expression information. Left: overview of the individual steps; note that redundant expression experiments are now detected and pruned automatically. Right: improved benchmark performance of the resulting co-expression links, relative to the previous version of STRING, in four model organisms (ROC curves). The benchmark is based on the KEGG pathway maps; predicted interactions are considered to be true positives when both interacting proteins are annotated to the same KEGG map.

R/Bioconductor access

Apart from directly browsing and searching the website, data access in STRING is possible also via a REST-based API (application programing interface) and via wholesale data download. With version 10.0, we have introduced a further option: direct access from the R programming environment, following the Bioconductor standard (39). The corresponding package is named STRINGdb (Figure 3), and can be downloaded from the Bioconductor repository (http://www.bioconductor.org/packages/release/bioc/html/STRINGdb.html). The package interacts with the STRING server via the REST API and via additional, dedicated web services. To optimize the speed of subsequent accesses, the entire interaction network and associated data for a given organism are downloaded from the server and cached locally in the R environment, whenever possible. The package is built around the iGraph framework (40), which handles the complexity of the network data structures and provides fast query/analysis functions. Once a network is loaded/cached into an iGraph object, high-level functions facilitate the most common user tasks, such as mapping protein names onto their corresponding STRING identifiers, retrieving the neighbors of a protein of interest, retrieving PubMed IDs for publications that support a given interaction, finding clusters of proteins in the network and generating stable links back to the STRING website.

Figure 3.

Open in new tab Download slide

Access to STRING from R/Bioconductor. Left: example session describing how to initialize a human protein network from the STRING database backend, and how to map a set of gene names against it. A subset of the proteins is then plotted as a STRING network (right), complete with auxiliary numerical payload-information highlighting some nodes of interest (red color halos).

The plot_network function can be used to display a native STRING network of proteins in R (Figure 3). Functions are also available to augment a given network with user-provided node colorings (‘payload information’, see also Figure 1), such that subsets of proteins can be tagged and visually highlighted. Statistical enrichment tests can be executed on gene lists within the STRING namespace, covering Gene Ontology and pathway annotations, as well as tissue and diseases annotations. Results can be visualized as lists of enriched terms and/or heatmaps. The R-package proves particularly valuable for users arriving with a very large set of genes, for which the web-based interface of STRING has previously been a major bottleneck.

The authors wish to thank Yan P. Yuan (EMBL Heidelberg) for excellent technical support with the STRING backend servers. Prof. Dr Thomas Rattei and his SIMAP team (University of Vienna) are gratefully acknowledged for extensive technical support during access to their systematic protein–protein similarity data.

FUNDING

Swiss Institute of Bioinformatics; Novo Nordisk Foundation Center for Protein Research (Copenhagen); European Molecular Biology Laboratory (EMBL, Heidelberg). Funding for open access charges: University of Zurich.

Conflict of interest statement. None declared.

REFERENCES

1.

Ashburner

M.

,

Ball

C.A.

,

Blake

J.A.

,

Botstein

D.

,

Butler

H.

,

Cherry

J.M.

,

Davis

A.P.

,

Dolinski

K.

,

Dwight

S.S.

,

Eppig

J.T.

, et al.

Gene ontology: tool for the unification of biology. The Gene Ontology Consortium

,

Nat. Genet.

,

2000

, vol.

25

(pg.

25

-

29

)

2.

Lee

D.

,

Redfern

O.

,

Orengo

C.

.

Predicting protein function from sequence and structure

,

Nat. Rev. Mol. Cell Biol.

,

2007

, vol.

8

(pg.

995

-

1005

)

3.

Ouzounis

C.A.

,

Coulson

R.M.

,

Enright

A.J.

,

Kunin

V.

,

Pereira-Leal

J.B.

.

Classification schemes for protein structure and function

,

Nat. Rev. Genet.

,

2003

, vol.

4

(pg.

508

-

519

)

4.

Bairoch

A.

,

Boeckmann

B.

.

The SWISS-PROT protein sequence data bank: current status

,

Nucleic Acids Res.

,

1994

, vol.

22

(pg.

3578

-

3580

)

5.

Kanehisa

M.

,

Goto

S.

,

Sato

Y.

,

Kawashima

M.

,

Furumichi

M.

,

Tanabe

M.

.

Data, information, knowledge and principle: back to metabolism in KEGG

,

Nucleic Acids Res.

,

2014

, vol.

42

(pg.

D199

-

D205

)

6.

Croft

D.

,

Mundo

A.F.

,

Haw

R.

,

Milacic

M.

,

Weiser

J.

,

Wu

G.

,

Caudy

M.

,

Garapati

P.

,

Gillespie

M.

,

Kamdar

M.R.

, et al.

The Reactome pathway knowledgebase

,

Nucleic Acids Res.

,

2014

, vol.

42

(pg.

D472

-

D477

)

7.

Sherman

B.T.

,

Huang da

W.

,

Tan

Q.

,

Guo

Y.

,

Bour

S.

,

Liu

D.

,

Stephens

R.

,

Baseler

M.W.

,

Lane

H.C.

,

Lempicki

R.A.

.

DAVID Knowledgebase: a gene-centered database integrating heterogeneous gene annotation resources to facilitate high-throughput gene functional analysis

,

BMC Bioinformatics

,

2007

, vol.

8

(pg.

426

-

437

)

8.

Gibson

T.J.

.

Cell regulation: determined to signal discrete cooperation

,

Trends Biochem. Sci.

,

2009

, vol.

34

(pg.

471

-

482

)

9.

Kerrien

S.

,

Aranda

B.

,

Breuza

L.

,

Bridge

A.

,

Broackes-Carter

F.

,

Chen

C.

,

Duesbury

M.

,

Dumousseau

M.

,

Feuermann

M.

,

Hinz

U.

, et al.

The IntAct molecular interaction database in 2012

,

Nucleic Acids Res.

,

2012

, vol.

40

(pg.

D841

-

D846

)

10.

Licata

L.

,

Briganti

L.

,

Peluso

D.

,

Perfetto

L.

,

Iannuccelli

M.

,

Galeota

E.

,

Sacco

F.

,

Palma

A.

,

Nardozza

A.P.

,

Santonico

E.

, et al.

MINT, the molecular interaction database: 2012 update

,

Nucleic Acids Res.

,

2012

, vol.

40

(pg.

D857

-

D861

)

11.

Chatr-Aryamontri

A.

,

Breitkreutz

B.J.

,

Heinicke

S.

,

Boucher

L.

,

Winter

A.

,

Stark

C.

,

Nixon

J.

,

Ramage

L.

,

Kolas

N.

,

O'Donnell

L.

, et al.

The BioGRID interaction database: 2013 update

,

Nucleic Acids Res.

,

2013

, vol.

41

(pg.

D816

-

D823

)

12.

Salwinski

L.

,

Miller

C.S.

,

Smith

A.J.

,

Pettit

F.K.

,

Bowie

J.U.

,

Eisenberg

D.

.

The Database of Interacting Proteins: 2004 update

,

Nucleic Acids Res.

,

2004

, vol.

32

(pg.

D449

-

D451

)

13.

Schaefer

M.H.

,

Fontaine

J.F.

,

Vinayagam

A.

,

Porras

P.

,

Wanker

E.E.

,

Andrade-Navarro

M.A.

.

HIPPIE: Integrating protein interaction networks with experiment based quality scores

,

PloS One

,

2012

, vol.

7

pg.

e31826

14.

Orchard

S.

,

Kerrien

S.

,

Abbani

S.

,

Aranda

B.

,

Bhate

J.

,

Bidwell

S.

,

Bridge

A.

,

Briganti

L.

,

Brinkman

F.S.

,

Cesareni

G.

, et al.

Protein interaction data curation: the International Molecular Exchange (IMEx) consortium

,

Nat. Methods

,

2012

, vol.

9

(pg.

345

-

350

)

15.

Orchard

S.

,

Ammari

M.

,

Aranda

B.

,

Breuza

L.

,

Briganti

L.

,

Broackes-Carter

F.

,

Campbell

N.H.

,

Chavali

G.

,

Chen

C.

,

del-Toro

N.

, et al.

The MIntAct project–IntAct as a common curation platform for 11 molecular interaction databases

,

Nucleic Acids Res.

,

2014

, vol.

42

(pg.

D358

-

D363

)

16.

Luo

Q.

,

Pagel

P.

,

Vilne

B.

,

Frishman

D.

.

DIMA 3.0: Domain Interaction Map

,

Nucleic Acids Res.

,

2011

, vol.

39

(pg.

D724

-

D729

)

17.

McDowall

M.D.

,

Scott

M.S.

,

Barton

G.J.

.

PIPs: human protein-protein interaction prediction database

,

Nucleic Acids Res.

,

2009

, vol.

37

(pg.

D651

-

D656

)

18.

Schmitt

T.

,

Ogris

C.

,

Sonnhammer

E.L.

.

FunCoup 3.0: database of genome-wide functional coupling networks

,

Nucleic Acids Res.

,

2014

, vol.

42

(pg.

D380

-

D388

)

19.

Zhang

Q.C.

,

Petrey

D.

,

Garzon

J.I.

,

Deng

L.

,

Honig

B.

.

PrePPI: a structure-informed database of protein-protein interactions

,

Nucleic Acids Res.

,

2013

, vol.

41

(pg.

D828

-

D833

)

20.

Baspinar

A.

,

Cukuroglu

E.

,

Nussinov

R.

,

Keskin

O.

,

Gursoy

A.

.

PRISM: a web server and repository for prediction of protein-protein interactions and modeling their 3D complexes

,

Nucleic Acids Res.

,

2014

, vol.

42

(pg.

W285

-

W289

)

21.

Zuberi

K.

,

Franz

M.

,

Rodriguez

H.

,

Montojo

J.

,

Lopes

C.T.

,

Bader

G.D.

,

Morris

Q.

.

GeneMANIA prediction server 2013 update

,

Nucleic Acids Res.

,

2013

, vol.

41

(pg.

W115

-

W122

)

22.

Niu

Y.

,

Otasek

D.

,

Jurisica

I.

.

Evaluation of linguistic features useful in extraction of interactions from PubMed; application to annotating known, high-throughput and predicted interactions in I2D

,

Bioinformatics

,

2010

, vol.

26

(pg.

111

-

119

)

23.

Valencia

A.

,

Pazos

F.

.

Computational methods for the prediction of protein interactions

,

Curr. Opin. Struct. Biol.

,

2002

, vol.

12

(pg.

368

-

373

)

24.

Huynen

M.A.

,

Snel

B.

,

von Mering

C.

,

Bork

P.

.

Function prediction and protein networks

,

Curr. Opin. Struct. Biol.

,

2003

, vol.

15

(pg.

191

-

198

)

Google Scholar

Crossref

WorldCat

25.

Lewis

A.C.

,

Saeed

R.

,

Deane

C.M.

.

Predicting protein-protein interactions in the context of protein evolution

,

Mol. Biosyst.

,

2010

, vol.

6

(pg.

55

-

64

)

26.

Franceschini

A.

,

Szklarczyk

D.

,

Frankild

S.

,

Kuhn

M.

,

Simonovic

M.

,

Roth

A.

,

Lin

J.

,

Minguez

P.

,

Bork

P.

,

von Mering

C.

, et al.

STRING v9.1: protein-protein interaction networks, with increased coverage and integration

,

Nucleic Acids Res.

,

2013

, vol.

41

(pg.

D808

-

D815

)

27.

Szklarczyk

D.

,

Franceschini

A.

,

Kuhn

M.

,

Simonovic

M.

,

Roth

A.

,

Minguez

P.

,

Doerks

T.

,

Stark

M.

,

Muller

J.

,

Bork

P.

, et al.

The STRING database in 2011: functional interaction networks of proteins, globally integrated and scored

,

Nucleic Acids Res.

,

2011

, vol.

39

(pg.

D561

-

D568

)

28.

Jensen

L.J.

,

Kuhn

M.

,

Stark

M.

,

Chaffron

S.

,

Creevey

C.

,

Muller

J.

,

Doerks

T.

,

Julien

P.

,

Roth

A.

,

Simonovic

M.

, et al.

STRING 8–a global view on proteins and their functional interactions in 630 organisms

,

Nucleic Acids Res.

,

2009

, vol.

37

(pg.

D412

-

D416

)

29.

Letunic

I.

,

Doerks

T.

,

Bork

P.

.

SMART 7: recent updates to the protein domain annotation resource

,

Nucleic Acids Res.

,

2012

, vol.

40

(pg.

D302

-

D305

)

30.

Gaudet

P.

,

Argoud-Puy

G.

,

Cusin

I.

,

Duek

P.

,

Evalet

O.

,

Gateau

A.

,

Gleizes

A.

,

Pereira

M.

,

Zahn-Zabal

M.

,

Zwahlen

C.

, et al.

neXtProt: organizing protein knowledge in the context of human proteome projects

,

J. Proteome Res.

,

2013

, vol.

12

(pg.

293

-

298

)

31.

Safran

M.

,

Dalah

I.

,

Alexander

J.

,

Rosen

N.

,

Iny Stein

T.

,

Shmoish

M.

,

Nativ

N.

,

Bahir

I.

,

Doniger

T.

,

Krug

H.

, et al.

GeneCards Version 3: the human gene integrator

,

Database

,

2010

, vol.

2010

(pg.

1

-

16

)

Google Scholar

Crossref

WorldCat

32.

UniProt Consortium

X

.

Activities at the Universal Protein Resource (UniProt)

,

Nucleic acids research

,

2014

, vol.

42

(pg.

D191

-

D198

)

33.

Kuhn

M.

,

Szklarczyk

D.

,

Pletscher-Frankild

S.

,

Blicher

T.H.

,

von Mering

C.

,

Jensen

L.J.

,

Bork

P.

.

STITCH 4: integration of protein-chemical interactions with user data

,

Nucleic Acids Res.

,

2014

, vol.

42

(pg.

D401

-

D407

)

34.

Powell

S.

,

Forslund

K.

,

Szklarczyk

D.

,

Trachana

K.

,

Roth

A.

,

Huerta-Cepas

J.

,

Gabaldon

T.

,

Rattei

T.

,

Creevey

C.

,

Kuhn

M.

, et al.

eggNOG v4.0: nested orthology inference across 3686 organisms

,

Nucleic Acids Res.

,

2014

, vol.

42

(pg.

D231

-

D239

)

35.

Marcotte

E.M.

,

Pellegrini

M.

,

Thompson

M.J.

,

Yeates

T.O.

,

Eisenberg

D.

.

A combined algorithm for genome-wide prediction of protein function

,

Nature

,

1999

, vol.

402

(pg.

83

-

86

)

36.

Eisen

M.B.

,

Spellman

P.T.

,

Brown

P.O.

,

Botstein

D.

.

Cluster analysis and display of genome-wide expression patterns

,

Proc. Natl. Acad. Sci. U.S.A.

,

1998

, vol.

95

(pg.

14863

-

14868

)

37.

Barrett

T.

,

Wilhite

S.E.

,

Ledoux

P.

,

Evangelista

C.

,

Kim

I.F.

,

Tomashevsky

M.

,

Marshall

K.A.

,

Phillippy

K.H.

,

Sherman

P.M.

,

Holko

M.

, et al.

NCBI GEO: archive for functional genomics data sets–update

,

Nucleic Acids Res.

,

2013

, vol.

41

(pg.

D991

-

D995

)

38.

Hobohm

U.

,

Scharf

M.

,

Schneider

R.

,

Sander

C.

.

Selection of representative protein data sets

,

Protein Sci.

,

1992

, vol.

1

(pg.

409

-

417

)

39.

Gentleman

R.C.

,

Carey

V.J.

,

Bates

D.M.

,

Bolstad

B.

,

Dettling

M.

,

Dudoit

S.

,

Ellis

B.

,

Gautier

L.

,

Ge

Y.

,

Gentry

J.

, et al.

Bioconductor: open software development for computational biology and bioinformatics

,

Genome Biol.

,

2004

, vol.

5

pg.

R80

40.

Csárdi

G.

,

Nepusz

T.

.

The igraph software package for complex network research

,

Inter. J. Comp. Syst.

,

2006

, vol.

1695

(pg.

1

-

9

)

Google Scholar

OpenURL Placeholder Text

WorldCat

This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.

Download all slides

Month:	Total Views:
November 2016	14
December 2016	47
January 2017	189
February 2017	394
March 2017	335
April 2017	295
May 2017	273
June 2017	280
July 2017	247
August 2017	235
September 2017	227
October 2017	241
November 2017	258
December 2017	470
January 2018	610
February 2018	509
March 2018	561
April 2018	610
May 2018	603
June 2018	612
July 2018	517
August 2018	496
September 2018	498
October 2018	558
November 2018	732
December 2018	674
January 2019	604
February 2019	717
March 2019	757
April 2019	815
May 2019	873
June 2019	747
July 2019	1,061
August 2019	652
September 2019	658
October 2019	493
November 2019	489
December 2019	376
January 2020	396
February 2020	464
March 2020	417
April 2020	273
May 2020	349
June 2020	486
July 2020	431
August 2020	433
September 2020	533
October 2020	477
November 2020	531
December 2020	366
January 2021	445
February 2021	437
March 2021	490
April 2021	495
May 2021	442
June 2021	403
July 2021	339
August 2021	462
September 2021	453
October 2021	404
November 2021	424
December 2021	405
January 2022	447
February 2022	420
March 2022	554
April 2022	504
May 2022	544
June 2022	435
July 2022	404
August 2022	383
September 2022	442
October 2022	451
November 2022	467
December 2022	439
January 2023	455
February 2023	497
March 2023	548
April 2023	507
May 2023	578
June 2023	345
July 2023	479
August 2023	376
September 2023	449
October 2023	479
November 2023	508
December 2023	555
January 2024	650
February 2024	689
March 2024	775
April 2024	329

Article Contents

STRING v10: protein–protein interaction networks, integrated over the tree of life

Abstract

INTRODUCTION

User interface

Interaction transfer between organisms

Co-expression analysis

R/Bioconductor access

FUNDING

REFERENCES

Comments

Citations

Views

Altmetric

Email alerts

Citing articles via

Latest

Most Read

Most Cited

Article Contents

STRING v10: protein–protein interaction networks, integrated over the tree of life

Abstract

INTRODUCTION

User interface

Interaction transfer between organisms

Co-expression analysis

R/Bioconductor access

FUNDING

REFERENCES

Comments

Citations

Views

Altmetric

Email alerts

Citing articles via

Latest

Most Read

Most Cited

This Feature Is Available To Subscribers Only