STRING v11: protein-protein association networks with increased coverage, supporting functional discovery in genome-wide experimental datasets

Damian Szklarczyk; Annika L Gable; David Lyon; Alexander Junge; Stefan Wyder; Jaime Huerta-Cepas; Milan Simonovic; Nadezhda T Doncheva; John H Morris; Peer Bork; Lars J Jensen; Christian von Mering

doi:10.1093/nar/gky1131

STRING v11: protein-protein association networks with increased coverage, supporting functional discovery in genome-wide experimental datasets

Damian Szklarczyk, Annika L Gable, David Lyon, Alexander Junge, Stefan Wyder, Jaime Huerta-Cepas, Milan Simonovic, Nadezhda T Doncheva, John H Morris, Peer Bork, Lars J Jensen, Christian von Mering

Disease Systems Biology Program

2827 Citations (Scopus)

16 Downloads (Pure)

Abstract

Proteins and their functional interactions form the backbone of the cellular machinery. Their connectivity network needs to be considered for the full understanding of biological phenomena, but the available information on protein-protein associations is incomplete and exhibits varying levels of annotation granularity and reliability. The STRING database aims to collect, score and integrate all publicly available sources of protein-protein interaction information, and to complement these with computational predictions. Its goal is to achieve a comprehensive and objective global network, including direct (physical) as well as indirect (functional) interactions. The latest version of STRING (11.0) more than doubles the number of organisms it covers, to 5090. The most important new feature is an option to upload entire, genome-wide datasets as input, allowing users to visualize subsets as interaction networks and to perform gene-set enrichment analysis on the entire input. For the enrichment analysis, STRING implements well-known classification systems such as Gene Ontology and KEGG, but also offers additional, new classification systems based on high-throughput text-mining as well as on a hierarchical clustering of the association network itself. The STRING resource is available online at https://string-db.org/.

Original language	English
Journal	Nucleic Acids Research
Volume	47
Issue number	D1
Pages (from-to)	D607-D613
Number of pages	7
ISSN	0305-1048
DOIs	https://doi.org/10.1093/nar/gky1131
Publication status	Published - 8 Jan 2019

Access to Document

10.1093/nar/gky1131Licence: CC BY

STRING v11: protein–protein association networks with increased coverage, supporting functional discovery in genome-wide experimental datasetsFinal published version, 1.72 MBLicence: CC BY

Cite this

Szklarczyk, D., Gable, A. L., Lyon, D., Junge, A., Wyder, S., Huerta-Cepas, J., Simonovic, M., Doncheva, N. T., Morris, J. H., Bork, P., Jensen, L. J., & von Mering, C. (2019). STRING v11: protein-protein association networks with increased coverage, supporting functional discovery in genome-wide experimental datasets. Nucleic Acids Research, 47(D1), D607-D613. https://doi.org/10.1093/nar/gky1131

Szklarczyk, D, Gable, AL, Lyon, D, Junge, A, Wyder, S, Huerta-Cepas, J, Simonovic, M, Doncheva, NT, Morris, JH, Bork, P, Jensen, LJ & von Mering, C 2019, 'STRING v11: protein-protein association networks with increased coverage, supporting functional discovery in genome-wide experimental datasets', Nucleic Acids Research, vol. 47, no. D1, pp. D607-D613. https://doi.org/10.1093/nar/gky1131

@article{03fa253c850f43eaa50b9da2dd4fdcf4,

title = "STRING v11: protein-protein association networks with increased coverage, supporting functional discovery in genome-wide experimental datasets",

abstract = "Proteins and their functional interactions form the backbone of the cellular machinery. Their connectivity network needs to be considered for the full understanding of biological phenomena, but the available information on protein-protein associations is incomplete and exhibits varying levels of annotation granularity and reliability. The STRING database aims to collect, score and integrate all publicly available sources of protein-protein interaction information, and to complement these with computational predictions. Its goal is to achieve a comprehensive and objective global network, including direct (physical) as well as indirect (functional) interactions. The latest version of STRING (11.0) more than doubles the number of organisms it covers, to 5090. The most important new feature is an option to upload entire, genome-wide datasets as input, allowing users to visualize subsets as interaction networks and to perform gene-set enrichment analysis on the entire input. For the enrichment analysis, STRING implements well-known classification systems such as Gene Ontology and KEGG, but also offers additional, new classification systems based on high-throughput text-mining as well as on a hierarchical clustering of the association network itself. The STRING resource is available online at https://string-db.org/.",

author = "Damian Szklarczyk and Gable, {Annika L} and David Lyon and Alexander Junge and Stefan Wyder and Jaime Huerta-Cepas and Milan Simonovic and Doncheva, {Nadezhda T} and Morris, {John H} and Peer Bork and Jensen, {Lars J} and {von Mering}, Christian",

year = "2019",

month = jan,

day = "8",

doi = "10.1093/nar/gky1131",

language = "English",

volume = "47",

pages = "D607--D613",

journal = "Nucleic Acids Research",

issn = "0305-1048",

publisher = "Oxford University Press",

number = "D1",

}

TY - JOUR

T1 - STRING v11

T2 - protein-protein association networks with increased coverage, supporting functional discovery in genome-wide experimental datasets

AU - Szklarczyk, Damian

AU - Gable, Annika L

AU - Lyon, David

AU - Junge, Alexander

AU - Wyder, Stefan

AU - Huerta-Cepas, Jaime

AU - Simonovic, Milan

AU - Doncheva, Nadezhda T

AU - Morris, John H

AU - Bork, Peer

AU - Jensen, Lars J

AU - von Mering, Christian

PY - 2019/1/8

Y1 - 2019/1/8

N2 - Proteins and their functional interactions form the backbone of the cellular machinery. Their connectivity network needs to be considered for the full understanding of biological phenomena, but the available information on protein-protein associations is incomplete and exhibits varying levels of annotation granularity and reliability. The STRING database aims to collect, score and integrate all publicly available sources of protein-protein interaction information, and to complement these with computational predictions. Its goal is to achieve a comprehensive and objective global network, including direct (physical) as well as indirect (functional) interactions. The latest version of STRING (11.0) more than doubles the number of organisms it covers, to 5090. The most important new feature is an option to upload entire, genome-wide datasets as input, allowing users to visualize subsets as interaction networks and to perform gene-set enrichment analysis on the entire input. For the enrichment analysis, STRING implements well-known classification systems such as Gene Ontology and KEGG, but also offers additional, new classification systems based on high-throughput text-mining as well as on a hierarchical clustering of the association network itself. The STRING resource is available online at https://string-db.org/.

AB - Proteins and their functional interactions form the backbone of the cellular machinery. Their connectivity network needs to be considered for the full understanding of biological phenomena, but the available information on protein-protein associations is incomplete and exhibits varying levels of annotation granularity and reliability. The STRING database aims to collect, score and integrate all publicly available sources of protein-protein interaction information, and to complement these with computational predictions. Its goal is to achieve a comprehensive and objective global network, including direct (physical) as well as indirect (functional) interactions. The latest version of STRING (11.0) more than doubles the number of organisms it covers, to 5090. The most important new feature is an option to upload entire, genome-wide datasets as input, allowing users to visualize subsets as interaction networks and to perform gene-set enrichment analysis on the entire input. For the enrichment analysis, STRING implements well-known classification systems such as Gene Ontology and KEGG, but also offers additional, new classification systems based on high-throughput text-mining as well as on a hierarchical clustering of the association network itself. The STRING resource is available online at https://string-db.org/.

U2 - 10.1093/nar/gky1131

DO - 10.1093/nar/gky1131

M3 - Journal article

C2 - 30476243

SN - 0305-1048

VL - 47

SP - D607-D613

JO - Nucleic Acids Research

JF - Nucleic Acids Research

IS - D1

ER -

STRING v11: protein-protein association networks with increased coverage, supporting functional discovery in genome-wide experimental datasets

Abstract

Access to Document

Fingerprint

Cite this