Abstract
We discuss the ramifications of noisy and incomplete observations of network data on the existence of a giant connected component (GCC). The existence of a GCC in a random graph can be described in terms of a percolation process, and building on general results for classes of random graphs with specified degree distributions we derive percolation thresholds above which GCCs exist. We show that sampling and noise can have a profound effect on the perceived existence of a GCC and find that both processes can destroy it. We also show that the absence of a GCC puts a theoretical upper bound on the false-positive rate and relate our percolation analysis to experimental protein-protein interaction data.
Originalsprog | Engelsk |
---|---|
Tidsskrift | Journal of the Royal Society Interface |
Vol/bind | 7 |
Udgave nummer | 51 |
Sider (fra-til) | 1411-1419 |
Antal sider | 9 |
ISSN | 1742-5689 |
DOI | |
Status | Udgivet - 6 okt. 2010 |
Udgivet eksternt | Ja |