An algebro-topological description of protein domain structure

Robert Clark Penner, Michael Knudsen, Carsten Wiuf*, Jørgen Ellegaard Andersen

*Corresponding author af dette arbejde
10 Citationer (Scopus)

Abstract

The space of possible protein structures appears vast and continuous, and the relationship between primary, secondary and tertiary structure levels is complex. Protein structure comparison and classification is therefore a difficult but important task since structure is a determinant for molecular interaction and function. We introduce a novel mathematical abstraction based on geometric topology to describe protein domain structure. Using the locations of the backbone atoms and the hydrogen bonds, we build a combinatorial object - a so-called fatgraph. The description is discrete yet gives rise to a 2-dimensional mathematical surface. Thus, each protein domain corresponds to a particular mathematical surface with characteristic topological invariants, such as the genus (number of holes) and the number of boundary components. Both invariants are global fatgraph features reflecting the interconnectivity of the domain by hydrogen bonds. We introduce the notion of robust variables, that is variables that are robust towards minor changes in the structure/fatgraph, and show that the genus and the number of boundary components are robust. Further, we invesigate the distribution of different fatgraph variables and show how only four variables are capable of distinguishing different folds. We use local (secondary) and global (tertiary) fatgraph features to describe domain structures and illustrate that they are useful for classification of domains in CATH. In addition, we combine our method with two other methods thereby using primary, secondary, and tertiary structure information, and show that we can identify a large percentage of new and unclassified structures in CATH.

OriginalsprogEngelsk
Artikelnummere19670
TidsskriftPLoS ONE
Vol/bind6
Udgave nummer5
ISSN1932-6203
DOI
StatusUdgivet - 27 maj 2011
Udgivet eksterntJa

Fingeraftryk

Dyk ned i forskningsemnerne om 'An algebro-topological description of protein domain structure'. Sammen danner de et unikt fingeraftryk.

Citationsformater