A Probabilistic Programming Approach to Protein Structure Superposition

Lys Sanz Moreta*, Ahmad Salim Al-Sibahi, Douglas Theobald, William Bullock, Basile Nicolas Rommes, Andreas Manoukian, Thomas Hamelryck

*Corresponding author af dette arbejde
1 Citationer (Scopus)

Abstract

Optimal superposition of protein structures is crucial for understanding their structure, function, dynamics and evolution. We investigate the use of probabilistic programming to superimpose protein structures guided by a Bayesian model. Our model THESEUS-PP is based on the THESEUS model, a probabilistic model of protein superposition based on rotation, translation and perturbation of an underlying, latent mean structure. The model was implemented in the deep probabilistic programming language Pyro. Unlike conventional methods that minimize the sum of the squared distances, THESEUS takes into account correlated atom positions and heteroscedasticity (i.e., atom positions can feature different variances). THESEUS performs maximum likelihood estimation using iterative expectation-maximization. In contrast, THESEUS-PP allows automated maximum a-posteriori (MAP)estimation using suitable priors over rotation, translation, variances and latent mean structure. The results indicate that probabilistic programming is a powerful new paradigm for the formulation of Bayesian probabilistic models concerning biomolecular structure. Specifically, we envision the use of the THESEUS-PP model as a suitable error model or likelihood in Bayesian protein structure prediction using deep probabilistic programming.

OriginalsprogEngelsk
Titel2019 IEEE Conference on Computational Intelligence in Bioinformatics and Computational Biology, CIBCB 2019
RedaktørerGiacomo Baruzzo, Sebastian Daberdaku, Barbara Di Camillo, Simone Furini, Emanuele Domenico Giordano, Giuseppe Nicosia
Antal sider5
ForlagIEEE
Publikationsdatojul. 2019
Artikelnummer8791469
ISBN (Elektronisk)9781728114620
DOI
StatusUdgivet - jul. 2019
Begivenhed16th IEEE Conference on Computational Intelligence in Bioinformatics and Computational Biology, CIBCB 2019 - Certosa di Pontignano, Siena, Italien
Varighed: 9 jul. 201911 jul. 2019

Konference

Konference16th IEEE Conference on Computational Intelligence in Bioinformatics and Computational Biology, CIBCB 2019
Land/OmrådeItalien
ByCertosa di Pontignano, Siena
Periode09/07/201911/07/2019
SponsorGlaxoSmithKline (GSK), IEEE, IEEE Computational Intelligence Society

Fingeraftryk

Dyk ned i forskningsemnerne om 'A Probabilistic Programming Approach to Protein Structure Superposition'. Sammen danner de et unikt fingeraftryk.

Citationsformater