Unknown

Dataset Information

0

Minimization-Aware Recursive K*: A Novel, Provable Algorithm that Accelerates Ensemble-Based Protein Design and Provably Approximates the Energy Landscape.


ABSTRACT: Protein design algorithms that model continuous sidechain flexibility and conformational ensembles better approximate the in vitro and in vivo behavior of proteins. The previous state of the art, iMinDEE-A*-K*, computes provable ɛ-approximations to partition functions of protein states (e.g., bound vs. unbound) by computing provable, admissible pairwise-minimized energy lower bounds on protein conformations, and using the A* enumeration algorithm to return a gap-free list of lowest-energy conformations. iMinDEE-A*-K* runs in time sublinear in the number of conformations, but can be trapped in loosely-bounded, low-energy conformational wells containing many conformations with highly similar energies. That is, iMinDEE-A*-K* is unable to exploit the correlation between protein conformation and energy: similar conformations often have similar energy. We introduce two new concepts that exploit this correlation: Minimization-Aware Enumeration and Recursive K*. We combine these two insights into a novel algorithm, Minimization-Aware Recursive K* (MARK*), which tightens bounds not on single conformations, but instead on distinct regions of the conformation space. We compare the performance of iMinDEE-A*-K* versus MARK* by running the Branch and Bound over K* (BBK*) algorithm, which provably returns sequences in order of decreasing K* score, using either iMinDEE-A*-K* or MARK* to approximate partition functions. We show on 200 design problems that MARK* not only enumerates and minimizes vastly fewer conformations than the previous state of the art, but also runs up to 2 orders of magnitude faster. Finally, we show that MARK* not only efficiently approximates the partition function, but also provably approximates the energy landscape. To our knowledge, MARK* is the first algorithm to do so. We use MARK* to analyze the change in energy landscape of the bound and unbound states of an HIV-1 capsid protein C-terminal domain in complex with a camelid VHH, and measure the change in conformational entropy induced by binding. Thus, MARK* both accelerates existing designs and offers new capabilities not possible with previous algorithms.

SUBMITTER: Jou JD 

PROVIDER: S-EPMC7185359 | biostudies-literature | 2020 Apr

REPOSITORIES: biostudies-literature

altmetric image

Publications

Minimization-Aware Recursive <i>K*:</i> A Novel, Provable Algorithm that Accelerates Ensemble-Based Protein Design and Provably Approximates the Energy Landscape.

Jou Jonathan D JD   Holt Graham T GT   Lowegard Anna U AU   Donald Bruce R BR  

Journal of computational biology : a journal of computational molecular cell biology 20191206 4


Protein design algorithms that model continuous sidechain flexibility and conformational ensembles better approximate the <i>in vitro</i> and <i>in vivo</i> behavior of proteins. The previous state of the art, iMinDEE-<i>A</i>*-<i>K</i>*, computes provable <i>ɛ</i>-approximations to partition functions of protein states (e.g., bound vs. unbound) by computing provable, admissible pairwise-minimized energy lower bounds on protein conformations, and using the <i>A</i>* enumeration algorithm to retu  ...[more]

Similar Datasets

| S-EPMC4904165 | biostudies-literature
| S-EPMC7270206 | biostudies-literature
| S-EPMC8510975 | biostudies-literature
| S-EPMC9295780 | biostudies-literature
| S-EPMC11784930 | biostudies-literature
| S-EPMC6074059 | biostudies-literature
| S-EPMC10311297 | biostudies-literature
| S-EPMC8749547 | biostudies-literature
| S-EPMC6248979 | biostudies-literature
| S-EPMC11822027 | biostudies-literature