Assessing the current state of computational models for protein-protein interfaces

ORAL

Abstract



Computational prediction and design of protein-protein interfaces (PPI) is a difficult task that results in models with a wide range of quality. Thus, PPI scoring algorithms have been designed to classify computational models as high- or low-quality without direct knowledge of the experimentally determined structures. To assess PPI scoring algorithms, we scored models of heterodimeric proteins generated with a rigid-body docking algorithm (ZDOCK) and compared these scores to a measure of structural similarity (i.e. DockQ score) to the associated heterodimeric x-ray crystal structures. We generated hundreds of thousands of computational models, sampled uniformly over the DockQ score, to create a balanced dataset of redocked models. We scored this dataset using seven state-of-the-art PPI model scoring algorithms. We show that only a small fraction of PPI targets possesses strong correlations (ρ > 0.8) between the model scores and DockQ. We use three physical features to differentiate easy- and hard-to-score targets: interface flatness, fractional interface surface area, and number of interfacial contacts. We show that there are strong correlations between model and DockQ scores for PPI targets with large, intertwined interfaces, yet weak correlations for targets with small, flat interfaces. We create a support vector regression score using these physical features that matches or exceeds the performance of current state-of-the-art PPI scoring functions.

*NIH Training Grant No: 1T32GM145452

Presenters

  • Jake Sumner

    • Yale University

Authors

  • Jake Sumner

    • Yale University
  • Grace Meng

    • Yale University
  • Naomi Brandt

    • Yale University
  • Andrés Córdoba

    • Duke University
  • Alex T Grigas

    • Yale University
  • Lynne Regan

    • University of Edinburgh
  • Corey S O'Hern

    • Yale University