Insights in connecting phenotypes in bacteria to coevolutionary information.

ORAL

Abstract

It has long been known that protein sequences are far from random. These sequences have been evolutionarily selected to maintain their ability to fold into stable, three-dimensional folded structures as well as their ability to form macromolecular assemblies, perform catalytic functions, etc. For these reasons, there exist quantifiable mutational patterns in the collection of sequence data for a protein family arising from the need to maintain favorable residue-residue interactions to facilitate folding as well as cellular function. Here, we focus on studying the correlated mutational patterns that give rise to interaction specificity in bacterial two-component signaling (TCS) systems. TCS proteins have evolved to be able to preferentially bind and transfer a phosphate group to their signaling partner while avoiding phosphotransfer with non-partners. We infer a Potts model Hamiltonian governing the correlated mutational patterns that are observed in the sequence data of TCS partners and apply this model to recently published in vivo mutational data. Our findings further support the notion that statistical models built from sequence data can be used to predict bacterial phenotypes as well as engineer interaction specificity between non-partner TCS proteins.

*This research has been supported by the NSF INSPIRE award (MCB-1241332) and by the CTBP sponsored by the NSF (Grant PHY- 1427654)

Authors

  • Ryan Cheng

    • Rice University
  • Faruck Morcos

    • University of Texas at Dallas
  • Ryan Hayes

    • University of Michigan
  • Rodney Helm

    • University of Houston
  • Herbert Levine

    • Rice University
  • Jose Onuchic

    • Rice University