TY - JOUR
T1 - Learning the pattern of epistasis linking genotype and phenotype in a protein
AU - Poelwijk, Frank J.
AU - Socolich, Michael
AU - Ranganathan, Rama
N1 - Funding Information:
We thank O. Rivoire, V. Krishna, M. Stiffler, M. Weigt, R. Monasson, and S. Cocco for discussions, Ciyue Shen for implementation of the constraint-released optimization, and members of the Ranganathan and Sander laboratories for critical review of the manuscript. We also thank the High Performance Computing Group (BioHPC) and the Genomics Core at UT Southwestern for providing computational resources and sequencing, as well as the Research Computing Core at Harvard Medical School. This work was supported by NIH Grant RO1GM12345 (to R.R.), a Robert A. Welch Foundation Grant I-1366 (to R.R.), and the Green Center for Systems Biology at UT Southwestern Medical Center. F.J.P was an HHMI fellow of the Helen Hay Whitney Foundation.
Publisher Copyright:
© 2019, The Author(s).
PY - 2019/12/1
Y1 - 2019/12/1
N2 - Understanding the pattern of epistasis—the non-independence of mutations—is critical for relating genotype and phenotype. However, the combinatorial complexity of potential epistatic interactions has severely limited the analysis of this problem. Using new mutational approaches, we report a comprehensive experimental study of all 213 mutants that link two phenotypically distinct variants of the Entacmaea quadricolor fluorescent protein—an opportunity to examine epistasis up to the 13th order. The data show the existence of many high-order epistatic interactions between mutations, but also reveal extraordinary sparsity, enabling novel experimental and computational strategies for learning the relevant epistasis. We demonstrate that such information, in turn, can be used to accurately predict phenotypes in practical situations where the number of measurements is limited. Finally, we show how the observed epistasis shapes the solution space of single-mutation trajectories between the parental fluorescent proteins, informative about the protein’s evolutionary potential. This work provides conceptual and experimental strategies to profoundly characterize epistasis in a protein, relevant to both natural and laboratory evolution.
AB - Understanding the pattern of epistasis—the non-independence of mutations—is critical for relating genotype and phenotype. However, the combinatorial complexity of potential epistatic interactions has severely limited the analysis of this problem. Using new mutational approaches, we report a comprehensive experimental study of all 213 mutants that link two phenotypically distinct variants of the Entacmaea quadricolor fluorescent protein—an opportunity to examine epistasis up to the 13th order. The data show the existence of many high-order epistatic interactions between mutations, but also reveal extraordinary sparsity, enabling novel experimental and computational strategies for learning the relevant epistasis. We demonstrate that such information, in turn, can be used to accurately predict phenotypes in practical situations where the number of measurements is limited. Finally, we show how the observed epistasis shapes the solution space of single-mutation trajectories between the parental fluorescent proteins, informative about the protein’s evolutionary potential. This work provides conceptual and experimental strategies to profoundly characterize epistasis in a protein, relevant to both natural and laboratory evolution.
UR - http://www.scopus.com/inward/record.url?scp=85072280101&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85072280101&partnerID=8YFLogxK
U2 - 10.1038/s41467-019-12130-8
DO - 10.1038/s41467-019-12130-8
M3 - Article
C2 - 31527666
AN - SCOPUS:85072280101
SN - 2041-1723
VL - 10
JO - Nature communications
JF - Nature communications
IS - 1
M1 - 4213
ER -