TY - JOUR
T1 - AlphaFold2 models indicate that protein sequence determines both structure and dynamics
AU - Guo, Hao Bo
AU - Perminov, Alexander
AU - Bekele, Selemon
AU - Kedziora, Gary
AU - Farajollahi, Sanaz
AU - Varaljay, Vanessa
AU - Hinkle, Kevin
AU - Molinero, Valeria
AU - Meister, Konrad
AU - Hung, Chia
AU - Dennis, Patrick
AU - Kelley-Loughnane, Nancy
AU - Berry, Rajiv
N1 - Publisher Copyright:
© 2022, This is a U.S. Government work and not under copyright protection in the US; foreign copyright protection may apply.
PY - 2022/6/23
Y1 - 2022/6/23
N2 - AlphaFold 2 (AF2) has placed Molecular Biology in a new era where we can visualize, analyze and interpret the structures and functions of all proteins solely from their primary sequences. We performed AF2 structure predictions for various protein systems, including globular proteins, a multi-domain protein, an intrinsically disordered protein (IDP), a randomized protein, two larger proteins (> 1000 AA), a heterodimer and a homodimer protein complex. Our results show that along with the three dimensional (3D) structures, AF2 also decodes protein sequences into residue flexibilities via both the predicted local distance difference test (pLDDT) scores of the models, and the predicted aligned error (PAE) maps. We show that PAE maps from AF2 are correlated with the distance variation (DV) matrices from molecular dynamics (MD) simulations, which reveals that the PAE maps can predict the dynamical nature of protein residues. Here, we introduce the AF2-scores, which are simply derived from pLDDT scores and are in the range of [0, 1]. We found that for most protein models, including large proteins and protein complexes, the AF2-scores are highly correlated with the root mean square fluctuations (RMSF) calculated from MD simulations. However, for an IDP and a randomized protein, the AF2-scores do not correlate with the RMSF from MD, especially for the IDP. Our results indicate that the protein structures predicted by AF2 also convey information of the residue flexibility, i.e., protein dynamics.
AB - AlphaFold 2 (AF2) has placed Molecular Biology in a new era where we can visualize, analyze and interpret the structures and functions of all proteins solely from their primary sequences. We performed AF2 structure predictions for various protein systems, including globular proteins, a multi-domain protein, an intrinsically disordered protein (IDP), a randomized protein, two larger proteins (> 1000 AA), a heterodimer and a homodimer protein complex. Our results show that along with the three dimensional (3D) structures, AF2 also decodes protein sequences into residue flexibilities via both the predicted local distance difference test (pLDDT) scores of the models, and the predicted aligned error (PAE) maps. We show that PAE maps from AF2 are correlated with the distance variation (DV) matrices from molecular dynamics (MD) simulations, which reveals that the PAE maps can predict the dynamical nature of protein residues. Here, we introduce the AF2-scores, which are simply derived from pLDDT scores and are in the range of [0, 1]. We found that for most protein models, including large proteins and protein complexes, the AF2-scores are highly correlated with the root mean square fluctuations (RMSF) calculated from MD simulations. However, for an IDP and a randomized protein, the AF2-scores do not correlate with the RMSF from MD, especially for the IDP. Our results indicate that the protein structures predicted by AF2 also convey information of the residue flexibility, i.e., protein dynamics.
KW - Amino Acid Sequence
KW - Furylfuramide
KW - Intrinsically Disordered Proteins/chemistry
KW - Molecular Dynamics Simulation
KW - Protein Conformation
UR - http://www.scopus.com/inward/record.url?scp=85132683689&partnerID=8YFLogxK
U2 - 10.1038/s41598-022-14382-9
DO - 10.1038/s41598-022-14382-9
M3 - Article
C2 - 35739160
AN - SCOPUS:85132683689
SN - 2045-2322
VL - 12
SP - 10696
JO - Scientific Reports
JF - Scientific Reports
IS - 1
M1 - 10696
ER -