Skip to main content
Log in

The use of model selection in the model-free analysis of protein dynamics

  • Published:
Journal of Biomolecular NMR Aims and scope Submit manuscript

Abstract

Model-free analysis of NMR relaxation data, which is widely used for the study of protein dynamics, consists of the separation of the global rotational diffusion from internal motions relative to the diffusion frame and the description of these internal motions by amplitude and timescale. Five model-free models exist, each of which describes a different type of motion. Model-free analysis requires the selection of the model which best describes the dynamics of the NH bond. It will be demonstrated that the model selection technique currently used has two significant flaws, under-fitting, and not selecting a model when one ought to be selected. Under-fitting breaks the principle of parsimony causing bias in the final model-free results, visible as an overestimation of S 2 and an underestimation of τe and R ex. As a consequence the protein falsely appears to be more rigid than it actually is. Model selection has been extensively developed in other fields. The techniques known as Akaike's Information Criteria (AIC), small sample size corrected AIC (AICc), Bayesian Information Criteria (BIC), bootstrap methods, and cross-validation will be compared to the currently used technique. To analyse the variety of techniques, synthetic noisy data covering all model-free motions was created. The data consists of two types of three-dimensional grid, the Rex grids covering single motions with chemical exchange {S 2e,R ex}, and the Double Motion grids covering two internal motions {S f 2,S s 2s}. The conclusion of the comparison is that for accurate model-free results, AIC model selection is essential. As the method neither under, nor over-fits, AIC is the best tool for applying Occam's razor and has the additional benefits of simplifying and speeding up model-free analysis.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Abragham, A. (1961) The Principles of Nuclear Magnetism, Clarendon Press, Oxford.

    Google Scholar 

  • Akaike, H. (1973) In Information Theory and an Extension of the Maximum Likelihood Principle, Petrov, B.N. and Csaki, F. (Eds.), Proceedings of the 2nd International Symposium on Information Theory, Budapest, Academiai Kiado, pp. 267-281.

  • Andrec, M., Inman, K.G., Weber, D.J., Levy, R.M. and Montelione, G.T. (2000) J. Magn. Reson., 146, 66-80.

    Article  ADS  Google Scholar 

  • Andrec, M., Montelione, G.T. and Levy, R.M. (1999) J. Magn. Reson., 139, 408-421.

    Article  ADS  Google Scholar 

  • Burnham, K.P. and Anderson, D.R. (1998) Model Selection and Inference: A Practical Information-Theoretic Approach, Springer-Verlag, New York.

    Google Scholar 

  • Clore, G.M., Szabo, A., Bax, A., Kay, L.E., Driscoll, P.C. and Gronenborn, A.M. (1990) J. Am. Chem. Soc., 112, 4989-4991.

    Article  Google Scholar 

  • Edwards, A.W.F. (1972) Likelihood, Cambridge University Press, London.

    Google Scholar 

  • Farrow, N.A., Muhandiram, R., Singer, A.N., Pascal, S.M., Kay, C.M., Gish, G., Shoelson, S.E., Pawson, T., Forman-Kay, J.D. and Kay, L.E. (1994) Biochemistry, 33, 5984-6003.

    Article  Google Scholar 

  • Hurvich, C.M. and Tsai, C-L. (1989) Biometrika, 76, 297-307.

    Article  MathSciNet  Google Scholar 

  • Jin, D., Andrec, M., Montelione, G.T. and Levy, R.M. (1998) J. Biomol. NMR, 12, 471-492.

    Article  Google Scholar 

  • Korzhnev, D.M., Orekhov, V.Y. and Arseniev, A.S. (1997) J. Magn. Reson., 127, 184-191.

    Article  Google Scholar 

  • Kullback, S. and Leibler, R.A. (1951) Ann. Math. Stat., 22, 79-86.

    MathSciNet  Google Scholar 

  • Linhart, H. and Zucchini, W. (1986) Model Selection, John Wiley and Sons, New York.

    Google Scholar 

  • Lipari, G. and Szabo, A. (1982a) J. Am. Chem. Soc., 104, 4546-4559.

    Article  Google Scholar 

  • Lipari, G. and Szabo, A. (1982b) J. Am. Chem. Soc., 104, 4559-4570.

    Article  Google Scholar 

  • Mandel, A.M., Akke, M. and Palmer, A.G. (1995) J. Mol. Biol., 246, 144-163.

    Article  Google Scholar 

  • Millet, O., Loria, J.P, Kroenke, C.D., Pons, M. and Palmer, A.G. (2000) J. Am. Chem. Soc., 122, 2867-2877.

    Article  Google Scholar 

  • Osborne, M.J. and Wright, P.E (2001) J. Biomol. NMR, 19, 209-230.

    Article  Google Scholar 

  • Palmer, A.G., Rance, M. and Wright, P.E. (1991) J. Am. Chem. Soc., 113, 4371-4380.

    Article  Google Scholar 

  • Pawley, N.H., Wang, C., Koide, S. and Nicholson, L.K. (2001) J. Biomol. NMR, 20, 149-165.

    Article  Google Scholar 

  • Schwarz, G. (1978) Ann. Stat., 6, 461-464.

    MATH  Google Scholar 

  • Tugarinov, V., Liang, Z., Shapiro, Y.E., Freed, J.H. and Meirovitch, E. (2001) J. Am. Chem. Soc., 123, 3055-3063.

    Article  Google Scholar 

  • Zucchini, W. (2000) J. Math. Psychol., 44, 41-61.

    Article  MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Edward J. d'Auvergne.

Electronic supplementary material

Rights and permissions

Reprints and permissions

About this article

Cite this article

d'Auvergne, E.J., Gooley, P.R. The use of model selection in the model-free analysis of protein dynamics. J Biomol NMR 25, 25–39 (2003). https://doi.org/10.1023/A:1021902006114

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1023/A:1021902006114

Navigation