Skip to main content
Log in

Investigation of the widely applicable Bayesian information criterion

  • Published:
Statistics and Computing Aims and scope Submit manuscript

Abstract

The widely applicable Bayesian information criterion (WBIC) is a simple and fast approximation to the model evidence that has received little practical consideration. WBIC uses the fact that the log evidence can be written as an expectation, with respect to a powered posterior proportional to the likelihood raised to a power \(t^*\in {(0,1)}\), of the log deviance. Finding this temperature value \(t^*\) is generally an intractable problem. We find that for a particular tractable statistical model that the mean squared error of an optimally-tuned version of WBIC with correct temperature \(t^*\) is lower than an optimally-tuned version of thermodynamic integration (power posteriors). However in practice WBIC uses the a canonical choice of \(t=1/\log (n)\). Here we investigate the performance of WBIC in practice, for a range of statistical models, both regular models and singular models such as latent variable models or those with a hierarchical structure for which BIC cannot provide an adequate solution. Our findings are that, generally WBIC performs adequately when one uses informative priors, but it can systematically overestimate the evidence, particularly for small sample sizes.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7

Similar content being viewed by others

References

  • Bash, P.A., Singh, U.C., Langridge, R., Kollman, P.A.: Free energy calculations by computer simulation. Science 236(4801), 564–568 (1987)

    Article  Google Scholar 

  • Burrows, B.: A new approach to numerical integration. IMA J. Appl. Math. 26(2), 151–173 (1980)

    Article  MathSciNet  MATH  Google Scholar 

  • Calderhead, B., Girolami, M.: Estimating Bayes factors via thermodynamic integration and population MCMC. Comput. Stat. Data Anal. 53, 4028–4045 (2009)

    Article  MathSciNet  MATH  Google Scholar 

  • Chickering, D.M., Heckerman, D.: Efficient approximations for the marginal likelihood of Bayesian networks with hidden variables. Mach. Learn. 29(2–3), 181–212 (1997)

    Article  MATH  Google Scholar 

  • Chipot, C., Pohorille, A.: Free Energy Calculations: Theory and Applications in Chemistry and Biology, vol. 86. Springer, Berlin (2007)

    Google Scholar 

  • Chopin, N., Robert, C.: Contemplating evidence: properties, extensions of, and alternatives to nested sampling. Technical report, 2007-46, Ceremade, Université Paris Dauphine (2007)

  • Drton, M., Plummer, M.: A Bayesian information criterion for singular models. arXiv preprint (2013)

  • Friel, N., Hurn, M., Wyse, J.: Improving power posterior estimation of statistical evidence. Stat. Comput. 24, 709–723 (2014)

    Article  MathSciNet  MATH  Google Scholar 

  • Friel, N., Pettitt, A.N.: Marginal likelihood estimation via power posteriors. J. R. Stat. Soc. Ser. B 70, 589–607 (2008)

    Article  MathSciNet  MATH  Google Scholar 

  • Friel, N., Wyse, J.: Estimating the evidence a review. Statistica Neerlandica 66(3), 288–308 (2012)

    Article  MathSciNet  Google Scholar 

  • Gelfand, A.E., Dey, D.K.: Bayesian model choice: asymptotics and exact calculations. J. R. Stat. Soc. Ser. B (Methodological) 56, 501–514 (1994)

    MathSciNet  MATH  Google Scholar 

  • Gelman, A., Hwang, J., Vehtari, A.: Understanding predictive information criteria for Bayesian models. Stat. Comput. 24, 1–20 (2013)

    MathSciNet  MATH  Google Scholar 

  • Gelman, A., Meng, X.-L.: Simulating normalizing constants: from importance sampling to bridge sampling to path sampling. Stat. Sci. 13, 163–185 (1998)

    Article  MathSciNet  MATH  Google Scholar 

  • Green, P.J.: Reversible jump Markov chain Monte Carlo computation and Bayesian model determination. Biometrika 82, 711–732 (1995)

  • Hug, S., Schwarzfischer, M., Hasenauer, J., Marr, C., Theis, F.J.: An adaptive scheduling scheme for calculating Bayes factors with thermodynamic integration using Simpsons rule. Stat.Comput. 26, 663–677 (2016)

    Article  MathSciNet  MATH  Google Scholar 

  • Kass, R.E., Wasserman, L.: A reference Bayesian test for nested hypotheses and its relationship to the Schwarz criterion. J. Am. Stat. Assoc. 90(431), 928–934 (1995)

    Article  MathSciNet  MATH  Google Scholar 

  • Kirkwood, J.G.: Statistical mechanics of fluid mixtures. J. Chem. Phys. 3(5), 300–313 (1935)

    Article  MATH  Google Scholar 

  • Mononen, T.: A case study of the widely applicable Bayesian information criterion and its optimality. Stat. Comput. 25, 929–940 (2015)

    Article  MathSciNet  MATH  Google Scholar 

  • Neal, R.M.: Probabilistic inference using Markov chain Monte Carlo methods. Technical report, CRG-TR-93-1 Department of Computer Science, University of Toronto Toronto, Ontario (1993)

  • Oates, C.J., Papamarkou, T., Girolami, M.: The controlled thermodynamic integral for Bayesian model comparison. Journal of the American Statistical Association (to appear) (2016)

  • Raftery, A.E.: Bayes factors and BIC. Sociol. Methods. Res. 27(3), 411–417 (1999)

    Article  Google Scholar 

  • Robert, C., Wraith D.: Computational methods for Bayesian model choice. In: Bayesian Inference and maximum entropy methods in Science and Engineering: The 29th International Workshop on Bayesian Inference and Maximum Entropy Methods in Science and Engineering, vol. 1193, pp. 251–262 (2009)

  • Schwarz, G.E.: Estimating the dimension of a model. Ann. Stat. 6(2), 461–464 (1978)

    Article  MathSciNet  MATH  Google Scholar 

  • Skilling, J.: Nested sampling for general Bayesian computation. Bayesian Anal. 1(4), 833–859 (2006)

    Article  MathSciNet  MATH  Google Scholar 

  • Vitoratou, S., Ntzoufras, I.: Thermodynamic assessment of probability distribution divergencies and Bayesian model comparison. arXiv preprint (2013)

  • Volinsky, C.T., Raftery, A.E.: Bayesian information criterion for censored survival models. Biometrics 56(1), 256–262 (2000)

    Article  MATH  Google Scholar 

  • Watanabe, S.: A widely applicable Bayesian information criterion. J. Mach. Learn. Res. 14, 867–897 (2013)

    MathSciNet  MATH  Google Scholar 

  • Williams, E.: Regression Analysis. Wiley, New York (1959)

    MATH  Google Scholar 

Download references

Acknowledgments

The Insight Centre for Data Analytics is supported by Science Foundation Ireland under Grant Number SFI/12/RC/2289. Nial Friel’s research was also supported by an Science Foundation Ireland Grant: 12/IP/1424. James McKeone is grateful for the support of an Australian Postgraduate Award (APA). Tony Pettitt’s research was supported by the Australian Research Council Discovery Grant, DP1101000159.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to N. Friel.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Friel, N., McKeone, J.P., Oates, C.J. et al. Investigation of the widely applicable Bayesian information criterion. Stat Comput 27, 833–844 (2017). https://doi.org/10.1007/s11222-016-9657-y

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11222-016-9657-y

Keywords

Navigation