Abstract
Autologistic regression models are suitable for relating spatial binary responses in ecology to covariates such as environmental factors. For big ecological data, pseudolikelihood estimation is appealing due to its ease of computation, but at least two challenges remain. Although an important issue, it is unclear how model selection may be carried out under pseudolikelihood. In addition, for assessing the variation of pseudolikelihood estimates, parametric bootstrap using Monte Carlo simulation is often used but may be infeasible for very large data sizes. Here both these issues are addressed by developing a penalized pseudolikelihood estimation method and an approximation of the variance of the parameter estimates. A simulation study is conducted to evaluate the performance of the proposed method, followed by a data example in a study of land cover in relation to land ownership characteristics. Extension of these models and methods to spatial-temporal binary data is further discussed. This article has supplementary material online.
Similar content being viewed by others
References
Banerjee, S., Carlin, B. P., and Gelfand, A. E. (2004), Hierarchical Modeling and Analysis for Spatial Data, Boca Raton: Chapman and Hall.
Besag, J. (1972), “Nearest-Neighbour Systems and the Auto-Logistic Model for Binary Data,” Journal of the Royal Statistical Society. Series B, 34, 75–83.
— (1974), “Spatial Interaction and the Statistical Analysis of Lattice Systems (with Discussion),” Journal of the Royal Statistical Society. Series B, 36, 192–236.
Caragea, P. C., and Kaiser, M. S. (2009), “Autologistic Models with Interpretable Parameters,” Journal of Agricultural, Biological, and Environmental Statistics, 14, 281–300.
Comets, F., and Janžura, M. (1998), “A Central Limit Theorem for Conditionally Centred Random Fields with an Application to Markov Fields,” Journal of Applied Probability, 35, 608–621.
Cressie, N. (1993), Statistics for Spatial Data (Rev. ed.), New York: Wiley.
Crow, T. R., Host, G. E., and Mladenoff, D. J. (1999), “Ownership and Ecosystem as Sources of Spatial Heterogeneity in a Forested Landscape, Wisconsin USA,” Landscape Ecology, 14, 449–463.
Diggle, P. J., and Ribeiro, P. J. (2007), Model-Based Geostatistics, New York: Springer.
Efron, B., Hastie, T., Johnstone, I., and Tibshirani, R. (2004), “Least Angle Regression (with Discussion),” The Annals of Statistics, 32, 407–499.
Friel, N., Pettitt, A. N., Reeves, R., and Wit, E. (2009), “Bayesian Inference in Hidden Markov Random Fields for Binary Data Defined on Large Lattices,” Journal of Computational and Graphical Statistics, 18, 243–261.
Gaetan, C., and Guyon, X. (2010), Spatial Statistics and Modeling, New York: Springer.
Geyer, C. J. (1994), “On the Convergence of Monte Carlo Maximum Likelihood Calculations,” Journal of the Royal Statistical Society. Series B, 56, 261–274.
Gumpertz, M. L., Graham, J. M., and Ristaino, J. B. (1997), “Autologistic Model of Spatial Pattern of Phytophthora Epidemic in Bell Pepper: Effects of Soil Variables on Disease Presence,” Journal of Agricultural, Biological, and Environmental Statistics, 2, 131–156.
He, F., Zhou, J., and Zhu, H. (2003), “Autologistic Regression Model for the Distribution of Vegetation,” Journal of Agricultural, Biological, and Environmental Statistics, 8, 205–222.
Huang, H.-C., Hsu, N.-J., Theobald, D. M., and Breidt, F. J. (2010), “Spatial LASSO with Applications to GIS Model Selection,” Journal of Computational and Graphical Statistics, 19, 963–983.
Huffer, F. W., and Wu, H. (1998), “Markov Chain Monte Carlo for Autologistic Regression Models with Application to the Distribution of Plant Species,” Biometrics, 54, 509–524.
Hughes, J., and Haran, M. (2013), “Dimension Reduction and Alleviation of Confounding for Spatial Generalized Linear Mixed Models,” Journal of the Royal Statistical Society. Series B, 75, 139–159.
Hughes, J., Haran, M., and Caragea, P. C. (2011), “Autologistic Models for Binary Data on a Lattice,” Environmetrics, 22, 857–871.
Jin, C., Zhu, J., Steen-Adams, M. M., Sain, S. R., and Gangnon, R. E. (2013), “Spatial Multinomial Regression Models for Nominal Categorical Data: A Study of Land Cover in Northern Wisconsin, USA,” Environmetrics, 24, 98–108.
Møller, J., Pettitt, A. N., Reeves, R., and Berthelsen, K. K. (2006), “An Efficient Markov Chain Monte Carlo Method for Distributions with Intractable Normalising Constants,” Biometrika, 93, 451–458.
Nocedal, J., and Wright, S. J. (2000), Numerical Optimization (2nd ed.), New York: Springer.
Paciorek, C. J. (2010), “The Importance of Scale for Spatial-Confounding Bias and Precision of Spatial Regression Estimators,” Statistical Science, 25, 107–125.
R Development Core Team (2011), R: A Language and Environment for Statistical Computing, Vienna: R Foundation for Statistical Computing. ISBN 3-900051-07-0 http://www.R-project.org/.
Rue, H., Martino, S., and Chopin, N. (2009), “Approximate Bayesian Inference for Latent Gaussian Models by Using Integrated Nested Laplace,” Journal of the Royal Statistical Society, 71, 319–392.
Stanfield, B. J., Bliss, J. C., and Spies, T. A. (2002), “Land Ownership and Landscape Structure: A Spatial Analysis of Sixty-Six Oregon (USA) Coast Range Watersheds,” Landscape Ecology, 17, 685–697.
Steen-Adams, M. M., Mladenoff, D. J., Langston, N. E., Liu, F., and Zhu, J. (2011), “Influence of Biophysical Factors and Differences in Ojibwe Reservation Versus Euro-American Social Histories on Forest Landscape Change in Northern Wisconsin, USA,” Landscape Ecology, 26, 1165–1178.
Sun, L., and Clayton, M. K. (2008), “Bayesian Analysis of Cross-Classified Spatial Data with Autocorrelation,” Biometrics, 64, 74–84.
Tibshirani, R. (1996), “Regression Shrinkage and Selection Via the Lasso,” Journal of the Royal Statistical Society. Series B, 58, 267–288.
Turner, M. G., Wear, D. N., and Flamm, R. O. (1996), “Land Ownership and Land-Cover Change in the Southern Appalachian Highlands and the Olympic Peninsula,” Ecological Applications, 6, 1150–1172.
Wasserman, L. (2003), All of Statistics: A Concise Course in Statistical Inference, New York: Springer.
Wang, Z., and Zheng, Y. (2013), “Analysis of Binary Data Via a Centered Spatial–Temporal Autologistic Regression Model,” Environmental and Ecological Statistics, 20, 37–57.
Xue, L., Zou, H., and Cai, T. (2012), “Nonconcave Penalized Composite Conditional Likelihood Estimation of Sparse Ising Models,” The Annals of Statistics, 40, 1403–1429.
Zhang, Y., Li, R., and Tsai, C.-L. (2010), “Regularization Parameter Selections Via Generalized Information Criterion,” Journal of the American Statistical Association, 105, 312–323.
Zheng, Y., and Zhu, J. (2008), “Markov Chain Monte Carlo for Spatial–Temporal Autologistic Regression Model,” Journal of Computational and Graphical Statistics, 17, 123–127.
Zhu, Z., and Liu, Y. (2009), “Estimating Spatial Covariance Using Penalized Likelihood with Weighted L 1 Penalty,” Journal of Nonparametric Statistics, 21, 925–942.
Zhu, J., Huang, H.-C., and Wu, J.-P. (2005), “Modeling Spatial–Temporal Binary Data Using Markov Random Fields,” Journal of Agricultural, Biological, and Environmental Statistics, 10, 212–225.
Zhu, J., Zheng, Y., Carroll, A. L., and Aukema, B. H. (2008), “Autologistic Regression Analysis of Spatial–Temporal Binary Data Via Monte Carlo Maximum Likelihood,” Journal of Agricultural, Biological, and Environmental Statistics, 13, 84–98.
Zhu, J., Huang, H.-C., and Reyes, P. E. (2010), “On Selection of Spatial Linear Models for Lattice Data,” Journal of the Royal Statistical Society. Series B, 72, 389–402.
Zou, H. (2006), “The Adaptive LASSO and Its Oracle Properties,” Journal of the American Statistical Association, 101, 1418–1429.
Zou, H., and Li, R. (2008), “One-Step Sparse Estimates in Nonconcave Penalized Likelihood Models,” The Annals of Statistics, 36, 1509–1533.
Author information
Authors and Affiliations
Corresponding author
Additional information
An erratum to this article is available at http://dx.doi.org/10.1007/s13253-017-0281-x.
Electronic Supplementary Material
Below is the link to the electronic supplementary material.
Rights and permissions
About this article
Cite this article
Fu, R., Thurman, A.L., Chu, T. et al. On Estimation and Selection of Autologistic Regression Models via Penalized Pseudolikelihood. JABES 18, 429–449 (2013). https://doi.org/10.1007/s13253-013-0144-z
Published:
Issue Date:
DOI: https://doi.org/10.1007/s13253-013-0144-z