On Estimation and Selection of Autologistic Regression Models via Penalized Pseudolikelihood

Fu, Rao; Thurman, Andrew L.; Chu, Tingjin; Steen-Adams, Michelle M.; Zhu, Jun

doi:10.1007/s13253-013-0144-z

On Estimation and Selection of Autologistic Regression Models via Penalized Pseudolikelihood

Published: 30 May 2013

Volume 18, pages 429–449, (2013)
Cite this article

Journal of Agricultural, Biological, and Environmental Statistics Aims and scope Submit manuscript

Rao Fu¹,
Andrew L. Thurman¹,
Tingjin Chu³,
Michelle M. Steen-Adams⁴ &
…
Jun Zhu²

366 Accesses
10 Citations
Explore all metrics

An Erratum to this article was published on 04 May 2017

Abstract

Autologistic regression models are suitable for relating spatial binary responses in ecology to covariates such as environmental factors. For big ecological data, pseudolikelihood estimation is appealing due to its ease of computation, but at least two challenges remain. Although an important issue, it is unclear how model selection may be carried out under pseudolikelihood. In addition, for assessing the variation of pseudolikelihood estimates, parametric bootstrap using Monte Carlo simulation is often used but may be infeasible for very large data sizes. Here both these issues are addressed by developing a penalized pseudolikelihood estimation method and an approximation of the variance of the parameter estimates. A simulation study is conducted to evaluate the performance of the proposed method, followed by a data example in a study of land cover in relation to land ownership characteristics. Extension of these models and methods to spatial-temporal binary data is further discussed. This article has supplementary material online.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

The accuracy of crime statistics: assessing the impact of police data bias on geographic crime analysis

Article Open access 26 March 2021

Urban biodiversity: State of the science and future directions

Article 21 February 2022

Applications of structural equation modeling (SEM) in ecological studies: an updated review

Article Open access 22 November 2016

References

Banerjee, S., Carlin, B. P., and Gelfand, A. E. (2004), Hierarchical Modeling and Analysis for Spatial Data, Boca Raton: Chapman and Hall.
MATH Google Scholar
Besag, J. (1972), “Nearest-Neighbour Systems and the Auto-Logistic Model for Binary Data,” Journal of the Royal Statistical Society. Series B, 34, 75–83.
MathSciNet MATH Google Scholar
— (1974), “Spatial Interaction and the Statistical Analysis of Lattice Systems (with Discussion),” Journal of the Royal Statistical Society. Series B, 36, 192–236.
MathSciNet MATH Google Scholar
Caragea, P. C., and Kaiser, M. S. (2009), “Autologistic Models with Interpretable Parameters,” Journal of Agricultural, Biological, and Environmental Statistics, 14, 281–300.
Article MathSciNet MATH Google Scholar
Comets, F., and Janžura, M. (1998), “A Central Limit Theorem for Conditionally Centred Random Fields with an Application to Markov Fields,” Journal of Applied Probability, 35, 608–621.
Article MathSciNet MATH Google Scholar
Cressie, N. (1993), Statistics for Spatial Data (Rev. ed.), New York: Wiley.
MATH Google Scholar
Crow, T. R., Host, G. E., and Mladenoff, D. J. (1999), “Ownership and Ecosystem as Sources of Spatial Heterogeneity in a Forested Landscape, Wisconsin USA,” Landscape Ecology, 14, 449–463.
Article Google Scholar
Diggle, P. J., and Ribeiro, P. J. (2007), Model-Based Geostatistics, New York: Springer.
MATH Google Scholar
Efron, B., Hastie, T., Johnstone, I., and Tibshirani, R. (2004), “Least Angle Regression (with Discussion),” The Annals of Statistics, 32, 407–499.
Article MathSciNet MATH Google Scholar
Friel, N., Pettitt, A. N., Reeves, R., and Wit, E. (2009), “Bayesian Inference in Hidden Markov Random Fields for Binary Data Defined on Large Lattices,” Journal of Computational and Graphical Statistics, 18, 243–261.
Article MathSciNet Google Scholar
Gaetan, C., and Guyon, X. (2010), Spatial Statistics and Modeling, New York: Springer.
Book MATH Google Scholar
Geyer, C. J. (1994), “On the Convergence of Monte Carlo Maximum Likelihood Calculations,” Journal of the Royal Statistical Society. Series B, 56, 261–274.
MathSciNet MATH Google Scholar
Gumpertz, M. L., Graham, J. M., and Ristaino, J. B. (1997), “Autologistic Model of Spatial Pattern of Phytophthora Epidemic in Bell Pepper: Effects of Soil Variables on Disease Presence,” Journal of Agricultural, Biological, and Environmental Statistics, 2, 131–156.
Article MathSciNet Google Scholar
He, F., Zhou, J., and Zhu, H. (2003), “Autologistic Regression Model for the Distribution of Vegetation,” Journal of Agricultural, Biological, and Environmental Statistics, 8, 205–222.
Article Google Scholar
Huang, H.-C., Hsu, N.-J., Theobald, D. M., and Breidt, F. J. (2010), “Spatial LASSO with Applications to GIS Model Selection,” Journal of Computational and Graphical Statistics, 19, 963–983.
Article MathSciNet Google Scholar
Huffer, F. W., and Wu, H. (1998), “Markov Chain Monte Carlo for Autologistic Regression Models with Application to the Distribution of Plant Species,” Biometrics, 54, 509–524.
Article MATH Google Scholar
Hughes, J., and Haran, M. (2013), “Dimension Reduction and Alleviation of Confounding for Spatial Generalized Linear Mixed Models,” Journal of the Royal Statistical Society. Series B, 75, 139–159.
Article MathSciNet Google Scholar
Hughes, J., Haran, M., and Caragea, P. C. (2011), “Autologistic Models for Binary Data on a Lattice,” Environmetrics, 22, 857–871.
Article MathSciNet Google Scholar
Jin, C., Zhu, J., Steen-Adams, M. M., Sain, S. R., and Gangnon, R. E. (2013), “Spatial Multinomial Regression Models for Nominal Categorical Data: A Study of Land Cover in Northern Wisconsin, USA,” Environmetrics, 24, 98–108.
Article MathSciNet Google Scholar
Møller, J., Pettitt, A. N., Reeves, R., and Berthelsen, K. K. (2006), “An Efficient Markov Chain Monte Carlo Method for Distributions with Intractable Normalising Constants,” Biometrika, 93, 451–458.
Article MathSciNet MATH Google Scholar
Nocedal, J., and Wright, S. J. (2000), Numerical Optimization (2nd ed.), New York: Springer.
MATH Google Scholar
Paciorek, C. J. (2010), “The Importance of Scale for Spatial-Confounding Bias and Precision of Spatial Regression Estimators,” Statistical Science, 25, 107–125.
Article MathSciNet MATH Google Scholar
R Development Core Team (2011), R: A Language and Environment for Statistical Computing, Vienna: R Foundation for Statistical Computing. ISBN 3-900051-07-0 http://www.R-project.org/.
Google Scholar
Rue, H., Martino, S., and Chopin, N. (2009), “Approximate Bayesian Inference for Latent Gaussian Models by Using Integrated Nested Laplace,” Journal of the Royal Statistical Society, 71, 319–392.
Article MathSciNet MATH Google Scholar
Stanfield, B. J., Bliss, J. C., and Spies, T. A. (2002), “Land Ownership and Landscape Structure: A Spatial Analysis of Sixty-Six Oregon (USA) Coast Range Watersheds,” Landscape Ecology, 17, 685–697.
Article Google Scholar
Steen-Adams, M. M., Mladenoff, D. J., Langston, N. E., Liu, F., and Zhu, J. (2011), “Influence of Biophysical Factors and Differences in Ojibwe Reservation Versus Euro-American Social Histories on Forest Landscape Change in Northern Wisconsin, USA,” Landscape Ecology, 26, 1165–1178.
Article Google Scholar
Sun, L., and Clayton, M. K. (2008), “Bayesian Analysis of Cross-Classified Spatial Data with Autocorrelation,” Biometrics, 64, 74–84.
Article MathSciNet MATH Google Scholar
Tibshirani, R. (1996), “Regression Shrinkage and Selection Via the Lasso,” Journal of the Royal Statistical Society. Series B, 58, 267–288.
MathSciNet MATH Google Scholar
Turner, M. G., Wear, D. N., and Flamm, R. O. (1996), “Land Ownership and Land-Cover Change in the Southern Appalachian Highlands and the Olympic Peninsula,” Ecological Applications, 6, 1150–1172.
Article Google Scholar
Wasserman, L. (2003), All of Statistics: A Concise Course in Statistical Inference, New York: Springer.
MATH Google Scholar
Wang, Z., and Zheng, Y. (2013), “Analysis of Binary Data Via a Centered Spatial–Temporal Autologistic Regression Model,” Environmental and Ecological Statistics, 20, 37–57.
Article MathSciNet Google Scholar
Xue, L., Zou, H., and Cai, T. (2012), “Nonconcave Penalized Composite Conditional Likelihood Estimation of Sparse Ising Models,” The Annals of Statistics, 40, 1403–1429.
Article MathSciNet MATH Google Scholar
Zhang, Y., Li, R., and Tsai, C.-L. (2010), “Regularization Parameter Selections Via Generalized Information Criterion,” Journal of the American Statistical Association, 105, 312–323.
Article MathSciNet MATH Google Scholar
Zheng, Y., and Zhu, J. (2008), “Markov Chain Monte Carlo for Spatial–Temporal Autologistic Regression Model,” Journal of Computational and Graphical Statistics, 17, 123–127.
Article MathSciNet Google Scholar
Zhu, Z., and Liu, Y. (2009), “Estimating Spatial Covariance Using Penalized Likelihood with Weighted L ₁ Penalty,” Journal of Nonparametric Statistics, 21, 925–942.
Article MathSciNet MATH Google Scholar
Zhu, J., Huang, H.-C., and Wu, J.-P. (2005), “Modeling Spatial–Temporal Binary Data Using Markov Random Fields,” Journal of Agricultural, Biological, and Environmental Statistics, 10, 212–225.
Article Google Scholar
Zhu, J., Zheng, Y., Carroll, A. L., and Aukema, B. H. (2008), “Autologistic Regression Analysis of Spatial–Temporal Binary Data Via Monte Carlo Maximum Likelihood,” Journal of Agricultural, Biological, and Environmental Statistics, 13, 84–98.
Article MathSciNet MATH Google Scholar
Zhu, J., Huang, H.-C., and Reyes, P. E. (2010), “On Selection of Spatial Linear Models for Lattice Data,” Journal of the Royal Statistical Society. Series B, 72, 389–402.
Article MathSciNet Google Scholar
Zou, H. (2006), “The Adaptive LASSO and Its Oracle Properties,” Journal of the American Statistical Association, 101, 1418–1429.
Article MathSciNet MATH Google Scholar
Zou, H., and Li, R. (2008), “One-Step Sparse Estimates in Nonconcave Penalized Likelihood Models,” The Annals of Statistics, 36, 1509–1533.
Article MathSciNet MATH Google Scholar

Download references

Author information

Authors and Affiliations

Department of Statistics, University of Wisconsin, Madison, WI, 53706, USA
Rao Fu & Andrew L. Thurman
Department of Statistics and Department of Entomology, University of Wisconsin, Madison, WI, 53706, USA
Jun Zhu
School of Statistics, Renmin University, Beijing, 100872, China
Tingjin Chu
Department of Environmental Studies, University of New England, Biddeford, ME, 04005, USA
Michelle M. Steen-Adams

Authors

Rao Fu
View author publications
You can also search for this author in PubMed Google Scholar
Andrew L. Thurman
View author publications
You can also search for this author in PubMed Google Scholar
Tingjin Chu
View author publications
You can also search for this author in PubMed Google Scholar
Michelle M. Steen-Adams
View author publications
You can also search for this author in PubMed Google Scholar
Jun Zhu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jun Zhu.

Additional information

An erratum to this article is available at http://dx.doi.org/10.1007/s13253-017-0281-x.

Electronic Supplementary Material

Below is the link to the electronic supplementary material.

(PDF 252 kB)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Fu, R., Thurman, A.L., Chu, T. et al. On Estimation and Selection of Autologistic Regression Models via Penalized Pseudolikelihood. JABES 18, 429–449 (2013). https://doi.org/10.1007/s13253-013-0144-z

Download citation

Published: 30 May 2013
Issue Date: September 2013
DOI: https://doi.org/10.1007/s13253-013-0144-z

Key Words

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

On Estimation and Selection of Autologistic Regression Models via Penalized Pseudolikelihood

Abstract

Access this article

Similar content being viewed by others

The accuracy of crime statistics: assessing the impact of police data bias on geographic crime analysis

Urban biodiversity: State of the science and future directions

Applications of structural equation modeling (SEM) in ecological studies: an updated review

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Electronic Supplementary Material

(PDF 252 kB)

Rights and permissions

About this article

Cite this article

Key Words

Navigation

On Estimation and Selection of Autologistic Regression Models via Penalized Pseudolikelihood

Abstract

Access this article

Similar content being viewed by others

The accuracy of crime statistics: assessing the impact of police data bias on geographic crime analysis

Urban biodiversity: State of the science and future directions

Applications of structural equation modeling (SEM) in ecological studies: an updated review

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Electronic Supplementary Material

(PDF 252 kB)

Rights and permissions

About this article

Cite this article

Share this article

Key Words

Search

Navigation