Skip to main content
Log in

Kolmogorov–Smirnov test for spatially correlated data

  • Original Paper
  • Published:
Stochastic Environmental Research and Risk Assessment Aims and scope Submit manuscript

Abstract

The Kolmogorov–Smirnov test is a convenient method for investigating whether two underlying univariate probability distributions can be regarded as undistinguishable from each other or whether an underlying probability distribution differs from a hypothesized distribution. Application of the test requires that the sample be unbiased and the outcomes be independent and identically distributed, conditions that are violated in several degrees by spatially continuous attributes, such as topographical elevation. A generalized form of the bootstrap method is used here for the purpose of modeling the distribution of the statistic D of the Kolmogorov–Smirnov test. The innovation is in the resampling, which in the traditional formulation of bootstrap is done by drawing from the empirical sample with replacement presuming independence. The generalization consists of preparing resamplings with the same spatial correlation as the empirical sample. This is accomplished by reading the value of unconditional stochastic realizations at the sampling locations, realizations that are generated by simulated annealing. The new approach was tested by two empirical samples taken from an exhaustive sample closely following a lognormal distribution. One sample was a regular, unbiased sample while the other one was a clustered, preferential sample that had to be preprocessed. Our results show that the p-value for the spatially correlated case is always larger that the p-value of the statistic in the absence of spatial correlation, which is in agreement with the fact that the information content of an uncorrelated sample is larger than the one for a spatially correlated sample of the same size.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8

Similar content being viewed by others

References

  • Bogaert P (1999) On the optimal estimation of the cumulative distribution function in presence of spatial dependence. Math Geol 31(2):213–239

    Google Scholar 

  • Bourgault G (1997) Spatial declustering weights. Math Geol 29(2):277–290

    Article  Google Scholar 

  • Chernick MR (2008) Bootstrap methods: a guide for practitioners and researcher, 2nd edn. Wiley Interscience, Hoboken, 369 pp

    Google Scholar 

  • de Berg M, Cheonq O, van Kreveld M, Overmars M (2008) Computational geometry, 3rd edn. Springer, Berlin, 386 pp

  • Deutsch CV (1989) DECLUS: a FORTRAN 77 program for determining optimum spatial declustering weights. Comput Geosci 15(3):325–332

    Article  Google Scholar 

  • Deutsch CV (2002) Geostatistical reservoir modeling. Oxford University Press, New York, 376 pp

    Google Scholar 

  • Deutsch CV, Journel AG (1992) GSLIB—geostatistical software library and user’s guide. Oxford University Press, New York, 340 pp

    Google Scholar 

  • Deutsch CV, Journel AG (1998) GSLIB—geostatistical software library and user’s guide, 2nd edn. Oxford University Press, New York, 384 pp

    Google Scholar 

  • Emery X, Ortiz JM (2005) Histogram and variogram inference in the multigaussian model. Stoch Env Res Risk A 19(1):48–58

    Article  Google Scholar 

  • Field A (2001) Discovering statistics using SPSS. Sage Publications, London, 779 pp, 1 CD

  • Goovaerts P (1997) Geostatistics for natural resources evaluation. Oxford University Press, New York, 483 pp

    Google Scholar 

  • Holmgren EB (1995) The P–P plot as a method for comparing treatment effects. J Am Stat Assoc 90(429):360–365

    Article  Google Scholar 

  • Isaaks EH, Srivastava RM (1989) Introduction to applied geostatistics. Oxford University Press, New York, 561 pp

    Google Scholar 

  • Journel AG (1983) Nonparametric estimation of spatial distributions. Math Geol 15(3):445–468

    Article  Google Scholar 

  • Journel AG (1994) Resampling from stochastic simulations. Environ Ecol Stat 1:63–91

    Article  Google Scholar 

  • Kendall M, Stuart A, Ord JK, Arnold S (1999) Kendall’s advanced theory of statistics, vol 2A—classical inference and the linear model. Hodder Arnold, London, 912 pp

  • Kirkpatrick S, Gellat CD Jr, Vecchi MP (1983) Optimization by simulated annealing. Science 220(4598):671–680

    Article  Google Scholar 

  • Lilliefors HW (1967) On the Kolmogorov–Smirnov test for normality with mean and variance unknown. J Am Stat Assoc 62(318):399–402

    Article  Google Scholar 

  • Liu, JS (2008) Monte Carlo strategies in scientific computing, paperback 2nd printing. Springer, New York, 346 pp

  • Metropolis N, Rosenbluth AW, Rosenbluth MN, Teller AH, Teller E (1953) Equation of state calculations for fast computing machines. J Chem Phys 21(6):1087–1092

    Article  CAS  Google Scholar 

  • Olea RA (2007) Declustering of clustered preferential sampling for histogram and semivariogram inference. Math Geol 39(5):453–467

    Article  Google Scholar 

  • Pardo-Igúzquiza E, Dowd PA (2004) Normality test for spatially correlated data. Math Geol 36(6):659–681

    Article  Google Scholar 

  • Stephens MA (1974) EDF statistics for goodness of fit and some comparisons. J Am Stat Assoc 69(347):730–737

    Article  Google Scholar 

  • Switzer P (1977) Estimation of spatial distributions from point sources with applications to air pollution measurement. Proceeding of the 41st ISI Session, New Delhi, Bulletin of the International Statistical Institute 47(2):123–137

    Google Scholar 

Download references

Acknowledgments

We are grateful to the insightful reviews of Paul Switzer, John C. Davis, and four anonymous reviewers, who contributed to a significant improvement of the paper.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ricardo A. Olea.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Olea, R.A., Pawlowsky-Glahn, V. Kolmogorov–Smirnov test for spatially correlated data. Stoch Environ Res Risk Assess 23, 749–757 (2009). https://doi.org/10.1007/s00477-008-0255-1

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00477-008-0255-1

Keywords

Navigation