Temporal trend evaluation in monitoring programs with high spatial resolution and low temporal resolution using geographically weighted regression models

von Brömssen, Claudia; Fölster, Jens; Eklöf, Karin

doi:10.1007/s10661-023-11172-2

Temporal trend evaluation in monitoring programs with high spatial resolution and low temporal resolution using geographically weighted regression models

Research
Open access
Published: 10 April 2023

Volume 195, article number 547, (2023)
Cite this article

Download PDF

You have full access to this open access article

Environmental Monitoring and Assessment Aims and scope Submit manuscript

Temporal trend evaluation in monitoring programs with high spatial resolution and low temporal resolution using geographically weighted regression models

Download PDF

Claudia von Brömssen¹,
Jens Fölster² &
Karin Eklöf²

1535 Accesses
Explore all metrics

Abstract

Data from monitoring programs with high spatial resolution but low temporal resolution are often overlooked when assessing temporal trends, as the data structure does not permit the use of established trend analysis methods. However, the data include uniquely detailed information about geographically differentiated temporal trends driven by large-scale influences, such as climate or airborne deposition. In this study, we used geographically weighted regression models, extended with a temporal component, to evaluate linear and nonlinear trends in environmental monitoring data. To improve the results, we tested approaches for station-wise pre-processing of data and for validation of the resulting models. To illustrate the method, we used data on changes in total organic carbon (TOC) obtained in a monitoring program of around 4800 Swedish lakes observed once every 6 years between 2008 and 2021. On applying the methods developed here, we identified nonlinear changes in TOC from consistent negative trends over most of Sweden around 2010 to positive trends during later years in parts of the country.

Spatial–temporal modellization of the $$\hbox {NO}_{2}$$ concentration data through geostatistical tools

Article 23 November 2015

High resolution spatiotemporal modeling of long term anthropogenic nutrient discharge in China

Article Open access 09 March 2024

PatagoniaMet: A multi-source hydrometeorological dataset for Western Patagonia

Article Open access 02 January 2024

Introduction

Surveillance monitoring has two main aims: (i) to assess the status of a target object and (ii) to detect long-term trends over time (Carvalho et al., 2019; Fölster et al., 2014b). Typically, for the work to be cost-effective, different types of monitoring programs with varying temporal and spatial resolution are established. A few selected stations are monitored frequently (at least once a year) to evaluate and quantify trends, while general status is assessed at a larger number of stations, but with lower temporal frequency (e.g., once or twice in a 6-year-period; Carvalho et al., 2019). The data from the latter are often disregarded in trend evaluations or only used to provide auxiliary information. In this study, we examined whether geographically weighted regression models can reveal the spatially detailed and unique information provided by monitoring programs with high spatial resolution and low temporal resolution and if this information can be utilized to quantify regional trends, i.e., trends driven by large-scale phenomena such as atmospheric deposition or climate.

Rotating panel, also called serially alternating, designs consist of a large number of monitoring sites that are split into a few panels, which are revisited at regular intervals, but with differing start years. Some panels may be revisited annually, in what is called an augmented rotating design (McDonald, 2003). Several approaches based on linear mixed models for analyzing trends in the data obtained in such programs have been suggested (Starcevich et al., 2018a, b; Urquhart & Kincaid, 1999). For example, Dauwalter et al. (2010) estimated trends individually for each site and, using these, computed a mean slope across a larger area. Starcevich et al. (2018a) used a mixed model comprising a fixed joint trend slope, random site-specific trend slopes, random effects for sites and years, and a random site-year interaction (Piepho & Ogutu, 2002) for this purpose. These methods are designed to evaluate a single population-level trend, under the assumption that this trend has the same form and similar magnitude over the entire geographical region under assessment, while site-specific trends also can be evaluated using the random-effect site-level trend slopes. This assumption is reasonable for smaller areas or for regions where a single large-scale driver affects all sites uniformly. However, in many cases, it is naïve and could conceal important information. Moreover, due to the low temporal resolution of the data, conventional station-wise trend detection methods, such as Mann–Kendall tests (Hirsch & Slack, 1984; Hirsch et al., 1982), are not applicable, as they require more than a few data points per series. There is thus a lack of appropriate statistical methodology for evaluating temporal trends that vary over space in data with sparse temporal resolution.

Spatially varying relationships without a temporal component can be studied using geographically weighted regression (GWR) models (Brunsdon et al., 1998a). Such models allow the relationship between two variables to change smoothly over space and have been applied widely in social, economic, and medical studies. In environmental studies, Tu and Xia (2008) and Taghipour Javi et al. (2014) used GWR models to connect water quality or quantity to land use indicators, while Koh et al. (2020) studied nutrient contamination in groundwater in relation to hydrological and land use variables. In many applications of GWR, the analysis is based on single observations per site, object, or administrative unit. To counterbalance that, the regression model for a specific object uses a geographical window. Observed objects within this window, adjusted by weights that represent the distance to the object of interest, are used to build the regression model. GWR has also been adapted to include additional spatial correlations (GWR-SAR; Brunsdon et al., 1998b) and both spatial and temporal weights (GTWR; Fotheringham et al., 2015; Huang et al., 2010). For space–time data also geographically weighted panel regression models have been suggested (Yu, 2010). While the latter methods make use of temporal data, the focus was on the spatial representation and not changes in time. Since GWR is a computationally expensive method, several solutions for parallel computations have been suggested and implemented (Harris et al., 2010a, b; Murakami et al., 2021; Wang et al., 2020).

Geographically weighted regression models are a useful tool for exploring geographically diverse temporal trends in temporally sparse data as long as these trends are influenced by large-scale drivers, i.e., can be assumed to be spatially homogeneous to some extent. In this study, we examined whether GWR models can be modified by adding time as an explanatory variable and adjusting the approach used for pre-processing of data. This would permit the evaluation of spatially differentiated trends and thereby uncover new information about patterns of prevailing trends. Specifically, in this study, we sought to determine whether:

1.
Pre-processing of data could be performed to lower the influence of outliers and remove small-scale variation in observed levels of the variable of interest.
2.
The size of the chosen k-nearest-neighborhood (knn) influences observed large-scale geographical patterns and which size works best for the problem at hand.
3.
Datasets with higher temporal resolution can be added to increase the available information in sparse datasets, in order to produce more reliable temporal trends.
4.
Nonlinear temporal trends can be identified using an approach with temporally moving windows.

For illustration purposes, we used data from the Swedish Lake Survey (SLS), a monitoring program covering around 4800 lakes with a revisit interval of 6 years, that has not been utilized previously to study temporal trends. Brownification of surface waters was selected as an example of an issue that would benefit from the development of relevant statistical tools to evaluate spatial–temporal data. Brownification may be driven by recovery from acidification (Evans et al., 2005; Monteith et al., 2007) and by changes in climate, such as increased temperature or precipitation (de Wit et al., 2016; Weyhenmeyer & Karlsson, 2009; Weyhenmeyer et al., 2012). However, there are also alternative theories on the causes of brownification with more local effects, including, e.g., changes in land use and increasing coniferous forest cover (Kritzberg, 2017; Meyer-Jacob et al., 2019; Škerlep et al., 2020). An ability to compute regionally differentiated trends for brownification variables on a dense spatial grid would provide a clearer picture of which of these hypotheses is most important. As the focus of this study was on the statistical methodology, we used only one variable, total organic carbon (TOC). A more extensive analysis to evaluate potential drivers of brownification with the proposed statistical methods was conducted in a parallel study and will be reported elsewhere.

Methods

Geographically weighted regression models

Geographically weighted regression models are used to model a response variable as a function of one or several explanatory variables ${x}_{1}, {x}_{2}, \dots , {x}_{p}$ (Brunsdon et al., 1998a; Comber et al., 2023):

$${Y}_{i}={\beta }_{i0}+\sum_{k=1}^{p}{\beta }_{ik}{x}_{ik}+{\varepsilon }_{i}$$

where ${\beta }_{ik}$ are the regression coefficients for the variable ${x}_{ik}$, k denotes the different variables, and i the spatial location. This formula is valid for each of the n locations and contains usually no temporal component. The error term in the model is assumed to follow a normal distribution with a mean zero.

This means that for each spatial location, a regression model is defined. In order to achieve spatial smoothing of the regression coefficients, the coefficient for a specific location is computed from observations within a spatial neighborhood. Such a neighborhood can be defined using a fixed distance band or as a group of stations formed by the k nearest neighbors. Within the neighborhood, weights are assigned to individual observations, depending on the distance of the observations to the location modeled. Then, a weighted least squares regression is computed using the regression equation and the defined weight matrix. A typical choice for the weights makes use of the Bisquare weighting function (Brunsdon et al., 1998a):

$$w_{ij}=\left\{\begin{array}{c}\left(1-\left(d_{ij}/b\right)^2\right)^2\;\;\;if\left|d_{ij}<b\right|\\\;\;\;\;\;\;\;\;\;0\;\;\;\;\;\;\;\;\;\;\;\;\;otherwise\end{array}\right.$$

where ${d}_{ij}$ denotes the distance between observations i and j, while b represents the physical distance from location i to the k^th nearest neighbor. Standard errors for the estimated GWR coefficients correspond to standard errors in a weighted least squares regression (Fotheringham et al., 2002; Harris et al., 2010a, b).

To make this method available for use in trend analysis, we reformulated the model with respect to two features: (i) we allowed several observations instead of single observations per location and (ii) we used time explicitly as an explanatory variable. While it is possible to add additional explanatory variables, we concentrated on the basic trend model. The model is thus:

$${Y}_{ij}={\beta }_{i0}+{\beta }_{i}{t}_{ij}+{\varepsilon }_{ij}$$

where i again denotes the geographical location and j the j^th observation at that location, and ${t}_{ij}$ is a time variable, typically a year of observation.

Including time as an explanatory variable requires adjusted pre-processing of the data (see section “Data pre-processing”). We set the size of the neighborhood to be adaptive, as the spatial sampling density varied and used a Bi-square kernel to spatially weight the observations. We created the GWR model using the package GWmodel (Gollini et al., 2015; Lu et al., 2014) in the free statistical software R (R Core Team, 2022). As the dataset was rather large and GWR models are computationally expensive, we opted to perform computations in parallel on a standard PC using the default option provided in GWmodel. The size of the k-nearest-neighborhood (knn) was determined by cross-validation and also compared to user-defined sizes of the neighborhood. R scripts and data used in this study are available online (von Brömssen et al., 2023).

Data

The Swedish Lake Survey has monitored approximately 4800 lakes since 2007 (Fölster et al., 2014a, b). This program has a revisit time of 6 years, and each year a panel consisting of about 800 lakes is monitored, leading to time series of two or three observations per location so far. The spatial monitoring density is higher in southern Sweden. Mid-lake surface samples are taken during autumn circulation, usually between September and December, since these measurements are regarded as representative of the entire lake (Göransson et al., 2004). The selection of lakes is made randomly within geographical strata and accounting for lake size. Only lakes larger than 0.01 km² are included (Grandin, 2007). Occasional late observations (up to January) are attributed to the autumn of the previous year. During autumn 2018 (including January 2019), only 463 lakes were sampled due to a technical breakdown of the helicopter used for monitoring. The remaining lakes were sampled together with the panel planned for 2019, leading to a total of 1080 lake measurements in 2019. Some data quality problems were detected for the first year of monitoring in the SLS (2007), so our analysis was based on data from 2008 to 2021. As the goal was to identify larger regions with common trends, the 11 lakes on the island of Gotland, which lies off the south-east coast of Sweden, were omitted from the analysis.

A parallel monitoring program, the Swedish Trend Lake (STL) program, initially covered small acidification-sensitive forest lakes, which were not treated by liming to mitigate acidification. Over time, the scope was broadened several times, and it now includes lakes that are unaffected by local point source pollution in general. Its monitoring sites have a good geographical representation in Sweden, but with a higher density in southern Sweden. The current program includes 107 lakes (excluding lakes on the island of Gotland). Most of the lakes are monitored four times a year, while some have a more frequent sampling regime (Fölster et al., 2014a). For the purposes of the present study, where STL data were co-analyzed with SLS data, STL surface water observations (≤ 1 m) during autumn, i.e., between September and December, in the years 2008 to 2021 were selected. When evaluating nonlinear trends, all available STL data between 2008 and 2021 were used for a supporting analysis (section “Nonlinear temporal trends”).

All data used were downloaded from the environmental data internet service at the Swedish University of Agricultural Sciences https://miljodata.slu.se/MVM/.

Data pre-processing

Basic pre-processing

The goal of our study is to conduct temporal trend evaluation and we removed, thus, sites with less than two observations to ensure that only time series type of data were included. This led to the removal of 214 observations/stations. All TOC data were log₁₀-transformed before analysis to account for the skewed data distribution and to decrease the effect of potential outliers.

Detection of series with high variation and outliers

Local outliers can strongly influence GWR estimates (Comber et al., 2023; de Bellefon & Floch, 2018; Fotheringham et al., 2002). A number of adjusted methods have been suggested to reduce the potential influences of single observations, by identifying and eliminating or downweighting them (Fotheringham et al., 2002; Harris et al., 2010a, b), by using robust estimation (Sugasawa & Murakami, 2021; Zhang & Mei, 2011), or by robust cross-validation (Farber & Páez, 2007; Harris et al., 2010a, b). In this study, the log₁₀-transformation of the data before analysis already decreased the effect of single deviating observations. Additionally, we took advantage of the fact that we had several observations per site, which is usually not the case for GWR. We were thus able to identify sites with unusually large variation relatively to observed levels by computing the coefficient of variation (CV) for each station as defined for log-normal data (Koopmans et al., 1964):

$${CV}_{log-normal}=\sqrt{{e}^{{s}_{ln}^{2}}-1}$$

where ${s}_{ln}^{2}$ is the variance in the response on the scale of the natural logarithm. Here, we used a logarithm with base 10, so this was substituted with ${s}_{ln}={s}_{10}\bullet ln\left(10\right)$, where ${s}_{10}$ is the standard deviation of the response transformed with a logarithm of base 10. Station-wise variances and station-wise mean values were also computed and compared.

Assessing the geographical heterogeneity of temporal changes

To evaluate whether a geographically differentiated trend model was necessary, we first fitted a global model, i.e., we assumed a homogeneous trend slope for all of Sweden and fitted a traditional linear regression on time while ignoring spatial dependence. For this model, we studied the geographical heterogeneity of the residuals using plots and summary statistics. To further assess the spatial autocorrelation of a feature, Moran’s I, which is a measure for clustering of spatial data, can be computed (Comber et al., 2023). Here, we wanted to determine whether the predominant trend differed in different parts of Sweden and thus we determined the spatial clustering using the change over time, rather than the level of the variable of interest. For this, we determined the change per year for each individual station as:

$${change}_{year}=\frac{{y}_{j}-{y}_{i}}{j-i}$$

where ${y}_{i}$ is the first log-transformed observation at a site and ${y}_{j}$ is the last observation, while i and j indicate the years of observation. We then computed Moran’s I for the variable change_year using Euclidian distances in space, with the package ape in R (Paradis & Schliep, 2019).

Station-wise centering of data

When data are collected from a number of sites, the general level of the variable of interest can vary due to factors such as catchment land uses and the depth, type, or size of the lake. While we assumed that the prevailing temporal trends in a neighborhood are similar, we could not make this assumption for the levels of the variable of interest. Since GWR is based on the spatial weighting of observations and the data analyzed here were collected with a low temporal resolution, varying station-wise levels might lead to undesired effects on the trend analysis. To illustrate this, observations of log-transformed TOC from nine stations within a geographical window are shown in Fig. 1 (left panel). Both decreasing and increasing trends are evident at individual stations, but several of the stations with higher mean levels were monitored during later years. A joint trend estimated for the data (here without geographical weighting) led to the conclusion that there is an apparent upward trend, with an estimated magnitude of 0.018 units per year (standard error: 0.02). To decrease the effect of station-specific mean levels, we applied station-wise mean-centering of the data, i.e., we subtracted the mean level of each station from the individual observations, resulting in a series with zero mean (Fig. 1, right). When we then computed the trend slope for these series, the result was a slightly negative trend estimate (−0.005; standard error: 0.002).

Model validation

For a GWR model, the residual sum of the square, local R², and p-values can be computed to assess its fit (Comber et al., 2023). The p-values can be corrected for multiple testing using, e.g., Benjamini and Hochberg False Discovery rates or the Bonferroni method (da Silva & Fotheringham, 2016). In our application, p-values were only used to identify areas with potential trends and exact p-values were not relevant. Therefore, we followed the recommendation of Brunsdon et al. (1998a) and only considered areas with a large proportion of locally very distinct trends, while smaller areas and single stations indicating geographically limited deviations were ignored. Stations that were identified in the pre-processing step as containing much variation compared with the overall level of the measurements were examined by computing the differences between the estimated trend slopes obtained for models with and without these stations. While we generally used the size of the geographical neighborhood suggested by the cross-validation approach, we also compared these results with a model assuming a much smaller neighborhood, both visually and by computing the difference between the estimated trend slopes for the two neighborhood sizes.

Combining datasets

Geographically weighted regression involves the weighting of observations from geographical windows, which makes it possible to combine data from different monitoring programs even if their temporal and spatial resolution vary. Here, we merged SLS and STL data, with SLS being a random sample of lakes and STL a sample of lakes unaffected by local pollution sources. It was thus important to verify that the same type of overall trend was monitored, especially since the STL monitors with substantially higher temporal frequency than the SLS. The effect of including the STL data in the analysis was evaluated by inspecting the changes obtained in slope estimates.

Nonlinear temporal trends

GWR allowed us to estimate linear trends in time, but for longer time series, this would not be adequate to identify ongoing changes. Therefore, in addition to the overall GWR analysis, we also applied GWR on temporal subsets, so-called moving windows, of the SLS data. We used coefficients estimated for 10-year-long windows and results plotted side-by-side to identify any clear changes in the trend slope coefficients over time, i.e., any indications of nonlinearity.

For comparison, we evaluated nonlinear trends in the STL data, which have a higher temporal resolution, using generalized additive mixed models (Hastie & Tibshirani, 1986; Wood, 2017) and the visualization methods suggested by von Brömssen et al. (2021). For this analysis, we used all available data in the STL, usually at least one observation per season. We modeled the overall trend by a thin plate spline and accounted for seasonal variation in the model by a seasonal indicator variable with four levels: spring (March–May), summer (June–August), autumn (September–November), and winter (December–February). The error term is modeled as an autoregressive process of order 1 to account for the temporal autocorrelation in these time series.

Results

Data quality assurance and pre-processing

High levels of TOC were generally recorded in southern Sweden, but also on the north-east coast (Fig. 2, left). Using the coefficient of variation, we were able to locate stations with relatively high variation compared with the station mean (Fig. 2, right). Especially, eleven lakes had a value higher than 1 (Table S1 in Supporting Information (SI)), i.e., the standard deviation of the log-transformed observations was at least as large as their local mean. An unusually high number of such lakes (five) were observed in the Swedish alpine region (north-west of the country), where the overall level of TOC in lake water is generally low (≤ 1 mg/L). At four of these five stations, unusually high TOC levels were observed as the first recorded value, which could introduce an artificial trend if these observations were faulty. However, the measurements were made during different years, and there is no obvious reason to believe that these observations were not correct. The remaining stations with the highest coefficient of variation were distributed randomly over the country and included stations that so far only have been visited twice. While these stations most probably have some effect on trend estimation in their immediate region, at this stage we were unable to determine whether these were invalid or erroneous measurements and they needed to be revisited instead in model validation. Station-wise variances were computed, but did not contribute further to the analysis of data quality.

Trends in TOC

Moran’s I was calculated on the change per year based on individual stations and showed a spatial pattern (p < 0.001). This pattern was confirmed by fitting a global linear trend model and analyzing its residuals (not shown). Thus, both analyses indicated a need for geographically differentiated analysis.

Temporal trends in TOC were estimated by GWR for log-transformed TOC values that were mean-centered within the station. The size of the k-nearest neighborhood was determined to be 370 by cross-validation, indicating that approximately 125 lakes (about 2.5% of all lakes), with varying weights, were considered in the estimation of the trend slope for an individual lake. Clear decreasing trends with the magnitude of about 0.02 were observed in the Swedish alpine region (Fig. 3, left) and trends with slightly lower magnitude in south-east Sweden. Increasing trends were seen in south-west Sweden and to some extent in the region of Gävleborg in central Sweden. A plot illustrating where trends were significant at an unadjusted significance level of p < 0.05 was used to enhance this geographical information (Fig. 3, right).

For comparison, GWR was also computed for data without station-wise mean-centering, where the cross-validated k-nearest neighborhood was determined to be 23, i.e., only about 8 lakes. While the overall picture of positive and negative trend slope estimates in some regions still could be identified (Fig. S2, left) local deviations were much larger due to the small size of the selected neighborhood and standard errors of trend slopes were approximately 10 times higher than in the centered case (Figs. S1 and S2, right). Only few stations in the alpine region showed significant trends. Similarly, using a larger neighborhood (knn = 370) few regions showed significant trends due to the increased local variation when the TOC level of stations differed strongly and the standard errors of the trend slopes were still about 4 times larger compared to the model for mean-centered data. The model using non-centered data was not considered further.

Model validation

The resulting model had an overall adjusted R² value of 0.12, with local R² values up to about 0.4 in the region with the strongest trends, i.e., the alpine region. The standard error values computed in GWR showed the expected pattern of higher variation in regions with the sparser geographical resolution, especially along the north-east coast, central-west Sweden, and southernmost Sweden (Fig. S1 in SI). Regions with high trend slopes and significant trends were mostly well connected. Some smaller areas with significant trends were observed, especially in central Sweden (Fig. 3, right), but these trends should not be interpreted unless supported by additional data, expert knowledge, or auxiliary information from relevant background variables. The removal of 24 lakes with CV_log-normal > 0.8 obviously led to a change in trend estimates, but the change was seldom larger than 0.001 (Fig. S3 in SI, left). This was mainly observed in the alpine region, where the negative trends appeared weaker on removing high-variation sites. Due to the lower variation in some other regions, new stations appeared to be significant after the removal of several high-variation sites. However, these were randomly scattered throughout Sweden and trends were thus not eligible for interpretation.

It was expected that single lakes would not have any substantial influence on the results of this model, as the neighborhood used was rather large. However, the large neighborhood also made it more difficult to observe local trends that could arise due to climatic or atmospheric deposition effects on a smaller scale (e.g., rainy alpine regions, high deposition on south-west coast) or to detect whether smaller regional drivers had more effect than large-scale drivers. We evaluated the effect of neighborhood by running a GWR with a smaller k-nearest-neighborhood of only 90 observations (approx. 30 lakes). The overall picture was unchanged, with decreasing trends in the alpine region, central-west Sweden, and south-east Sweden and increasing trends mainly in the south-west (Fig. 4, left). However, local trends were much more pronounced, with maximum slope estimates as high as 0.037 (for negative trends in the alpine region; standard error: 0.002) and 0.015 (for positive trends around Hanöbukten; standard error: 0.002) and local R² values above 0.7. Consequently, fewer locations showed significant trends (Fig. 4, right), and significant trends were more often observed for small clusters of lakes, especially around the edges of the fitted area, i.e., the coastline. Even for the smaller neighborhood (knn = 90), the removal of lakes with high CV_log-normal did not change the overall picture (results not shown).

Combining data from different monitoring programs

The STL program has a higher temporal frequency, but much fewer stations than the SLS program. We merged data from these two programs and conducted a joint GWR. Cross-validation determined the optimal k-nearest neighborhood to be 400 for the combined data (about 130 lakes or 3% of all stations). While the overall picture of prevailing trends was quite similar to that observed for the SLS data alone (Fig. 5, left), there were some differences in the estimated trend coefficients (Figs. 5, right; S3, right). The greatest difference in the estimated slope coefficient was observed around Lake Hindsen in the extreme south of Sweden, but since this is an area with few lakes on the edge of the observed geographical area, we did not interpret this change further. A similar difference in slope estimate was observed around Lake Kupesjön, which lies near three STL lakes, Älgarydssjön, Fiolen, and Stora Skärsjön. All three STL lakes showed some decrease in TOC concentration over the years (Fig. 6, upper left), while SLS lakes mainly indicated increased TOC level or no change. This area connects with lakes north-east of Halmstad (Fig. 6, upper right), where a similar change was observed (Fig. 5 (right), red circle). In both areas, the overall estimated trend magnitude was still mainly positive, but was lessened by the addition of the STL lakes. The approximate decrease in trend slope estimate was 0.0035 units. A third area with decreased slope coefficient after the inclusion of the STL lakes was identified south of Stockholm, around Södertälje. There, most of the STL lakes exhibited high values during the first years in the series (Fig. 5 (right), green circle; Fig. 6, lower left), a finding which needs to be studied further. Clear confirmation of an increasing trend in TOC was given by the STL lakes Svinarydsjön and Örsjön outside Karlshamn, close to Hanöbukten (Fig. 6, lower right). The estimated trend slope for this region was around 0.002 units steeper when STL lakes were included compared with only SLS lakes.

Nonlinear trends in TOC

To study nonlinear trends in the SLS data, we fitted separate GWR models in five different 10-year temporal windows (2008–2017, 2009–2018, 2010–2019, 2011–2020, 2012–2021). We observed primarily downward trends in TOC during the first few 10-year periods (Fig. 7, left), while, especially during the last 10 years, we observed increasing TOC levels in south-west and north-east Sweden (Fig. 7, right). Negative trends prevailed in the northern Swedish alpine regions and, to a lesser extent, in the south-east during the entire observation period. Even though the periods used in the analysis were only 10 years long, many of the observed changes identified were significant (Fig. S4 in SI).

To validate the results, we again used the sites in the STL program. As the data had a higher temporal resolution, we performed the analysis of nonlinear trends station-wise for the 107 stations, using generalized additive models. We observed that for TOC, significant downward trends were present even in this analysis for several lakes in the north, as well as the south east (e.g., Glimmingen, Fjärasjö, and Hökesjön), and several series exhibited upward trends towards the end of the series, both in north-east (e.g., Täftesträsket, Hällvattnet) and in south Sweden (Fig. S5 in SI). In the south, the most prominent increases happened in Västergötland, between the large lakes Vänern and Vättern, while no or even negative trends were observed in the south-east, as already observed in the combined SLS and STL analysis. An exception was again the area around Hanöbukten where lakes Svinarydsjön and Örsjön showed significant increases.

Discussion

GWR as a method for trend evaluation for datasets with a sparse temporal resolution

Large amounts of data are collected in environmental monitoring to determine the status of lakes, water courses, grasslands, forests, or other types of objects. A typical choice for such programs is to keep the temporal resolution of the series low in favor of obtaining a better spatial representation and including more objects in the survey. Still, such extensive data sets are also very interesting for temporal trend analysis. GWR has so far not been used to evaluate temporal trends but is a much-used tool to quantify spatially varying relationships. Including time as an explanatory variable in GWR, we can conduct a trend analysis as long as trends can be assumed to be consistent over a reasonably large region. In the brownification of surface waters, climate and current and previous acid deposition are expected to influence the level and change in TOC (de Wit et al., 2016; Evans et al., 2005; Monteith et al., 2007; Weyhenmeyer & Karlsson, 2009; Weyhenmeyer et al., 2012), and Sweden has a clear gradient both in deposition and climate. Therefore, TOC in Swedish lakes is a suitable study area for geographically differentiated temporal trends as identified by GWR.

Even though GWR is quite straightforward to use for temporal trend evaluation by including time of observation as an explanatory variable, there are some additional features to be considered. For example, the level of observed quantities is typically not influenced by a single driver, but by a combination of characteristics of the site in question. This is most probably also the cause for TOC, as trends of TOC have been suggested to be driven by land use (Škerlep et al., 2020) and soil processes related to recovery from acidification (Evans et al., 2012). Such local differences in confounding factors can easily disguise existing trends or create artificial trends, especially if the geographical window chosen is small. To avoid such effects, we used two approaches: (i) station-wise mean-centering of data as a pre-processing step to rescale data at a mean of zero and thus remove site observed levels as an influencing factor; and (ii) evaluation of the chosen window size. We used adaptive kernels defined as k-nearest neighbors (knn), as the data had a varying spatial resolution, which makes a fixed bandwidth problematic (Comber et al., 2023). Cross-validation is recommended to choose the best local size of the knn. In our case, the cross-validation results suggested a neighborhood of approximately 372 observations or 125 lakes. Smaller neighborhoods allow more local trends to manifest, which can be of interest if the spatial resolution is sparser, i.e., if a large neighborhood oversmooths results or if drivers are expected to work on a smaller regional scale. With very sparse temporal data, as in the present case, it will generally not be possible to identify trends on a small geographical scale. Instead, the goal must be the rough identification of areas with increasing and decreasing TOC levels, while the exact borders of these regions are not important. In this study, analyses with larger and smaller knn produced very similar results, with clear decreases in TOC in the northern alpine region and in south-east Sweden and with increasing TOC levels primarily in south-west Sweden.

It needs to be pointed out that for our approach GWR is a hypothesis-generating rather than a hypothesis-testing method. Provided results include standard errors for trend estimates as well as p-values, but both of these will be underestimated due to several reasons. First, temporal autocorrelation might be present in addition to the temporal trend, but is not possible to estimate in time series that only consist of two or three observations. Similarly, spatial correlation might still be present in the error term even after geographical smoothing. The presence of either of these correlations will lead to underestimation of standard errors, which in turn will cause too low p-values. Additionally, as we have a large number of locations, adjustment for multiple testing (da Silva & Fotheringham, 2016) would be needed to decrease the risk of false positives. To avoid these issues, we chose to only interpret patterns of trends and with that lay the foundation to generate hypotheses on where trends are present and, consequently, which drivers might be of importance.

Pre-processing and model validation

As GWR models are sensitive to outliers (Brunsdon et al., 1998a), pre-processing and model validation are important. We did this by identifying stations with a high coefficient of variation before model fitting and analyzing the effect of the removal of such stations on the model results. A high coefficient of variation may mean stations that have faulty high or low values, but can also indicate stations that have a steep temporal trend. In the latter case, their removal would obviously be disadvantageous for trend evaluation. Since the removal of such stations with a high coefficient of variation had little effect in the present analysis, this indicates that they were stations with a steep trend located in areas that generally showed such a trend, or single stations, which would not be interpreted as trends anyway. For small knn in particular, we found several limited areas with significant trends along the Swedish coastline and in the alpine region. This is not unexpected, as it is generally more difficult to estimate reliable trend coefficients at the edge of the observable area (Leong & Yue, 2017), and single stations with a distinct increase or decrease can have larger effects.

The overall and local R² values can be used to determine the goodness of fit. For our main model, the overall R² was only 0.12. However, this was not informative as there were large regions in Sweden that did not exhibit a trend and therefore contributed to a low coefficient of determination. The local R² values are more interesting to quantify, and those reached 0.4 for the model with a large neighborhood size and 0.7 when a small neighborhood was chosen, indicating that there are rather strong temporal trends in some geographical areas. Consequently, the GWR approach is deemed successful as it can identify regions with trends as well as such without. The inclusion of local covariates might improve the overall and local R² values additionally and will be studied separately. Care needs, however, to be taken as high R² values in models with very small neighborhoods or with many covariates might be a sign of overfitting due to the small number of observations available per station.

Additionally, GWR models can be validated by comparing results with studies with a similar objective but using datasets with higher temporal resolution. For example, Eklöf et al. (2021) studied TOC concentrations and absorbance in 164 watercourses distributed over Sweden and found a smaller number of streams with ongoing increasing TOC trends in the period 2011–2020, after a period of more general positive trends. Most of these streams were located in south-west Sweden and the along the north-east coast, in agreement with the findings in our study. Eklöf et al. (2021) also identified single stations in the south-east with decreasing trends in TOC during the past decade, but in that region and in the alpine region only a few water courses were sampled and results are more difficult to compare.

To validate the results of a trend analysis, it could be interesting to conduct a power study. Pregler et al. (2019) conducted a power study for different revisit designs and found that serially alternating designs have similar power to always revisited designs if the total amount of observations is the same. Similarly, Starcevich et al. (2018a) found that a revisit design with more unique sites is preferable to annual revisit panels of sites when detecting trends over at least 12 years of monitoring with a linear mixed model. However, both assume that a common trend is present over the area of interest. In our case, it is reasonable to assume that trends differ in different parts of Sweden and the power to detect spatially differentiated trends would thus be more difficult to quantify.

Combining datasets with a low and high temporal resolution

By using a separate dataset, the STL data, we were able to add data from 107 additional stations in computing the GWR model. The data for the STL stations have a higher temporal resolution, but do not represent randomly selected stations, which the SLS does. Instead, the STL program monitors stations that are mainly unaffected by local pollution sources. As the main goal in the present case was to study trends caused by large-scale drivers, such as climate, and recovery from acid deposition, the different objectives of the two surveys were not expected to affect the trend evaluation results to any substantial degree. Rather, the STL data helped make a distinction between large-scale drivers that were of interest here and more local pollution sources. We compared the results obtained in combined analysis with those based on the SLS alone and found only a few differences. The most noticeable change occurred in south-west Sweden, between Halmstad (approx. 56.692748 lat., 13.112771 long.) and east of Värnamo (approx. 57.208242 lat., 14.142123 long.), where the estimated trend slope for TOC decreased by 0.0035 units compared to the SLS dataset alone, meaning that local pollution sources might be prevalent for lakes that are part of the SLS. However, the combined analysis results still indicated an increase in TOC levels. For comparison, in the well-studied Lake Bolmen water color has been observed to decrease during the past decade (Klante et al., 2021), following high levels observed after a strong storm and large amounts of tree felling in 2005. Such nonlinear temporal relationships could also be a reason why different monitoring programs with different objectives provide information that is not completely in line.

In other regions of Sweden, the STL data supported or enforced the general direction of the SLS trends. For example, near Karlshamn, the STL lakes showed clear increasing trends, which was also observed in the SLS lakes. Similarly, the downward trend in the Helags alpine region was confirmed by data from the STL program. This is an area where results based on the SLS alone must be considered uncertain, due to large variations and low absolute values of TOC and its situation at the edge of the observable space.

Nonlinear temporal trends

In the present study, we assumed that the temporal trend in log-transformed TOC can be approximated by a line, but is allowed to vary smoothly over space. Linear temporal trends can be reasonable if time series are short or, as in our case, data are sparse and linearity is a helpful simplification. However, to understand trends and what is driving these, it is necessary to investigate in nonlinearity whenever possible. We addressed this using a moving window approach, i.e., we subdivided the data into five 10-year intervals, moving the interval 1 year forward in each step. This provided at least some information about how estimated trend slopes changed over time. In particular, we found that during the last 10-year period studied (2012–2021), lakes increasingly showed positive trends after a period of an overall decrease in TOC. The reliability of the results of this nonlinear analysis was confirmed on comparing them to the results from trend analysis based on higher frequency data. Here, we used a screening of trends with generalized additive models (von Brömssen et al., 2021), which showed increases in TOC at a number of stations during the latest years of monitoring.

The possible presence of nonlinear trends in data puts our initial analysis into question. A combination of data with sparse temporal resolution and nonlinear trends will require some trade-off in modeling. Here, the goal was to demonstrate the usefulness of GWR in such situations, and we present both the linear and the nonlinear trends. Which of the two alternatives to use (linear with more data or nonlinear with subsetted datasets) needs to be addressed in the individual modeling situation. For TOC, the change in trend slope estimates over time seems strong, and therefore, a nonlinear approach would probably be the appropriate choice.

Including other explanatory variables

In the present study, we did not include any other explanatory variable except time. GWR models are equipped to include additional explanatory variables and can also be fitted with geographical windows of different sizes (so-called mixed GWR; Fotheringham et al., 2002). This would provide the possibility to adjust or normalize data with the aid of local conditions for individual lakes, e.g., local land use or levels of important background variables such as water temperature. How well this works when data are sparse needs to be evaluated in a separate study.

Large-scale drivers of brownification

While we did not try to make a comprehensive analysis of trends in brownification in Swedish lakes, the modeling results obtained using TOC as a proxy revealed clear large-scale patterns in the prevailing trends. In particular, downward trends emerged in the alpine region and in south-east Sweden, while in the south-west and north-east, decreasing trends were observed during the first years of the study period, but in the last decade, positive changes in TOC were predominant. Although the trends in TOC were generally nonlinear, the GWR analyses managed to identify a large-scale geographical pattern of areas of increasing versus decreasing trends. In the next stage of the analysis, this geographical pattern can be related to levels or trends of land-use activities, catchment characteristics, climatic variables, and acid deposition patterns.

Conclusions

Data from monitoring programs with low temporal frequency, but covering many sites, are often not considered in trend evaluation. Here, we show that such programs can carry unique and important information about regionally differentiated trends, which will be valuable guidance to distinguish between potential causes of the trend and allow better management decisions.

Data availability

The data and all used R code that supports the findings of this study are available online (von Brömssen et al., 2023).

References

Brunsdon, C., Fotheringham, S., & Charlton, M. (1998a). Geographically weighted regression. Journal of the Royal Statistical Society, 47, 431–443. https://doi.org/10.1111/1467-9884.00145
Article Google Scholar
Brunsdon, C., Fotheringham, S., & Charlton, M. (1998b). Spatial nonstationarity and autoregressive models. Environment and Planning A: Economy and Space, 30, 957–973. https://doi.org/10.1068/a300957
Article Google Scholar
Carvalho, L., Mackay, E. B., Cardoso, A. C., Baattrup-Pedersen, A., Birk, S., Blackstock, K. L., Borics, G., Borja, A., Feld, C. K., Ferreira, M. T., Globevnik, L., Grizzetti, B., Hendry, S., Hering, D., Kelly, M., Langaas, S., Meissner, K., Panagopoulos, Y., Penning, E., … Solheim, A. L. (2019). Protecting and restoring Europe’s waters: An analysis of the future development needs of the Water Framework Directive. Science of the Total Environment, 658, 1228–1238. https://doi.org/10.1016/j.scitotenv.2018.12.255
Article CAS Google Scholar
Comber, A., Brunsdon, C., Charlton, M., Dong, G., Harris, R., Lu, B., Lü, Y., Murakami, D., Nakaya, T., Wang, Y., & Harris, P. (2023). A route map for successful applications of geographically weighted regression. Geographical Analysis, 55, 155–178. https://doi.org/10.1111/gean.12316
Article Google Scholar
da Silva, A. R., & Fotheringham, A. S. (2016). The multiple testing issue in geographically weighted regression. Geographical Analysis, 48, 233–247. https://doi.org/10.1111/gean.12084
Article Google Scholar
Dauwalter, D. C., Rahel, F. J., & Gerow, K. G. (2010). Power of revisit monitoring designs to detect forestwide declines in trout populations. North American Journal of Fisheries Management, 30, 1462–1468. https://doi.org/10.1577/M10-048.1
Article Google Scholar
de Bellefon, M. -P., & Floch, J. -M. (2018). Geographically weighted regression, in: Handbook of Spatial Analysis, Insee Methodes. Insee, Eurostat.
de Wit, H. A., Valinia, S., Weyhenmeyer, G. A., Futter, M. N., Kortelainen, P., Austnes, K., Hessen, D. O., Räike, A., Laudon, H., & Vuorenmaa, J. (2016). Current browning of surface waters will be further promoted by wetter climate. Environmental Science & Technology Letters, 3, 430–435. https://doi.org/10.1021/acs.estlett.6b00396
Article CAS Google Scholar
Eklöf, K., von Brömssen, C., Amvrosiadi, N., Fölster, J., Wallin, M. B., & Bishop, K. (2021). Brownification on hold: What traditional analyses miss in extended surface water records. Water Research, 203, 117544. https://doi.org/10.1016/j.watres.2021.117544
Evans, C. D., Jones, T. G., Burden, A., Ostle, N., Zieliński, P., Cooper, M. D. A., Peacock, M., Clark, J. M., Oulehle, F., Cooper, D., & Freeman, C. (2012). Acidity controls on dissolved organic carbon mobility in organic soils. Global Change Biology, 18, 3317–3331. https://doi.org/10.1111/j.1365-2486.2012.02794.x
Article Google Scholar
Evans, C. D., Monteith, D. T., & Cooper, D. M. (2005). Long-term increases in surface water dissolved organic carbon: Observations, possible causes and environmental impacts. Environmental pollution, Barking, Essex, 1987(137), 55–71. https://doi.org/10.1016/j.envpol.2004.12.031
Article CAS Google Scholar
Farber, S., & Páez, A. (2007). A systematic investigation of cross-validation in GWR model estimation: Empirical analysis and Monte Carlo simulations. Journal of Geographical Systems, 9, 371–396. https://doi.org/10.1007/s10109-007-0051-3
Article Google Scholar
Fölster, J., Hallstan, S., & Johnson, R. (2014a). Utvärdering av de nationella miljöövervakningsprogrammen av sjöar (Rapport No. 2014a:3). Uppsala.
Fölster, J., Johnson, R. K., Futter, M. N., & Wilander, A. (2014b). The Swedish monitoring of surface waters: 50 years of adaptive monitoring. Ambio, 43, 3–18. https://doi.org/10.1007/s13280-014-0558-z
Article CAS Google Scholar
Fotheringham, A. S., Brunsdon, C., & Charlton, M. (2002). Geographically weighted regression: The analysis of spatially varying relationships. Wiley.
Google Scholar
Fotheringham, A. S., Crespo, R., & Yao, J. (2015). Geographical and temporal weighted regression (GTWR). Geographical Analysis, 47, 431–452. https://doi.org/10.1111/gean.12071
Article Google Scholar
Gollini, I., Lu, B., Charlton, M., Brunsdon, C., & Harris, P. (2015). GWmodel: An R package for exploring spatial heterogeneity using geographically weighted models. Journal of Statistical Software, 63, 1–50. https://doi.org/10.18637/jss.v063.i17
Göransson, E., Johnson, R. K., & Wilander, A. (2004). Representativity of a mid-lake surface water chemistry sample. Environmental Monitoring and Assessment, 95, 221–238. https://doi.org/10.1023/B:EMAS.0000029905.07072.b0
Article Google Scholar
Grandin, U. (2007). Strategier för urval av sjöar som ska ingå i den sexåriga omdrevsinventeringen av vattenkvalitet i svenska sjöar (Rapport No. 2007:10). Uppsala.
Harris, P., Fotheringham, A. S., & Juggins, S. (2010a). Robust geographically weighted regression: A technique for quantifying spatial relationships between freshwater acidification critical loads and catchment attributes. American Association of Geographers Annals, 100, 286–306. https://doi.org/10.1080/00045600903550378
Article Google Scholar
Harris, R., Singleton, A., Grose, D., Brunsdon, C., & Longley, P. (2010b). Grid-enabling geographically weighted regression: A case study of participation in higher education in England. Transactions in GIS, 14, 43–61. https://doi.org/10.1111/j.1467-9671.2009.01181.x
Article Google Scholar
Hastie, T., & Tibshirani, R. (1986). Generalized additive models. Statistical Science, 1, 297–310. https://doi.org/10.1214/ss/1177013604
Article Google Scholar
Hirsch, R. M., & Slack, J. R. (1984). A nonparametric trend test for seasonal data with serial dependence. Water Resources Research, 20, 727–732. https://doi.org/10.1029/WR020i006p00727
Article Google Scholar
Hirsch, R. M., Slack, J. R., & Smith, R. A. (1982). Techniques of trend analysis for monthly water quality data. Water Resources Research, 18, 107–121. https://doi.org/10.1029/WR018i001p00107
Article Google Scholar
Huang, B., Wu, B., & Barry, M. (2010). Geographically and temporally weighted regression for modeling spatio-temporal variation in house prices. International Journal of Geographical Information Science, 24, 383–401. https://doi.org/10.1080/13658810802672469
Article Google Scholar
Klante, C., Larson, M., & Persson, K. M. (2021). Brownification in Lake Bolmen, Sweden, and its relationship to natural and human-induced changes. Journal of Hydrology: Regional Studies, 100863. https://doi.org/10.1016/j.ejrh.2021.100863
Koh, E. -H., Lee, E., & Lee, K. -K. (2020). Application of geographically weighted regression models to predict spatial characteristics of nitrate contamination: Implications for an effective groundwater management strategy. Journal of Environmental Management, 268, 110646. https://doi.org/10.1016/j.jenvman.2020.110646
Koopmans, L. H., Owen, D. B., & Rosenblatt, J. I. (1964). Confidence intervals for the coefficient of variation for the normal and log normal distributions. Biometrika, 51, 25–32. https://doi.org/10.1093/biomet/51.1-2.25
Article Google Scholar
Kritzberg, E. S. (2017). Centennial-long trends of lake browning show major effect of afforestation. Limnol. Oceanogr. Lett., 2, 105–112. https://doi.org/10.1002/lol2.10041
Article Google Scholar
Leong, Y. -Y., & Yue, J. C. (2017). A modification to geographically weighted regression. International Journal of Health Geographics, 16, 11. https://doi.org/10.1186/s12942-017-0085-9
Article Google Scholar
Lu, B., Harris, P., Charlton, M., & Brunsdon, C. (2014). The GWmodel R package: Further topics for exploring spatial heterogeneity using geographically weighted models. Geo-Spat. Inf. Sci., 17, 85–101. https://doi.org/10.1080/10095020.2014.917453
Article Google Scholar
McDonald, T. L. (2003). Review of environmental monitoring methods: Survey designs. Environmental Monitoring and Assessment, 85, 277–292. https://doi.org/10.1023/A:1023954311636
Article Google Scholar
Meyer-Jacob, C., Michelutti, N., Paterson, A. M., Cumming, B. F., Keller, W., & Smol, J. P. (2019). The browning and re-browning of lakes: Divergent lake-water organic carbon trends linked to acid deposition and climate change. Science and Reports, 9, 16676. https://doi.org/10.1038/s41598-019-52912-0
Article CAS Google Scholar
Monteith, D. T., Stoddard, J. L., Evans, C. D., de Wit, H. A., Forsius, M., Høgåsen, T., Wilander, A., Skjelkvåle, B. L., Jeffries, D. S., Vuorenmaa, J., Keller, B., Kopácek, J., & Vesely, J. (2007). Dissolved organic carbon trends resulting from changes in atmospheric deposition chemistry. Nature, 450, 537–540. https://doi.org/10.1038/nature06316
Article CAS Google Scholar
Murakami, D., Tsutsumida, N., Yoshida, T., Nakaya, T., & Lu, B. (2021). Scalable GWR: A linear-time algorithm for large-scale geographically weighted regression with polynomial kernels. American Association of Geographers Annals, 111, 459–480. https://doi.org/10.1080/24694452.2020.1774350
Article Google Scholar
Paradis, E., & Schliep, K. (2019). ape 5.0: An environment for modern phylogenetics and evolutionary analyses in R. Bioinformatics, 35, 526–528.
Article CAS Google Scholar
Piepho, H. -P., & Ogutu, J. O. (2002). A simple mixed model for trend analysis in wildlife populations. Journal of Agricultural, Biological, and Environmental Statistics, 7, 350. https://doi.org/10.1198/108571102366
Article Google Scholar
Pregler, K. C., Hanks, R. D., Childress, E. S., Hitt, N. P., Hocking, D. J., Letcher, B. H., Wagner, T., & Kanno, Y. (2019). State-space analysis of power to detect regional brook trout population trends over time. Canadian Journal of Fisheries and Aquatic Sciences, 76, 2145–2155. https://doi.org/10.1139/cjfas-2018-0241
Article Google Scholar
R Core Team. (2022). R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria.
Škerlep, M., Steiner, E., Axelsson, A., & Kritzberg, E. S. (2020). Afforestation driving long-term surface water browning. Global Change Biology, 26, 1390–1399. https://doi.org/10.1111/gcb.14891
Article Google Scholar
Starcevich, L. A. H., Irvine, K. M., & Heard, A. M. (2018a). Impacts of temporal revisit designs on the power to detect trend with a linear mixed model: An application to long-term monitoring of Sierra Nevada lakes. Ecological Indicators, 93, 847–855. https://doi.org/10.1016/j.ecolind.2018.05.087
Article Google Scholar
Starcevich, L. A. H., McDonald, T., Chung-MacCoubrey, A., Heard, A., Nesmith, J., & Philippi, T. (2018b). Trend estimation for complex survey designs of water chemistry indicators from Sierra Nevada Lakes. Environmental Monitoring and Assessment, 190, 596. https://doi.org/10.1007/s10661-018-6963-1
Article CAS Google Scholar
Sugasawa, S., & Murakami, D. (2021). Adaptively robust geographically weighted regression. ArXiv210615811 Stat.
Taghipour Javi, S., Malekmohammadi, B., & Mokhtari, H. (2014). Application of geographically weighted regression model to analysis of spatiotemporal varying relationships between groundwater quantity and land use changes (case study: Khanmirza Plain, Iran). Environmental Monitoring and Assessment, 186, 3123–3138. https://doi.org/10.1007/s10661-013-3605-5
Article Google Scholar
Tu, J., & Xia, Z. -G. (2008). Examining spatially varying relationships between land use and water quality using geographically weighted regression I: Model design and evaluation. Science of the Total Environment, 407, 358–378. https://doi.org/10.1016/j.scitotenv.2008.09.031
Article CAS Google Scholar
Urquhart, N. S., & Kincaid, T. M. (1999). Designs for detecting trend from repeated surveys of ecological resources. Journal of Agricultural, Biological, and Environmental Statistics, 4, 404–414. https://doi.org/10.2307/1400498
Article Google Scholar
von Brömssen, C., Betnér, S., Fölster, J. & Eklöf, K. (2021). A toolbox for visualizing trends in large-scale environmental data. Environmental Modelling & Software, 136, 104949. https://doi.org/10.1016/j.envsoft.2020.104949
von Brömssen, C., Fölster, J., & Eklof, K. (2023). Data and Script: GWR for Trend Analysis. https://doi.org/10.5281/zenodo.7664622
Article Google Scholar
Wang, D., Yang, Y., Qiu, A., Kang, X., Han, J., & Chai, Z. (2020). A CUDA-based parallel geographically weighted regression for large-scale geographic data. ISPRS International Journal of Geo-Information, 9, 653. https://doi.org/10.3390/ijgi9110653
Article Google Scholar
Weyhenmeyer, G. A., Fröberg, M., Karltun, E., Khalili, M., Kothawala, D., Temnerud, J., & Tranvik, L. J. (2012). Selective decay of terrestrial organic carbon during transport from land to sea. Global Change Biology, 18, 349–355. https://doi.org/10.1111/j.1365-2486.2011.02544.x
Article Google Scholar
Weyhenmeyer, G. A., & Karlsson, J. (2009). Nonlinear response of dissolved organic carbon concentrations in boreal lakes to increasing temperatures. Limnology and Oceanography, 54, 2513–2519. https://doi.org/10.4319/lo.2009.54.6_part_2.2513
Article CAS Google Scholar
Wood, S. N. (2017). Generalized additive models: An introduction with R, Second edition. ed, Chapman & Hall/CRC texts in statistical science. CRC Press/Taylor & Francis Group, Boca Raton.
Yu, D. (2010). Exploring spatiotemporally varying regressed relationships: The geographically weighted panel regression analysis. The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences, 134–139.
Zhang, H., & Mei, C. (2011). Local least absolute deviation estimation of spatially varying coefficient models: Robust geographically weighted regression approaches. International Journal of Geographical Information Science, 25, 1467–1489. https://doi.org/10.1080/13658816.2010.528420
Article Google Scholar

Download references

Funding

Open access funding provided by Swedish University of Agricultural Sciences. This study was funded by the Swedish Research Council for Environment, Agricultural Sciences and Spatial Planning (Formas) (Grant No. 2022–00942).

Author information

Authors and Affiliations

Division of Applied Statistics and Mathematics, Department of Energy and Technology, Swedish University of Agricultural Sciences, PO Box 7032, 750 07, Uppsala, Sweden
Claudia von Brömssen
Section for Geochemistry and Hydrology, Department of Aquatic Sciences and Assessment, Swedish University of Agricultural Sciences, PO Box 7050, 750 07, Uppsala, Sweden
Jens Fölster & Karin Eklöf

Authors

Claudia von Brömssen
View author publications
You can also search for this author in PubMed Google Scholar
Jens Fölster
View author publications
You can also search for this author in PubMed Google Scholar
Karin Eklöf
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Claudia von Brömssen: conceptualization, methodology, formal analysis, writing—original draft preparation. Jens Fölster: data curation, writing—reviewing and editing. Karin Eklöf: writing—reviewing and editing.

Corresponding author

Correspondence to Claudia von Brömssen.

Ethics declarations

Ethical approval

All authors have read, understood, and have complied as applicable with the statement on “Ethical responsibilities of Authors” as found in the Instructions for authors and are aware that with minor exceptions, no changes can be made to authorship once the paper is submitted.

Conflict of interest

The authors declare no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file1 (DOCX 1277 KB)

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

von Brömssen, C., Fölster, J. & Eklöf, K. Temporal trend evaluation in monitoring programs with high spatial resolution and low temporal resolution using geographically weighted regression models. Environ Monit Assess 195, 547 (2023). https://doi.org/10.1007/s10661-023-11172-2

Download citation

Received: 02 January 2023
Accepted: 27 March 2023
Published: 10 April 2023
DOI: https://doi.org/10.1007/s10661-023-11172-2

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Temporal trend evaluation in monitoring programs with high spatial resolution and low temporal resolution using geographically weighted regression models

Abstract

Similar content being viewed by others

Spatial–temporal modellization of the $$\hbox {NO}_{2}$$ concentration data through geostatistical tools

High resolution spatiotemporal modeling of long term anthropogenic nutrient discharge in China

PatagoniaMet: A multi-source hydrometeorological dataset for Western Patagonia

Introduction

Methods

Geographically weighted regression models

Data

Data pre-processing

Basic pre-processing

Detection of series with high variation and outliers

Assessing the geographical heterogeneity of temporal changes

Station-wise centering of data

Model validation

Combining datasets

Nonlinear temporal trends

Results

Data quality assurance and pre-processing

Trends in TOC

Model validation

Combining data from different monitoring programs

Nonlinear trends in TOC

Discussion

GWR as a method for trend evaluation for datasets with a sparse temporal resolution

Pre-processing and model validation

Combining datasets with a low and high temporal resolution

Nonlinear temporal trends

Including other explanatory variables

Large-scale drivers of brownification

Conclusions

Data availability

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethical approval

Conflict of interest

Additional information

Publisher's Note

Supplementary Information

Supplementary file1 (DOCX 1277 KB)

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation