Brought to you by:
Letter The following article is Open access

A new approach for determining optimal placement of PM2.5 air quality sensors: case study for the contiguous United States

, , and

Published 28 February 2022 © 2022 The Author(s). Published by IOP Publishing Ltd
, , Citation Makoto M Kelp et al 2022 Environ. Res. Lett. 17 034034 DOI 10.1088/1748-9326/ac548f

Download Article PDF
DownloadArticle ePub

You need an eReader or compatible software to experience the benefits of the ePub3 file format.

1748-9326/17/3/034034

Abstract

Considerable financial resources are allocated for measuring ambient air pollution in the United States, yet the locations for these monitoring sites may not be optimized to capture the full extent of current pollution variability. Prior research on best sensor placement for monitoring fine particulate matter (PM2.5) pollution is scarce: most studies do not span areas larger than a medium-sized city or examine timescales longer than 1 week. Here we present a pilot study using multiresolution dynamic mode decomposition (mrDMD) to identify the optimal placement of PM2.5 sensors from 2000 to 2016 over the contiguous United States. This novel approach incorporates the variation of PM2.5 on timescales ranging from 1 d to over a decade to capture air pollution variability. We find that the mrDMD algorithm identifies more high-priority sensor locations in the western United States than those expected along the eastern coast, where a large number of Environmental Protection Agency (EPA) PM2.5 monitors currently reside. Specifically, 53% of mrDMD optimized sensor locations are west of the 100th meridian, compared to only 32% in the current EPA network. The mrDMD sensor locations can capture PM2.5 from wildfires and high pollution events, with particularly high skill in the west. These results suggest significant gaps in the current EPA monitoring network in the San Joaquin Valley in California, northern California, and in the Pacific Northwest (Idaho, and Eastern Washington and Oregon). Our framework diagnoses where to place air quality sensors so that they can best monitor smoke from wildfires. Our framework may also be applied to urban areas for equitable placement of PM2.5 monitors.

Export citation and abstract BibTeX RIS

Original content from this work may be used under the terms of the Creative Commons Attribution 4.0 license. Any further distribution of this work must maintain attribution to the author(s) and the title of the work, journal citation and DOI.

1. Introduction

Air pollution consisting of fine particulate matter (PM2.5) presents a major environmental risk to public health, causing 4.1 million premature deaths worldwide in 2019 (Murray et al 2020). Given the financial cost and limited resources allocated for measuring ambient air pollution, however, optimizing sensor placement is a central mathematical challenge. Monitoring PM2.5 is especially difficult as there are many outdoor sources including direct emissions from wildland and agricultural fires (Johnston et al 2012, Cusworth et al 2018), incomplete fuel combustion from vehicles and industrial processes (Bond et al 2007), and secondary chemical formation from gas-phase precursors of anthropogenic (McDuffie et al 2021) or biogenic (Goldstein et al 2009) origin. Furthermore, the formation and transport of PM2.5 are both sensitive to meteorological conditions from the local to synoptic scales (Tai et al 2012), including, for example, convective events and horizontal winds that can disperse pollution as well as precipitation that can immediately remove particles. Here we provide an optimization framework that takes into account the multiscale variability of PM2.5 to determine the placement of a minimum number of air pollution sensors for maximal spatiotemporal coverage.

In the United States, monitoring of air pollution has historically relied on the Environmental Protection Agency (EPA) network of sampling sites (US EPA 2013). These monitoring locations are used to assess local and regional attainment of the National Ambient Air Quality Standard (NAAQS), to analyze air pollution impacts on public health, and to validate satellite measurements and air quality models. However, these sites are not equally distributed across the United States and may not adequately sample the full range of concentrations (Di et al 2019). A greater number of sites are in the eastern United States, along the western coast, and in urban areas, with relatively fewer sites in mountainous regions and rural areas. Due in part to the high cost of maintenance, new monitoring sites are added infrequently (US EPA 2020).

Improvements in low-cost sensor technologies coupled with the rise of citizen science air quality monitoring offers potential for greater spatial coverage of pollution observations, but even so these sensor locations may not provide adequate coverage. Many low-cost sensors can report measurements publicly in real-time (Snyder et al 2013, Kumar et al 2015, Parmar et al 2017). These sensors, deployed by citizen science and crowdsourced organizations such as Purple Air (Barkjohn et al 2021), represent a core component of an internet of things network for monitoring air pollution in cities (Dhingra et al 2019, Toma et al 2019). However, this promising democratization of air pollution monitoring may inadequately capture the spatial distribution and variability of PM2.5. Crowdsourced datasets often rely on volunteers who are responsible for installation and upkeep of each sensor, resulting in deployment in predominantly white areas characterized by higher incomes and levels of education relative to the US census tracts with EPA monitors (deSouza and Kinney 2021). Furthermore, areas with a greater density of low-cost sensors report lower annual-average PM2.5 concentrations than the EPA monitors in all states expect California (deSouza and Kinney 2021). To be sure, such citizen science efforts are well-intentioned, but they exacerbate disparities in the spatial coverage of PM2.5 monitors and limit the pursuit of environmental justice (Sorensen et al 2019).

Few studies have considered both spatial features and temporal dynamics of air pollution in determining the optimal placement of outdoor PM2.5 sensors. While strategies based on computational fluid dynamics can locate optimal placement of indoor sensors (Löhner and Camelli 2005, Waeytens and Sadr 2018), such approaches are infeasible over large atmospheric domains. Mukherjee et al (2020) developed an optimal sensor placement algorithm for mobile monitoring of PM2.5 and SO2 in Atlanta, Georgia, but such an algorithm can be applied for just a short time period (hours to a day). For a case study in Cambridge, UK, Sun et al (2019) created an algorithm that considers areas with PM2.5 hotspots such as high-traffic roads and locations such as hospitals and elderly care homes where more vulnerable populations reside. However, this framework does not guarantee capturing representative PM2.5 concentrations across a broader region. Here we build on prior studies that determined the optimal placement of sensors using modal decomposition techniques in the domains of ocean monitoring (Yildirim et al 2009), fluid dynamics (Bai et al 2017), and neuroscience (Brunton et al 2016). Application of such techniques is yet nascent for air pollution, which is characterized by its own unique multiscale behavior. To our knowledge, there exist no studies of optimal air pollution sensor placement in the United States that span regions larger than an urban area or timescales longer than 1 week.

In this pilot study, we demonstrate a data-driven approach that determines the optimal placement of sensors to capture PM2.5 concentrations and variability across the contiguous United States on timescales ranging from days to over a decade. We employ multiresolution dynamic mode decomposition (mrDMD), which recursively decomposes a dataset into low-rank spatial modes and their temporal Fourier dynamics (Kutz et al 2016, Manohar et al 2019). This algorithm allows us to create a library of modes that not only captures PM2.5 concentrations spatially and temporally on short (daily) and long-term (years to decade) timescales, but also incorporates information from significant transient phenomena, such as wildfires and temperature inversions, that would otherwise be discarded or averaged out using similar data reduction techniques. We emphasize that the current distribution of over 2000 EPA monitors is needed to keep track of adherence to the NAAQS. However, the modeling approach and results presented here may be used to locate gaps in the EPA sensor network and inform locations of future PM2.5 monitors.

2. Data and methods

2.1. PM2.5 dataset

We use a dataset consisting of daily PM2.5 concentrations for the contiguous United States for the period January 2000 to December 2016. The dataset was produced through a data fusion method involving ensemble machine learning that combines surface monitoring measurements, satellite aerosol optical depth, land-use data, and chemical transport model results, among other variables (Di et al 2019, 2021). The high spatial and temporal resolution of this PM2.5 dataset reveals spatial variability across the United States, including a stark east–west gradient, with the eastern United States experiencing relatively higher PM2.5 concentration than the west. Hotspots in the dataset reveal the wide diversity of PM2.5 sources, such as emissions from power plants in the Midwest, organic aerosol produced from biogenic volatile organic compounds in the southeast, and wildfires and associated smoke in the west. We coarsen the dataset's 1 km $ \times $ 1 km resolution to 10 km $ \times \,$ 10 km to yield a spatial grid of 871 $ \times \,$ 413 cells. Missing values and erroneous observations are removed from the dataset by applying a mask to the spatial grid.

2.2. Dynamic mode decomposition (DMD)

To isolate the underlying multiscale variability of PM2.5, we use a DMD modeling approach. The DMD method provides a spatiotemporal decomposition of data into a set of dynamic modes derived from snapshots or measurements of a given system in time. DMD is similar to principal components analysis (PCA) as they are both dimensionality reduction algorithms that take advantage of underlying low-rank features found in large datasets. The resulting modes of these methods reveal the dominant spatial patterns that account for variation in the dataset. However, unlike PCA, which does not model temporal dynamics of time-series data explicitly, DMD computes a set of modes with an exponential temporal dependence whose imaginary components model an oscillating time frequency characterizing the modal dynamics. Importantly, DMD can accommodate heterogenous spatial and temporal sampling. For noisy datasets, DMD can be ensembled in order to produce uncertainty metrics (Sashidhar and Kutz 2021). Many variants of DMD exist (Tissot et al 2014, Noack et al 2015, Askham and Kutz 2017, Li et al 2017, Sashidhar and Kutz 2021), but here we follow Manohar et al (2018), as described in the supplement section 1 (available online at stacks.iop.org/ERL/17/034034/mmedia). We apply the DMD method to the PM2.5 dataset to construct a library of modes that reveals the dominant spatiotemporal patterns in the dataset.

2.3. Multiresolution dynamic mode decomposition (mrDMD)

Recently, a multiresolution extension to the DMD algorithm has been proposed (Kutz et al 2016, Manohar et al 2019), which allows for the incorporation of significant transient phenomenon into the DMD library. Much like a multiresolution analysis using wavelets (Kutz 2013), the mrDMD algorithm recursively removes low frequency, or slowly varying, signals to allow separation of background (i.e. average PM2.5 concentrations) from foreground (i.e. high pollution episodes) in the PM2.5 dataset on varying timescales specified by the user. The mrDMD algorithm is more precise in capturing spatiotemporal variability than methods based on singular value decomposition such as DMD and PCA (Manohar et al 2019).

The number of snapshots M from our dataset are chosen so that the DMD modes provide a full rank approximation of the dynamics observed. Thus, M should be representative of all high- and low-frequency content of the dataset. The DMD algorithm is then applied recursively on the dataset. The slowest m1 modes are removed in the initial pass, and remaining snapshots are divided into M/2 segments and are fed into the DMD algorithm again. The slowest m2 modes are removed, and this process is continued until a termination condition is reached. The mrDMD separates the DMD approximate solution in the first pass as follows:

Equation (1)

where ${b_k}\left( 0 \right)$ is the initial amplitude, $\xi $ is the spatial coordinates of each mode, $\psi _k^{\left( 1 \right)}$ represents the DMD modes (eigenvectors) obtained from the complete snapshot matrix, and $\exp \left( {{\omega _k}t} \right)$ are the corresponding eigenvalues. After the first level of decomposition, the fast modes are again decomposed into two matrices:

Equation (2)

At this level of decomposition, the first matrix $X_{M/2}^{\left( 1 \right)}$ represents slower modes and $X_{M/2}^{\left( 2 \right)}$ represents faster modes, which undergo decomposition again. This process is repeated recursively by removing slower modes in each iteration. At each level of decomposition, the separation of the foreground mode from the background is deemed significant if its eigenvalues exceed a user-set tolerance. Here, we set this tolerance to a standard value of 1 $ \times $ 10−2.

A formal expansion of the mrDMD theory and modeling approach may be found elsewhere (Kutz et al 2016, Manohar et al 2019).

The mrDMD algorithm operates by decreasing the time domain by a factor of two at each successive decomposition level. We apply mrDMD to training windows starting at 11.4 years (M= 4096 d), followed by 13 decomposition levels so that the shortest frequency is daily. This approach thus yields a long-term mode characterizing the average PM2.5 concentrations over 11.4 years, with potential identification of transient pollution events spanning timescales from 5.7 years (2048 d) to 1 d. To analyze all PM2.5 data from 2000 to 2016 employing these decomposition levels, we divide the dataset into two time-windows: January 2000 to March 2011 (first 4096 d) and September 2005 to December 2016 (last 4096 d).

2.4. Optimal sensor placement

We use the matrix libraries containing all DMD and mrDMD modes as tailored basis sets ${\psi _r} \in \,{\mathbb{R}^{n \times r}}$ to optimize for sensor placement. We identify the optimal sensor locations by employing QR pivoting to our DMD and mrDMD basis sets (Heck et al 1998, Manohar et al 2018). QR pivoting is a 'greedy' selection algorithm that is computationally efficient for finding near-optimal sensor locations. Greedy approaches are often favored over other optimization techniques as the true optimal solution often involves a combinatorially intractable optimization.

QR column pivoting identifies rows in the modal library ${\psi _r}$ with the highest two-norm, which corresponds to locations with the largest PM2.5 modal frequencies and therefore greatest variability. The reduced matrix QR factorization with column pivoting decomposes a matrix $A \in \,{\mathbb{R}^{m \times n}}$ into a unitary matrix $Q$, an upper-triangular matrix $R$, and a column permutation matrix ${C^{\text{T}}}$ such that $A{C^{\text{T}}} = { }QR$. Thus, the QR factorization with column pivoting yields $r$ point sensors (pivots) that best sample the $r$ tailored basis modes ${\psi _r}$:

Equation (3)

That is, each QR pivot identifies those spatial locations in the modal library that exhibit the most variability and where sensor placement captures significant pollution episodes above background concentrations.

A formal expansion of the sparse sensor placement approach may be found elsewhere (Kutz et al 2016, Manohar et al 2018, 2019, Brunton and Kutz 2019).

In the supplement, we reconstruct the full field of PM2.5 concentrations across the contiguous United States using measurements from only those sensor locations diagnosed as optimal in the step above. These reconstructions serve as validation of the DMD and mrDMD libraries and the ability of these algorithms to select sensor locations with high information content.

3. Results and discussion

Here we present the mrDMD results and analysis. The modal library of DMD is less interpretable than that of mrDMD, and we present the DMD results and analysis in the supplement.

3.1. mrDMD modes describing PM2.5 variability

The mrDMD modal library reveals that most significant air pollution episodes occur on timescales between 1 week and 1 month, with many of these events arising from wildfires. We diagnose these wildfire modes by visual inspection based on regions with known fire activity and minimal industrial PM2.5 sources. Figure 1 presents the resulting mrDMD modal maps in the time-frequency domain. Figure 1(a) corresponds to the first 11.4 years of the time period (January 2000 to March 2011) and figure 1(b) corresponds to the last 11.4 years (September 2005 to December 2016). The background mode in both training windows corresponds to the mean PM2.5 concentration distribution for that window. Both background modes display a sharp east–west gradient with most of the eastern United States, except the Appalachian Mountains and some remote areas of Maine, having relatively higher PM2.5 concentration than the west. This pattern agrees with the observed annual mean spatial distribution of PM2.5 (Tai et al 2010, Di et al 2019). The background mode for the second half of the training period exhibits smaller modal amplitudes in hotspot regions than the first half. This trend is consistent with Di et al who reported that PM2.5 concentrations decreased noticeably after 2008 due to a combination of the economic recession and emission controls on coal-fired power plants. The 2000–2008 annual average PM2.5 concentration across the contiguous United States was 7.8 μg m−3; the 2009–2016 annual average was 6.2 μg m−3.

Figure 1.

Figure 1. Maps of mrDMD PM2.5 modes from 2000 to 2016. The mrDMD time windows are for January 2000 to March 2011 (A) and September 2005 to December 2016 (B). The left axis expresses the decomposition level and its related time frequency, such that the bottom row corresponds to the average background mode of PM2.5 over 4096 d (∼136.5 months), with each successively higher row corresponding to pollution episodes lasting half the number of days of the row below. Colored boxes indicate a mrDMD mode that exhibits significant variability above the background mode; otherwise, the boxes are left blank. No significant pollution modes less than 1 week are found and thus these rows are removed from the figure. Modal maps of the background averages and examples of significant pollution episodes are shown in the margins. Arrows point to the time periods of the corresponding mrDMD modal maps. All mrDMD modes are on the same amplitude scale shown to the left of the figure.

Standard image High-resolution image

The mrDMD method also identifies relatively short-term air pollution episodes on timescales of less than a year. Most episodes occur on timescales between 1 week and 1 month. However, we find an especially long-lived mode over Los Angeles and the San Joaquin Valley, California, lasting for 256 d from September 2007 to May 2008. This mode may correspond to intense winter PM2.5 due to temperature inversions trapping emissions in the southern California basin. From 2007 to 2008, the San Joaquin Valley reported significantly higher PM2.5 concentrations than are typical, with 66 d above the 24 h 35 μg m−3 standard in 2008 (US EPA 2021). The 2007 Murphy Complex Fire and other wildfires in Idaho are captured as a significant mode occurring on a timescale of 64 d from June to August. Although the lifetime of PM2.5 in the atmosphere is ∼1 week and is sensitive to meteorology such as rainout, no pollution modes occur on timescales less than a week. We attribute the absence of modes on this timescale to the eigenvalue tolerance used in the mrDMD algorithm. If we tighten the tolerance, then more significant modes would be found at daily timescales, but such a tolerance would result in higher-frequency (i.e. noisy) and less-interpretable modal maps, making it more difficult to discern prolonged pollution events. Overall, we detect 72 significant pollution modes during the first time window and 77 during the second time window. We find 27 wildfire modes (38% of all modes) in the first window and 46 wildfire modes (60% of all modes) in the second window.

Based on figure 1, we conclude that the mrDMD framework can distinguish high pollution episodes from the mean PM2.5 concentrations through time. The mrDMD framework provides a background mode that agrees with Di et al (2019), but also contains modal snapshots of events that stand out from this background.

3.2. Optimal sensor locations

By applying QR pivots to our mrDMD modes, we can identify a set of 1369 optimal sensor locations across the contiguous United States, a number of roughly the same magnitude as the number of EPA sensors. Our method assumes that air pollution spatiotemporal patterns and concentration variability from 2000 to 2016 are representative of future conditions. We argue that this assumption is reasonable, as discussed below. Figure 2 shows that the mrDMD sensor distribution is more clustered in California and across the west, along the Gulf Coast, and in the Industrial Midwest and Northeast, relative to that of the EPA. The mrDMD sensors thus reflect those regions characterized by higher PM2.5 variability, with few sensors in the central United States. These results suggest that the west is a driver of PM2.5 variability in the United States due to periodic wildfires, distributed urban sources isolated by rugged terrain, and reduced meteorological mixing. For example, the mrDMD algorithm yields a cluster of sensors in Los Angeles and Salt Lake City as these urban areas are basins surrounded by mountains where the meteorological conditions favor the production and trapping of PM2.5 pollution. Additionally, we present a sparse distribution of sensors that can capture the overall spatiotemporal variability of PM2.5 in the supplement.

Figure 2.

Figure 2. PM2.5 sensor locations. Distribution of sensor locations in the EPA monitoring network compared to those identified as optimal by the mrDMD algorithm. The EPA sites are shown at original scale while mrDMD locations are projected on a 10 km $ \times \,$ 10 km grid. Small gray patches in the Southwest and Pacific Northwest reflect grid cells with missing or corrupted data that were discarded from this analysis.

Standard image High-resolution image

The mrDMD method identifies significantly more sensors in the San Joaquin Valley and in northern California, and the Pacific Northwest than the current EPA sensor network (figure 3). In particular, the San Joaquin Valley dominates the modal signatures of the mrDMD library (figure 1) and is characteristic of high PM2.5 variability due to large-scale agricultural activities, smoke from wildfires, and frequent temperature inversions (Wolyn and McKee 1989, Wei et al 2013, Rémy et al 2017). The greater density of mrDMD sensors in northern California and the Pacific Northwest (especially in Idaho and the eastern parts of Oregon and Washington) can be attributed to their sensitivity to smoke PM2.5 from wildfires. Indeed, we find that wildfires constitute 60% (46 out of 77) of significant pollution modes in the mrDMD library from September 2005 to December 2016. Also, mrDMD identifies locations along the Gulf Coast (i.e. the eastern shoreline of Texas) that are not covered by the EPA network (figure S1).

Figure 3.

Figure 3. PM2.5 sensor locations in the west. Same as figure 2 but zoomed in on the western United States.

Standard image High-resolution image

Our results suggest that the EPA monitoring network does not optimally capture the spatial patterns and variability of PM2.5 pollution in the contiguous United States today. We posit that the current locations of EPA sensors were informed by the legacy of air pollution from the 1980s to 1990s when the eastern United States had a higher regional PM2.5 background with large variability in PM2.5 concentrations arising from interactions between meteorology and emissions from coal-fired power plants, diesel emissions, and industrial activities (Bachmann 2007). However, with the passage of the Clean Air Act (and subsequent amendments), stricter air quality regulations have reduced these emissions (Bachmann 2007). Urban air pollution levels in turn declined nationally while at the same time the frequency and magnitude of wildfires in the west have increased significantly (Abatzoglou and Williams 2016). Our mrDMD sensor locations underscores this shift to the west as 53% of sensors (726 out of 1369) are found west of the 100th meridian (Seager et al 2017) compared to 32% (694 out of 2156) for the current network of EPA monitors. We emphasize that the current distribution EPA monitors is needed to keep track of adherence to the NAAQS, especially in densely populated regions. We do not advocate for the removal of existing PM2.5 sensors, but rather for the deployment of more sensors in the west (especially in the San Joaquin Valley and Pacific Northwest) to capture high-pollution variability.

4. Conclusions

We present a pilot study using a data-driven approach to identify the optimal placement of PM2.5 sensors to capture air pollution over the contiguous United States at 10 km $ \times $ 10 km resolution from 2000 to 2016. While earlier studies are limited in spatial and temporal scope, this is the first national-scale study that diagnoses the optimal placement of PM2.5 sensors. Our method employs multiresolution dynamic mode decomposition (mrDMD), an approach that incorporates the variability of PM2.5 on timescales ranging from 1 d to over a decade. We identify 72 significant pollution modes from January 2000 to March 2011 and 77 pollution modes from September 2005 to December 2016, ranging in duration from 8 to 256 d. Smoke from wildfire events comprise 38% and 60% of these modes during these respective time frames. Our results underscore the large impact that wildfires have on PM2.5 concentrations and variability, especially in the west.

The mrDMD modal library identifies 1369 optimal sensor locations in the western United States, with significantly lower density of sensors in the eastern coast. We show that 53% of mrDMD sensor locations are found west of the 100th meridian compared to only 32% for the current network of EPA monitors. We hypothesize that the current network of EPA monitors was designed to capture PM2.5 spatial patterns during the 1980s–1990s when urban air pollution and high background concentrations in the East, coupled with meteorological factors, represented a significant driver of PM2.5 variability. However, with reduced emissions from increased air quality regulations, our results suggest a shift to the west as the main driver of PM2.5 variability across the contiguous United States. This shift can be traced to periodic wildfires and to distributed urban sources isolated by rugged terrain. We suggest that the mrDMD sensors identified here may help capture high PM2.5 episodes not adequately monitored by EPA sensors. We acknowledge, however, that our method assumes that air pollution spatiotemporal patterns and concentration variability from 2000 to 2016 are representative of future conditions. We argue that this assumption is reasonable, as we expect wildfires to continue to be a significant source of PM2.5 and that our optimal sensor locations capture that variability.

More specifically, our mrDMD results recommend adding PM2.5 sensors in the San Joaquin Valley in California, northern California, and in the Pacific Northwest (Idaho and the eastern parts of Oregon and Washington). Agricultural activities, temperature inversions, and fires drive pollution variability in the San Joaquin Valley and wildfire smoke drives variability in northern California and the Pacific Northwest. All three regions have large populations of non-white Hispanic agricultural workers who are disproportionately affected by an underserved PM2.5 monitoring network (Garcia 2007, Chandrasekaran 2021, Marlier et al 2022). The disparity in sensor locations is especially apparent in the San Joaquin Valley where the EPA network of monitors is relatively sparse (figure 3).

Smoke from wildfires represents a significant human health and ecosystem hazard. With the frequency and magnitude of such fires predicted to increase in the future, identifying where to place sensors to track smoke plumes is becoming especially important (McClure and Jaffe 2018, David et al 2021). The mrDMD framework skillfully diagnoses where to place air quality sensors that can best monitor smoke plumes, especially those from large fires. We posit that such a sensor distribution may help stakeholders weigh the benefits and hazards of prescribed fires.

Although we identify optimal PM2.5 sensor locations on a national scale, our framework may also be impactful for regional and local domains. With the proliferation of low-cost sensors and citizen science efforts, the techniques provided here could help inform high utility areas to place sensors. For example, the original 1 km $ \times $ 1 km PM2.5 dataset by Di et al (2021) may be applied to investigate sensor placement at the regional and urban scale. Extensions to the DMD framework could incorporate sensor measurements with different signal-to-noise ratios—i.e. high-cost and low-cost sensors (Clark et al 2020)—or could consider cost-constraining functions that optimize sensor placement based on topography, population density, or metrics related to environmental justice (Clark et al 2019).

Acknowledgments

This work was partly funded by the US Environmental Protection Agency (EPA) Grant 83587201. It has not been formally reviewed by the EPA. The views expressed in this document are solely those of the authors and do not necessarily reflect those of the EPA. We thank Yaguang Wei for technical assistance and John Bachmann for valuable discussion.

Data availability statement

The DMD algorithm, mrDMD algorithm, and 10 km × 10 km PM2.5 dataset from 2000 to 2016 that support the findings of this study are openly available at the following URL/DOI: https://doi.org/10.5281/zenodo.5809461.

Please wait… references are loading.
10.1088/1748-9326/ac548f