Hough-Transform-Based Interpolation Scheme for Generating Accurate Dense Spatial Maps of Air Pollutants from Sparse Sensing

Nebenzal, Asaf; Fishbain, Barak

doi:10.1007/978-3-319-89935-0_5

Part of the book series: IFIP Advances in Information and Communication Technology ((IFIPAICT,volume 507))

Included in the following conference series:

International Symposium on Environmental Software Systems

978 Accesses
1 Citations

Abstract

Air pollution is a significant health risk factor and causes many negative effects on the environment. Thus, arises the need for studying and assessing air-quality. Today, air-pollution assessment is mostly based on data acquired from Air Quality Monitoring (AQM) stations. These AQM stations provide continuous measurements and considered to be accurate; however, they are expensive to build and operate, thus scattered sparingly. To cope with this limitation, typically, the information obtained from those measurements is generalized with interpolation methods such as IDW or Kriging. Yet, the mathematical basis of those schemes defines that pollution extremum values are obtained at the measuring points. In addition, they are not considering the location of the pollution source or any physicochemical characteristics of pollutant hence do not reveal the real spatial air-pollution patterns. This research introduces a new interpolation scheme which breaks the interpolation process into two stages. At the first stage, the source of pollution and its estimated emission rate are inferred through a detection procedure which is based on the Hough Transform. At the second stage, based on the detected source location and emission, spatial dense pollution maps are created. The method requires, for its computation, to assume a dispersion model. To this end, any model can be used as sophisticated as it may be. Spatial maps created with simplified dispersion models in a computational simulation, show that the suggested interpolation scheme manages to create more accurate and more physically reasonable maps than the state-of-the-art.

You have full access to this open access chapter, Download conference paper PDF

Novel spatial and temporal interpolation algorithms based on extended field intensity model with applications for sparse AQI

Article 05 February 2021

Fine-Grained Traffic Pollution Monitoring and Estimation: A Case Study in Chengdu

Prediction of High Resolution Spatial-Temporal Air Pollutant Map from Big Data Sources

Keywords

1 Introduction

Air pollution is a significant risk factor for multiple health situations including eye irritation, breathing difficulties, lung cancer, heart diseases and respiratory infections [1]. In addition, air-pollution causes many negative effects on the environment like decreased visibility, acid rain, global warming, climate change, water quality deterioration and ecosystems destruction [2]. Thus, arises the need for studying and assessing air-quality’s characteristics, dispersion patterns and behavior.

Today, numerous air-pollution studies are based on data acquired from Air Quality Monitoring (AQM) Stations [3]. However, AQM are typically scattered sparingly, mainly near main roads, industrial factories, or near highly populated areas [4]. Thus, the AQM network has a limited ability to account for spatial variability of pollution levels in heterogeneous regions, such as urban areas, which in return, renders exposure assessment as a difficult task [5]. To cope with the measurements sparsity, the information obtained from those measurements is often generalized with mathematical methods to improve the spatio-temporal coverage. To this end, interpolation schemes are sought.

Interpolation is a mathematical method of constructing a continuous function that obtains the measured values (or close values) at the measuring point. Environmental interpolation is based on the assumption that data attributes are continuous over space and spatially dependent [6]. Grossly speaking, interpolation methods can be divided into deterministic and geostatistical methods. The first include Inverse Distance Weighted (IDW), Nearest Neighbor (NN) and radial basis functions [6], while the latter involve, for example, various types of Kriging methods [7]. Next, we focus on IDW and ordinary Kriging, owing to their frequent use in spatial maps creation.

There are many studies in the field of air pollution modeling that used IDW or Kriging for creating dense spatial map of air pollution. IDW, for example, was applied for examining the ratio between low birth weight and air pollution exposure during pregnancy [8]. In that research, the IDW interpolation was utilized for estimating PM10 levels at future mothers’ home address. Clark et al. [9] examined the effect of early life exposure to air pollution on development of childhood asthma. For estimating the average exposure level of an area, IDW interpolation was applied. Trujillo-Ventura et al. [10] introduced multi-objective pollutant AQMs optimization. In their research, they applied a Kriging interpolation scheme for creating dense spatial maps. Sarigiannis and Saisana [11] used Kriging interpolation method to create pollution maps of CO and O3 as an additional input to their multi-objective optimization scheme, which was based on remote sensing satellites.

IDW and Ordinary Kriging are both well-known and widely used interpolation methods. However, these methods are not appropriate for creating air-pollution spatial dense maps for several reasons: The mathematical basis of those schemes defines that all interpolated values over the study area are essentially a weighted average of the measurements points, thus extremum values cannot be obtained at any other place than the measuring points. In addition, these methods do not consider the location of pollution sources or any physicochemical characteristics of pollution. Hence, the resulted dense pollution maps do fall short in describing accurately the real spatial patterns of pollution. Regarding these in the interpolation process is expected to result in better and more accurate interpolation methods.

This research introduces a Hough Transform-Based Interpolation (HTBI) method, which generates accurate dense pollution maps through finding sources’ locations and the utilization of an air pollution dispersion model. The Hough Transform is a mathematical method, originated in image processing, used for detecting geometric shapes, like lines, circles or ellipses [12]. The main idea is converting from representing the shape in x, y coordinates (Cartesian) system to a parametric space, where the feature of interest is best represented. In this research, a feature space, which will represent best the source location is devised.

The method consists of two phases: at first, based on ambient concentration and assuming a dispersion model, the HTBI detects the sources’ emission rates and locations. Then, using this information, the interpolation scheme builds the continuous pollution field. The suggested HTBI scheme applies no constraint on the assumed dispersion model. Hence, any dispersion model found in the literature (e.g., [13,14,15]), as sophisticated as it may be, can be incorporated into the suggested scheme.

2 Methodology

2.1 Notation

The following notation facilitate the description of the method. Let $ \{ S\} $ be a set of sources of a specific pollutant, with emission rates $ \{ Q\} $. Let A be the specific pollutant’s continues signal generated by $ \{ S\} $, defined over a geographical area $ \Omega $. $ \{ S\} $ are located at $ \{ \gamma \} \in\Omega $. Let $ \{ a\} $ be a finite set of samples of signal $ A $, taken in locations $ \{ \omega \} \subset\Omega $. Interpolation aims at estimating A over the entire space $ \Omega $, based on the set of samples $ \{ a\} $. This is achieved here by first finding sources’ locations, $ \{ \gamma \} $. It is worthwhile noting that the discussion here is limited to a single pollutant interpolation, i.e., the generation of a dense map of the specific pollutant is based on a set of sparse measurements of the same pollutant.

2.2 Interpolation Scheme

Each sample $ a_{i} \in \left\{ a \right\} $, represents a measurement in $ \omega_{i} $. W.L.O.G, if we order $ \left\{ S \right\} $, and $ \left\{ Q \right\} $ so $ Q_{i} $ is the emission rate of source $ S_{i} $; $ a_{i} $ is a weighted combination of the contributions from all the sources, $ \left\{ Q \right\} $. Assuming a dispersion model, $ M $, so the k^th element of the vector is the decay coefficient of source k, $ Q_{k} $, in location i; sensor i’s measurement, $ a_{i} $, is all sources contributions at i and is given by:

(1)

Consequentially, forming the set {a} as a vector, all sensors’ measurements can be represented by the following matrices multiplication:

$$ \vec{a} = M \cdot \overrightarrow {Q}^{T} $$

(2)

Given [M], we assume that there exists a matrix E, which satisfies:

$$ \overrightarrow {Q} = \left[ E \right] \cdot \vec{a}^{T} $$

(3)

For finding Q and $ \gamma $, a search on the entire $ \Omega $ is suggested. To this end, $ \Omega $ is divided into N disjoint catchments. We assume that each catchment, $ C_{n} \subseteq\Omega $ is small enough so the pollution is uniform all over it. For each of the catchments an estimated emission rate $ \hat{Q}^{i}_{n} $ is calculated, based on accepted measurements from single sample $ a_{i} $; where e is a single row of $ E: $

$$ \hat{Q}_{n}^{i} = e_{i} \cdot a_{i} $$

(4)

Thus, $ \hat{Q}_{n}^{i} $ introduces the estimated emission rate from the single source S, had it was located at $ C_{n} $, based on the single measured sample at $ a_{i} $.

The same process is applied for all $ C_{n} $ for each of the sensors:

$$ \overrightarrow {{\hat{Q}}}_{{_{n} }} = [E] \cdot \vec{a}^{T} $$

(5)

Applying Eq. (5), results in each $ C_{n} $ having its unique set of $ \overrightarrow {{\hat{Q}}}_{n} $, one estimate for each sensor. Using the standard deviation (STD) of the estimates, the catchment with the lowest STD is the approximated location of S. Once the source location, $ \gamma $, is obtained, the emission rate of S is estimated by the average of the catchment’s estimates:

(6)

Having the estimated emission rate Q, of the source S, and its estimated location, $ \gamma $, with the dispersion model M, we can now estimate the dense pollution map over $ \Omega $:

$$ C_{n} = \overrightarrow {M} \cdot \widehat{Q} $$

(7)

The process is illustrated in the simple example of Fig. 1, where three sensors are deployed in a region with one source (see Fig. 1a). While the catchments can assume any geographical region and shape, for the sake of simplicity, the region, Ω, is divided into squared catchments, forming a squared grid. Sensor 1, which measures a pollution level of 33 (i.e. $ a_{1} = 33 $), is located at catchment (1, 3); Sensor 2, located at (2, 4), measures 29; and Sensor 3, at (3, 3) measures a level of 30. Keeping in mind the source’s location, $ \gamma $, is unknown, Fig. 1b demonstrates the execution of Eq. (4), where each catchment is assigned with the estimated source’s emission rate if the source was located in this catchment, given Sensor 1’s measurement, and an exponential isotropic decay dispersion model, with an extinction coefficient $ \lambda $. i.e., for r, the Cartesian distance from the source, the pollution level at each location on the map is given by [16]:

$$ a_{i} = Q \cdot e^{ - \lambda |r|} $$

(8)

If the source was located at (2, 2), then the estimated emission rate, $ \widehat{Q} $, based on Sensor 1’s measurement, should have been 38. If the source was located at (1, 4), then $ \widehat{Q} $, according to Sensor 1, would be 47.3. Figures 1c and d are the estimation maps, generated in the same fashion as b, for Sensor 2 and Sensor 3 respectively.

Assuming the dense pollution maps are a collection of isolines, the estimated emission rate values of the three sensors should agree in one grid location [17]. To evaluate the agreement, we compute, in each $ C_{n} $, the three sensors’ estimates’ standard deviation. The lower the STD, the higher the agreement. This is illustrated in Fig. 1e. The smallest STD, indeed is obtained at location (1, 1), where, in this example the source is located.

3 Results and Discussion

3.1 Dispersion Models

As mentioned earlier, M represents the pollution decay function of the dispersion model. The suggested scheme, HTBI, does not apply any constraint on the dispersion model used. It can be any model, as long as it allows to compute the expected pollution on any given location on the map, given the emission rate Q and all other meteorological parameters required by the specific model in use. In this research, two models were used, the above isotropic decay dispersion model [16] (Eq. (8)), and the well-known Gaussian Plume Dispersion (GPD) model [18]:

$$ \begin{aligned} & a_{i} \left( {x,y,z} \right) = \frac{Q}{{2\pi \sigma_{y} \sigma_{z} \bar{u}}}\exp \left( { - \frac{{y^{2} }}{{2\sigma_{y}^{2} }}} \right) \\ & \quad \quad \quad \quad \quad \quad \quad \cdot \left[ {\exp ( - \frac{{\left( {z - H} \right)^{2} }}{{2\sigma_{z}^{2} }}) + \exp ( - \frac{{\left( {z + H} \right)^{2} }}{{2\sigma_{z}^{2} }})} \right] \\ \end{aligned} $$

(9)

where x is the downwind, y is the crosswind and z is the vertical distances of $ a_{i} $ from the source; $ \bar{u} $ is the time-averaged wind speed at the hight of release H; and σ_y and σ_z represent the standard deviations of the crosswind and vertical Gaussian distribution of the pollutant concentration, respectively. The model also assumes full reflection from the ground.

3.2 Computational Simulation

For generating a continuous pollution field, the two types of dispersion models, described above were used. Specifications of the models are: Q = 8 ton/h; wind speed (for the GPD model):4 m/h; wind direction (GPD model): 285°; effective stack-height: 120 m.

The continuous fields were sampled by the set of sensors described in Fig. 2. To simulate real conditions, additive white Gaussian noise with Signal to Noise Ratio (SNR) of 10% (10 dB) was added to the readings of the sensors. Each sensor is now reporting the ambient level in its location as derived from the dispersion model with noise added. See Table 1 for ambient data measured in each sensor, for the radial and the GPD models.

Table 1. Ambient data measured by the sensors (units are in µg/m³) for the radial and the GPD dispersion models

Full size table

Using only the noisy readings obtained from the sensors, {a}, the source’s location is estimated and then the dense pollution maps are created.

The results obtained for the radial dispersion model (Eq. (8)) are displayed in Fig. 3. The highest ambient pollution level is located at the source location and exponentially decay as moving away. However, both IDW and Kriging models create a pollution map in which the maximum pollution level is obtained at the closest sensor to the sources’ location, and decay as the distance from the source decreases (Fig. 3(a) and (b) respectively). HTBI, on the other hand, find the accurate source’s location and then computes the accurate dense pollution map (Fig. 3(c)).

The interpolation results for the GPD model are presents in Fig. 4. As both IDW and Kriging do not consider physicochemical characteristics nor atmospheric conditions, the maximum of the dense pollution maps is found at the closest sensor downwind from the sources’ location (Fig. 4(a) and (b) respectively). Moreover, the created maps demonstrate a roughly radial dispersion around this point, which is not the true condition, due to the wind. HTBI, as presented in Fig. 4(c), does manage to create a dense spatial map which complies with the Gaussian plume behavior. This is attributed to the fact that the HTBI method does incorporates the Gaussian model, as it can incorporate any dispersion model.

The suggested algorithm is deterministic in nature, i.e., for the same input, the system will produce the same output. Therefore, the uncertainty in the system stems from the uncertainty of the measurements, i.e., measurements noise [19,20,21,22]. The results of Figs. 3 and 4 were obtained at a noise level of 10%. (SNR, of 10 dB). For evaluating the robustness of the algorithm, different noise levels were tested with the system. The radial model (Eq. (8)) showed stability even with up to 50% errors (SNR of 3 dB). The Gaussian model’s (Eq. (7)) robustness showed dependency on the catchments size. For larger catchment sizes (e.g., cell size of 40 m²), our algorithm showed stability up to 10% SNR. However, increasing the spatial resolution to a cell size of 20 m², the HTBI showed higher sensitivity to measurement noise and showed the correct source location and interpolation maps for noise levels of up to 5% (13 dB). For lower SNR values the algorithm faced difficulties in locating the source and consequentially generating the dense pollution maps.

4 Conclusions

IDW and Ordinary Kriging are well-known and commonly-used interpolation methods for creating dense spatial maps, however they are not considering the physicochemical properties of the pollution characteristics nor the source location, therefore not accurate for this task. In this research, we introduced the Hough Transform Based Interpolation (HTBI), a two-phase interpolation scheme, which addresses these limitations. At the first phase, the HTBI detects sources’ locations and their estimated emission rate. Using this information, at the second phase, a dense pollution spatial map is built. The method incorporates an air-pollution dispersion model into its calculations. This may be any dispersion model that can be found in the literature. Comparing between the dense pollution maps created by the HTBI, IDW and Ordinary Kriging shows that the HTBI creates spatial maps, which represents the true pollution maps better and thus, is more accurate and sensible interpolation scheme. However, this work showed a computational simulation of a simple configuration with only one emission source. Implementing the method to a real-word situation is challenging. Air pollution emitted from many sources including industrial zones and transportation (line source). Hence, HTBI should be adjusted to face with this complex situation of multi sources detection.

Despite the above, HTBI indeed can be used in its current form, a single source detection. We can imagine at least two scenarios in which such configuration applies. The first is indeed when a single source can be identified. For example, when considering SO₂ which is emitted mainly from factories, and the study area contains only single industrial zone. The second is a case of leaks and we would like to identify the leak’s source. In these cases, HTBI will be able to produce better and accurate spatial pollution maps than the existing methods.

Current work, carried out these days, is focusing on the implementation of HTBI in exactly such scenarios.

Another aspect this work sheds light on is the number of sensors and the way they scattered in the study area. It is obvious that the higher the number of sensors, the easier it will be to locate the source. There is a need for further research in finding the optimal number of sensors in a given area. The parameters that should be considered are the size of the study area, the characters of it (an open area is not the same as crowded urban area.), the coverage capacity and accuracy of the sensors and more.

References

Heroux, M.E., et al.: Quantifying the health impacts of ambient air pollutants: recommendations of a WHO/Europe project. Int. J. Publ. Health 60(5), 619–627 (2015)
Article Google Scholar
Venkatadri, M., Rao, P.S.: A survey on air quality forecasting techniques. Int. J. Comput. Sci. Inf. Technol. 5(1), 103–107 (2014)
Google Scholar
Özkaynak, H., Baxter, L., Dionisio, K.: Air pollution exposure prediction approaches used in air pollution epidemiology studies. J. Exp. Sci. Environ. Epidemiol. 23, 566 (2013)
Article Google Scholar
Goswami, E., Larson, T., Lumley, T.: Spatial characteristics of fine particulate matter: identifying representative monitoring locations in Seattle, Washington. J. Air Waste Manag. Assoc. 52, 324–333 (2002)
Article Google Scholar
Rao, et al.: Environmental modeling and methods for estimation of the global health impacts of air pollution. Environ. Model. Assess. 17(6), 613–622 (2012)
Article Google Scholar
Akkala, A., Devabhaktuni, V.: Interpolation techniques and associated software for environmental data. Environ. Prog. Sustain. Energy 29(2), 134–141 (2010)
Article Google Scholar
Li, J., Heap, A.D.: A Review of Spatial Interpolation Methods for Environmental Scientists (2008)
Google Scholar
Xu, X., et al.: PM10 air pollution exposure during pregnancy and term low birth weight in Allegheny County, PA, 1994-2000. Int. Arch. Occup. Environ. Health 84(3), 251–257 (2011)
Article Google Scholar
Clark, N.A., et al.: Effect of early life exposure to air pollution on development of childhood asthma. Environ. Health Perspect. 118(2), 284–290 (2010)
Article Google Scholar
Trujillo-Ventura, A., Ellis, J.H.: Multiobjective air pollution monitoring network design. Atmos. Environ. Part A. Gen. Top. 25(2), 469–479 (1991)
Article Google Scholar
Sarigiannis, D.A., Saisana, M.: Multi-objective optimization of air quality monitoring. Environ. Monit. Assess. 136(1–3), 87–99 (2008)
Google Scholar
Hough, P.: Method and means for recognizing complex patterns. U.S. Patent, vol. 3,069,654 (1962)
Google Scholar
Hystad, P., et al.: Creating national air pollution models for population exposure assessment in Canada. Environ. Health Perspect. 119(8), 1123–1129 (2011)
Article Google Scholar
Zannetti, P.: Air Pollution Modeling: Theories, Computational Methods and Available Software. Springer, New York (1990). https://doi.org/10.1007/978-1-4757-4465-1
Book Google Scholar
Tominaga, Y., Stathopoulos, T.: CFD simulation of near-field pollutant dispersion in the urban environment: a review of current modeling techniques. Atmos. Environ. 79, 716–730 (2013)
Article Google Scholar
Buhmann, M.: Radial Basis Functions: Theory and Implementations. Cambridge Monographs on Applied and Computational Mathematics, vol. 12, pp. 147–165. Cambridge University Press, Cambridge (2003)
MATH Google Scholar
Ballard, D.H.: Generalizing the Hough transform to detect arbitrary shapes. Pattern Recogn. 13(2), 111–122 (1981)
Article Google Scholar
Ermak, D.: An analytical model for air pollutant transport and deposition from a point source. Atmos. Environ. (1967) 11, 231–237 (1977)
Article Google Scholar
Fishbain, B., et al.: An evaluation tool kit of air quality micro-sensing units (2015)
Google Scholar
Fishbain, B., Moreno-Centeno, E.: Self calibrated wireless distributed environmental sensory networks. Sci. Rep. 6, 24382 (2016)
Article Google Scholar
Lerner, U., Yacobi, T., Levy, I., Moltchanov, S.A., Cole-Hunter, T., Fishbain, B.: The effect of ego-motion on environmental monitoring. Sci. Total Environ. 533, 8–16 (2015)
Article Google Scholar
Moltchanov, S., Levy, I., Etzion, Y., Lerner, U., Broday, D.M., Fishbain, B.: On the feasibility of measuring urban air pollution by wireless distributed sensor networks. Sci. Total Environ. 502, 537–547 (2015)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Mathematics, Technion – Israel Institute of Technology, Haifa, Israel
Asaf Nebenzal
Faculty of Civil and Environmental Engineering, Technion – Israel Institute of Technology, Haifa, Israel
Barak Fishbain

Authors

Asaf Nebenzal
View author publications
You can also search for this author in PubMed Google Scholar
Barak Fishbain
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Barak Fishbain .

Editor information

Editors and Affiliations

Masaryk University, Brno, Czech Republic
Jiří Hřebíček
Environmental Informatics Group, Saarbrücken, Germany
Ralf Denzer
Austrian Institute of Technology GmbH, Seibersdorf, Austria
Gerald Schimak
Masaryk University, Brno, Czech Republic
Tomáš Pitner

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Nebenzal, A., Fishbain, B. (2017). Hough-Transform-Based Interpolation Scheme for Generating Accurate Dense Spatial Maps of Air Pollutants from Sparse Sensing. In: Hřebíček, J., Denzer, R., Schimak, G., Pitner, T. (eds) Environmental Software Systems. Computer Science for Environmental Protection. ISESS 2017. IFIP Advances in Information and Communication Technology, vol 507. Springer, Cham. https://doi.org/10.1007/978-3-319-89935-0_5

Download citation

DOI: https://doi.org/10.1007/978-3-319-89935-0_5
Published: 25 April 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-89934-3
Online ISBN: 978-3-319-89935-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The International Federation for Information Processing (opens in a new tab)