# Hapke-based computational method to enable unmixing of hyperspectral data of common salts

- 376 Downloads

**Part of the following topical collections:**

## Abstract

Environmental scientists are currently assessing the ability of hyper-spectral remote sensing to detect, identify, and analyze natural components, including minerals, rocks, vegetation and soil. This paper discusses the use of a nonlinear reflectance model to distinguish multicomponent particulate mixtures. Analysis of the data presented in this paper shows that, although the identity of the components can often be found from diagnostic wavelengths of absorption bands, the quantitative abundance determination requires knowledge of the complex refractive indices and average particle scattering albedo, phase function and size. The present study developed a method for spectrally unmixing halite and gypsum combinations. Using the known refractive indexes of the components, and with the assistance of Hapke theory and Legendre polynomials, the authors develop a method to find the component particle sizes and mixing coefficients for blends of halite and gypsum. Material factors in the method include phase function parameters, bidirectional reflectance, imaginary index, grain sizes, and iterative polynomial fitting. The obtained Hapke parameters from the best-fit approach were comparable to those reported in the literature. After the optical constants (*n*, the so-called real index of refraction *and k*, the coefficient of the imaginary index of refraction) are derived, and the geometric parameters are determined, single-scattering albedo (or ω) can be calculated and spectral unmixing becomes possible.

## Keywords

Reflectance spectroscopy Halite Gypsum Reflectance parameters Unmixing## Abbreviations

- r′
reflectance at wavelength

- µ
_{o} cosine of the angle of incident light

- µ
cosine of the angle of emitted light

- g
phase angle

- w′
average single scattering albedo

- B(g)
backscatter function

- P(g)
average single particle phase function, and

- H
Chandrasekhar (1960) H-function for isotropic scatters

## Introduction

Interest has grown in hyperspectral imaging and remote sensing for environmental analysis as it is inexpensive and fast and does not harm the environment in comparison to tradition soil analysis methods [1, 2, 3]. The hyper-spectral technique collects light absorbance and transmittance data from materials. The various earth materials differ from each other in their chemical and physical properties, leading to differences in their reflectance and absorption of light at different wavelengths. These differences are the basis for analyzing and classifying these material [4, 5, 6, 7]. Experimental earth material models have been used to better understand their spectral signatures and to answer some related questions. Salt and evaporite minerals are common earth materials that can be investigated for their reflectance parameters [1, 4, 5, 6]. There is much interest in them since they have simple mineralogy yet significant environmental impacts on soils and plants. However, collected spectral data cannot be directly visually interpreted. Spectral pretreatment techniques, such as data normalization, continuum removal, etc., must be applied to smooth spectral graphs.

*A*_{i}: fractional abundance of the *i*th endmember spectrum; *S*_{x}: *x*th end member spectrum; Y: observed spectrum; W: error term for additive noise; S: matrix of end members.

If we have K spectral bands, and we denote the *x*th endmember spectrum as S_{x} and the abundance of the *i*th endmember as A_{i}, the observed spectrum is Y for any pixel, accounting for additive noise (including sensor noise, endmember variability, and other model inadequacies). This model for pixel synthesis is the linear mixing model (LMM).

_{dec}) is 10% and spruce reflectance (R

_{spr}) is 50% and reflectance measured for the pixel (R

_{pix}) is 30. The mixing model for this example will be as:

_{dec}+ A

_{spr}) = 1; one can rearrange, substitute and solve via:

On the other hand, for intimate mixtures, the non-linear mixing approach has been tested and used [9]. The arrangement of components is not in an order because the components comprising the medium are not organized proportionally on the surface. The intimate mixture of materials results when each component is randomly distributed in a homogeneous way. Non-linear mixing is described by Hapke theory.

In Hapke theory, the isotropic multiple scattering approximation (IMSA) is often used to derive the diffuse reflectance of an intimate mixture, and combines two terms: the contribution of singly scattered light is given exactly, while the multiply scattered light is described by an approximate solution to the radiative transfer equation (RTE) for isotopically scattering particles [14]. One solves the RTE in an infinitely thick half-space of dispersed particulate matter. The derivation assumes that the particles are much larger than the wavelength of light, and uses geometrical optics arguments to solve the radiative transfer integral equations. IMSA considers large phase angles, B(g) = 0 and isotropic scattering, p(g). The objective of the present study is to use Hapke parameters from literature and fitting techniques to simulate and unmix spectra of a simple salt or evaporite system. The selected system is gypsum and halite and their mixtures. These salts have been selected because they are very commonly present in the soils of arid and semi-arid regions.

It was predicted that an intimate mixture of powders may be linearized in the single-scattering albedo [15]. For example, various mixtures of olivine, anorthite, enstatite and magnetite were studied [4]. This research [4] estimated the single-scattering albedo from bi-directional reflectance measurements, and converted the estimated mixing coefficients to mass fractions using the density of the endmembers. While other researchers demonstrated this technique for plagioclase-dominated minerals, computing the density from electron microprobe measurements [16]. Similarly, Hapke model was applied as a basis for unmixing of various mineral mixtures [17]. They replaced the measurement of density with further reflectance measurements. Other studies used the real and imaginary part of the optical constant to compute a quantitative abundance estimate [10]. This study provides a quantitative estimate of the abundance of halite and gypsum from spectral reflectance data, using Hapke model.

## Methodology

### Experimental design

In this study, laboratory experiments have been carried out under controlled conditions for the preparation of pure gypsum and halite crusts and their mixtures. Analytical grade compounds of NaCl (halite), and CaSO_{4}·2H_{2}O (gypsum) were used specifically. The weight fraction, grain size, type of mixing and mixing ratios are the main experimental variables.

### Data presentation

Different approaches of data processing were considered. The traditional method of graphing the spectral data was used. This method involves plotting the percent of reflectance against wavelength for the entire spectral region. Another method is the continuum removal, which is of significance in the study of the absorption features [9]. The continuum is the background absorption onto which the absorption features are superimposed. The continuum removal method implies the removal of the absorption features in the spectra, by plotting the intensities or band depths of the absorption features against the associated wavelengths. This technique of spectral reconstruction can isolate the spectral features and set them on a level, so that comparisons can be made [9].

### Unmixing model

_{0}are the cosines of the reflection and incidence angles; g is the phase angle; B(g) is the back-scattering function, which defines the increase in brightness of a rough surface with decreasing phase; P(g) is the single-particle phase function; and the H(μ) is the isotropic scattering function. The main parameter is

*ω*, the single scattering albedo, defined as the probability that the radiation would be scattered by the particle (power scattered to total power absorbed and scattered). The single scattering albedo can be expressed in term of optical constants n, k and the effective grain size 〈D〉 (the average distance traveled by rays that traverse the particle once, without being internally scattered);

*ω*would thus be dependent on the wavelength of radiation (through n and k) and the shape and size of the particles (〈

*D*〉 ≅ 0.9

*D*for spherical particles, and departures from sphericity will decrease 〈

*D*〉 further).

*R(0)*is the surface reflection coefficient for externally incident light:

*S*

_{i}the reflection coefficient for internally scattered light is given by:

Internal bi-hemispherical reflectance is *r*_{i} and *α* is internal absorption coefficient, while \(\lambda\) is the wavelength of the photons.

*H*is Chandrasekhar integral multiple scattering function:

_{0}(0 ≤ B0 ≤ 1) the amplitude of the opposition effect.

*P*(

*g*) is the particle scattering phase function and describes the angular pattern into which the power is scattered. Where g = i − e is the phase angle. This function can be modeled by Legendre polynomials:

*b*(0 ≤ b ≤ 1) characterizes the anisotropy of the scattering lobe: b = 0 isotropic case, b = 1 single direction diffuser and c(0 ≤ c ≤ 1) backscattering fraction, characterizes the main direction of the diffusion, c < 0.5 representing forward scattering, and c > 0.5 representing backward scattering. In an intimate mixture of different minerals, bidirectional reflectance \(r\left( {\mu , \mu_{0} ,g} \right)\) would depend nonlinearly on the abundances of each mineral component. On the other hand, the single-scattering albedo of a mixture of grains

*ω*

_{mix}, is a linear combination of the single-scattering albedos of its individual endmembers, \(\omega_{i}\):

*f*

_{i}is fractional relative cross section of component i:

*m*

_{i}is mass abundance,

*ρ*

_{i}is density,

*D*

_{i}is the grain size of component i in the mixture. Thus, the reflectance spectra can be inverted to determine the mass abundance and grain sizes of the endmembers in the mixture. These equations and associated python code are provided in the Additional files 1, 2 and 3.

The Hapke model can be considered as an optimization problem through which we try to fit the data to a model that depends on a set of parameters. Since there are so many parameters it is practical to use optical and literature data to reduce the indeterminacies (over-fitting). Phase function parameters from measurements of the bi-directional reflectance at several phase angles are often used to determine some geometric parameters. To this end, one must measure the same reference sample in seven or more geometries, varying the incidence and emergent angles. This, however, is time consuming. For gypsum we used related results in Mustard and Pieters [4]. For halite the same geometrics values were assumed. Reducing the uncertainties in these values yields better fits and reduces the uncertainties in the statistical results, but does not significantly change the results for these samples. However, while optimizing the Hapke model, each grain size parameter usually requires separate measurements for gypsum [12] and for halite [18]. The method of Robertson et al. [10] was used, i.e. n was assumed to be known and the reflectance model was inverted to derive the effective grain size D and k. Also, with n values, Kramers–Kronig relations could be used to obtain the real and imaginary index of refraction from bidirectional reflectance measurements, though this requires larger spectra extending to UV and MIR. Since we had one reference sample with unknown grain sizes, D and k were kept free, but the starting k values were taken from the literature for gypsum. This provided starting values for effective grain size. Because of this approach, k values differ slightly from those in the literature, as the values also depend on other factors, e.g. hydration. For halite, k has not been sufficiently well studied in the literature. Halite is problematic, and is unique in that the single scattering albedo and the absorption values place it in a region where uncertainties are large. To determine the k values for halite the same procedure was used, again keeping the effective grain size as a free parameter. The difference is that the grain size of gypsum was taken as a starting parameter, assuming the two samples were prepared in the same way, to plot the results.

### Inversion algorithms

If the optical material parameters n and k, internal scattering s, the porosity S and the phase function parameters b and c are given, the reflectance spectra can be inverted to determine the mass abundance and grain sizes of the endmembers in the mixture. The phase function parameters b and c are determined by taking measurements of bidirectional reflectance at several angles, g [4]. Also, the wavelength-dependent real and imaginary indices of refraction can be obtained from bidirectional reflectance of samples with different grain sizes [13]. There are two general algorithms which were used to extract the mass abundances and the grain sizes of the endmembers in the mixture, from the model and measured reflectance.

The first approach [15, 19, 20, 21] is to find best fitting parameters *m*_{i}, *D*_{i} that minimize the root mean square of the difference between the model and data reflectance. The second method is the probabilistic method [6], that uses a Markov Chain Monte Carlo algorithm and Bayes Theorem to estimate the probability density functions of the model parameters, given the reflectance data and model relationship between parameters. One of the advantages of the probabilistic model is that the detection noise model (which can be non-Gaussian for low count photons per pixel) can be accounted in the calculations.

While the first approach supplies a single set of data for the endmember mass fractions and particles sizes, the probabilistic model gives a range of values and, in principle, can account for non-unique solutions in the model parameters.

## Results and discussion

Polynomial fitting of the smooth background is a common algorithm used in peak fitting software. The idea is to keep the polynomial series low in degree (minimizing the number of parameters-Occam’s razor). Thus, the iterative algorithm is looking for that series that approximate the background satisfactory. This fitting is necessary to get a (qusi) quantitative understanding of the contributions of the components in the mixture to the reflection spectra, assuming linear mixture model is valid, from the size of the band depths. The polynomial fitting was used only to approximate the background smooth component of reflectance, and then subtract it to reveal the absorption features, un-skewed by the background component. Since the background continuum part of reflectance is assumed to be smooth, it can be modeled by a polynomial series. No polynomial fitting was used in the Hapke model. The Hapke algorithm was then used to find the required parameters (n, k, D, ω, *ρ, S*) to simulate the spectra of halite, gypsum and their mixtures. The study also found a favorable comparison between the results of our extracted parameters and those reported in the literature. The input parameters were: incoming angle = 30°, emerging angle = 0°, phase angle g (the angle between the direction of the source to detector) = 30.0°; phase parameters [4] or b = − 0.4, and c = 0.25. We also used B = 0 (g > 15), s = 10^{−17} and S = 1.

The scattering regime of the two-component system is: (1) for gypsum, single scattering albedo ω is between 0.8 < ω < 0.99, and (2) for halite is close to 1 for the entire region 0.95 < ω < 0.99. For gypsum *α*〈*D*〉 is between 0.01 and 0.11 while gypsum is between 0.1 and 0.5. The region with \(\alpha \left\langle D \right\rangle \ll 1\) is the volume scattering region with scattering albedo ω close to 1. The reflectance is dominated by light that has been refracted and transmitted within the volume of the particle. The region of *α*〈*D*〉 < 0.1 is especially susceptible to errors when determining k [19]. This study used the following densities values: *ρ*_{halite} = 2.16 g/cm^{3}, *ρ*_{gypsum} = 2.31 g/cm^{3}

_{gypsum}= 0.758, m

_{halite}= 0.24, grain sizes, D

_{gypsum}= 57 µm, D

_{halite}= 40.08 µm, and

*χ*

^{2}= 0.82 (Fig. 8). By comparison, the simulation results for the second mixing scenario, which involves 50% gypsum and 50% halite, we conducted with the following fitting parameters: m

_{gypsum}= 0.105, m

_{halite}= 0.89, grain sizes D

_{gypsum}= 43 µm, D

_{halite}= 287 µm,

*χ*

^{2}= 0.98. The simulation results for these two scenarios are shown in Fig. 8. A third scenario considered the same percentages (i.e. 50% gypsum and 50% halite mixing ratios). The values employed were m

_{gypsum}= 0.364, m

_{halite}= 0.635, grain sizes, D

_{gypsum}= 339 µm, D

_{halite}= 46.5 µm,

*χ*

^{2}= 1.09. For this scenario there was a higher fitting error, as seen in Fig. 9. The fourth and last scenario considered 25% gypsum and 75% halite mixture. The fitting parameters of the mass fractions are m

_{gypsum}= 0.0016, m

_{halite}= 0.994, grain sizes, D

_{gypsum}= 26 µm (fixed), D

_{halite}= 265 µm,

*χ*

^{2}= 0.478. The simulation results for the last two scenarios are shown in Fig. 9. Apart from the last scenario, the results of simulation can be considered satisfactory—the results of the measured and modeled spectra of the first two scenarios almost coincide.

## Conclusions

The approach reported in this contribution was useful for modeling the mixed spectra of gypsum and halite, after obtaining the optical constants n, k for gypsum and halite, and leaving the grain sizes or their ratio as a parameter for fitting. The main challenge facing spectral modeling is that the single scatted albedo depends nontrivially on many variables, including grain sizes, which impact both of the absorption coefficients, and then the fractional cross sections, i.e. there are at least two other reflectance variables which are linked to grain size. The grain size mainly scales the spectra, but there are additional factors as well e.g. porosity factor, and shape of the grains. Although we have measured the spectra from 350 to 2500 nm, we used only the NIR region 750–2500 nm. Impurities make the model unsuitable in the VIS range. The geometry of the measurement is very important for unmixing, since the phase factor cannot be neglected. The study concludes that reflectivity band contrast decreases and becomes overall smaller as particle size increases. High scattering albedo components have larger influence because of the nonlinear dependence of reflectance on it, especially if they are smaller in size. When the absorption is low the sample must be thick (e.g. for halite 100 µm with ω varying only few percent, the sample must be larger than 1 cm).

## Notes

### Authors’ contributions

All authors contributed to this manuscript. FMH designed and supervised the research project. GA did the software design for extraction and comparison. YN and FA contributed to the experimental design. All authors read and approved the final manuscript.

### Acknowledgements

The authors would like to extend our thanks and appreciation Prof Bruce Hapke, University of Pittsburgh and the unanimous reviewers for their comments and suggestions as well as to UAE Space Agency for funding this research Z01-2016-001.

### Competing interests

The authors declare that they have no competing interests.

### Availability of data and materials

The datasets supporting the conclusions of this article are available in the USGS spectral library https://speclab.cr.usgs.gov/spectral-lib.html. The python code for the fitting is provided in the Additional files 1, 2 and 3.

### Ethics approval and consent to participate

Not applicable.

### Funding

Fares Howari is grateful for support by UAE Space Agency for funding this project.

### Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

## Supplementary material

## References

- 1.Howari FM, Goodel PC, Miyamoto S (2002) Spectral properties of salt crusts formed on saline soils. Environ Qual 31:1453–1461CrossRefGoogle Scholar
- 2.Wang F, Gao J, Zha Y (2018) Hyperspectral sensing of heavy metals in soil and vegetation: feasibility and challenges. ISPRS J Photogramm Remote Sens 136:73–84CrossRefGoogle Scholar
- 3.Hunt GR (1982) Spectroscopic properties of rocks and minerals. In: Carmichael RS (ed) Handbook of physical properties of rocks, vol 1. CRC Press, Boca Raton, pp 295–385Google Scholar
- 4.Mustard JF, Pieters CM (1989) Photometric phase functions of common geologic minerals and applications to quantitative analysis of mineral mixture reflectance spectra. J Geophys Res Solid Earth 94:13619–13634CrossRefGoogle Scholar
- 5.Araújo SR, Wetterlind J, Demattê JAM, Stenberg B (2014) Improving the prediction performance of a large tropical vis-NIR spectroscopic soil library from Brazil by clustering into smaller subsets or use of data mining calibration techniques. Eur J Soil Sci 65(5):718CrossRefGoogle Scholar
- 6.Lapotre MGA, Ehlmann BL, Minson SE (2017) A probabilistic approach to remote compositional analysis of planetary surfaces. J Geophys Res Planets 122:983–1009CrossRefGoogle Scholar
- 7.Howari FM (2004) Chemical and environmental implications of visible and near-infrared spectral features of salt crusts formed from different brines. Ann Chim 94(4):315–323CrossRefPubMedGoogle Scholar
- 8.Hunt GR, Salisbury JW (1970) Visible and near infrared spectra of minerals and rocks. I. Silicate minerals. Mod Geol 1:283–300Google Scholar
- 9.Clark RN (1999) Spectroscopy of rocks and minerals and principles of spectroscopy. In: Rences AN (ed) Remote sensing for earth sciences: manual of remote sensing, vol 3, 3rd edn. Wiley, Hoboken, pp 3–52Google Scholar
- 10.Robertson K, Milliken R, Li S (2016) Estimating mineral abundances of clay and gypsum mixtures using radiative transfer models applied to visible-near infrared reflectance spectra. Icarus 277:171–186CrossRefGoogle Scholar
- 11.Csillag F, Pasztore L, Biehl LL (1993) Spectral selection for characterization of salinity status of soils. Remote Sens Environ 43:231–242CrossRefGoogle Scholar
- 12.Roush TL, Esposito F, Rossman GR, Colangeli L (2007) Estimated optical constants of gypsum in the regions of weak absorptions: application of scattering theories and comparisons to independent measurements. J Geophys Res Planets. https://doi.org/10.1029/2007JE002920 CrossRefGoogle Scholar
- 13.Sklute EC, Glotch TD, Piatek JL, Woerner WR, Martone AA, Kraner ML (2015) Optical constants of synthetic potassium, sodium, and hydronium jarosite. Am Mineral 100:1110CrossRefGoogle Scholar
- 14.Keshava N (2003) A survey of spectral unmixing algorithms. Lincoln Lab J 14(1):55–78Google Scholar
- 15.Hapke BW (1981) Bidirectional reflectance spectroscopy 1. Theory. J Geophys Res 86(1981):3039–3054CrossRefGoogle Scholar
- 16.Cheek LC, Pieters CM (2014) Reflectance spectroscopy of plagioclase and mafic mineral mixtures: implications for characterizing lunar anorthosites remotely. Am Mineral 99:1871–1892CrossRefGoogle Scholar
- 17.Grumpe A, Mengewein N, Rommel D, Mall U, Wöhler C (2018) Interpreting spectral unmixing coefficients: from spectral weights to mass fractions. Icarus 299:1–14CrossRefGoogle Scholar
- 18.Palik E (1998) Handbook of optical constants of solids, 1st edn. Academic Press, p 999Google Scholar
- 19.Hapke B (2012) Theory of reflectance and emittance spectroscopy, 2nd edn. Cambridge University Press, New YorkGoogle Scholar
- 20.Li S, Milliken RE (2015) Estimating the modal mineralogy of eucrite and diogenite meteorites using visible-near infrared reflectance spectroscopy. Meteorit Planet Sci 50(11):1821–1850CrossRefGoogle Scholar
- 21.Hapke B (2005) Theory of reflectance and emittance spectroscopy. Cambridge University Press, Cambridge, p 472Google Scholar

## Copyright information

**Open Access**This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.