Modified LDE for Dimensionality Reduction of Hyperspectral Image

He, Lei; Yang, Hongwei; Zhao, Lina

doi:10.1007/978-3-030-34113-8_26

Lei He¹⁴,
Hongwei Yang¹⁴ &
Lina Zhao¹⁴

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 11903))

Included in the following conference series:

International Conference on Image and Graphics

1595 Accesses

Abstract

Hyperspectral image (HSI) has shown promising results in many fields because of its high spectral resolution. However, redundancy and noise in spectral dimension seriously affect the classification of HSI. For this reason, many popular dimensionality reduction (DR) methods are proposed to solve the problem. The local discriminant embedding (LDE) as an effective non-linear method for DR can be more discriminative by constructing two neighborhood graphs. However, HSI is very easy influenced by noise, and the LDE algorithm based on K nearest neighborhood is highly susceptible to interference from extreme point, which may lead to inaccurate graph construction and poor performance of classification. To overcome the problem and retain the advantages of LDE, a modified local discriminant embedding (MLDE) is firstly applied on HSI by constructing neighborhood graphs on a new spectral feature space instead of the original space. We use variance to characterize the pixels similarity of the same class and use covariance to characterize the separation of different classes of pixels. The combination of variance and covariance makes pixels in the same class to be closer and makes greater separation of pixels from different classes, which enhances classification performance of HSI. The way of representing data by using variance and covariance can attenuate the effects of noise. The Log-Euclidean metric is used to capture the similarity between spectral vectors, which can provide a more accurate similarity evaluation than euclidean distance. The experimental results of two hyperspectral datasets demonstrate the effectiveness of our proposed MLDE method.

Supported by the National Natural Science Foundation of China, 11301021 and 11571031.

You have full access to this open access chapter, Download conference paper PDF

Dimensionality Reduction for Hyperspectral Image Based on Manifold Learning

Supervised Feature Extraction of Hyperspectral Image by Preserving Spatial-Spectral and Local Topology

An Adaptive Supervised Nonlinear Feature Extraction for Hyperspectral Imagery Classification

Article 11 July 2017

Keywords

1 Introduction

Hyperspectral image (HSI) usually consists of hundreds of spectral bands from the visible spectrum to the infrared spectrum [1]. Each pixel of HSI can be represented by a high dimensional spectral vector. It’s because HSI’s rich spectral information that it has not only attracted the attention of the remote sensing community, but also aroused great interest in other fields, for instance, military [2], agriculture [3], urban planning, and environmental monitoring [4]. It is known that classification plays a crucial role in these fields. However, HSI generates a large amount of irrelevant or redundant data that causes a number of issues including significantly increased computation time, computational complexity and the classification performance especially when the training datasets are limited. A number of classical dimensionality reduction (DR) algorithms are explored to address these issues.

One of classic linear methods of DR is principle component analysis (PCA) [5]. But as an unsupervised methods, PCA doesn’t take advantage of class label information. Another one of classic linear methods of DR is linear discriminant analysis (LDA) [6], as a supervised method, it often suffers from the small sample size problem. And the biggest disadvantage of these linear methods is the failure to discover the nonlinear structure inherent in HSI.

Since nonlinear techniques have the merit of preserving geometrical structure of data manifold, it can overcome the above-problem. Laplacian eigenmaps (LE) [7], local linear embedding (LLE) [8] and other manifold learning algorithms have been successfully applied to DR for HSI. Besides, as a linear version of LE, locality preserving projection (LPP) [9] has been introduced. In order to overcome the difficulty of LDA tending to produce undesirable results when the samples in a class is multimodal non-Gaussian class distributions [10], local Fisher’s discriminant analysis (LFDA) [11] which having the advantages of LDA and LPP at the same time was introduced. After that, unlike LPP which uses only one graph to describe the geometry of the sample, local discriminant embedding (LDE) [12] method using two graphs to characterize the geometry structure of the sample was proposed. One as an intrinsic graph to characterize the compact nature of the sample, and the other as a penalty graph to describe the internal separation of the sample. Thus, LDE is more discriminative than LPP. The advantage of LDE is that it can make the data from the same class keep their intrinsic neighbor relations, and it also makes the data in different classes no longer close to each other. However, one thing in common among these above-mentioned methods is that the calculation of the affinity matrix is based on K nearest neighborhood, which is sensitive to outlier samples.

To overcome the above-problem, a graph embedding (GE) frame work [13] was proposed. In order to represent the sparse nature of the samples, a sparse graph embedding (SGE) [14] was developed. Later, a sparse graph-based discriminant analysis (SGDA) [15] model was developed by exploiting the class label information, resulting in a better performance than SGE. Above this, based on SGDA, sparse and low-rank graph discriminant analysis (SLGDA) [16] was proposed by increasing local information of samples. Recently, since considering curves changing description among spectral bands, a graph-based discriminant analysis with spectral similarity (GDA-SS) [17] method was proposed.

Each pixel of HSI is a high dimensional spectral vector that directly displays the spectral reflectance of the targets in different bands. Under an ideal condition, the same targets should have the same spectral characteristics. Nevertheless, HSI is very easy influenced by environment change (i.e. atmosphere and illumination) and instrument problem (i.e. senor) in the real word. And K nearest neighborhood based on euclidean distance is usually used to compute the similarity between two vectors, which is highly susceptible to interference from extreme point. These may lead to inaccurate graph construction and poor performance of classification. Inspired by the region covariance descriptor in [18] and the superiority of the second-order statistic representing data, a novel modified local discriminant embedding (MLDE) is proposed by constructing neighborhoods on a new spectral feature space instead of the original space. We use variance to characterize the pixel similarity of the same class and use covariance to characterize the separation of different classes of pixels. Considering the symmetric positive definite nature of covariance matrix lying on a Riemannan manifold, the Log-Euclidean metric is used to capture the similarity, which has a better effect than the euclidean distance. The main advantages in this paper are summarized as follows: (a) The combination of variance and covariance enables data points in the same class to be closer and enables greater separation of data points from different classes, which enhances classification performance of HSI. (b) The way of representing data by using variance and covariance can attenuate the effects of noise, which can better handle with noise in HSI. (c) The Log-Euclidean metric can provide a more accurate similarity evaluation than euclidean distance, which can better express the characteristics of spectral information.

2 Related Work

2.1 Local Discriminant Embedding (LDE)

Assume a hyperspectral dataset having N samples is denoted as $X=\{x_{i}\}_{i}^N$ existing in a $\mathbb {R}^{m\times 1}$ feature space, where m is the number of bands. And class labels $y_{i}\in {1, 2, ... C}$, where C is the number of classes.

LDE which is defined for manifold learning and pattern classification tries to obtain an optimal projection matrix by considering the class label information of the data points and the local neighborhood information between data points. Specifically, the LDE algorithm can be described as follows.

Steps 1: Construct neighborhood graphs. An intrinsic graph G and a penalty graph $G^{\prime }$ can be constructed by K nodes of K nearest neighborhood (KNN) over all the data point.

Steps 2: Compute affinity weights. An affinity matrix W of the intrinsic graph G and an affinity matrix $W^{\prime }$ of the penalty graph $G^{\prime }$ can be computed as follows:

$$\begin{aligned} w_{ij} = {\left\{ \begin{array}{ll} exp(-||x_{i} - x_{j}||^{2} / t) &{} x_{j}\in O(K, x_{i}) \\ &{}\text {or }x_{i}\in O(K, x_{j}) \\ &{}\text {and }y_{i} = y_{j}; \\ 0 &{}\text {otherwise} \end{array}\right. } \end{aligned}$$

(1)

and

$$\begin{aligned} w'_{ij} = {\left\{ \begin{array}{ll} exp(-||x_{i} - x_{j}||^{2} / t) &{} x_{j}\in O(K, x_{i}) \\ &{}\text {or } x_{i}\in O(K, x_{j}) \\ &{}\text {and }y_{i} \not = y_{j}; \\ 0 &{}\text {otherwise} \end{array}\right. } \end{aligned}$$

(2)

where $O(K,x_{i})$ represents the K nearest neighborhood of data $x_{i}$ and the parameter t is a kernel width parameter.

The optimization problem of LDE is described as follows:

$$\begin{aligned} \begin{aligned} \mathop {\arg }&\min _P \sum _{i,j}||P^{T}x_{i}-P^{T}x_{j}||^{2}w_{ij}\\ s.t.\;&\sum _{ij}||P^{T}x_{i}-P^{T}x_{j}||^{2}w_{ij}^{'}=1 \end{aligned} \end{aligned}$$

(3)

Steps 3: Complete the embedding. The projection matrix P can be obtained by solving the eigenvectors corresponding to the H smallest nonzero eigenvalues of the following generalized eigenvalue problem:

$$\begin{aligned} X(D-W)X^{T}P = \wedge X(D^{'}-W^{'})X^{T}P \end{aligned}$$

(4)

where $\wedge $ is a diagonal eigenvalue matrix. D and $D^{'}$ are diagonal matrices with $D_{ii} = \sum _{j=1}^{N}W_{i,j}$ and $D_{ii}^{'} = \sum _{j=1}^{N}W_{i,j}^{'}$.

2.2 Region Covariance Descriptor for HSI

As a robust and very novel data descriptor, region covariance descriptor has been successful and effectively applied to many computer vision problems [19, 20]. Consider a HSI data ${\varvec{{X}}}\in \mathbb {R}^{\textit{l}\times \textit{w}\times \textit{m}}$ with $\textit{m}$ representing the number of bands and $\textit{l}\times \textit{w}$ representing the spatial structure. Consider a three order spatial-spectral tensor $x\in \mathbb {R}^{(2\textit{n}-1)\times (2\textit{n}-1)\times \textit{m}}$ as a small patch of ${\varvec{{X}}}\in \mathbb {R}^{\textit{l}\times \textit{w}\times \textit{m}}$, the central of x is a pixel, the rest of the central of x is its local region neighborhood. Therefore, the pixels of a HSI data ${\varvec{{X}}}\in \mathbb {R}^{\textit{l}\times \textit{w}\times \textit{m}}$ can be denoted as $\{x_{i}\}_{i=1}^{\textit{N}}$, where $x_i\in \mathbb {R}^{(2\textit{n}+1)\times (2\textit{n}+1)\times \textit{m}}$ denotes the ith pixel and N is the number of pixels [18]. And $x_{s}\;(s = 1, 2, ... , (2n+1)\times (2n+1))$ is a spectral vector in the region of interest around the ith hyperspectral pixel. Then, a spectral region covariance descriptor $C_{i}$ can be obtained by the Eq. (5).

$$\begin{aligned} \begin{aligned} C_{i}&= {1\over S-1}\sum _{s=1}^{S} (x_{s}-\mu _{i})(x_{s}-\mu _{i})^{T}\\ \mu _i&= {1\over S}\sum _{s=1}^{s} x_{s} \end{aligned} \end{aligned}$$

(5)

where S is the number of spectral vectors in the region of interest, and $\mu _i$ is the mean vector. Meantime, $C_{i}$ is considered to be the feature of ${\varvec{{X}}}_i$.

3 Our Work

3.1 Variance and Covariance for HSI

Inspired by the region covariance descriptor in [18], we want to introduce the variance and covariance instead of the region covariance descriptor to attenuate the effects of noise, because in this paper the hyperspectral dataset is used as input in the form of a vector, not a tensor. Consider a hyperspectral dataset denoted as $X = \{x_{i}|x_{i1}, x_{i2}, ... , x_{im}\}_{i=1}^N$ existing in a $\mathbb {R}^{m\times 1}$ feature space, where m is the number of bands. Then, a spectral variance $C_i\;(i = 1, 2, ... , N)$ and a covariance $C_{ij}\;(i, j = 1, 2, ... , N)$ can be obtained by the Eq. (6).

$$\begin{aligned} \begin{aligned} C_i&= {1\over m-1}\sum _{k=1}^{m} (x_{ik}-\mu _{i})(x_{ik}-\mu _{i})^{T}\\ \mu _i&= {1\over m}\sum _{k=1}^{m} x_{ik}\\ C_{ij}&= {1\over m-1}\sum _{k=1}^{m} (x_{ik}-\mu _{i})(x_{jk}-\mu _{j})^{T} \end{aligned} \end{aligned}$$

(6)

where $\mu _i$ is the spectral mean value. Meantime, the variance $C_i$ is considered to be the feature of $x_i$, and the covariance $C_{ij}$ is considered to be the feature of between $x_i$ and $x_j$.

3.2 Modified Local Discriminant Embedding (MLDE)

Suffered by the euclidean distance which is sensitive for noise and the data which contain inevitable noise created by environment change (i.e. atmosphere and illumination) and instrument problem (i.e. senor), the LDE algorithm may lead to inaccurate graph construction and a poor performance of classification. In this section, we propose an MLDE algorithm to overcome the problem.

Like LDE, the intrinsic graph G and the penalty graph $G^{\prime }$ should be constructed firstly. Nevertheless, in MLDE, the difference is that we use the variance features $\{C_{i}\}_{i=1}^N$ and the covariance features $\{C_{ij}\}_{i,j=1}^N$ obtained by Eq. (6) to construct the intrinsic graph and the penalty graph denoted as $G_{var}$ and $G_{cov}^{\prime }$, respectively. Due to the variance features and the covariance features lying on a Rimannian manifold, the Log-Euclidean metric is a good choice to compute the affinity.

$$\begin{aligned} D_{LE}(C_{i},C_{j}) = |log(C_{i}) - log(C_{j})| \end{aligned}$$

(7)

Then, the affinity matrix $W_{var}$ of the intrinsic graph $G_{var}$ and the affinity matrix $W_{cov}$ of the penalty graph $G_{cov}$ can be computed as follows:

$$\begin{aligned} w_{var}\;_{ij} = {\left\{ \begin{array}{ll} exp(- D_{LE}(C_{i},C_{j})^{2} / t) &{} C_{j}\in O(K, C_{i}) \\ &{}\text {or } C_{i}\in O(K,C_{j}) \\ &{}\text {and } y_{i} = y_{j}; \\ 0 &{}\text {otherwise} \end{array}\right. } \end{aligned}$$

(8)

and

$$\begin{aligned} w'_{cov}\;_{ij} = {\left\{ \begin{array}{ll} exp(-|log(C_{ij})|^{2} / t) &{} C_{ij}\in O(K,C_{ii}) \\ &{}\text {or } C_{ii}\in O(K,C_{ij}) \\ &{}\text {and } y_{i} \not = y_{j}; \\ 0 &{}\text {otherwise} \end{array}\right. } \end{aligned}$$

(9)

where $O(K,C_{i})$ represents the K nearest neighborhood of covariance feature $C_{i}$ and the parameter t is a kernel width parameter.

The optimization problem of MLDE is described as follows:

$$\begin{aligned} \begin{aligned} J&(P) = \mathop {\arg }\min _P \sum _{i,j}||P^{T}x_{i}-P^{T}x_{j}||^{2}w_{var}\;_{ij}\\&\;s.t.\;\;\sum _{ij}||P^{T}x_{i}-P^{T}x_{j}||^{2}w'_{cov}\;_{ij}=1 \end{aligned} \end{aligned}$$

(10)

Similarity to LDE, the optimization problem (10) can be rewritten as (11) by the nature of trace.

$$\begin{aligned} \begin{aligned} J(P)&= \mathop {\arg }\min _P \sum _{i,j}||P^{T}x_{i}-P^{T}x_{j}||^{2}w_{var}\;_{ij}\\&= \mathop {\arg }\min _P \sum _{i,j}tr\{(P^{T}x_{i}-P^{T}x_{j})(P^{T}x_{i}-P^{T}x_{j})^{T}\}w_{var}\;_{ij}\\&=\mathop {\arg }\min _P\sum _{i,j}tr\{P^{T}(x_{i}-x_{j})(x_{i}-x_{j})^{T}P\}w_{var}\;_{ij} \end{aligned} \end{aligned}$$

(11)

By $w_{var}\;_{ij}$ is a scalar and the operation of trace is linear, the Eq. (11) can be rewritten as (12):

$$\begin{aligned} \begin{aligned} J(P)&= \mathop {\arg }\min _{P}\;tr\{P^{T}\sum _{i,j}((x_{i}-x_{j})w_{var}\;_{ij}(x_{i}-x_{j})^{T})P\}\\&=\mathop {\arg }\min _{P}\;tr\{P^{T}(2XD_{var}X^{T}-2XW_{var}X^{T})P\}\\&= \mathop {\arg }\min _{P}\;2tr\{P^{T}X(D_{var}-W_{var})X^{T}P\} \end{aligned} \end{aligned}$$

(12)

where $D_{var}$ is a diagonal matrix with $D_{var}\;_{ii}=\sum _{j=1}^{N}W_{var}\;_{ij}$. Then, the optimization problem (10) can be rewritten as (13):

$$\begin{aligned} \begin{aligned} J&\!(P) = \mathop {\arg }\min _{P}\;2tr\{P^{T}X(D_{var}-W_{var})X^{T}P\}\\&\;s.t.\;\;2tr\{P^{T}X(D_{cov}-W_{cov})X^{T}P\}=1 \end{aligned} \end{aligned}$$

(13)

The projection matrix P can be obtained by solving the eigenvectors corresponding to the H smallest nonzero eigenvalues of the following generalized eigenvalue problem:

$$\begin{aligned} X(D_{var}-W_{var})X^{T}P = \wedge X(D_{cov}-W_{cov})X^{T}P \end{aligned}$$

(14)

Thus, MLDE for hyperspectral image classification is carried out following the steps in Algorithm 1.

4 Experimental Results and Discussions

In this section, we will apply MLDE on two hyperspectral datasets. Firstly, we introduce the experimental datasets. Secondly, how to choose the best experimental parameters would be given. Finally, The classification accuracy and classification maps on compared algorithms and MLDE algorithm would be shown. The MLDE algorithm is implemented by matlab. The results are generated on a personal computer equipped with an Intel Core i7-3370 with 3.40 GHz. The personal computer’s memory is 4 GB.

Table 1. Number of training and testing samples for the University of Pavia dataset

Full size table

Table 2. Number of training and testing samples for the Salinas dataset

Full size table

4.1 Experimental Dataset

The first experimental dataset was acquired by the Reflective Optics System Imaging Spectrometer (ROSIS) sensor over the University of Pavia in Italy. The image includes $610\times 340$ pixels and 115 spectral bands in the wavelength range $0.43-0.86-\upmu $m. In our experiments, 12 spectral bands covering noisy are removing. Then, a total of 103 bands is used. Thus, the image contains 9 different classes and a total of 42776 ground-truth samples (Table 1).

The second experimental dataset was acquired by the National Aeronautics and Space Administration’s Airborne Visible/ Infrared Imaging Spectrometer (AVIRIS) sensor over Salinas Valley in California. The image includes $512 \times 127$ pixels and 204 bands afther 20 water-absorption bands are removed. Thus, the image cantains 16 different classes and a total of 54129 ground-truth samples.

8% and 5% samples in each class are randomly selected as training samples in the University of Pavia dataset and the Salinas dataset, respectively. And the rest are chosen as the testing samples. More detailed information of the number of training and testing samples is summarized in Tables 1 and 2.

4.2 Experiment Parameters

The SVM is used to verify the proposed MLDE algorithm. The SVM classifier is implemented by libsvm (the kernel is rbf, the penalty parameter is 1000 and the sigma is searched in {0.01, 0.05, 0.5, 1, 5, 10, 50, 100, 500, 1000}). And to demonstrate the benefits of MLDE algorithm, the experimental results would be compared with nine other classical algorithm of DR, i.e., PCA, LDA, LPP, LDE, LFDA, LGDA, SGDA, SLGDA, GDA-SS.

It is very easy to note that the reduced dimensionality and the value of the K nearest neighborhood are two important parameters, which have a significant influence on the performance of the classification.

If the K is too small, it may reduce classification accuracy. And if the K is too large, it would increase computational complexity, increase the noise and reduce the classification effect. To find a good value of K, the even numbers are chosen from 2 to 60, and the reduced dimensionality is searched in the range of {2, 7, 12, 15, 20, 25, 27, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75}. To have a better presentation, we only show the range of 2–30 of the value of K in Fig. 1.

Figure 1 shows the classification performances of MLDE in different K for two hyperspectral datasets. It can be seen from Fig. 1 that the overall accuracy would increase as K increasing when K is at a relatively small value, while the overall accuracy would decline as K increasing when K is at a relatively big value. It’s noticed that the overall accuracy will be stable and less affected by k when the spectral number is in a high position. From Fig. 1, the highest value of overall accuracy are 94.28% and 93.30% in the University of Pavia dataset and the Salinas dataset, at the same time, K are 12 and 22, respectively.

Thus, the K is respectively fixed as 12 and 22 according to Fig. 1. Next, a good value of the reduced dimensionality would be searched in the above spectral range, the way in which other algorithms do, e.g. LFDA, SGDA, SLGDA.

Figure 2 illustrates the overall accuracy corresponding to the reduced dimensionality H for all the algorithms mentioned in the University of Pavia dataset. The performance is poor when the reduced dimensionality is low, and it would increase and stabilize as the reduced dimensionality increasing. From Fig. 2(a), PCA, LDA, LGDA, SLGDA, and GDA-SS apparently don’t have a better classification performance than MLDE. Although the curves of LPP, LFDA, SGDA and MLDE alternatively rise, the highest point 94.28% can be found on the MLDE curve in Fig. 2(b). So, the reduced dimensionality being set as 27 can be considered a good choice.

Figure 3 also illustrates the overall accuracy corresponding to the reduced dimensionality H for all the algorithms mentioned in the Salinas dataset. The performance is poor when the reduced dimensionality is low, and it would increase and stabilize as the reduced dimensionality increasing. From Fig. 3 (b), LPP, LDE, LFDA, SGDA apparently don’t have a better classification performance than MLDE. Excpet the curver of GDA-SS has some intersection with the curver of MLDE, the other methods don’t have a better value than MLDE in Fig. 3(a), and the highest overall accuracy 93.12% will be found in the curve of MLDE. So, the reduced dimensionality being set as 70 can be considered a good choice.

From Fig. 4(a), the computational time of MLDE is 5.276 s and ranked second with a small difference of 0.251 s of the first place. Because the computational time of SLGDA is 1564.8 s is very big will cause the figure don’t have a good presentation, it don’t be shown in Fig. 4(a). And from Fig. 4(b), the computational time of MLDE is 12.879 s and ranked third.

4.3 Experimental Results

Through our experiments, for the University of Pavia dataset, the value of K would be set as 12, the reduced dimensionality would be set as 27, for the salinas datastet, the value of K would be set as 22, the reduced dimensionality would be set as 70.

The each class’s accuracy, overall accuracy (OA), average accuracy (AA) and kappa coefficient of two hyperspectral datasets are listed in Tables 3 and 4.

From Table 3, the MLDE achieves the best classification performance in the class 3, the class 7, and the class 8, respectively. And the classification accuracy of OA, AA, and $\kappa $ are all better than other compared methods. On details, the OA of MLDE increases from 0.44% to 10.89%, the AA of MLDE increases from 1% to 17.08%, and the $\kappa $ of MLDE increases from 0.59% to 15.27%, when compared with other methods. Especially, the classification performance of the class 7 is 83.83% when the accuracy of other methods is basically no more than 80%, and the classification performance of the class 8 is 91.53% when the accuracy of other methods is basically no more than 90%. Meaawhile, when other methods achieve the best results in a certain class, the results of MLDE are not inferior, for instance, the class1, the class2, the class 5, and the class 9.

Table 3. Classification accuracy (%) for the University of Pavia dataset

Full size table

Table 4. Classification accuracy (%) for the salinas dataset

Full size table

From Table 4, although the MLDE only achieves the best classification performance in the class 16, the classification performance of the other classes has a good performance, for example, the classification accuracy of the class 1 and the class 12 are also good. And the classification accuracy of OA is better than other compared methods. On details, the OA of MLDE increases from 3.64% to 7.21%.

Figure 5 illustrates the classification maps resulting from the classification of those methods in the University of Pavia dataset. In Fig. 5, the number of misclassified points in the class 3 (Gravel), the class 8 (Self-Blocking Bricks) of MLDE is significantly less than other methods, which further illustrates that the results in Table 3 are indeed believable.

Figure 6 illustrates the classification maps resulting from the classification of those methods in the salinas dataset. In Fig. 6, the number of misclassified points in the class 16 (Vinyard-vertical-trellis) is significantly less than other methods.

5 Conclusion

In this paper, we proposed a MLDE algorithm for HSI by constructing neighborhood graphs on a new spectral feature space instead of the original space. We use variance to characterize the pixels similarity of the same class and use covariance to characterize the separation of different classes of pixels. The combination of variance and covariance enables pixels in the same class to be closer and enables greater separation of pixels from different classes, which enhances classification performance of HSI. The way of representing data by using variance and covariance can attenuate the effects of noise, which can better handle with noise in HSI. Considering the symmetric positive definite nature of covariance lying on a Riemannan manifold, the MLDE algorithm using the Log-Euclidean metric to capture the similarity between spectral vectors, which can provide a more accurate similarity evaluation than euclidean distance and can better express the characteristics of spectral information. The experimental results of two hyperspectral datasets demonstrate the effectiveness of our proposed MLDE method.

References

Lianru, G., Bin, Y., Qian, D., et al.: Adjusted spectral matched filter for target detection in hyperspectral imagery. Remote Sensing 7(6), 6611–6634 (2015)
Article Google Scholar
Zhang, L., Zhang, L., Tao, D., et al.: Hyperspectral remote sensing image subpixel target detection based on supervised metric learning. IEEE Trans. Geosci. Remote Sens. 52(8), 4955–4965 (2014)
Article Google Scholar
Onoyama, H., Ryu, C., Suguri, M., et al.: Integrate growing temperature to estimate the nitrogen content of rice plants at the heading stage using hyperspectral imagery. IEEE J. Sel. Top. Appl. Earth Observations Remote Sensing 7(6), 2506–2515 (2014)
Article Google Scholar
Cheng, G., Zhu, F., Xiang, S., et al.: Semisupervised hyperspectral image classification via discriminant analysis and robust regression. IEEE J. Sel. Top. Appl. Earth Observations Remote Sensing 9(2), 595–608 (2017)
Article Google Scholar
Jolliffe, I.T.: Principal component analysis. J. Mark. Res. 87(100), 513 (2002)
MATH Google Scholar
Bandos, T.V., Bruzzone, L., Camps-Valls, G.: Classification of hyperspectral images with regularized linear discriminant analysis. IEEE Trans. Geosci. Remote Sens. 47(3), 862–873 (2009)
Article Google Scholar
Zhang, X., Liang, Y., Cahill, N.: Using superpixels to improve the efficiency of Laplacian Eigenmap based methods for target detection in hyperspectral imagery. In: Geoscience & Remote Sensing Symposium, IEEE (2016)
Google Scholar
Wang, M., Yu, J., Niu, L., et al.: Unsupervised feature extraction for hyperspectral images using combined low rank representation and locally linear embedding. In: IEEE ICASSP 2017–2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) - New Orleans, LA, USA, 5 March 2017–9 March 2017, pp. 1428–1431 (2017)
Google Scholar
Zhang, M., Jia, P., Shen, Y., et al.: Hyperspectral image classification method based on orthogonal NMF and LPP. In: Instrumentation & Measurement Technology Conference, IEEE (2016)
Google Scholar
Sugiyama, M.: Dimensionality reduction of multimodal labeled data by local fisher discriminant analysis. J. Mach. Learn. Res. 8(1), 1027–1061 (2007)
MATH Google Scholar
Li, W., Prasad, S., Fowler, J.E., et al.: Locality-preserving dimensionality reduction and classification for hyperspectral image analysis. IEEE Trans. Geosci. Remote Sens. 50(4), 1185–1198 (2012)
Article Google Scholar
Chen, H.T., Chang, H.W., Liu, T.L.: Local discriminant embedding and its variants. In: IEEE Computer Society Conference on Computer Vision & Pattern Recognition (2005
Google Scholar
Yan, S., Xu, D., Zhang, B., et al.: Graph embedding: a general framework for dimensionality reduction. IEEE Trans. Pattern Anal. Mach. Intell. 29(1), 40 (2007)
Article Google Scholar
Cheng, B., Yang, J., Yan, S., et al.: Learning with 1-graph for image analysis. IEEE Trans. Image Process. 19(4), 858–866 (2010)
Article MathSciNet Google Scholar
Ly, N.H., Du, Q., Fowler, J.E.: Sparse graph-based discriminant analysis for hyperspectral imagery. IEEE Trans. Geosci. Remote Sens. 52(7), 3872–3884 (2014)
Article Google Scholar
Li, W., Liu, J., Du, Q.: Sparse and low-rank graph for discriminant analysis of hyperspectral imagery. IEEE Trans. Geosci. Remote Sens. 54(7), 1–12 (2016)
Article Google Scholar
Fubiao, F., Wei, L., Qian, D., et al.: Dimensionality reduction of hyperspectral image with graph-based discriminant analysis considering spectral similarity. Remote Sensing 9(4), 323 (2017)
Article Google Scholar
Deng, Y.J., Li, H.C., Pan, L., et al.: Modified tensor locality preserving projection for dimensionality reduction of hyperspectral images. IEEE Geosci. Remote Sens. Lett. 15(2), 277–281 (2018)
Article Google Scholar
Yang, X., Tu, S., Bai, Y., et al.: Fusion of intensity/coherent information using region covariance features for unsupervised classification of SAR imagery. In: Geoscience & Remote Sensing Symposium, IEEE (2016)
Google Scholar
Yang, J., Xing, C., Chen, Y.: Improving the ScSPM model with Log-Euclidean Covariance matrix for scene classification. In: 2016 International Conference on Computer on Information and Telecommunication Systems (CITS), IEEE (2016)
Google Scholar

Download references

Acknowledgement

We would like to thank Prof. Wei Li for sharing the codes of LFDA, LGDA, SGDA, and SLGDA. We would also like to thank Changming Jia for offering the code of GDA-SS.

Author information

Authors and Affiliations

The Beijing University of Chemical Technology, Beijing, 10029, China
Lei He, Hongwei Yang & Lina Zhao

Authors

Lei He
View author publications
You can also search for this author in PubMed Google Scholar
Hongwei Yang
View author publications
You can also search for this author in PubMed Google Scholar
Lina Zhao
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Lina Zhao .

Editor information

Editors and Affiliations

Beijing Jiaotong University, Beijing, China
Yao Zhao
The Australian National University, Canberra, Australia
Nick Barnes
Peking University, Peking, China
Baoquan Chen
The Technical University of Munich, München, Bayern, Germany
Rüdiger Westermann
Zhejiang University, Hangzhou, China
Xiangwei Kong
Beijing Jiaotong University, Beijing, China
Chunyu Lin

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

He, L., Yang, H., Zhao, L. (2019). Modified LDE for Dimensionality Reduction of Hyperspectral Image. In: Zhao, Y., Barnes, N., Chen, B., Westermann, R., Kong, X., Lin, C. (eds) Image and Graphics. ICIG 2019. Lecture Notes in Computer Science(), vol 11903. Springer, Cham. https://doi.org/10.1007/978-3-030-34113-8_26

Download citation

DOI: https://doi.org/10.1007/978-3-030-34113-8_26
Published: 28 November 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-34112-1
Online ISBN: 978-3-030-34113-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The International Association for Pattern Recognition (opens in a new tab)

Modified LDE for Dimensionality Reduction of Hyperspectral Image

Abstract

Similar content being viewed by others

Dimensionality Reduction for Hyperspectral Image Based on Manifold Learning

Supervised Feature Extraction of Hyperspectral Image by Preserving Spatial-Spectral and Local Topology

An Adaptive Supervised Nonlinear Feature Extraction for Hyperspectral Imagery Classification

Keywords

1 Introduction