Keywords

1 Introduction

Palmprint recognition is regarded as a potential biometric recognition technology. It has the advantages of low capture cost and high recognition accuracy, and can be widely applied to employees’ attendance system, building security control, automatic teller machines and ID card [1,2,3,4].

Palmprint recognition mainly includes two key steps, i.e., feature extraction and classifier construction [4]. Generally, feature extraction algorithms have great influence to the recognition accuracy. Due to the characteristics of highly descriptive, low computational cost and well separability, the subspace-based feature extraction methods have been widely researched for palmprint recognition. The main idea of subspace recognition methods is to put the high dimensional matrix projection into a low dimensional space, thereby reducing the processing dimension and obtaining the recognition features. The subspace algorithms maintain the specific characteristic of the palmprint with lower dimensional vectors or matrices. Currently, the main subspace projection methods include principal component analysis (PCA), linear discriminant analysis (LDA), locality preserving projection (LPP) [5] and discriminant locality preserving projections (DLPP) [6], etc. Although PCA and LDA have been widely used in image processing field, they have some obvious disadvantages for recognition. PCA and LDA consider global statistical properties based on the training data, while ignore the local nature of the sample data. For palmprint recognition, as the original sample data form is matrix, LDA and DLPP need to transform image matrix into a long vector form, which breaks the continuity properties of the image data in a certain degree. Also, this increases the time complexity due to the high dimension vector, and generates small sample problems. Therefore, some scholars propose some typical two-dimensional subspace methods, such as two-dimensional principal component analysis (2DPCA) [7], two-dimensional linear discriminant analysis (2DLDA) [8] two-dimensional locality preserving projections (2DLPP) [9] and two-dimensional discriminant locality preserving projections (2DDLPP) based on the comprehensive of sample class information.

However, at the point of the extracted features in a low-dimensional subspace, the greatest disadvantage of all methods mentioned above is that the learned projection axes are linear combination of all the original features. So, it is difficult to physically interpret the extracted features. In order to solve this problem, Zuo et al. [10] proposed sparse principle components analysis (SPCA) that uses the least angle regression [11, 12] and the Elastic Net [13]. Zhao et al. [14] proposed a spectral bounds framework for sparse subspace, and they also proposed sparse PCA and sparse LDA (SLDA). The sparse subspace algorithm has been widely studied. Though SPCA and SLDA can directly operate on the high dimensional vectors, they have two limitations. One limitation is time-consuming when the dimension of the features is very high. Another one is some manifold structural information embedded in the two-dimensional images may be lost. Recently, Lai et al. [15] introduced the idea of sparse into 2DDLPP and proposed a novel and effective image feature extraction method named sparse two-dimensional local discriminant projections (S2DLDP). The S2DLDP method shows well recognition performance for face recognition. This paper studies the application of S2DLDP methods in palmprint recognition.

The rest of the paper is organized as follows: Sect. 2 describes the algorithm theory of S2DLDP. Section 3 presents the experimental results and gives some analysis of them. Section 4 offers our conclusions.

2 Palmprint Feature Extraction Based S2DLDP

Research results have proven that discriminative features play an important role in recognition and classification problems, and sparse-based feature extraction method can achieve better recognition performance [16]. In essence, the S2DLDP algorithm is 2DLDP form of sparse representation. This algorithm uses a direct sparse regression approach instead of the palm image matrix. It not only has high computational efficiency, but also can get more intuitive sparse projection matrix.

2.1 Two-Dimensional Discriminant Locality Preserving Projections (2DDLPP)

The main idea of 2DDLPP is projecting \( m \times n \) image matrix \( A_{i} \) to the \( w \) by a linear transformation \( y_{i} = A_{i} w \), and to get an m-dimensional column vector \( y_{i} \), which is called the projection feature vector of image \( A_{i} \). Given the projection vector \( P_{\varPhi } \), sample array \( B_{1} ,B_{2} , \ldots ,B_{M} \) can be mapped to a set of d dimensional Euclidean space points \( Z_{1} ,Z_{2} , \ldots ,Z_{M} \), where \( Z_{i}^{T} = P_{\varPhi }^{T} B_{i} ,i = 1, \ldots ,M \).

To resolve optimal projection vector \( P_{\varPhi } \), we can turn 2DDLPP into the minimization of following function:

$$ \sum\limits_{i,j = 1}^{M} {\left\| {Z_{i} - Z_{j} } \right\|^{2} } W(i,j) $$
(1)

However, the issue can be transformed into a generalized eigenvalue problem:

$$ P_{\varPhi }^{T} S_{L} P_{\varPhi } = \lambda P_{\varPhi }^{T} S_{L} P_{\varPhi } \Rightarrow S_{L} P_{\varPhi } = \lambda S_{D} P_{\varPhi } $$
(2)

\( S_{D} \), \( S_{W} \) and \( S_{L} \) are respectively defined as:

$$ S_{D} = \sum\limits_{i = 1}^{M} {B_{i} B_{i}^{T} } D_{ii} ,S_{w} = \sum\limits_{i,j = 1}^{M} {B_{i} B_{j}^{T} } D_{ij} ,S_{L} = S_{D} - S_{W} $$

\( S_{L} \) is the 2D Laplacian matrix. The larger value of \( D_{ii} \), the more important of the position \( B_{i} \), and the position of the projection results \( Z_{i} \) is more important. The optimal feature vectors \( P_{{\varPhi_{1} }} , \ldots ,P_{\varPhi d} \) should be the eigenvectors corresponding to a group of the smallest eigenvalues \( \lambda_{1} ,\lambda_{2} , \ldots ,\lambda_{d} \) from the characteristic Eq. (2), and then we can get all features matrix of the sample to complete the feature extraction.

$$ Z_{i}^{T} = \left[ {Z_{i}^{(1)} , \ldots ,Z_{i}^{(d)} } \right]^{T} = P^{T} B_{i} ,i = 1, \ldots ,M $$
(3)

2DDLPP avoids the singular value problem exists in DLPP to obtain the final matrix. Meanwhile, it possesses the characteristics of simple calculation, simple application and keeps the characteristics of nonlinear local retention from DLPP.

2.2 Sparse Two-Dimensional Local Discriminant Projections (S2DLDP)

Optimal projection from local feature extraction method, basing on image matrix, is often obtained by solving the eigenvalue equation. According to supervised learning representative model and unsupervised learning representative model, Lai [15] summarizes 2D projection learning framework as:

$$ X^{T} \left( {L_{{^{b} }} \otimes I_{n1} } \right)X\varphi = \lambda X^{T} \left( {L_{w} \otimes I_{n1} } \right)X\varphi $$
(4)

\( L_{{^{b} }} \) and \( L_{w} \) represent image diagonal weighting matrices, and \( I_{n1} \) is an identity matrix of order n1. The operator \( \otimes \) is kronecker product of the matrices. Note that there are two obvious limitations in 2D projection learning framework. One is the high computational complexity due to the calculations of \( X^{T} \left( {L_{{^{b} }} \otimes I_{n1} } \right)X\varphi \) and \( X^{T} \left( {L_{w} \otimes I_{n1} } \right)X\varphi \). Another is that learned projection axes are the linear combination of all the original features, which is difficult to semantically explain the extracted features. Therefore, the idea of sparsity is introduced.

To solve sparse problems, so far, researchers have put forward a lot of regression method for fitting data, which is representative of the Ridge regression, Lasso regression and Elastic Net regression. In this paper, we use Elastic Net regression to resolve the above equation. The two-dimensional representation of Elastic Net regression method is given:

$$ \varphi = \mathop {\arg \hbox{min} }\limits_{\varphi } \left( {\sum\limits_{i = 1}^{m} {\sum\limits_{h = 1}^{{n_{1} }} {(X_{i} (h,:) \times \varphi - y_{i} )^{2} + a\sum\limits_{j = 1}^{{n_{2} }} {\mathop {\overline{\varphi } }\nolimits_{j}^{2} } + \beta \sum\limits_{j = 1}^{{n_{2} }} {\left| {\overline{\varphi }_{j} } \right|} } } } \right) $$
(5)

Elastic net regression is widely used for feature extraction, which compounds Ridge regression and Lasso regression. It also introduces the \( l_{1} \) norm and \( l_{2} \) norm, and overcomes the limitations of Lasso regression. The selected features of elastic net regression, unaffected by sample restrictions on the number, effectively choose the characteristics of the groups [17,18,19]. In this paper, the S2DLDP algorithm is implemented in this way. After the algorithm introduces sparse limit, Lai [15] draws the following algorithm model framework:

$$ \left\{ {\begin{array}{*{20}l} {X^{T} \left( {L_{{^{b} }} \otimes I_{n1} } \right)X\varphi = \lambda X^{T} \left( {L_{w} \otimes I_{n1} } \right)X\varphi } \hfill \\ {\begin{array}{*{20}c} {subject} & {to} & {Card\left( \varphi \right)} \\ \end{array} \le K} \hfill \\ \end{array} } \right. $$
(6)

Among them, \( Card\left( \varphi \right) \) is the number of non-zero elements contained in vectors \( \varphi \), \( K \) is a positive integer not greater than the dimension of the image matrix.

3 The Experimental Results and Analysis

To test the application efficiency of S2DLDP in palmprint recognition, we perform the algorithm in PolyU palmprint database and give an exactly comparison with several typical subspace methods. PolyU palmprint database is the largest and most widely used palmprint database with the original palmprint image size of 384 * 284 pixels. Our experiment is tested on a subset of the PolyU database. The subset consists of 1000 images form 100 palms. We segment the region of interest (ROI) in these images, and then the image size is 128 * 128 pixels. In the experiment, we select the top four images of each person as the training image, and the remained six images as test set. In recognition phase, the Euclidean distance is employed to calculate the similarity between palmprint features (Fig. 1).

Fig. 1.
figure 1

Typical palmprint ROI images in PolyU database

3.1 The Non-zero Number Test Result

From Eq. 6 we can get that non-zero \( Card(\varphi ) \) in vector has the function of sparse restriction in image. The feature corresponding to the non-zero element is important, and influences the objective function. Based on this conclusion, to some extent, we discovery the features play a key role in the palmprint recognition. So it is not necessary to use the whole features, but only the key features. This has certain guidance in terms of the acquired characteristics. Therefore, studying the number of non-zero elements \( Card(\varphi ) \) under circumstances may achieve better recognition results.

At first, the eigen-palms learned by S2DLDP with \( Card(\varphi ) \) = 1:4:20 are shown in Fig. 2, where the eigen-palms are much clearer. This indicates that non-zero elements automatically form the palm profiles in the metrics of the S2DLDP algorithm. Thus S2DLDP can learn a set of semantic palm contours. This method not only gives a meaningful and intuitive explanation on the learned subspace, but also shows us the more discriminative feature subspace for palmprint recognition.

Fig. 2.
figure 2

Eigen-palm using various \( Card(\varphi ) \)

\( Card(\varphi ) \) = 4:8:40 are taken to experiment, and we calculate the recognition rate under various \( Card(\varphi ) \). Then, we draw recognition rate in different experiments plotted in the same figure for comparison in Fig. 3, where the horizontal axes \( d \), varying from 1 to 50 with one step, denotes the various dimension of features.

Fig. 3.
figure 3

Comparative palmprint recognition rate with various \( Card(\varphi ) \)

In Fig. 3, recognition result gradually deteriorates with the increase of value \( Card(\varphi ) \). To achieve better recognition results, the number of non-zero should take a smaller value, and we set the value with 4. Note that when the number of non-zero elements is more, the effectiveness to enhance identify is not obvious. This experiment indicates that the sparse-palms with a large \( Card(\varphi ) \) are not necessary to obtain higher recognition rates. The above two figures show us an insightful understanding of the appearance-based 2D palm image representation and recognition.

3.2 The Image Size Test Result

After repeated experiments, the S2DLDP algorithm achieves optimal performance with \( Card(\varphi ) = 4 \) and nearest neighbor number k = 1. Therefore, under the above conditions, we evaluate the effectiveness of recognition rate using various sub-image sizes of \( 32\, \times \,32 \), \( 64\, \times \,64 \), \( 128\, \times \,128 \) pixels, respectively. Then, some different eigen-palms are given in Fig. 4 and comparison results are illustrated in Fig. 5, in which we take d = 1: 1: 100 as horizontal axes.

Fig. 4.
figure 4

Comparative eigen-palms based on different sub image size

Fig. 5.
figure 5

Comparative recognition rate result based on different sub-image size

In Fig. 5, the recognition rate under different sub image size shows a trend of growth with the increase of the dimension size image characteristics, and gradually stabilizes. Meanwhile, curve clearly shows that S2DLDP algorithm achieves better recognition rate in dealing with a larger image, and the recognition rate can reach 99.5%. What is more, when image size is \( 32\, \times \,32 \) pixel, recognition rate has no obvious descend, as the change of dimension.

3.3 The Comparative Test Result with LDA, LPP and DLPP

To further illustrate the effectiveness of the algorithm which is used to identify, experiment changes the number of training samples, using the top three and five palmprint images of each person as the training image, the remained images as the test images. For S2DLDP algorithm, the parameters do not change. In order to save computation time, all of the images are resized to 32 * 32 pixels. The comparative test results are given in Figs. 6 and 7 in which we take d = 1: 1: 35 as horizontal axes. The best recognition performance under corresponding feature dimension is shown in Table 1.

Fig. 6.
figure 6

The comparison of recognition rate for 3 training samples

Fig. 7.
figure 7

The comparison result of recognition rate for 5 training samples

Table 1. The optimal recognition rate results with feature dimension of the four methods on the PolyU palmprint database

In Figs. 6 and 7, the recognition rates of S2DLDP are significantly higher than other methods. S2DLDP not only gives an intuitively semantic interpretation of the learned subspace, but also shows that palm profile subspace is more discriminative for palmprint recognition.

4 Conclusions

In this paper, the S2DLDP algorithm is introduced into palmprint recognition, and achieves the optimal recognition rate of 99.5%. The S2DLDP algorithm fully integrates the local separability, sample information and inherent attribute of two-dimensional images. To a certain extent, it reduces the complexity of calculation and improves the recognition rate. This algorithm provides intuitive, semantics, and interpretable palmprint feature space, and the palmprint features extracted have strong distinction. The experimental results show that the sparse projection in the process of feature extraction and dimensionality reduction realizes the function of feature selection, also proves its validity application for palmprint recognition.