Keywords

1 Introduction

Texture is among the most important attributes in computer vision and has been the focus of intensive research throughout the years. In a concise term, we can define texture as an arrangement of sub-patterns, which can be pixels, regions or other visual attributes [1]. Obviously, such definition is quite restrict and does not encompass a great variety of images (for instance, smoke, mammograms, fire, water etc.), which present a persistent stochastic pattern with a cloud-like appearance [2].

Even though texture lacks a formal definition, it is a feature easily understood by the human visual system. Such importance has motivated the development of many techniques for the analysis and recognition of texture patterns, making this a field of intense research [3]. Among the many techniques available, there are those that describe the image texture using second-order statistics [4, 5], spectral analysis (e.g., Fourier and Gabor filters) [6,7,8], local binary patterns [9], gravitational systems [10] and agents walking over the texture pattern [1].

Medical image analysis is a field of intense research, with many approaches being developed over the years. For instance, [11] proposed LBP variants as texture descriptors for medical image analysis, which were evaluated in different medical datasets, such as cell phenotype image classification, neonatal facial images classification of pain states and detection of abnormal smear cells. In [12], a fuzzy clustering algorithm was proposed for brain tumor segmentation. The authors stated that the fuzzy clustering enables many cases of uncertainty to be considered during the segmentation process. Breast cancer risk assessment has been the focus of many studies on texture analysis. In [13], two types of texture features are proposed to assess breast cancer risk: textons based on local pixel intensities and features based on oriented tissue structures. In [14], background intensity independent texture features were proposed for mammogram classification. Another topic of intense research is the identification and segmentation of melanocytic skin lesions. Machine learning techniques were used in [15] to select the parameters of a classification framework of melanocytic lesions. The paper [16] presented an approach using a feature learning scheme and normalized graph cuts for skin lesion image segmentation.

This paper proposes to apply a recent and very discriminative texture analysis method to a relevant medical problem, which consists of classifying pap-smear cells to discover pre-cancerous or cancerous stages in the cervix. Section 2 briefly describes the randomized neural network and how to use it to obtain an image signature from its neuron weights. Section 3 presents the pap-smear database, the other texture analysis methods used for comparison and the classification procedure. Section 4 discusses the obtained results, and, finally, Sect. 5 presents some remarks about this work.

2 Randomized Neural Network and its Texture Signature

A randomized neural network [17,18,19,20] is a recent proposal of neural network that has only two neuron layers and a very fast training procedure. In the hidden layer, the weights of the neurons are randomly determined according to a uniform or Gaussian distribution. These weights can be arranged in a matrix

$$\begin{aligned} W=\left( \begin{array}{ccccc} w_{10} &{} w_{11} &{} \ldots &{} w_{1p} \\ w_{20} &{} w_{21} &{} \ldots &{} w_{2p} \\ \vdots &{} \vdots &{} \vdots &{} \vdots \\ w_{Q0} &{} w_{Q1} &{} \ldots &{} w_{Qp} \\ \end{array} \right) , \end{aligned}$$
(1)

where each line represent the weights of a determined hidden neuron q, p is the number of attributes of an input vector \({{\varvec{x}}}\), and Q is the total of hidden neurons.

Let \(X=\left[ {{{\varvec{x}}}_\mathbf{1}}, {{{\varvec{x}}}_\mathbf{2}}, \dots , {{{\varvec{x}}}_{{\varvec{N}}}}\right] \) and \(D=\left[ {{{\varvec{d}}}_\mathbf{1}}, {{{\varvec{d}}}_\mathbf{2}}, \dots , {{{\varvec{d}}}_{{\varvec{N}}}}\right] \) be matrices representing the input vectors \({{{\varvec{x}}}_{{\varvec{i}}}}\) and their respective labels \({{{\varvec{d}}}_{{\varvec{i}}}}\) (N is the number of feature vectors). Then, after inserting a new first line composed of \(-1\) into X (for bias), we can provide the output of the hidden neurons according to the equation \(Z=\phi (WX)\), where \(\phi (.)\) is a transfer function (in general, logistic or hyperbolic function).

Next, we create a matrix \(Z=\left[ {{{\varvec{z}}}_\mathbf{1}}, {{{\varvec{z}}}_\mathbf{2}}, \dots , {{{\varvec{z}}}_{{\varvec{N}}}}\right] \) representing the output of the hidden neurons for each input feature vector \({{{\varvec{x}}}_{{\varvec{i}}}}\). Again, we insert a new first line composed of \(-1\) into Z (for bias) and the objective is to solve \(D=MZ\), where M represents the weights of the output neurons. The matrix M can be easily obtained after some simple matrix operations, according to the following equation

$$\begin{aligned} M=DZ^{T}(ZZ^{T})^{-1}. \end{aligned}$$
(2)

2.1 Randomized Neural Network Texture Signature

The random neural network texture signature is proposed in the paper [21] and consists of using image pixels as input and label data in order to train a randomized neural network. Next, the weights of the output neuron layer of this trained network are used as the image signature. For this purpose, the image is divided into overlapping windows \(K \times K\) (\(K=\{3,5,7\}\)). For each window, its border pixels are used as input feature vector \({{{\varvec{x}}}_{{\varvec{i}}}}\) and its central pixel is used as the respective scalar label \(d_i\). Thus, we have 8-, 16- and 24-dimensional feature vectors \({{{\varvec{x}}}_{{\varvec{i}}}}\) for the aforementioned window sizes, respectively.

The next step is to determine the values of the matrix W. For this, the paper [21] adopted the Linear Congruent Generator (LCG) [22, 23] to produce pseudorandom values in a uniform distribution. The parameter values for the “seed” and other adjustment parameters are based on the value Q (number of hidden neurons) and p (dimensionality of the input feature vector). All the values of W and each line of the matrix X are normalized to have zero mean and unit variance. Finally, the logistic transfer function is used in all the neurons.

Once these fundamental procedures are determined, it is possible to construct two signatures based on Eq. 2, which becomes a vector \({{\varvec{f}}}=DZ^{T}(ZZ^{T})^{-1}\) because D is also a vector. The first signature considers only one value Q for multiples values K, as follows

$$\begin{aligned} \varvec{\varTheta }(Q)_{K_1, K_2, \ldots , K_n}=\left[ {{\varvec{f}}}_{K_1}, {{\varvec{f}}}_{K_2}, \ldots , {{\varvec{f}}}_{K_n}\right] . \end{aligned}$$
(3)

The second signature, which consists of the concatenation of the previous signature for different values Q, is determined according to the following equation

$$\begin{aligned} \varvec{\varPsi }_{Q_1,\ldots ,Q_m}=\left[ \varvec{\varTheta }(Q_1)_{K_1,\ldots , K_n}, \ldots , \varvec{\varTheta }(Q_m)_{K_1, \ldots , K_n} \right] . \end{aligned}$$
(4)

A detailed description of the randomized neural network based signature can be found in the paper [21].

3 Experiments

3.1 Pap-smear database

The pap-smear database [24] is a collection of 917 cell images extracted from cervices. The images were obtained at the Herlev University Hospital and were classified into 7 groups, which are: normal superficial squamous epithelial (74 cells); normal intermediate squamous epithelial (70 cells); normal columnar epithelial (98 cells); mild squamous non-keratinizing dysplasia (182 cells); abnormal moderate squamous non-keratinizing dysplasia (146 cells); abnormal severe squamous non-keratinizing dysplasia (197 cells); and abnormal squamous cell carcinoma in situ intermediate (150 cells). These cell images can also be classified into two groups: normal cells (242 images) and abnormal cells (675 images). In our experiments, all the images were converted into grayscale. Moreover, we addressed only the 2-class problem, since the 7-class problem is still a challenge for texture analysis methods. Figure 1 shows one sample of each class.

Fig. 1.
figure 1

Cells of the pap-smear database: (a) superficial squamous epithelial, (b) intermediate squamous epithelial, (c) columnar epithelial, (d) mild squamous non-keratinizing dysplasia, (e) moderate squamous non-keratinizing dysplasia, (f) severe squamous non-keratinizing dysplasia, (g) squamous cell carcinoma in situ intermediate [24].

3.2 Classification Procedure

In the randomized neural network texture signature, we used the same parameter values adopted in the paper [21], that is, \(Q=\{19,39\}\), \(K=\{3,5,7\}\) for the second signature (Eq. 4) in order to establish a fair comparison with the other texture methods, in which we used parameter values according to either their respective papers or the common use. At this point, it is important to mention that, even though the paper [21] proposes a strategy to make the method more robust to rotation, we did not use it for two reasons: first, there is no orientation in the pap-smear cells; second, the method is faster without this strategy.

In order to assess the performance of the method, we compare it to other classical and recent texture analysis. They are: Co-occurrence matrices [5], Wavelets descriptors [25, 26], Tourist Walk [27], Discrete Cosine Transform (DCT) [28], Lacunarity 3D [29], Local binary patterns (LBP) [9], Gray Level Difference Matrix (GLDM) [30, 31] and Complex Network Texture Descriptor (CNTD) [32].

For classification, we used the Linear Discriminant Analysis [33], which is a classical statistical classifier that creates hyperplanes among the groups based on the their centroid vectors and the covariance matrix of all the samples. As strategy validation, we adopted the leave-one-out cross-validation, which uses one sample for testing the remainder for training. This process is repeated N times (N is the number of samples), each time with a different sample for testing. The performance measure is the average of the N accuracies.

We also obtained the AUC (Area Under the ROC Curve) [34] to compare it to the highest AUC values obtained in two recent papers [11, 35], which compared several LBP variants applied to the pap-smear database. To assess the randomized neural network signature, we used the same procedures present in these two works: a Linear Support Vector Machine (SVM) as classifier and the 5-fold cross-validation. The paper [11] does not mention the parameter values used, but the paper [35] uses the default parameter values of the LIBSVM [36], which is a public library for SVM. Thus, for a fair comparison, we also used the default parameter values of this library. Moreover, because 5-fold-cross-validation is not a deterministic strategy, we performed 101 validation runs and adopted the median AUC value as the performance measure of the randomized neural network signature.

4 Results and Discussion

Table 1 shows the comparison of the randomized neural network signature with other grayscale texture analysis methods. As one can see, the neural network approach surpasses all the compared methods in terms of accuracy. One disadvantage of the method is its excessively large number of descriptors. However, it is important to notice that its accuracy is \(1.20\%\) superior to the second best method (wavelet descriptors). This percentage represents 11 more images correctly classified by the method, thus corroborating its efficiency and ability to discriminate pap-smear samples, a challenging database in which any improvement is desirable.

Table 1. Comparison of different texture analysis methods applied to the pap-smear database.

Table 2 shows the median AUC obtained by the neural network signature and all the compared approaches, as well as the highest AUC values present in two recent papers. As it is possible to notice, the randomized neural network signature obtained the second best result among all the methods. Although this performance is already impressive, it is important to emphasize that ENS and MAG1 provide the highest AUC values of the papers [11] and [35], respectively. Thus, considering that the paper [11] applied nine LBP variants to the pap-smear database, and the paper [35] performed more than 50 tests on this same dataset, our obtained result acquires an even higher perspective and demonstrates that the randomized neural network descriptors are very discriminative in pap-smear cell images.

Table 2. Comparison of AUC (Area Under the ROC Curve) of different texture analysis methods applied to the pap-smear database

5 Conclusion

This paper presented the application of a very discriminative texture analysis method to the highly relevant medical problem of classifying pap-smear cells. The randomized neural network texture signature obtained a high performance in this problem, surpassing all the compared methods (LDA experiment) and presenting the second best AUC value, which is comparable to the highest results of two recent papers that address the same problem. Thus, it is possible to affirm that the randomized neural network signature is suitable for the pap-smear problem, and, therefore, adds a new tool to the computer vision research focused on the Papanicolaou test.