# Efficient and accurate image alignment using TSK-type neuro-fuzzy network with data-mining-based evolutionary learning algorithm

- 2.7k Downloads

## Abstract

Image alignment is considered a key problem in visual inspection applications. The main concerns for such tasks are fast image alignment with subpixel accuracy. About this, neural network-based approaches are very popular in visual inspection because of their high accuracy and efficiency of aligning images. However, such methods are difficult to identify the structure and parameters of neural network. In this study, a Takagi-Sugeno-Kang-type neuro-fuzzy network (NFN) with data-mining-based evolutionary learning algorithm (DMELA) is proposed. Compared with traditional learning algorithms, DMELA combines the self-organization algorithm (SOA), data-mining selection method (DMSM), and regularized least square (RLS) method to not only determine a suitable number of fuzzy rules, but also automatically tune the parameters of NFN. Experimental results are shown to demonstrate superior performance of the DMELA constructed image alignment system over other typical learning algorithms and existing alignment systems. Such system is useful to develop accurate and efficient image alignment systems.

## Keywords

subpixel accuracy TSK-type neuro-fuzzy network data-mining based evolutionary learning algorithm regularized least square## Abbreviations

- BBs
building blocks

- DCT
discrete cosine transform

- DMELA
Data-mining-based evolutionary learning algorithm

- DMSM
data-mining selection method

- ERS
elite-based reproduction strategy

- FNN
feedforward neural network

- GFSA
global feature selection approach

- ISOMAP
isometric mapping

- MGCSE
multi-groups cooperation based symbiotic evolution

- MGSE
groups symbiotic evolution

- NFN
neuro-fuzzy network

- RLS
regularized least square

- SIFT
scale invariant feature transform

- SNR
signal-to-noise ratio

- SOA
self-organization algorithm

- TNFN
TSK-type neuro-fuzzy network

- TSE
traditional symbiotic evolution

- TSK
Takagi-Sugeno-Kang

- WGOH
weighted gradient orientation histograms.

## 1. Introduction

Accurate and efficient image alignment is widely applied to many industrial applications, such as automatic visual inspection, factory automation, and robotic machine vision. Among them, visual inspection is usually required at finding a geometric transformation to align images. More specifically, the geometric transformation is commonly used as an affine transformation, which is consists of scaling, rotation, and translation, for aligning images. In other words, an affine transformation is considered of great importance in designing image alignment systems. Thus, it raises a challenge to provide an efficient affine transformation. To this end, neural network-based methods have widespread to address this challenge because such methods often feed global features of inspected images into a trained neural network to estimate affine transformation parameters [1, 2, 3, 4]. In other words, neural networks are helpful for designing image alignment systems. Thus, there is a need to develop a neural network-based image alignment system to demonstrate high performance [5]. To this end, the aim of this study is to design a learning algorithm to train a neural network that can estimate affine parameters precisely.

Regarding the aim, this study adopts weighted gradient orientation histograms (WGOH) [6] as an image descriptor, which extracts the features from inspected images, to be the input of the neural network. Such representation technique has been proven a good descriptor in several literatures [7, 8]. After that, we propose a novel learning algorithm to improve the robustness of neural networks. To be more specific, the proposed learning algorithm combines the self-organization algorithm (SOA), data-mining selection method (DMSM), and regularized least square (RLS) method to automatically identify the structure and parameters of the network. Once our learning method is applied, the structure of the network will be variable instead of a fixed one. Moreover, automatic tuning the parameters of the network can get more dynamic search space than a heuristic way. In other words, the structure and parameters of neural networks will become more robustness. The major contribution of this study is that the proposed learning method is helpful to develop efficient image alignment systems by automatically tuning the systems' structure and parameters.

The rest of this article is organized as follows. Section 2 gives a review of related studies. In Section 3, the proposed methodology for automatic aligning industrial images is introduced. The experimental results are presented in Section 4. In Section 5, a conceptual framework for developing image alignment systems is described. The conclusion is attained in the last section.

## 2. Related studies

The problem of precisely aligning images has been well studied in several fields. For a broad introduction to image alignment methods, the related literature has been reviewed on several occasions [9, 10]. To brief survey, prior aligning methods can be classified as feature- and area-based methods [9, 11]. Zitova and Flusser [10] pointed out that area-based methods are preferably applied to the images which have not many details. Moreover, Amintoosi et al. [11] indicated that as the signal-to-noise ratio (SNR) is low, area-based methods produce better results than feature-based methods. In this study, we assume that our proposed image alignment system is developed for industrial inspection tasks such that the captured images usually have less detail. Thus, area-based methods that adopt global descriptors are recommended in this article.

Recently, neural network-based image alignment utilizing global features has been a relatively new research subject. Such methods demonstrated high alignment speed since it only needs to feed the extracted feature vectors into the trained neural network to estimate the transformation parameters. For example, Ethanany et al. [1] presented a feedforward neural network (FNN) to align images through 144 discrete cosine transform (DCT) coefficients as the feature vectors. Their study showed that the FNN demonstrated high tolerance in deformed and noisy images. Moreover, based on FNN research, Wu and Xie [2] utilized low-order Zernike moments to replace DCT to further improve the performance of Ethanany's study, which adopted larger dimension of feature vector to represent an image sufficiently for the un-orthogonality of DCT-based space. As shown in their results, the proposed method can reduce the dimension of feature vector but their alignment results are not satisfied. More recently, Xu and Guo [3] adopted isometric mapping (ISOMAP) to reduce the dimension of feature vector. Their study demonstrated that ISOMAP can drastically reduce the dimension of feature vector to improve the computational efficiency. Nevertheless, the over fitting problem could happen in FNN when a neural network is over learnt for training sets. Thus, the unseen pattern may be hard applied to this over-trained FNN since the network cannot provide the good ability of generalization. Owing to this problem, Xu and Guo [12] used a Bayesian regularization method to improve the capability of generalizing the FNN. They showed some comparative experiments that FNN with regularization indeed performed better than without regularization.

Aforementioned studies indicated that the FNN is helpful to improve the alignment efficiency. However, such methods used steepest descent technique to minimize the error function such that it may reach the local minimal. In addition, it must take a large number of iterations to minimize the error function and several training attempts are needed to provide a robust FNN. In that respect, evolutionary algorithms appear to be better candidates than steepest descent method [13, 14, 15]. Because such learning methods are global and parallel search, they have more chance to converge toward global solution. Therefore, training a neural network utilizing evolutionary algorithms has been an important field.

In this respect, several evolutionary algorithms were proposed [16, 17, 18]. Gomez and Schmidhuber [16] proposed enforced sub-populations using sub-populations of neurons for the fitness evaluation and overall control. The sub-populations that are used to evaluate the solution locally can obtain better performance compared to systems that only use one population for evaluating the solution. Moriarty and Miikkulainen [17] used a symbiotic evolution method to train a neural network. The authors indicated that the symbiotic evolution performed better than traditional genetic algorithms. Recently, Hsu et al. [18] proposed a multi-groups cooperation-based symbiotic evolution (MGCSE) to train a Takagi-Sugeno-Kang (TSK)-type neuro-fuzzy network (TNFN). Their results showed that MGCSE can obtain better performance and convergence than symbiotic evolution. Although MGCSE is a good approach for training a TNFN, it would not be suitable for image alignment tasks. The reason is that the dimension of the input of a neural network is always high and the number of hidden node is not small such that large amount of parameters must be trained. For instance, in the experiments described in this article, the dimension of the input and output of the network is 33 and 4, respectively, and the number of fuzzy rules is 25. Thus, in MGCSE's model, the total number of parameters is 5050 (*r**(2 * *n* + *m**(*n* + 1)), *r* = 25, *n* = 33, *m* = 4). Such a great number would lead the algorithm not only to impossibly converge to optimal solution, but also to estimate bad image alignment results. In addition, MGCSE performed random group combination to construct a network. In spite of such action can sustain diversity, there is no systematic way to identify suitable groups for selecting chromosomes. Thus, it could result in slow rate of convergence.

To this end, this study proposes a TNFN with data-mining-based evolutionary learning algorithm (DMELA) to solve the abovementioned problems. In the first place, DMELA encodes an antecedent part of a TSK-type fuzzy rule into a chromosome and utilizes a RLS to estimate the consequent part of a TSK-type fuzzy rule. Such combination not only reduces the number of parameters that must be trained, but also increases the convergence speed. Later, DMSM is used to explore the association rules that can identify suitable and unsuitable groups for chromosome selection. This action would solve the random group combination problem yielded by MGCSE. Finally, the SOA is utilized to decide suitability of different number of fuzzy rules. Thus, SOA is useful to automatic construct the structure of neuro-fuzzy networks (NFNs). In short, DMELA benefits both structure and parameters learning of a TNFN and it collocates with WGOH descriptor to provide a framework to develop accurate and efficient image alignment systems.

## 3. Methodology

### 3.1. Synthesized training images

where (*x*_{1}, *y*_{1}) indicates the original image coordinate, (*x*_{2}, *y*_{2}) indicates transformed image coordinate, *s* is a scaling factor, (Δ*x*, Δ*y*) is a translation vector, *θ* is a rotation angle, and (*x*_{ c }, *y*_{ c }) is the center of rotation. Thus, the synthesized training images can be generated by applying various combination of translation, rotation, and scaling transformations within a predefined range.

### 3.2. WGOH descriptor

The WGOH has been proven a good descriptor by a global feature selection approach (GFSA), which has been presented in our previous research [7]. Such descriptor was compared with other five global descriptors and results showed that WGOH demonstrated best performance. Therefore, this article adopts WGOH as a descriptor to represent inspected images.

- 1.
For each image, we capture the template window, whose location is at the center of the image, to be a place of extracting features. Within the window, we divide the length and width of the window into four equal parts to form 4 × 4 grids. Each grid is considered a sub-image. Thus, the template window can be split into 4 × 4 sub-images.

- 2.On each pixel of the sub-image (
*I*(*x*,*y*)), the gradient magnitude*m*(*x*,*y*), and orientation*θ*(*x, y*) are computed using pixel difference which the equations can be written as$m\left(x,y\right)=\sqrt{{\left(I\left(x+1,y\right)-I\left(x-1,y\right)\right)}^{2}+{\left(I\left(x,y+1\right)-I\left(x,y-1\right)\right)}^{2}},$(2)$\theta \left(x,y\right)={tan}^{-1}\left(\left(I\left(x,y+1\right)-I\left(x,y-1\right)\right)\u2215\left(I\left(x+1,y\right)-I\left(x-1,y\right)\right)\right).$(3) - 3.
Calculate the 8-bin orientation histograms (each bin cover 45°) within each sub-image which are weighted by the gradient magnitude, and the Gaussian function.

- 4.
Concatenate 8-bin histograms of 16 sub-images into a 128-element feature vector, and normalize it to a unit length. To reduce strong gradient magnitudes, the elements of the feature vector are limited to 0.2, and this vector is normalized again.

### 3.3. Structure of TNFN

In general, three typical types of NFN are the TSK-type, Mamdani-type, and singleton-type. According to [23, 24], the authors have shown that a TNFN can offer better network size and learning accuracy than a Mamdani-type NFN. Thus, for our image alignment task, we only compare the TNFN with the singleton-type NFN in the experimental section to prove that the TNFN outperforms the singleton-type NFN.

*n*and

*j*represent the dimension of the input and the number of the fuzzy rules, respectively.

*n*represents the dimension of the input. It is a five-layer network structure. In the TNFN, the firing strength of a fuzzy rule is calculated by performing the following "AND" operation on the truth values of each variable to its corresponding fuzzy sets by:

where ${u}_{{}^{i}}^{\left(1\right)}={x}_{i}$ and ${u}_{{}^{ij}}^{\left(3\right)}$ are the outputs of first and third layers; *m*_{ ij } and *σ*_{ ij } are the center and the width of the Gaussian membership function of the *j* th term of the *i* th input variable *x*_{ i }, respectively. In this article, the reason of adopting the Gaussian membership function is that it can be a universal approximator of any nonlinear functions on a compact set [23].

where *u*^{(5)} is the output of fifth layer, *w*_{ ij } is the weighting value with *i* th dimension and *j* th rule node, and *M* is the number of a fuzzy rule. Here, the dimension of the output is set to be 4, and they are represented as a scaling factor (*s*), a rotation angle (*θ*), and translation parameters (Δ*x*, Δ*y*), respectively.

### 3.4. Data-mining-based evolutionary learning algorithm

The proposed DMELA aims to improve MGCSE [18]. Unlike MGCSE encoding one fuzzy rule into a chromosome, DMELA only encodes an antecedent part of a fuzzy rule into a chromosome. The consequent part of a fuzzy rule used in DMELA is estimated by an RLS approach. These two operations could not only reduce the number of parameters that must be trained, but also increase the convergence speed. Therefore, details of the coding step and RLS approach are described as follows:

#### (1) Coding step

*m*

_{ ij }and

*σ*

_{ ij }represent a Gaussian membership function with mean and deviation of

*i*th dimension and

*j*th rule node, respectively.

#### (2) RLS approach

Since the coding step only decides an antecedent part of a fuzzy rule, the consequent part is undetermined. In this article, RLS is adopted to estimate the consequent part. For simplicity, we only use two inputs (*x*_{1}, *x*_{2}) and one output (*y*) to represent a two-rule TSK-type neuro-fuzzy system, which is described as follows:

**Rule 1**

**Rule 2**

*A*

_{ ij }and

*B*

_{ ij }are the linguistic parts with respect the input

*i*and

*Rule j*. From Equation 6, the output can be written as:

*u*

_{1}and

*u*

_{2}are the firing strengths of

*Rules 1*and

*2*, respectively, ${\mathit{\xfb}}_{1}={u}_{1}\u2215\left({u}_{1}+{u}_{2}\right)$, and ${\mathit{\xfb}}_{2}={u}_{2}\u2215\left({u}_{1}+{u}_{2}\right)$. Combine Equations 7-9, and we can get the following equation:

*x*

_{1}, and

*x*

_{2}are known values, the only unknown value is the consequent part

*w*

_{ ij }. Suppose a given set of training inputs and desired outputs is ${\left\{x\left(t\right),{y}_{d}\left(t\right)\right\}}_{t=1}^{M}$. Equation 10 can be rewritten as:

*W*. Instead, a least square method is utilized to obtain an approximate solution. Moreover, to get the smooth estimation, the regularization is adopted. To this end, such method is named as RLS approach. Using RLS, the approximation solution is as follows:

where *λ* is a regularization parameter which adjusts the smoothness. Therefore, by getting Equation 14, we complete the estimation the consequent part of fuzzy rules. Such operation can easily be expanded to *n* input, *m* output, and *r* fuzzy rules of a TNFN. To compare with MGCSE, the consequent part used in this article is computed by an RLS approach rather than tuned by an evolutionary procedure. Such action would increase the convergence speed because RLS approach directly calculates the consequent part one time to minimize the errors between real and desire outputs. Nevertheless, evolutionary method tunes the consequent part many times to gradually minimize the errors.

*P*

_{size}denotes that there are

*P*

_{size}groups in a population, and

*M*

_{ k }means that there are

*M*

_{ k }rules used in TNFN construction. Such construction allows variable number of rules in TNFN.

The learning process of DMELA in each group involves seven major operators: initialization, SOA, DMSM, fitness assignment, reproduction strategy, crossover strategy, and mutation strategy. This process stops as the number of generations or the fitness value reaches a predetermined condition. The whole learning process is described below:

#### 3.4.1. Initialization

Before we start to design DMELA, the initial groups of individuals should be generated. The initial groups of DMELA are generated randomly within a fixed range. The following formulations show how to generate the initial chromosomes in each group:

*Chr*

_{ g, c }[

*p*] =

*random*[

*σ*

_{min},

*σ*

_{max}],

*Chr*

_{ g, c }[

*p*]

*= random*[

*m*

_{min},

*m*

_{max}],

where *Chr*_{ g, c } represents *c* th chromosome in the *g* th group, *N*_{C} is the total number of chromosomes in each group, *p* represents the *p* th gene in a *Chr*_{ g, c }, and [*σ*_{min}, *σ*_{max}], [*m*_{min}, *m*_{max}] represent the predefined range to generate the chromosomes.

#### 3.4.2. Self-organization algorithm

*M*

_{ k }rules, into BBs. In addition, in SOA, the minimum and maximum number of rules must be predefined to limit the number of fuzzy rules to a certain bound, i.e., [

*M*

_{min},

*M*

_{max}].

After BBs is defined, we use SOA to determine the suitable selection times of each number of fuzzy rules. The "selection times" indicates how many TNFNs should be produced in one generation. In other words, SOA is used to determine the number of TNFN with *M*_{ k } rules in every generation. After the SOA is carried out, the selection times of the suitable number of fuzzy rules in a TNFN will increase; otherwise, the selection times of the unsuitable ones in a TNFS will decrease. The processing steps of the SOA are described as follows:

**Step 0**. Initialize the probability vectors of the BBs:

**Step 1**. Update the probability vectors of the BBs according to the following equations:

where ${V}_{{M}_{k}}$ is the probability vector in the BBs, λ is a predefined threshold value, *Avg* represents the average fitness value in the whole population, $Best\text{\_}Fitnes{s}_{{M}_{k}}$ represents the best fitness value of TNFN with *M*_{ k } rules, and $fi{t}_{{M}_{k}}$ is the sum of the fitness values of the TNFN with *M*_{ k } rules. In Equation 19, the conditions "$fi{t}_{{M}_{k}}$ ≥ or < *Avg*" would affect the suitability of TNFNs with *M*_{ k } rules to be increased or decreased.

**Step 2**. Determine the selection times of TNFNs with different rules according to the probability vectors of the BBs as follows:

where *Selection_Times* represents the total selection times in each generation and $R{p}_{{M}_{k}}$ represents the selection times of TNFNs with *M*_{ k } rules in one generation.

**Step 3**. In SOA, to prevent suitable selection times from falling into the local optimal solution, we use two different actions to update ${V}_{{M}_{k}}$. Such actions are defined in the following equations:

where *SOATimes* is a predefined value, *Best_Fitness*_{ g } represents the best fitness value of the best combination of chromosomes in the *g* th generation, and *Best_Fitness* represents the best fitness value of the best combination of chromosomes in the current generations. If Equation 27 is satisfied, then it indicates that the suitable selection times may fall into the local optimal solution. At this time, the processing step of SOA should return to Step 0 to initialize the BBs.

#### 3.4.3. The DMSM

After the selection times are determined, DMELA further performs the selection step, which includes the selection of groups and chromosomes. In selection of groups, this article proposes DMSM to determine the suitable groups for chromosomes selection. To prevent the selected groups from falling into the local optimal solution, DMSM uses normal and explore actions to select well-performed groups. The details of the DMSM are discussed below:

**Step 0**. The transactions are built, as in the following equations:

*i*= 1, 2,...,

*M*

_{ k },

*M*

_{ k }=

*M*

_{min},

*M*

_{min+1},...,

*M*

_{max},

*j*= 1, 2,...,

*TransactionNum*, the $Fitnes{s}_{{M}_{k}}$ represents the fitness value of TNFN with

*M*

_{ k }rules,

*ThreadFitnessvalue*is a predefined value,

*TransactionNum*is the total number of transactions,

*Transaction*

_{ j }[

*i*] represents the

*i*th item in the

*j*th transaction, $TF\text{C}RuleSe{t}_{{M}_{k}}\left[i\right]$ represents the

*i*th group in the

*M*

_{ k }groups used for chromosomes selection, and

*Performance Index*=

*g*and

*Performance Index*=

*b*represent the good and bad performances, respectively. Hence, transactions have the form shown in Table 1. As shown in Table 1, the first transaction means that the three-rule TNFN formed by the first, fourth, and eighth groups have "good" performance. In contrast, the second transaction indicates that the four-rule TNFN formed by the second, fourth, seventh, and the tenth groups have "bad" performance.

Transactions in the DMSM

Transaction index | Groups | Performance index |
---|---|---|

1 | 1,4,8 | |

2 | 2,4,7,10 | |

... | ... | ... |

| 1,3,4,6,8,9 | |

**Step 1**. Normal action:

where *i* = 1, 2,..., *M*_{ k }, *M*_{ k } = *M*_{min}, *M*_{min+1},..., *M*_{max}, *Accumulatar* defined in Equation 30 are used to determine which action should be adopted, *GroupIndex*[*i*] represents the selected *i* th group of the *M*_{ k } groups, and *P*_{ size } indicates that there are *P*_{ size } groups in a population in DMELA. If the best fitness value does not improve for a sufficient number of generations (*NormalTimes*), then DMSM selects groups according to explore action.

**Step 2**. Explore action:

If *Accumulator* exceeds the *NormalTimes*, then the current action switches to the explore action. The objective of this action is to adopt the notion of DMSM to explore suitable groups in transactions. The major operations of DMSM include FP-growth performing, association rules generating, and suitable groups selecting. The details of these three operations are presented below.

*In this operation, only good groups, whose performance index showed "*

**i. FP-growth performing***g*" in Table 1, are performed with FP-growth and bad groups are skipped. Thus, frequently occurring groups can be found according to the predefined

*Minimum_Support*, which stands for the minimum fraction of transactions containing the item set. After

*Minimum_Support*is defined, data mining using FP-growth is performed. The FP-growth algorithm can be divided into two parts: FP-tree construction and FP-growth. The sample transactions are shown in Table 2. In this example,

*Minimum_Support*= 3.

Sample transactions

Transaction index | Groups |
---|---|

1 | { |

2 | { |

3 | { |

4 | { |

5 | { |

(1) FP-tree Construction

*Minimum_Support*in transactions. Then, the retrieved frequently occurring groups are arranged in descending order based on their supports. After that, we discard the infrequently occurring groups and sort the remaining groups. Then, the result is shown in Table 3. Thus, the ordered transactions appeared in this table are utilized to construct a FP-tree.

Transactions after discarding the infrequent groups and sorting the remaining groups

Transaction index | Groups | Ordered groups |
---|---|---|

1 | { | { |

2 | { | { |

3 | { | { |

4 | { | { |

5 | { | { |

(2) FP-growth

Frequently occurring groups generated by FP-growth data mining

Suffix group | Cond. group base | Cond. FP-tree | Frequent groups |
---|---|---|---|

B | c:4 | c:4 | cb:4 |

F | cb:3, c:1 | c:4, cb:3 | cf:4, bf:3, cbf:3 |

M | cbf:2, cf:1 | cf:3 | cm:3, fm:3, cfm:3 |

O | cbfm:2, cfm:1 | cfm:3 | co:3, fo:3, mo:3, cfo:3, cmo:3, fmo:3, cfmo:3 |

*Once the frequently occurring groups are found, we can produce association rules from these frequent ones. For the purpose of identifying the association rules with good performance, the frequent groups must combine the groups owing bad performance shown in Table 1 to count the confidence degree. The confidence degree can be computed by the following formula:*

**ii. Association rules generating***P*(

*good|frequent groups*) is the conditional probability,

*frequent groups*∪

*good*or

*bad*means the union of frequent groups and good or bad performance, and

*supp*(

*frequent groups*∪

*good*or

*bad*) stands for the counts of

*frequent groups*with good or bad performance occurring in transactions. Then the rule is valid if

where *minconf* represents the minimal confidence given by user or expert. Hence, we can infer that if a rule satisfies Equation 32, then the frequent groups can be viewed as the suitable groups, otherwise they would be unsuitable groups. For instance, if the confidence of {1,3,6} ≥ {*g*} is bigger than the minimum confidence, then we construct this association rule. This rule indicates that the combination of the first, third, and sixth groups results in "good" performance. After doing so, the frequent groups are conduct to the association rules and generate the *AssociatedGoodPool* which contains all frequent groups satisfied Equation 32.

*After the association rules are identified, DMSM selects groups according to the association rules. The group indexes are selected from the associated good groups as the following equations:*

**iii. Suitable groups selecting**where *q* = 1, 2,..., *AssociatedGoodPoolNum i* = 1, 2,..., *M*_{ k }, *M*_{ k } = *M*_{min}, *M*_{min+1},..., *M*_{max}, *ExploreTimes* are the predefined value that judges to perform the exploring action, *AssociatedGoodPool* represents the sets of good item set that obtain from association rules, *AssociatedGoodPoolNum* presents the total number of sets in *AssociatedGoodPoolNum* and *GoodItemSet*[*i*] presents a good item set that select from *AssociatedGoodPool* randomly. In Equation 33, if *M*_{ k } greater than the size of *GoodItemSet*, then remaining groups are selected by Equation 30.

**Step 3**. If the best fitness value does not improve for a sufficient number of generations (*ExploreTimes*), then DMSM selects groups based on the normal action.

**Step 4**. After the

*M*

_{ k }groups are selected,

*M*

_{ k }chromosomes are selected from

*M*

_{ k }groups as follows:

where *q* = *Random*[1, *N*_{ c }], *i* = 1, 2,..., *k*, *N*_{ c } represents the total number of chromosomes in each group, and *ChromosomeIndec*[*i*] represents the index of a chromosome that is selected from the *i* th group.

#### 3.4.4. Fitness assignment

In this step, the fitness value of an antecedent part of a fuzzy rule (an individual) is calculated by summing up the fitness values of all possible combinations in the chromosomes that are selected from *M*_{ k } groups that are decided by DMSM. The steps in the fitness value assignment are described below:

**Step 1**. Choose *M*_{ k } antecedent part of fuzzy rules with RLS method to construct a TNFN $R{p}_{{M}_{k}}$ times from *M*_{ k } groups with size *N*_{C}. The *M*_{ k } groups are obtained from the DMSM.

**Step 2**. Evaluate every TNFN that is generated from Step 1 to obtain a fitness value.

**Step 3**. Divide the fitness value by *M*_{ k } and accumulate the divided fitness value to the selected antecedent part of fuzzy rules with their fitness value records that were set to zero initially.

**Step 4**. Divide the accumulated fitness value of each chromosome from

*M*

_{ k }groups by the number of times that it has been selected. The average fitness value represents the performance of an antecedent part of a fuzzy rule. In this article, the fitness value is designed according the follow formulation:

where *y*_{ i } and ${\u0233}_{i}$ represent the desired and predicted values of the *i* th output, respectively, $E\left(\stackrel{y}{},\stackrel{\text{\_}}{y}\right)$ is a error function, and *N* represents the number of the training data in each generation.

#### 3.4.5. Reproduction strategy

Reproduction is a procedure of copying individuals according to their fitness value. This study adopted our previous research-elite-based reproduction strategy (ERS) [18] to perform reproduction. In ERS, every chromosome in the best combination of *M*_{ k } groups must be kept by performing reproduction step. In the remaining chromosomes in each group, this study uses the roulette-wheel selection method [26] for this reproduction process. The well-performed chromosomes in the top half of each group [27] proceed to the next generation. The other half is created by executing crossover and mutation operations on chromosomes in the top half of the parent individuals.

#### 3.4.6. Crossover strategy

#### 3.4.7. Mutation strategy

In spite of many new strings the crossover strategy produced, new information to every group at the site of an individual is still not provided by these strings. Mutation can randomly alter the allele of a gene. In this article, uniform mutation [26] is adopted, and the mutated gene is drawn randomly from the domain of the corresponding variable. The advantages of uniform mutation are not only to provide new information for a population but also to preserve diversity [29].

### 3.5. Termination criterion

- (1)
The number of generations reaches a predefined maximal iteration value.

- (2)
Fitness value is greater than a fitness threshold.

### 3.6. Time complexity analysis

*P*

_{size}, the size of sub-population is

*N*

_{ c }, the number of fuzzy rules is

*M*, the number of constructing fuzzy systems in one generation is

*S*(i.e., the

*Selection_Times*defined in Equation 23), the number of the training data is

*N*, and the input dimension of NFN is

*n*. The discussion of the complexity for each stage is as follows:

- (1)
SOA: in this stage, the only computation is to update the probability vectors (Equation 19)

*S*times in one generation. Therefore, the complexity of SOA is*O*(*S*). - (2)
DMSM: The DMSM operation includes normal and explore actions. In the normal action, since this action would be performed

*NormalTimes*(appeared in Equation 30) in the overall learning process, the complexity of this action is*O*(*NormalTimes*). In the explore action, because the FP-growth and association rules mining are performed only in the beginning of this action or when the system falls into local optima. As a result, the effect caused by these two operations on the overall learning efficiency is not crucial. The complexity of these two operators can be skipped. Moreover, since the explore action would be performed (*ExploreTimes*-*NormalTimes*) times in the overall learning process, the complexity of this action is*O*(*ExploreTimes*-*NormalTimes*), where*ExploreTimes*appeared in Equation 33. - (3)
Fitness assignment: according to Equations 6 and 36, the evaluation of fitness one time requires

*NMn*computations. Furthermore, there are*S*evaluation times in one generation. Thus, the complexity of fitness assignment is O(*SNMn*). - (4)
Reproduction: in this stage, the roulette-wheel selection method is chosen to perform reproduction. Since each selection requires

*N*_{ c }steps and*N*_{ c }spins to fill the sub-populations [30], the total computation for a whole population in a generation is ${N}_{c}^{2}{P}_{size}$. Therefore, the complexity of reproduction stage is $O\left({{N}_{c}}^{2}{P}_{size}\right)$. - (5)
Crossover: to consider the selection of parents, the tournament selection is adopted to select parents. Since the tournament selection can be performed in constant time and

*N*_{ c }*P*_{ Size }competitions are required to fill one generation [30], the complexity of the tournament selection is*O*(*N*_{ c }*P*_{ Size }). Moreover, the computation of two-point crossover is constant in one generation. Thus, the complexity of crossover stage is*O*(*N*_{ c }*P*_{ Size }). - (6)
Mutation: because the uniform mutation is adopted and the mutated gene is picked randomly from the chromosome, the mutation operator needs

*N*_{ c }*P*_{ Size }steps to fill overall populations. Hence, the complexity of mutation step is*O*(*N*_{ c }*P*_{ Size }).

In summary, the dominate complexity of the proposed algorithm is the stage of fitness assignment (*O*(*SNMn*))). It indicates that the fitness assignment step would occupy most of the learning time.

### 3.7. Executing procedure

After training a TNFN, the executing phase of the proposed image alignment system merely consists of computing the WGOH descriptor and then feeding it into the DMELA-trained TNFN to get a scaling factor *s*, a rotation angle *θ*, and translation parameters (Δ*x*, Δ*y*). About this, the proposed system is simple and efficient.

## 4. Experimental results

Experimental images preparation

Image type | Image preparation |
---|---|

Synthesized images | 600 images are generated with randomly selected affine parameters within the range described in Table 2 |

Training images | The 70% (420) of synthesized images |

Testing images | The 30% (180) of synthesized images |

Real images | Images are acquired from CCD camera with different pose from the reference image |

The range of affine transformation parameters used in experiments

Affine transformation parameter | The range of affine transformation parameter |
---|---|

Scale | [0.7 1.3] |

Rotation (degrees) | [-30 30] |

Vertical translation (pixels) | [-20 20] |

Horizontal translation (pixels) | [-20 20] |

All the experiments are performed using an Intel Core i7 860 chip with a 2.8 GHz CPU, a 3G memory, and the Matlab 7.5 simulation software.

The experimental results in this section contain four sections. Section 4.1 performs the comparison with different types of NFNs. Comparison with existing learning methods is presented in Section 4.2. In Section 4.3, synthesized images are used to compare the proposed image alignment system with other systems. Section 4.5 uses real images to validate the alignment accuracy of the proposed system.

### 4.1. Comparison with different types of NFN

The initial parameters before training

Parameters | Value | Parameters | Value |
---|---|---|---|

P | 40 | [ | [18, 25] |

Nc | 20 | [ | [-10, 10] |

Selection_Times | 50 | [σ | [3, 15] |

NormalTimes | 10 | [ | RLS determined |

SearchingTimes | 15 | Minimum_Support | TransactionNum/3 |

Crossover rate | 0.6 | Minimum_Confidence | 60% |

Mutation rate | 0.2 | RLS parameter (λ) | 0.003 |

The comparison of the TNFN and the singleton-type NFN

Method | Errors | |||||||
---|---|---|---|---|---|---|---|---|

ErrScale | ErrAngle (degrees) | ErrDx (pixels) | ErrDy (pixels) | |||||

Mean | SD | Mean | SD | Mean | SD | Mean | SD | |

TNFN | 0.0054 | 0.0051 | 0.2106 | 0.1856 | 0.4702 | 0.3578 | 0.4326 | 0.4015 |

Singleton-type NFN | 0.027 | 0.021 | 1.237 | 1.075 | 1.338 | 1.016 | 1.571 | 1.319 |

### 4.2. Comparison with existing learning methods

Two typical evolutionary learning methods TSE [17] and MGCSE [18] are implemented carefully to compare with the proposed DMELA. To explore the number of fuzzy rules for TSE and MGSE, the fuzzy rules are tuned by setting the range of 20-100 in increments of 5. Thus, the results find that 85 and 80 rules are suitable for TSE and MGCSE, respectively.

In this simulation, training and testing images are randomly generated by the way specified in Table 5. Then 33-element feature vectors are obtained by applying WGOH with genetic algorithm-based dimensionality reduction described in [22] to above-generated images. Moreover, before training, the initial parameters of DMELA are given in Table 7.

*M*

_{min},

*M*

_{max}] = [18, 25].

Comparison of running time for various algorithms

Method | Best (s) | Worst (s) | Mean (s) |
---|---|---|---|

DMELA | 212 | 1063 | 623 |

MGCSE | 3078 | 4106 | 3698 |

TSE | 4711 | 8106 | 6565 |

### 4.3. Comparison with existing image alignment systems

To evaluate the proposed system in comparison with other existing systems [3, 12, 19], the implementation of these existing systems are carefully cited in their original article. The comparison in this section consists of the alignment accuracy and robustness. These comparisons are discussed in the following parts.

#### 4.3.1 Alignment accuracy

To compare the alignment accuracy of different systems, the training images, which are used to train neural networks, and the testing images, which are used to check the alignment accuracy, are generated by the way described in Table 5.

Alignment errors in different image alignment systems

Method | Errors | |||||||
---|---|---|---|---|---|---|---|---|

| | | | |||||

| | | | | | | | |

| | | | | | | | |

ISOMAP[3] | 0.0329 | 0.0311 | 2.1193 | 2.0732 | 1.7264 | 1.5673 | 1.8764 | 1.6872 |

KICA [12] | 0.0158 | 0.0161 | 1.4121 | 1.2985 | 0.9612 | 0.8635 | 1.1623 | 1.0541 |

SIFT[19] | 0.0345 | 0.0759 | 0.3561 | 0.7898 | 0.9822 | 1.5789 | 1.9220 | 3.7420 |

#### 4.3.2. Alignment speed

To demonstrate the alignment speed, the execution time required in performing one image alignment task is discussed. In this article, the steps of performing one image alignment task consists of capturing the template window from the input image, computing the feature within the window, and feeding the calculated feature into the trained network to get the affine parameters.

In this experiment, we utilize 240 testing images to perform image alignment tasks. The average execution time of image alignment in the proposed system, Isomap, KICA, and SIFT takes about 30, 330, 65, and 57 ms, respectively. From this result, we infer that the proposed system is efficient and can apply to real-time tasks.

#### 4.3.3. Alignment robustness

### 4.4. Real image alignment case

## 5. A conceptual framework for aligning visual inspection images

## 6. Conclusion

In this article, DMELA is proposed for training a TNFN to perform image alignment tasks. Thus, this study tends to investigate two aims including developing an evolutionary learning algorithm and designing an efficient and accurate image alignment system.

Regarding the first aim, the proposed DMELA combines chromosome encoding and RLS method to determine the antecedent and consequent part of fuzzy rules. Such combination can offer faster convergence and less RMSE in comparison with other evolutionary algorithm. Moreover, this article utilizes a DMSM to select suitable groups and identify unsuitable groups for chromosome selection. Such operation would solve the random group selection problem yielded by TSE algorithm. Finally, an SOA is adopted to evaluate suitability of different number of fuzzy rules such that the automatic structure construction of a NFN is feasible.

Regarding the second aim, by integrating a WGOH descriptor with a DMELA-trained TNFN to form an image alignment system could estimate affine parameters accurately. The evidence can be found in experimental results on both synthesized and real images. The results show that the proposed alignment system can reach a subpixel accuracy, real-time speed, and high noise robustness level. Consequently, this finding is helpful to develop efficient and accurate image alignment systems.

In spite of the proposed system demonstrating good performance, there still have some limitations. More specifically, the searching range of image alignment is not large enough. Such case would limit the alignment performance. Thus, future study should be taken into account the coarse to fine image alignment to enlarge the searching range. Moreover, the image alignment accuracy in the case of low SNR is not high enough. There is a need to improve the WGOH descriptor to suppress noise.

## Notes

### Acknowledgements

The authors gratefully acknowledge the reviewers for their valuable comments and suggestions.

## Supplementary material

## References

- 1.Elhanany I, Sheinfeld M, Beckl A, Kadmon Y, Tal N, Tirosh D:
**Robust image registration based on feedforward neural networks.**In*Proceedings of IEEE International Conference on System, Man and Cybernetics*.*Volume 2*. Nashville, USA; 2000:1507-1511.Google Scholar - 2.Wu J, Xie J:
**Zernike moment-based image registration scheme utilizing feedforward neural networks.**In*Proceedings of the 5th World Congress on Intelligent Control and Automation*.*Volume 5*. Hangzhou, P.R. China; 2004:4046-4048.Google Scholar - 3.Xu AB, Guo P:
**Isomap and neural networks based image registration scheme.***Lecture Notes in Computer Science*2006,**3972:**486-491. 10.1007/11760023_71CrossRefGoogle Scholar - 4.Abche AB, Yaacoub F, Maalouf A, Karam E:
**Image registration based on neural network and Fourier transform.**In*Proceedings of the 28th IEEE EMBS annual international conference*. New York, USA; 2006:803-4806.Google Scholar - 5.Sarnel H, Senol Y, Sagirlibas D:
**Accurate and robust image registration based on radial basis neural networks.**In*IEEE International Symposium on Computer and Information Sciences*. Istanbul, Turkey; 2008:1-5.Google Scholar - 6.Hofmeister M, Liebsch M, Zell A:
**Visual self-localization for small mobile robots with weighted gradient orientation histograms.**In*40th International Symposium on Robotics (ISR)*. Barcelona, Spain; 2009:87-91.Google Scholar - 7.Hsu CY, Hsu YC, Lin SF:
**A hybrid learning neural network based image alignment system using global feature selection approach.***Adv Comput Sci Eng*2011,**6**(2):129-157.MATHGoogle Scholar - 8.Hofmeister M, Vorst P, Zell A:
**A comparison of efficient global image features for localizing small mobile robots.**In*Proceedings of ISR/ROBOTIK*. Munich, Germany; 2010:143-150.Google Scholar - 9.Brown LG:
**A survey of image registration techniques.***ACM Comput Surv*1992,**24**(4):325-376. 10.1145/146370.146374CrossRefGoogle Scholar - 10.Zitova B, Flusser J:
**Image registration methods: a survey.***Image Vis Comput*2003,**21**(11):977-1000. 10.1016/S0262-8856(03)00137-9CrossRefGoogle Scholar - 11.Amintoosi M, Fathy M, Mozayani N:
**Precise image registration with structural similarity error measurement applied to superresolution.***EURASIP J Adv Signal Process*2009, 7. Article ID 305479Google Scholar - 12.Xu AB, Guo P:
**Image registration with regularized neural network.***Lecture Notes in Computer Science*2006,**4233:**286-293. 10.1007/11893257_32CrossRefGoogle Scholar - 13.Juang CF:
**A TSK-type recurrent fuzzy network for dynamic systems processing by neural network and genetic algorithms.***IEEE Trans Fuzzy Syst*2002,**10**(2):155-170. 10.1109/91.995118MathSciNetCrossRefGoogle Scholar - 14.Hsu YC, Lin SF:
**Reinforcement group cooperation based symbiotic evolution for recurrent wavelet-based neuro-fuzzy systems.***Neurocomputing*2009,**72:**2418-2432. 10.1016/j.neucom.2008.12.027CrossRefGoogle Scholar - 15.Li M, Wang Z:
**A hybrid coevolutionary algorithm for designing fuzzy classifiers.***Inf Sci*2009,**179**(12):1970-1983. 10.1016/j.ins.2009.01.045CrossRefGoogle Scholar - 16.Gomez F, Schmidhuber J:
**Co-evolving recurrent neurons learn deep memory POMDPs.**In*Proceeding of Conference on Genetic and Evolutionary Computation*. Washington, DC, USA; 2005:491-498.Google Scholar - 17.Moriarty DE, Miikkulainen R:
**Efficient reinforcement learning through symbiotic evolution.***Mach Learn*1996,**22:**11-32.Google Scholar - 18.Hsu YC, Lin SF, Cheng YC:
**Multi groups cooperation based symbiotic evolution for TSK-type neuro-fuzzy systems design.***Expert Syst Appl*2010,**37**(7):5320-5330. 10.1016/j.eswa.2010.01.003CrossRefGoogle Scholar - 19.Lowe D:
**Distinctive image features from scale-invariant keypoints.***Int J Comput Vis*2004,**60**(2):91-110.CrossRefGoogle Scholar - 20.Bradley DM, Patel R, Vandapel N, Thayer SM:
**Real-time image-based topological localization in large outdoor environments.**In*Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)*. Edmonton, Canada; 2005:3670-3677.Google Scholar - 21.Zamalloa M, Rodrigues-Fuentes LJ, Penagarikano M, Bordel G, Uribe JP:
**Feature dimensionality reduction through genetic algorithms for faster speaker recognition.**In*16th European Signal Processing Conference*. Lausanne, Switzerland; 2008.Google Scholar - 22.Neshatian K, Zhang M:
**Dimensionality reduction in face detection: a genetic programming approach.**In*24th International Conference Image and Vision Computing*. Wellington, New Zealand; 2009:391-396.Google Scholar - 23.Juang CF, Lin C-T:
**An on-line self-constructing neural fuzzy inference network and its applications.***IEEE Trans Fuzzy Syst*1998,**6**(1):12-32. 10.1109/91.660805CrossRefGoogle Scholar - 24.Sugeno M, Tanaka K:
**Successive identification of a fuzzy model and its applications to prediction of a complex system.***Fuzzy Sets Syst*1991,**42**(3):315-334. 10.1016/0165-0114(91)90110-CMathSciNetCrossRefMATHGoogle Scholar - 25.Takagi T, Sugeno M:
**Fuzzy identification of systems and its applications to modeling and control.***IEEE Trans Syst Man Cybern*1985,**15**(1):116-132.CrossRefMATHGoogle Scholar - 26.Cordon O, Herrera F, Hoffmann F, Magdalena L:
*Genetic Fuzzy Systems Evolutionary Tuning and Learning of Fuzzy Knowledge Bases, Advances in Fuzzy Systems-Applications and Theory*.*Volume 19*. World Scientific Publishing, NJ; 2001.CrossRefMATHGoogle Scholar - 27.Juang CF, Lin JY, Lin CT:
**Genetic reinforcement learning through symbiotic evolution for fuzzy controller design.***IEEE Trans Syst Man Cybern B*2000,**30**(2):290-302. 10.1109/3477.836377MathSciNetCrossRefGoogle Scholar - 28.Cox E:
*Fuzzy Modeling and Genetic Algorithms for Data Mining and Exploration*. 1st edition. Morgan Kaufman Publications, San Francisco; 2005.MATHGoogle Scholar - 29.Dempsey I:
**Constant generation for the financial domain using grammatical evolution.**In*Genetic and Evolutionary Computation Conference Workshop Program*. Washington, DC, USA; 2005:350-353.Google Scholar - 30.Goldberg DE, Deb K:
**A comparative analysis of selection schemes used in genetic algorithms.**In*Foundations of Genetic Algorithms*.*Volume 1*. San Mateo, CA, USA; 1991:69-93.Google Scholar

## Copyright information

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.