Keywords

1 Introduction

Similarly as in many other engineering disciplines, design of high-frequency systems (e.g., microwave [1], antenna [2], photonic [3], etc.) is heavily based on computer simulation tools. While in some cases, analytical representations (e.g., array factor models for antenna array radiation patterns [4], or coupling-matrix-based models for microwave filters [5]) or equivalent network models (e.g., for transmission-line-based circuits [6]) are available, the ultimate level of accuracy and generality can be achieved by full-wave electromagnetic (EM) simulation [7]. As a matter of fact, EM analysis is mandatory for design verification of vast majority of contemporary high-frequency components and devices, which is due to their geometrical complexity. The latter is a consequence of more and more stringent performance requirements imposed on the components, demands for multi-band operation [8], additional functionalities (e.g., band notches in wideband antennas [9]), and, most importantly, compact size [10,11,12,13]. It is especially the miniaturization requirement that results in the development of topologically complex layouts exhibiting considerable EM cross-coupling effects, impossible to be adequately accounted for using analytical or equivalent circuit models. Examples include microwave components exploiting a slow-wave phenomenon [14] or defected ground structures [15], as well as compact antennas incorporating stepped-impedance feeds [16] or modified radiators [17].

Whenever electromagnetic simulation is involved, some variation of parametric optimization is necessary in order to obtain the best possible performance of the component at hand. Application of conventional numerical optimization methods, either local [18] or global [19], faces considerable challenges which come from several sources: (i) a high cost of individual EM analysis, (ii) a necessity of simultaneous handling of multiple performance figures and constraints, and (iii) a large number of simulations required to converge to the optimized design. The latter is partially due to the fact that geometries of modern high frequency structures are typically described by many parameters [20, 21]. All of these factors result in a considerable computational cost of the optimization process, often prohibitive. It should be mentioned here that a commonly used workaround these issues is an interactive design that combines engineering experience and parameter sweeping (one, maximum two parameters at a time). This is a laborious process yet incorporation of the expert knowledge permitted, in many cases, finding a satisfactory design relatively quickly. However, the aforementioned increase of component complexity, has made this approach questionable, especially in the context of controlling multiple objectives, handling constraints, and operating in highly-dimensional parameter spaces.

Meanwhile, several methods have been developed and applied to reduce the cost of numerical optimization. These include utilization of adjoint sensitivities (in the context of local design [22]), machine learning methods (for global optimization [23]), as well as various surrogate-assisted techniques [24,25,26]. The last group of approaches comes in many variations and involves both data-driven surrogates (e.g., kriging [27], Gaussian process regression [28], or polynomial response surfaces [29]), and physics-based ones. In the latter case, the surrogate is constructed from the underlying low-fidelity model (space mapping [30], response correction techniques [31], feature-based optimization [32]). For the sake of computational efficiency, the low-fidelity models should be cheap (e.g., analytical or equivalent network ones). However, in many situations, e.g., antennas or various classes of miniaturized microwave components, the only reliable option for low-fidelity models are those obtained from coarse-discretization EM simulations [33]. Here, the overhead related to optimizing the surrogate model (often carried out using conventional algorithms) determines the overall design expense.

As discussed above, reducing the number of EM simulations required by an optimization routine is important for both direct and surrogate-assisted design procedures. In this paper, a low-cost trust-region-based gradient search algorithm is proposed for the optimization of high-frequency structures. The foundation of the method is an appropriate management of the system response sensitivity (here, Jacobian) updates. This management scheme involves a few mechanisms, including an omission of (finite-differentiation-based) sensitivity updates for variables that exhibit small (relative) relocation in the directions of the corresponding coordinate system axes as well as a selective application of a rank-one Broyden updating formula. The Broyden-based updates are performed for parameters whose corresponding basis vectors are sufficiently well aligned with the direction of the latest design relocation. For the sake of demonstration, two high-frequency structures are considered: a miniaturized coupler implemented using compact microstrip resonant cells (CMRCs), and an ultra-wideband antenna operating in 3.1 GHz to 10.6 GHz frequency range. In both cases, application of the proposed algorithm leads to a significant reduction of the number of EM simulations necessary for identifying the optimized design. The average computational speedup is as high as 60% as compared to the reference algorithm. The improvement of computational efficiency is achieved with only minor degradation of the design quality.

2 Reduced-Cost Design Optimization Through Jacobian Update Management

In this section, the proposed procedure for expedited design optimization of high-frequency structures is formulated and explained. Here, local optimization is considered with the trust-region gradient search utilizing numerical derivatives used as a reference algorithm. The section starts by formulating the high-frequency design optimization problem, followed by a brief description of the reference algorithm, the proposed sensitivity updating scheme, as well as the complete optimization framework.

2.1 High-Frequency Design Problem Formulation

The computational model of the high-frequency structure of interest will be denoted as R(x), where x stands for the adjustable variables of the problem (typically, geometry parameters). R represents the model responses which are typically frequency characteristics such as scattering parameters (e.g., reflection S11, transmission S21, etc.), gain, efficiency, radiation pattern, power split ration, and so on. The computational model is assumed to be evaluated using a full-wave electromagnetic (EM) analysis. Normally, computational cost of EM simulation is considerable, ranging from a few dozens of seconds to a few hours, depending on the structure complexity, its electrical size (i.e., physical dimensions as compared to the guided wavelength at the operating frequency), as well as other components that need to be included in the model (e.g., SMA connectors, housing, installation fixtures, etc.).

The design problem can be formulated as

$$ \varvec{x}^{*} = \arg \mathop {\hbox{min} }\limits_{\varvec{x}} U\left( {\varvec{R}(\varvec{x})} \right) $$
(1)

where x* is the optimum design to be found and U is a scalar objective function encoding the performance specifications imposed on the structure at hand.

Given a large variety of high-frequency structures and the figures of interest involved, it is obvious that the objective function is very much problem dependent. For the sake of illustration we will discuss two types of objective functions, pertinent to particular illustration cases considered in Sect. 3.

In many situations, especially related to the design of antenna structures but also certain other components such as impedance transformers or microwave filters, it is important to improve the in-band matching (which is equivalent to minimization of the reflection response S11 or reducing the return loss). In this case, the objective function may be defined as

$$ U(\varvec{R}(\varvec{x})) = \mathop {\hbox{max} }\limits_{f \in F} |S_{11} (\varvec{x},f)| $$
(2)

where \( \left| {S_{11} \left( {\varvec{x},f} \right)} \right| \) stands for the reflection as a function of optimization variables x and frequency f, with F being the frequency range of interest (e.g., 3.1 GHz to 10.6 GHz in case of UWB antennas). The problem (1), (2) is thus formulated in a minimax sense.

The objective function (2) addresses a single performance figure. However, in majority of practical cases, there is a need to control several figures. Representative examples are microwave couplers, where a typical design problem requires maximization of the bandwidth BW (usually, symmetric with respect to the operating frequency f0), obtaining the required (e.g., equal) power split \( d_{S} = \, \left| {S_{21} } \right| \, {-} \, \left| {S_{31} } \right| \) at f0, as well as the allocation of the matching and isolation characteristics (|S11| and |S41|) minima close to f0. In this case, assuming implicit constraint handling, we may define U as

$$ U(\varvec{R}(\varvec{x})) = - BW(\varvec{x}) + \sigma_{1} d_{S} (\varvec{x})^{2} + \sigma_{2} (f_{{\hbox{min} .S_{11} }} (\varvec{x}) - f_{0} )^{2} + \sigma_{3} (f_{{\hbox{min} .S_{41} }} (\varvec{x}) - f_{0} )^{2} $$
(3)

where fmin.S11 and fmin.S41 are the frequencies corresponding to |S11| and |S41| minima, respectively, and σk are penalty coefficients. A penalty function approach as in (3) is a convenient way of handling expensive constraints, especially when dealing with EM simulation models [34, 35].

2.2 Trust-Region Gradient Search

The reference optimization algorithm for this paper is a conventional trust-region (TR)-based gradient-search procedure [18]. The TR algorithm yields a series of approximations x(i), i = 0, 1, …, to the optimum design x*, by solving sub-problems

$$ \varvec{x}^{(i + 1)} = \arg \mathop {\hbox{min} }\limits_{{\varvec{x};\; - \varvec{d}^{(i)} \le \varvec{x} - \varvec{x}^{(i)} \le \varvec{d}^{(i)} }} U(\varvec{L}_{{}}^{(i)} (\varvec{x})) $$
(4)

In (4), \( \varvec{L}^{\left( i \right)} \left( \varvec{x} \right) \, = \varvec{R}\left( {\varvec{x}^{(i)} } \right) \, + \varvec{J}_{\varvec{R}} \left( {\varvec{x}^{(i)} } \right) \cdot \left( {\varvec{x}{-}\varvec{x}^{(i)} } \right) \) is a first-order Taylor expansion of the computational model R at the current iteration point x(i). The Jacobian JR can be evaluated using adjoint sensitivities if available; however, in vast majority of practical cases of high-frequency structure design it is estimated through finite differentiation (FD). The reason is that the adjoint technology is not supported (with rare exceptions) by commercial EM simulation packages. FD incurs the cost of additional n EM analyses (n being dimensionality of the design space) per algorithm iteration. Obviously, reducing this overhead would lead to computational savings, which is the main subject of this paper.

It should also be noticed that the trust region size in (4) does not take a traditional form of \( \left| {\left| {\varvec{x}{-}\varvec{x}^{(i)} } \right|} \right| \le \delta^{\left( i \right)} \) (Euclidean norm with scalar TR radius), but it is an interval defined through the size vector d(i) (adjusted using the standard rules [18]). The inequalities \( {-}\varvec{d}^{(i)} \le \varvec{x}{-}\varvec{x}^{(i)} \le \varvec{d}^{(i)} \) are understood component-wise. This type of the TR region allows—by making the initial size vector d(0) proportional to the design space sizes—for ensuring a similar treatment of variables with significantly different ranges, a situation common in the antenna or microwave design. For example, the lengths of the transmission line components are typically in the range of millimeters or tens of millimeters, whereas the line widths or spacings between them may be small fractions of millimeters.

2.3 Reduced-Cost TR Algorithm Using Adaptive Jacobian Updates

The computational cost of the design optimization involving the conventional TR algorithm is primarily determined by the cost of Jacobian estimation performed using the finite differentiation. Here, a reduced-cost TR algorithm using adaptive Jacobian updates is introduced which allows us to notably decrease the number of EM simulations necessary to obtain the optimal design. The computational savings result from the reduction of the overall number of FD calculations. The algorithm combines two independent procedures for suppressing Jacobian updates: an accelerated update procedure (AUP) and Broyden update procedure (BUP). The former is based on a relative relocation of the design variable vector between iterations, and the optimization run history. The latter adopts the Broyden updating formula for the selected design variables, depending on the alignment between the most recent design relocation and the coordinate system axes. When used separately, each procedure allows for achieving considerable computational savings, however, at the expense of a slight degradation of the design quality, as shown in Sect. 3. Furthermore, a combination of both procedures further expedites the optimization process while ensuring satisfactory design quality. The essential stages of the proposed reduced-cost TR algorithm are shown in the form of a flow diagram in Fig. 1.

Fig. 1.
figure 1

Flow diagram of the proposed algorithm using adaptive sensitivity updating scheme. The following notation is used: Γ – selection matrix containing information about the Jacobian updates, d(i+1) TR region size in the (i + 1)th iteration, ρ – gain ratio (deciding about iteration acceptance and d(i+1) update), ε – algorithm termination threshold.

The information about either performing (1) or omitting (0) FD calculation of the system response Jacobian JR is stored in a selection matrix Γ. The complete description of the procedure of creating the matrix Γ is provided in Sect. 2.4, along with the account of the two component procedures: accelerated update procedure and Broyden update procedure. Here, the major outline of the modified TR algorithm is given. The selection matrix Γ is initialized as a column vector, n × 1, with all its entries set to ones: γk.1 = 1, k = 1, …, n. This implies that in the first iteration, the initial estimate of the entire Jacobian JR is obtained with FD. Upon each successful iteration, the matrix is extended by an additional column, utilized to govern the Jacobian update in the upcoming iteration (as depicted in Fig. 2).

Fig. 2.
figure 2

Flow diagram of the selection matrix update procedure: (A) accelerated update procedure, (B) Broyden update procedure, (C) selection matrix Γ update based on AUP and BUP, D) Jacobian update based on Γ. The following notation is used: (i) AUP parameters: α (i+1)k – decision factor for kth geometry parameter and (i + 1)th iteration; η – history count, αhigh, αlow – threshold values, Α – AUP decision vector; (ii) BUP parameters: β (i+1)k – decision factor for kth geometry parameter and (i + 1)th iteration, βmin – alignment threshold value, Β – BUP decision vector.

After the Jacobian update is completed, a candidate design xtmp is found by solving (4), with the objective function U described by (2) or (3). Then, a gain ratio \( \rho = (U(\varvec{R}\left( {\varvec{x}_{tmp} } \right){-}U(\varvec{R}\left( {\varvec{x}^{(i)} } \right)/\left( {\varvec{L}^{(i)} \left( {\varvec{x}_{tmp} } \right) {-}\varvec{L}^{(i)} \left( {\varvec{x}^{(i)} } \right)} \right) \) is calculated, and the TR region size d(i+1) for the next (i + 1)th iteration is adjusted. If the gain ratio is positive, the candidate step is accepted and the next iteration is executed, unless the termination criterion is satisfied, i.e., \( \left| {\left| {\varvec{d}^{(i + 1)} } \right|} \right| < \varepsilon \,\,{\text{and}}\,\,\left| {\left| {\varvec{x}^{(i + 1)} {-}\varvec{x}^{(i)} } \right|} \right| < \varepsilon \), where ε is the algorithm termination threshold.

2.4 Selection Matrix Updating Procedure

In this section, a procedure for updating the selection matrix Γ is described in detail. It combines two separate routines that control the Jacobian update: the accelerated (AUP) and the Broyden (BUP) update procedure. The former pinpoints the parameters that exhibit small relative design changes between iterations, therefore implying that the calculation of the respective part of the Jacobian can be omitted.

Whereas, the latter identifies the parameters for which the corresponding basis vectors are sufficiently well aligned with the most current design relocation vector. For these parameters, calculation of the respective Jacobian columns will be executed using the Broyden formula instead of FD. The outcomes of both procedures are then combined to create the selection matrix. The two procedures, along with the resulting Jacobian update procedure are presented in the form of a flow diagram in Fig. 2.

Accelerated Update Procedure.

The accelerated procedure, depicted in Panel A of Fig. 2, accommodates the changes of geometry parameters observed throughout the optimization course. These are quantified by calculating a relative design change of the kth parameter, k = 1, …, n, w.r.t. the TR region size in the ith iteration

$$ \alpha_{k}^{{\left( {i + 1} \right)}} = {{\left| {x_{k}^{{\left( {i + 1} \right)}} - x_{k}^{\left( i \right)} } \right|} \mathord{\left/ {\vphantom {{\left| {x_{k}^{{\left( {i + 1} \right)}} - x_{k}^{\left( i \right)} } \right|} {d_{k}^{\left( i \right)} }}} \right. \kern-0pt} {d_{k}^{\left( i \right)} }},\quad \quad k = 1, \ldots ,n, $$
(5)

where x (i)k , x (i+1)k and d (i)k refer to the kth components of vectors x(i), x(i+1), and d(i), respectively. Let us also denote by Jk the kth column of the Jacobian JR. The decision factors α (i+1)k are utilized to determine whether Jk, pertaining to the kth parameter of the structure at hand, is to be calculated using FD in the next, (i + 1)th, iteration. Additionally, the optimization run history is inspected, in order to ensure that the part of the Jacobian Jk is computed at least once every few iterations.

The update history is examined in the course of the last N iterations (N being the algorithm control parameter), and a history count η is defined as a total number of iterations, among the last N iterations, in which the Jacobian update using FD was carried out. Both the decision factors α (i+1)k and the history count η are then translated into a binary AUP decision vector A that stores the information about the update for the upcoming iteration. Let us denote by Ai+1.k the kth entry of the vector A referring to the (i + 1)th iteration. If the entry of the vector A equals zero, i.e., Ai+1.k = 0, AUP indicates that, for the kth parameter of the structure at hand, the respective Jacobian column Jk does not have to be calculated with FD in the next iteration; otherwise, i.e., if Ai+1.k = 1, FD is suggested.

As shown in Fig. 2, at the beginning of AUP, a decision factor α (i+1)k (see (5)) and history count η are calculated for all parameters. Then, the entries of the decision vector A are established depending on whether the iteration was successful (ρ > 0) or not (ρ ≤ 0). If the iteration was accepted, Ai+1.k is set to 0 in the two following cases:

  1. 1.

    For all parameters, if the TR size in the next iteration \( \left| {\left| {\varvec{d}^{(i + 1)} } \right|} \right| \) is small, below a user-specified threshold dthr,

  2. 2.

    For the selected parameters: if the TR region size in the next iteration \( \left| {\left| {\varvec{d}^{(i + 1)} } \right|} \right| \) exceeds the threshold dthr, and if, for a given parameter k, the Jacobian column Jk was: (i) calculated with FD at least once in the last N iterations and (ii) the factor α (i+1)k is below the user-specified threshold αhigh.

In the case of the rejected iteration, (i.e., ρ ≤ 0), as a rule, the decision vector established in the previous iteration, and stored in the selection matrix Γ, is not altered. However, if \( \left| {\left| {\varvec{d}^{(i + 1)} } \right|} \right| \) is below the threshold dthr, Ai+1.k is set to 0, when simultaneously the two following conditions are met: (i) α ik  < αlow and (ii) Jk was calculated with FD at least once in the last N iterations. The user defined thresholds fulfill the condition: \( 0 \, < \alpha_{\text{low}} < \alpha_{\text{high}} < \, 1 \), in order to ensure more frequent updates, if the iteration was unsuccessful.

Broyden Update Procedure.

Here, the Broyden update procedure, shown in Panel B of Fig. 2, is described in detail. During BUP, the parameters for which the alignment between the most recent design relocation and the coordinate system axes is satisfactory, are selected. Then, the corresponding columns of the circuit response Jacobian JR, are calculated using the Broyden formula (BF) instead of FD. Here, we adopt a rank-one Broyden update [36]

$$ \varvec{J}_{\varvec{R}}^{(i + 1)} = \varvec{J}_{\varvec{R}}^{(i)} + \frac{{\left( {\varvec{f}^{(i + 1)} - \varvec{J}_{\varvec{R}}^{(i)} \cdot \varvec{h}^{(i + 1)} } \right) \cdot \varvec{h}^{(i + 1)T} }}{{\varvec{h}^{(i + 1)T} \varvec{h}^{(i + 1)} }},\quad \quad i = 0,1, \ldots $$
(6)

where the following notation is used: \( \varvec{f}^{{\left( {i + 1} \right)}} = \varvec{R}\left( {\varvec{x}^{(i + 1)} } \right) {-}\varvec{R}\left( {\varvec{x}^{(i)} } \right) \), and \( \varvec{h}^{{\left( {i + 1} \right)}} = \varvec{x}^{(i + 1)} {-}\varvec{x}^{(i)} \). Note that (6) only updates the Jacobian in the one-dimensional subspace spanned by h(i) and at least n executions of (6) are necessary to obtain JR information within the entire space. Consequently, in higher-dimensional spaces, unsatisfactory results are usually obtained when the Jacobian is calculated solely using (6). Here, JR updating involves both FD and BF.

Let us denote by e(k) the standard basis vectors, i.e., \( \varvec{e}^{\left( k \right)} = \, \left[ {\begin{array}{*{20}c} 0 & \ldots & 0 & 1 & 0 & \ldots & 0 \\ \end{array} } \right]^{T} \) with 1 on the kth position. In the BUP procedure, for each parameter k, the values of the alignment factors \( \beta_{k}^{{\left( {i + 1} \right)}} = \, \left| {\varvec{h}^{{\left( {i + 1} \right)T}} \varvec{e}^{(k)} } \right|/\left| {\left| {\varvec{h}^{(i + 1)} } \right|} \right| \) are calculated. The factors β (i+1)k act as a quantification measure of the alignment between the current design relocation h(i+1) and the respective basis vectors e(k). Note that \( 0 \le \beta_{k}^{(i + 1)} \le 1 \) (β (i+1)k  = 0 and β (i+1)k  = 1, if h(i+1) and e(k) are orthogonal and co-linear, respectively). For a given parameter k, the respective entry of the BUP decision vector Bi+1.k is set to 0, if the alignment β (i+1)k is better than a user-defined alignment acceptance threshold βmin, otherwise Bi+1.k = 1; βmin is a control parameter for BUP: the higher βmin, the more stringent the condition for using BF gets, which likely leads to lower computational savings but higher quality of the obtained solution.

Jacobian Update.

Upon performing AUP and BUP, the selection matrix Γ is altered and the Jacobian update is performed accordingly. Γ is extended by adding the (i + 1)th column after each successful iteration. The added column is computed from both AUP and BUP in the following manner (see Panel C of Fig. 2). For a given index k, the entry γi+1,k is set to 0, if the Bi+1.k equals 0 (disregarding the value of Ai+1.k). In that case, the respective Jacobian column Jk is estimated using BF. Otherwise, if Bi+1.k = 1 and Ai+1.k = 0, then γi+1,k is set to –1 which indicates that the Jacobian from the previous iteration is to be used, i.e., \( \varvec{J}_{k} \left( {\varvec{x}^{(i + 1)} } \right) = \varvec{J}_{k} \left( {\varvec{x}^{(i)} } \right) \). Finally, if \( B_{i + 1.k} = A_{i + 1.k} = 1 \), Jk is obtained using FD. Thus, FD is performed exclusively if both routines conclude it is mandatory. Consequently, substantial computational savings are secured, as it is confirmed by the results presented in Sect. 3. At the same time, the observed degradation of the solution quality is practically acceptable. The algorithmic flow of the Jacobian update procedure is shown in Panel D of Fig. 2.

3 Numerical Results and Benchmarking

Here, two high-frequency structures are considered as a benchmark set: an equal-split rat-race coupler (RRC) composed of compact microstrip resonant cells (CMRCs) [37] shown in Fig. 3(a), and a wideband antenna [38] (Fig. 3(b)). The first structure (RRC) is implemented on Taconic RF-35 substrate (h = 0.762 mm, εr = 3.5, tanδ = 0.0018). The RRC geometry is described by a parameter vector \( \varvec{x} = \left[ {\begin{array}{*{20}c} {w_{1} } & {l_{1} } & {w_{2} } & {l_{2} } & {w_{3} } \\ \end{array} } \right] ^{T} \). Relative dimensions are \( l_{3} = 19w_{1} + 18w_{2} + w_{3} {-}l_{1} \), \( l_{4} = \, 5w_{1} + 6w_{2} + l_{2} + w_{3} \), \( l_{5} = \, 3w_{1} + \, 4w_{2} \) and \( w_{4} = 9w_{1} + \, 8w_{2} \) (all in mm). The coupler is supposed to operate at f0 = 1 GHz and it has been optimized for maximum bandwidth (defined at –20 dB level of matching and isolation), and symmetric around f0.

The antenna [37] is also implemented on RF-35 substrate, and it has the following independent geometry parameters x = [L0 dR R rrel dL dw Lg L1 R1 dr crel]T. The antenna has been optimized for minimum reflection within the UWB frequency range (3.1 GHz to 10.6 GHz). The computational model is implemented in CST Microwave Studio and evaluated using its transient solver.

Fig. 3.
figure 3

High-frequency structures used for verification of the proposed algorithm: (a) CMRC-based miniaturized microstrip rat-race coupler [37], and (b) wideband antenna [38].

For both structures, ten algorithm runs have been executed with random initial designs. The results for the proposed algorithm are presented in Table 1 and compared to the results obtained with the reference TR algorithm. The following acceptance threshold values were used φ0 = 0, 0.025, 0.05, 0.1, 0.2 and 0.3. The higher the threshold value, the lower the savings are, which is due to the more stringent condition for applying the Broyden update formula (and FD performed more frequently). The results for φ0 = 0 (Broyden-only Jacobian updates) are enclosed in order to demonstrate that this version does not yield acceptable design quality. The frequency characteristics before and after optimization have been shown in Fig. 4. The proposed combined algorithm delivers considerable reduction of the number of EM simulations needed to find the optimized solution even for the highest value of the threshold φ0 = 0.3 (around 48% for the coupler and around 57% for the antenna. The presented results also reveal an important advantage of the proposed algorithm, namely that the solution quality is stable, virtually independent of the acceptance threshold value apart from when φ = 0. This applies to both the objective function value (bandwidth BW in the case of the coupler and maximum in-band reflection S11 for the antenna) and the standard deviation of the respective objective functions. The latter is used as a measure of the results repeatability.

Table 1. Performance statistics of the proposed algorithm
Fig. 4.
figure 4

Reflection characteristics of the considered high-frequency components for the representative algorithm runs: (a) compact RRC, (b) UWB antenna: initial and optimized design are marked gray and black, respectively. The vertical line in (a) indicates the required operating frequency f0, whereas the horizontal line in (b) the design specifications.

The most suitable threshold value for the RRC is φ0 = 0.05: it delivers the best bandwidth and the smallest value of the bandwidth standard deviation, as well as cost savings of around 53%. As far as the UWB antenna is concerned, the value φ0 = 0.025 seems to be most appropriate: it delivers savings as high as 72%, while both the design quality and its repeatability are degraded only slightly.

In order to provide more in-depth analysis of the results, a supplementary Table 2 is provided, in which the results obtained with the sole usage of Broyden update procedure (without the accelerated procedure) are shown. The results presented in Table 2 exhibit more pronounced dependence of the design quality on the acceptance threshold value when compared to the combined (AUP and BUP) algorithm (see Table 1). As φ0 increases, the better the quality of the solution is, as expressed by the objective function value and its standard deviation across ten algorithm runs. At the same time, the cost savings drop significantly (to as low as 4% for the RRC and to around 20% for the antenna). Thus, the introduction of the second, AUP procedure allowed for higher savings delivering similar design quality.

Table 2. Performance statistics of the algorithm utilizing only BUP

4 Conclusions

In the paper, a procedure for expedited design optimization of high-frequency structures has been proposed. Our methodology accelerates a standard trust-region gradient-based algorithm with numerical derivatives by development and incorporation of an adaptive scheme for updating the Jacobians of the system under design. The scheme combines the two main mechanisms, selective Broyden updates (governed by measuring the alignment between the latest direction of the design relocation and the coordinate system basis vectors), as well as relative design changes with respect to the trust region size in respective directions. The latter allows for discriminating variables for which the response sensitivities are stable across the design space, which is often the case of antennas and microwave components. As demonstrated using a wideband antenna and a miniaturized microstrip coupler, the proposed algorithm allows for considerable computational savings without compromising the design quality. Various design trade-offs controlled by user-defined algorithm parameters have been investigated as well. The algorithm can be used not only to expedite direct optimization of high-fidelity EM models, but also to speed up surrogate-assisted procedures involving variable-fidelity simulations. The future work will be focused on incorporating the proposed methodology into surrogate-based optimization frameworks.