1 Introduction

Reducing wheel wear has been a topic of concern since railway vehicles emerge. Most directly, slow wheel wear can improve wheel-track system performances, extend wheel re-profiling mileage, and reduce maintenance costs [1,2,3]. At present, the works on wheel wear reduction are mainly based on five aspects: (1) wheel–rail tribology [4,5,6,7,8], (2) wheel/rail profile optimization [9], (3) vehicle/track design [10,11,12,13], (4) active control of vehicle suspensions [14,15,16,17], and (5) track layout setting [18, 19]. This article focuses on wheel profile optimization.

1.1 Existing methods

Since the advent of railway vehicles, the wheel–rail relationship has been a hot topic. The related research is mainly divided into two categories: (1) the wheel–rail contact geometry and (2) the wheel–rail interaction forces and their impact on vehicle-track systems. Both categories of research are based on specific wheel and rail profiles, and therefore, the design and optimization of wheel profiles have always been a topic of interest to scholars. In this section, we briefly review some of the wheel profile optimization methods proposed in the past two decades, which can be used to reduce wheel wear. These methods fall into two groups: single-objective and multi-objective optimization methods.

1.1.1 Single-objective optimization methods

Classical single-objective optimization methods include (I) target contact angle method [20], (II) target RRD (rolling radius difference) method [21,22,23,24], (III) target conicity method [25], and (IV) target normal gap method [26], etc.

(I) Target contact angle method

Shen et al. [20] proposed a method for designing the wheel profile using the inverse method of contact angle curve and applied it to the profile design of independent wheels and the wheels of low-floor vehicles. In this method, five hypotheses are introduced: (1) the wheel and rail are rigid, (2) the influence of wheelset roll on the contact angle is ignored, (3) the shape of the rail is convex, (4) the shapes of the left and right wheels and rails are symmetrical, and (5) the flange thickness and height, and the wheel width remain unchanged.

In Fig. 1, the wheel and rail profiles are given as \(Z_{\text{w}} (Y_{\text{w}} )\) and \(Z_{\text{r}} (Y_{\text{r}} )\), respectively. When the wheelset lateral displacement is \(y_{\text{s}}\), the coordinates of the contact point on the wheel and rail are \((y_{\text{w}} ,z_{\text{w}} )\) and \((y_{\text{r}} ,z_{\text{r}} )\), respectively, where \(z_{\text{r}} = Z_{\text{r}} (y_{\text{r}} )\) and \(z_{\text{w}} = Z_{\text{w}} (y_{\text{w}} )\). Then, there is

$$Z_{\text{w}} = Z_{0} + \int_{{y_{1} }}^{{y_{2} }} {\tan \alpha \, \text{d}} Y_{\text{w}},$$
(1)

where \(Z_{0}\) is the ordinate of the contact point on the wheel when \(y_{\text{s}} = 0\); \(\alpha\) is the contact angle, and \(\tan \alpha \text{ = d}Z_{\text{w}} /\text{d}Y_{\text{w}} = \text{d}Z_{\text{r}} /\text{d}Y_{\text{r}}\). Equation (1) means that the wheel profile can be expressed by \(\alpha (Y_{\text{w}} )\).

Fig. 1
figure 1

Sketch of contact points between the wheel and rail

The shape of the wheel profile designed with this method is not limited to the combination of arcs and straights. However, the effect of the wheelset roll angle is not taken into account in the contact angle.

(II) Target RRD method

Shevtsov et al. [21] proposed a method for the optimal design of a wheel profile based on RRD, which was subsequently used in many references.

As plotted in Fig. 2, the curve of the wheel profile is represented by discrete points, where solid points represent fixed points, and hollow points represent movable points. The entire profile is obtained by using a curve fitting method (such as the piecewise cubic Hermite interpolating polynomial [21], and the B-spline [22]) to fit these points. The abscissas \(y_{i}\, (i = 1,2, \ldots ,n)\) of the hollow points are unchanged, but their ordinates \(z_{i}\, (i = 1,2, \ldots ,n)\) are movable, and these ordinates are defined as design variables. Then an objective function [Eq. (2)] is introduced:

Fig. 2
figure 2

Fixed and movable points on wheel profile

$$F_{0} (x) = \frac{{\sum\nolimits_{i = 1}^{k} {\left( {\Delta r_{{y_{i} }}^{\text{tar}} (x) - \Delta r_{{y_{i} }}^{\text{calc}} (x)} \right)}^{2} }}{{\sum\nolimits_{i = 1}^{k} {\left( {\Delta r_{{y_{i} }}^{\text{tar}} (x)} \right)^{2} } }} \to \hbox{min},$$
(2)

where \(\Delta r_{{y_{i} }}^{\text{tar}} (x)\) is the target RRD function, \(\Delta r_{{y_{i} }}^{\text{calc}} (x)\) is the calculated RRD function for the design profile. To ensure the stability of the vehicle and the monotonicity of the tread region of generated wheel profiles, some constraints to limit the equivalent conicity and slope are required (see Ref. [21]).

This method firstly introduces a medium that reflects vehicle dynamics and wheel–rail contact mechanics: the RRD function, through which the optimization of the wheel profile is transformed into the optimization of the RRD function. Due to this transformation, the multi-objective problem of the optimal design of the wheel profile is transformed into a single-objective problem, which reduces the difficulty of solving the optimization problem. The key and difficulty of this method lie in the selection of target RRD function since an appropriate target RRD function is usually based on expertise and repeated attempts. In Ref. [21, 22], three target RRD functions were proposed. A similar method called the target \(Y - \Delta r\) method was presented in Ref. [27].

(III) Target conicity method

Polach [25] proposed a method for designing the wheel tread profile to achieve a wide contact spreading and target conicity level, i.e., target conicity method. This method is similar to the target contact angle method [20]. Firstly, five hypotheses are introduced: (1) the wheel and rail are rigid, (2) the wheelset roll angle is ignored, (3) the shape of the rail is convex, (4) the shapes of the left and right wheels and rails are symmetrical, and (5) the contact between wheel and rail is represented by a contact point.

In this method, the design of wheel tread is based on a specified rail profile. As shown in Fig. 1, Yw and \(Z_{\text{w}}\) can be expressed as a function of the wheelset lateral displacement ys, i.e., \(Y_{\text{w}} (s)\) and \(Z_{\text{w}} (s)\), respectively. To achieve continuous spreading, lateral distributions of the contact points on the rail profile are assumed proportional to the contact point distribution on the wheel profile. The wheel contact points can be transformed to the contact points on the rail by \(Y_{\text{r}} (y_{\text{s}} ) = Y_{\text{w}} (y_{\text{s}} ) + k_{{y}} y_{\text{s}} + Y_{0}\), where \(k_{y}\) is a coefficient related to RRD and can be used to adjust the target equivalent conicity level and \(Y_{0}\) is the abscissa of the contact point on the wheel when ys = 0. Then, the wheel profile can be further derived:

$$Z_{\text{w}} = Z_{0} + \int_{{y_{1} }}^{{y_{2} }} {Z_{\text{w}}^{{\prime }} } \text{d}Y_{\text{w}},$$
(3)

where \(Z_{\text{w}}^{{\prime }} (Y_{\text{w}} )\left| {y_{\text{s}} } \right. = Z_{\text{r}}^{{\prime }} (Y_{\text{r}} )\left| {y_{\text{s}} } \right. = \tan \alpha\), as plotted in Fig. 3. Finally, the equivalent conicity \(\lambda \approx {{\alpha_{0} R_{\text{w}} } \mathord{\left/ {\vphantom {{\alpha_{0} R_{\text{w}} } {(R_{\text{w}} - R_{\text{r}} )}}} \right. \kern-0pt} {(R_{\text{w}} - R_{\text{r}} )}}\) is proportional to RRD, which can be calculated by subtracting the vertical coordinates of contact points on the left and right wheels \(\Delta r(y_{\text{s}} ) = (Z_{\text{rL}} - Z_{\text{wL}} ) - (Z_{\text{rR}} - Z_{\text{wR}} )\), where subscripts \({\text{L}}\) and \({\text{R}}\) represent left and right, and \(\alpha_{0}\) is the contact angle between wheel and rail profiles in the nominal position. The selection of the contact point distribution and the proportionality coefficient \(k_{y}\) is limited by the wheel tread width and the rail width before the flange root contacts the rail gauge corner.

Fig. 3
figure 3

Sketch of wheel/rail contact geometry

As described in Ref. [25], the application environment of the wheel tread designed by this method is subject to some restrictions. This method is suitable for vehicles running primarily on straight lines and/or with heavy traction forces, leading to a rapid change of wheel tread shape after the turning. Since the wheel profile designed by this method is for a specific rail profile, it has higher requirements for the running line. The designed wheel profile can only be used when the shapes of rails vary little.

(IV) Target normal gap method

The vertical clearance (normal gap) between the wheel and the rail around the contact point is an important indicator for evaluating the compatibility of wheel/rail profiles, and a small normal gap can improve the “conformity” of the wheel and rail and reduce the contact stress, thereby reducing wheel wear. Based on this consideration, Cui et al. [26] proposed a wheel profile optimization method for reducing wheel wear based on target normal gap. In this method, three hypotheses are introduced: (1) the elastoplastic deformation of the wheel and rail is ignored; (2) the effect of wheelset yaw motion on the contact patch is ignored; (3) the flange thickness and height and wheel width remain is unchanged.

The first step of this method is to laterally move the wheelset, and the wheelset lateral distance is defined as \(Y_{j}\), where \(j\) represents the \(j\) th contact point. Then the non-Hertzian contact method is used to solve the wheel–rail contact patch size of the contact point \(C_{j}\), and \(c_{1}\) and \(c_{2}\) define the boundary of the calculated region, the values of which are set to be slightly larger than the corresponding semi-axis of the contact patch. As shown in Fig. 4, the normal gap function is defined as \(D_{j} = \sum\nolimits_{i = 1}^{m} {d_{ji} /m}\), where \(d_{ji}\) is the normal clearance at the ith point and m is the number of discrete points in the region around the contact point \(C_{j}\). Like the target RRD method [21], the ordinates zj (j = 1, 2,…, n) of the movable points (see Fig. 2) are defined as design variables, which means that \(D_{j}\) can be represented by \(D_{j} = D_{j} (Y_{j} ,z_{1} ,z_{2} , \ldots ,z_{n} )\) since \(d_{ji}\) is determined by the wheel profile function \(f(Y_{j} ,z_{1} ,z_{2} , \ldots ,z_{n} )\). Finally, the objective function is introduced as

$$S = \frac{{\sum\nolimits_{j = 1}^{K - 1} {c[w_{j} D_{j} (Y_{j} ,z_{1} ,z_{2} , \ldots ,z_{n} ) + w_{j + 1} D_{j + 1} Y_{j + 1} ,z_{1} ,z_{2} , \ldots ,,z_{n} )]} }}{2} \to \hbox{min},$$
(4)

where \(K\) is the number of points in the normal gap curve, \(c = \left| {y_{j} - y_{j - 1} } \right|\), and \(w_{j}\) is a weighting factor. To ensure the safety of the vehicle and the monotonicity of the tread region of the generated wheel profile, some constraints to limit the equivalent conicity and slope are required [26].

Fig. 4
figure 4

Normal gap between the wheel and the rail

In Ref. [26], this method was used to optimize the LMa wheel profile that matches the CHN60 rail. Simulation experiments showed that the curve-negotiation performance and wheel–rail contact stress level of the optimized profile are better than those of the traditional LMa profile. However, the effect of the wheelset yaw on the contact patch is not considered in this method, while some studies have shown that the wheelset yaw has a significant influence on the contact patch and wheel wear [28].

1.1.2 Multi-objective optimization methods

Compared with the single-objective optimization methods listed, the multi-objective optimization method has multiple objective (target) functions, and there is usually an interdependent relationship between the objective functions, even completely opposite or contradictory. The multi-objective optimization algorithm is to find a compromise in these relationships, thus multiple targets can be improved rather than one target improves and the other target deteriorates.

In the optimization of wheel profiles, the multi-objective optimization method is usually based on multi-body dynamics simulation (MBS). With the MBS technique, the dynamic responses corresponding to different wheel profiles can be obtained, and these wheel profiles are considered as the input of the optimization algorithm, and the dynamic responses are the output of the optimization algorithm. The output of the optimization algorithm often contains not only optimization targets, but also constraints. The frequently used outputs include wear index, RCF (rolling contact fatigue) index, safety-related index, stability-related index, noise index, comfort level, etc. It should be noted that when the number of the optimization target is 1, the multi-objective optimization problem is transformed into a single-objective optimization problem, which means that the multi-objective optimization method is also applicable to the single-objective optimization problem, an example is shown in Ref. [9]. In terms of the optimization algorithms, this section introduces two categories: (I) bio-inspired optimization algorithm and (II) response surface technique.

(I) Bio-inspired optimization algorithm

The most common bio-inspired algorithm used to optimize wheel profiles is GA (genetic algorithm) [29]. For instance, Persson et al. [30] used GA to build the relationship between different wheel profiles and various dynamic parameters including ride comfort, lateral track-shifting force, maximum derailment coefficient, wear index, maximum contact stress, in which, 110 and 121 wheel profiles were generated for a soft bogie vehicle and a stiff bogie vehicle, respectively. Finally, a new wheel profile (P8) was obtained. Novales et al. [31] used GA to generate every new wheel profile through a semi-random combination of the fourth derivative (obtained in an incremental manner) of two wheel profiles of the last generation. In the formulation of optimization targets, a formula \(I_{\text{T}} = d_{\text{f}} + 4T\gamma + I_{{\text{stress}}}\) that can transform a multi-objective optimization problem into a single-objective optimization problem was introduced, where \(d_{\text{f}}\) is the derailment coefficient, \(T\gamma\) is the wear index, and \(I_{{\text{stress}}}\) is the wheel–rail contact stress. The formula can also be assigned with appropriate weights to consider factors such as track payout. Choi et al. [32] fitted 10 points, including five fixed points and five movable points (the same idea as in Fig. 2) by using a piecewise cubic Hermite interpolating polynomial function to generate new wheel profiles, and the corresponding wear index, surface fatigue index, maximum derailment coefficient, maximum lateral force, maximum overturning coefficient were calculated by the VAMPIRE software. Among them, the wear index and the surface fatigue index were designed as optimization targets, while the maximum derailment coefficient, the maximum lateral force, the maximum vertical force, and the maximum overturning coefficient were considered as boundary constraints. After that, the relationship between these generated profiles and wear index and surface fatigue index was established through NSGA-II (non-dominated sorting genetic algorithm, an improved GA). The final results showed that the optimized profile found by NSGA-II could reduce wear and fatigue, and yielded good performance in terms of derailment and lateral force.

Another classical bio-inspired algorithm that has been used in wheel profile optimization is the PSO algorithm [33]. For instance, based on the cubic NURBS (non-uniform rational b-spline) method, Lin et al. [34] applied PSO to design an LM thin flange wheel profile, in which, the wear index and wheel–rail lateral force were designed as objective targets; the ordinate range, the tread monotonicity, the concave-convex properties, the flange thickness, and the derailment coefficient were designed as boundary constraints. However, the application of the thin flange wheel profile designed in Ref. [34] may involve modifications to existing designing or operating standards [35], thereby greatly reducing its versatility and engineering applicability. Cui et al. [36] used PSO to design a new wheel profile for a CRH1 train, in which, 14 movable points were designed as variables (see Fig. 2), and the generated wheel profile was fitted by using cubic spline function; a weighted function \(f(z) = 10000w_{1} f_{1} + w_{2} f_{2} + 100w_{3} f_{3}\) was designed as the objective function, which considered the angle of attack f1, the maximum lateral force \(f_{2}\), and the carbody acceleration \(f_{3}\), \(w_{i}\, (i = 1,2,3)\) is the weighted factor; the tread monotonicity, the derailment coefficient, and the load reduction rate were designed as the constraints. Like Ref. [31], the study [36] also transformed the multi-objective optimization problem into a single-objective optimization problem.

Besides, there are also some other bio-inspired optimization algorithms that can be used in wheel profile optimization. For instance, Firlik et al. [37] applied CMA-ES (covariance matrix adaptation evolution strategy, a genetic approach) to design a wheel profile for a low-floor tram from the city of Poznań, Poland, in which, the wheel profile was divided into five parts (a cubic spline curve, an arc of a circle, a straight line and two fillets), and the spline curve was designed as the optimization region; the wear index, derailment coefficient, and contact area corresponding to over 50 000 wheel profiles were calculated. With the help of CMA-ES, an optimal wheel profile (FP7) was found.

(II) Response surface technique

The basic idea of the response surface technique [38] is to locally adopt a low-order polynomial (quadratic or linear, this polynomial is often called a regression model) to fit a response surface that can reasonably reflect the real response (in wheel profile optimization, the target, and the constraints are the output responses). The optimization problem is solved by maximizing or minimizing the response surface. This method effectively reduces the amount of calculation through reasonable experimental design (such as sampling strategies). For instance, Ye et al. [9] used a small number of samples to build the relationship between wheel profiles and worn volume using KSM [39]. Through the established KSM, an optimized wheel profile to reduce wheel wear could be found. A similar study was presented in Ref. [40].

The biggest advantage of the methods based on bio-inspired algorithm and response surface technique is that they get rid of the establishment of a clear medium that reflects vehicle dynamics and wheel–rail contact mechanics, and directly find the relationship between the wheel profile and the optimization target through iterative calculation and/or regression calculation. In addition, these multi-objective optimization methods can also be applied to single-objective optimization problems, while the target-based techniques previously listed are inapplicable to multi-objective optimization problems. More importantly, by introducing multi-body dynamics models, these methods can accurately take vehicle-track system factors, including wheelset yaw and roll, track layout, etc., into account in the design of the wheel profile. However, it should be noted that these methods based on the bio-inspired algorithms are often computationally expensive since they involve a large number of iterative calculations, which means that a large number of MBSs are required.

1.2 Outlook

In addition to the aforementioned literature, there are also many other studies on wheel profile optimization [41,42,43,44,45]. Overall, the following outlook can be concluded:

  1. (1)

    In terms of the design variables in multi-objective optimization methods, how to numerically represent the wheel profile is an important issue, such as arcs, fillets, straights, and splines. An expression that can well describe the profile of the wheel can not only reduce the amount of calculation but also the interface with MBS software is not error-prone.

  2. (2)

    The selection of the objective function needs to be determined according to the train object. Freight trains are characterized by heavy axle load, and their running routes often contain many small-radius curves (such as the German Blankenburg–Rübeland railway line). The wheels are prone to fatigue, and the flange wear is often severe. The optimization targets should be more focused on safety and wheel damage. Passenger trains are characterized by fast speeds and people being transported, so their optimization targets should be more focused on safety, stability, and comfort. Of course, boundary constraints are essential.

  3. (3)

    With the expansion of urbanization, more and more railway vehicles shuttle on fixed lines, such as metro, light rail, and tram [9]. In addition, due to some unique geographic or economic reasons, some vehicles typically operate on a designated line. For instance, some CRH1A and CRH380A trains only run on the China Hainan Roundabout Railway Line because of the unique island geography of Hainan province. For these vehicles shuttling on fixed lines, a specific wheel profile that takes into account the specific track layout is a good alternative.

  4. (4)

    Bio-inspired algorithms require a large number of iterative calculations, and response surface techniques have regression capabilities. How to combine the two methods to quickly and reliably complete the task of optimizing wheel profiles, we believe this is a promising direction.

1.3 Contribution and structure of this work

The main work of this paper is summarized as follows:

  1. (1)

    In Sect. 1.1, we briefly review the methods concerning wheel profile optimization over the past two decades.

  2. (2)

    For the design of wheel profiles, we propose a comparably conservative rotary-scaling fine-tuning (RSFT) method to fine-tune the traditional wheel profiles, it is also an improvement of our previous work [9]. This method introduces two design variables and an empirical formula to generate new wheel profiles that meet the design standards. Besides, it avoids smoothing problems and has good compatibility with MBS software. More importantly, the new wheel profile obtained by fine-tuning the traditional profile is more proper to engineering applications.

  3. (3)

    For the optimization method, we propose a KSM–PSO-based multi-objective optimization method. This method combines the iterative computing power of the bio-inspired algorithm with the regression capability of the response surface technique and can quickly and reliably complete the task of optimizing wheel profiles.

  4. (4)

    We propose an RSFT–KSM–PSO method to optimize wheel profiles of railway vehicles shuttling on special lines, where a BOMBARDIER TRAXX locomotive serving on the German Blankenburg–Rübeland railway line is taken as a case to show how to implement the strategy.

  5. (5)

    We propose two wear-resistant wheel profiles for the TRAXX locomotive serving on the Blankenburg–Rübeland railway line to reduce the severe flange wear we observed, and our preliminary results show that the two profiles satisfy the requirements specified in Standard UIC 518 [46], EN 14363 [47], EN 15313 [35], etc.

Section 2 of this paper introduces the RSFT method, where the profile S1002 is taken as an example to show how to generate new wheel profiles. In Sect. 3 we simulate the BOMBARDIER TRAXX locomotive and the Blankenburg–Rübeland railway line in SIMPACK. In Sect. 4, we propose a KSM–PSO-based multi-objective optimization method for optimizing wheel profiles. In Sect. 5 we present and analyze the simulation results. In Sect. 6 the improved wheel profiles are tested according to Standard EN 14363. This paper ends with a brief discussion and conclusions.

2 RSFT method for generating wheel profiles

An important step in wheel profile optimization is to generate new wheel profiles. As shown in Fig. 5, the current approaches for directly generating wheel profiles can be concluded into three main categories: (a) the approach based on fitting discrete points (FDP-approach), (b) the element combination approach, and (c) the scaling approach.

Fig. 5
figure 5

Wheel profile generation approach

  1. (a)

    Approach based on fitting discrete points: The wheel profile is firstly represented by discrete points (Fig. 5a). The points in the non-optimization region are fixed, while the points in the optimization region are movable within the constraints (usually the ordinate is movable). Then, these discrete points are connected by a curve fitting method to generate new wheel profiles. Some examples are given in Refs. [21,22,23,24, 26, 27, 32, 34, 37].

  2. (b)

    Element combination approach: As plotted in Fig. 5b, the wheel profile is divided into several elements (sections), such as splines, arcs, and straights. Then, two (or more) elements act as the optimization region, and the parameters (e.g., the curvature of the circular arch, the length of the straight) of these elements are adjusted within constraints to generate a new wheel profile. A curve fitting method (e.g., B-splines, Beziers, NURBS) is generally required to ensure the smoothness of the generated wheel profile. An example is given in Ref. [44].

  3. (c)

    Scaling approach: A scale factor \(\alpha\) is introduced to enlarge or lessen the profile as a whole, as plotted in Fig. 5c. As mentioned in Ref. [9], the shortcoming of this approach is obvious, since a simple scale factor cannot take into account multiple indicators, such as safety-related index, stability-related index, wear index.

The first two approaches are promising, but they involve many design variables, which means that they are computationally expensive since a large number of iterative calculations and MBSs are required. More importantly, the setting of the curve is a complicated issue, which needs to consider wear, RCF, safety, stability, noise, comfort, service life, maintenance costs, etc. Since the performance shown in the face of these factors needs to be tested in long-term actual operation, a completely new wheel profile is difficult to be promoted for practical applications.

Alternatively, this paper introduces a comparably conservative RSFT method to fine-tune the traditional wheel profiles for improving their engineering applicability. Since the S1002 profile is a classic and widely used wheel profile in Europe and even worldwide [48], and it is also machined on the wheelsets of the introduced locomotive. It is, therefore, introduced as an example to show the theory and technique route of the RSFT method. The MATLAB code is disclosed in “Appendix 1” section. The specific steps, as shown in Fig. 6, are as follows:

Fig. 6
figure 6

RSFT method for wheel profile generation (an example of \(\alpha_{1} = 0.9\), and \(\alpha_{2} = 1.02\))

Step 1 Rotation

The tread base point (nominal circle contact point) is set as the origin of the profile coordinate, and the lateral coordinate and vertical coordinate are set as \(oy\)-axis and \(oz\)-axis, respectively. The original S1002 is defined as a function of \(y\), i.e., \(z(y)\), and the point on the curve is written as \((y,z)\). A transformation matrix \(\varvec{T}_{1}\) is introduced to rotate the curve until the vertex \(A(y_{\theta } ,z_{\hbox{max} } )\) of the curve coincides with the \(oy\)-axis and the point A after being rotated is written as \(A_{1}\). The point \((y,z)\) after being rotated is written as \((y_{1} ,z_{1} )\).

$$\left[ {\begin{array}{*{20}c} {y_{1} } \\ {z_{1} } \\ \end{array} } \right] = \varvec{T}_{1} \left[ {\begin{array}{*{20}c} y \\ z \\ \end{array} } \right]\text{, }\;\text{and}\;\text{ }\varvec{T}_{1} = \left[ {\begin{array}{*{20}c} {\cos \theta } & {\sin \theta } \\ { - \sin \theta } & {\cos \theta } \\ \end{array} } \right],$$
(5)

where \(\theta = \arctan ({{z_{\hbox{max} } } \mathord{\left/ {\vphantom {{z_{\hbox{max} } } {y_{\theta } }}} \right. \kern-0pt} {y_{\theta } }})\) is the rotation angle, \(z_{\hbox{max} }\) and \(y_{\theta }\) are the ordinate and abscissa of point \(A\), respectively.

Step 2 Scaling

The curve \(z(y)\) after being rotated is written as \(z_{1} (y)\), and a correction coefficient \(\alpha_{1}\) is introduced to scale the \(z\)-coordinate of the points on the curve:

$$\left[ {\begin{array}{*{20}c} {y_{2} } \\ {z_{2} } \\ \end{array} } \right] = \left[ {\begin{array}{*{20}c} {y_{1} } \\ {\alpha_{1} z_{1} } \\ \end{array} } \right].$$
(6)

The selection of the value of \(\alpha_{1}\) needs to consider the following three issues: (1) A small change of \(\alpha_{1}\) significantly affects the equivalent conicity, and further affects the curve-negotiation performance and critical speed of the vehicle [49]. Besides, when the value of \(\alpha_{1}\) increases to 1.0561 (for the S1002 profile), the slope of the tread region is not strictly monotonic, which will worsen the self-centering ability of the wheelset. (2) Increasing or decreasing the value of \(\alpha_{1}\) affects the flange thickness \(S_{\text{d}}\) flange slope quota qR, affecting the evaluation indicators (\(S_{\text{d}}\) and \(q_{\text{R}}\) in Fig. 7) specified in Standard EN 15313 [35] to a certain extent. (3) With reference to the North American transit system [50], the flange angle \(\alpha_{\text{f}}\) is considered to vary within 60°–75°. Based on the above considerations, the value of \(\alpha_{1}\) is selected in the range of \(0.95 \le \alpha_{1} \le 1.05\).

Fig. 7
figure 7

Three parameters for evaluating wheel wear as well as their limits for a 1250-mm-diameter-wheel specified in EN15313

Step 3 Re-rotation

A transformation matrix \(\varvec{T}_{2} \varvec{ = T}_{1}^{\text{T}}\), which is used to rotate the curve \(z_{2} (y)\) back, is introduced to generate the curve \(z_{3} (y)\), and the generated points \((y_{3} ,z_{3} )\) of \(z_{3} (y)\), corresponding to \((y,z)\) of \(z(y)\), is expressed as

$$\left[ {\begin{array}{*{20}c} {y_{3} } \\ {z_{3} } \\ \end{array} } \right] = \varvec{T}_{2} \left[ {\begin{array}{*{20}c} {y_{2} } \\ {z_{2} } \\ \end{array} } \right]\text{, }\;\text{and}\;\text{ }\varvec{T}_{2} = \left[ {\begin{array}{*{20}c} {\cos \theta } & { - \sin \theta } \\ {\sin \theta } & {\cos \theta } \\ \end{array} } \right].$$
(7)

Step 4 Empirical formula

It can be seen from Fig. 6 (Step 3) that the adjusted profile z3(y) changes significantly in the inner section (left half axis) including the flange region. This section is the focus of optimization, as wear in this section is usually the most severe for vehicles running on curved tracks. However, the outer section (right half axis) also significantly changes. If one takes into account factors such as self-centering and critical speed, this section should be changed as little as possible, but not changing this section will cause the curve to be not smooth, deteriorating the running performance of railway vehicles. To solve this problem, an important empirical formula \({{E_x}}\) is introduced to modify the curve, and the generated curve is written as z4(y)

$$\left[ {\begin{array}{*{20}c} {y_{4} } \\ {z_{4} } \\ \end{array} } \right] = \varvec{T}_{2} \left[ {\begin{array}{*{20}c} {y_{2} } \\ {z_{2} - {{E_x}}} \\ \end{array} } \right],$$
(8)

where Ex is obtained through our extensive simulation experiments, expressed as

$${{E_x}} = \sin \left( {\frac{\uppi }{2}\frac{{y_{3} }}{{\left| {y_{\theta } } \right|}}} \right)\cdot\left( {z_{3} - z} \right),$$
(9)

\(y_{\theta }\) is the abscissa corresponding to the vertex \(A\) of the S1002 profile, and z is the ordinate of the initial point on the original S1002 profile (\(z(y)\)). As shown in Fig. 6 (Step 4), the use of this formula produces the following three merits: (1) the inner section of the wheel profile can be adjusted to a large extent, while the outer section remains relatively stable, which can greatly reduce flange wear and ensure the self-centering ability of wheelsets; (2) the smoothness of the entire profile is guaranteed without introducing a fitting function; and (3) the flange height remains constant and does not affect the evaluation index \(S_{\text{h}}\) specified in Standard EN 15313 [35].

Step 5 Rescaling

Another correction factor \(\alpha_{2}\) to scale the y-coordinates of these points \((y_{4} ,z_{4} )\) on the curve \(z_{4} (y)\) is introduced to fine-tune the flange thickness, and the final curve is obtained as

$$\left[ {\begin{array}{*{20}c} {y_{5} } \\ {z_{5} } \\ \end{array} } \right] = \left[ {\begin{array}{*{20}c} {\alpha_{2} y_{4} } \\ {z_{4} } \\ \end{array} } \right].$$
(10)

It should be noted that a too-large or too-small value of \(\alpha_{2}\) may accelerate the other two indicators \(S_{\text{d}}\) and \(q_{\text{R}}\) specified in Standard EN 15313 [35] to reach the threshold for determining grinding, which will cause the wheelset to be scrapped in advance. Therefore, according to our experience, the range of \(\alpha_{2}\) is set as \(0.98 \le \alpha_{2} \le 1.02\). The determined design range of \(\alpha_{1}\) and \(\alpha_{2}\) ensures that the flange angle \(\alpha_{\text{f}}\) changes within 60°–75°.

The proposed RSFT method only introduces two design variables, which greatly reduces the amount of calculation. Besides, the application of the empirical function can not only achieve the optimization of the flange region but also ensure the original ability of the wheelset when the vehicle runs on straight tracks. More importantly, based on the existing wheel profile, this method does not introduce a curve fitting method, avoids the complex curve design problem, and is easily generalized to engineering applications. However, it should be noted that this method relies heavily on the baseline model (the original wheel profile), and the superiority of the optimized profile depends to some extent on the baseline model.

3 Locomotive-railway line dynamics model

A TRAXX F140 AC locomotive from BOMBARDIER serving on the German Blankenburg–Rübeland railway line is taken as a case to show how to use the RSFT method to obtain an optimal wheel profile for wear reduction while meeting the requirements of operational safety.

3.1 BOMBARDIER TRAXX locomotive

The MBS model of the TRAXX locomotive simulated in our work consists of three sub-structures, one for the carbody and two for the bogies, where each bogie consists of one bogie frame, two motors, two motor hanging arms, two driven wheelsets, four axleboxes, and four axlebox arms. Like most locomotives, the TRAXX locomotive has two stages of suspensions, i.e., the primary suspension and the secondary suspension. The corresponding suspension elements are established as shown in Fig. 8a. Finally, the MBS model of the locomotive is made up of 35 rigid bodies, with a total of 130 degrees of freedom (DOFs). The final model simulated in SIMPACK is shown in Fig. 8b. Some data of interest are listed in Table 1. The unlisted data are confidential and the authors have no rights to disclose them.

Fig. 8
figure 8

Modeled elements of the suspension system (a) and the MBS model of the BOMBARDIER TRAXX locomotive (b)

Table 1 Related information of the locomotive-railway line model used in SIMPACK

3.2 Blankenburg–Rübeland railway line

The introduced TRAXX locomotive operated almost exclusively on the German Blankenburg–Rübeland railway line. This line was built starting in about 1880 in Blankenberge in the Harz Mountains, connecting the companies (such as Hüttenwerke and Kalkbranntwerke) there to the railway network [51]. On the one hand, bulk goods must be transported away. On the other hand, raw materials such as coal and lime are supplied. Currently, most of the vehicles on this line mainly serve the lime project of the Fels-Werke GmbH in Harz. The corresponding transport route starts in Blankenberge and leads to Michaelstein before continuing to Rübeland. The return trip takes place in the reverse order, forming a cycle of the Rübelandbahn, with a total length of approximately 31.46 km. The satellite map of the line is shown in Fig. 9a.

Fig. 9
figure 9

Rübeland–Blankenburg railway line: a the satellite map, and b the plan view of the line simulated in SIMPACK [9]

The curvature, superelevation, and rail cant of the railway line are simulated according to the existing route plan [52, 53], and the track gauge \(S_{\text{w}}\) is approximately calculated by Eq. (11). Finally, the plan view of the line simulated in SIMPACK is shown in Fig. 9b. By comparing Fig. 9a with Fig. 9b, it can be known that the simulated line and the actual line are almost identical. The vehicle speed was measured by the authors’ technique team [51]. For detailed technical information concerning the Rübeland–Blankenburg railway line, including the track layout parameters, as well as the vehicle speed, see our previous work [9].

$$S_{\text{w}} = \left\{ {\begin{array}{*{20}l} {1.435\;\text{m,}\;\text{when }r_{{\text{arc}}} = + \infty, \text{ }} \hfill \\ {\frac{100}{{r_{{\text{arc}}} }} \times 0.02\;\text{m} + 1.435\;\text{m},\;\text{when }r_{{\text{arc}}} \ne + \infty }, \hfill \\ \end{array} } \right.$$
(11)

where \(r_{{\text{arc}}}\) is the arc radius.

4 KSM–PSO-based wheel profile optimization for wear reduction

4.1 Optimization problem statement

As can be seen from Fig. 9, the Blankenburg–Rübeland railway line has many tight curves, some of which even have a radius of only 180 m (see Ref. [9]), which results in severe wheel flange wear of vehicles shuttling on this line. In fact, this phenomenon has been confirmed in our previous project [51]. In this work, we aim to optimize the wheel profile of the TRAXX locomotives exclusively operating on this line to reduce wheel wear. Not only can it extend the service life of the wheelset, and it can also mitigate track deterioration.

4.1.1 Design variables and their limits

The correction factors \(\alpha_{1}\) and \(\alpha_{2}\) described in Sect. 3 are designed as the input variables of the optimization model, in which, \(\alpha_{1}\) and \(\alpha_{2}\) range in 0.95 ≤ \(\alpha_{1}\) ≤ 1.05 and 0.98 ≤ \(\alpha_{2}\) ≤ 1.02, respectively.

4.1.2 Optimization targets

The currently available wheel and/or rail wear models can be classified into two categories [54]: (1) the Archard-based models and (2) the \(T\gamma\)-based models. In the \(T\gamma\)-based models, such as the BRR model [55], the USFD model [56] and the Zobory model [57], the first step is to calculate the dissipated energy (wear number) in the wheel–rail contact patch. This is based on the assumption that there is a direct relationship between material loss and energy dissipation. In this paper, the energy dissipation is expressed as the product of the tangential force (\(T\)) and the creepage (\(\gamma\)), i.e., \(T\gamma\):

$$T\gamma = F_{x} v_{x} + F_{y} v_{y} + M_{z} \varphi_{z},$$
(12)

where \(F_{x}\) and \(F_{y}\) are longitudinal and lateral creep forces, respectively; \(v_{x}\) and \(v_{y}\) are longitudinal and lateral creepages, respectively; \(M_{z}\) and \(\varphi_{z}\) are the creep torque and spin creepage, respectively. Note that in SIMPACK 2020X the wheel–rail contact algorithm does not calculate the creep torque. Therefore, the wear number introduced here is the scalar product of the tangential forces and the related creepages.

The \(T\gamma\) value of the first wheelset of the BOMBARDIER TRAXX locomotive driving from Blankenburg to Rübeland (approximate 15.73 km) is considered as the optimization target since the wheel–rail interaction force of the first wheelset is generally greater than that of the other wheelsets. Finally, the optimization targets are expressed as

$$\left\{ {\begin{array}{*{20}c} {{\text{Target 1: }}w_{\text{t}} { = }\int_{t_{\text{s}}}^{t_{\text{e}}} {(T\gamma_{l.t} (t) + } \, T\gamma_{{{\text{r}} . {\text{t}}}} (t))\text{d}t} \\ {{\text{Target 2: }}w_{\text{f}} { = }\int_{t_{\text{s}}}^{t_{\text{e}}} {(T\gamma_{l.f} (t) + } \, T\gamma_{{{\text{r}} . {\text{f}}}} (t))\text{d}t} \\ \end{array} } \right.,$$
(13)

where \(w_{\text{t}}\) and \(w_{\text{f}}\) are the total wear number of the tread region and flange region, respectively; ts and te are the start and end simulation time, respectively; \(T\gamma_{{{\text{l}} . {\text{t}}}} (t)\) and \(T\gamma_{{{\text{r}} . {\text{t}}}} (t)\) are the \(T\gamma\) value of tread region of the left and right wheel of the first wheelset, respectively; and \(T\gamma_{{{\text{l}} . {\text{f}}}} (t)\) and \(T\gamma_{{{\text{r}} . {\text{f}}}} (t)\) are the \(T\gamma\) values of flange region of the left and right wheel of the first wheelset, respectively. With reference to Standard EN 13715 [58], in wear number calculation we consider the H2-C1* section and C1*-I section as the flange region and tread region, respectively, as shown in Fig. 10, where we define the coordinate of C1* as (− 26, zP), and zP is determined by the RSFD method introduced in Sect. 2.

Fig. 10
figure 10

Definition of the flange region and tread region (see EN 13715)

4.1.3 Constraints

To guarantee the operational safety and stability, referring to Standard UIC 518 [46] and Ref. [32], the wheel–rail vertical force \(Q\) and Nadal’s coefficient (derailment coefficient) \(f_{\text{d}}\) of all wheels, as well as the lateral force \(Y\) and overturning coefficient \(\eta\) of the vehicle system, among the whole line are designed as constraints.

Any high lateral force from the vehicle-track system, including wheel–rail lateral force and axlebox lateral force, is detrimental to the structural safety of the wheel and rail. Anyone of wheel–rail lateral force and axlebox lateral force (\(Y\)) must satisfy

$$Y\left| {\hbox{max} } \right. \le \frac{{P_{0} }}{3} + 10\,({\text{kN}}),$$
(14)

where \(P_{0}\) is a static axle load.

A high vertical wheel–rail force may lead to severe wear and/or RCF on the track. The vertical wheel–rail force (\(Q\)) must satisfy

$$Q\left| {\hbox{max} } \right. \le 90 + Q_{0}\, ({\text{kN}}),$$
(15)

where \(Q_{0}\) is the static load on a wheel.

The derailment coefficient (fd) is used to evaluate the safety of railway vehicles, and its permissible limit is

$$f_{\text{d}} = \frac{Y}{Q} \le 0.8.$$
(16)

The overturning coefficient (η) is an important index to evaluate the safety of railway vehicles, and its permissible limit is

$$\eta = \frac{{\sum\nolimits_{\text{bogie}} {Q_{{i{\text{l}}}} - } \sum\nolimits_{\text{bogie}} {Q_{{i{\text{r}}}} } }}{{\sum\nolimits_{\text{bogie}} {Q_{{i{\text{l}}}} + } \sum\nolimits_{\text{bogie}} {Q_{{i{\text{r}}}} } }} \le 1,$$
(17)

where \(Q_{{i{\text{l}}}}\) and \(Q_{{i{\text{r}}}}\) are the vertical wheel–rail force on the left and right side of the ith wheelset, respectively.

Finally, the optimization problem is transformed into

$$\left\{ {\begin{array}{*{20}l} {{\text{Minimize: }}w_{\text{t}} \, } \hfill \\ {{\text{Minimize: }}w_{\text{f}} } \hfill \\ {{\text{Design varaible: }}0.95 \le \alpha_{1} \le 1.05 \, } \hfill \\ {\begin{array}{*{20}l} {{\text{Design varaible: }}0.98 \le \alpha_{2} \le 1.02} \hfill \\ {\begin{array}{*{20}l} {{\text{Subject to: }}Y \le 92.04\;{\text{kN}}} \hfill \\ {{\text{Subject to: }}Q \le 213.06\;{\text{kN}}} \hfill \\ {{\text{Subject to: }}f_{\text{d}} \le 0.8} \hfill \\ {{\text{Subject to: }}\eta \le 1} \hfill \\ \end{array} } \hfill \\ \end{array} } \hfill \\ \end{array} } \right..$$
(18)

4.2 KSM-based objective functions and constraint functions

For discrete datasets, parameter optimization is based on a large number of samples. Specifically, for the wheel profile optimization problem that relies on MBSs, the dynamic indexes corresponding to a large number of different profiles need to be evaluated, which means that a lot of repeated modeling and simulations are required. For example, in Ref. [37], to obtain a wheel profile that takes into account the wear index, the derailment coefficient, and the wheel–rail contact patch area, over 50 000 wheel profiles were evaluated. Such a huge number of simulations (or evaluations) that the railway industry is reluctant to do many deterministic analyses. Moreover, in our work, each simulation takes about three hours (computing facilities: software: SIMPACK 2020X; hardware: Intel Core i7-4790K, 4.00 GHz.), which is on the premise that the simulation can run smoothly. The calculation amount is so large that it is unrealistic to perform a large number of simulations. Therefore, it is of interest to introduce a high-efficiency and reliable method to reduce the simulation amount.

KSM [39] is an interpolation technique based on statistical theory. It uses a small number of samples that meet a specific sampling strategy to construct a simplified mathematical model that approximates the original complex model. The essence of KSM is to perform regression or interpolation on the discrete dataset using an approximation function to predict the unknown region. This technique is introduced in our work to model the relationship between the correction factors (\(\alpha_{1}\) and \(\alpha_{2}\)) and the wheel–rail wear number and constraints \(w_{\text{t}}\), \({\text{w}}_{\text{f}}\), \(Y\), \(Q\), \(f_{\text{d}}\), and \(\eta\).

In the establishment of KSM, a group of \(\left\{ {\alpha_{1}^{(i)} ,\alpha_{2}^{(i)} ,w_{\text{t}}^{(i)} ,{\text{w}}_{\text{f}}^{(i)} ,Y^{(i)} ,Q^{(i)} ,f_{\text{d}}^{(i)} ,\eta^{(i)} } \right\}\) is defined as the \(i\) th sample, \(i = 1,2, \ldots ,n\), and n is the number of samples. The \(i\) th input design point is defined as

$$\varvec{\alpha}^{(i)} = \left[ {\alpha_{1}^{(i)} ,\alpha_{2}^{(i)} } \right],i = 1,2, \ldots ,n,$$
(19)

where \(\alpha_{1}^{(i)}\) and \(\alpha_{2}^{(i)}\) are obtained by Latin hypercube sampling (LHS) strategy [59], and the corresponding output response point is defined as

$$\varvec{r}^{(i)} = \left[ {w_{\text{t}}^{(i)} ,{\text{w}}_{\text{f}}^{(i)} ,Y^{(i)} ,Q^{(i)} ,f_{\text{d}}^{(i)} ,\eta^{(i)} } \right],$$
(20)

where \(Y^{(i)}\), \(Q^{(i)}\), and \(f_{\text{d}}^{(i)}\) can be directly obtained in the SIMPACK post-processing file, while \(w_{\text{t}}^{(i)}\) and \(w_{\text{f}}^{(i)}\), and \(\eta^{(i)}\) are calculated by Eq. (13) and Eq. (17), respectively. Therefore, the input dataset A and the output dataset \(\varvec{R}\) of KSM are, respectively, expressed as

$$\varvec{A} = \left[ {\begin{array}{*{20}c} {\varvec{\alpha}^{(1)} } \\ {\varvec{\alpha}^{(2)} } \\ \vdots \\ {\varvec{\alpha}^{(n)} } \\ \end{array} } \right],\,\varvec{R} = \left[ {\begin{array}{*{20}c} {\varvec{r}^{(1)} } \\ {\varvec{r}^{(2)} } \\ \vdots \\ {\varvec{r}^{(n)} } \\ \end{array} } \right].$$
(21)

Finally, the KSM-based objective functions and constraint functions are expressed as

$$\hat{r}(\varvec{\alpha}) = f(\varvec{\alpha})^{\text{T}}\varvec{\beta}+ z(\varvec{\alpha}),$$
(22)

where \(\hat{r}(\varvec{\alpha})\) is the predicted \(j\) th element of the vector \(\varvec{r}\), j = 1, 2,…, 6; \(f(\varvec{\alpha})\) is a determined regression model, in our work, a quadratic polynomial regression model is applied; \(\varvec{\beta}\) is a vector of unknown coefficients of the regression model; \(z(\varvec{\alpha})\) a random process, in our work, a random Gaussian distribution is applied. Specifically, \(\hat{r}_{1} (\varvec{\alpha})\) and \(\hat{r}_{2} (\varvec{\alpha})\) are the objective functions, or written as \(\overset{\lower0.5em\hbox{$\smash{\scriptscriptstyle\frown}$}}{w}_{\text{t}} (\varvec{\alpha})\) and \(\overset{\lower0.5em\hbox{$\smash{\scriptscriptstyle\frown}$}}{w}_{\text{f}} (\varvec{\alpha})\); \(\hat{r}_{3} (\varvec{\alpha})\), \(\hat{r}_{4} (\varvec{\alpha})\), \(\hat{r}_{5} (\varvec{\alpha})\), and \(\hat{r}_{6} (\varvec{\alpha})\) are the constraint functions, or written as \(\overset{\lower0.5em\hbox{$\smash{\scriptscriptstyle\frown}$}}{Y} (\varvec{\alpha})\), \(\overset{\lower0.5em\hbox{$\smash{\scriptscriptstyle\frown}$}}{Q} (\varvec{\alpha})\), \(\overset{\lower0.5em\hbox{$\smash{\scriptscriptstyle\frown}$}}{f}_{\text{d}} (\varvec{\alpha})\), and \(\overset{\lower0.5em\hbox{$\smash{\scriptscriptstyle\frown}$}}{\eta } (\varvec{\alpha})\).

For a detailed description of KSM can be found in Ref. [39] or our previous work [60].

4.3 PSO-based optimization

After constructing the objective functions and constraint functions based on KSM, the optimal wheel profile for reducing wheel wear can be automatically sought by iterative optimization algorithms. Here PSO [33] is introduced, which mainly simulates the group foraging behavior of birds. In this method, the position of each particle represents a potential solution. The speed and direction of each particle are determined by its velocity vector, and the particle is randomly initialized to find the optimal solution by an iterative search. In each iteration, the velocity \(\varvec{v}\) and position \(\varvec{x}\) of the particles are updated by Eqs. (23)–(25)

$$v_{ij} (t + 1) = \omega_{t + 1} v_{ij} (t) + c_{1} r_{1j} (t)\left( {p_{ij} - x_{ij} (t)} \right) + c_{2} r_{2j} (t)\left( {p_{gj} - x_{ij} (t)} \right),$$
(23)
$$x_{ij} (t + 1) = x_{ij} (t) + v_{ij} (t + 1),$$
(24)
$$\omega_{t + 1} = f_{\text{damp}} \omega_{t},$$
(25)

where t is the number of iterations, and is set as 100; \(i = 1,2, \ldots ,N_{\text{p}}\), and \(N_{\text{p}}\) is the number of particles in the population, and is set as 100; \(j = 1,2, \ldots ,N\), and \(N\) is the dimension of the solution space; \(\omega_{t}\) is the weight, the initial value is set 1; \(f_{\text{damp}}\) is the damping coefficient to reduce \(\omega_{t}\) after each iteration, and is set as 0.8; \(c_{1}\) and \(c_{2}\) are acceleration coefficients, both set as 2; \(r_{1j}\) and \(r_{2j}\) are random numbers that are uniformly distributed in 0–1; \(p_{ij}\) and \(p_{gj}\) represent the best positions found by particle and swarm, respectively. For a detailed description concerning PSO see Ref. [33].

5 Simulation

5.1 Technique route of the RSFT–KSM–PSO method

The RSFT–KSM–PSO-based wheel profile optimization method, as shown in Fig. 11, includes the following six steps:

Fig. 11
figure 11

Steps of the RSFT–KSM–PSO method for wheel wear reduction

Step 1:

Select the correction factors \(\alpha_{1}\) and \(\alpha_{2}\) by the LHS method and construct the design variable matrix \(\varvec{A}\)

Step 2:

Generate wheel profiles corresponding to \(\varvec{A}\) according to the RSFT method introduced in Sect. 2

Step 3:

Build the locomotive-railway line dynamics model in SIMPACK and run the simulation corresponding to each group of \(\varvec{\alpha}= \left[ {\alpha_{1} ,\alpha_{2} } \right]\), and obtain \(\left[ {w_{\text{t}} ,{\text{w}}_{\text{f}} ,Y,Q,f_{\text{d}} ,\eta } \right]\)

Step 4:

Generate the required results including targets \(\left[ {w_{\text{t}} ,{\text{w}}_{\text{f}} } \right]\) and constraints \(\left[ {Y,Q,f_{\text{d}} ,\eta } \right]\) and construct the output response matrix \(\varvec{R}\) described in Sect. 4.1. The simulated results are listed in “Appendix 2” section

Step 5:

Build the relationship between \(\varvec{A}\) and \(\varvec{R}\) described in Sect. 4.2 by a response surface technique (the KSM technique)

Step 6:

Search for the optimal combination of \(\alpha_{1}\) and \(\alpha_{2}\) for reducing wheel wear using PSO described in Sect. 4.3

5.2 Single-objective optimization

In order to visualize the whole process of the technique route described in Sect. 5.1, we first introduce the single-objective optimization problem. A total of 45 groups of design variables (correction factors) are selected, i.e., n = 45 in Eq. (19), as shown in Fig. 12a. Then, simulations corresponding to these correction factors are performed. The total wear number \(w = w_{\text{t}} { + }w_{\text{f}}\) is set as the optimization target, i.e., \(\varvec{r} = \left[ {Y,Q,\, f_{\text{d}} ,\eta } \right]\) (Eq. (20)), and the relationship between \(\varvec{A}\) and \(\varvec{R}\) is established. The wear number responses \(\varvec{w}\) corresponding to the correction factors \(\varvec{A}\) are shown in Fig. 12b.

Fig. 12
figure 12

Input sample variables (a), output sample responses (b), the corresponding KSM model (c), and the optimal result solved by PSO (d)

After bridging \(\varvec{A}\) and \(\varvec{R}\), we introduce a kind of surface response technique, KSM, to reflect the relationship between correction factors and total wear number, as shown in Fig. 12c. To ensure the correctness of the KSM–PSO method, the accuracy of KSM should be first guaranteed, where five groups of \(\varvec{\alpha}\) are randomly selected to run simulations. Table 2 compares the results calculated by the established KSM and the ones simulated in SIMPACK. All the errors are less than 4%, which shows that the established KSM is accurate enough and can be considered as the objective function of PSO.

Table 2 Comparison between the results calculated by KSM and the results simulated in SIMPACK

Finally, the optimal combination of \(\alpha_{1}\) and \(\alpha_{2}\) for reducing wheel wear is found by PSO, as shown in Fig. 12d. It can be seen that when the value of \(\varvec{\alpha}\) = [1.036, 1.0067], the total wear number is the smallest, which is \(7.2394 \times 10^{3}\) kN and the corresponding wheel profile is called S1002-S. The comparison between the S1002 and S1002-S profiles is shown in Fig. 13. It can be seen that the inner section of the wheel, including the flange region, changes obviously, while the outer section remains almost unchanged. In order to prove the superiority of the S1002-S profile, the related results of the S1002 profile calculated by KSM are introduced for comparison, as listed in Table 3. It can be known that when using the S1002-S profile, the total wear number is reduced by 30%, while other parameters related to stability and safety are hardly affected.

Fig. 13
figure 13

Comparison between the S1002 and S1002-S profiles

Table 3 Comparison of results calculated by KSM

Two points need to be explained here:

  1. (1)

    It should be noted that the S1002-S profile found by PSO may not be the optimal one that minimizes the total wear number, as this process is disturbed by the errors of the established KSM. As can be seen from the color map of Fig. 12d, when the value of \(\alpha\) within the red rectangle, all the corresponding wheel profiles can greatly reduce the total wheel number. Therefore, from the perspective of reducing the total wear number, all the wheel profiles derived from the red rectangle can be considered as an alternative.

  2. (2)

    From Fig. 12d, it can be seen that changing \(\alpha_{1}\) significantly affects the wear number, but changing \(\alpha_{2}\) has a relatively small effect on the wear number, which means that the value of \(\alpha_{2}\) can be selected within the range of \(0.98 \le \alpha_{2} \le 1.02\). From the maintenance point of view, to ensure that the flange thickness (Sd shown in Fig. 7) of the adjusted profile does not change, it is recommended to determine the value of \(\alpha_{2}\) according to the value of \(\alpha_{1}\), i.e., by changing \(\alpha_{2}\) to compensate for the increase or decrease in the flange thickness caused by the change of \(\alpha_{1}\). The optimal relationship between \(\alpha_{1}\) and \(\alpha_{2}\) will be studied in our future work.

5.3 Multiple objective optimization

To study the optimal relationship between the tread wear and the flange wear, we introduce a multi-objective optimization strategy. The tread wear number \(w_{\text{t}}\) and flange wear number \(w_{\text{f}}\) are set as the optimization targets, i.e., \(\varvec{r} = \left[ {w_{\text{t}} ,{\text{w}}_{\text{f}} ,Y,Q,f_{\text{d}} ,\eta } \right]\) [Eq. (20)]. According to the steps described in Sect. 5.1, 100 Pareto solutions are obtained, as shown in Fig. 14. These Pareto solutions reflect a nearly linear relationship between the tread wear number and the flange wear number, i.e., reducing the flange wear number will increase the tread wear number.

Fig. 14
figure 14

Multi-objective optimization results by KSM–PSO

Generally, the flange wear has the greatest impact on the safety of railway vehicles, and the resulted grinding depth is often deeper. Therefore, flange wear should be avoided as much as possible. The S1002-S profile obtained through the single-objective optimization in Sect. 5.2 is the profile that causes the least flange wear number. Therefore, in the multi-objective optimization, we select a compromise between tread wear number and flange wear number (the red point in Fig. 14, where \(\varvec{\alpha}\) = [1.0195, 1.0151]) as a case for analysis, and the corresponding wheel profile is called S1002-M.

The comparison between the S1002 and S1002-M profiles, as well as the dynamic parameters calculated by KSM, are shown in Fig. 15 and listed in Table 4, respectively. It can be seen that when the S1002-M profile is used, a proper sacrifice of the tread wear number can greatly reduce the flange wear number, and the total wear number is also reduced. In the example in this section, the tread wear number has been increased by 16%, while the flange wear number and the total wear number is reduced by 65% and 21%, respectively. In addition, the results listed in Table 4 show that the dynamic performance is not sacrificed.

Fig. 15
figure 15

Comparison between the S1002 and S1002-M profiles

Table 4 Comparison of results calculated by KSM

5.4 Long-term wear comparison between the S1002, S1002-S, and S1002-M profiles

Through the above RSFT–KSM–PSO method, it can be known that the S1002-S and S1002-M wheel profiles are advantageous in terms of short-term wear numbers for the BOMBARDIER TRAXX locomotive serving on the German Blankenburg–Rübeland railway line. To further prove that the improved wheel profiles are still superior to the standard S1002 profile in the long-term wear process, a wear distribution calculation method that integrates the Hertzian normal contact model [61], FaStrip tangential contact model [62], and USFD wear model [56] (i.e., Hertzian-FaStrip-USFD [9, 63]) is introduced to calculate the wheel wear distribution of the TRAXX locomotive after driving 1573 km. The theory of the Hertzian-FaStrip-USFD method and the selection of parameters can be found in our previous work [9].

Figure 16 shows the average wear distribution of the eight wheels of the locomotive under different wheel profiles. From this figure, the following observations are found:

Fig. 16
figure 16

Comparison of wear distribution of the S1002, S1002-S, and S1002-M profiles

  1. (1)

    When using the S1002 wheel profile, the wear depth of the flange is very large, and the wear distribution is extremely uneven.

  2. (2)

    When using the S1002-S wheel profile, the wear depth of the flange is greatly reduced, while the wear depth of the tread is obviously increased.

  3. (3)

    When using the S1002-M wheel profile, the wear depth of the tread decreases slightly. More importantly, the wear depth of the flange decreases significantly and the wear distribution is more uniform.

The above observations show that wheel wear can be reduced when using the improved wheel profiles (S1002-S and S1002-M). Particularly, flange wear can be significantly reduced. To account for this, Fig. 17 presents the RRD, equivalent conicity, and contact distribution of the S1002, S1002-S and S1002-M profiles when the track gauge is 1.435 m with a rail cant of 1/40. It can be seen that compared with the S1002 profile, the S1002-S and S1002-M profiles have better curve-negotiation capabilities since the flange contact generates larger lateral displacement on the curved track, while the optimized profiles show similar performances on the tangent track due to little changes of the tread section. Similar conclusions and explanations can be found in Ref. [23, 24, 32].

Fig. 17
figure 17

RRD (a), equivalent conicity (b), and contact distribution (ce) of the S1002, S1002-S, and S1002-M profiles

The long-time wear simulation proves that the improved wheel profiles are advantageous in terms of wear reduction, which further proves the correctness of the RSFT–KSM–PSO method, and meanwhile illustrates that this method is promising for practical engineering.

6 Quasi-static and hunting stability tests

Before putting into use, the improved wheel profile must satisfy the requirements specified in the acceptance standard EN 14363 [47]. Due to the huge workload, in this work, we only present the quasi-static safety against derailment on a twisted track and the hunting stability on a straight track.

6.1 Quasi-static safety against derailment on twisted track

We carry out the quasi-static test in accordance with Method 1 specified in EN 14363. The vehicle must negotiate a 150-m-radius twist track without the wheel lift of the outer leading wheel exceeding 5 mm. For this purpose, we built a twisted track with a gauge of 1460 mm. The track layout, including radius (R) and superelevation (\(H\)) distributions, is shown in Fig. 18a, b, the track consists of the following sections:

Fig. 18
figure 18

Track layout of radius (a) and superelevation (b) according to EN 14363 and comparison of wheel lift (c) and derailment quotient \({Y \mathord{\left/ {\vphantom {Y Q}} \right. \kern-0pt} Q}\) (d) between the S1002, S1002-S and S1002-M profiles

  • a straight line of 11 m,

  • a 100-m-long transition curve with the end radius of 150 m,

  • a 430-m-long full arch with a radius of 150 m,

  • and a straight line of 30 m.

The twist g is determined by the wheelbase distance and the pivot distance between bogies, where we used a constant twist of \(g_{0} = 3{{^{0} } \mathord{\left/ {\vphantom {{^{0} } {_{00} }}} \right. \kern-0pt} {_{00} }}\). The twist deficiency is balanced by the additional “gaskets” added under the primary- or secondary springs. EN 14363 [47] provides detailed information concerning this section, and a simple acceptance case can be found in our previous work [64].

The evaluation criterium is the maximum value \(\Delta Z_{\hbox{max} }\) of the wheel lift of the curve-outer wheel of the leading wheelset. The limit value is \(\Delta Z_{\hbox{max} } < \Delta Z_{\lim } = 5\;{\text{mm}}\). As shown in Fig. 18c, although the two wear-resistant wheel profiles have a slight increase in wheel lift on the twisted curved track, they are well below the limit value. In addition, the \({Y \mathord{\left/ {\vphantom {Y Q}} \right. \kern-0pt} Q}\) ratio is introduced as an auxiliary evaluation index. Figure 18d shows that the \({Y \mathord{\left/ {\vphantom {Y Q}} \right. \kern-0pt} Q}\) ratio of the two wear-resistant wheel profiles is almost the same as that of the S1002 profile, and all are below 1.2.

6.2 Hunting stability

We carry out the hunting stability assessment by estimating the nonlinear critical speed using the deceleration method [65] with an acceleration of − 0.3 \({{\text{m}} \mathord{\left/ {\vphantom {{\text{m}} {{\text{s}}^{2} }}} \right. \kern-0pt} {{\text{s}}^{2} }}\). The simulation runs in Fig. 19 show that the critical speed of profile S1002 is about 335 km/h, while that of the two wear-resistant wheel profiles is slightly lower, about 314 km/h for S1002-S and 326 km/h for S1002-M, respectively. However, for freight vehicles the slight reduction is acceptable since the critical speed of 314 km/h is usually considerably higher than the actual running speed. For instance, the maximum speed of the TRAXX locomotive on the Blankenburgenburg–Rübeland railway line usually does not exceed 60 km/h.

Fig. 19
figure 19

Simulation runs of S1002, S1002-S and S1002-M through the deceleration method

7 Conclusion and discussion

Currently, the multi-objective wheel profile optimization methods mainly include three sub-modules: (1) the formulation of design variables to generate wheel profile curves, (2) the dynamic simulation to obtain the relationship between the design variables and the dynamic performance, and (3) the selection of the optimization algorithm to find the optimal wheel profile. In this article, the work on these three modules is briefly summarized as follows:

  1. (1)

    In terms of the first module, we propose a comparably conservative RSFT method to fine-tune the traditional wheel profiles. Compared with the approach based on fitting discrete points and the element combination approach, the RSFT method introduces only two design variables, which greatly reduces the amount of calculation. Besides, the empirical function applied in this method can not only achieve the optimization of the flange region but also ensure the original ability of the wheelset when the vehicle runs on a straight line. More importantly, based on the existing wheel profile, this method does not introduce a curve fitting method, avoids the complex curve design problem, and is easily generalized to engineering applications.

  2. (2)

    In terms of the second module, for the BOMBARDIER TRAXX locomotives serving on the German Blankenburgenburg–Rübeland railway line, we build a KSM-based function to reflect the relationship between the wheel profile designed by the RSFD method and the wheel–rail wear number.

  3. (3)

    In terms of the third module, we propose a KSM–PSO-based optimization method. This method combines the iterative computing power of the bio-inspired algorithm with the regression capability of the response surface technique and can quickly and reliably complete the task of optimizing wheel profiles.

  4. (4)

    With the RSFT–KSM–PSO method, we present two wear-resistant wheel profiles for the TRAXX locomotives serving on the German Blankenburgenburg–Rübeland railway line, namely S1002-S and S1002-M. The S1002-S profile minimizes the total wear number by 30%, while the S1002-M profile makes the wear distribution more uniform through a proper sacrifice of the tread wear number, and the total wear number is reduced by 21%. Our preliminary results show that the two wear-resistant wheel profiles meet the requirements specified in standards UIC 518, EN 14363, EN 15313, etc.

The article ends with the following three points: (1) The coupler forces on freight locomotives of a long heavy-haul train are very large, which could greatly affect the locomotive dynamics and wheel wear. However, we have not considered the coupler forces in this paper. (2) It should be noted that the RSFT method relies heavily on the baseline model (the original wheel profile), and the superiority of the optimized profile depends to some extent on the baseline model. (3) We are studying the performance of this method in metros, and the relevant information will be reported in our future work.

8 Replication of results

The unlisted data about the locomotive and the railway line are confidential and the authors have no rights to disclose them. Readers interested in the MATLAB code and the internal reports are encouraged to contact the corresponding authors by E-mail. The RSFT algorithm and the original simulated results from SIMPACK are disclosed in “Appendices 1 and 2” section, respectively.