Bilevel Programming: Optimality Conditions and Duality

Zlobec, S.

doi:10.1007/0-306-48332-7_39

S. Zlobec³

241 Accesses
1 Citations

The bilevel programming problem (abbreviation: BPP) is a mathematical program in two variables x and θ, where x = x°(θ) is an optimal solution of another program. Specifically, BPP can be formulated in terms of two ordered objective functions φ and Ψ as follows:

(1)

where x = x°(θ) is an optimal solution of the program

(2)

Here the functions φ, Ψ, f ⁱ, g ^j : R ⁿ × R ^m → R, i ∈ P, j ∈ Q, are assumed to be continuous; x ∈ R ⁿ, θ ∈ R ^m; P, Q are finite index sets. Program (1) is often called the upper ( first level , outer , leader’s ) problem; then (2) is the lower ( second level , inner , follower’s ) problem. Many mathematical programs, such as minimax problems, linear integer, bilinear and quadratic programs, can be stated as special cases of bilevel programs. In view of the so-called Reduction Ansatz, developed in [18], [44], semi-infinite programs can be considered as special cases of bilevel programs. For stability and deformations of these see, e.g., [20], [21]. Problems appearing in such seemingly unrelated areas as best approximation problems and data envelopment analysis can be viewed as bilevel programs. In the former, one is often interested in finding a least-norm solution in the set of all best approximate solutions, while, in the latter, one wants to rank, or decrease the number of, efficient decision making units by a ‘post-optimality analysis’. For history of bilevel programs , reviews of numerical methods and applications, especially for connections with von Stackelberg games of market economy see, e.g., [14], [22], [30], [39]. In this contribution we will focus only on optimality conditions and duality.

Basic Difficulties

The study of bilevel programming problems requires some familiarity with point-to-set topology; see, e.g., [1], [2], [6], [15]. Since the lower level optimal solution mapping x° : θx°(θ) is a point-to-set mapping (rather than a vector function), the optimal value function of the BPP may be discontinuous. This is illustrated by the following example:

EXAMPLE 1

Consider the bilevel program with the upper level objective φ (x, θ) = −x ₁/θ, the lower level objective Ψ (x, θ) = − x ₁ − x ₂, and the lower level feasible set determined by x ₁ + θ x ₂ ≤ 1, x ₁ ≥ 0, x ₂ ≥ 0. The lower level optimal solutions x = x°(θ) are the segment {x ₁ + x ₂ = 1, x ₁ ≥ 0, x ₂ ≥ 0}, for θ = 1, and the singleton [0, 1/θ], when 0 < θ < 1. The corresponding upper level optimal solutions, i.e., the BPP optimal solutions, are the points [1, 0] and [0, 1/θ], respectively. Here the corresponding optimal value of the BPP jumps from − 1 to 0, as θ assumes the value 1.

Note that the lower level feasible set mapping, in Example 1, is lower semicontinuous (open) at θ = 1. Hence we conclude that discontinuity of the optimal value can occur even if the lower level model is stable.

The fact that the set of optimal solutions is generally discontinuous in a stable situation is well known in linear programming. It may manifest itself in a chaotic behavior of the optimal solutions, but not the optimal value, when the program is solved by computer repeatedly with small perturbations of data; see Nondifferentiable optimization: Parametric programming. The topological loss of continuity is generally unrelated to the conditioning, which describes numerical sensitivity of the solutions relative to roundoff errors. In particular, a linear program with an ill-conditioned coefficient matrix can be stable.

Another difficulty results from the fact that the optimal solutions mapping x° : θ x°(θ) is not generally closed. Hence a BPP may not have an optimal solution even if the feasible set of the lower program is compact:

EXAMPLE 2

Consider the bilinear BPP:

where x = x°(θ) solves

Here the optimal solutions mapping is the function x°(θ) = 0, if θ > 0, and x°(0) = 1, if θ = 0. The feasible set of the lower level problem is a unit square in the (θ, x)-plane, while the feasible set of the BPP is a disjoint noncompact set consisting of the segment 0 < θ ≤ 1 and the point [0, 1]. Since the origin is not a feasible point, the BPP does not have a solution. Note that the function x°(θ) is not continuous here because the lower level feasible set mapping is not lower semicontinuous at the origin, i.e., the lower level problem is unstable.

Optimality

A popular approach to the study of optimality in BPP is to reduce the program to a one-level program. This can be done as follows: Denote the optimal value of the lower level program (2) by Ψ°(θ) and introduce the new constraint f°(x, θ) = Ψ(x, θ) − Ψ°(θ). Now the BPP can be reformulated as

(3)

Difficulties with this formulation generally include discontinuity of the leading constraint f° and the lack of classical constraint qualifications. The latter can be handled in convex case using the results on optimality conditions from, e.g., [5], [15], [47]. One of the first attempts to formulate optimality conditions for bilevel programming problems, using (3), was made in [2]. However a counterexample to these conditions was given in [4], [12], [17], also see [10]. The one-level approach leads, under assumptions that guarantee Lipschitz continuity of the optimal value function, to necessary conditions of the Fritz John type . Under a partial calmness condition , and a constraint qualification for the lower level problem, one obtains conditions of the Karush-Kuhn-Tucker type . The concept of partial calmness is equivalent to the ‘exact penalization’ and it is satisfied, in particular, for the minimax problem and if the lower level problem is linear. This approach in a nonsmooth framework is used in, e.g., [11] and [46]. The relationship between the BPP and an associated exact penalty function was explored also in [7] to derive other types of necessary and sufficient optimality conditions. Other approaches to optimality conditions, that use nonsmooth analysis, include [13], [19], [32]. Another approach to reducing the BPP to a single-level program is to replace the lower level problem by an optimality condition. This is usually done in formulations of numerical methods; see, e.g., [42]. There are also approaches that use the specific geometry of BPP. One of these applies properties of the steepest descent directions to BPP and it yields a necessary condition for optimality, see [33]. Adaptations of the well-known first and second order optimality conditions of mathematical programming to BPP appeared in [40]. Checking local optimality for linear BPP is NP-hard; see [41]. Examples of linear BPPs with an exponential number of local minima can be generated by a technique proposed in [9].

Many authors have studied links between two-objective and bilevel programming, looking for conditions that guarantee that the optimal solution of a given BPP be Pareto optimal for both upper and lower level objective functions, and vice versa; e.g., [28], [29], [30], [37]. The idea is to find an optimal solution of the BPP by solving a bi-objective program. It was shown in [43] that an optimal solution in linear BPP may not be a Pareto optimum for the objective function of the outer program and the optimal value function of the lower program, contrary to a claim made in [38]. The authors of [43] also give a sufficient condition for the implication to hold. If an optimal solution exists, in the linear BPP case with a compact feasible set at the lower level, then at least one optimal solution is assumed at a vertex of this set, see [3]. Necessary conditions for optimality can also be stated using marginal value formulas for optimal value functions. However, these formulas can not assume a usual constraint qualification in order to be applied to the formulation (3). One such formula in parametric convex programming is given in [48] and, under slightly different assumptions, in [49]. In the latter, it is used in the context of data envelopment analysis to rank efficiently administered university libraries by their radii of rigidity. Existence of optimal solutions is studied in [16], [23], [24]; constraints in [24] are defined by an implicit variational problem. Both, existence and stability of solutions and approximate solutions are studied in [27]. Optimality conditions are important for checking optimality, formulation of duality theories, and for numerical methods.

Parametric Approach To Optimality

A parametric approach to characterizing global and local optimal solutions in convex BPP can be described as follows: Denote, for every θ, the optimal value of (3) by

Also, denote the feasible set in the x variable by F(θ) = {x : f ⁱ(x, θ) ≤ 0, i ∈ R, and the feasible set in the θ variable by

A parametric formulation of the BPP is

(4)

Here we optimize the optimal value of the outer problem over the feasible set in the variable θ, considered as a ‘parameter’. The problem of the form (4) is a basic problem of parametric programming, e.g., Nondifferentiable optimization: Parametric programming. It has been extensively studied in the literature from both the theoretical and the numerical side. In particular, various optimality conditions have been formulated for it, e.g., in the context of input optimization; see [48]. The key observation in the parametric approach is that, under the assumption that the feasible set of the lower program is compact, every θ^∗ that globally solves the parametric program (4), with the corresponding optimal solution x ^∗ of the program (3), is a global optimal solution of the bilevel program, and vice versa. However, under the compactness assumption, both sets can be empty (as demonstrated by Example 2). A necessary and sufficient condition for global optimality in convex BPP can be given over a ‘region of cooperation’ in terms of the existence of a saddle point; see [15]: Given a candidate for global optimality θ^∗ and the set of all optimal solutions at the lower level {x° (θ)}, θ ∈ F. Denote by K(θ^∗) the region in the θ-space, where the minimal index set of active constraints R ⁼ (θ) = {i ∈ R : x ∈ {x°(θ)} ⇒ f ⁱ (x, θ) = 0} does not strictly increase, i.e., K(θ^∗) = {θ ∈ F : R ⁼ (θ) ⊂ R ⁼(θ^∗)}. Then the region of cooperation at θ^∗ is defined as the set {(θ, x)} : θ ∈ K(θ^∗), x ∈ F (θ)}. One can characterize global optimality on the entire feasible set for linear BPP, and also for convex BPP provided that the constraints are ‘LFS functions’ , e.g. [35], [48]. These functions form a large class of convex functions that includes all linear and polyhedral functions. Characterizations of global optimality are simplified under the so-called sandwich condition . This is a two-sided global inclusion involving the set of optimal solutions of the inner program, e.g., [15]. Characterizations of locally optimal parameters θ^∗ for convex (4) require lower semicontinuity of the optimal solutions mapping x°. The results apply to the convex BPP with the additional assumption that the corresponding optimal solution x ^∗ ∈ {x° (θ^∗} is unique; see, e.g., [15]. The uniqueness assumption in the characterization of local optimality cannot be replaced by the requirement that the set {x°(θ^∗)} be compact. The following example illustrates a situation where a local optimum of the BPP can not be recovered by the parametric approach.

EXAMPLE 3

Consider the program min φ (x, θ) = x θ², where x solves min Ψ (x, θ) = 0, subject to − 1 ≤ x, θ ≤ 1. Here x ^∗ = 1, θ^∗ = 0 is a local minimum of the bilevel program. But φ° (θ) = − θ² and θ^∗ = 0 is not its local minimum; in fact, it is an isolated global maximum!

Duality

Duality theories for bilevel programming problems can be formulated by adjusting the duality theories of mathematical programming (see, e.g., [34]) to the single-objective model (3). Let us outline how this works using a parametric approach; we follow the ideas from [15]. Instead of a single ‘dual’ one obtains a collection of several’ subduals’, each closely related to the original (primal) program. The number of these subduals is cardinality of the set

First, with each Ω ⊂ Π, one associates the feasible subregion S_Ω = {θ ∈ F : R⁼(θ) = Ω}, the Lagrangian L_Ω (x, θ; u) = φ (x, θ) + ∑_{i ∈ R \ Ω}u_ifⁱ (x, θ), and the point-to-set mapping F_Ω : F → Rⁿ defined by F_Ω (θ) = {x : fⁱ (x, θ) ≤ 0, i ∈ Ω}. The corresponding subdual function is

and the subdual (D, Ω) is defined as

(5)

Here u belongs to the set of all nonnegative vector functions defined on S _Ω. The duality results, stated for partly convex programs in, e.g., [47] can be reformulated for the outer convex model and hence BPP. In particular, if, for some Ω ⊂ Π, u ^∗ ∈ [S _Ω → R ₊ ^{card R \ Ω}], and an optimal solution x ^∗ of the inner program for some fixed θ^∗ ∈ S _Ω, one has Φ_Ω(u ^∗) = φ (x ^∗, θ^∗), then u ^∗ solves the subdual (5) and θ^∗ solves (4) on S _Ω.

If optimization of the optimal value function in (4) is performed from some fixed ‘initial’ θ, but using only parameter paths that preserve continuity of the optimal solutions mapping of the lower problem, then we talk about stable BPP . This approach, in the convex case, guarantees that the optimal solutions mapping in BPP is closed and that the optimal value function is continuous, thus removing the two basic difficulties mentioned in Section 1. However, the optimal solutions now depend on the initial choice of the parameter and on a particular class of stable paths used. Stable parametric programming has been studied in [48], stable BPP is mentioned (but not studied) in [15]; see [36].

References

Bank, B., Guddat, J., Klatte, D., Kummer, B., and Tammer, K.: Nonlinear parametric optimization, Akad. Verlag 1982.
Google Scholar
Bard, J.: ‘Optimality conditions for the bilevel programming problem’, Naval Res. Logist. Quart.31 (1984), 13–26.
MathSciNet MATH Google Scholar
Bard, J.: ‘Convex two-level optimization’, Math. Program.40 (1988), 15–27.
MathSciNet MATH Google Scholar
Ben-Ayed, O., and Blair, C.: ‘Computational difficulties of bilevel linear programming’, Oper. Res.38 (1990), 556–560.
MathSciNet MATH Google Scholar
Ben-Israel, A., Ben-Tal, A., and Zlobec, S.: Optimality in nonlinear programming: A feasible directions approach, Wiley/Interscience 1981.
Google Scholar
Berge, C.: Topological spaces, Oliver and Boyd 1963.
Google Scholar
Bi, Z., and Calamai, P.: ‘Optimality conditions for a class of bilevel programming problems’, Techn. Report Dept. Systems Design Engin. Univ. Waterloo, no. 191-O-191291 (1991).
Google Scholar
Bracken, J., Falk, J., and McGill, J.: ‘Equivalence of two mathematical programs with optimization problems in the constraints’, Oper. Res.22 (1974), 1102–1104.
MathSciNet MATH Google Scholar
Calamai, P., and Vicente, L.N.: ‘Generating linear and linear-quadratic bilevel programming problems’, SIAM J. Sci. Statist. Comput.14 (1993), 770–782.
MathSciNet MATH Google Scholar
Candler, W.: ‘A linear bilevel programming algorithm: A comment’, Computers Oper. Res.15 (1988), 297–298.
MathSciNet MATH Google Scholar
Chen, Y., and Florian, M.: ‘The nonlinear bilevel programming problem: formulations, regularity and optimality conditions’, Optim.32 (1995), 193–209.
MathSciNet MATH Google Scholar
Clarke, P., and Westerberg, A.: ‘A note on the optimality conditions for the bilevel programming problem’, Naval Res. Logist. Quart.35 (1988), 413–418.
Google Scholar
Dempe, S.: ‘A necessary and sufficient optimality condition for bilevel programming problems’, Optim.25 (1992), 341–354.
MathSciNet MATH Google Scholar
Dempe, S.: ‘On the leader’s dilemma and a new idea for attacking bilevel programming problems’, Preprint Techn. Univ. Chemnitz (1997).
Google Scholar
Floudas, C.A., and Zlobec, S.: ‘Optimality and duality in parametric convex lexicographic programming’, in P.M. Pardalos A. Migdalas and P. Värbrand (eds.): Multilevel Optimization: Algorithms and Applications, Kluwer Acad. Publ. 1998, pp. 359–379.
Google Scholar
Harker, P., and Pang, J.-S.: ‘Existence of optimal solutions to mathematical programs with equilibrium constraints’, Oper. Res. Lett.7 (1988), 61–64.
MathSciNet MATH Google Scholar
Haurie, A., Savard, G., and White, D.: ‘A note on: An efficient point algorithm for a linear two-stage optimization problem’, Oper. Res.38 (1990), 553–555.
MATH Google Scholar
Hettich, R., and Jongen, H.Th.: ‘Semi-infinite programming: conditions of optimality and applications’, in J. Stoer (ed.): Optimization Techniques, Part 2, Vol. 7 of Lecture Notes Control Inform. Sci., Springer, 1978, pp. 1–11.
Google Scholar
Ishizuka, Y.: ‘Optimality conditions for quasi-differentiable programs with applications to two-level optimization’, SIAM J. Control Optim.26 (1988), 1388–1398.
MathSciNet MATH Google Scholar
Jongen, H.Th., Rückmann, J.-J, and Stein, O.: ‘Generalized semi-infinite optimization: A first order optimality condition and examples’, Math. Program.83 (1998), 145–158.
Google Scholar
Jongen, H.Th., and Rückmann, J.-J.: ‘On stability and deformation in semi-infinite optimization’, in R. Reemtsen and J.-J. Rückmann (eds.): Semi-Infinite Programming, Kluwer Acad. Publ., 1998, pp. 29–67.
Google Scholar
Kolstad, C.D.: ‘A review of the literature on bilevel mathematical programming’, Techn. Report Los Alamos Nat. Lab. no. LA-10284-MS, UC-32 (Oct. 1985).
Google Scholar
Lignola, M.B., and Morgan, J.: ‘Topological existence and stability for Stackelberg problems’, J. Optim. Th. Appl.84 (1995), 145–169.
MathSciNet MATH Google Scholar
Lignola, M.B., and Morgan, J.: ‘Existence of solutions to generalized bilevel programing problem’, in A. Migdalas P.M. Pardalos and P. Värbrand (eds.): Multilevel Optimization: Algorithms and Applications, Kluwer Acad. Publ. 1998, pp. 315–332.
Google Scholar
Liu, Y., and Hart, S.: ‘Characterizing an optimal solution to the linear bilevel programming problem’, Europ. J. Oper. Res.166 (1994), 164–166.
Google Scholar
Loridan, P., and Morgan, J.: ‘New results on approximate solutions in two-level optimization’, Optim.20 (1989), 819–836.
MathSciNet MATH Google Scholar
Mallozzi, L., and Morgan, J.: ‘Weak Stackelberg problem and mixed solutions under data perturbations’, Optim.32 (1995), 269–290.
MathSciNet MATH Google Scholar
Marcotte, P., and Savard, G.: ‘A note on Pareto optimality of solutions to the linear bilevel programming problem’, Comput. Oper. Res.18 (1991), 355–359.
MathSciNet MATH Google Scholar
Migdalas, A.: ‘When is a Stackelberg equilibrium Pareto optimum?’, in P.M. Pardalos, Y. Siskos and C. Zopounidis (eds.): Advances in Multicriteria Analysis, Kluwer Acad. Publ. 1995, pp. 175–181.
Google Scholar
Migdalas, A., and Pardalos, P.M.: ‘Editorial: Hierarchical and bilevel programming’, J. Global Optim.8 (1996), 209–215.
MathSciNet Google Scholar
A. Migdalas, P.M. Pardalos and Värbrand P. (eds.): Multilevel optimization: Algorithms and applications, Kluwer Acad. Publ. 1998, pp. 29–67.
Google Scholar
Outrata, J.: ‘Necessary optimality conditions for Stackelberg problems’, J. Optim. Th. Appl.76 (1993), 305–320.
MathSciNet MATH Google Scholar
Savard, G., and Gauvin, J.: ‘The steepest descent direction for the nonlinear bilevel programming problem’, Oper. Res. Lett.15 (1994), 275–282.
MathSciNet Google Scholar
Tammer, K., and Rückmann, J.-J.: ‘Relations between the Karush-Kuhn-Tucker points of a nonlinear optimization problem and of a generalized Lagrange dual’, in H.-J Sebastian and K. Tammer (eds.): System Modelling and Optimization, 143 of Lecture Notes Control Inform. Sci., Springer 1990.
Google Scholar
Trujillo-Cortez, R.: ‘LFS functions in stable bilevel programming’, PhD Thesis Dept. Math. and Statist. McGill Univ.July (1997).
Google Scholar
Trujillo-Cortez, R.: ‘Stable bilevel programming and applications’, PhD Thesis McGill Univ. (2000), in preparation.
Google Scholar
Tuy, H.: ‘Bilevel linear programming, multiobjective programming, and monotonic reverse convex programming’, in A. Migdalas, P.M. Pardalos and P. VÄrbrand (eds.): Multilevel Optimization: Algorithms and Applications, Kluwer Acad. Publ. 1998, 295–314.
Google Scholar
Ünlü, G.: ‘A linear bilevel programming algorithm based on bicriteria programming’, Computers Oper. Res.14 (1987), 173–179.
MATH Google Scholar
Vicente, L.N., and Calamai, P.H.: ‘Bilevel and multilevel programming: A bibliography review’, J. Global Optim.5 (1994), 291–306.
MathSciNet MATH Google Scholar
Vicente, L.N., and Calamai, P.H.: ‘Geometry and local optimality conditions for bilevel programs with quadratic strictly convex lower levels’, in D.-Z Dhu and P.M. Pardalos (eds.): Minimax and Applications, Kluwer Acad. Publ. 1995, 141–151.
Google Scholar
Vicente, L.N., Savard, G., and Judice, J.: ‘Descent approaches for quadratic bilevel programming’, J. Optim. Th. Appl.81 (1994), 379–399.
MathSciNet MATH Google Scholar
Visweswaran, V., Floudas, C.A., Ierapetritou, M.G., and Pistikopoulos, E.N.: ‘A decomposition-based global optimization approach for solving bilevel linear and quadratic programs’, in C.A. Floudas and P.M. Pardalos (eds.): State of the Art in Global Optimization, Kluwer Acad. Publ. 1996, 139–162.
Google Scholar
Wen, U.-P, and Hsu, S.-T.: ‘A note on a linear bilevel programming algorithm based on bicriteria programming’, Comput. Oper. Res.16 (1989), 79–83.
MathSciNet MATH Google Scholar
Wetterling, W.: ‘Definitheitsbedingungen für relative Extrema bei Optimierungs-und Aproximation saufgaben’, Numerische Math.15 (1970), 122–136.
MathSciNet MATH Google Scholar
Ye, J.: ‘Necessary conditions for bilevel dynamic optimization problems’, SIAM J. Control Optim.33 (1995), 1208–1223.
MathSciNet MATH Google Scholar
Ye, J.J., and Zhu, D.L.: ‘Optimality conditions for bilevel programming problems’, Optim.33 (1995), 9–27.
MathSciNet MATH Google Scholar
Zlobec, S.: ‘Lagrange duality in partly convex programming’, in C.A. Floudas and P.M. Pardalos (eds.): State of the Art in Global Optimization, Kluwer Acad. Publ. 1996, pp. 1–18.
Google Scholar
Zlobec, S.: ‘Stable parametric programming’, Optim.45 (1998), 387–416.
MathSciNet Google Scholar
Zlobec, S.: ‘Parametric programming: An illustrative mini-encyclopedia’, Math. Commun.5 (2000), 1–39.
MathSciNet MATH Google Scholar

Download references

Author information

Authors and Affiliations

Dept. Math. Statist. McGill Univ., Burnside Hall, 805 Sherbrooke Street West, Montreal, Quebec, Canada, H3A 2K6
S. Zlobec

Authors

S. Zlobec
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Dept. Chemical Engin., Princeton Univ., Princeton, NJ, 08544-5263, USA
Christodoulos A. Floudas
Center for Applied Optim. Dept. Industrial and Systems Engin., Univ. Florida, Gainesville, FL, 32611, USA
Panos M. Pardalos

Rights and permissions

Reprints and permissions

Copyright information

About this entry

Cite this entry

Zlobec, S. (2001). Bilevel Programming: Optimality Conditions and Duality . In: Floudas, C.A., Pardalos, P.M. (eds) Encyclopedia of Optimization. Springer, Boston, MA. https://doi.org/10.1007/0-306-48332-7_39

Download citation

DOI: https://doi.org/10.1007/0-306-48332-7_39
Publisher Name: Springer, Boston, MA
Print ISBN: 978-0-7923-6932-5
Online ISBN: 978-0-306-48332-5
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics