1 Introduction

Parental refusal to vaccinate their child has become a growing concern for public health administrators and pediatricians. For example, in a recent national survey of pediatricians, 54% had encountered parents over a 12-month period that refused to vaccinate their child, citing safety concerns as their top reason [14]. In another survey, 70% of pediatricians had encountered a parent in the 12 months preceding the survey that refused at least one immunization for their child [12]. As these studies indicate, the safety, necessity, and effectiveness of vaccines are a concern for many parents.

In the last decade, vaccine manufacturers have begun developing a growing number of combination vaccines, which protect against several diseases with a single injection [13, 18]. Such combination vaccines are an important solution to alleviating vaccine injection overcrowding and schedule complexity [8, 13, 31] within the current United States Recommended Childhood Immunization Schedule [4]. However, using combination vaccines increases the risk of extraimmunization, which means that a child receives vaccine antigens for a given disease beyond what is prescribed in the United States Recommended Childhood Immunization Schedule. For example, injecting a child with a diphtheria, tetanus, pertussis, hepatitis B, and polio combination vaccine at age 4 months would provide extraimmunization for hepatitis B, since no dose of this vaccine is required at that age (see Fig. 1). Such extraimmunization poses biological safety risks and amplifies parental concerns on vaccine safety and effectiveness. Biologically, extraimmunization of some vaccines increases the risk of adverse side effects; such is the case with diphtheria and tetanus vaccines [8]. Furthermore, several people challenge the safety and effectiveness of vaccinating children and particularly object to the use of combination vaccines, since they believe that injecting a child with multiple antigens simultaneously overwhelms the infant immune system, and hence, extraimmunization due to combination vaccines only exacerbates these fears [10, 13]. Moreover, the economic toll of extraimmunization is also significant. For example, the annual societal cost burden of providing one extra dose of vaccine for each child born in the United States is no less than $28 million, which assumes a birth rate of 11,100 births per day (see [26]) and a vaccine cost of $7, where the vaccine cost estimates the Federal contract purchase price of the least expensive pediatric vaccine (see [3]).

Fig. 1
figure 1

United States 2006 recommended childhood immunization schedule through age 6 (excluding recommendations for selected populations)

In addition to extraimmunization, combination vaccines are creating a combinatorial explosion of vaccine alternatives and choices for the public health community, especially as the Recommended Childhood Immunization Schedule continues to evolve, with new diseases emerging and/or new vaccines being developed. For example, four time periods and five diseases (with Rotavirus and Meningococcal added in 2007) have been added to the Recommended Childhood Immunization Schedule since 1995 [5, 9, 11]. This combinatorial explosion of vaccine alternatives and choices has motivated the use of operations research techniques in addressing pediatric immunization formulary design problems. Most of the research to date addresses the economics of pediatric vaccine formulary design, combination vaccine pricing, and vaccine wastage [21, 23]. Weniger et al. [32] report the results of a pilot study that uses operations research methods to assess the economic value of vaccine formularies (i.e., the set of vaccines inventoried by an immunization clinic or pediatrician). In this pilot study, a portion of the Recommended Childhood Immunization Schedule was modeled as an integer program (IP). The objective of this IP was to aide decision-makers in determining the vaccine formulary that minimized the cost to fully immunize a child. Jacobson et al. [26] present a more rigorous presentation of this pilot study and demonstrate how the model selects different vaccine formularies depending on the desired economic criteria. Sewell et al. [30] embed the IP from the pilot study into a bisection algorithm [2] to “reverse engineer” the maximum inclusion prices (the maximum price at which a vaccine remains part of the optimal vaccine formulary) of four combination vaccines not yet (at that time) licensed in the United States. Sewell and Jacobson [29] present a rigorous description of this study, including the complete IP model. This study shows how operations research can provide beneficial economic analysis to the pharmaceutical companies that develop and manufacture vaccines (see [20, 25] for additional applications of this bisection algorithm). Jacobson and Sewell [22] extend the bisection/IP algorithm approach by including it with Monte Carlo simulation, thereby determining a probability distribution for the price of the four potential combination vaccines. Lastly, Hall et al. (submitted) formulate and analyze a general discrete optimization model that determines the set of vaccines (i.e., a vaccine formulary) that should be used in a clinical environment to satisfy any given childhood immunization schedule at minimum cost, while Hall [16] formulates and analyzes a general discrete optimization model that determines the set of vaccines that should be used in a clinical environment to satisfy any given childhood immunization schedule that minimizes the amount of extraimmunization.

This paper addresses the safety of combination vaccines by further examining the issue of extraimmunization as an extension of the results in Hall [16]. Specifically, this paper presents a general model that maximizes the effectiveness of a given pediatric vaccine formulary while also prohibiting extraimmunization for any given childhood immunization schedule. The paper is organized as follows. Section 2 presents the model terminology and the formulation of the discrete optimization problem. Section 3 presents a description and analysis of an exact dynamic programming algorithm and a randomized heuristic for solving the discrete optimization problem. Section 4 presents a computational comparison of this dynamic programming algorithm and randomized heuristic with an integer programming branch and bound algorithm, while Section 5 provides concluding comments and directions for future research.

2 Model terminology and formulation

The model terminology and the formulation of a discrete optimization problem used to maximize the effectiveness of a given pediatric vaccine formulary while prohibiting extraimmunization for any given childhood immunization schedule is presented. Some extensions of the discrete optimization problem are also described.

The discrete optimization model assumes a given childhood immunization schedule and vaccine formulary. To precisely describe a childhood immunization schedule, several set and parameter definitions are needed. Let

  • T = {1,2,...,τ} be the set of time periods for the childhood immunization schedule,

  • D = {1,2,...,δ} be the set of diseases requiring immunization,

  • D E  ⊆ D, with \(\left| {D_E } \right| = \delta _E \) be the set of diseases where extraimmunization is permitted,

  • D NE  = D\D E , with \(\left| {D_{NE} } \right| = \delta _{NE} \) be the set of diseases where extraimmunization is not permitted,

  • V = {1,2,...,υ} be the vaccine formulary (i.e., set of vaccines available to be administered to immunize against the δ diseases),

  • n d Z + be the number of doses of a vaccine that must be administered for immunization against disease dD

  • I vd  = 1 if vaccine v   V immunizes against disease dD, 0 otherwise, which is a set of binary parameters that indicate which vaccines immunize against which diseases,

  • P djt  = 1 if in time period tT, a vaccine may be administered to satisfy the jth dose, j = 1,2,...,n d , requirement for disease dD, 0 otherwise, which is a set of binary parameters that indicate the set of time periods in which a particular dose of a vaccine may be administered to immunize against a disease, and, finally,

  • R dt  = 1 if in time period tT, no dose of a vaccine may be administered to immunize against disease dD NE , 0 otherwise, which is a set of binary parameters that indicate the set of time periods in which no dose of a vaccine may be administered to immunize against a disease where extraimmunization is not permitted.

These sets and parameters completely describe a childhood immunization schedule along with the vaccine formulary. For example, the United States Recommended Childhood Immunization Schedule (see Fig. 1) outlines the vaccinations required to protect a child against several (currently thirteen) infectious diseases that pose a risk to children living in the United States [4]. This schedule includes the number of required doses of each vaccine and the recommended age for each dose (D1 = Dose 1, D2 = Dose 2, etc.). For example, the disease d = hepatitis B requires three doses of vaccine (i.e., n d  = 3), where the second dose (D2) may be administered at age 1 or 2 months. Therefore, for disease d = hepatitis B, dose j = 2, P djt  = 1(0) for time periods t = 2,3(1,4,5,6,7,8,9,10), and, assuming hepatitis B ∈ D NE , R dt  = 1(0) for time periods t = 4,9,10(1,2,3,5,6,7,8).

Unless otherwise stated, the phrase “childhood immunization schedule” refers to an arbitrary general immunization schedule, whereas the phrase “Recommended Childhood Immunization Schedule” refers to the immunization schedule in Fig. 1. Assume that all diseases dD have sequentially ordered and mutually exclusive doses (i.e., all the time periods during which dose j can be administered occur prior to all the time periods during which dose j + 1 can be administered, j = 1,2,...,n d  − 1). Furthermore, assume that the time periods during which dose j can be administered are consecutive. Define the valency, denoted by Val(v), as the number of diseases covered by vaccine vV, and hence, \({\text{Val}}\left( v \right){\text{ }} = \Sigma _{d \in D} I_{vd} \). Combination vaccines are often referred to as multivalent vaccines, or simply multivalents, because Val(v) ≥ 2 when vV is a combination vaccine. Furthermore, vaccine vV, where Val(v) = 1, 2, 3, 4, 5, or 6 is often referred to as a monovalent, bivalent, trivalent, tetravalent, pentavalent, or hexavalent vaccine, respectively. In practice, the dose parameters, n d , depend on biological constraints and are determined by the recommendations and guidelines set forth by the Centers for Disease Control and Prevention [7]. Note that schedule parameters P djt specify the time periods when vaccination is “permitted” (or useful) for disease dD, while the schedule parameters R dt specify the time periods when vaccination is “restricted” for disease dD NE . For example, assuming disease d = hepatitis B D NE , Fig. 1 implies P djt  = 1(0) for some dose j and time periods t = 1,2,3,5,6,7,8(4,9,10) and R dt  = 1(0) for time periods t = 4,9,10(1,2,3,5,6,7,8). The set D NE is the set of diseases where extraimmunization is restricted based on biological and/or philosophical constraints, and hence, may change for each child, on a case-by-case basis. At present, diphtheria, tetanus, and pertussis are the only diseases for which extraimmunization is biologically prohibited [8]; however, for any parent/guardian questioning the safety, necessity, and effectiveness of all vaccines, it is reasonable to assume that extraimmunization is prohibited for all diseases dD.

To present the discrete optimization model formulation, define the binary decision variables:

  • X tv  = 1 if vaccine vV is administered in time period tT, 0 otherwise,

  • π dj  = 1 if the jth dose requirement for disease dD is satisfied, 0 otherwise.

Therefore, for a given childhood immunization schedule, the discrete optimization model that maximizes the effectiveness of a given pediatric vaccine formulary while prohibiting extraimmunization is the following integer program

IP

$${\text{Maximize }}\sum\nolimits_{d \in D} {\sum\nolimits_{j = 1}^{n_d } {\pi _{dj} } } $$
(O)

Subject to

$$\begin{array}{*{20}c} {\sum\nolimits_{{\text{t}} \in T} {\sum\nolimits_{{\text{v}} \in V} {P_{djt} X_{tv} I_{vd} \geqslant \pi _{dj} } } }&{{\text{for all }}d \in D_E ,j = 1,2, \ldots ,n_d ,} \\ \end{array} $$
(1)
$$\begin{array}{*{20}c} {{{\sum\nolimits_{{\text{t}} \in T} {{\sum\nolimits_{{\text{v}} \in V} {P_{{djt}} X_{{tv}} I_{{vd}} = \pi _{{dj}} } }} }}} & {{{\text{for all }}d \in D_{{NE}} ,j = 1,2, \ldots ,n_{d} ,}} \\ \end{array} $$
(2)
$$\begin{array}{*{20}c} {\sum\nolimits_{t \in T} {\sum\nolimits_{v \in V} {R_{dt} X_{tv} I_{vd} = 0} } }&{{\text{for all }}d \in D_{NE} ,} \\ \end{array} $$
(3)
$$\begin{array}{*{20}c} {X_{tv} \in \left\{ {0,1} \right\}}&{{\text{for all }}t \in T,v \in V,} \\ \end{array} $$
(4)
$$\begin{array}{*{20}c} {\pi _{dj} \in \left\{ {0,1} \right\}}&{{\text{for all }}d \in D,j = 1,2. \ldots ,n_d .} \\ \end{array} $$
(5)

Therefore, the jth dose requirement for disease dD is satisfied if and only if \(\sum _{t \in T} \sum _{v \in V} P_{djt} X_{tv} I_{vd} \geqslant 1\) for dD E , j=1,2,...,n d and \(\sum _{t \in T} \sum _{v \in V} P_{djt} X_{tv} I_{vd} = 1\) for dD NE , j = 1,2,...,n d . Note that the objective function (O) of IP is maximized when π dj  = 1 for all diseases dD and doses j = 1,2,...,n d , and hence, if the value of the optimal solution equals \(\Sigma _{d = 1}^\delta n_d \), then the jth dose requirement for every disease dD is satisfied and no extra vaccine doses were administered for any disease dD NE . Therefore, IP maximizes the number of doses that may be administered while prohibiting extraimmunization. Constraint (1) ensures that for each disease dD E , if dose j = 1,2,...,n d is satisfied, then at least one vaccine that provides immunization for disease dD E is administered in some time period when dose j = 1,2,...,n d may be administered. Constraint (2) ensures that for each disease dD NE , if dose j = 1,2,...,n d is satisfied, then exactly one vaccine that provides immunization for disease dD NE is administered in some time period when dose j = 1,2,...,n d may be administered. Constraint (3) is for each disease dD NE and ensures that the number of doses administered in time periods when vaccination is restricted equals zero. Lastly, Constraints (4) and (5) are the binary constraints on the respective decision variables. Note that there are additional special restrictions unique to certain vaccines and manufacturers that are not modeled by IP; Hall [16] describes some of these restrictions (see [5]) and how they may be modeled. Furthermore, note that in IP if, for a given disease dD, dose j is satisfied without satisfying vaccine dose j − 1, then dose j effectively becomes dose j − 1. This is reasonable since, in practice, if a child misses vaccine doses early in the Recommended Childhood Immunization Schedule then the child continues to receive vaccine doses as recommended. For example, if doses 2 and 3 for Hepatitis B are satisfied in IP (i.e., \(\pi _{{\text{HepatitisB,}}2} = \pi _{{\text{HepatitisB,3}}} = 1\)), but not dose 1, then Hepatitis B would effectively have two of the three dose series satisfied.

Hall [16] formulates and analyzes a discrete optimization model that determines the set of vaccines (i.e., a vaccine formulary) that should be used in a clinical environment to satisfy any given childhood immunization schedule that minimizes the amount of extraimmunization. Therefore, the discrete optimization model in Hall [16] emphasizes satisfying the childhood immunization schedule first while trying to minimize extraimmunization whereas IP emphasizes prohibiting extraimmunization first while trying to satisfy the childhood immunization schedule as much as possible. Furthermore, the discrete optimization model in Hall [16] is not a binary integer program (BIP); however, all the decision variables in IP are binary, which offers theoretical advantages (see [27]). For example, IP shares a similar structure to the Max-Satisfiability Problem [17] or, when δ NE  = δ, to the Set-Partitioning Problem [27], both of which have been well-studied. In fact, IP can be shown to be NP-hard using Satisfiability or Set-Partitioning [15]. See Hall [16] for a complete analysis of the computational complexity and for special cases when IP is solvable in polynomial time.

Assigning weights to the objective function decision variables in IP offers a useful practical extension. Each vaccine has a known immunogenicity (the ability of the vaccine to immunize against a disease). For example, the immunogenicity of a Hepatitis B vaccine is >95%, which means that at least 95% of those children receiving the three dose series develop a protective antibody response against the disease [6]. However, for some vaccines, the immunogenicity increases with each dose. For example, for a Hepatitis B vaccine, the immunogenicity is 30–50% after the first dose, 75% after the second, and >95% after the third dose [6]. Therefore, if each objective function decision variable π dj in IP is weighted by immunogenicity, then IP maximizes the total immunogenicity of the vaccine formulary while prohibiting extraimmunization. More specifically, define Immdj as the immunogenicity for disease dD after dose j = 1,2,...,n d is administered, and let ω dj  = 1 − Immdj be the corresponding objective function coefficient for decision variable π dj . These objective function coefficients place more emphasis on the earlier vaccine dose requirements for each disease. For example, for disease dD, suppose Immd1 = .4 (after dose j = 1), Immd2 = .75 (after dose j = 2), and Immd3 = 0.95 (after dose j = 3), and hence, ω d 1 = .6, ω d 2 = .25, and ω d 3 = 0.05. Therefore, satisfying dose j = 1 for disease dD is much more attractive to the IP than satisfying dose j = 3 for disease dD, since IP is a maximization problem. Furthermore, these objective function coefficients assume that the earlier doses in the childhood immunization schedule are the most important (i.e., it is more important that a child receives the first dose of some vaccine that immunizes against disease dD (e.g., Hepatitis A) than to receive the fourth dose of some vaccine that immunizes against some other disease d’D (e.g., Polio)). This extension of IP assumes equivalent immunogenicities for each vaccine vV that immunizes against disease dD. Weighting the decision variables accordingly would require the additional constraint \(\pi _{d1} \geqslant \pi _{d2} \geqslant \ldots \geqslant \pi _{dn_d } \) for all diseases dD in the formulation of IP, which requires, for a given disease dD, that a vaccine for dose i must be administered before a vaccine for dose j may be administered for all doses i < j. The additional constraint \(\pi _{d1} \geqslant \pi _{d2} \geqslant \ldots \geqslant \pi _{dn_d } \) for all diseases dD is very restrictive; however, maximizing immunogenicity in a less restrictive manner will require that the constraints in the IP model be modified.

An additional extension to the model IP is to define a lower bound l d for each disease dD, where l d is a lower bound on the number of doses for disease dD that must be satisfied. Therefore, the additional constraint \(\sum\nolimits_{j = 1}^{n_d } {\pi _{dj} } \geqslant l_d \) for all diseases dD would enforce a lower bound on effectiveness while also prohibiting extraimmunization. Nevertheless, the inclusion of these constraints would likely yield an infeasible model for unreasonable values of l d . Theoretically, it is easy to construct an instance of IP that is infeasible for any lower bound greater than zero for a given disease dD (or subset of diseases); however, for all practical instances of IP, the given vaccine formulary will likely allow some doses to be satisfied for all diseases d   D, since combination vaccines are designed to immunize against diseases that share common dose requirements in the Recommended Childhood Immunization Schedule (e.g., compare pneumococcus and Haemophilus influenzae type b in Fig. 1).

3 A Dynamic programming algorithm and randomized rounding heuristic

This section presents a dynamic programming (DP) algorithm that finds an optimal solution for IP. The randomized rounding heuristic for IP is also presented. The randomized rounding heuristic is shown to be a randomized approximation algorithm, which by definition, executes in polynomial time and provides an approximation bound on the expected value of the heuristic solution [17].

3.1 Dynamic programming algorithm

The stages of the DP correspond to the time periods in T. A state, say S t, in stage tT is a vector of length δ, where \(S_d^t \) is the number of doses desired to be given for disease d   D through stage tT. Let m dt (M dt ) be the minimum (maximum) number of doses scheduled for disease dD through stage tT. For example, for the first disease in the Recommend Childhood Immunization Schedule (see Fig. 1), m 1 = (1, 1, 2, 2, 2, 2, 2, 3, 3, 3) and M 1 = (1, 2, 2, 2, 3, 3, 3, 3, 3, 3). The desired number of doses \(S_d^t \) should satisfy \(m_{dt} \leqslant S_d^t \leqslant M_{dt} \), and hence, the set of states for stage tT is \(\Omega _t = \left\{ {S^t \in Z^\delta :m_{dt} \leqslant S_d^t \leqslant M_{dt} {\text{ for all }}d \in D} \right\},\) where Z denotes the set of integers. For example, for the first four diseases in Fig. 1, the set of possible states in stage 5 are (2, 3, 3, 2), (2, 3, 3, 3), (3, 3, 3, 2), (3, 3, 3, 3).

\(S_d^t \) is the number of doses desired to be given for disease dD through stage tT. However, it may not be possible to administer all of the desired doses in a state due to the restriction on extraimmunization. Let Z(S t) be the maximum number of desired doses in S t that can be administered through stage tT without extraimmunization for any disease dD NE .

Moving from state S t− 1 in stage t −1 to state S t in stage t corresponds to administering vaccines in time period tT. If vaccination is permitted, it is desired to administer a vaccine for each disease dD such that \(S_d^t >S_d^{t - 1} \). Note that \(S_d^t \leqslant S_d^{t - 1} + 1\) because the doses for disease dD are mutually exclusive. Furthermore, note that it is infeasible to move from state S t− 1 in stage t –1 to state S t in stage t if \(S_d^t <S_d^{t - 1} \) for any disease dD, since it is impossible to reduce the number of doses that have been given for disease dD. Therefore, state S t− 1 in stage t −1 is connected to state S t in stage t if and only if b = S tS t−1 is a binary vector, and hence, for a given stage tT, it is desired to administer vaccines for the set of diseases \(D' = \left\{ {d \in D:{\mathbf{b}}_d = {\mathbf{S}}_d^t - {\mathbf{S}}_d^{t - 1} = 1} \right\}\), where b d is the dth component of the binary vector b. The set of vaccines that may be administered for the diseases in D’ is V’ = {vV: I vd  = 1 for some dD’ and I vd  = 0 for all dD NE \D’}. Once D’ and V’ are known, the problem is to find a subset of V’ that protects against as many diseases in D’ as possible without incurring extraimmunization for any disease dD NE . This problem has the same structure as IP, except that it contains only a single time period, and hence, it is called the One Period Problem (OPP(b)).

Define g(b) to be the optimal value of OPP(b) and recall that Z(S t) is the maximum number of desired doses in S t that can be given without extraimmunization. If Z(S t) is obtained by moving from S t− 1, then \(Z\left( {S^t } \right) = Z\left( {S^{t - 1} } \right) + g\left( b \right)\), where \(b = S^t - S^{t - 1} \). Since it is desired to maximize Z(S t) over all states from the previous stage that are connected to S t, the dynamic programming recurrence relation is

$$Z\left( {{\mathbf{S}}^t } \right) = \mathop {\max }\limits_{S^{t - 1} \in \Omega _{t - 1} :b = S^t = S^{t - 1} {\text{ is binary}}} Z\left( {{\mathbf{S}}^{t - 1} } \right) + g\left( {\mathbf{b}} \right).$$

The final stage contains a single state, namely S τ = (n 1, n 2,...,n δ), thus the optimal value is Z(S τ).

Theorem 1

The DP finds an optimal solution for IP.

Proof

This result follows by demonstrating that the optimal solution to IP is embedded in the state space of the DP, which is shown by demonstrating that the state space for stage t is sufficiently connected to the state space in stage t− 1 (t = 2,3,...,τ) and that Z(S τ) equals the optimal value of IP. See Appendix 1 for a detailed proof. □

Note that OPP(b) must be solved each time the recurrence relation is used in the DP; however, for a given stage tT, the binary vector b, which denotes the set of diseases for which a vaccine dose is desired, may have already been encountered in a previous stage, and hence, there is no need to resolve OPP(b) if the solution has been stored and is available. A “branch and remember” algorithm was used in the computational experiments reported in Section 4 (see the Appendix 1 for the pseudo-code for this algorithm.)

The DP algorithm offers several theoretically and computationally advantages. First, the DP algorithm is efficient in practice with the current Recommended Childhood Immunization Schedule, since this schedule yields a reasonable state/decision space, and the OPP(b) instances in each stage tT are small. Second, the structure of the DP algorithm is ideal for solving partial (or incomplete) childhood immunization schedules that arise when children have been partially immunized and then re-enter the healthcare system to complete their immunization schedules (where the set of vaccines used to partially immunize the children are feasible initial solutions for the DP algorithm; this problem is termed the schedule completion problem. Note that if this set of vaccines is not feasible, then the recommended catch-up schedule would need to be used, which would require a completely new model formulation and analysis; see [5] for details of this catch-up schedule). Lastly, the structure of the DP algorithm makes it easier to capture restrictions that are schedule-specific by imposing such restrictions on each OPP(b) instance (see [5, 16]).

3.2 Randomized rounding heuristic

The randomized rounding heuristic uses the solution from an LP relaxation to construct a feasible binary solution for IP. This technique has been applied successfully for other discrete optimization problems that share a common structure with IP (e.g., [17, 28]). Relaxing the binary constraints for the decision variables in IP yields the LP relaxation, LP, with objective function (O), constraints (13), and real decision variables 0 ≤ X tv  ≤ 1 for all tT, vV and 0 ≤ π dj  ≤ 1 for all dD, j = 1,2,...,n d . Denote the optimal objective function values for IP and LP, as z IP and z LP , respectively, where z LP z IP . Let \(X_{LPtv}^ * \) denote the optimal decision vector for LP, with \(X_{LPtv}^ * \), tT, vV, and \(\pi _{dj}^* \), dD, j = 1,2,...,n d , denoting the optimal decision variable values for LP. After solving LP, the randomized rounding heuristic assigns binary decision variable X tv  = 1(0) with probability \(X_{LPtv}^ * \left( {1 - X_{LPtv}^ * } \right)\) for each time period tT and vaccine vV. Note that constraint (3) in LP implicitly eliminates (i.e., sets to zero) all decision variables \(X_{LPtv}^ * \) such that R dt  = I vd  = 1 for dD NE , and hence, the corresponding binary variable X tv  = 0. This binary variable assignment is then used to determine the number of satisfied doses. The randomized rounding heuristic is now formally given in pseudo-code form.

Randomized rounding heuristic for IP

  1. Step 1.

    Solve LP

  2. Step 2.

    For i = 1,2,...,K (where K is the user specified number of replications)

    1. a.

      X tv ← 0 for all tT and vV

    2. b.

      π dj ← 0 for all dD and j = 1,2,...,n d

    3. c.

      For all tT and vV

      1. i.

        Draw a random number RAND ~ U(0,1)

      2. ii.

        If \(X_{LP_{tv} }^* \geqslant RAND\), then X tv ←1

    4. d.

      For all dD NE

      1. i.

        For all j = 1,2,...,n d

        1. 1.

          \(\hat \pi _{dj} \leftarrow \sum _{t \in T} \sum _{v \in V} P_{djt} \;X_{tv} I_{vd} \)

        2. 2.

          If \(\hat \pi _{dj} >1\), then set X tv ← 0 such that X tv  = 1 and P djt  = I vd  = 1 for the \(\left( {\widehat \pi _{dj} = 1} \right)\) variables with corresponding smallest fractional values \(X_{LPtv}^ * \)

    5. e.

      For all dD

      1. i.

        For all j = 1,2,...,n d

        1. 1.

          Compute \(\hat \pi _{dj} \leftarrow \sum _{t \in T} \sum _{v \in V} P_{djt} \;X_{tv} I_{vd} \)

          1. a.

            If dD NE and \(\hat \pi _{dj} = 1\), then π dj  ← 1

          2. b.

            If dD E and \(\hat \pi _{dj} >1\), then π dj  ← 1

    6. f.

      Compute \({\text{SatisfiedDoses}}\left( {\text{i}} \right) = \sum\nolimits_{d \in D} {\sum\nolimits_{j = 1}^{n_d } {\pi _{dj} } } \) for replication i

  3. Step 3.

    Return \(\mathop {\max }\limits_i {\text{Satisfied}}\,{\text{Doses}}\left( i \right)\)

The randomized rounding heuristic executes in O(T LP  + K(τ 2 νδ)) time, where T LP is the time required to solve LP. Steps 2.d and 2.e ensure that the solution returned by the randomized rounding heuristic is feasible. Given that linear programming is solvable in polynomial time (assuming an appropriate algorithm; see [1]), it then follows that the randomized rounding heuristic executes in polynomial time. Theorem 2 shows that the expected value of the solution returned by the randomized rounding heuristic is guaranteed to be no worse than ξ ×z IP , where \(\xi = \left( {1 - p_{max} } \right)^{\alpha - 1} \) for \({\text{ }}p_{{\text{max}}} = \mathop {\max }\limits_{t \in T,v \in V:0 <X_{LP_{tv} }^* <1} \left\{ {X_{LP_{tv} }^* } \right\}\) and \(\alpha = \mathop {\max }\limits_{d \in D} \;\left( {\Sigma _{v \in V} I_{vd} } \right)\left( {\mathop {\max }\limits_{j = 1,2,...,n_d } \Sigma _{t \in T} P_{djt} } \right)\), which is the maximum number of non-zero columns in any row of the constraint matrix for IP.

Theorem 2

The Randomized Rounding heuristic is a randomized ξ-approximation algorithm for IP.

Proof

See the Proof of Theorem 2 in Appendix 1 for a detailed proof. □

Theorem 2 shows that, in the worst case, the randomized rounding heuristic could return a solution that is arbitrarily close to zero; however, the computational results in Section 4 show that the randomized rounding heuristic performs well in practice for several randomly generated IP instances.

4 Computational results

This section reports computational results comparing the randomized rounding heuristic for IP with the DP algorithm and an IP branch and bound (IP B&B) algorithm that both optimally solve IP. The randomized rounding heuristic and the DP and IP B&B algorithms were executed on three sets of test problems to demonstrate their computational effectiveness and limitations. The first test problem is the 2006 Recommended Childhood Immunization Schedule. The second set of test problems are randomly generated based on hypothetical near-term future childhood immunization schedules, while the third set of test problems are larger, randomly generated childhood immunization schedules executed with several different vaccine sets. The size of these randomly generated childhood immunization schedules assume that future Recommended Childhood Immunization Schedules will expand to include more diseases and time periods, and hence, will require a larger number of both monovalent and combination vaccines [9, 11, 18].

The solution quality effectiveness measure for the randomized rounding heuristic and the DP and IP B&B algorithms is ζ, which is the value of the objective function Z divided by the total number of required doses, and hence, \({{\zeta = Z} \mathord{\left/ {\vphantom {{\zeta = Z} {\Sigma _{d = 1}^\delta n_d }}} \right. \kern-\nulldelimiterspace} {\Sigma _{d = 1}^\delta n_d }}\). Therefore, ζ represents the percentage of dose requirements satisfied by the randomized rounding heuristic or the DP and IP B&B algorithms. The execution time (in CPU seconds) is also reported for the heuristic and each exact algorithm, which is the efficiency effectiveness measure. The randomized rounding heuristic and the DP and IP B&B algorithms were coded and executed in MATLABv7.0 on a 2.4 GHz Pentium IV. The IP B&B algorithm used the existing binary solver in the MATLAB optimization toolbox.

The first test problem is the 2006 Recommended Childhood Immunization Schedule (see Fig. 1). Therefore, D = {1 = Hepatitis B, 2 = Diphtheria–Tetanus–Pertussis, 3 = Haemophilus influenzae type b, 4 = Polio, 5 = Measles–Mumps–Rubella, 6 = Varicella, 7 = Pneumococcus, 8 = Influenza, 9 = Hepatitis A} with dose vector n = (3, 5, 4, 4, 2, 1, 4, 1, 2), since diphtheria, tetanus, and pertussis are considered one disease and measles, mumps, and rubella are also considered one disease, and T = {1,2,...,10}. The schedule parameters P djt , Q dt , and R dt , for diseases dD, dose j = 1,2,...,n d , and time periods tT are all obtained from Fig. 1. For example, for disease d = 1 = Hepatitis B and dose j = 2, P djt  = 1(0) for time periods t = 2,3(1,4,5,6,7,8,9,10). Two different vaccine formularies, V 1 and V 2, are evaluated on two different sets of diseases that restrict extraimmunization, D NE 1 and D NE 2. The vaccine formularies are V 1 = {1 = {1}, 2 = {2}, 3 = {3}, 4 = {4}, 5 = {5}, 6 = {6}, 7 = {7}, 8 = {8}, 9 = {9}, 10 = {2,3}, 11 = {1,3}, 12 = {1,2,4}, 13 = {5,6}} and V 2 = {1 = {1}, 2 = {2,3,4}, 3 = {1,9}, 4 = {4}, 5 = {5}, 6 = {6}, 7 = {7}, 8 = {8}, 9 = {9} 11 = {1,3}, 12 = {1,2,4}, 13 = {5,6}, 14 = {1,2,3,4}}, where V 1 represents a set of pediatric vaccines currently licensed for use in the United States and V 2 represents a formulary of pediatric vaccines with fewer monovalent vaccines and more combination vaccines, some of which are not yet licensed for use in the United States, but are projected to be in the future. The parameters I vd are indicated within the sets V 1 and V 2, respectively. For example, vaccine 1 V 1 is the monovalent vaccine for disease 1 (Hepatits B), vaccine 12 V 1 is the combination vaccine Pediarix® that immunizes against diseases 1 (Hepatitis B), 2 (Diphtheria-Tetanus-Pertussis), and 4 (Polio), and vaccine 2 V 2 is the combination vaccine Pentacel® that immunizes against diseases 2 (Diphtheria-Tetanus-Pertussis), 3 (Haemophilus influenzae type b), and 4 (Polio), which has been submitted for licensing in the United States. The disease sets are D NE 1 = {1,2,3,4} and D NE 2 = D = {1,2,3,4,5,6,7,8,9}. Table 1 reports the solution quality and execution time (in CPU seconds) for the heuristic and the exact algorithms, for each vaccine set and disease set combination. Furthermore, K = 10 in the randomized rounding heuristic for each vaccine set and disease set combination.

Table 1 Computational results for 2006 recommend childhood immunization schedule

IP is solvable in polynomial time when all the vaccines in V are monovalent vaccines or when there exists a corresponding monovalent vaccine for every disease in D, and hence, the results for the solution quality and execution time reported in Table 1 are not surprising, given that most diseases have a corresponding monovalent vaccine, particularly in vaccine set V 1 [16]. Moreover, the randomized rounding heuristic returned the optimal solution for each case, but, in most cases, took more time to execute than the exact algorithms since the random rounding was replicated K = 10 times. However, as the next set of test problems will illustrate, this is unlikely to occur for future Recommended Childhood Immunization Schedules, as the schedule expands and more combination vaccines gain FDA approval.

The second set of test problems considers hypothetical near-term future childhood immunization schedules. The randomized rounding heuristic and the DP and IP B&B algorithms were executed on 30 randomly generated childhood immunization schedules with 15 time periods, 11 diseases, and a vaccine formulary of 30 vaccines. Therefore, each random childhood immunization schedule reflects a gradual expansion in the sets D (from 9 to 11 diseases) and T (from 10 to 15 time periods) and a modest increase in the number of available vaccines, particularly, combination vaccines. In each random childhood immunization schedule, 1 ≤ n d  ≤ 5 for all diseases dD, 1 ≤ Val(v) ≤ 6 for all vaccines vV, and P djt  = 1 for at most three time periods tT for every disease dD and dose j = 1,2,...,n d . For each randomly generated childhood immunization schedule, the randomized rounding heuristic and both exact algorithms were executed three times, where in execution 1, 2, and 3, d NE  = 4, 8, and 11, respectively. Furthermore, in each execution, the randomized rounding heuristic was replicated for K = 100, 300, and 500. Table 2 reports the solution quality and execution time (in CPU seconds) averaged over the 30 random childhood immunization schedules for each value of d NE (and K for the randomized rounding heuristic). An additional measure λ, which indicates the number of childhood immunization schedules that the respective heuristic or algorithm found the optimal solution, is also reported. For the IP B&B algorithm, the optimal solution was not always found because the algorithm exceeded the default execution time limit (2 h) or default iteration limit (107). Therefore, the statistics reported in Table 2 for the IP B&B algorithm are averaged over the λ random childhood immunization schedules for which the algorithm found the optimal solution.

Table 2 Computational results for future childhood immunization schedule

The data reported in Table 2 show that the randomized rounding heuristic found better solutions when δ NE  << δ, which is consistent with Theorem 2. Recall from the proof of Theorem 2 that it can be shown that the bound on the expected solution value returned by the randomized rounding heuristic improves when D NE  = ∅ (i.e., δ NE  = 0). Moreover, the randomized rounding heuristic, on average, was always more efficient than the IP B&B algorithm. Furthermore, the IP B&B algorithm’s ability to find an optimal solution in a ‘reasonable’ amount of time and memory was sensitive to the value of d NE . Moreover, the randomized rounding heuristic was more efficient than the DP algorithm for δ NE  = 8 and 11, provided K = 100. The DP algorithm was both efficient (outperforming the randomized rounding heuristic for K = 300 and 500 and the IP B&B algorithm in all cases) and effective (found an optimal solution in each execution).

The observed difference in execution time between the randomized rounding heuristic or DP algorithm and the IP B&B algorithm reported in Table 2 could become problematic for larger childhood immunization schedules and/or for practical uses. For example, a webpage used to find a vaccine formulary for a given childhood immunization schedule that prohibits extraimmunization would require an algorithm to execute in real-time, since most web users would terminate a web application that required several seconds or minutes to execute. Moreover, the difference in execution time between the randomized rounding heuristic or DP algorithm and the IP B&B algorithm will provide a more efficient analysis of larger childhood immunization schedules that may involve Monte Carlo simulation (see [22]), where either of these may require the solution of hundreds of thousands of integer programming instances. Furthermore, the childhood immunization schedule may need to be solved for each child, on a case-by-case basis, depending on the desired diseases in the set D NE , and hence, efficient algorithms are needed so as to provide, in real-time, practical value for the public health community.

The third set of test problems considers larger randomly generated childhood immunization schedules that demonstrate the effect of combination vaccines and further demonstrate how a childhood immunization schedule’s size affects the efficiency and solution quality of the randomized rounding heuristic and the exact algorithms. The randomized rounding heuristic and the exact algorithms were executed on 30 randomly generated childhood immunization schedules with 20 time periods, 13 diseases, and a vaccine formulary of 50 vaccines, where δ NE  = 10, 1≤ n d  ≤ 5 for all diseases dD, and P djt  = 1 for at most four time periods tT for every disease dD and dose j = 1,2,...,n d . For each randomly generated childhood immunization schedule, the randomized rounding heuristic and both exact algorithms were executed six times, where for execution i = 1,2,...,6, Val(v) ≤ i for all vaccines vV. Table 3 reports the solution quality and execution time (in CPU seconds) averaged across all 30 randomly generated childhood immunization schedules as well as the measure λ for each valency. The overall average and standard deviation across all vaccine sets is also reported. Note that the IP B&B algorithm found the optimal solution for 151 of the 180 instances (each random childhood immunization schedule was executed six times), but exceeded the maximum number of iterations allowed for the remaining 29 instances (three instances for Val(v) ≤ 4, 11 instances for Val(v) ≤ 5, and 15 instances for Val(v) ≤ 6). The statistics reported in Table 3 for the IP B&B algorithm are for the 151 instances for which the IP B&B algorithm found the optimal solution while the execution time in parenthesis is the average over all instances. Furthermore, K = 300 in each execution of the randomized rounding heuristic.

Table 3 Computational Results for the Effect of Combination Vaccines

The data reported in Table 3 demonstrate how the size of the childhood immunization schedule and the complexity of the vaccine set impact the execution time and solution quality of the randomized rounding heuristic and the exact algorithms. For example, in most cases, the execution time required to execute the heuristic and exact algorithms steadily increased as the valency (i.e., complexity) of the vaccine formulary increased. As expected, the solution quality of the randomized rounding heuristic deteriorated as the valency of the vaccine formulary increased, since as the valency of the vaccine formulary increases, the number of fractional variables in LP should increase (since each decision variable is present in more constraints), which results in a larger gap between the LP optimal solution value and the IP optimal solution value. Therefore, since the randomized rounding heuristic solution is derived from the LP optimal solution, the gap between the randomized rounding heuristic solution value and IP optimal solution value also increases. Theorem 2 offers theoretical support for this claim, since p max is likely to be larger as the valency of the vaccine formulary increases. However, the execution time of the randomized rounding heuristic was insensitive to the valency of the vaccine formulary versus the exact algorithms, which were more sensitive to the valency of the vaccine formulary. Furthermore, the randomized rounding heuristic was more efficient than both exact algorithms; however, the DP algorithm was twice as fast as the IP B&B algorithm on average for the instances that the IP B&B algorithm was able to optimally solve and was nearly 500 times as fast as the IP B&B algorithm when averaged across all 180 instances. Moreover, the DP algorithm effectively solved all 180 instances to optimality.

5 Conclusion and research extensions

This paper reports results on applying operations research techniques to further examine the issue of extraimmunization in pediatric immunization by extending the results in Hall [16]. There are several articles in the literature that report research that applies operations research techniques to pediatric immunization problems (e.g., [2022, 2426]). This paper moves in a new direction by applying discrete optimization techniques to address the issue of pediatric vaccine extraimmunization. A discrete optimization model that seeks to maximize the effectiveness of a vaccine formulary while prohibiting extraimmunization for a given childhood immunization schedule was formulated in Section 2. As more combination vaccines come to market and the Recommended Childhood Immunization Schedule becomes more complex to include more diseases and cover more time periods, this discrete optimization model will capture the combinatorial explosion of alternatives for public health policy-makers and administrators, vaccine manufacturers, pediatricians, and parents/guardians by identifying vaccine formularies and schedules that safely use combination vaccines, which will help address safety concerns in pediatric immunization, reduce costs, and reduce vaccine wastage associated with extraimmunization.

An exact DP algorithm as well as a randomized rounding heuristic were presented in Section 3 and compared computationally to an IP B&B algorithm in Section 4. These results showed that for larger randomly generated childhood immunization schedules, the randomized rounding heuristic was more efficient than the IP B&B algorithm and provided a reasonable solution quality that, on average, was within 5% of the optimal solution value. Furthermore, the randomized rounding heuristic was able to find a feasible solution in seconds for all random childhood immunization schedules, whereas the IP B&B algorithm often exceeded default time and memory limits when \(\mathop {\max }\limits_{v \in V} Val\left( v \right) \geqslant 4\). Moreover, the results showed the DP algorithm to be significantly more efficient and effective than the IP B&B algorithm. Therefore, for practical purposes, the results suggest that the DP algorithm is the algorithm of choice, since it is an efficient exact algorithm. However, with the significant expansion of the Recommended Childhood Immunization Schedule over the past decade (see [5, 9]) and with numerous new pediatric vaccines being considered (see [18]), the randomized rounding heuristic will allow more efficient analysis of larger childhood immunization schedules and practical analysis involving Monte Carlo simulation or finding an optimal vaccine formulary for each child on a case-by-case basis, which will require the solution of several unique integer programming instances. Advances in biotechnology and differing perspectives on the safety of combination vaccines will only heighten the possibility for considering each child on a case-by-case basis (see [19]).

Work is in progress to determine the trade-off of satisfying a given childhood immunization schedule while minimizing extraimmunization versus strictly prohibiting extraimmunization and maximizing the number of vaccine doses satisfied, since both of these alternatives incur different costs. The model presented in this paper will allow such trade-offs to be analyzed. Moreover, further examining the model extensions (theoretically and empirically) presented in Section 2 is an area of future research since these extensions offer practical value but may also significantly alter the structure of the model, which would require entirely different solution techniques. Finally, work is in progress to further extend the results reported by determining new heuristics for IP that either empirically provide better solutions and/or improve the existing approximation bound in Section 3.