# Correction to: Trading Off Worst and Expected Cost in Decision Tree Problems

- 579 Downloads

## 1 Correction to: Algorithmica (2017) 79:886–908 https://doi.org/10.1007/s00453-016-0211-2

**Abstract** This erratum fixes a technical problem in the paper published in Algorithmica, Volume 79, Number 3, November 2017, pp. 886–908. Theorem 1 of this paper gives upper bounds on both worst testing cost and expected testing cost of the decision tree built by Algorithm 1. Although the statement is correct, the proof presented in the paper has a problem. The proof relies on the analysis of a nonlinear program (NLP) given by Eqs. (5)–(9), which is not convex as mistakenly proved in Appendix A.2. In this erratum we present a correct proof of Theorem 1. Instead of analyzing the NLP we analyze a related linear program.

## 2 Notation and Algorithm 1

For the sake of self containment we present some notations and the description of Algorithm 1 introduced in [1].

*cost*(

*D*,

*s*) as the sum of the tests’ cost on the path from the root of

*D*to the leaf associated with object

*s*. Then, the

*worst testing cost*and the

*expected testing cost*of

*D*are, respectively, defined as

*j*and two decision trees \(D_E\) and \(D_W\) for instance

*I*, the procedure CombineTrees(\(D_E\),\(D_W\),

*j*) (see Algorithm 1) constructs a new decision tree \(D^j\) for

*I*whose worst testing cost is increased by at most

*j*w.r.t the worst testing cost of \(D_W\), i.e., \(cost_W(D^j) \le j + cost_W(D_W).\) Our algorithm uses the definition of a

*j-replaceable node*, by which we mean a node

*v*in

*D*such that the total cost of the tests on the path from the root of

*D*to

*v*(including

*v*) is larger than

*j*and the cost of the path from the root of

*D*to the parent of

*v*is smaller than or equal to

*j*. The procedure Trade-Off repeatedly uses CombineTrees to create several decision trees (one of these trees being \(D_W\)) with increasingly worst testing cost and chooses the one with the best expected testing cost. We will show that this way it can guarantee the best possible trade-off.

### Proposition 1

The decision tree \(D^j\) returned by CombineTrees has worst testing cost at most \(j+cost_W(D_W)\).

## 3 Correct Analysis of Algorithm 1

Now we analyze the decision tree \(D = D^{j^*}\) output by Trade-Off(\(D_E, D_W, C\)), where *C* is an integer parameter. Notice that *D* is the decision tree with minimum expected testing cost among the decision trees \(D^0,D^{1},D^{2},\ldots ,D^{C}\), where \(D^j\) is the decision tree returned by CombineTrees(\(D_E,D_W,j\)). It follows from the previous proposition that \(cost_W(D) \le C+ cost_W(D_W).\)

The analysis of the expected testing cost of *D* is more involved. In order to simplify the notation we will let \(W = cost_W(D_W).\) We also assume for simplicity in the following that test costs are integers. Given a decision tree \(D'\) and an object/leaf \(s \in S\) with \(cost(D',s) = \kappa \) we will say that *s* has cost \(\kappa \) in \(D'\).

*i*in \(D_E\) and \(p_{C+1}\) be the sum of the probabilities of the objects with cost larger than

*C*in \(D_E\). Clearly:

*j*have cost at most \(j+W\) in \(D^j\). Moreover, for a probability distribution \(\mathbf{q}=(q_1,\ldots ,q_{C+1})\), let

### Lemma 1

### Proof

Thus, by setting \(C = \lfloor \rho W \rfloor \) we get the following theorem.

### Theorem 1

*I*of the decision tree optimization problem and let \(D_E\) be a decision tree such that \(cost_E(D_E) = OPT_E(I).\) For every \(\rho > 0\) there exists a decision tree

*D*such that

### Proof

*D*output by Trade-Off(\(D_E, D_W, C\)) has worst testing cost at most \(C+W < (1+\rho )W\) and expected testing cost smaller than

## Reference

- 1.Saettler, A., Laber, E., Cicalese, F.: Trading of worst and expected cost in decision tree problems. Algorithmica
**79**(3), 886–908 (2017)MathSciNetCrossRefzbMATHGoogle Scholar