Reduced cost-based variable fixing in two-stage stochastic programming

Crainic, Teodor G.; Maggioni, Francesca; Perboli, Guido; Rei, Walter

doi:10.1007/s10479-018-2942-8

Reduced cost-based variable fixing in two-stage stochastic programming

S.I. : Stochastic Modeling and Optimization, in memory of András Prékopa
Published: 19 June 2018

(2018)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Annals of Operations Research Aims and scope Submit manuscript

Reduced cost-based variable fixing in two-stage stochastic programming

Download PDF

Teodor G. Crainic^1,2,
Francesca Maggioni ORCID: orcid.org/0000-0003-3968-1934³,
Guido Perboli^2,4 &
…
Walter Rei^1,2

487 Accesses
12 Citations
1 Altmetric
Explore all metrics

Abstract

The explicit consideration of uncertainty is essential in addressing most planning and operation issues encountered in the management of complex systems. Unfortunately, the resulting stochastic programming formulations, integer ones in particular, are generally hard to solve when applied to realistically-sized instances. A common approach is to consider the simpler deterministic version of the formulation, even if it is well known that the solution quality could be arbitrarily bad. In this paper, we aim to identify meaningful information, which can be extracted from the solution of the deterministic problem, in order to reduce the size of the stochastic one. Focusing on two-stage formulations, we show how and under which conditions the reduced costs associated to the variables in the deterministic formulation can be used as an indicator for excluding/retaining decision variables in the stochastic model. We introduce a new measure, the Loss of Reduced Costs-based Variable Fixing (LRCVF), computed as the difference between the optimal values of the stochastic problem and its reduced version obtained by fixing a certain number of variables. We relate the LRCVF with existing measures and show how to select the set of variables to fix. We then illustrate the interest of the proposed LRCVF and related heuristic procedure, in terms of computational time reduction and accuracy in finding the optimal solution, by applying them to a wide range of problems from the literature.

On a conservative partition refinement (CPR) method for a class of two-stage stochastic programming problems

Article 20 January 2022

Pareto Adaptive Robust Optimality via a Fourier–Motzkin Elimination lens

Article 30 June 2023

Decision-dependent probabilities in stochastic programs with recourse

Article Open access 11 August 2018

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

The explicit consideration of uncertainty is essential in addressing most management problems, particularly for the planning and operations of complex systems in transportation, logistics, finance, marketing, energy, health care, production, to name but a few important areas (Prékopa 1995; Kall and Wallace 1994; Gaivoronski 2005; Birge and Louveaux 2011; King and Wallace 2012).

Two-stage stochastic programs offer a classical modelling framework for those problems, strategic and tactical planning formulations, see in particular Birge and Louveaux (2011). In such programs, the first stage groups all decisions to be implemented before the realization of the random variables representing the stochastic parameters of the problem. In the second stage, all random information becomes known and a set recourse actions are taken to adjust the decisions made in the previous stage. The two-stage stochastic model then optimizes (without loss of generality, we use minimization in the following) a total system cost combining the cost of the first stage decisions, plus the expected cost of the recourse over all possible realizations of the random variables (the developments in this paper may be extended to the multistage case, but for simplicity of presentation we focus on the two-stage case).

Stochastic programs, in particular stochastic integer ones, are known to be generally very difficult, if not close to impossible, to address for realistically-sized instances. A formulation approximating the original stochastic model is then often used. This approximation generally takes the form of a deterministic formulation, such as the expected-value problem, obtained by replacing the random parameters with their expected values (or with other single-point forecast) or the extensive form of the equivalent deterministic problem obtained through sampling of a finite number of scenarios (Birge and Louveaux 2011). Due to its generally very large dimensions produced by the scenario approximation, the latter is generally not much easier to address than the stochastic one, particularly for formulations involving integer-valued decision variables. As for the former, it is known that the use of single-point forecasts can lead to finding arbitrarily bad solutions when compared to the optimal solution of the stochastic program, e.g., Lium et al. (2009).

But what insights can be derived from an optimal expected-value solution even if its single-point forecast defines an inaccurate estimator of the stochastic parameters of the considered problem? Specifically, two important questions then arise: (1) what can be inferred about the optimal stochastic solution from this optimal deterministic solution even when it is not of high quality and (2) can we use this information to reduce the computational effort of the stochastic program without affecting the stochastic solution quality?

We proceed in two steps: the first aims to achieve a deeper understanding of the relation between the expected-value and the stochastic solutions. What could be identified as “inherited" from the former to the latter? Can we identify a subset of variables with zero value in the deterministic solution to fix at zero in the stochastic formulation in order to guide the search toward the optimal stochastic solution? In the affirmative, are the reduced costs of the optimal solution of the (continuous relaxation of, in the case of integer formulations) deterministic problem a good estimation of bad/good variables to include into the stochastic solution? Can we infer a general trend from the several cases considered or is the behavior of the deterministic solution problem dependent?

To achieve these goals, we introduce the Loss of Reduced Costs-based Variable Fixing LRCVF, a measure of the badness/goodness of deterministic solutions based on the information offered by the reduced costs of the solution of the (continuous relaxation of the) deterministic formulation. We relate LRCVF to other measures present in the literature, the Value of the Stochastic Solution (VSS) (Birge 1982) and the Loss Using the Skeleton Solution (LUSS) (Maggioni and Wallace 2012). We then show experimentally that the LRCVF helps to identify the “good” variables that the stochastic solution should inherit from the expected-value deterministic solution and thus, provides better insights into what defines the structure of the solution to the stochastic programming model than VSS and LUSS.

We analyze, in the second step, the general trends observed during the Step 1 of the experimental campaign. This analysis aims to identify how the reduced costs associated to the non basic variables in the expected-value deterministic solution can be used to guide the selection of the variables to exclude from the stochastic formulation, making it solvable for larger instances, while preserving the quality of the final solution. The skeleton of a first heuristic procedure implementing the hints provided by this analysis is then experimentally evaluated on a large set of problems from the literature, including some large-sized stochastic Traveling Salesman Problem (TSP) instances (Ahmed et al. 2015). The results illustrate the performance and interest of LRCVF and the heuristic idea.

To sum up, the main contributions of this paper are to:

1.
Provide a more comprehensive understanding of the structure of the optimal solution of two-stage stochastic problems and its links to the optimal solution of the expected-value corresponding deterministic version (its linear relaxation for integer formulations);
2.
Define LRCVF, a new measure of goodness/badness of the deterministic solution with respect to the stochastic formulation;
3.
Show, using LRCVF, how the reduced costs in the deterministic solution lead, under certain conditions, to the identification of the variables to retain/exclude in the stochastic solution;
4.
Show, by means of an extensive experimental campaign, the interest of the proposed LRCVF, and how the reduced-costs rules may yield a heuristic effective in terms of computational time reduction and accurate in the approximation of the optimal solution.
5.
Define new and more realistic standard benchmark for Stochastic Programming. It should be noted that our experimental campaign was conducted using the instances available in the SIPLIB library. In addition, numerical tests were also conducted using a set of larger stochastic programming problems that represent more realistic settings. These additional instances have been added to the SIPLIB library to complement the overall benchmark set available to the stochastic programming community.

The paper is organized as follows. The problem statement and literature review are presented in Sect. 2, while Sect. 3 defines the LRCVF measure. The experimental plan is described in Sect. 4, including how we use LRCVF and the problems and formulations considered in the experimentation. Numerical results are presented and analyzed in the same section. We sum up the highlights and general trends observed from this experiments in Sect. 5. Given the trends identified, we derive and algorithmic procedure based on LRCVF and we test it on a wide set of highly combinatorial instances taken from the literature. We conclude in Sect. 6.

2 Literature review and problem statement

We focus our brief literature review on the characterization of the solutions of deterministic versions of stochastic formulations in relation to the solutions to the latter. A main concern is the identification of structures that might migrate from the deterministic solution to the stochastic one.

As already mentioned, stochastic programs, in particular integer ones, are generally very difficult to address for realistically-sized instances. Bounding techniques are therefore quite useful in practice, and several approaches and bounds on the optimal objective-function value have thus been proposed.

The standard measure of the expected gain from solving a stochastic model rather than its deterministic counterpart is given by the Value of the Stochastic Solution (VSS) (Birge 1982; Maggioni and Wallace 2012; Escudero et al. 2007), computed by comparing the solution values of the stochastic and expected-value deterministic variants of the problem. A high VSS indicates that stochastic programming models are necessary despite the computational efforts involved.

Other approaches (e.g., Frauendorfer 1988; Hausch and Ziemba 1983; Huang et al. 1977a, b) generalize the Edmundson–Madansky inequality (Madansky 1960) for upper bounding and Jensen’s inequality (Jensen 1906) for lower bounding. Bounds have been proposed in Birge (1985) and Rosa and Takriti (1999) by aggregating constraints and variables in the extensive-form, while bounds based on the barycentric approximation scheme are investigated in Kuhn (2005). Bounds for convex multistage stochastic programs have been extensively elaborated in Kuhn (2008) by means of an integrated stage-aggregation and space-discretization. Other bounds for multistage linear programs have been analyzed in Maggioni et al. (2014a) by means of measures of information, measures of quality of the expected value solution, and rolling horizon measures. Maggioni and Pflug (2016) also provides bounds and approximations for multistage convex problems with concave risk functionals as objective. Maggioni et al. (2016) proposed a bounding approach, extending that of Birge (1982), Maggioni et al. (2014a) and Sandıkçı et al. (2013), which works for multistage stochastic mixed integer linear programs. The latter considers an alternative way of forming sub-problems and merging their results, with the significant advantage of dividing a given problem into independent sub-problems, which may take advantage of parallel-machine architectures. Worst-case analysis of approximated solutions in a stochastic setting has been performed in Bertazzi and Maggioni (2015) for a capacitated traveling salesmen location problem and in Bertazzi and Maggioni (2017) for a fixed charge transportation problem.

The main drawback of all these methodologies is that they measure, in different ways, the quality of the approximating solution in terms of objective-function values, but they do not provide any information on the structure of the stochastic solution. An open research question is then the following: can we learn from an approximating formulation solution, irrespective of its quality, measured in terms of objective function value?

It is well known that, in general, the expected-value solution can behave very badly in a stochastic environment. The structural differences between the two solutions within the context of particular combinatorial optimization problems have been studied in Lium et al. (2009), Thapalia et al. (2011, 2012a, b), Wang et al. (2016), observing both the general bad behavior of the expected value solution and hinting that some structures from the deterministic solution find their way into the stochastic one. An approach proposed in the literature to assess the value of a given solution is to approximate its relative gap to the optimum value of the stochastic problem. For example, a Monte Carlo sampling-based procedure was proposed in Mak et al. (1999) and Bayraksan and Morton (2006). Escudero et al. (2007) proposed to use the expected value solution in a multistage setting by solving subsets of scenarios and testing the obtained solution in a dynamic way.

However, from all these experiments, it is still generally not clear where the badness of the expected value solution comes from: is it because the wrong variables are fixed at non-zero levels or because they have been assigned wrong values?

An attempt to answer this question has been proposed in Maggioni and Wallace (2012). Starting from the solution of the expected value problem, it assesses whether (1) the deterministic model produced the right non-zero variables, but possibly was off on the values of the basic variables; and (2) the deterministic solution is upgradable to become good (if not optimal) in the stochastic setting. The resulting measures, called Loss Using the Skeleton Solution (LUSS) and the Loss of Upgrading the Deterministic Solution (LUDS) in Maggioni and Wallace (2012) (see Maggioni et al. 2014a, for the extension to the multistage setting), are obtained by restricting the values of the first stage variables based on the solution of the expected-value problem. LUSS is obtained by fixing at zero (or at the lower bound) the first stage variables which are at zero (or at the lower bound) in the expected value solution (i.e., for linear programs, the non basic variables), solving the stochastic program, and contrasting it to the solution of the original stochastic model. LUDS is measured by first solving a restricted stochastic model obtained by fixing the lower bound of all variables to their corresponding values in the expected value solution, and contrasting it to the solution of the original stochastic model. Unfortunately, this approach leads to suboptimal solutions, in particular when large combinatorial stochastic problems must be solved. We compare in our experimental-results section the performance of LRCVF, the new measure we propose, to that of LUSS and LUDS.

Notice also that, approaches were proposed in the literature on deterministic combinatorial optimization to fix to zero the largest part of the non basic variables in the continuous relaxation of the problem in order to reduce the computational time (Angelelli et al. 2010; Perboli et al. 2011). Then, to identify the appropriate core set of non basic variables to be included in the restricted problem, the search is performed starting from the ones with the smallest reduced cost (Perboli et al. 2011).

One may conclude from this brief review of previous work that a systematic way to identify the structure of the stochastic solution out of the expected-value deterministic one is still missing. The goal of this paper is to fill this gap providing a tool to analyze and compare the expected value solution with respect to the stochastic one. In the next section, we introduce the concepts and a procedural way to compute the Reduced Costs-based Variable Fixing (RCVF) and the Loss of Reduced Costs-based Variable Fixing (LRCVF). LRCVF will provide the means to investigate, even in the case of a large VSS, what can be inherited from the structure of the expected value solution in its stochastic counterpart, by taking into account the information on reduced costs associated to the variables at zero (or lower bound) in the expected value solution.

3 The value of variable fixing

We first define the standard notation used in this paper, and then move to introduce RCVF and LRCVF.

3.1 Notation and definitions

The following mathematical model represents a general formulation of a stochastic program in which a decision maker needs to determine x in order to minimize (expected) costs or outcomes (Kall and Wallace 1994; Birge and Louveaux 2011):

$$\begin{aligned} \min _{x\in X}E_{{\varvec{\xi }}}z\left( x,{\varvec{\xi }}\right) =\min _{x\in X} \left\{ f_1(x) + E_{{\varvec{\xi }}}\left[ h_2\left( x,{\varvec{\xi }}\right) \right] \right\} , \end{aligned}$$

(1)

where x is a first-stage decision vector restricted to the set $X\subseteq \mathbb {R}^{n}_+$, with $\mathbb {R}^{n}_+$ is the set of non negative real vectors of dimension n, and $E_{{\varvec{\xi }}}$ stands for the expectation with respect to a random vector ${\varvec{\xi }}$, defined on some probability space $(\varOmega ,\mathscr {A},p)$ with support $\varOmega $ and given probability distribution p on the $\sigma $-algebra $\mathscr {A}$. The function $h_2$ is the value function of another optimization problem defined as

$$\begin{aligned} h_2\left( x,\xi \right) =\min _{y\in Y(x,\xi )} f_2\left( y;x,\xi \right) , \end{aligned}$$

(2)

which is used to reflect the costs associated with adapting to information revealed through a realization $\xi $ of the random vector ${\varvec{\xi }}$. The term $E_{{\varvec{\xi }}}\left[ h_2\left( x,{\varvec{\xi }}\right) \right] $ in (1) is referred to as the recourse function. We make the assumption in this paper that functions $f_1$ and $f_2$ are linear in their unknowns. The solution $x^{*}$ obtained by solving problem (1), is called the here and now solution and

$$\begin{aligned} RP= E_{{\varvec{\xi }}}z(x^{*}, {\varvec{\xi }}), \end{aligned}$$

(3)

is the optimal value of the associated objective function.

A simpler approach is to consider the Expected Value Problem, where the decision maker replaces all random variables by their expected values and solves a deterministic program:

$$\begin{aligned} EV =\min _{x\in X} z(x,\bar{\xi }), \end{aligned}$$

(4)

where $\bar{\xi }=E({\varvec{\xi }})$. Let $\bar{x}(\bar{\xi })$ be an optimal solution to (4), called the Expected Value Solution and let EEV be the expected cost when using the solution $\bar{x}(\bar{\xi })$:

$$\begin{aligned} EEV=E_{{\varvec{\xi }}}\left( z\left( \bar{x}(\bar{\xi }),{\varvec{\xi }}\right) \right) . \end{aligned}$$

(5)

The Value of the Stochastic Solution is then defined as

$$\begin{aligned} VSS=EEV-RP, \end{aligned}$$

(6)

measuring the expected increase in value when solving the simpler deterministic model rather than its stochastic version. Relations and bounds on EV, EEV and RP can be found for instance in Birge (1982) and Birge and Louveaux (2011).

Let $\mathcal {J}=\{1,\dots ,J\}$ be the set of indices for which the components of the expected value solution $\bar{x}(\bar{\xi })$ are at zero or at their lower bound (non basic variables). Then let $\hat{x}$ be the solution of:

$$\begin{aligned}&\min \nolimits _{x \in X} \ E_{{\varvec{\xi }}}z\left( x,{\varvec{\xi }}\right) \nonumber \\&\quad \hbox {s.t.}\quad x_j=\bar{x}_{j}(\bar{\xi }),\ j\in \mathcal {J}. \end{aligned}$$

(7)

We then compute the Expected Skeleton Solution Value

$$\begin{aligned} ESSV=E_{{\varvec{\xi }}}\left( z\left( \hat{x}, {\varvec{\xi }}\right) \right) , \end{aligned}$$

(8)

and we compare it with RP by means of the Loss Using Skeleton Solution

$$\begin{aligned} LUSS=ESSV - RP. \end{aligned}$$

(9)

A LUSS close to zero means that the variables chosen by the expected value solution are the correct ones but their values may be off. We have:

$$\begin{aligned} RP\le ESSV\le EEV, \end{aligned}$$

(10)

and consequently,

$$\begin{aligned} VSS\ge LUSS \ge 0. \end{aligned}$$

(11)

Notice that the case $LUSS=0$ corresponds to the perfect skeleton solution in which the condition $x_j=\bar{x}_{j}(\bar{\xi }),\ j\in \mathcal {J}$, is satisfied by the stochastic solution $x^{*}$ even without being enforced by a constraint (i.e., $\hat{x}=x^{*}$); on the other hand, if there exists $j\in \mathcal {J}$ such that $x_j^{*}\ne \bar{x}_{j}(\bar{\xi })$ in any optimal stochastic solutions $x^{*}$, then $0<LUSS<VSS$. Finally, one observes $LUSS=VSS$, if the $\hat{x}=\bar{x}(\bar{\xi })$.

3.2 Defining the LRCVF

We now define RCVF and LRCVF, together with a procedural way to compute them.

Let $\mathscr {R}=\{r_1,\dots ,r_j,\dots ,r_J\}$ be the set of reduced costs, with respect to the recourse function, of the components $\bar{x}_{j}(\bar{\xi }),\ j\in \mathcal {J}$, of the expected-value solution $\bar{x}(\bar{\xi })$ at zero or at their lower bound (i.e., non basic variables). We recall that a reduced cost is the amount by which an objective function coefficient would have to improve (increase, for maximization problems and decrease for minimization ones) before it would be possible for the corresponding variable to assume a positive value in the optimal solution and become a basis variable. Since the reduced costs of all basis variables (also the ones at the related upper bounds) are zero, they will be not fixed. In the following, we make the assumption that in the case of a problem with first stage integer variables, we compute the reduced costs on the continuous relaxation.

Let $r^{max}=\max _{j\in \mathcal {J}} \{r_j: r_j\in \mathscr {R}\}$ and $r^{min}=\min _{j\in \mathcal {J}} \{r_j: r_j\in \mathscr {R}\}$ be respectively the maximum and the minimum of the reduced costs of the variables $\bar{x}_{j}(\bar{\xi }) ,\ j\in \mathcal {J}$. We divide the difference $r^{max} - r^{min}$ into N classes $\mathscr {R}_1,\dots ,\mathscr {R}_N$ of constant width $\frac{r^{max} - r^{min}}{N}$ such that the p-class is defined as follows

$$\begin{aligned} \mathscr {R}_p =\left\{ r_j\!:\! r^{min} + (p-1)\! \cdot \!\frac{(r^{max} - r^{min})}{N}\le r_j \le r^{min} + p \cdot \frac{(r^{max} - r^{min})}{N}\! \right\} , \end{aligned}$$

(12)

with $p=1,\dots ,N$. Let $\mathcal {J}_p$ be the set of indices associated to the variables $\bar{x}_{j}(\bar{\xi })$ with reduced costs $r_j\in \mathscr {R}_p$. Then let $\tilde{x}_p$ be the solution of

$$\begin{aligned}&\min \nolimits _{x \in X} \ E_{{\varvec{\xi }}}z\left( x,{\varvec{\xi }}\right) \nonumber \\&\quad \hbox {s.t.}\quad x_j=\bar{x}_{j}(\bar{\xi }),\ j\in \mathcal {J}_p,\dots ,\mathcal {J}_N, \end{aligned}$$

(13)

where we fix at zero or lower bounds only the variables with indices belonging to the last p classes $\mathcal {J}_p,\dots ,\mathcal {J}_N $, i.e., with the highest reduced costs.

We then compute the Reduced Costs-based Variables Fixing

$$\begin{aligned} RCVF(p,N)=E_{{\varvec{\xi }}}\left( z\left( \tilde{x}_p, {\varvec{\xi }}\right) \right) ,\quad p=1,\dots ,N, \end{aligned}$$

(14)

and we compare it with RP by means of the Loss of Reduced Costs-based Variable Fixing

$$\begin{aligned} LRCVF(p,N)=RCVF(p,N) - RP ,\quad p=1,\dots ,N. \end{aligned}$$

(15)

Notice that $RCVF(1,N)=ESSV$ and consequently $LRCVF(1,N)=LUSS$.

Furthermore, considering that both RCVF and LRCVF are defined on the basis of restricting only a subset of the N classes that partition the non basic variables according to their respective values, these bounds RCVF(p, N), $p=1,\dots ,N$ can be improved (as is clearly stated in the two propositions that will follow). Also, as will be described in the subsequent section of this paper, by varying the values of parameters p and N, a systematic search can be performed to both assess the quality of the obtained bounds and inferring what the actual restriction to be applied on the overall stochastic model should be. We now prove that the following inequalities hold true:

Proposition 3.1

For a fixed $N\in \mathbb {N}\backslash \left\{ 0,1\right\} $ (where $\mathbb {N}$ is the set of natural numbers),

$$\begin{aligned} LRCVF(p,N)\ge LRCVF(p+1,N) , \quad p=1,\dots ,N-1. \end{aligned}$$

(16)

Proof

Any feasible solution of problem RCVF(p, N) is also a solution of problem $RCVF(p+1,N)$, since the former is more restricted than the latter, and so, the relation (16) holds true. If $LRCVF(p,N) =\infty $, the inequality is automatically satisfied. $\square $

Proposition 3.2

For a given $N\in \mathbb {N}\backslash \left\{ 0\right\} $ and a fixed $p\in \mathbb {N}\backslash \left\{ 0\right\} $ such that $p=1,\dots ,N$,

$$\begin{aligned} LRCVF(p,N+1)\ge LRCVF(p,N). \end{aligned}$$

(17)

Proof

If $p=1$ then $LRCVF(p,N+1)=LRCVF(p,N)=LUSS$. Furthermore, any feasible solution of problem $RCVF(p,N+1)$ is also a solution of problem RCVF(p, N), since the former is more restricted than the latter, and so, the relation (17) holds true. If $LRCVF(p,N+1) =\infty $, the inequality is automatically satisfied. $\square $

The two previous properties can be generalized in the following corollary:

Corollary 3.1

For given $N_1,N_2\in \mathbb {N}\backslash \left\{ 0\right\} $ and $p_1,p_2\in \mathbb {N}\backslash \left\{ 0\right\} $, with $p_1=1,\dots ,N_1$, $p_2=1,\dots ,N_2$ and such that $\frac{p_1}{N_1}\le \frac{p_2}{N_2}$

$$\begin{aligned} LRCVF(p_1,N_1)\ge LRCVF(p_2,N_2). \end{aligned}$$

(18)

Proof

If $p_1=p_2=1$ then $ LRCVF(p_1,N_1)= LRCVF(p_2,N_2)=LUSS$. Furthermore, if $\frac{p_1}{N_1}\le \frac{p_2}{N_2}$ then the number of variables at zero with highest reduced cost to be fixed is respectively $\frac{N_1 -p_1}{N_1}|\mathscr {R}|\ge \frac{N_2 -p_2}{N_2}|\mathscr {R}|$. Consequently $RCVF(p_1,N_1)$ is more restricted than $RCVF(p_2,N_2)$, and the relation (18) holds true. $\square $

Notice that, variables are unbounded in the minimization problem setting considered (1). One might, however, consider problem settings where the variables have limited upper bounds. In these cases, non basic variables might be at zero (or at their lower bound values) with positive reduced cost or at their upper bounds with negative reduced costs (Ahuja et al. 1993). The variable fixing procedure we propose implicitly considers this case, as non basic variables at their upper bounds correspond, due to their negative reduced costs, to the sets $\mathscr {R}_p$ with the lowest reduced cost values. Therefore, such variables are unlikely to be fixed to 0 by the procedure.

LRCVF measures how much we lose in terms of solution quality when we consider the reduced costs-based variable fixing solution. But how can one use it in order to analyze and derive the structure of the stochastic solution? How should we choose the number of classes N and p? We answer these questions in the following sections, by presenting a procedure using LRCVF and applying it to a wide set of problems from the literature.

4 Experimental plan and results

This section describes the experimental plan and the instance sets considered. Our goal is to assess the validity of LRCVF for extracting information about the skeleton of the stochastic solution from the reduced costs of the expected-value solution (or its linear relaxation for integer formulations). We therefore performed an experimental analysis to explore the behavior of RCVF and LRCVF, compared to LUSS, while varying the values of p and N, according to three axes:

Computational effort What number of variables can we fix in order to drastically reduce the effort of the stochastic solution computation?
Feasibility What are the effects of fixing a subset of the variables from the expected-value solution with regards to the feasibility of the stochastic model?
Optimality How to use the LRCVF to find an optimal or near optimal stochastic solution?

We used instances corresponding to stochastic optimization models related to three real-case applications: a single-sink transportation problem, a power generation scheduling case, and a supply transportation problem. All numerical experiments were conducted on a 64-bit machine with 12 GB of RAM and a Intel Core i7-3520M CPU 2.90 GHz processor, using CPLEX 12.5 as MIP solver.

Section 4.1 presents our methodology for computing the two measures, including a proposed approach to set up the number of classes N and the class parameter p of LRCVF(p, N). Section 4.2 gives a short description of the test instances, while computational results are discussed in Sect. 4.3.

4.1 Computing RCVF and LRCVF

We computed VSS, LUSS and LRCVF(p, N) for each instance set. The optimal solutions of the stochastic formulations were either taken from the literature, when available, or computed, otherwise. We now briefly describe the procedure we developed, which can be applied and extended to any stochastic programming problem.

Recall that parameter N defines the number of classes, or sets, in which the non basic variables of $\bar{x}(\bar{\xi })$ are grouped, and that these sets provide a characterization of the variables with respect to their reduced costs. Thus, the higher the value of N, the closer the reduced-cost values of the variables included in each set. We therefore start by considering a rough characterization given by three classes, $N=3$, where the non basic variables of $\bar{x}(\bar{\xi })$ are included in a high, low or medium-range reduced-cost set. Finally, the size of the “supply transportation” problem allowed us to test other values of N ($N=3, 10, 50, 100$) and to analyze the sensitivity of the results when N increases.

For a given value N, our objective while generating sets $\mathscr {R}_1,\dots ,\mathscr {R}_N$ and the partition of the variables $\mathcal {J}_1, \dots , \mathcal {J}_N$, is to identify which non basic variables of $\bar{x}(\bar{\xi })$ should be fixed in the stochastic model to produce an optimal, or near-optimal, solution. To do so, the parameter p is first fixed to its upper limit (i.e., $p=N$) to compute LRCVF(N, N). Parameter p is then iteratively decreased by a value of one as long as the following condition is verified: $LRCVF(p,N) = LRCVF(p-1,N)$. In fact, from Property 3.1, we have that, for a fixed N, LRCVF(p, N) can only increase when p decreases.

4.2 Test instances

The instances used in this experimental phase are taken from the literature:

Power generation scheduling based on an economic scheduling model formulated in Williams (2013) and Garver (1962) as a deterministic mixed integer program and extended in Maggioni and Wallace (2012) as a stochastic optimization problem; Power generation scheduling involves the selection of generating units to be put into operation and the allocation of the stochastic power demand among the units over a set of time periods;
Supply transportation problem inspired by a real case of gypsum replenishment in Italy, provided by the primary Italian cement producer. The logistics system is organized as follows: 24 suppliers, each of them having several plants located all around Italy, are used to satisfy the demand for gypsum of 15 cement factories belonging to the same company; the demands for gypsum at the 15 cement factories are considered stochastic; See Maggioni et al. (2017) for more details.

In order to ensure the fluidity of the paper, the problem descriptions and the two-stage models as reported in the literature are included in Appendix A, while their corresponding numerical data are summarized in Appendix B. Notice that Appendices A and B, include also the description and numerical results of a Single-sink transportation problem, inspired by a real case of clinker replenishment, provided by the largest Italian cement producer located in Sicily (Maggioni et al. 2009).

4.3 Numerical results

We now present and analyze the results obtained by applying the LRCVF(p, N) measure to the problems described above. We followed the procedure described in Sect. 4.1, computing each time VSS, LUSS and LRCVF(p, N), $p=1,\dots ,N$. Detailed solutions of the different instances for the first three test problems may be found at: http://www.francescamaggioni.it/index.php?id=lrcvf.

4.3.1 The power generation problem

The power generation problem (PGP) (Appendix A.2) selects power units of type 1 or 2 to operate and allocates the power demand among the selected units. We run the model for 10 different instances with demand randomly generated in the interval $[d^{min},d^{max}]$, where $d^{min}=33$ and $d^{max}=687$ are respectively the minimum and maximum demand observed in the historical data. The number of scenarios is 20. Summary statistics of the adjusted problem derived for our test case are reported in Table 1. Columns 3–4–5–6 display the total number of variables and the total number of integer variables, respectively. Notice that presolve eliminates 68 constraints and 2 variables.

Table 1 Summary statistics for the PGP

Reduced cost-based variable fixing in two-stage stochastic programming

Abstract

Similar content being viewed by others

On a conservative partition refinement (CPR) method for a class of two-stage stochastic programming problems

Pareto Adaptive Robust Optimality via a Fourier–Motzkin Elimination lens

Decision-dependent probabilities in stochastic programs with recourse

1 Introduction

2 Literature review and problem statement

3 The value of variable fixing

3.1 Notation and definitions

3.2 Defining the LRCVF

Proposition 3.1

Proof

Proposition 3.2

Proof

Corollary 3.1

Proof

4 Experimental plan and results

4.1 Computing RCVF and LRCVF

4.2 Test instances

4.3 Numerical results

4.3.1 The power generation problem

4.3.2 The supply transportation problem

5 General trends and skeleton of a heuristic procedure

5.1 Toward an algorithmic procedure for stochastic programming

5.2 SIPLIB instances

5.3 SIPLIB computational results

5.4 General trends

6 Conclusions and future directions

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Appendices

Appendix

Test problem description

1.1 A single-sink transportation problem

1.2 Power generation scheduling

1.3 Supply transportation problem

Numerical data

1.1 A single-sink transportation problem

1.2 Power generation scheduling

1.3 Supply transportation problem

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation