A second-order cone programming problem has the form
subject to the constraints
f, x, b,
beq, lb, and ub are
vectors, and A and Aeq are matrices. For each
i, the matrix
Asc(i), the vectors
bsc(i) and
dsc(i), and the
scalar γ(i) are in a second-order cone
constraint that you create using secondordercone
.
In other words, the problem has a linear objective function and linear constraints, as well as a set of second-order cone constraints of the form .
coneprog
AlgorithmThe coneprog
solver uses the algorithm described in Andersen, Roos, and
Terlaky [1]. This method is
an interior-point algorithm similar to the Interior-Point linprog Algorithm.
The algorithm starts by placing the problem in standard form. The algorithm adds nonnegative slack variables so that the problem has the form
subject to the constraints
The solver expands the sizes of the linear coefficient vector f and linear constraint matrix A to account for the slack variables.
The region K is the cross product of Lorentz cones Equation 1 and the nonnegative orthant. To convert each convex cone
to a Lorentz cone Equation 1, create a column vector of variables t1, t2, …, tn+1:
Here, the number of variables n for each cone i is the number of rows in Asc(i). By its definition, the variable vector t satisfies the inequality
(1) |
Equation 1 is the definition of a Lorentz cone in (n+1) variables. The variables t appear in the problem in place of the variables x in the convex region K.
Internally, the algorithm also uses a rotated Lorentz cone in the reformulation of cone constraints, but this topic does not address that case. For details, see Andersen, Roos, and Terlaky [1].
When adding slack variables, the algorithm negates variables, as needed, and adds appropriate constants so that:
Variables with only one bound have a lower bound of zero.
Variables with two bounds have a lower bound of zero and, using a slack variable, have no upper bound.
Variables without bounds are placed in a Lorentz cone with a slack variable as the constrained variable. This slack variable is not part of any other expression, objective or constraint.
The dual cone is
The dual problem is
such that
for some
A dual optimal solution is a point (y,s) that satisfies the dual constraints and maximizes the dual objective.
To handle potentially infeasible or unbounded problems, the algorithm adds two more variables τ and κ and formulates the problem as homogeneous (equal to zero) and self-dual.
(2) |
along with the constraints
(3) |
Here, is the cone K adjoined with the nonnegative real line, which is the space for (x;τ). Similarly is the cone adjoined with the nonnegative real line, which is the space for (s;κ). In this formulation, the following lemma shows that τ is the scaling for feasible solutions, and κ is the indicator of an infeasible problem.
Lemma ([1] Lemma 2.1)
Let (x, τ, y, s, κ) be a feasible solution of Equation 2 along with the constraints in Equation 3.
xTs + τκ = 0.
If τ > 0, then (x, y, s)/τ is a primal-dual optimal solution of the standard form second-order cone problem.
If κ > 0, then at least one of these strict inequalities holds:
bTy > 0
fTx < 0.
If the first inequality holds, then the standard form, primal second-order cone problem is infeasible. If the second inequality holds, then the standard form, dual second-order cone problem is infeasible.
In summary, for feasible problems, the variable τ scales the solution between the original standard form problem and the homogeneous self-dual problem. For infeasible problems, the final iterate (x, y, s, τ, κ) provides a certificate of infeasibility for the original standard form problem.
The start point for the iterations is the feasible point:
x = 1 for each nonnegative variable, 1 for the first variable in each Lorentz cone, and 0 otherwise.
y = 0.
s = (1,0,…,0) for each cone, 1 for each nonnegative variable.
τ = 1.
κ = 1.
The algorithm attempts to follow the central path, which is the parameterized solution to the following equations for γ decreasing from 1 toward 0.
(4) |
Each variable with a 0 subscript indicates the start point of the variable.
The variables X and S are arrow head matrices formed from the x and s vectors, respectively. For a vector x = [x1,x2,…,xn], the arrow head matrix X has the definition
By its definition, X is symmetric.
The variable e is the vector with a 1 in each cone coordinate corresponding to the x1 Lorentz cone coordinate.
The variable μ0 has the definition
where k is the number of nonzero elements in x0.
The central path begins at the start point and ends at an optimal solution to the homogeneous self-dual problem.
Andersen, Roos, and Terlaky [1] show in Lemma 3.1 that the complementarity condition xTs = 0, where x and s are in a product of Lorentz cones L, is equivalent to the condition
for every cone i. Here Xi = mat(xi), xi is the variable associated with the Lorentz cone i, Si = mat(si), and ei is the unit vector [1,0,0,…,0] of the appropriate dimension. This discussion shows that the central path satisfies the complementarity condition at its end point.
To obtain points near the central path as the parameter γ decreases from 1 toward 0, the algorithm uses Newton's method. The variables to find are labeled (x, τ, y, s, κ). Let dx represent the search direction for the x variables, and so on. Then the Newton step solves the following linear system, derived from Equation 4.
The algorithm obtains its next point by taking a step in the d direction.
for some step .
For both numerical stability and accelerated convergence, the algorithm scales the step according to a suggestion in Nesterov and Todd [5]. Also, the algorithm corrects the step according to a variant of Mehrotra's predictor-corrector [4]. (For further details, see Andersen, Roos, and Terlaky [1].)
At each iteration k, the algorithm computes three relative convergence measures:
Primal infeasibility
Dual infeasibility
Gap infeasibility
You can view these three statistics at the command line by specifying iterative display.
options = optimoptions('coneprog','Display','iter');
All three should approach zero when the problem is feasible and the solver converges. For a feasible problem, the variable κk approaches zero, and the variable τk approaches a positive constant.
One stopping condition is somewhat related to the gap infeasibility. The stopping condition is when the following optimality measure decreases below the optimality tolerance.
This statistic measures the precision of the objective value.
The solver also stops and declares the problem to be infeasible under the
following conditions. The three relative infeasibility measures are less than
c = ConstraintTolerance
, and
If bTyk > 0, then the solver declares that the primal problem is infeasible. If fTxk < 0, then the solver declares that the dual problem is infeasible.
The algorithm also stops when
and
In this case, coneprog
reports that the problem is
numerically unstable (exit flag -10
).
The remaining stopping condition occurs when at least one infeasibility
measure is greater than ConstraintTolerance
and the computed
step size is too small. In this case, coneprog
reports that
the search direction became too small and no further progress could be made
(exit flag -7
).
[1] Andersen, E. D., C. Roos, and T. Terlaky. On implementing a primal-dual interior-point method for conic quadratic optimization. Math. Program., Ser. B 95, pp. 249–277 (2003). https://doi.org/10.1007/s10107-002-0349-3
[2] Ben-Tal, Aharon, and Arkadi Nemirovski. Convex Optimization in Engineering: Modeling, Analysis, Algorithms. (1998). Available at http://wwwtmp.st.ewi.tudelft.nl/roos/courses/wi485/tud00r.pdf.
[3] Luo, Zhi-Quan, Jos F. Sturm, and Shuzhong Zhang. Duality and Self-Duality for Conic Convex Programming. (1996). Available at https://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.48.6432
[4] Mehrotra, Sanjay. “On the Implementation of a Primal-Dual Interior Point Method.” SIAM Journal on Optimization 2, no. 4 (November 1992): 575–601. https://doi.org/10.1137/0802028.
[5] Nesterov, Yu. E., and M. J. Todd. “Self-Scaled Barriers and Interior-Point Methods for Convex Programming.” Mathematics of Operations Research 22, no. 1 (February 1997): 1–42. https://doi.org/10.1287/moor.22.1.1.