Abstract
In this paper we consider the use of extended formulations in LP-based algorithms for mixed integer conic quadratic programming (MICQP). Extended formulations have been used by Vielma et al. (INFORMS J Comput 20: 438–450, 2008) and Hijazi et al. (Comput Optim Appl 52: 537–558, 2012) to construct algorithms for MICQP that can provide a significant computational advantage. The first approach is based on an extended or lifted polyhedral relaxation of the Lorentz cone by Ben-Tal and Nemirovski (Math Oper Res 26(2): 193–205 2001) that is extremely economical, but whose approximation quality cannot be iteratively improved. The second is based on a lifted polyhedral relaxation of the euclidean ball that can be constructed using techniques introduced by Tawarmalani and Sahinidis (Math Programm 103(2): 225–249, 2005). This relaxation is less economical, but its approximation quality can be iteratively improved. Unfortunately, while the approach of Vielma, Ahmed and Nemhauser is applicable for general MICQP problems, the approach of Hijazi, Bonami and Ouorou can only be used for MICQP problems with convex quadratic constraints. In this paper we show how a homogenization procedure can be combined with the technique by Tawarmalani and Sahinidis to adapt the extended formulation used by Hijazi, Bonami and Ouorou to a class of conic mixed integer programming problems that include general MICQP problems. We then compare the effectiveness of this new extended formulation against traditional and extended formulation-based algorithms for MICQP. We find that this new formulation can be used to improve various LP-based algorithms. In particular, the formulation provides an easy-to-implement procedure that, in our benchmarks, significantly improved the performance of commercial MICQP solvers.
Similar content being viewed by others
Notes
Or more precisely, \({\mathcal {N}}:=\left\{ N_{i,k}\,:\, i\in \{1,\ldots ,\lfloor r_k/2 \rfloor \},\,k\in \left\{ 0,\ldots ,K-1\right\} \right\} \) where \(N_{i,k}={\widehat{N}}^2_s\) for each i, k.
After removing redundant variables through the equalities of the formulation.
For instance, one can check that \(\left\{ y\in {\mathbb {R}}^2\,:\, {{\mathrm{Proj}}}_{\left( 1,y\right) }\left( {\widehat{N}}^2_s\right) \right\} \) is a regular \(2^s\) polygon.
Note that the complete description of each rotated two-dimensional conic quadratic constraints is \(y_j^2 \le w_j y_0\) and \(0\le y_0\).
References
Julia interface for the CPLEX optimization software. https://github.com/JuliaOpt/CPLEX.jl
JuMP Modeling language for Mathematical Programming. https://github.com/JuliaOpt/JuMP.jl
Abhishek, K., Leyffer, S., Linderoth, J.: FilMINT: an outer approximation-based solver for convex mixed-integer nonlinear programs. INFORMS J. Comput. 22(4), 555–567 (2010)
Ball, K.M.: An elementary introduction to modern convex geometry. In: Levy, S. (ed.) Flavors of Geometry, Mathematical Sciences Research Institute Publications, vol. 31, pp. 1–58. Cambridge University Press, Cambridge (1997)
Ben-Tal, A., Nemirovski, A.: Lectures on Modern Convex Optimization: Analysis, Algorithms, And Engineering Applications, vol. 2. SIAM, Philadelphia (2001)
Ben-Tal, A., Nemirovski, A.: On polyhedral approximations of the second-order cone. Math. Oper. Res. 26(2), 193–205 (2001)
Bertsimas, D., Shioda, R.: Algorithm for cardinality-constrained quadratic optimization. Comput. Optim. Appl. 43(1), 1–22 (2009)
Bienstock, D.: Computational study of a family of mixed-integer quadratic programming problems. Math. Program. 74(2), 121–140 (1996)
Bonami, P., Biegler, L.T., Conn, A.R., Cornuéjols, G., Grossmann, I.E., Laird, C.D., Lee, J., Lodi, A., Margot, F., Sawaya, N., et al.: An algorithmic framework for convex mixed integer nonlinear programs. Discrete Optim. 5(2), 186–204 (2008)
Bonami, P., Kilinç, M., Linderoth, J.: Algorithms and software for convex mixed integer nonlinear programs. In: Mixed Integer Nonlinear Programming, pp. 1–39. Springer, Berlin (2012)
Borchers, B., Mitchell, J.E.: An improved branch and bound algorithm for mixed integer nonlinear programs. Comput. Oper. Res. 21(4), 359–367 (1994)
Ceria, S., Soares, J.: Convex programming for disjunctive convex optimization. Math. Programm. 86, 595–614 (1999)
Ceria, S., Stubbs, R.A.: Incorporating estimation errors into portfolio selection: robust portfolio construction. J. Asset Manag. 7(2), 109–127 (2006)
Chang, T.J., Meade, N., Beasley, J.E., Sharaiha, Y.M.: Heuristics for cardinality constrained portfolio optimisation. Comput. Oper. Res. 27, 1271–1302 (2000)
Dolan, E.D., Moré, J.J.: Benchmarking optimization software with performance profiles. Math. Program. 91(2), 201–213 (2002)
Dong, H., Linderoth, J.: On valid inequalities for quadratic programming with continuous variables and binary indicators. In: Goemans, M.X., Correa J.R. (eds.) Integer Programming and Combinatorial Optimization—16th International Conference, IPCO 2013, Valparaíso, Chile, March 18–20, 2013. Proceedings, Lecture Notes in Computer Science, vol. 7801, pp. 169–180. Springer, Berlin (2013)
Duran, M.A., Grossmann, I.E.: An outer-approximation algorithm for a class of mixed-integer nonlinear programs. Math. Program. 36(3), 307–339 (1986)
Fletcher, R., Leyffer, S.: Solving mixed integer nonlinear programs by outer approximation. Math. Program. 66(3), 327–349 (1994)
Frangioni, A., Gentile, C.: Perspective cuts for a class of convex 0–1 mixed integer programs. Math. Programm. 106, 225–236 (2006)
Frangioni, A., Gentile, C.: SDP diagonalizations and perspective cuts for a class of nonseparable MIQP. Oper. Res. Lett. 35, 181–185 (2007)
Frangioni, A., Gentile, C.: A computational comparison of reformulations of the perspective relaxation: Socp vs. cutting planes. Oper. Res. Lett. 37, 206–210 (2009)
Frangioni, A., Gentile, C., Grande, E., Pacifici, A.: Projected perspective reformulations with applications in design problems. Oper. Res. 59, 1225–1232 (2011)
Geoffrion, A.: Generalized benders decomposition. J. Optim. Theory Appl. 10(4), 237–260 (1972)
Glineur, F.: Computational experiments with a linear approximation of second order cone optimization. Image Technical Report 0001, Service de Mathématique et de Recherche Opérationnelle, Faculté Polytechnique de Mons, Mons, Belgium (2000)
Grossmann, I.E., Lee, S.: Generalized convex disjunctive programming: nonlinear convex hull relaxation. Comput. Optim. Appl. 26, 83–100 (2003)
Günlük, O., Linderoth, J.: Perspective relaxation of mixed integer nonlinear programs with indicator variables. In: Lodi, A., Panconesi, A., Rinaldi G. (eds.) Integer Programming and Combinatorial Optimization, 13th International Conference, IPCO 2008, Bertinoro, Italy, May 26–28, 2008, Proceedings, Lecture Notes in Computer Science, vol. 5035, pp. 1–16. Springer, Berlin (2008)
Günlük, O., Linderoth, J.: Perspective reformulations of mixed integer nonlinear programs with indicator variables. Math. Program. 124, 183–205 (2010)
Günlük, O., Linderoth, J.: Perspective reformulation and applications. In: Lee, J., Leyffer, S. (eds.) Mixed Integer Nonlinear Programming, The IMA Volumes in Mathematics and its Applications, vol. 154, pp. 61–89. Springer, New York (2012)
Gupta, O.K., Ravindran, A.: Branch and bound experiments in convex nonlinear integer programming. Manag. Sci. 31(12), 1533–1546 (1985)
Gurobi Optimization: The Gurobi Optimizer. http://www.gurobi.com
Hijazi, H., Bonami, P., Cornuéjols, G., Ouorou, A.: Mixed-integer nonlinear programs featuring on/off constraints. Comput. Optim. Appl. 52, 537–558 (2012)
Hijazi, H., Bonami, P., Ouorou, A.: An outer-inner approximation for separable mixed-integer nonlinear programs. INFORMS J. Comput. 26(1), 31–44 (2013)
IBM Corp.: User’s Manual for CPLEX. IBM Corp (2014)
IBM ILOG: CPLEX High-performance mathematical programming engine. http://www.ibm.com/software/integration/optimization/cplex/
Jeon, H., Linderoth, J., Miller, A.: Quadratic cone cutting surfaces for quadratic programs with on-off constraints. Optim. Online (2015). http://www.optimization-online.org/DB_HTML/2015/01/4746.html
Leyffer, S.: Integrating SQP and branch-and-bound for mixed integer nonlinear programming. Comput. Optim. Appl. 18, 295–309 (2001)
Lobo, M.S., Fazel, M., Boyd, S.: Portfolio optimization with linear and fixed transaction costs. Ann. Oper. Res. 152(1), 341–365 (2007)
Lobo, M.S., Vandenberghe, L., Boyd, S.: Application of second-order cone programming. Linear Algebra Appl. 284, 193–228 (1998)
Lubin, M., Dunning, I.: Computing in operations research using julia. arXiv preprint (2013). arXiv:1312.1431
Maringer, D., Kellerer, H.: Optimization of cardinality constrained portfolios with a hybrid local search algorithm. OR Spectrum 25(4), 481–495 (2003)
Quesada, I., Grossmann, I.: An LP/NLP based branch and bound algorithm for convex MINLP optimization problems. Comput. Chem. Eng. 16, 937–947 (1992)
Stubbs, R.A.: Branch-and-cut methods for mixed 0–1 convex programming. Ph.D. thesis (1996)
Stubbs, R.A., Mehrotra, S.: A branch-and-cut method for 0–1 mixed convex programming. Math. Program. 86(3), 515–532 (1999)
Tawarmalani, M., Sahinidis, N.V.: A polyhedral branch-and-cut approach to global optimization. Math. Programm. 103(2), 225–249 (2005)
Vielma, J.P., Ahmed, S., Nemhauser, G.: A lifted linear programming branch-and-bound algorithm for mixed integer conic quadratic programs. INFORMS J. Comput. 20, 438–450 (2008)
Westerlund, T., Pettersson, F.: An extended cutting plane method for solving convex minlp problems. Comput. Chem. Eng. 19, S131–S136 (1995)
Westerlund, T., Pettersson, F., Grossmann, I.: Optimization of pump configurations as a minlp problem. Comput. Chem. Eng. 18(9), 845–858 (1994)
Wolfram Research Inc.: Mathematica, Version 10.0. Wolfram Research, Inc., Champaign, Illinois (2014)
Acknowledgements
We thank the review team for their thoughtful and constructive comments, which improved the exposition of the paper. Support for Juan Pablo Vielma was provided by the National Science Foundation under grant CMMI-1351619, support for Miles Lubin was provided by the DOE Computational Science Graduate Fellowship under grant DE-FG02-97ER25308 and support for Joey Huchette was provided by the National Science Foundation Graduate Research Fellowship under grant 1122374.
Author information
Authors and Affiliations
Corresponding author
Appendices
Appendix A: Proof of Proposition 4
To prove Proposition 4 we begin with the following lemma, which gives a characterization of the homogenization of a convex set described by nonlinear constraints.
Lemma 1
Let \(f:{\mathbb {R}}^d\rightarrow {\mathbb {R}}\) be a closed convex function such that \(\lim _{\left||x\right||_2\rightarrow \infty } \frac{f(x)}{\left||x\right||_2}=\infty \) so that the closure of the perspective function of f is given by
If \({\mathbf {C}}:=\left\{ y\in {\mathbb {R}}^d\,:\, f(y)\le 1\right\} \), then
Proof
Let \({\mathbf {D}}:=\left\{ \left( y_0,y\right) \in {\mathbb {R}}^{d+1}\,:\, \left( cl{\tilde{f}}\right) (y_0,y)\le y_0\right\} \). We have \(\left\{ 1\right\} \times {\mathbf {C}}\subseteq {\mathbf {D}}\) and \({\mathbf {D}}\) is a convex cone because \(\left( cl{\tilde{f}}\right) (y_0,y)\) is a homogeneous function, so \({{\mathrm{cone}}}\left( \left\{ 1\right\} \times {\mathbf {C}}\right) \subseteq {\mathbf {D}}\).
For the reverse inclusion, let \(\left( y_0,y\right) \in {\mathbf {D}}\). If \(y_0=0\) then \(y=0\) and hence \(\left( y_0,y\right) \in {{\mathrm{cone}}}\left( \left\{ 1\right\} \times {\mathbf {C}}\right) \). If \(y_0>0\) then \(\left( y_0,y\right) /y_0\in \left\{ 1\right\} \times {\mathbf {C}}\) and hence \(\left( y_0,y\right) \in {{\mathrm{cone}}}\left( \left\{ 1\right\} \times {\mathbf {C}}\right) \). \(\square \)
The final ingredient for the proof of Proposition 4 is the following lemma that shows how to translate the polyhedral approximation for univariate functions to their homogenization.
Lemma 2
Let \(f:{\mathbb {R}}\rightarrow {\mathbb {R}}\) be a closed convex differentiable function such that \(\lim _{x\rightarrow \infty } \frac{f(x)}{\left| x\right| }=\infty \). Then
Furthermore, \(\left( y_0,y,w\right) \in {{\mathrm{epi}}}\left( cl{\tilde{f}}\right) \) if and only if either \(y_0=y=0\le w\) or if \(y_0> 0\) and
for \(\gamma \left( y_0,y\right) \) defined in Corollary 2.
Proof
First note that \({\mathbf {D}}:=\big \{\left( y_0,y,w\right) \in {\mathbb {R}}^3\,:\, \left( f\left( \gamma \right) -\gamma f'\left( \gamma \right) \right) y_0 + f'\left( \gamma \right) y \le w \quad \forall \gamma \in {\mathbb {R}}\big \}\) is a closed convex cone, \({{\mathrm{epi}}}\left( cl{\tilde{f}}\right) ={\overline{{{\mathrm{cone}}}}}\left( \left\{ 1\right\} \times {{\mathrm{epi}}}\left( f\right) \right) \) and
Then \(\left\{ 1\right\} \times {{\mathrm{epi}}}\left( f\right) \subseteq {\mathbf {D}}\) implies \({{\mathrm{epi}}}\left( cl{\tilde{f}}\right) \subseteq {\mathbf {D}}\).
For the reverse inclusion, let \(\left( y_0,y,w\right) \in {\mathbf {D}}\). We first show that \(y_0\ge 0\), by assuming \(y_0<0\) and reaching a contradiction. For this we consider cases \(y\ne 0\) and \(y=0\) separately. For both cases note that \(\lim _{x\rightarrow \infty } \frac{f(x)}{\left| x\right| }=\infty \) implies that \(\lim _{x\rightarrow +\infty } f'(x)=+\infty \) and \(\lim _{x\rightarrow -\infty } f'(x)=-\infty \).
For case \(y\ne 0\), note that if \(y_0<0\), then \(y_0 f(0)\le y_0\left( f\left( \gamma \right) -\gamma f'\left( \gamma \right) \right) \) for all \(\gamma \in {\mathbb {R}}\) by convexity of f. Hence, if \(\left( y_0,y,w\right) \in {\mathbf {D}}\) and \(y_0<0\) then
for all \(\gamma \in {\mathbb {R}}\). Taking limit for \(\gamma \rightarrow \infty \) when \(y>0\) and for \(\gamma \rightarrow -\infty \) when \(y<0\) in (25) we arrive at a contradiction with \(w<\infty \).
For case \(y=0\), note that by convexity of f and the mean value theorem we have that \(f\left( \gamma \right) -\gamma f'\left( \gamma \right) \le f\left( \gamma _0\right) - f'\left( \gamma \right) \gamma _0\) for any \(0<\gamma _0<\gamma \). Then, \(\lim _{\gamma \rightarrow \infty } f\left( \gamma \right) -\gamma f'\left( \gamma \right) =-\infty \). Now, if \(\left( y_0,y,w\right) \in {\mathbf {D}}\) and \(y=0\) then
for all \(\gamma \in {\mathbb {R}}\). Then, if \(y_0<0\), by taking limit for \(\gamma \rightarrow \infty \) in (26) we again arrive at a contradiction with \(w<\infty \).
We now show that any \(\left( y_0,y,w\right) \in {\mathbf {D}}\) with \(y_0\ge 0\) also belongs to \({{\mathrm{epi}}}\left( cl{\tilde{f}}\right) \). We divide the proof into cases \(y_0=0\) and \(y_0>0\).
For case, \(y_0=0\), note that \(\left( y_0,y,w\right) \in {\mathbf {D}}\) implies \(f'\left( \gamma \right) y \le w\) for all \(\gamma \in {\mathbb {R}}\). Taking limit for \(\gamma \rightarrow \infty \) when \(y>0 \) and for \(\gamma \rightarrow -\infty \) when \(y<0\), we conclude that \(y=0\) and \(w\ge 0\). Hence, \(\left( y_0,y,w\right) \in {{\mathrm{epi}}}\left( cl{\tilde{f}}\right) \).
For case, \(y_0>0\) we can check that \(\left( y,w\right) /y_0\in {{\mathrm{epi}}}\left( f\right) \) and then \(\left( y_0,y,w\right) /y_0\in \left\{ 1\right\} \times {{\mathrm{epi}}}\left( f\right) \). Hence, \(\left( y_0,y,w\right) \in {{\mathrm{cone}}}\left( \left\{ 1\right\} \times {{\mathrm{epi}}}\left( f\right) \right) \subseteq {{\mathrm{epi}}}\left( cl{\tilde{f}}\right) \).
For the final statement, it suffices to prove that \(\left( y_0,y,w\right) \in {{\mathrm{epi}}}\left( cl{\tilde{f}}\right) \) if \(y_0>0\) and \(\left( y_0,y,w\right) \) satisfies (24). For this note that, if the last two conditions hold, then \(\left( y,w\right) /y_0\in {{\mathrm{epi}}}\left( f\right) \), which we have already shown implies \(\left( y_0,y,w\right) \in {{\mathrm{epi}}}\left( cl{\tilde{f}}\right) \).
\(\square \)
Combining these lemmas we obtain the following straightforward proof of Proposition 4.
Proof
(of Proposition 4) Let \(f(y)=\sum _{j=1}^d f_j(y_j)\). Then (11) follows by noting that by Lemma 1 we have \({{\mathrm{cone}}}\left( \left\{ 1\right\} \times {\mathbf {C}}\right) =\left\{ \left( y_0,y\right) \in {\mathbb {R}}^{d+1}\,:\, \left( cl{\tilde{f}}\right) (y_0,y)\le y_0\right\} \) and by the definition of perspective function we have \(\left( cl{\tilde{f}}\right) (y_0,y)=\sum _{j=1}^d \left( cl\tilde{f_j}\right) (y_0,y_j)\). All other statements are directly from Lemma 2 and from Proposition 3 by fixing \(y_0=1\). \(\square \)
Appendix B: Additional graphs and tables
1.1 Appendix B.1: Summary statistics for solve times
Tables 1, 2, 3, 4, 5, 6, 7 and 8 show some summary statistics for the solve times for the different algorithms and instances. These statistics include the minimum, average and maximum solve times, together with their standard deviation. The tables also include the number of each solver was the fastest (wins) and the number of times a solver has a solution time that was within 1 and 10 % of the fastest solver (1 and 10 % win).
1.2 B. 2 Summary statistics for solution quality
Tables 9, 10, 11, 12, 13, 14, 15 and 16 present summary statistics for the maximum violation of constraints (1c) by the optimal solution returned by the solver. This value is calculated as
where \({\bar{x}}\) is the optimal solution.
1.3 B.3 Additional graphs
Figure 6 compares the performance of the three dynamic lifted polyhedral relaxations in their nonlinear reformulation version (CPLEXSepLP, GurobiSepLP, CPLEXTowerLP, GurobiTowerLP, CPLEXTowerSepLP, GurobiTowerSepLP). We also include as a reference the NLP-based algorithms CPLEXCP and GurobiCP (Tables 14, 15, 16).
LiftedLP Our final set of charts compare the branch- and cut-based LiftedLP algorithms (LiftedLP, CPLEXSepLazy and GurobiSepLazy) with the separable dynamic lifted polyhedral relaxation in its nonlinear reformulation version (CPLEXSepLP and GurobiSepLP). To minimize the complexity of the graphics we divide the comparison into two parts. In Fig. 7 we compare the branch-based algorithm with the separable reformulations and in Fig. 8 we compare the cut-based algorithms with the separable reformulations.
From Fig. 7 we see that the separable reformulations provide an advantage over the branch-based LiftedLP algorithm for all instances. However, this advantage becomes smaller as the problems sizes increase and the LiftedLP algorithm can provide an advantage in some instances (e.g Table 7 shows the algorithms is the fastest in 10 % of the robust instances for \(n=200\)). Figure 8 shows a similar behavior for the cut-based algorithms.
Rights and permissions
About this article
Cite this article
Vielma, J.P., Dunning, I., Huchette, J. et al. Extended formulations in mixed integer conic quadratic programming. Math. Prog. Comp. 9, 369–418 (2017). https://doi.org/10.1007/s12532-016-0113-y
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s12532-016-0113-y