0% found this document useful (0 votes)

25 views6 pages

Convex - Optimization - Homework 3

Uploaded by

hadjiamine93

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

25 views6 pages

Convex - Optimization - Homework 3

Uploaded by

hadjiamine93

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Convex optimization - Homework 3

1. Second order methods for dual problem

1. Let the LASSO (Least Absolute Shrinkange Operator and Selection Opera-
tor) problem
minimize 21 ||Xw − y||22 + λ||w||1
In the variable w ∈ Rd , with X ∈ Rn×d , y ∈ Rn and λ > 0 the regularization
parameter.

We can rewrite the problem as

minimize 12 ||z − y||22 + λ||w||1
subject to z = Xw
The Lagrangian of the problem will be
L(w, z, v) = 21 ||z − y||22 + λ||w||1 + v T (Xw − z).

As the function is separable

inf L(w, z, v) = inf (λ||w||1 + v T Xw) + inf ( 21 ||z − y||22 − v T z)
w,z w z

We have the rst part

1
− y||22 − v T z = 12 z T z − (y + v)T z + 21 y T y
2 ||z
arg min( 12 z T z − (y + v)T z + 12 y T y) = y + v
z
inf ( 12 ||z − y||22 − v T z) = − 12 ||v||22 − y T v
z

And the second part P

d
inf (λ||w||1 + (X T v)T w) = i=1 (inf (λ|wi | + (X T v)i wi ))
w w i

inf (λ|wi | + (X T v)i wi ) = 0 if |(X T v)i | ≤ λ

wi
−∞ else
inf (λ||w||1 + (X T v)T w) = 0 if ||X T v||∞ ≤ λ
w
−∞ else
We deduce that :
inf L(w, z, v) = − 21 ||v||22 − y T v when ||X T v||∞ ≤ λ
w,z

Thus, the dual problem is

maximize - 21 ||v||22 − y T v
subject to ||X T v||∞ ≤ λ
We can rewrite the problem as
minimize v T Qv + pT v
subject to Av ≤ b
In the variable v ∈ Rn , with Q = 12 In , p = y , A = (X, −X)T and b = λI2d .

3.We can take v0 = 0 that satises the initial condition.

We notice that the bigger µ is, the deeper the objective function goes at each
iterations. However, when µ is too big, the number of iterations decreases.

1
Figure 1: The objective function vs the number of iterations for dierent µ

Figure 2: The gap in a semilog scale

Thus, an appropriate µ should be 50

2. First order methods for primal problem

1. The function f (w) = 12 ||Xw − y||22 + λ||w||1 is not dierentiable. However,

the function is dierentiable in the set {w ∈ Rd |∃i ∈ {1, ..., d}wi = 0}, otherwise

2
∂f (w) = X T Xw − X T y + λ(sgn(wi ))1≤i≤d .
By setting g(w) = X T Xw − X T y + λ(sgn(wi ))1≤i≤d , with sgn(0) = 0, we
have
∀z ∈ Rd f (z) − f (w) − g(w)T (z − w) ≥ 0. So we can consider g a subgradient
for f .

Figure 3: The objective function vs the number of iterations for dierent strate-
gies

Figure 4: The gap in a semilog scale for dierent strategies

• Strategy 1 : Constant step size αk = h

• Strategy 2 : Constant step length αk = h
gk

3
• Strategy 3 : Square summable but not summable αk = h
k

• Strategy 4 : Nonsummable diminishing αk = √h

We can see that the 4th strategy is the fastest, while the 1st and the 2nd are
really slow. However, if we continue to iterate a certain number of times, we'll
notice that the rst two strategies are more precise, even though they are not
converging.
2. The function is in the form f (w) = 12 wT X T Xw − y T Xw + 12 y T y + λ||w||1 .
If we writePthe function in close form, P
we'll get :
f (w) = 21 i=1 j=1 (X T X)ij wi wj − i=1 (X T y)i wi + 21 y T y + λ i=1 |wi |.
d Pd Pd
A better form
P should be : P
d d i−1
f (w) = 21 i=1 (X T X)ii wi2 + T T
y)i wi + 21 y T y+
P P
i=1 j=1 (X X)ij wi wj − i=1 (X
λ i=1 |wi |.
Pd

Here, for each i ∈ {1, .., d}, if we x wj with j 6= i, fi (wi ) = f (w) in

the form fi (wi ) = αi wi2 + βi wi + γi |wi | + δi with αi = 12 (X T X)ii , βi =
j6=i (X X)ij wj − (X y)i , γi = λ and δi = 2
P T T 1
Pd Pd T
j6=i k6=i (X X)kj wk wj −
Pd
T
y)j wj + 21 y T y + λ j6=i |wj |
P
j6=i (X
We can say than αi , γi > 0. Thus, when we study the function fi (wi ) =
αi wi2 + βi wi + γi |wi | + δi , we know that the function is convex and its lim-
its are +∞. Moreover :
fi0 (wi ) = 2αi wi + βi + γi if wi > 0
2αi wi + βi − γi if wi < 0
(the funcion is not dierentiable on 0).

We can thus deduce that :

arg min fi (wi ) = 0 if |βi | ≤ γi
wi
−βi −γi
2αi if βi < −γi
−βi +γi
2αi if βi > γi
We can thus set at each iteration wi(k) = arg min fi (wi ) and wj(k+1) = wj(k) when
wi
j 6= i, by cycling over i.

We notice that the coordinate descent method is faster than the sub-gradient
method. Indeed, while with the sub-gradient method, there is no convergence
but an oscillation over p∗, the coordinate descent method assures a convergence
in a certain number of step (approximately 250 to have a gap = 10−3 ).

4
Figure 5: The objective function vs the number of iterations for the coordinate
descent

Figure 6: The gap in a semilog scale for the coordinate descent

5
Figure 7: Comparison of the gap for the 2 methods

Homework 2
No ratings yet
Homework 2
5 pages
Optimization for Convex Functions
No ratings yet
Optimization for Convex Functions
31 pages
Introduction To Optimization - Jean-François Aujol
No ratings yet
Introduction To Optimization - Jean-François Aujol
51 pages
Week02 Convex Optimization
No ratings yet
Week02 Convex Optimization
48 pages
SESO2018 Wednesday Sagastizabal
No ratings yet
SESO2018 Wednesday Sagastizabal
181 pages
Exam With Solutions PDF
0% (1)
Exam With Solutions PDF
17 pages
Controle 16
No ratings yet
Controle 16
4 pages
Chapter 3 Unconstrained Convex Optimization
No ratings yet
Chapter 3 Unconstrained Convex Optimization
28 pages
Advanced Optimization Techniques
No ratings yet
Advanced Optimization Techniques
8 pages
Lecture 5
No ratings yet
Lecture 5
6 pages
Chương 9
No ratings yet
Chương 9
12 pages
Ee227c Notes PDF
No ratings yet
Ee227c Notes PDF
122 pages
Ee227c Notes 2 PDF
No ratings yet
Ee227c Notes 2 PDF
122 pages
Three Strategies To Derive A Dual Problem
No ratings yet
Three Strategies To Derive A Dual Problem
8 pages
Subgrad Method Slides
No ratings yet
Subgrad Method Slides
33 pages
System of Linear Equations: Assignment - 3 24142
No ratings yet
System of Linear Equations: Assignment - 3 24142
7 pages
Optimization Exam Questions
No ratings yet
Optimization Exam Questions
10 pages
Convex Optimization Quizz
No ratings yet
Convex Optimization Quizz
5 pages
MSML604 Homework 5
No ratings yet
MSML604 Homework 5
4 pages
A Smoothing Technique For Nondifferentiable Optimization Problem
No ratings yet
A Smoothing Technique For Nondifferentiable Optimization Problem
11 pages
Gradient Method in Convex Optimization
No ratings yet
Gradient Method in Convex Optimization
31 pages
Hw2sol PDF
100% (1)
Hw2sol PDF
5 pages
HW 2 Sol
No ratings yet
HW 2 Sol
5 pages
Convex Problems
No ratings yet
Convex Problems
48 pages
Optimization for Large-Scale Problems
No ratings yet
Optimization for Large-Scale Problems
14 pages
(Barrientos O.) A Branch and Bound Method For Solv
No ratings yet
(Barrientos O.) A Branch and Bound Method For Solv
17 pages
Proximal Minimization With D-Functions: Gorithms
No ratings yet
Proximal Minimization With D-Functions: Gorithms
11 pages
Nlpsol 5
No ratings yet
Nlpsol 5
37 pages
Sol3 2015
No ratings yet
Sol3 2015
8 pages
Advanced Nonlinear Optimization
No ratings yet
Advanced Nonlinear Optimization
12 pages
Inequality 20161031
No ratings yet
Inequality 20161031
31 pages
Comp Numerical Analysis Problems
No ratings yet
Comp Numerical Analysis Problems
7 pages
Lecture 05 - Unconstrained
No ratings yet
Lecture 05 - Unconstrained
21 pages
Conjugate Gradient Method: Com S 477/577 Nov 6, 2007
No ratings yet
Conjugate Gradient Method: Com S 477/577 Nov 6, 2007
8 pages
A Proximal-Gradient Homotopy Method For The Sparse Least-Squares Problem
No ratings yet
A Proximal-Gradient Homotopy Method For The Sparse Least-Squares Problem
37 pages
A Strengthened Conjecture On The Minimax Optimal Constant Stepsize For Gradient Descent
No ratings yet
A Strengthened Conjecture On The Minimax Optimal Constant Stepsize For Gradient Descent
8 pages
HW 5 Sol
100% (1)
HW 5 Sol
19 pages
Hw5sol PDF
No ratings yet
Hw5sol PDF
19 pages
Oblig2 Fasit
No ratings yet
Oblig2 Fasit
6 pages
2019-20-I MS Key
No ratings yet
2019-20-I MS Key
6 pages
Local Search in Smooth Convex Sets: CX Ax B A I A A A A A A O D X Ax B X CX CX O A I J Z O Opt D X X C A B P CX
No ratings yet
Local Search in Smooth Convex Sets: CX Ax B A I A A A A A A O D X Ax B X CX CX O A I J Z O Opt D X X C A B P CX
9 pages
Lecture 7 8 Other Descent Methods
No ratings yet
Lecture 7 8 Other Descent Methods
7 pages
Structural and Multidisciplinary Optimization
No ratings yet
Structural and Multidisciplinary Optimization
33 pages
Midterm 1 Notes
No ratings yet
Midterm 1 Notes
46 pages
Opt 202 LN
No ratings yet
Opt 202 LN
86 pages
Optimization Class Notes MTH-9842
No ratings yet
Optimization Class Notes MTH-9842
25 pages
Institute of Computer Science: Academy of Sciences of The Czech Republic
No ratings yet
Institute of Computer Science: Academy of Sciences of The Czech Republic
49 pages
MATH412 QUIZ 3 Solution
No ratings yet
MATH412 QUIZ 3 Solution
5 pages
Solutions To Selected Exercises and Additional Examples For My Book Numerical Methods For Evolutionary Differential Equations
No ratings yet
Solutions To Selected Exercises and Additional Examples For My Book Numerical Methods For Evolutionary Differential Equations
19 pages
Multi Variable Optimization: Min F (X, X, X, - X)
No ratings yet
Multi Variable Optimization: Min F (X, X, X, - X)
38 pages
Proximal Algorithms for Convex Optimization
No ratings yet
Proximal Algorithms for Convex Optimization
5 pages
Iterative Methods for Convergence
No ratings yet
Iterative Methods for Convergence
5 pages
Xu2001 Minimax
No ratings yet
Xu2001 Minimax
13 pages
Optimization Techniques in Engineering
No ratings yet
Optimization Techniques in Engineering
44 pages
IRWA-Rewighted Burke 2015
No ratings yet
IRWA-Rewighted Burke 2015
34 pages
Ods End Term
No ratings yet
Ods End Term
3 pages
斯坦福大学机器学习数学基础 57-64
No ratings yet
斯坦福大学机器学习数学基础 57-64
8 pages
Cheatsheet
No ratings yet
Cheatsheet
2 pages
Latches and Flip Flops - 222
No ratings yet
Latches and Flip Flops - 222
15 pages
Part1 1counting
No ratings yet
Part1 1counting
203 pages
S.L Unit - Ii
No ratings yet
S.L Unit - Ii
23 pages
Neural Networks: Concepts & Applications
No ratings yet
Neural Networks: Concepts & Applications
5 pages
Structured Programming
No ratings yet
Structured Programming
4 pages
Scikit-Learn Feature Extraction Guide
No ratings yet
Scikit-Learn Feature Extraction Guide
16 pages
DS Using C++-1
No ratings yet
DS Using C++-1
2 pages
Theory of Computation Exam 2019
No ratings yet
Theory of Computation Exam 2019
3 pages
E-Commerce Web App Project Report
No ratings yet
E-Commerce Web App Project Report
7 pages
Zoho Questions
No ratings yet
Zoho Questions
15 pages
Asm 01
No ratings yet
Asm 01
3 pages
Python OOP Basics for Beginners
No ratings yet
Python OOP Basics for Beginners
58 pages
Max. Error Measured Value: Topic: Estimation
No ratings yet
Max. Error Measured Value: Topic: Estimation
5 pages
Free Dsa Course
No ratings yet
Free Dsa Course
12 pages
Lemlem Abebaw Asaye Asignment 7
No ratings yet
Lemlem Abebaw Asaye Asignment 7
9 pages
Aiml Lesson Plan
No ratings yet
Aiml Lesson Plan
3 pages
CH 5 Match The Following
No ratings yet
CH 5 Match The Following
8 pages
Prog 3112 Second Quarter Exam (Amaoedsources - Blogspot.com)
100% (2)
Prog 3112 Second Quarter Exam (Amaoedsources - Blogspot.com)
166 pages
CSC 210 Exam Guide
No ratings yet
CSC 210 Exam Guide
3 pages
Interview Questions
No ratings yet
Interview Questions
133 pages
Sample Output To Test PDF Combine Only
No ratings yet
Sample Output To Test PDF Combine Only
138 pages
Unit 1notes
No ratings yet
Unit 1notes
45 pages
7 Ijcse 08620 18
No ratings yet
7 Ijcse 08620 18
6 pages
Data Representation - DPP 01 (English)
No ratings yet
Data Representation - DPP 01 (English)
4 pages
Class 9th Post Mid Dec
No ratings yet
Class 9th Post Mid Dec
2 pages
(FREE PDF Sample) Foundations For Analytics With Python 1st Edition Clinton W. Brownley Ebooks
100% (2)
(FREE PDF Sample) Foundations For Analytics With Python 1st Edition Clinton W. Brownley Ebooks
65 pages
MCS-224 Dec 2023
No ratings yet
MCS-224 Dec 2023
6 pages
BT-Knight Tour PDF
No ratings yet
BT-Knight Tour PDF
8 pages
Cat Question
100% (1)
Cat Question
4 pages
Practicas (01-08) Robots Moviles
No ratings yet
Practicas (01-08) Robots Moviles
11 pages

Convex - Optimization - Homework 3

Uploaded by

Convex - Optimization - Homework 3

Uploaded by

Convex optimization - Homework 3

1. Second order methods for dual problem

We can rewrite the problem as

As the function is separable

We have the rst part

And the second part P

inf (λ|wi | + (X T v)i wi ) = 0 if |(X T v)i | ≤ λ

Thus, the dual problem is

3.We can take v0 = 0 that satises the initial condition.

Figure 2: The gap in a semilog scale

Thus, an appropriate µ should be 50

2. First order methods for primal problem

1. The function f (w) = 12 ||Xw − y||22 + λ||w||1 is not dierentiable. However,

Figure 4: The gap in a semilog scale for dierent strategies

• Strategy 1 : Constant step size αk = h

• Strategy 4 : Nonsummable diminishing αk = √h

Here, for each i ∈ {1, .., d}, if we x wj with j 6= i, fi (wi ) = f (w) in

We can thus deduce that :

Figure 6: The gap in a semilog scale for the coordinate descent

You might also like

We have the rst part

3.We can take v0 = 0 that satises the initial condition.

1. The function f (w) = 12 ||Xw − y||22 + λ||w||1 is not dierentiable. However,

Figure 4: The gap in a semilog scale for dierent strategies

Here, for each i ∈ {1, .., d}, if we x wj with j 6= i, fi (wi ) = f (w) in