SlideShare uma empresa Scribd logo
1 de 34
Baixar para ler offline
MTH702 Optimization
Nonlinear Optimization
Derivative Free Optimization
Zero-Order Oracle
Motivation
we saw that for general function, finding  solution is extremely hard
Q: what is we restrict to convex problems only?
2/24
Motivation
we saw that for general function, finding  solution is extremely hard
Q: what is we restrict to convex problems only?
we analyzed uniform grid method
Q: what could we do better if minx f(x) would be a convex problem?
2/24
Brainstorming...
Assumption
We assume that f : Rd → R is
convex and L Lipschitz continuous,
|f(x) − f(y)| ≤ Lkx − yk ∀x, y
source: towardsdatascience.com
Q: How would you define optimization algorithm
that can use only zero-order oracle?
3/24
Brainstorming...
source: towardsdatascience.com
4/24
Direct-search methods
Directional direct-search methods
1: pick initial solution x0 ∈ Rd
2: choose step-size η ∈ R+
3: pick set of directions D ⊂ Rd (e.g., D = {±ei, i ∈ [d] := {1, 2, . . . , d}})
4: for k = 0, . . . , do
5: if ∃dj ∈ D such that f(xk + ηdj) produces sufficient improvement then
6: xk+1 := xk + ηdj
7: (optional) increase η = ηγu with γu  1
8: else
9: adjust η = ηγd with γd ∈ (0, 1)
10: xk+1 = xk
11: end if
12: end for
6/24
Directional direct-search methods
source: towardsdatascience.com
7/24
Directional direct-search methods
https://en.wikipedia.org/wiki/File:Direct_search_BROYDEN.gif
8/24
Directional direct-search methods
https://en.wikipedia.org/wiki/File:Direct_search_BROYDEN.gif
9/24
Directional direct-search methods
https://en.wikipedia.org/wiki/File:Direct_search_BROYDEN.gif
10/24
Directional direct-search methods
https://en.wikipedia.org/wiki/File:Direct_search_BROYDEN.gif
11/24
Directional direct-search methods
https://en.wikipedia.org/wiki/File:Direct_search_BROYDEN.gif
12/24
Directional direct-search methods
https://en.wikipedia.org/wiki/File:Direct_search_BROYDEN.gif
13/24
Directional direct-search methods
https://en.wikipedia.org/wiki/File:Direct_search_BROYDEN.gif
14/24
Directional direct-search methods
https://en.wikipedia.org/wiki/File:Direct_search_BROYDEN.gif
15/24
Model-based methods
Polynomial Models
Q: could we pick n points {xi}n
i=1, eval-
uate the function values {P(xi)}n
i=1 and
fit a polynomial m(x) that will interpo-
late the points?
Q: How many points would we need for
the example here?
17/24
Polynomial Models - Simplified Setting
pick a monomial basis φ(x) = [1, x1, . . . , xd, x2
1, . . . , x2
d]T
polynomial model m(x) =
P
i αiφi(x) for some αis
Q: How could we find the αis?
18/24
Polynomial Models - Simplified Setting
pick a monomial basis φ(x) = [1, x1, . . . , xd, x2
1, . . . , x2
d]T
polynomial model m(x) =
P
i αiφi(x) for some αis
Q: How could we find the αis?
Using n points {(xi, yi = f(xi)}n
i=1 we want α to satisfy m(xi) = yi ∀i
18/24
Polynomial Models - Simplified Setting
pick a monomial basis φ(x) = [1, x1, . . . , xd, x2
1, . . . , x2
d]T
polynomial model m(x) =
P
i αiφi(x) for some αis
Q: How could we find the αis?
Using n points {(xi, yi = f(xi)}n
i=1 we want α to satisfy m(xi) = yi ∀i
α1φ1(x1) + α2φ2(x1) + . . . αnφn(x1) = y1 (1)
.
.
. =
.
.
. (2)
α1φ1(xi) + α2φ2(xi) + . . . αnφn(xi) = yi (3)
.
.
. =
.
.
. (4)
α1φ1(xn) + α2φ2(xn) + . . . αnφn(xn) = yn (5)
18/24
Polynomial Models - Simplified Setting
pick a monomial basis φ(x) = [1, x1, . . . , xd, x2
1, . . . , x2
d]T
polynomial model m(x) =
P
i αiφi(x) for some αis
Q: How could we find the αis?
Using n points {(xi, yi = f(xi)}n
i=1 we want α to satisfy m(xi) = yi ∀i
α1φ1(x1) + α2φ2(x1) + . . . αnφn(x1) = y1 (1)
.
.
. =
.
.
. (2)
α1φ1(xi) + α2φ2(xi) + . . . αnφn(xi) = yi (3)
.
.
. =
.
.
. (4)
α1φ1(xn) + α2φ2(xn) + . . . αnφn(xn) = yn (5)
[φ(x1), . . . , φ(xn)]T
α = [y1, . . . , yn]T
18/24
Polynomial Models - Simplified Setting
pick a monomial basis φ(x) = [1, x1, . . . , xd, x2
1, . . . , x2
d]T
polynomial model m(x) =
P
i αiφi(x) for some αis
Q: How could we find the αis?
Using n points {(xi, yi = f(xi)}n
i=1 we want α to satisfy m(xi) = yi ∀i
α1φ1(x1) + α2φ2(x1) + . . . αnφn(x1) = y1 (1)
.
.
. =
.
.
. (2)
α1φ1(xi) + α2φ2(xi) + . . . αnφn(xi) = yi (3)
.
.
. =
.
.
. (4)
α1φ1(xn) + α2φ2(xn) + . . . αnφn(xn) = yn (5)
[φ(x1), . . . , φ(xn)]T
α = [y1, . . . , yn]T
⇒ Least squares!
[Colab: 00_DFO.ipynb]
18/24
Polynomial Models - Simplified Setting
after we build a model m(x) ≈ f(x) we can minimize it!
19/24
Polynomial Models - Simplified Setting
after we build a model m(x) ≈ f(x) we can minimize it!
the minimum of m(x) will define a new point xn+1 and we can evaluate f(xn+1)
Q: what should we do next? – any suggestions are welcome!
[back to Colab]
19/24
Derivative-free
model-based
trust-region method
Motivation
for any analytic function f(x) we know that Taylor approximation is locally good
for given point xk and radius ∆ we have that ∀x ∈ B(xk, ∆) it holds
f(x) ≈ f(xk) + h∇f(xk), x − xki +
1
2
(x − xk)T
∇2
f(xk)(x − xk)
21/24
Motivation
for any analytic function f(x) we know that Taylor approximation is locally good
for given point xk and radius ∆ we have that ∀x ∈ B(xk, ∆) it holds
f(x) ≈ f(xk) + h∇f(xk), x − xki +
1
2
(x − xk)T
∇2
f(xk)(x − xk)
given n = (d + 1)(d + 2)/2 points we can pick polynomial model
m(x) =
X
i
αiφi(x)
with
φ(x) = [1, x1, . . . , xd, x2
1, . . . , x2
d, x1x2, . . . , xd−1xd]T
21/24
Trust-Region Radius ∆
we build model mk(x) using only points in B(xk, ∆)
we may need to evaluate f(x) on multiple new points that we may sample from B(xk, ∆)
let sk ∈ B(0, ∆) is such that it minimize model mk(xk + s)
Q: what do we expect about following quantity?
ρk =
f(xk) − f(xk + s)
mk(xk) − mk(xk + s)
22/24
Trust-Region Radius ∆
we build model mk(x) using only points in B(xk, ∆)
we may need to evaluate f(x) on multiple new points that we may sample from B(xk, ∆)
let sk ∈ B(0, ∆) is such that it minimize model mk(xk + s)
Q: what do we expect about following quantity?
ρk =
f(xk) − f(xk + s)
mk(xk) − mk(xk + s)
if ρk  η1 (with 0  η1  1) we can trust the model and we can increase ∆
otherwise we have multiple options - improve model mk(x) or even decrease ∆
22/24
Algorithm [LMW19]
23/24
Bibliography
This lecture was based on [LMW19]
Jeffrey Larson, Matt Menickelly, and Stefan M Wild.
Derivative-free optimization methods.
Acta Numerica, 28:287–404, 2019.
24/24
mbzuai.ac.ae
Mohamed bin Zayed
University of Artificial Intelligence
Masdar City
Abu Dhabi
United Arab Emirates

Mais conteúdo relacionado

Semelhante a Derivative free optimization

Quadratic Function Presentation
Quadratic Function PresentationQuadratic Function Presentation
Quadratic Function Presentation
RyanWatt
 
Image Processing 3
Image Processing 3Image Processing 3
Image Processing 3
jainatin
 
C2 st lecture 4 handout
C2 st lecture 4 handoutC2 st lecture 4 handout
C2 st lecture 4 handout
fatima d
 

Semelhante a Derivative free optimization (20)

Remainder theorem
Remainder theoremRemainder theorem
Remainder theorem
 
Differentiation.pptx
Differentiation.pptxDifferentiation.pptx
Differentiation.pptx
 
Quadratic Function Presentation
Quadratic Function PresentationQuadratic Function Presentation
Quadratic Function Presentation
 
QMC: Operator Splitting Workshop, Stochastic Block-Coordinate Fixed Point Alg...
QMC: Operator Splitting Workshop, Stochastic Block-Coordinate Fixed Point Alg...QMC: Operator Splitting Workshop, Stochastic Block-Coordinate Fixed Point Alg...
QMC: Operator Splitting Workshop, Stochastic Block-Coordinate Fixed Point Alg...
 
Image Processing 3
Image Processing 3Image Processing 3
Image Processing 3
 
Module 2 polynomial functions
Module 2   polynomial functionsModule 2   polynomial functions
Module 2 polynomial functions
 
On learning statistical mixtures maximizing the complete likelihood
On learning statistical mixtures maximizing the complete likelihoodOn learning statistical mixtures maximizing the complete likelihood
On learning statistical mixtures maximizing the complete likelihood
 
Tensor Train data format for uncertainty quantification
Tensor Train data format for uncertainty quantificationTensor Train data format for uncertainty quantification
Tensor Train data format for uncertainty quantification
 
138191 rvsp lecture notes
138191 rvsp lecture notes138191 rvsp lecture notes
138191 rvsp lecture notes
 
H2O World - Consensus Optimization and Machine Learning - Stephen Boyd
H2O World - Consensus Optimization and Machine Learning - Stephen BoydH2O World - Consensus Optimization and Machine Learning - Stephen Boyd
H2O World - Consensus Optimization and Machine Learning - Stephen Boyd
 
Lecture 5: Structured Prediction
Lecture 5: Structured PredictionLecture 5: Structured Prediction
Lecture 5: Structured Prediction
 
Animashree Anandkumar, Electrical Engineering and CS Dept, UC Irvine at MLcon...
Animashree Anandkumar, Electrical Engineering and CS Dept, UC Irvine at MLcon...Animashree Anandkumar, Electrical Engineering and CS Dept, UC Irvine at MLcon...
Animashree Anandkumar, Electrical Engineering and CS Dept, UC Irvine at MLcon...
 
Efficient end-to-end learning for quantizable representations
Efficient end-to-end learning for quantizable representationsEfficient end-to-end learning for quantizable representations
Efficient end-to-end learning for quantizable representations
 
C2 st lecture 4 handout
C2 st lecture 4 handoutC2 st lecture 4 handout
C2 st lecture 4 handout
 
Research Inventy : International Journal of Engineering and Science
Research Inventy : International Journal of Engineering and ScienceResearch Inventy : International Journal of Engineering and Science
Research Inventy : International Journal of Engineering and Science
 
Optimization tutorial
Optimization tutorialOptimization tutorial
Optimization tutorial
 
Hierarchical matrices for approximating large covariance matries and computin...
Hierarchical matrices for approximating large covariance matries and computin...Hierarchical matrices for approximating large covariance matries and computin...
Hierarchical matrices for approximating large covariance matries and computin...
 
IVR - Chapter 1 - Introduction
IVR - Chapter 1 - IntroductionIVR - Chapter 1 - Introduction
IVR - Chapter 1 - Introduction
 
Andrei rusu-2013-amaa-workshop
Andrei rusu-2013-amaa-workshopAndrei rusu-2013-amaa-workshop
Andrei rusu-2013-amaa-workshop
 
Multilinear Twisted Paraproducts
Multilinear Twisted ParaproductsMultilinear Twisted Paraproducts
Multilinear Twisted Paraproducts
 

Último

Último (20)

Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptxExploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
 
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
 
Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024
 
How to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSHow to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POS
 
Understanding Accommodations and Modifications
Understanding  Accommodations and ModificationsUnderstanding  Accommodations and Modifications
Understanding Accommodations and Modifications
 
Google Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptxGoogle Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptx
 
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
 
Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)
 
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
Food safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdfFood safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdf
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docx
 
How to Add New Custom Addons Path in Odoo 17
How to Add New Custom Addons Path in Odoo 17How to Add New Custom Addons Path in Odoo 17
How to Add New Custom Addons Path in Odoo 17
 
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
 
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
 
Towards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxTowards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptx
 
Wellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptxWellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptx
 
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfUGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.
 
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptxHMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
 

Derivative free optimization