SlideShare uma empresa Scribd logo
1 de 61
Reinforcement Learning
(2nd)
ujava.org Workshop
2016-08-12 www.idosi.com
CEO 강신동
Shindong KANG
(주)지능도시
www.idosi.comujava.org
www.idosi.comspaceapi.org
www.idosi.comReinforcement Learning for Brick Game
www.idosi.comReinforcement Learning for Brick Game
www.idosi.comTo Flip Pancake
www.idosi.comCrawling Robot on Carpet
www.idosi.comPavlov's Dog
www.idosi.comPavlov
www.idosi.comReinforcement (강화)
www.idosi.comReinforcement Learning
www.idosi.comForecast
www.idosi.comForecast with probability
www.idosi.comUnknown model & real facts
Deep Neural Network
Bayesian Probability
www.idosi.comVariance (분산)
www.idosi.comVariance (분산)
www.idosi.comRandrom Variable
www.idosi.comTypes of Randrom Variable
www.idosi.comDiscrete Probability Distribution
www.idosi.comContinuous Probability Distribution, Probability Density Function
Density (밀도)
www.idosi.comExpected value (기대값)
EV = xP/1
www.idosi.comExpected Value for Continuous variable
www.idosi.comCovariance (공분산)
www.idosi.comCovariance
www.idosi.comProbability (확률)
www.idosi.comConditional Probability (조건부 확률)
www.idosi.comBayes rule
www.idosi.comBayesian Probability (베이지안 확률)
www.idosi.comBayesian Probability (베이지안 확률)
P(fair|H) = ?
P(A) = P(fair) = ½
P(B) = P(H) = ¾
P(B|A) = P(H|fair) = ½
½ ½ 1
--- = –--
¾ 3
www.idosi.comBrownian motion (브라운 운동)
www.idosi.comBrownian motion, Gaussian distribution
www.idosi.comSnapshot of state
www.idosi.comMarkov Chain
www.idosi.comProcess Probability (과정 확률)
s1 s2 s3
Episode process :
s1, s2 = ?
s2, s3 = ?
s1, s3 = ?
www.idosi.comMarkov Process
www.idosi.comMarkov Process
www.idosi.comMath Product Symbol
www.idosi.comMarkov Process
www.idosi.comMarkov Process
www.idosi.comMarkov Process
www.idosi.comStochastic Matrix
www.idosi.comStochastic Matrix
0.4 0.6
0.7 0.3
www.idosi.com2 Snapshots of state
Direction using Second Order
www.idosi.comMarkov Process
www.idosi.com3 Snapshots of state
Acceleration using 3rd
order
www.idosi.comExploitation and Exploration (개발 and 탐험)
www.idosi.comState-action exploration vs. Parameter exploration
www.idosi.comMulti-armed bandit problem
www.idosi.comThompson sampling
www.idosi.comSimulated Bandit Performance
www.idosi.comMulti-armed bandit problem
www.idosi.comMulti-Armed Bandit Algorithms
www.idosi.comMAB Reward
www.idosi.comFunction's Probability Distribution
Function's Probability Distribution ?
www.idosi.comFunction's Probability Distribution
y = ax^2 +b
www.idosi.comFunction's Probability Distribution with Gaussian Distribution
y = ax^2 +b
www.idosi.comFunction's Probability Distribution with Gaussian Distribution
www.idosi.comGaussian Process Regreesion
www.idosi.comGaussian Process
From “C. E. Rasmussen & C. K. I. Williams, Gaussian Processes for Machine Learning, the MIT Press,
2006”
www.idosi.comThompson sampling
www.idosi.com
Thank you !
(주)지능도시
Intelligent City Ltd.
강신동
Shindong KANG
www.idosi.com
ceo@idosi.com

Mais conteúdo relacionado

Mais de 신동 강

Graph Convolutional Neural Networks
Graph Convolutional Neural Networks Graph Convolutional Neural Networks
Graph Convolutional Neural Networks 신동 강
 
Recurrent Neural Network tutorial (2nd)
Recurrent Neural Network tutorial (2nd) Recurrent Neural Network tutorial (2nd)
Recurrent Neural Network tutorial (2nd) 신동 강
 
Quantum Computer for Deep Learning
Quantum Computer for Deep Learning Quantum Computer for Deep Learning
Quantum Computer for Deep Learning 신동 강
 
ujava.org Drone Physics
ujava.org Drone Physicsujava.org Drone Physics
ujava.org Drone Physics신동 강
 
Recursive Neural Network : ujava.org 12th deep learning workshop
Recursive Neural Network : ujava.org 12th deep learning workshopRecursive Neural Network : ujava.org 12th deep learning workshop
Recursive Neural Network : ujava.org 12th deep learning workshop신동 강
 
NN Models with DL4J for Deep Learning
NN Models with DL4J for Deep LearningNN Models with DL4J for Deep Learning
NN Models with DL4J for Deep Learning신동 강
 
RBM with DL4J for Deep Learning
RBM with DL4J for Deep Learning RBM with DL4J for Deep Learning
RBM with DL4J for Deep Learning 신동 강
 
Deep Learning for Java (DL4J)
Deep Learning for Java (DL4J)Deep Learning for Java (DL4J)
Deep Learning for Java (DL4J)신동 강
 
Ujava.org reinforcement-learning
Ujava.org reinforcement-learningUjava.org reinforcement-learning
Ujava.org reinforcement-learning신동 강
 
Ujava.org tensor-analysis
Ujava.org tensor-analysisUjava.org tensor-analysis
Ujava.org tensor-analysis신동 강
 
Tensor Physics for Deep Learning
Tensor Physics for Deep Learning Tensor Physics for Deep Learning
Tensor Physics for Deep Learning 신동 강
 
ujava.org Deep Learning with Convolutional Neural Network
ujava.org Deep Learning with Convolutional Neural Network ujava.org Deep Learning with Convolutional Neural Network
ujava.org Deep Learning with Convolutional Neural Network 신동 강
 
Recurrent Neural Network, Fractal for Deep Learning
Recurrent Neural Network, Fractal for Deep LearningRecurrent Neural Network, Fractal for Deep Learning
Recurrent Neural Network, Fractal for Deep Learning신동 강
 
ujava.org workshop : Deep Learning [2015-03-08]
ujava.org workshop : Deep Learning  [2015-03-08]ujava.org workshop : Deep Learning  [2015-03-08]
ujava.org workshop : Deep Learning [2015-03-08]신동 강
 
IoT & Machine Learning
IoT & Machine LearningIoT & Machine Learning
IoT & Machine Learning신동 강
 
IoT In-Depth Conference 강연 자료 (주)지능도시 강신동 양계장 비닐하우스 포함
IoT In-Depth Conference 강연 자료 (주)지능도시 강신동 양계장 비닐하우스 포함IoT In-Depth Conference 강연 자료 (주)지능도시 강신동 양계장 비닐하우스 포함
IoT In-Depth Conference 강연 자료 (주)지능도시 강신동 양계장 비닐하우스 포함신동 강
 

Mais de 신동 강 (16)

Graph Convolutional Neural Networks
Graph Convolutional Neural Networks Graph Convolutional Neural Networks
Graph Convolutional Neural Networks
 
Recurrent Neural Network tutorial (2nd)
Recurrent Neural Network tutorial (2nd) Recurrent Neural Network tutorial (2nd)
Recurrent Neural Network tutorial (2nd)
 
Quantum Computer for Deep Learning
Quantum Computer for Deep Learning Quantum Computer for Deep Learning
Quantum Computer for Deep Learning
 
ujava.org Drone Physics
ujava.org Drone Physicsujava.org Drone Physics
ujava.org Drone Physics
 
Recursive Neural Network : ujava.org 12th deep learning workshop
Recursive Neural Network : ujava.org 12th deep learning workshopRecursive Neural Network : ujava.org 12th deep learning workshop
Recursive Neural Network : ujava.org 12th deep learning workshop
 
NN Models with DL4J for Deep Learning
NN Models with DL4J for Deep LearningNN Models with DL4J for Deep Learning
NN Models with DL4J for Deep Learning
 
RBM with DL4J for Deep Learning
RBM with DL4J for Deep Learning RBM with DL4J for Deep Learning
RBM with DL4J for Deep Learning
 
Deep Learning for Java (DL4J)
Deep Learning for Java (DL4J)Deep Learning for Java (DL4J)
Deep Learning for Java (DL4J)
 
Ujava.org reinforcement-learning
Ujava.org reinforcement-learningUjava.org reinforcement-learning
Ujava.org reinforcement-learning
 
Ujava.org tensor-analysis
Ujava.org tensor-analysisUjava.org tensor-analysis
Ujava.org tensor-analysis
 
Tensor Physics for Deep Learning
Tensor Physics for Deep Learning Tensor Physics for Deep Learning
Tensor Physics for Deep Learning
 
ujava.org Deep Learning with Convolutional Neural Network
ujava.org Deep Learning with Convolutional Neural Network ujava.org Deep Learning with Convolutional Neural Network
ujava.org Deep Learning with Convolutional Neural Network
 
Recurrent Neural Network, Fractal for Deep Learning
Recurrent Neural Network, Fractal for Deep LearningRecurrent Neural Network, Fractal for Deep Learning
Recurrent Neural Network, Fractal for Deep Learning
 
ujava.org workshop : Deep Learning [2015-03-08]
ujava.org workshop : Deep Learning  [2015-03-08]ujava.org workshop : Deep Learning  [2015-03-08]
ujava.org workshop : Deep Learning [2015-03-08]
 
IoT & Machine Learning
IoT & Machine LearningIoT & Machine Learning
IoT & Machine Learning
 
IoT In-Depth Conference 강연 자료 (주)지능도시 강신동 양계장 비닐하우스 포함
IoT In-Depth Conference 강연 자료 (주)지능도시 강신동 양계장 비닐하우스 포함IoT In-Depth Conference 강연 자료 (주)지능도시 강신동 양계장 비닐하우스 포함
IoT In-Depth Conference 강연 자료 (주)지능도시 강신동 양계장 비닐하우스 포함
 

Último

Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingNeil Barnes
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxfirstjob4
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxolyaivanovalion
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxolyaivanovalion
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxStephen266013
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxolyaivanovalion
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfLars Albertsson
 
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...Suhani Kapoor
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Callshivangimorya083
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiSuhani Kapoor
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H
 
Data-Analysis for Chicago Crime Data 2023
Data-Analysis for Chicago Crime Data  2023Data-Analysis for Chicago Crime Data  2023
Data-Analysis for Chicago Crime Data 2023ymrp368
 
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一ffjhghh
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationshipsccctableauusergroup
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAroojKhan71
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxJohnnyPlasten
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxolyaivanovalion
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfMarinCaroMartnezBerg
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusTimothy Spann
 

Último (20)

Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data Storytelling
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptx
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptx
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptx
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docx
 
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptx
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdf
 
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
 
Data-Analysis for Chicago Crime Data 2023
Data-Analysis for Chicago Crime Data  2023Data-Analysis for Chicago Crime Data  2023
Data-Analysis for Chicago Crime Data 2023
 
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptx
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptx
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 

ujava.org Reinforcement Learning (2nd)