SlideShare uma empresa Scribd logo
1 de 23
Baixar para ler offline
Louis Monier
@louis_monier
https://www.linkedin.com/in/louismonier
Deep Learning for Images
I see what you mean...
Gregory Renard
@redo
https://www.linkedin.com/in/gregoryrenard
Class 2 - Q2 - 2016
Fun with Images
Image Classification: kayak, boy Entity Detection
kayak boy
Face Recognition
Leo
Gollum
More Fun with Images
Pose DetectionImage SegmentationImage Captioning: “A young boy
wearing an orange vest riding a
yellow kayak on water, with sunlight
reflections.”
Yet more Fun with Images
Optical Character Recognition (OCR):
Astronomy is the science which treats of the nature and
properties of the heavenly bodies.
Autonomous Vehicles
Handwriting Recognition:
combustible: “able to catch fire”, adjective for being capable of
igniting and burning.
Our Wet Hardware
Alternating layers of
- simple cells (filters)
- complex cells (combination)
Simple patterns to abstract concepts
~ 5B neurons for vision
Convolutional Neural Network (ConvNet, CNN)
Suggested by Kunihiko Fukushima, 1980
LeNet, by Yann LeCun, 1998, to classify hand-written digits
filterimage
= 6.6
= -7.8
1.0 - really want
0.2 - sort of want
-1.0 - don’t want
Convolution: Applying a Filter to a Signal
through
through
=
=
image filter
1.0
0.5
0.0
through
through
Convolutional Layer - Basic Unit
5x5x3 chunk of inputs
Layer N Layer N+1
ReLU
neuron
(3)
(3)
(3)
(3)
5 x 5 x 3 = 75 inputs
76 weights
Convolutional Layer: Add Depth
5x5x3 chunk of inputs
Layer N Layer N+1
Depth = 7 ReLU neurons in parallel,
with different weights
Stride = 1
Convolutional Layer: Repeat over entire image
L=W=5
D=4
zero
padding
D=7
Shared weights!!!
Pooling Layer: Squeeeeeze!
Max Pooling Average Pooling
Layer N+1Layer N
Classical CNN topology - VGGNet (2013)
224x224 112x112 56x56 28x28 14x14 FC
D=64
D=128
D=256
D=512
D=512
D=4096 D=4096 D=1000
FC FC + Softmax
ConvNet
Pool
Layer 1
Filter Matching images
Layer 2
Layer 3
Layer 4
Layer 5
Modern ConvNet - GoogLeNet
GoogLeNet (2014)
ResNet-34 (2015)
Manifolds
Real-life Data vs Random Data
If music be the food of love, play on!
-- William Shakespeare
3Flr'kI5;LS3oLj1xK52,BA1 Rea5IYSf
-- 1000 monkeys typing
-- Real world -- Random Pixels
Workshop : Keras & MNIST
https://github.com/holbertonschool/deep-learning/tree/master/Class%20%232
Workshop : Keras & CIFAR 10
https://github.com/holbertonschool/deep-learning/tree/master/Class%20%232

Mais conteúdo relacionado

Semelhante a Deep Learning Class #2 - Deep learning for Images, I See What You Mean

Semelhante a Deep Learning Class #2 - Deep learning for Images, I See What You Mean (8)

Neuroaesthetics: Sciences embraces art
Neuroaesthetics: Sciences embraces artNeuroaesthetics: Sciences embraces art
Neuroaesthetics: Sciences embraces art
 
A new technology for a new era
A new technology for a new eraA new technology for a new era
A new technology for a new era
 
PHYSIOLOGY(OCULAR) PPT.pptx
PHYSIOLOGY(OCULAR) PPT.pptxPHYSIOLOGY(OCULAR) PPT.pptx
PHYSIOLOGY(OCULAR) PPT.pptx
 
4. darwin and the eye, part 1
4. darwin and the eye, part 14. darwin and the eye, part 1
4. darwin and the eye, part 1
 
Thesis
ThesisThesis
Thesis
 
Unit 1 How to Use Microscope
Unit 1 How to Use MicroscopeUnit 1 How to Use Microscope
Unit 1 How to Use Microscope
 
Night vision technology
Night vision technologyNight vision technology
Night vision technology
 
CHAPTER 3PERCEPTIONGraham Pike, Graham Edgar, and Helen
CHAPTER 3PERCEPTIONGraham Pike, Graham Edgar, and Helen CHAPTER 3PERCEPTIONGraham Pike, Graham Edgar, and Helen
CHAPTER 3PERCEPTIONGraham Pike, Graham Edgar, and Helen
 

Último

On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsMebane Rash
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...Nguyen Thanh Tu Collection
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfAyushMahapatra5
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxDenish Jangid
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDThiyagu K
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfAdmir Softic
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxVishalSingh1417
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfJayanti Pande
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17Celine George
 
Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxnegromaestrong
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...christianmathematics
 
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptxMaritesTamaniVerdade
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxAreebaZafar22
 
Making and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdfMaking and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdfChris Hunter
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdfQucHHunhnh
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin ClassesCeline George
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxVishalSingh1417
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introductionMaksud Ahmed
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphThiyagu K
 

Último (20)

On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan Fellows
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdf
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SD
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptx
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdf
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 
Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptx
 
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptx
 
Making and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdfMaking and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdf
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot Graph
 

Deep Learning Class #2 - Deep learning for Images, I See What You Mean

  • 1. Louis Monier @louis_monier https://www.linkedin.com/in/louismonier Deep Learning for Images I see what you mean... Gregory Renard @redo https://www.linkedin.com/in/gregoryrenard Class 2 - Q2 - 2016
  • 2. Fun with Images Image Classification: kayak, boy Entity Detection kayak boy Face Recognition Leo Gollum
  • 3. More Fun with Images Pose DetectionImage SegmentationImage Captioning: “A young boy wearing an orange vest riding a yellow kayak on water, with sunlight reflections.”
  • 4. Yet more Fun with Images Optical Character Recognition (OCR): Astronomy is the science which treats of the nature and properties of the heavenly bodies. Autonomous Vehicles Handwriting Recognition: combustible: “able to catch fire”, adjective for being capable of igniting and burning.
  • 5. Our Wet Hardware Alternating layers of - simple cells (filters) - complex cells (combination) Simple patterns to abstract concepts ~ 5B neurons for vision
  • 6. Convolutional Neural Network (ConvNet, CNN) Suggested by Kunihiko Fukushima, 1980 LeNet, by Yann LeCun, 1998, to classify hand-written digits
  • 7. filterimage = 6.6 = -7.8 1.0 - really want 0.2 - sort of want -1.0 - don’t want Convolution: Applying a Filter to a Signal through through = = image filter 1.0 0.5 0.0 through through
  • 8. Convolutional Layer - Basic Unit 5x5x3 chunk of inputs Layer N Layer N+1 ReLU neuron (3) (3) (3) (3) 5 x 5 x 3 = 75 inputs 76 weights
  • 9. Convolutional Layer: Add Depth 5x5x3 chunk of inputs Layer N Layer N+1 Depth = 7 ReLU neurons in parallel, with different weights
  • 10. Stride = 1 Convolutional Layer: Repeat over entire image L=W=5 D=4 zero padding D=7 Shared weights!!!
  • 11. Pooling Layer: Squeeeeeze! Max Pooling Average Pooling Layer N+1Layer N
  • 12. Classical CNN topology - VGGNet (2013) 224x224 112x112 56x56 28x28 14x14 FC D=64 D=128 D=256 D=512 D=512 D=4096 D=4096 D=1000 FC FC + Softmax ConvNet Pool
  • 18. Modern ConvNet - GoogLeNet GoogLeNet (2014) ResNet-34 (2015)
  • 20. Real-life Data vs Random Data If music be the food of love, play on! -- William Shakespeare 3Flr'kI5;LS3oLj1xK52,BA1 Rea5IYSf -- 1000 monkeys typing -- Real world -- Random Pixels
  • 21.
  • 22. Workshop : Keras & MNIST https://github.com/holbertonschool/deep-learning/tree/master/Class%20%232
  • 23. Workshop : Keras & CIFAR 10 https://github.com/holbertonschool/deep-learning/tree/master/Class%20%232