Deep Learning Class #2 - Deep learning for Images, I See What You Mean

•

4 gostaram•1,906 visualizações

Slides by Louis Monier (Altavista Co-Founder & CTO) for Deep Learning keynote #2 at Holberton School. http://www.meetup.com/Holberton-School/events/230547621/ The keynote was followed by a workshop prepared by Gregory Renard. If you want to assist to similar keynote for free, checkout http://www.meetup.com/Holberton-School/

Educação

Louis Monier
@louis_monier
https://www.linkedin.com/in/louismonier
Deep Learning for Images
I see what you mean...
Gregory Renard
@redo
https://www.linkedin.com/in/gregoryrenard
Class 2 - Q2 - 2016

Fun with Images
Image Classification: kayak, boy Entity Detection
kayak boy
Face Recognition
Leo
Gollum

More Fun with Images
Pose DetectionImage SegmentationImage Captioning: “A young boy
wearing an orange vest riding a
yellow kayak on water, with sunlight
reflections.”

Yet more Fun with Images
Optical Character Recognition (OCR):
Astronomy is the science which treats of the nature and
properties of the heavenly bodies.
Autonomous Vehicles
Handwriting Recognition:
combustible: “able to catch fire”, adjective for being capable of
igniting and burning.

Our Wet Hardware
Alternating layers of
- simple cells (filters)
- complex cells (combination)
Simple patterns to abstract concepts
~ 5B neurons for vision

Convolutional Neural Network (ConvNet, CNN)
Suggested by Kunihiko Fukushima, 1980
LeNet, by Yann LeCun, 1998, to classify hand-written digits

filterimage
= 6.6
= -7.8
1.0 - really want
0.2 - sort of want
-1.0 - don’t want
Convolution: Applying a Filter to a Signal
through
through
=
=
image filter
1.0
0.5
0.0
through
through

Convolutional Layer - Basic Unit
5x5x3 chunk of inputs
Layer N Layer N+1
ReLU
neuron
(3)
(3)
(3)
(3)
5 x 5 x 3 = 75 inputs
76 weights

Convolutional Layer: Add Depth
5x5x3 chunk of inputs
Layer N Layer N+1
Depth = 7 ReLU neurons in parallel,
with different weights

Stride = 1
Convolutional Layer: Repeat over entire image
L=W=5
D=4
zero
padding
D=7
Shared weights!!!

Pooling Layer: Squeeeeeze!
Max Pooling Average Pooling
Layer N+1Layer N

Classical CNN topology - VGGNet (2013)
224x224 112x112 56x56 28x28 14x14 FC
D=64
D=128
D=256
D=512
D=512
D=4096 D=4096 D=1000
FC FC + Softmax
ConvNet
Pool

Modern ConvNet - GoogLeNet
GoogLeNet (2014)
ResNet-34 (2015)

Real-life Data vs Random Data
If music be the food of love, play on!
-- William Shakespeare
3Flr'kI5;LS3oLj1xK52,BA1 Rea5IYSf
-- 1000 monkeys typing
-- Real world -- Random Pixels

Workshop : Keras & MNIST
https://github.com/holbertonschool/deep-learning/tree/master/Class%20%232

Workshop : Keras & CIFAR 10
https://github.com/holbertonschool/deep-learning/tree/master/Class%20%232

Mais conteúdo relacionado

Semelhante a Deep Learning Class #2 - Deep learning for Images, I See What You Mean

Neuroaesthetics: Sciences embraces artNomensa

A new technology for a new era*** -"A NEW TECHNOLOGY FOR A NEW ERA"- / "A BREAK-LOOPER"/-QUANTUM PHYSICS-/

PHYSIOLOGY(OCULAR) PPT.pptxAlmaazAhmed

4. darwin and the eye, part 1Ariel Roth

ThesisRob Chhokar

Unit 1 How to Use MicroscopeShally Rahmawaty

Night vision technologypadma gade

CHAPTER 3PERCEPTIONGraham Pike, Graham Edgar, and Helen samirapdcosden

Semelhante a Deep Learning Class #2 - Deep learning for Images, I See What You Mean (8)

Neuroaesthetics: Sciences embraces art

A new technology for a new era

PHYSIOLOGY(OCULAR) PPT.pptx

4. darwin and the eye, part 1

Thesis

Unit 1 How to Use Microscope

Night vision technology

CHAPTER 3PERCEPTIONGraham Pike, Graham Edgar, and Helen

Último

On National Teacher Day, meet the 2024-25 Kenan FellowsMebane Rash

TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...Nguyen Thanh Tu Collection

Class 11th Physics NEET formula sheet pdfAyushMahapatra5

Basic Civil Engineering first year Notes- Chapter 4 Building.pptxDenish Jangid

Measures of Dispersion and Variability: Range, QD, AD and SDThiyagu K

Key note speaker Neum_Admir Softic_ENG.pdfAdmir Softic

Unit-V; Pricing (Pharma Marketing Management).pptxVishalSingh1417

Web & Social Media Analytics Previous Year Question Paper.pdfJayanti Pande

How to Give a Domain for a Field in Odoo 17Celine George

Seal of Good Local Governance (SGLG) 2024Final.pptxnegromaestrong

INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxRAM LAL ANAND COLLEGE, DELHI UNIVERSITY.

Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...christianmathematics

2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptxMaritesTamaniVerdade

ICT Role in 21st Century Education & its Challenges.pptxAreebaZafar22

Making and Justifying Mathematical Decisions.pdfChris Hunter

1029-Danh muc Sach Giao Khoa khoi 6.pdfQucHHunhnh

Mixin Classes in Odoo 17 How to Extend Models Using Mixin ClassesCeline George

Unit-IV- Pharma. Marketing Channels.pptxVishalSingh1417

microwave assisted reaction. General introductionMaksud Ahmed

Z Score,T Score, Percential Rank and Box Plot GraphThiyagu K

Deep Learning Class #2 - Deep learning for Images, I See What You Mean

1. Louis Monier @louis_monier https://www.linkedin.com/in/louismonier Deep Learning for Images I see what you mean... Gregory Renard @redo https://www.linkedin.com/in/gregoryrenard Class 2 - Q2 - 2016

2. Fun with Images Image Classification: kayak, boy Entity Detection kayak boy Face Recognition Leo Gollum

3. More Fun with Images Pose DetectionImage SegmentationImage Captioning: “A young boy wearing an orange vest riding a yellow kayak on water, with sunlight reflections.”

4. Yet more Fun with Images Optical Character Recognition (OCR): Astronomy is the science which treats of the nature and properties of the heavenly bodies. Autonomous Vehicles Handwriting Recognition: combustible: “able to catch fire”, adjective for being capable of igniting and burning.

5. Our Wet Hardware Alternating layers of - simple cells (filters) - complex cells (combination) Simple patterns to abstract concepts ~ 5B neurons for vision

6. Convolutional Neural Network (ConvNet, CNN) Suggested by Kunihiko Fukushima, 1980 LeNet, by Yann LeCun, 1998, to classify hand-written digits

7. filterimage = 6.6 = -7.8 1.0 - really want 0.2 - sort of want -1.0 - don’t want Convolution: Applying a Filter to a Signal through through = = image filter 1.0 0.5 0.0 through through

8. Convolutional Layer - Basic Unit 5x5x3 chunk of inputs Layer N Layer N+1 ReLU neuron (3) (3) (3) (3) 5 x 5 x 3 = 75 inputs 76 weights

9. Convolutional Layer: Add Depth 5x5x3 chunk of inputs Layer N Layer N+1 Depth = 7 ReLU neurons in parallel, with different weights

10. Stride = 1 Convolutional Layer: Repeat over entire image L=W=5 D=4 zero padding D=7 Shared weights!!!

11. Pooling Layer: Squeeeeeze! Max Pooling Average Pooling Layer N+1Layer N

12. Classical CNN topology - VGGNet (2013) 224x224 112x112 56x56 28x28 14x14 FC D=64 D=128 D=256 D=512 D=512 D=4096 D=4096 D=1000 FC FC + Softmax ConvNet Pool

13. Layer 1 Filter Matching images

14. Layer 2

15. Layer 3

16. Layer 4

17. Layer 5

18. Modern ConvNet - GoogLeNet GoogLeNet (2014) ResNet-34 (2015)

19. Manifolds

20. Real-life Data vs Random Data If music be the food of love, play on! -- William Shakespeare 3Flr'kI5;LS3oLj1xK52,BA1 Rea5IYSf -- 1000 monkeys typing -- Real world -- Random Pixels

21.

22. Workshop : Keras & MNIST https://github.com/holbertonschool/deep-learning/tree/master/Class%20%232

23. Workshop : Keras & CIFAR 10 https://github.com/holbertonschool/deep-learning/tree/master/Class%20%232

Deep Learning Class #2 - Deep learning for Images, I See What You Mean

Recomendados

Recomendados

Mais conteúdo relacionado

Semelhante a Deep Learning Class #2 - Deep learning for Images, I See What You Mean

Semelhante a Deep Learning Class #2 - Deep learning for Images, I See What You Mean (8)

Último

Último (20)

Deep Learning Class #2 - Deep learning for Images, I See What You Mean