SlideShare uma empresa Scribd logo
1 de 19
The image part with relationship ID rId14 was not found in the file.
{
Deep  Learning
And  Its  Applications:  Computer  Vision
Adam  Gibson
{  deeplearning4j.org  //  skymind.io  //  zipfian  academy
The image part with relationship ID rId14 was not found in the file.
•  Object  Recognition
•  Image  Categorization
•  Scene  Parsing
•  Face  Recognition
Computer  Vision:  A  Primer
The image part with relationship ID rId14 was not found in the file.
•  OpenCV  
•  SIFT
•  Filters/Edge  Detection
•  Feature  Extraction
What’s  currently  done?
The image part with relationship ID rId14 was not found in the file.
•  Representation  Learning  
•  More  precise  than  hand-­‐‑done  
features
•  Non-­‐‑linearities  and  higher-­‐‑order  
trends
•  Pretrain  and  Hessian  Free
This  is  manual!
The image part with relationship ID rId14 was not found in the file.
•  Representation  Learning
•  Position  Invariance  with  convolutions
•  Semantic  Hashing  
Deep  Learning  and  Images
The image part with relationship ID rId14 was not found in the file.
•  Normal  pixels  –  0-­‐‑255  –  
normalization
•  Sparse  –  binarization  (depending  on  
pixel  presence)
Different  kinds  of  images
The image part with relationship ID rId14 was not found in the file.
•  Faces  =  a  collection  of  images.
•  With  persistent  pa_erns  of  pixels.
•  Pixel  pa_erns  =  features.
•  Nets  learn  to  identify  features  in  data,  to  
classify  faces  as  faces  and  label  them:  John  or  
Sarah.
•  Nets  train  by  reconstructing  faces  from  features  
many  times.	
•  Measuring  their  work  against  a  benchmark.
Facial  recognition
The image part with relationship ID rId14 was not found in the file.
DL4J’s  Facial  Reconstructions
The image part with relationship ID rId14 was not found in the file.
•  Slices  of  a  feature  space  (Max  pooling)	
•  Learns  different  portions  for  easily  scalable  
and  robust  feature  engineering.	
Position  Invariance  -­‐‑  Convolutions
The image part with relationship ID rId14 was not found in the file.
Visual  Example  -­‐‑  Convolutions
The image part with relationship ID rId14 was not found in the file.
Pen  Strokes
The image part with relationship ID rId14 was not found in the file.
•  Facebook  uses  facial  recognition  to  make  
itself  stickier  and  know  more  about  us.
•  Government  agencies  use  it  to  secure  
national  borders.
•  Video  game  makers  use  it  to  construct  more  
realistic  worlds.
•  Stores  use  it  to  identify  customers  and  track  
behavior.
What  are  faces  for?
The image part with relationship ID rId14 was not found in the file.
•  2  layers  of  neuron-­‐‑like  nodes.	
•  The  1st  is  the  visible,  or  input,  layer	
•  The  2nd  is  “hidden.”  It  identifies  features  in  input	
•  Symmetrically  connected.	
•  “Restricted”  =  no  visible-­‐‑visible  or  hidden-­‐‑hidden  
ties	
•  All  connections  happen  between  layers.	
Restricted  Bolgmann  
Machines  (RBMs)
The image part with relationship ID rId14 was not found in the file.
•  A  stack  of  RBMs.	
•  Each  RBM’s  hidden  layer  à  Next  RBM’s  visible/input  
layer.  	
•  DBNs  learn  more  &  more  complex  features	
•  Example:  	
•  1)  Pixels  =  input;  	
•  2)  H1  learns  an  edge  or  line;  	
•  3)  H2  learns  a  corner  or  set  of  lines;  	
•  4)  H3  learns  two  groups  of  lines  forming  an  object  
-­‐‑-­‐‑  a  face!	
•  Final  layer  classifies  feature  groups:  sunset,  elephant,  
flower,  John,  Sarah.	
Deep-­‐‑Belief  Net  (DBN)
The image part with relationship ID rId14 was not found in the file.
•  2  DBNs.
•  1st  DBN  *encodes*  data  into  vector  of  10-­‐‑30  
numbers  =  Pre-­‐‑training.
•  2nd  DBN  decodes  data  into  original  state.
•  Backprop  only  happens  on  2nd  DBN
•  2nd  is  the  fine-­‐‑tuning  stage  (reconstruction  entropy).
•  Reduces  documents  or  images  to  compact  vectors  .
•  Useful  in  search,  QA  and  information  retrieval.
Deep  Autoencoder
The image part with relationship ID rId14 was not found in the file.
Deep  Autoencoder  Architecture
The image part with relationship ID rId14 was not found in the file.
Image  Search  Results
The image part with relationship ID rId14 was not found in the file.
•  Top-­‐‑down  &  hierarchical  rather  than  feed-­‐‑forward  (DBNs).
•  Handles  sequence-­‐‑based  classification,  windows  of  several  
events,  entire  scenes  (multiple  objects).
•  Features  themselves  are  vectors.  
•  A  tensor  =  a  multi-­‐‑dimensional  matrix,  or  multiple  matrices  of  
the  same  size.
Recursive  Neural  Tensor  Net
The image part with relationship ID rId14 was not found in the file.
RNTNs  &  Scene  Composition

Mais conteúdo relacionado

Destaque

General introduction to computer vision
General introduction to computer visionGeneral introduction to computer vision
General introduction to computer vision
butest
 
applications of computer graphics
applications of computer graphicsapplications of computer graphics
applications of computer graphics
Aaina Katyal
 

Destaque (20)

General introduction to computer vision
General introduction to computer visionGeneral introduction to computer vision
General introduction to computer vision
 
Computer Vision
Computer VisionComputer Vision
Computer Vision
 
Computer vision
Computer visionComputer vision
Computer vision
 
COM2304: Introduction to Computer Vision & Image Processing
COM2304: Introduction to Computer Vision & Image Processing COM2304: Introduction to Computer Vision & Image Processing
COM2304: Introduction to Computer Vision & Image Processing
 
Nonverbal
NonverbalNonverbal
Nonverbal
 
Computer Vision Introduction
Computer Vision IntroductionComputer Vision Introduction
Computer Vision Introduction
 
Cross platform computer vision optimization
Cross platform computer vision optimizationCross platform computer vision optimization
Cross platform computer vision optimization
 
Understanding hearing impairment
Understanding hearing impairmentUnderstanding hearing impairment
Understanding hearing impairment
 
Deaf culture
Deaf cultureDeaf culture
Deaf culture
 
Sign Language in Communication
Sign Language in CommunicationSign Language in Communication
Sign Language in Communication
 
Sign Language
Sign LanguageSign Language
Sign Language
 
Matrix and Tensor Tools for Computer Vision
Matrix and Tensor Tools for Computer VisionMatrix and Tensor Tools for Computer Vision
Matrix and Tensor Tools for Computer Vision
 
Computer Vision
Computer VisionComputer Vision
Computer Vision
 
Hearing impairment
Hearing impairmentHearing impairment
Hearing impairment
 
Dbms Project
Dbms ProjectDbms Project
Dbms Project
 
American Sign Language
American Sign LanguageAmerican Sign Language
American Sign Language
 
What Is Sign Language
What Is Sign LanguageWhat Is Sign Language
What Is Sign Language
 
Sign language ppt
Sign language pptSign language ppt
Sign language ppt
 
Image processing ppt
Image processing pptImage processing ppt
Image processing ppt
 
applications of computer graphics
applications of computer graphicsapplications of computer graphics
applications of computer graphics
 

Mais de Adam Gibson

Mais de Adam Gibson (20)

End to end MLworkflows
End to end MLworkflowsEnd to end MLworkflows
End to end MLworkflows
 
World Artificial Intelligence Conference Shanghai 2018
World Artificial Intelligence Conference Shanghai 2018World Artificial Intelligence Conference Shanghai 2018
World Artificial Intelligence Conference Shanghai 2018
 
Deploying signature verification with deep learning
Deploying signature verification with deep learningDeploying signature verification with deep learning
Deploying signature verification with deep learning
 
Self driving computers active learning workflows with human interpretable ve...
Self driving computers  active learning workflows with human interpretable ve...Self driving computers  active learning workflows with human interpretable ve...
Self driving computers active learning workflows with human interpretable ve...
 
Anomaly Detection and Automatic Labeling with Deep Learning
Anomaly Detection and Automatic Labeling with Deep LearningAnomaly Detection and Automatic Labeling with Deep Learning
Anomaly Detection and Automatic Labeling with Deep Learning
 
Strata Beijing 2017: Jumpy, a python interface for nd4j
Strata Beijing 2017: Jumpy, a python interface for nd4jStrata Beijing 2017: Jumpy, a python interface for nd4j
Strata Beijing 2017: Jumpy, a python interface for nd4j
 
Boolan machine learning summit
Boolan machine learning summitBoolan machine learning summit
Boolan machine learning summit
 
Advanced deeplearning4j features
Advanced deeplearning4j featuresAdvanced deeplearning4j features
Advanced deeplearning4j features
 
Deep Learning with GPUs in Production - AI By the Bay
Deep Learning with GPUs in Production - AI By the BayDeep Learning with GPUs in Production - AI By the Bay
Deep Learning with GPUs in Production - AI By the Bay
 
Big Data Analytics Tokyo
Big Data Analytics TokyoBig Data Analytics Tokyo
Big Data Analytics Tokyo
 
Wrangleconf Big Data Malaysia 2016
Wrangleconf Big Data Malaysia 2016Wrangleconf Big Data Malaysia 2016
Wrangleconf Big Data Malaysia 2016
 
Distributed deep rl on spark strata singapore
Distributed deep rl on spark   strata singaporeDistributed deep rl on spark   strata singapore
Distributed deep rl on spark strata singapore
 
Deep learning in production with the best
Deep learning in production   with the bestDeep learning in production   with the best
Deep learning in production with the best
 
Dl4j in the wild
Dl4j in the wildDl4j in the wild
Dl4j in the wild
 
SKIL - Dl4j in the wild meetup
SKIL - Dl4j in the wild meetupSKIL - Dl4j in the wild meetup
SKIL - Dl4j in the wild meetup
 
Strata Beijing - Deep Learning in Production on Spark
Strata Beijing - Deep Learning in Production on SparkStrata Beijing - Deep Learning in Production on Spark
Strata Beijing - Deep Learning in Production on Spark
 
Anomaly detection in deep learning (Updated) English
Anomaly detection in deep learning (Updated) EnglishAnomaly detection in deep learning (Updated) English
Anomaly detection in deep learning (Updated) English
 
Skymind - Udacity China presentation
Skymind - Udacity China presentationSkymind - Udacity China presentation
Skymind - Udacity China presentation
 
Anomaly Detection in Deep Learning (Updated)
Anomaly Detection in Deep Learning (Updated)Anomaly Detection in Deep Learning (Updated)
Anomaly Detection in Deep Learning (Updated)
 
Hadoop summit 2016
Hadoop summit 2016Hadoop summit 2016
Hadoop summit 2016
 

Último

AKTU Computer Networks notes --- Unit 3.pdf
AKTU Computer Networks notes ---  Unit 3.pdfAKTU Computer Networks notes ---  Unit 3.pdf
AKTU Computer Networks notes --- Unit 3.pdf
ankushspencer015
 
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
dollysharma2066
 
Call Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
Call Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort ServiceCall Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
Call Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
result management system report for college project
result management system report for college projectresult management system report for college project
result management system report for college project
Tonystark477637
 
Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar ≼🔝 Delhi door step de...
Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar  ≼🔝 Delhi door step de...Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar  ≼🔝 Delhi door step de...
Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar ≼🔝 Delhi door step de...
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 

Último (20)

UNIT-IFLUID PROPERTIES & FLOW CHARACTERISTICS
UNIT-IFLUID PROPERTIES & FLOW CHARACTERISTICSUNIT-IFLUID PROPERTIES & FLOW CHARACTERISTICS
UNIT-IFLUID PROPERTIES & FLOW CHARACTERISTICS
 
Unit 1 - Soil Classification and Compaction.pdf
Unit 1 - Soil Classification and Compaction.pdfUnit 1 - Soil Classification and Compaction.pdf
Unit 1 - Soil Classification and Compaction.pdf
 
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptx
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptxBSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptx
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptx
 
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
 
(INDIRA) Call Girl Meerut Call Now 8617697112 Meerut Escorts 24x7
(INDIRA) Call Girl Meerut Call Now 8617697112 Meerut Escorts 24x7(INDIRA) Call Girl Meerut Call Now 8617697112 Meerut Escorts 24x7
(INDIRA) Call Girl Meerut Call Now 8617697112 Meerut Escorts 24x7
 
Java Programming :Event Handling(Types of Events)
Java Programming :Event Handling(Types of Events)Java Programming :Event Handling(Types of Events)
Java Programming :Event Handling(Types of Events)
 
Thermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.pptThermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.ppt
 
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
 
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
 
Call for Papers - International Journal of Intelligent Systems and Applicatio...
Call for Papers - International Journal of Intelligent Systems and Applicatio...Call for Papers - International Journal of Intelligent Systems and Applicatio...
Call for Papers - International Journal of Intelligent Systems and Applicatio...
 
AKTU Computer Networks notes --- Unit 3.pdf
AKTU Computer Networks notes ---  Unit 3.pdfAKTU Computer Networks notes ---  Unit 3.pdf
AKTU Computer Networks notes --- Unit 3.pdf
 
UNIT-III FMM. DIMENSIONAL ANALYSIS
UNIT-III FMM.        DIMENSIONAL ANALYSISUNIT-III FMM.        DIMENSIONAL ANALYSIS
UNIT-III FMM. DIMENSIONAL ANALYSIS
 
(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7
(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7
(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7
 
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
 
Online banking management system project.pdf
Online banking management system project.pdfOnline banking management system project.pdf
Online banking management system project.pdf
 
Call Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
Call Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort ServiceCall Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
Call Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
 
result management system report for college project
result management system report for college projectresult management system report for college project
result management system report for college project
 
Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar ≼🔝 Delhi door step de...
Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar  ≼🔝 Delhi door step de...Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar  ≼🔝 Delhi door step de...
Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar ≼🔝 Delhi door step de...
 
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
 
UNIT-II FMM-Flow Through Circular Conduits
UNIT-II FMM-Flow Through Circular ConduitsUNIT-II FMM-Flow Through Circular Conduits
UNIT-II FMM-Flow Through Circular Conduits
 

Deep Learning and its Applications - Computer Vision

  • 1. The image part with relationship ID rId14 was not found in the file. { Deep  Learning And  Its  Applications:  Computer  Vision Adam  Gibson {  deeplearning4j.org  //  skymind.io  //  zipfian  academy
  • 2. The image part with relationship ID rId14 was not found in the file. •  Object  Recognition •  Image  Categorization •  Scene  Parsing •  Face  Recognition Computer  Vision:  A  Primer
  • 3. The image part with relationship ID rId14 was not found in the file. •  OpenCV   •  SIFT •  Filters/Edge  Detection •  Feature  Extraction What’s  currently  done?
  • 4. The image part with relationship ID rId14 was not found in the file. •  Representation  Learning   •  More  precise  than  hand-­‐‑done   features •  Non-­‐‑linearities  and  higher-­‐‑order   trends •  Pretrain  and  Hessian  Free This  is  manual!
  • 5. The image part with relationship ID rId14 was not found in the file. •  Representation  Learning •  Position  Invariance  with  convolutions •  Semantic  Hashing   Deep  Learning  and  Images
  • 6. The image part with relationship ID rId14 was not found in the file. •  Normal  pixels  –  0-­‐‑255  –   normalization •  Sparse  –  binarization  (depending  on   pixel  presence) Different  kinds  of  images
  • 7. The image part with relationship ID rId14 was not found in the file. •  Faces  =  a  collection  of  images. •  With  persistent  pa_erns  of  pixels. •  Pixel  pa_erns  =  features. •  Nets  learn  to  identify  features  in  data,  to   classify  faces  as  faces  and  label  them:  John  or   Sarah. •  Nets  train  by  reconstructing  faces  from  features   many  times. •  Measuring  their  work  against  a  benchmark. Facial  recognition
  • 8. The image part with relationship ID rId14 was not found in the file. DL4J’s  Facial  Reconstructions
  • 9. The image part with relationship ID rId14 was not found in the file. •  Slices  of  a  feature  space  (Max  pooling) •  Learns  different  portions  for  easily  scalable   and  robust  feature  engineering. Position  Invariance  -­‐‑  Convolutions
  • 10. The image part with relationship ID rId14 was not found in the file. Visual  Example  -­‐‑  Convolutions
  • 11. The image part with relationship ID rId14 was not found in the file. Pen  Strokes
  • 12. The image part with relationship ID rId14 was not found in the file. •  Facebook  uses  facial  recognition  to  make   itself  stickier  and  know  more  about  us. •  Government  agencies  use  it  to  secure   national  borders. •  Video  game  makers  use  it  to  construct  more   realistic  worlds. •  Stores  use  it  to  identify  customers  and  track   behavior. What  are  faces  for?
  • 13. The image part with relationship ID rId14 was not found in the file. •  2  layers  of  neuron-­‐‑like  nodes. •  The  1st  is  the  visible,  or  input,  layer •  The  2nd  is  “hidden.”  It  identifies  features  in  input •  Symmetrically  connected. •  “Restricted”  =  no  visible-­‐‑visible  or  hidden-­‐‑hidden   ties •  All  connections  happen  between  layers. Restricted  Bolgmann   Machines  (RBMs)
  • 14. The image part with relationship ID rId14 was not found in the file. •  A  stack  of  RBMs. •  Each  RBM’s  hidden  layer  à  Next  RBM’s  visible/input   layer.   •  DBNs  learn  more  &  more  complex  features •  Example:   •  1)  Pixels  =  input;   •  2)  H1  learns  an  edge  or  line;   •  3)  H2  learns  a  corner  or  set  of  lines;   •  4)  H3  learns  two  groups  of  lines  forming  an  object   -­‐‑-­‐‑  a  face! •  Final  layer  classifies  feature  groups:  sunset,  elephant,   flower,  John,  Sarah. Deep-­‐‑Belief  Net  (DBN)
  • 15. The image part with relationship ID rId14 was not found in the file. •  2  DBNs. •  1st  DBN  *encodes*  data  into  vector  of  10-­‐‑30   numbers  =  Pre-­‐‑training. •  2nd  DBN  decodes  data  into  original  state. •  Backprop  only  happens  on  2nd  DBN •  2nd  is  the  fine-­‐‑tuning  stage  (reconstruction  entropy). •  Reduces  documents  or  images  to  compact  vectors  . •  Useful  in  search,  QA  and  information  retrieval. Deep  Autoencoder
  • 16. The image part with relationship ID rId14 was not found in the file. Deep  Autoencoder  Architecture
  • 17. The image part with relationship ID rId14 was not found in the file. Image  Search  Results
  • 18. The image part with relationship ID rId14 was not found in the file. •  Top-­‐‑down  &  hierarchical  rather  than  feed-­‐‑forward  (DBNs). •  Handles  sequence-­‐‑based  classification,  windows  of  several   events,  entire  scenes  (multiple  objects). •  Features  themselves  are  vectors.   •  A  tensor  =  a  multi-­‐‑dimensional  matrix,  or  multiple  matrices  of   the  same  size. Recursive  Neural  Tensor  Net
  • 19. The image part with relationship ID rId14 was not found in the file. RNTNs  &  Scene  Composition