SlideShare uma empresa Scribd logo
1 de 38
Robust Object Recognition with Cortex-like Mechanisms (PAMI, 06) Presented by Ala Stolpnik T. Serre, L. Wolf, S. Bileschi, M. Riesenhuber, and T. Poggio
Introduction ,[object Object],[object Object],[object Object]
Scene Understanding Watch Out! Probably Hanging Out
The StreetScenes Database 3,547 Images, all taken with the same  camera , of the same type of  scene , and hand labeled with the same  objects , using the same labeling  rules . Database Performance Measures Approach sky road tree building bicycle pedestrian car Object 2562 3400 4932 5067 209 1449 5799 # Labeled Examples
More StreetScenes Examples Database Performance Measures Approach
Even More Street Scenes Examples Database Performance Measures Approach
Challenges: In-class variability Partial, or weak labeling Includes Rigid, Articulated and Amorphous objects  Database Performance Measures Approach
Challenges: In-class variability Partial, or weak labeling Includes  Rigid ,  Articulated  and  Amorphous  objects   Database Performance Measures Approach
Texture Sample Locations Building, Tree, Road and Sky  Hand-drawn Labels Training  Sample   Locations Database Performance Measures Approach
Input image Segmented image Texture classification Windowing Crop classification Output Texture-based objects pathway (e.g., trees, road..) Shape-based objects pathway (e.g., pedestrians, cars..) car car ped Approach Two Slightly Different Pathways
Texture-based Object Detection Input image Classification Smoothing Over Segmentation Tree / Not-Tree Standard Model  Feature Extraction Classification Database Performance Measures Approach Feature Vector Decision Feature Vector Decision
Shape-based Object Detection Windowing Crop classification Output car car ped Car / Not-Car Standard Model  Feature Extraction Statistical learning Classification Database Performance Measures Approach Feature Vector Decision
Standard Model Features from a neuroscience view. Retina Complexity Approach
Standard Model Features from a neuroscience view. ,[object Object],[object Object],[object Object],[object Object],C1 S2 C2 S1
Overview ,[object Object],[object Object],[object Object],Approach
S1 - Gabor filter ,[object Object],[object Object],[object Object],[object Object],Approach
Gabor filter - rotation Input sample Thetha = 0 Thetha = 90 Approach We use 4 different orientations: 0, 45, 90, 135
Gabor filter - scaling Lambda = 3.5 Lambda = 22.8 Lambda = 10.3 Approach We use 16 different scales from Lambda=3.5 to 22.8
S1 Input Image ,[object Object],C1 S2 C2 Approach Apply Gebor filter to gray scale image
Apply Gebor filter to gray scale image Input Image S1 C1 S2 C2 Approach ,[object Object],[object Object]
S1 S1 C1 S2 C2 Approach ,[object Object],[object Object],[object Object]
Input Image S1 C1 S1 C1 S2 C2 Local maximization takes place in each orientation channel separately, and also over nearby scales.  Approach
C1 S1 C1 S2 C2 Approach ,[object Object],[object Object],[object Object]
S1 -> C1 Approach S1 C1 S2 C2
S1 C1 S2 C2 Approach ,[object Object],[object Object],[object Object],C1 S2 Prototype  =
S2 Approach ,[object Object],[object Object],[object Object],[object Object],S1 C1 S2 C2
C2 C2 is simply the global maximum of the S2 response image. S1 C1 S2 C2 Each Prototype gives rise to one C2 value. C2 = max ( ) Size of patch, sampling rate, etc. are Parameters of the system. Approach
Overview ,[object Object],Approach
The learning stage ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Approach
Model overview ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Approach
Overview ,[object Object],[object Object],[object Object]
StreetScenes Database. Subjective Results Results
StreetScenes Database. Subjective Results Results
C2 vs. Sift – number of features Results
C2 vs. Sift – number of training examples Results
Object specific vs. universal features  Results
Conclusion ,[object Object],[object Object],[object Object],[object Object]
Thanks!

Mais conteúdo relacionado

Mais procurados

Interactive Stereoscopic Rendering for Non-Planar Projections (GRAPP 2009)
Interactive Stereoscopic Rendering for Non-Planar Projections (GRAPP 2009)Interactive Stereoscopic Rendering for Non-Planar Projections (GRAPP 2009)
Interactive Stereoscopic Rendering for Non-Planar Projections (GRAPP 2009)
Matthias Trapp
 
WE2.TO9.2.pptx
WE2.TO9.2.pptxWE2.TO9.2.pptx
WE2.TO9.2.pptx
grssieee
 

Mais procurados (20)

Orb feature by nitin
Orb feature by nitinOrb feature by nitin
Orb feature by nitin
 
A Three-Dimensional Representation method for Noisy Point Clouds based on Gro...
A Three-Dimensional Representation method for Noisy Point Clouds based on Gro...A Three-Dimensional Representation method for Noisy Point Clouds based on Gro...
A Three-Dimensional Representation method for Noisy Point Clouds based on Gro...
 
Evaluation of illumination uniformity metrics in design and optimization of ...
 Evaluation of illumination uniformity metrics in design and optimization of ... Evaluation of illumination uniformity metrics in design and optimization of ...
Evaluation of illumination uniformity metrics in design and optimization of ...
 
ICRA 2015 interactive presentation
ICRA 2015 interactive presentationICRA 2015 interactive presentation
ICRA 2015 interactive presentation
 
Optical modeling and design of freeform surfaces using anisotropic Radial Bas...
Optical modeling and design of freeform surfaces using anisotropic Radial Bas...Optical modeling and design of freeform surfaces using anisotropic Radial Bas...
Optical modeling and design of freeform surfaces using anisotropic Radial Bas...
 
Interactive Stereoscopic Rendering for Non-Planar Projections (GRAPP 2009)
Interactive Stereoscopic Rendering for Non-Planar Projections (GRAPP 2009)Interactive Stereoscopic Rendering for Non-Planar Projections (GRAPP 2009)
Interactive Stereoscopic Rendering for Non-Planar Projections (GRAPP 2009)
 
Build Your Own 3D Scanner: 3D Scanning with Structured Lighting
Build Your Own 3D Scanner: 3D Scanning with Structured LightingBuild Your Own 3D Scanner: 3D Scanning with Structured Lighting
Build Your Own 3D Scanner: 3D Scanning with Structured Lighting
 
Build Your Own 3D Scanner: The Mathematics of 3D Triangulation
Build Your Own 3D Scanner: The Mathematics of 3D TriangulationBuild Your Own 3D Scanner: The Mathematics of 3D Triangulation
Build Your Own 3D Scanner: The Mathematics of 3D Triangulation
 
PCL (Point Cloud Library)
PCL (Point Cloud Library)PCL (Point Cloud Library)
PCL (Point Cloud Library)
 
Design and optimization of compact freeform lens array for laser beam splitti...
Design and optimization of compact freeform lens array for laser beam splitti...Design and optimization of compact freeform lens array for laser beam splitti...
Design and optimization of compact freeform lens array for laser beam splitti...
 
Hidden line removal algorithm
Hidden line removal algorithmHidden line removal algorithm
Hidden line removal algorithm
 
Computer Graphics and Multimedia Techniques Paper (RTU VI Semester)
Computer Graphics and Multimedia Techniques Paper (RTU VI Semester)Computer Graphics and Multimedia Techniques Paper (RTU VI Semester)
Computer Graphics and Multimedia Techniques Paper (RTU VI Semester)
 
"Introduction to Feature Descriptors in Vision: From Haar to SIFT," A Present...
"Introduction to Feature Descriptors in Vision: From Haar to SIFT," A Present..."Introduction to Feature Descriptors in Vision: From Haar to SIFT," A Present...
"Introduction to Feature Descriptors in Vision: From Haar to SIFT," A Present...
 
Point cloud library
Point cloud libraryPoint cloud library
Point cloud library
 
Low-Resolution Contour Recognition for Hexagonal Grid Images
Low-Resolution Contour Recognition for Hexagonal Grid Images Low-Resolution Contour Recognition for Hexagonal Grid Images
Low-Resolution Contour Recognition for Hexagonal Grid Images
 
OpenCV presentation series- part 5
OpenCV presentation series- part 5OpenCV presentation series- part 5
OpenCV presentation series- part 5
 
Hidden Surfaces
Hidden SurfacesHidden Surfaces
Hidden Surfaces
 
Hidden surface removal algorithm
Hidden surface removal algorithmHidden surface removal algorithm
Hidden surface removal algorithm
 
WE2.TO9.2.pptx
WE2.TO9.2.pptxWE2.TO9.2.pptx
WE2.TO9.2.pptx
 
Computer Graphics - Hidden Line Removal Algorithm
Computer Graphics - Hidden Line Removal AlgorithmComputer Graphics - Hidden Line Removal Algorithm
Computer Graphics - Hidden Line Removal Algorithm
 

Semelhante a Ala Stolpnik's Standard Model talk

Real-time Face Recognition & Detection Systems 1
Real-time Face Recognition & Detection Systems 1Real-time Face Recognition & Detection Systems 1
Real-time Face Recognition & Detection Systems 1
Suvadip Shome
 
Rapid object detection using boosted cascade of simple features
Rapid object detection using boosted  cascade of simple featuresRapid object detection using boosted  cascade of simple features
Rapid object detection using boosted cascade of simple features
Hirantha Pradeep
 
Sift based arabic sign language recognition aecia 2014 –november17-19, addis ...
Sift based arabic sign language recognition aecia 2014 –november17-19, addis ...Sift based arabic sign language recognition aecia 2014 –november17-19, addis ...
Sift based arabic sign language recognition aecia 2014 –november17-19, addis ...
Tarek Gaber
 
Questions On The Equation For Regression
Questions On The Equation For RegressionQuestions On The Equation For Regression
Questions On The Equation For Regression
Tiffany Sandoval
 
nilesh-Mtech-presentation
nilesh-Mtech-presentationnilesh-Mtech-presentation
nilesh-Mtech-presentation
Nilesh Heda
 

Semelhante a Ala Stolpnik's Standard Model talk (20)

Scalable machine learning
Scalable machine learningScalable machine learning
Scalable machine learning
 
2D/Multi-view Segmentation and Tracking
2D/Multi-view Segmentation and Tracking2D/Multi-view Segmentation and Tracking
2D/Multi-view Segmentation and Tracking
 
Visualizing the Model Selection Process
Visualizing the Model Selection ProcessVisualizing the Model Selection Process
Visualizing the Model Selection Process
 
Real-time Face Recognition & Detection Systems 1
Real-time Face Recognition & Detection Systems 1Real-time Face Recognition & Detection Systems 1
Real-time Face Recognition & Detection Systems 1
 
Rapid object detection using boosted cascade of simple features
Rapid object detection using boosted  cascade of simple featuresRapid object detection using boosted  cascade of simple features
Rapid object detection using boosted cascade of simple features
 
物件偵測與辨識技術
物件偵測與辨識技術物件偵測與辨識技術
物件偵測與辨識技術
 
Sift based arabic sign language recognition aecia 2014 –november17-19, addis ...
Sift based arabic sign language recognition aecia 2014 –november17-19, addis ...Sift based arabic sign language recognition aecia 2014 –november17-19, addis ...
Sift based arabic sign language recognition aecia 2014 –november17-19, addis ...
 
Semantic Segmentation on Satellite Imagery
Semantic Segmentation on Satellite ImagerySemantic Segmentation on Satellite Imagery
Semantic Segmentation on Satellite Imagery
 
Object Tracking with Instance Matching and Online Learning
Object Tracking with Instance Matching and Online LearningObject Tracking with Instance Matching and Online Learning
Object Tracking with Instance Matching and Online Learning
 
Lec07 aggregation-and-retrieval-system
Lec07 aggregation-and-retrieval-systemLec07 aggregation-and-retrieval-system
Lec07 aggregation-and-retrieval-system
 
Tracking Robustness and Green View Index Estimation of Augmented and Diminish...
Tracking Robustness and Green View Index Estimation of Augmented and Diminish...Tracking Robustness and Green View Index Estimation of Augmented and Diminish...
Tracking Robustness and Green View Index Estimation of Augmented and Diminish...
 
Questions On The Equation For Regression
Questions On The Equation For RegressionQuestions On The Equation For Regression
Questions On The Equation For Regression
 
Mvs adas
Mvs adasMvs adas
Mvs adas
 
final_presentation
final_presentationfinal_presentation
final_presentation
 
Vehicle Recognition Using VIBE and SVM
Vehicle Recognition Using VIBE and SVMVehicle Recognition Using VIBE and SVM
Vehicle Recognition Using VIBE and SVM
 
VEHICLE RECOGNITION USING VIBE AND SVM
VEHICLE RECOGNITION USING VIBE AND SVMVEHICLE RECOGNITION USING VIBE AND SVM
VEHICLE RECOGNITION USING VIBE AND SVM
 
VEHICLE RECOGNITION USING VIBE AND SVM
VEHICLE RECOGNITION USING VIBE AND SVM VEHICLE RECOGNITION USING VIBE AND SVM
VEHICLE RECOGNITION USING VIBE AND SVM
 
VEHICLE RECOGNITION USING VIBE AND SVM
VEHICLE RECOGNITION USING VIBE AND SVMVEHICLE RECOGNITION USING VIBE AND SVM
VEHICLE RECOGNITION USING VIBE AND SVM
 
A Hybrid Trademark Retrieval System Using Four-Gray-Level Zernike Moments & ...
A Hybrid Trademark Retrieval System Using Four-Gray-Level Zernike Moments & ...A Hybrid Trademark Retrieval System Using Four-Gray-Level Zernike Moments & ...
A Hybrid Trademark Retrieval System Using Four-Gray-Level Zernike Moments & ...
 
nilesh-Mtech-presentation
nilesh-Mtech-presentationnilesh-Mtech-presentation
nilesh-Mtech-presentation
 

Mais de wolf

A bayesian framework for unsupervised one-shot learning of object categories
A bayesian framework for unsupervised one-shot learning of object categoriesA bayesian framework for unsupervised one-shot learning of object categories
A bayesian framework for unsupervised one-shot learning of object categories
wolf
 
Moshe Guttmann's slides on eigenface
Moshe Guttmann's slides on eigenfaceMoshe Guttmann's slides on eigenface
Moshe Guttmann's slides on eigenface
wolf
 
Object recognition seminar S2006E01
Object recognition seminar S2006E01Object recognition seminar S2006E01
Object recognition seminar S2006E01
wolf
 

Mais de wolf (13)

Eigenfaces and Fisherfaces
Eigenfaces and FisherfacesEigenfaces and Fisherfaces
Eigenfaces and Fisherfaces
 
Shai Avidan's Support vector tracking and ensemble tracking
Shai Avidan's Support vector tracking and ensemble trackingShai Avidan's Support vector tracking and ensemble tracking
Shai Avidan's Support vector tracking and ensemble tracking
 
Constellation Models and Unsupervised Learning for Object Class Recognition
Constellation Models and Unsupervised Learning for Object Class RecognitionConstellation Models and Unsupervised Learning for Object Class Recognition
Constellation Models and Unsupervised Learning for Object Class Recognition
 
A bayesian framework for unsupervised one-shot learning of object categories
A bayesian framework for unsupervised one-shot learning of object categoriesA bayesian framework for unsupervised one-shot learning of object categories
A bayesian framework for unsupervised one-shot learning of object categories
 
PCA-SIFT: A More Distinctive Representation for Local Image Descriptors
PCA-SIFT: A More Distinctive Representation for Local Image DescriptorsPCA-SIFT: A More Distinctive Representation for Local Image Descriptors
PCA-SIFT: A More Distinctive Representation for Local Image Descriptors
 
The Pyramid Match Kernel: Discriminative Classification with Sets of Image Fe...
The Pyramid Match Kernel: Discriminative Classification with Sets of Image Fe...The Pyramid Match Kernel: Discriminative Classification with Sets of Image Fe...
The Pyramid Match Kernel: Discriminative Classification with Sets of Image Fe...
 
Recovering 3D human body configurations using shape contexts
Recovering 3D human body configurations using shape contextsRecovering 3D human body configurations using shape contexts
Recovering 3D human body configurations using shape contexts
 
Rafi Zachut's slides on class specific segmentation
Rafi Zachut's slides on class specific segmentationRafi Zachut's slides on class specific segmentation
Rafi Zachut's slides on class specific segmentation
 
Avihu Efrat's Viola and Jones face detection slides
Avihu Efrat's Viola and Jones face detection slidesAvihu Efrat's Viola and Jones face detection slides
Avihu Efrat's Viola and Jones face detection slides
 
Michal Erel's SIFT presentation
Michal Erel's SIFT presentationMichal Erel's SIFT presentation
Michal Erel's SIFT presentation
 
Gil Shapira's Active Appearance Model slides
Gil Shapira's Active Appearance Model slidesGil Shapira's Active Appearance Model slides
Gil Shapira's Active Appearance Model slides
 
Moshe Guttmann's slides on eigenface
Moshe Guttmann's slides on eigenfaceMoshe Guttmann's slides on eigenface
Moshe Guttmann's slides on eigenface
 
Object recognition seminar S2006E01
Object recognition seminar S2006E01Object recognition seminar S2006E01
Object recognition seminar S2006E01
 

Último

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Último (20)

ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
A Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusA Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source Milvus
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 

Ala Stolpnik's Standard Model talk

  • 1. Robust Object Recognition with Cortex-like Mechanisms (PAMI, 06) Presented by Ala Stolpnik T. Serre, L. Wolf, S. Bileschi, M. Riesenhuber, and T. Poggio
  • 2.
  • 3. Scene Understanding Watch Out! Probably Hanging Out
  • 4. The StreetScenes Database 3,547 Images, all taken with the same camera , of the same type of scene , and hand labeled with the same objects , using the same labeling rules . Database Performance Measures Approach sky road tree building bicycle pedestrian car Object 2562 3400 4932 5067 209 1449 5799 # Labeled Examples
  • 5. More StreetScenes Examples Database Performance Measures Approach
  • 6. Even More Street Scenes Examples Database Performance Measures Approach
  • 7. Challenges: In-class variability Partial, or weak labeling Includes Rigid, Articulated and Amorphous objects Database Performance Measures Approach
  • 8. Challenges: In-class variability Partial, or weak labeling Includes Rigid , Articulated and Amorphous objects Database Performance Measures Approach
  • 9. Texture Sample Locations Building, Tree, Road and Sky Hand-drawn Labels Training Sample Locations Database Performance Measures Approach
  • 10. Input image Segmented image Texture classification Windowing Crop classification Output Texture-based objects pathway (e.g., trees, road..) Shape-based objects pathway (e.g., pedestrians, cars..) car car ped Approach Two Slightly Different Pathways
  • 11. Texture-based Object Detection Input image Classification Smoothing Over Segmentation Tree / Not-Tree Standard Model Feature Extraction Classification Database Performance Measures Approach Feature Vector Decision Feature Vector Decision
  • 12. Shape-based Object Detection Windowing Crop classification Output car car ped Car / Not-Car Standard Model Feature Extraction Statistical learning Classification Database Performance Measures Approach Feature Vector Decision
  • 13. Standard Model Features from a neuroscience view. Retina Complexity Approach
  • 14.
  • 15.
  • 16.
  • 17. Gabor filter - rotation Input sample Thetha = 0 Thetha = 90 Approach We use 4 different orientations: 0, 45, 90, 135
  • 18. Gabor filter - scaling Lambda = 3.5 Lambda = 22.8 Lambda = 10.3 Approach We use 16 different scales from Lambda=3.5 to 22.8
  • 19.
  • 20.
  • 21.
  • 22. Input Image S1 C1 S1 C1 S2 C2 Local maximization takes place in each orientation channel separately, and also over nearby scales. Approach
  • 23.
  • 24. S1 -> C1 Approach S1 C1 S2 C2
  • 25.
  • 26.
  • 27. C2 C2 is simply the global maximum of the S2 response image. S1 C1 S2 C2 Each Prototype gives rise to one C2 value. C2 = max ( ) Size of patch, sampling rate, etc. are Parameters of the system. Approach
  • 28.
  • 29.
  • 30.
  • 31.
  • 34. C2 vs. Sift – number of features Results
  • 35. C2 vs. Sift – number of training examples Results
  • 36. Object specific vs. universal features Results
  • 37.

Notas do Editor

  1. אני אדבר על שיטה לזיהוי עצמים בתמונות . נתרכז במיוחד בזיהוי עצמים בסצנות של רחוב .