SlideShare uma empresa Scribd logo
1 de 30
Introduction to Face Detection
By: Amar Jindal
NIT Patna
Face Detaction or Object Detection
 Combination of computer vision and image processing to find out object in
a image or video.
 The Object detection includes classifying an object in the image and
localize its poition in the image.
 This can be achived using image segmentation with AI.
 Classify the face.
 Localize the face
 List of facial
features.
How it Looks like?
Semantic
Segmentation
Instance
Segmentation
Types of Segmentation
Computer Vision Tasks
Source: CS231n Object detection http://cs231n.stanford.edu/slides/2016/winter1516_lecture8.pdf
Classification using LeNet on MNIST dataset
Computer Vision Tasks
Source: CS231n Object detection http://cs231n.stanford.edu/slides/2016/winter1516_lecture8.pdf
Classification + Localization
● Classification:
○ Input: Image
○ Output: Class label
○ Loss: Cross entropy (Softmaxlog)
○ Evaluation metric: Accuracy
● Localization:
○ Input: Image
○ Output: Box in the image (x, y, w, h)
○ Loss: L2 Loss (Euclidean distance)
○ Evaluation metric: Intersection over Union
● Classification + Localization:
○ Input: Image
○ Output: Class label + box in the image
○ Loss: Sum of both losses
Convolution
Operation
Poolin
g
Cross Entropy
 Measures the performance of a
 classification model whose output
 is a probability value between 0 &
1.
 Increases as the predicted P(e)
 diverges from the actual label.
Euclidean Distance
 This loss function is usually used for regression problems,It measures the
 distance between two distinct points.
Shallow
Features
Deep
Features
Feature
1
Feature
2
Classification + Localization: Model
Classification
Head:
● C Scores for C
classes
Localization Head:
● Class agnostic:
(x,y,w,h)
● Class specific:
(x,y,w,h) X C
Computer Vision Tasks
Source: CS231n Object detection http://cs231n.stanford.edu/slides/2016/winter1516_lecture8.pdf
Intersection Over Union (IoU)
● Important measurement for object localization.
● Used in both training and evaluation.
IoU
Algorithm
Face Recognition using Siamese Networks
 This network allows to calculate the degree of similarity between two
inputs
 Since we can not have multiple images of the same person in
our database,
Yolo Based Face Detection
Each grid cell has 5 associated values. The first one is the probability p of that cell
containing the center of a face.The other 4 values are the (x_center, y_center, width,
height) of the detected face (relative to the cell).
 2x [8 filter convolutional layer on 288x288 image]
 Max pooling (288x288 to 144x144 feature map)
 2x [16 filter convolutional layer on 144x144 feature map]
 Max pooling (144x144 to 72x72 feature map)
 2x [32 filter convolutional layer on 72x72 feature map]
 Max pooling (72x72 to 36x36 feature map)
 2x [64 filter convolutional layer on 36x36 feature map]
 Max pooling (36x36 to 18x18 feature map)
 2x [128 filter convolutional layer on 18x18 feature map]
 Max pooling (18x18 to 9x9 feature map)
 4x [192 filter convolutional layer on 9x9 feature map]
 5 filter convolutional layer on 9x9 feature map for the final
grid
Auxiliary network
Outputs of the main network were not as accurate as expected. Hence, a small
CNN network was implemented to take as input a small image containing a face
Object Detection
l
● Input: Image
● Output: For each object class c and each
image i,an algorithm returns predicted
detections: ocations
with confidence scores .
Object Detection: Evaluation
● Today new metrics are emerging
○ Averaging precision over all IoU thresholds: 0.5:0.05:0.95
○ Averaging precision for different object sizes (small, medium, big)
○ Averaging recall as a metric to measure object proposal quality.
Average Precision (AP)
● [In the vision community] AP is the estimated area under the PR curve
Mean Average Precision (mAP)
● The winner of each object class is the team with the highest average precision
● The winner of the challenge is the team with the highest mean Average
Precision (mAP) across all classes.
Object Detection: Evaluation
● Mean Average Precision (mAP) across all classes, based on Average Precision
(AP) per class, based on Precision and Recall.
Applications
 Face Unlock
 Person Identification
 Video Survillance
Doubts..?

Mais conteúdo relacionado

Mais procurados

Mais procurados (20)

Deep learning based object detection
Deep learning based object detectionDeep learning based object detection
Deep learning based object detection
 
Anchor free object detection by deep learning
Anchor free object detection by deep learningAnchor free object detection by deep learning
Anchor free object detection by deep learning
 
You Only Look Once: Unified, Real-Time Object Detection
You Only Look Once: Unified, Real-Time Object DetectionYou Only Look Once: Unified, Real-Time Object Detection
You Only Look Once: Unified, Real-Time Object Detection
 
You only look once
You only look onceYou only look once
You only look once
 
Object tracking presentation
Object tracking  presentationObject tracking  presentation
Object tracking presentation
 
Yolov3
Yolov3Yolov3
Yolov3
 
PR-207: YOLOv3: An Incremental Improvement
PR-207: YOLOv3: An Incremental ImprovementPR-207: YOLOv3: An Incremental Improvement
PR-207: YOLOv3: An Incremental Improvement
 
[PR12] You Only Look Once (YOLO): Unified Real-Time Object Detection
[PR12] You Only Look Once (YOLO): Unified Real-Time Object Detection[PR12] You Only Look Once (YOLO): Unified Real-Time Object Detection
[PR12] You Only Look Once (YOLO): Unified Real-Time Object Detection
 
Tutorial on Object Detection (Faster R-CNN)
Tutorial on Object Detection (Faster R-CNN)Tutorial on Object Detection (Faster R-CNN)
Tutorial on Object Detection (Faster R-CNN)
 
You only look once: Unified, real-time object detection (UPC Reading Group)
You only look once: Unified, real-time object detection (UPC Reading Group)You only look once: Unified, real-time object detection (UPC Reading Group)
You only look once: Unified, real-time object detection (UPC Reading Group)
 
Object detection
Object detectionObject detection
Object detection
 
Yolo
YoloYolo
Yolo
 
Deep learning based object detection basics
Deep learning based object detection basicsDeep learning based object detection basics
Deep learning based object detection basics
 
Image Segmentation (D3L1 2017 UPC Deep Learning for Computer Vision)
Image Segmentation (D3L1 2017 UPC Deep Learning for Computer Vision)Image Segmentation (D3L1 2017 UPC Deep Learning for Computer Vision)
Image Segmentation (D3L1 2017 UPC Deep Learning for Computer Vision)
 
Anatomy of YOLO - v1
Anatomy of YOLO - v1Anatomy of YOLO - v1
Anatomy of YOLO - v1
 
Real Time Object Tracking
Real Time Object TrackingReal Time Object Tracking
Real Time Object Tracking
 
Yolo v2 ai_tech_20190421
Yolo v2 ai_tech_20190421Yolo v2 ai_tech_20190421
Yolo v2 ai_tech_20190421
 
PR-132: SSD: Single Shot MultiBox Detector
PR-132: SSD: Single Shot MultiBox DetectorPR-132: SSD: Single Shot MultiBox Detector
PR-132: SSD: Single Shot MultiBox Detector
 
Object Detection Using R-CNN Deep Learning Framework
Object Detection Using R-CNN Deep Learning FrameworkObject Detection Using R-CNN Deep Learning Framework
Object Detection Using R-CNN Deep Learning Framework
 
Image classification using convolutional neural network
Image classification using convolutional neural networkImage classification using convolutional neural network
Image classification using convolutional neural network
 

Semelhante a Introduction to object detection

Skin Lesion Detection from Dermoscopic Images using Convolutional Neural Netw...
Skin Lesion Detection from Dermoscopic Images using Convolutional Neural Netw...Skin Lesion Detection from Dermoscopic Images using Convolutional Neural Netw...
Skin Lesion Detection from Dermoscopic Images using Convolutional Neural Netw...
Universitat Politècnica de Catalunya
 
project_final_seminar
project_final_seminarproject_final_seminar
project_final_seminar
MUKUL BICHKAR
 
Skin melanoma stage detection - CNN.pptx
Skin melanoma stage detection - CNN.pptxSkin melanoma stage detection - CNN.pptx
Skin melanoma stage detection - CNN.pptx
VishalLabde
 
Rapid object detection using boosted cascade of simple features
Rapid object detection using boosted  cascade of simple featuresRapid object detection using boosted  cascade of simple features
Rapid object detection using boosted cascade of simple features
Hirantha Pradeep
 
Application of gaussian filter with principal component analysis
Application of gaussian filter with principal component analysisApplication of gaussian filter with principal component analysis
Application of gaussian filter with principal component analysis
IAEME Publication
 
Application of gaussian filter with principal component analysis
Application of gaussian filter with principal component analysisApplication of gaussian filter with principal component analysis
Application of gaussian filter with principal component analysis
IAEME Publication
 

Semelhante a Introduction to object detection (20)

物件偵測與辨識技術
物件偵測與辨識技術物件偵測與辨識技術
物件偵測與辨識技術
 
Plant Disease Detection.pptx
Plant Disease Detection.pptxPlant Disease Detection.pptx
Plant Disease Detection.pptx
 
Skin Lesion Detection from Dermoscopic Images using Convolutional Neural Netw...
Skin Lesion Detection from Dermoscopic Images using Convolutional Neural Netw...Skin Lesion Detection from Dermoscopic Images using Convolutional Neural Netw...
Skin Lesion Detection from Dermoscopic Images using Convolutional Neural Netw...
 
cvpresentation-190812154654 (1).pptx
cvpresentation-190812154654 (1).pptxcvpresentation-190812154654 (1).pptx
cvpresentation-190812154654 (1).pptx
 
ppt 20BET1024.pptx
ppt 20BET1024.pptxppt 20BET1024.pptx
ppt 20BET1024.pptx
 
Computer Vision - Real Time Face Recognition using Open CV and Python
Computer Vision - Real Time Face Recognition using Open CV and PythonComputer Vision - Real Time Face Recognition using Open CV and Python
Computer Vision - Real Time Face Recognition using Open CV and Python
 
最近の研究情勢についていくために - Deep Learningを中心に -
最近の研究情勢についていくために - Deep Learningを中心に - 最近の研究情勢についていくために - Deep Learningを中心に -
最近の研究情勢についていくために - Deep Learningを中心に -
 
Avihu Efrat's Viola and Jones face detection slides
Avihu Efrat's Viola and Jones face detection slidesAvihu Efrat's Viola and Jones face detection slides
Avihu Efrat's Viola and Jones face detection slides
 
object-detection.pptx
object-detection.pptxobject-detection.pptx
object-detection.pptx
 
Obscenity Detection in Images
Obscenity Detection in ImagesObscenity Detection in Images
Obscenity Detection in Images
 
J017426467
J017426467J017426467
J017426467
 
project_final_seminar
project_final_seminarproject_final_seminar
project_final_seminar
 
Week2- Deep Learning Intuition.pptx
Week2- Deep Learning Intuition.pptxWeek2- Deep Learning Intuition.pptx
Week2- Deep Learning Intuition.pptx
 
Convolution Neural Network (CNN)
Convolution Neural Network (CNN)Convolution Neural Network (CNN)
Convolution Neural Network (CNN)
 
Visualizing the Model Selection Process
Visualizing the Model Selection ProcessVisualizing the Model Selection Process
Visualizing the Model Selection Process
 
Skin melanoma stage detection - CNN.pptx
Skin melanoma stage detection - CNN.pptxSkin melanoma stage detection - CNN.pptx
Skin melanoma stage detection - CNN.pptx
 
Rapid object detection using boosted cascade of simple features
Rapid object detection using boosted  cascade of simple featuresRapid object detection using boosted  cascade of simple features
Rapid object detection using boosted cascade of simple features
 
Application of gaussian filter with principal component analysis
Application of gaussian filter with principal component analysisApplication of gaussian filter with principal component analysis
Application of gaussian filter with principal component analysis
 
Application of gaussian filter with principal component analysis
Application of gaussian filter with principal component analysisApplication of gaussian filter with principal component analysis
Application of gaussian filter with principal component analysis
 
“Can You See What I See? The Power of Deep Learning,” a Presentation from Str...
“Can You See What I See? The Power of Deep Learning,” a Presentation from Str...“Can You See What I See? The Power of Deep Learning,” a Presentation from Str...
“Can You See What I See? The Power of Deep Learning,” a Presentation from Str...
 

Último

Hospital management system project report.pdf
Hospital management system project report.pdfHospital management system project report.pdf
Hospital management system project report.pdf
Kamal Acharya
 
DeepFakes presentation : brief idea of DeepFakes
DeepFakes presentation : brief idea of DeepFakesDeepFakes presentation : brief idea of DeepFakes
DeepFakes presentation : brief idea of DeepFakes
MayuraD1
 

Último (20)

Thermal Engineering Unit - I & II . ppt
Thermal Engineering  Unit - I & II . pptThermal Engineering  Unit - I & II . ppt
Thermal Engineering Unit - I & II . ppt
 
Orlando’s Arnold Palmer Hospital Layout Strategy-1.pptx
Orlando’s Arnold Palmer Hospital Layout Strategy-1.pptxOrlando’s Arnold Palmer Hospital Layout Strategy-1.pptx
Orlando’s Arnold Palmer Hospital Layout Strategy-1.pptx
 
Hospital management system project report.pdf
Hospital management system project report.pdfHospital management system project report.pdf
Hospital management system project report.pdf
 
Unleashing the Power of the SORA AI lastest leap
Unleashing the Power of the SORA AI lastest leapUnleashing the Power of the SORA AI lastest leap
Unleashing the Power of the SORA AI lastest leap
 
Computer Lecture 01.pptxIntroduction to Computers
Computer Lecture 01.pptxIntroduction to ComputersComputer Lecture 01.pptxIntroduction to Computers
Computer Lecture 01.pptxIntroduction to Computers
 
GEAR TRAIN- BASIC CONCEPTS AND WORKING PRINCIPLE
GEAR TRAIN- BASIC CONCEPTS AND WORKING PRINCIPLEGEAR TRAIN- BASIC CONCEPTS AND WORKING PRINCIPLE
GEAR TRAIN- BASIC CONCEPTS AND WORKING PRINCIPLE
 
Wadi Rum luxhotel lodge Analysis case study.pptx
Wadi Rum luxhotel lodge Analysis case study.pptxWadi Rum luxhotel lodge Analysis case study.pptx
Wadi Rum luxhotel lodge Analysis case study.pptx
 
Bhubaneswar🌹Call Girls Bhubaneswar ❤Komal 9777949614 💟 Full Trusted CALL GIRL...
Bhubaneswar🌹Call Girls Bhubaneswar ❤Komal 9777949614 💟 Full Trusted CALL GIRL...Bhubaneswar🌹Call Girls Bhubaneswar ❤Komal 9777949614 💟 Full Trusted CALL GIRL...
Bhubaneswar🌹Call Girls Bhubaneswar ❤Komal 9777949614 💟 Full Trusted CALL GIRL...
 
HOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptx
HOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptxHOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptx
HOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptx
 
Engineering Drawing focus on projection of planes
Engineering Drawing focus on projection of planesEngineering Drawing focus on projection of planes
Engineering Drawing focus on projection of planes
 
Navigating Complexity: The Role of Trusted Partners and VIAS3D in Dassault Sy...
Navigating Complexity: The Role of Trusted Partners and VIAS3D in Dassault Sy...Navigating Complexity: The Role of Trusted Partners and VIAS3D in Dassault Sy...
Navigating Complexity: The Role of Trusted Partners and VIAS3D in Dassault Sy...
 
NO1 Top No1 Amil Baba In Azad Kashmir, Kashmir Black Magic Specialist Expert ...
NO1 Top No1 Amil Baba In Azad Kashmir, Kashmir Black Magic Specialist Expert ...NO1 Top No1 Amil Baba In Azad Kashmir, Kashmir Black Magic Specialist Expert ...
NO1 Top No1 Amil Baba In Azad Kashmir, Kashmir Black Magic Specialist Expert ...
 
Moment Distribution Method For Btech Civil
Moment Distribution Method For Btech CivilMoment Distribution Method For Btech Civil
Moment Distribution Method For Btech Civil
 
HAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKAR
HAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKARHAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKAR
HAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKAR
 
COST-EFFETIVE and Energy Efficient BUILDINGS ptx
COST-EFFETIVE  and Energy Efficient BUILDINGS ptxCOST-EFFETIVE  and Energy Efficient BUILDINGS ptx
COST-EFFETIVE and Energy Efficient BUILDINGS ptx
 
data_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdfdata_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdf
 
kiln thermal load.pptx kiln tgermal load
kiln thermal load.pptx kiln tgermal loadkiln thermal load.pptx kiln tgermal load
kiln thermal load.pptx kiln tgermal load
 
DeepFakes presentation : brief idea of DeepFakes
DeepFakes presentation : brief idea of DeepFakesDeepFakes presentation : brief idea of DeepFakes
DeepFakes presentation : brief idea of DeepFakes
 
Hostel management system project report..pdf
Hostel management system project report..pdfHostel management system project report..pdf
Hostel management system project report..pdf
 
S1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptx
S1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptxS1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptx
S1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptx
 

Introduction to object detection

  • 1. Introduction to Face Detection By: Amar Jindal NIT Patna
  • 2. Face Detaction or Object Detection  Combination of computer vision and image processing to find out object in a image or video.  The Object detection includes classifying an object in the image and localize its poition in the image.  This can be achived using image segmentation with AI.
  • 3.  Classify the face.  Localize the face  List of facial features. How it Looks like?
  • 5. Computer Vision Tasks Source: CS231n Object detection http://cs231n.stanford.edu/slides/2016/winter1516_lecture8.pdf
  • 6. Classification using LeNet on MNIST dataset
  • 7. Computer Vision Tasks Source: CS231n Object detection http://cs231n.stanford.edu/slides/2016/winter1516_lecture8.pdf
  • 8. Classification + Localization ● Classification: ○ Input: Image ○ Output: Class label ○ Loss: Cross entropy (Softmaxlog) ○ Evaluation metric: Accuracy ● Localization: ○ Input: Image ○ Output: Box in the image (x, y, w, h) ○ Loss: L2 Loss (Euclidean distance) ○ Evaluation metric: Intersection over Union ● Classification + Localization: ○ Input: Image ○ Output: Class label + box in the image ○ Loss: Sum of both losses
  • 9.
  • 12. Cross Entropy  Measures the performance of a  classification model whose output  is a probability value between 0 & 1.  Increases as the predicted P(e)  diverges from the actual label.
  • 13. Euclidean Distance  This loss function is usually used for regression problems,It measures the  distance between two distinct points.
  • 15. Classification + Localization: Model Classification Head: ● C Scores for C classes Localization Head: ● Class agnostic: (x,y,w,h) ● Class specific: (x,y,w,h) X C
  • 16. Computer Vision Tasks Source: CS231n Object detection http://cs231n.stanford.edu/slides/2016/winter1516_lecture8.pdf
  • 17. Intersection Over Union (IoU) ● Important measurement for object localization. ● Used in both training and evaluation.
  • 19. Face Recognition using Siamese Networks  This network allows to calculate the degree of similarity between two inputs
  • 20.  Since we can not have multiple images of the same person in our database,
  • 21. Yolo Based Face Detection Each grid cell has 5 associated values. The first one is the probability p of that cell containing the center of a face.The other 4 values are the (x_center, y_center, width, height) of the detected face (relative to the cell).
  • 22.  2x [8 filter convolutional layer on 288x288 image]  Max pooling (288x288 to 144x144 feature map)  2x [16 filter convolutional layer on 144x144 feature map]  Max pooling (144x144 to 72x72 feature map)  2x [32 filter convolutional layer on 72x72 feature map]  Max pooling (72x72 to 36x36 feature map)  2x [64 filter convolutional layer on 36x36 feature map]  Max pooling (36x36 to 18x18 feature map)  2x [128 filter convolutional layer on 18x18 feature map]  Max pooling (18x18 to 9x9 feature map)  4x [192 filter convolutional layer on 9x9 feature map]  5 filter convolutional layer on 9x9 feature map for the final grid
  • 23. Auxiliary network Outputs of the main network were not as accurate as expected. Hence, a small CNN network was implemented to take as input a small image containing a face
  • 24. Object Detection l ● Input: Image ● Output: For each object class c and each image i,an algorithm returns predicted detections: ocations with confidence scores .
  • 25. Object Detection: Evaluation ● Today new metrics are emerging ○ Averaging precision over all IoU thresholds: 0.5:0.05:0.95 ○ Averaging precision for different object sizes (small, medium, big) ○ Averaging recall as a metric to measure object proposal quality.
  • 26. Average Precision (AP) ● [In the vision community] AP is the estimated area under the PR curve
  • 27. Mean Average Precision (mAP) ● The winner of each object class is the team with the highest average precision ● The winner of the challenge is the team with the highest mean Average Precision (mAP) across all classes.
  • 28. Object Detection: Evaluation ● Mean Average Precision (mAP) across all classes, based on Average Precision (AP) per class, based on Precision and Recall.
  • 29. Applications  Face Unlock  Person Identification  Video Survillance