Talk 2009-monash-seminar-intelligent-video-surveillance

Gippsland School of Information Technology (GSIT)

Behaviour Recognition
Framework for Intelligent
Visual Surveillance
06 April 2009
www.monash.edu.au

Project Team
• Mahfuzul Haque, PhD student (2 yrs, 1 m)
• A/Prof. Manzur Murshed
• Dr. Manoranjan Paul

www.monash.edu.au
2

Project Motivation
“Behaviour Recognition Framework for
Intelligent Visual Surveillance”

Why “Intelligent” Surveillance?
Why “Behaviour Recognition”?
What type of “Behaviours”?

www.monash.edu.au
3

Surveillance Everywhere

Are we really protected?

www.monash.edu.au
4

Too Many Cameras

Deployment of large number of surveillance cameras in recent years
London Heathrow airport has more than 5000 cameras!!
www.monash.edu.au
5

Behind the Scene: Worried Human Monitor

Dependability on human monitors has increased.
Reliability on surveillance system has decreased.
www.monash.edu.au
6

Project Goals
 Aiding human monitors by
automatic detection of
specific abnormal behaviors
 Decreasing dependability on
human monitors
 Improving reliability of
surveillance systems for
ensuring human security
www.monash.edu.au
7

Project Scope
Group Behaviours







Mob Violence
Crowding
Sudden Group Formation
Sudden Group Deformation
Shooting
Panic Driven Behaviours

www.monash.edu.au
8

Research Question

How to recognize specific group
behaviours from surveillance video
streams in real-time?
Research Area
- Computer Vision
- Application of Machine Learning
www.monash.edu.au
9

System Architecture

Behaviour Profile

Surveillance
Video Stream
Framework

www.monash.edu.au
10

System Architecture

Behaviour Profile

Surveillance
Video Stream
Framework

www.monash.edu.au
11

Behaviour Profile
Surveillance Video Stream (System Input)
Time

0

Unknown

20

Group
Appearing

60

Group
Appearing

140

320

Group
Merging

Group
Splitting

Behaviour Profile (System Output)
www.monash.edu.au
12

System Architecture

Behaviour Profile

Surveillance
Video Stream
Framework

www.monash.edu.au
13

Behaviour Recognition Framework

Framework Components
•
•
•
•

Background Modelling
Frame Level Feature Extraction
Temporal Feature Extraction
Behaviour Classification

www.monash.edu.au
14

Behaviour Recognition Framework

Background
Modelling

Frame Level
Feature
Extraction

Temporal
Feature
Extraction

Behaviour
Classification

www.monash.edu.au
15


Background
Modelling

Frame Level
Feature
Extraction

Temporal
Feature
Extraction

Behaviour
Classification

www.monash.edu.au
16

How to extract the active regions from surveillance video stream?

Background Subtraction
Current frame

Challenges!!
=

Background

Moving foreground

www.monash.edu.au
17

σ2

P(x)

µ
P(x)

x

Sky
Cloud
Leaf
Moving Person

σ2

Road
Shadow
Moving Car

Floor
Shadow
Walking People

Cloud
µ
P(x)

x

P(x)

Person
Leaf
Sky

σ2

µ

x

x (Pixel intensity)

www.monash.edu.au
18

Background
Model
Current frame

Moving foreground

Frame 1

Frame N

Background Models
road

shadow

car

shadow

road
Models are ordered by ω/σ

ω1
σ12
µ1
road

ω2
σ22
µ2
shadow

65%

20%

ω3
σ32
µ3
car

15%
www.monash.edu.au
19

First
Frame

Test
Frame

Ground
Truth

S&G

Lee

Proposed

(1)

(2)

(3)

(4)

(5)

(1) PETS2000; (2) PETS2006-B1; (3) PETS2006-B2; (4) PETS2006-B3; and (5) PETS2006-B4.

www.monash.edu.au
20


Background
Modelling

Frame Level
Feature
Extraction

Temporal
Feature
Extraction

Behaviour
Classification

www.monash.edu.au
21

• Feature Categories:
– Count
– Area
– Density
– Bounding Box
– Filling Ratio
– Aspect Ratio
• 30 frame level features
Bounding Boxes
www.monash.edu.au
22

Foreground Count

Foreground Area

Foreground Density

• FC (Foreground Count)

• TFA (Total Foreground Area)
• AFA (Average Foreground Area)
• VFA (Variance of Foreground Area)
• MAXFA (Maximum Foreground Area)
• MINFA (Minimum Foreground Area)

• AFD (Average Foreground Density)
• VFD (Variance of Foreground Density)

Filling Ratio

Bounding Box – Area

Bounding Box – Width

• TFR (Total Filling Ratio)
• AFR (Average Filling Ratio)
• VFR (Variance of Filling Ratio)
• MAXFR (Maximum Filling Ratio)
• MINFR (Minimum Filling Ratio)

• TBBA (Total Bounding Box Area)
• ABBA (Average Bounding Box Area)
• VBBA (Variance of Bounding Box Area)
• MAXBBA (Maximum Bounding Box
Area)
• MINBBA (Minimum Bounding Box Area)

• ABBW (Average Bounding Box Width)
• VBBW (Variance of Bounding Box
Width)
• MAXBBW (Maximum Bounding Box
Width)
• MINBBW (Minimum Bounding Box
Width)

Bounding Box – Height

Aspect Ratio

• ABBH (Average Bounding Box Height)
• VBBH (Variance of Bounding Box
Height)
• MAXBBH (Maximum Bounding Box
Height)
• MINBBH (Minimum Bounding Box
Height)

• AAR (Average Aspect Ratio)
• VAR (Variance of Aspect Ratio)
• MAXAR (Maximum Aspect Ratio)
• MINAR (Minimum Aspect Ratio)

www.monash.edu.au
23


Background
Modelling

Frame Level
Feature
Extraction

Temporal
Feature
Extraction

Behaviour
Classification

www.monash.edu.au
24

• Fixed length, partially overlapped
sliding window
• Temporal data smoothing – polynomial
curve fitting
• 9 temporal features for each frame level
feature
• Output: 270 temporal features

www.monash.edu.au
25

TFA (Total Foreground Area)

Temporal Features
TFA (%)

• MAX
• MIN
• AVG
• VAR
• RATE
• TIME(MAX)
• TIME(MIN)
• D = TIME(MAX) - TIME(MIN)
• SLOPE ( D/2 )

Time (window = 100 frames)
www.monash.edu.au
26

TFA (Total Foreground Area)

Temporal Features

TFA (%)

MAX

MIN

• MAX
• MIN
• AVG
• VAR
• RATE
• TIME(MAX)
• TIME(MIN)
• D = TIME(MAX) - TIME(MIN)
• SLOPE ( D/2 )

TIME(MAX)
TIME(MIN)

Time (window = 100 frames)
www.monash.edu.au
27


Background
Modelling

Frame Level
Feature
Extraction

Temporal
Feature
Extraction

Behaviour
Classification

www.monash.edu.au
28

•
•
•
•
•
•
•
•

Individual classifiers for each behaviour
Supervised training
Feature ranking
Top 100 features from 270 features
Dimension reduction (PCA)
Max dimension 30
SVM classifier
Output: Behaviour Profile
www.monash.edu.au
29

Experiments
GROUP FORMING
• Accuracy: 0.9767
• Top 3 features
• TIME(MAX)-VFD
• TIME(MAX)-AFD
• TIME(MAX) - TIME(MIN)-VFD

GROUP SPLITTING AND SPREADING
• Top 3 features
• TIME(MAX)-VFD
• TIME(MIN)-ABBA
• TIME(MIN)-AFA

BLOCKED EXIT
• Top 3 features
• TIME(MIN)-TFA
• MIN-MINAR
• TIME(MAX)-TFA

www.monash.edu.au
30

Summary: Framework Components

Background
Modelling

Frame Level Feature
Extraction

Temporal Feature
Extraction

Behaviour
Classification

• Multiple Background
Models
• Gaussian Mixture
Models (GMM)
• Unsupervised
• Output: Foreground
Region/Mask

• Count
• Area
• Density
• Bounding Box
• Filling Ratio
• Aspect Ratio
• Output: 30 Frame Level
Features

• Fixed Length, Partially
Overlapped Sliding
Window
• Temporal Data
Smoothing –
Polynomial Curve
Fitting
• 9 Temporal Features for
Each Frame Level
Features
• Output: 270 Temporal
Features

• Individual Classifiers for
Each Behaviour
• Each Classifier is
Trained Using
Supervised Learning
• Feature Ranking
• Top 100 Features
• Dimension Reduction
(PCA)
• Max Dimension 30
• SVM classifier
• Output: Behaviour
Profile

www.monash.edu.au
31

Summary: Framework Components

Background
Modelling

Frame Level Feature
Extraction

Temporal Feature
Extraction

Behaviour
Classification

• Multiple Background
Models
• Gaussian Mixture
Models (GMM)
• Unsupervised
• Output: Foreground
Region/Mask

• Count
• Area
• Density
• Bounding Box
• Filling Ratio
• Aspect Ratio
• Output: 30 Frame
Level Features

• Fixed Length, Partially
Overlapped Sliding
Window
• Temporal Data
Smoothing –
Polynomial Curve
Fitting
• 9 Temporal Features for
Each Frame Level
Features
• Output: 270 Temporal
Features

• Individual Classifiers for
Each Behaviour
• Each Classifier is
Trained Using
Supervised Learning
• Feature Ranking
• Top 100 Features
• Dimension Reduction
(PCA)
• Max Dimension 30
• SVM classifier
• Output: Behaviour
Profile

www.monash.edu.au
32

Research Challenges
• No tracking/trajectory
• Simple behaviours
– Group appear/disappear
– Group merge/split

• Panic driven behaviours
– Fire/Blocked exit
– Fighting/Shooting

• Context variation
– Speed
– Direction
– Object Resolution
www.monash.edu.au
33

Implemented System: VSTK

www.monash.edu.au
34

Publications
1. Mahfuzul Haque, Manzur Murshed, and Manoranjan Paul, Improved
Gaussian Mixtures for Robust Object Detection by Adaptive MultiBackground Generation, International Conference on Pattern
Recognition (ICPR), Tampa, Florida, USA, 2008. (CORE A)
2. Mahfuzul Haque, Manzur Murshed, and Manoranjan Paul, A Hybrid
Object Detection Technique from Dynamic Background Using
Gaussian Mixture Models, IEEE International Workshop on Multimedia
Signal Processing (MMSP), Cairns, Australia, 2008. (CORE A)
3. Mahfuzul Haque, Manzur Murshed, and Manoranjan Paul, On Stable
Dynamic Background Generation Technique using Gaussian Mixture
Models for Robust Object Detection, IEEE International Conference On
Advanced Video and Signal Based Surveillance (AVSS), Santa Fe,
New Mexico, USA, 2008. (CORE A)
CORE - COmputing Research and Education Association
www.monash.edu.au
35

Thank you!

Q&A
Mahfuzul.Haque@infotech.monash.edu.au
http://www.mahfuzulhaque.com

www.monash.edu.au
36

Acknowledgments
URLs of the images used in this presentation
•
•
•
•
•
•
•
•
•
•
•
•
•
•

http://www.fotosearch.com/DGV464/766029/
http://www.cyprus-trader.com/images/alert.gif
http://security.polito.it/~lioy/img/einstein8ci.jpg
http://www.dtsc.ca.gov/PollutionPrevention/images/question.jpg
http://www.unmikonline.org/civpol/photos/thematic/violence/streetvio2.jpg
http://www.airports-worldwide.com/img/uk/heathrow00.jpg
http://www.highprogrammer.com/alan/gaming/cons/trips/genconindy2003/exhibithall-crowd-2.jpg
http://www.bhopal.org/fcunited/archives/fcu-crowd.jpg
http://img.dailymail.co.uk/i/pix/2006/08/passaPA_450x300.jpg
http://www.defenestrator.org/drp/files/surveillance-cameras-400.jpg
http://www.cityofsound.com/photos/centre_poin/crowd.jpg
http://www.hindu.com/2007/08/31/images/2007083156401501.jpg
http://paulaoffutt.com/pics/images/crowd-surfing.jpg
http://msnbcmedia1.msn.com/j/msnbc/Components/Photos/070225/070225_surv
eillance_hmed.hmedium.jpg

www.monash.edu.au
37

Talk 2009-monash-seminar-intelligent-video-surveillance

Recomendados

Recomendados

Mais conteúdo relacionado

Semelhante a Talk 2009-monash-seminar-intelligent-video-surveillance

Semelhante a Talk 2009-monash-seminar-intelligent-video-surveillance (20)

Mais de Mahfuzul Haque

Mais de Mahfuzul Haque (19)

Último

Último (20)

Talk 2009-monash-seminar-intelligent-video-surveillance