Sub-sampled dictionaries for coarse-to-fine sparse representation-based human action recognition

•Download as PPTX, PDF•

0 likes•234 views

The document proposes a novel coarse-to-fine sparse representation approach for efficient human action recognition. It reduces the computational complexity of testing sparse representation-based classification (SRC) by constructing sub-sampled dictionaries at multiple levels of granularity. Specifically, it first builds a coarse-grained dictionary by randomly projecting and sub-sampling atoms from the training data. Then it selects a small number of candidate actions using the coarse dictionary before classifying the action using a pruned fine-grained dictionary constructed from the candidate classes only. Experimental results on a benchmark dataset show the proposed method achieves efficient recognition with little loss in accuracy compared to the conventional SRC approach.

I. INTRODUCTION
 Sparse representation-based classification (SRC) has
recently attracted substantial research attention
 However, the computational complexity of testing makes it
challenging to deploy SRC in practice
 We propose a novel method for human action recognition,
leveraging coarse-to-fine sparse representations that have
been obtained through dictionary sub-sampling
 The proposed method reduces the time complexity of
testing at no substantial loss in recognition accuracy
JongHo Leea, Hyun-seok Mina, Jeong-jik Seoa, Wesley De Nevea,b, and Yong Man Roa
aImage and Video Systems Lab, KAIST, Republic of Korea
bMultimedia Lab, Ghent University-iMinds, Belgium
website: http://ivylab.kaist.ac.kr
IEEE International Conference on Multimedia & Expo (ICME), July 2014, Chengdu, China
SUB-SAMPLED DICTIONARIES FOR COARSE-TO-FINE
SPARSE REPRESENTATION-BASED HUMAN ACTION RECOGNITION
e-mail: ymro@ee.kaist.ac.kr
II. PROPOSED APPROACH
1. Training
Fig. 2. Time complexity of different human action recognition approaches.
Fig. 1. Accuracy of different human action recognition approaches.
0
10
20
30
40
50
60
70
150 300 450 600 750 900 1050 1200 1350 1500
Timecomplexity(s)
Number of atoms(ls)
Proposed method with ds =48
Proposed method with ds =72
Proposed method with ds =144
Conventional method
0.76
0.78
0.8
0.82
0.84
0.86
0.88
0.9
150 300 450 600 750 900 1050 1200 1350 1500
Recognitionaccuracy
Number of atoms(ls)
Proposed method with ds =48
Proposed method with ds =72
Proposed method with ds =144
Conventional method
III. EXPERIMENTS
1. Experimental setup
 Dataset: UCF-50
 Feature: Cuboid detector + HOG descriptor
 Homotopy-based 𝑙1-norm minimization
2. Experimental results
 Conventional method: classification only uses the FGD
IV. CONCLUSIONS
 We proposed a novel method for human action recognition
using coarse-to-fine sparse representations
 The proposed method achieves efficient human action
recognition at no substantial loss in recognition accuracy
2. Testing
Y
Y𝑠
Random projection
Feature Extraction
Test video clip
…
Class 1 Class 2
Φ 𝑠,1 Φ 𝑠,2 Φ 𝑠,3 Φ 𝑠,𝐾
Sparse
Coefficients
Y𝑠
Ranking 1 𝐻+1 𝐻+4 𝑯
Candidate
Actions
Candidate Action Selection
Coarse-Grained
Dictionary (CGD)
O X X O
We select 𝐻
candidate
actions
Feature Extraction
…
Action 1 Action 2 Action 3 Action 𝐾
Training Dataset
Action 1 Action 2 Action 3 Action 𝐾
… … … …
…
Action 1Action 2Action 3 Action 𝐾
Fine-Grained
Dictionary
(FGD)
Coarse-Grained
Dictionary (CGD)Φ 𝑠,1 Φ 𝑠,2 Φ 𝑠,3 Φ 𝑠,𝐾
Φ 𝑜,1 Φ 𝑜,2 Φ 𝑜,3 Φ 𝑜,𝐾
Random projection (for reducing the dimension of the atoms)
Random sampling (for reducing the number of atoms)
Dictionary Construction
Action 1 Action 2 Action 3 Action 𝐾
…
…
Pruned FGD
Φ 𝑜,1 Φ 𝑜,2 Φ 𝑜,3 Φ 𝑜,𝐾
Action 1 Action 2 Action 3 Action 𝐾
…
Candidate
Actions
O X X O
Φ 𝑝𝑟,1
Action 1
Φ 𝑝𝑟,𝐻
Action 𝐾
… Pruned FGD𝐃 𝑝𝑟
 Classification
 We can find the sparse representation 𝐗 𝑝𝑟 of 𝐘 with 𝐃 𝑝𝑟
𝐘 = 𝐲1, 𝐲2, … , 𝐲 𝑚 , 𝐗 𝑝𝑟 = [𝐱 𝑝𝑟,1, 𝐱 𝑝𝑟,2, … , 𝐱 𝑝𝑟,𝑚]
 We label 𝐕 with the action 𝑘 that comes with the smallest
residual error 𝒓 𝑘 𝐲
𝒓 𝑘 𝐘 =
1
𝑚 𝑖=1
𝑚
𝐲𝑖 − 𝐃 𝑝𝑟 𝜹 𝑘 𝐱 𝑝𝑟,𝑖 𝟏
 𝜹 𝒌 𝐱 𝑝𝑟,𝑖 is a new vector whose only nonzero entries are the
entries in 𝐱 𝑝𝑟,𝑖 associated with the action 𝑘
Φ 𝑜,1 Φ 𝑜,2 Φ 𝑜,3 Φ 𝑜,𝐾

Similar to Sub-sampled dictionaries for coarse-to-fine sparse representation-based human action recognition

Surveillance using Video Analyticsidescitation

Flow Trajectory Approach for Human Action RecognitionIRJET Journal

Human Action Recognition Using Deep LearningIRJET Journal

5 ijaems sept-2015-9-video feature extraction based on modified lle using ada...INFOGAIN PUBLICATION

Dance With AI – An interactive dance learning platformIRJET Journal

Review of Pose Recognition Systemsvivatechijri

IRJET - Automating the Identification of Forest Animals and Alerting in Case ...IRJET Journal

IJCER (www.ijceronline.com) International Journal of computational Engineerin...ijceronline

IRJET- A Review on Moving Object Detection in Video Forensics IRJET Journal

IRJET- A Real Time Yolo Human Detection in Flood Affected Areas based on Vide...IRJET Journal

Fast Human Detection in Surveillance VideoIOSR Journals

IRJET- Survey Paper on Anomaly Detection in Surveillance VideosIRJET Journal

lec_11_self_supervised_learning.pdfAlamgirAkash3

IRJET- Object Detection in Real Time using AI and Deep LearningIRJET Journal

IJSRED-V2I3P80IJSRED

IRJET- Prediction of Anomalous Activities in a VideoIRJET Journal

Discovering Anomalies Based on Saliency Detection and Segmentation in Surveil...ijtsrd

Video inpainting using backgroung registrationeSAT Publishing House

Avihu Efrat's Viola and Jones face detection slideswolf

Gan seminarSan Kim

Similar to Sub-sampled dictionaries for coarse-to-fine sparse representation-based human action recognition (20)

Surveillance using Video Analytics

Flow Trajectory Approach for Human Action Recognition

Human Action Recognition Using Deep Learning

5 ijaems sept-2015-9-video feature extraction based on modified lle using ada...

Dance With AI – An interactive dance learning platform

Review of Pose Recognition Systems

IRJET - Automating the Identification of Forest Animals and Alerting in Case ...

IJCER (www.ijceronline.com) International Journal of computational Engineerin...

IRJET- A Review on Moving Object Detection in Video Forensics

IRJET- A Real Time Yolo Human Detection in Flood Affected Areas based on Vide...

Fast Human Detection in Surveillance Video

IRJET- Survey Paper on Anomaly Detection in Surveillance Videos

lec_11_self_supervised_learning.pdf

IRJET- Object Detection in Real Time using AI and Deep Learning

IJSRED-V2I3P80

IRJET- Prediction of Anomalous Activities in a Video

Discovering Anomalies Based on Saliency Detection and Segmentation in Surveil...

Video inpainting using backgroung registration

Avihu Efrat's Viola and Jones face detection slides

Gan seminar

More from Wesley De Neve

Towards diagnosis of rotator cuff tears in 3-D MRI using 3-D convolutional ne...Wesley De Neve

Investigating the biological relevance in trained embedding representations o...Wesley De Neve

Impact of adversarial examples on deep learning models for biomedical image s...Wesley De Neve

Learning Biologically Relevant Features Using Convolutional Neural Networks f...Wesley De Neve

The 5th Aslla SymposiumWesley De Neve

Ghent University Global Campus 101Wesley De Neve

Booklet for the First GUGC Research SymposiumWesley De Neve

Center for Biotech Data Science at Ghent University Global CampusWesley De Neve

Learning biologically relevant features using convolutional neural networks f...Wesley De Neve

Towards reading genomic data using deep learning-driven NLP techniquesWesley De Neve

Deep Machine Learning for Making Sense of Biotech Data - From Clean Energy to...Wesley De Neve

GUGC Info Session - Informatics and BioinformaticsWesley De Neve

Ghent University Global Campus - Sungkyunkwan University: Workshop on Researc...Wesley De Neve

Ghent University and GUGC-K: Overview of Teaching and Research ActivitiesWesley De Neve

Biotech Data Science @ GUGC in Korea: Deep Learning for Prediction of Drug-Ta...Wesley De Neve

Exploring Deep Machine Learning for Automatic Right Whale Recognition and No...Wesley De Neve

Deep Machine Learning for Automating Biotech Tasks Through Self-Learning Expe...Wesley De Neve

Towards using multimedia technology for biological data processingWesley De Neve

Multimedia Lab @ Ghent University - iMinds - Organizational Overview & Outlin...Wesley De Neve

More from Wesley De Neve (20)

Towards diagnosis of rotator cuff tears in 3-D MRI using 3-D convolutional ne...

Investigating the biological relevance in trained embedding representations o...

Impact of adversarial examples on deep learning models for biomedical image s...

Learning Biologically Relevant Features Using Convolutional Neural Networks f...

The 5th Aslla Symposium

Ghent University Global Campus 101

Booklet for the First GUGC Research Symposium

Center for Biotech Data Science at Ghent University Global Campus

Learning biologically relevant features using convolutional neural networks f...

Towards reading genomic data using deep learning-driven NLP techniques

Deep Machine Learning for Making Sense of Biotech Data - From Clean Energy to...

GUGC Info Session - Informatics and Bioinformatics

Ghent University Global Campus - Sungkyunkwan University: Workshop on Researc...

Ghent University and GUGC-K: Overview of Teaching and Research Activities

Biotech Data Science @ GUGC in Korea: Deep Learning for Prediction of Drug-Ta...

Exploring Deep Machine Learning for Automatic Right Whale Recognition and No...

Deep Machine Learning for Automating Biotech Tasks Through Self-Learning Expe...

Towards using multimedia technology for biological data processing

Multimedia Lab @ Ghent University - iMinds - Organizational Overview & Outlin...

Sub-sampled dictionaries for coarse-to-fine sparse representation-based human action recognition

1. I. INTRODUCTION  Sparse representation-based classification (SRC) has recently attracted substantial research attention  However, the computational complexity of testing makes it challenging to deploy SRC in practice  We propose a novel method for human action recognition, leveraging coarse-to-fine sparse representations that have been obtained through dictionary sub-sampling  The proposed method reduces the time complexity of testing at no substantial loss in recognition accuracy JongHo Leea, Hyun-seok Mina, Jeong-jik Seoa, Wesley De Nevea,b, and Yong Man Roa aImage and Video Systems Lab, KAIST, Republic of Korea bMultimedia Lab, Ghent University-iMinds, Belgium website: http://ivylab.kaist.ac.kr IEEE International Conference on Multimedia & Expo (ICME), July 2014, Chengdu, China SUB-SAMPLED DICTIONARIES FOR COARSE-TO-FINE SPARSE REPRESENTATION-BASED HUMAN ACTION RECOGNITION e-mail: ymro@ee.kaist.ac.kr II. PROPOSED APPROACH 1. Training Fig. 2. Time complexity of different human action recognition approaches. Fig. 1. Accuracy of different human action recognition approaches. 0 10 20 30 40 50 60 70 150 300 450 600 750 900 1050 1200 1350 1500 Timecomplexity(s) Number of atoms(ls) Proposed method with ds =48 Proposed method with ds =72 Proposed method with ds =144 Conventional method 0.76 0.78 0.8 0.82 0.84 0.86 0.88 0.9 150 300 450 600 750 900 1050 1200 1350 1500 Recognitionaccuracy Number of atoms(ls) Proposed method with ds =48 Proposed method with ds =72 Proposed method with ds =144 Conventional method III. EXPERIMENTS 1. Experimental setup  Dataset: UCF-50  Feature: Cuboid detector + HOG descriptor  Homotopy-based 𝑙1-norm minimization 2. Experimental results  Conventional method: classification only uses the FGD IV. CONCLUSIONS  We proposed a novel method for human action recognition using coarse-to-fine sparse representations  The proposed method achieves efficient human action recognition at no substantial loss in recognition accuracy 2. Testing Y Y𝑠 Random projection Feature Extraction Test video clip … Class 1 Class 2 Φ 𝑠,1 Φ 𝑠,2 Φ 𝑠,3 Φ 𝑠,𝐾 Sparse Coefficients Y𝑠 Ranking 1 𝐻+1 𝐻+4 𝑯 Candidate Actions Candidate Action Selection Coarse-Grained Dictionary (CGD) O X X O We select 𝐻 candidate actions Feature Extraction … Action 1 Action 2 Action 3 Action 𝐾 Training Dataset Action 1 Action 2 Action 3 Action 𝐾 … … … … … Action 1Action 2Action 3 Action 𝐾 Fine-Grained Dictionary (FGD) Coarse-Grained Dictionary (CGD)Φ 𝑠,1 Φ 𝑠,2 Φ 𝑠,3 Φ 𝑠,𝐾 Φ 𝑜,1 Φ 𝑜,2 Φ 𝑜,3 Φ 𝑜,𝐾 Random projection (for reducing the dimension of the atoms) Random sampling (for reducing the number of atoms) Dictionary Construction Action 1 Action 2 Action 3 Action 𝐾 … … Pruned FGD Φ 𝑜,1 Φ 𝑜,2 Φ 𝑜,3 Φ 𝑜,𝐾 Action 1 Action 2 Action 3 Action 𝐾 … Candidate Actions O X X O Φ 𝑝𝑟,1 Action 1 Φ 𝑝𝑟,𝐻 Action 𝐾 … Pruned FGD𝐃 𝑝𝑟  Classification  We can find the sparse representation 𝐗 𝑝𝑟 of 𝐘 with 𝐃 𝑝𝑟 𝐘 = 𝐲1, 𝐲2, … , 𝐲 𝑚 , 𝐗 𝑝𝑟 = [𝐱 𝑝𝑟,1, 𝐱 𝑝𝑟,2, … , 𝐱 𝑝𝑟,𝑚]  We label 𝐕 with the action 𝑘 that comes with the smallest residual error 𝒓 𝑘 𝐲 𝒓 𝑘 𝐘 = 1 𝑚 𝑖=1 𝑚 𝐲𝑖 − 𝐃 𝑝𝑟 𝜹 𝑘 𝐱 𝑝𝑟,𝑖 𝟏  𝜹 𝒌 𝐱 𝑝𝑟,𝑖 is a new vector whose only nonzero entries are the entries in 𝐱 𝑝𝑟,𝑖 associated with the action 𝑘 Φ 𝑜,1 Φ 𝑜,2 Φ 𝑜,3 Φ 𝑜,𝐾

Sub-sampled dictionaries for coarse-to-fine sparse representation-based human action recognition

Recommended

Recommended

More Related Content

Similar to Sub-sampled dictionaries for coarse-to-fine sparse representation-based human action recognition

Similar to Sub-sampled dictionaries for coarse-to-fine sparse representation-based human action recognition (20)

More from Wesley De Neve

More from Wesley De Neve (20)

Sub-sampled dictionaries for coarse-to-fine sparse representation-based human action recognition