SlideShare uma empresa Scribd logo
1 de 15
Baixar para ler offline
Frontiers of
Human Activity Analysis

                J. K. Aggarwal
               Michael S. Ryoo
                 Kris M. Kitani
Overview


           2
Machine point of view
 Activities as videos
   Activity = a particular set of videos
    T                                          Hugging
                               Pushing




    T
                              Punching   Shaking hands




                               video space (640*480*100 D)   3
Activity classification
 Simple task of identifying videos
   Categorize given videos into their types.
      Known, limited number of classes
      Assumes that each video contains a single activity

                                             ?
                                                   Push
                                                   Punch
                                                    Kick
                                                   Shake
                                                    Hug


                                                            4
Activity classification
 Activity categorization
   Input = a video segment containing 1 activity

                                Hugging
                 Pushing

                           ?




                Punching   Shaking hands




                                                    5
Activity detection
  Search for the particular time interval
     <starting time, ending time>
     Video segment containing the activity
                                                 Hugging
                                  Pushing


Input:
 continuous video stream

                                 Punching   Shaking hands




                                                            6
Activity detection by classification
 Binary classifier

                          Push/not-push
                                                Yes, a push.
                                                No, not a push.
                            Classifier


 Sliding window technique
    Classify all possible time intervals




                                      Pushing
                                                                  7
Recognition process
 Represent videos in terms of features
   Captures properties of activity videos




                                Running
 Recognize activities
  by comparing video
  representations
                                          Not running
   Decision boundary

                                                        8
Taxonomy
  Approach based taxonomy
      Recognition approaches can be categorized.
                                      Human activity recognition



                     Single-layered                                Hierarchical
                      approaches                                   approaches


        Space-time                     Sequential    Statistical    Syntactic     Description
        approaches                     approaches                                   -based

Trajecto- Volumes Local       Exemplar        State-based
  ries            features     -based                                Aggarwal and Ryoo,
                                                                     ACM CSUR 2011         9
Single layered vs. hierarchical
 Single layered approaches
 T

               Feature
              extraction




 Hierarchical approaches
 T


          …
                           Stretching
                            <1, 20>
 T                         Withdrawing
                            <21, 30>
          …


                                         10
Taxonomy – single layered
 These approaches recognize actions directly from
  a sequence of images.




                                                     11
Single layered approaches
 Action representation
   Video volumes themselves
   Features directly extracted from videos




 Action classification
   Machine learning techniques
      Support vector machines
      Hidden Markov models

                                              12
Taxonomy – Hierarchical
                                             Hierarchical approaches


                       Statistical                  Syntactic               Description-based
                      approaches                   approaches                  approaches
                                                                            [Pinhanez and Bobick ’98]
Human actions        [Nguyen et al. ’05]
                                                                                [Gupta et al. ’09]

                                                                             [Intille and Bobick ’99]
                                                [Ivanov and Bobick ’00]            [Vu et al. ’03]
Human-Human           [Oliver et al. ’02]
 interactions                                   [Joo and Chellapha ’06]         [Ghanem et al. ’04]
                                                                          [Ryoo and Aggarwal ’06, ’09a]

                      [Shi et al. ’04]O          [Moore and Essa ’02]O            [Siskind ’01]O
Human-Object       [Yu and Aggarwal ’06]O         [Minnen et al. ’03]O       [Nevatia et al. ’03, ’04]O
 interactions
                   [Damen and Hogg ’09]O          [Kitani et al. ’07]O      [Ryoo and Aggarwal ’07]O

                   [Cupillard et al. ’02]G
                   [Gong and Xiang ’03]G
Group activities                                                          [Ryoo and Aggarwal ’08, ’10]G
                     [Zhang et al.’06]G
                      [Dai et al.’08]G

                                                                                                          13
Hierarchical approaches
 Layered approaches
   Activities in terms of sub-events.
   Human interactions
      Multiple agents


 Suitable for
  activity-level
  recognition


                                         14
Hierarchical approaches
 Activities as semantic structures
    Activity = a concatenation of its sub-events
    Human-oriented: high-level
    Hierarchically organized representations
                 Hand shake = “two persons do shake-action
                 (stretches, stays stretched, withdraw) simultaneously,
                 while touching”.
                                                        Fighting ->   Punching                : 0.3
        this==Push_interactions(p1,p2)
                                                                 |    Punching Fighting       : 0.7
  i=Stretch(p1’s arm) j=Stay_Stretched(p1’s arm)
                                                        Punching ->   stretch withdraw        : 0.8
                k =Touching(p1, p2) l =Depart(p2, p1)
                                                                 |    stretch stay_withdrawn : 0.1
                                                                 |    stay_stretched withdraw : 0.1
                                                                                                      15

Mais conteúdo relacionado

Mais procurados

Fishing or a Z?: Investigating the Effects of Error on Mimetic and Alphabet D...
Fishing or a Z?: Investigating the Effects of Error on Mimetic and Alphabet D...Fishing or a Z?: Investigating the Effects of Error on Mimetic and Alphabet D...
Fishing or a Z?: Investigating the Effects of Error on Mimetic and Alphabet D...Abdallah El Ali
 
Ontology 101 - New York Semantic Technology Conference
Ontology 101 - New York Semantic Technology ConferenceOntology 101 - New York Semantic Technology Conference
Ontology 101 - New York Semantic Technology ConferenceRobert Kost
 
Ontology 101 - Kendall & McGuiness
Ontology 101 - Kendall & McGuinessOntology 101 - Kendall & McGuiness
Ontology 101 - Kendall & McGuinessthematixpartners
 
Engelbrecht.joe
Engelbrecht.joeEngelbrecht.joe
Engelbrecht.joeNASAPMC
 
Demola affective robotics_20120502
Demola affective robotics_20120502Demola affective robotics_20120502
Demola affective robotics_20120502Rod Walsh
 
MCN 2012 Swiss Army Knife Approach
MCN 2012 Swiss Army Knife ApproachMCN 2012 Swiss Army Knife Approach
MCN 2012 Swiss Army Knife ApproachAdrian Kingston
 
Coupling Eye-Motion and Ego-Motion Features forFirst-Person Activity Recognition
Coupling Eye-Motion and Ego-Motion Features forFirst-Person Activity RecognitionCoupling Eye-Motion and Ego-Motion Features forFirst-Person Activity Recognition
Coupling Eye-Motion and Ego-Motion Features forFirst-Person Activity RecognitionKeisuke Ogaki
 
Haptic Interaction in Mixed Environments and Virtual Reality
Haptic Interaction in Mixed Environments and Virtual RealityHaptic Interaction in Mixed Environments and Virtual Reality
Haptic Interaction in Mixed Environments and Virtual RealityMarios Bikos
 
SEDE: An Ontology For Scholarly Event Description
SEDE:  An Ontology For Scholarly Event DescriptionSEDE:  An Ontology For Scholarly Event Description
SEDE: An Ontology For Scholarly Event DescriptionSenator Jeong 정상원
 
TEL Developments & Trends
TEL Developments & TrendsTEL Developments & Trends
TEL Developments & Trendstimku
 
Lean Inventive Systems Thinking Work Book
Lean Inventive Systems Thinking Work BookLean Inventive Systems Thinking Work Book
Lean Inventive Systems Thinking Work BookCrafitti Consulting
 
Fcv taxo zisserman
Fcv taxo zissermanFcv taxo zisserman
Fcv taxo zissermanzukun
 
NIPS2009: Understand Visual Scenes - Part 1
NIPS2009: Understand Visual Scenes - Part 1NIPS2009: Understand Visual Scenes - Part 1
NIPS2009: Understand Visual Scenes - Part 1zukun
 
cvpr2011: human activity recognition - part 4: syntactic
cvpr2011: human activity recognition - part 4: syntacticcvpr2011: human activity recognition - part 4: syntactic
cvpr2011: human activity recognition - part 4: syntacticzukun
 
Algortimos bio-inspirados para clustering y visualizacion de datos geoespaciales
Algortimos bio-inspirados para clustering y visualizacion de datos geoespacialesAlgortimos bio-inspirados para clustering y visualizacion de datos geoespaciales
Algortimos bio-inspirados para clustering y visualizacion de datos geoespacialesaskroll
 
Design Scripts: Designing (inter)action with intent
Design Scripts: Designing (inter)action with intent Design Scripts: Designing (inter)action with intent
Design Scripts: Designing (inter)action with intent Bas Leurs
 
Translating Related Words to Videos and Back through Latent Topics
Translating Related Words to Videos and Back through Latent TopicsTranslating Related Words to Videos and Back through Latent Topics
Translating Related Words to Videos and Back through Latent TopicsPradipto Das
 
Excursions into Blended Reality
Excursions into Blended RealityExcursions into Blended Reality
Excursions into Blended RealityLarry Smarr
 

Mais procurados (20)

Fishing or a Z?: Investigating the Effects of Error on Mimetic and Alphabet D...
Fishing or a Z?: Investigating the Effects of Error on Mimetic and Alphabet D...Fishing or a Z?: Investigating the Effects of Error on Mimetic and Alphabet D...
Fishing or a Z?: Investigating the Effects of Error on Mimetic and Alphabet D...
 
Ontology 101 - New York Semantic Technology Conference
Ontology 101 - New York Semantic Technology ConferenceOntology 101 - New York Semantic Technology Conference
Ontology 101 - New York Semantic Technology Conference
 
Ontology 101 - Kendall & McGuiness
Ontology 101 - Kendall & McGuinessOntology 101 - Kendall & McGuiness
Ontology 101 - Kendall & McGuiness
 
Engelbrecht.joe
Engelbrecht.joeEngelbrecht.joe
Engelbrecht.joe
 
Lecture21
Lecture21Lecture21
Lecture21
 
Demola affective robotics_20120502
Demola affective robotics_20120502Demola affective robotics_20120502
Demola affective robotics_20120502
 
MCN 2012 Swiss Army Knife Approach
MCN 2012 Swiss Army Knife ApproachMCN 2012 Swiss Army Knife Approach
MCN 2012 Swiss Army Knife Approach
 
Meet GeekPhysical
Meet GeekPhysicalMeet GeekPhysical
Meet GeekPhysical
 
Coupling Eye-Motion and Ego-Motion Features forFirst-Person Activity Recognition
Coupling Eye-Motion and Ego-Motion Features forFirst-Person Activity RecognitionCoupling Eye-Motion and Ego-Motion Features forFirst-Person Activity Recognition
Coupling Eye-Motion and Ego-Motion Features forFirst-Person Activity Recognition
 
Haptic Interaction in Mixed Environments and Virtual Reality
Haptic Interaction in Mixed Environments and Virtual RealityHaptic Interaction in Mixed Environments and Virtual Reality
Haptic Interaction in Mixed Environments and Virtual Reality
 
SEDE: An Ontology For Scholarly Event Description
SEDE:  An Ontology For Scholarly Event DescriptionSEDE:  An Ontology For Scholarly Event Description
SEDE: An Ontology For Scholarly Event Description
 
TEL Developments & Trends
TEL Developments & TrendsTEL Developments & Trends
TEL Developments & Trends
 
Lean Inventive Systems Thinking Work Book
Lean Inventive Systems Thinking Work BookLean Inventive Systems Thinking Work Book
Lean Inventive Systems Thinking Work Book
 
Fcv taxo zisserman
Fcv taxo zissermanFcv taxo zisserman
Fcv taxo zisserman
 
NIPS2009: Understand Visual Scenes - Part 1
NIPS2009: Understand Visual Scenes - Part 1NIPS2009: Understand Visual Scenes - Part 1
NIPS2009: Understand Visual Scenes - Part 1
 
cvpr2011: human activity recognition - part 4: syntactic
cvpr2011: human activity recognition - part 4: syntacticcvpr2011: human activity recognition - part 4: syntactic
cvpr2011: human activity recognition - part 4: syntactic
 
Algortimos bio-inspirados para clustering y visualizacion de datos geoespaciales
Algortimos bio-inspirados para clustering y visualizacion de datos geoespacialesAlgortimos bio-inspirados para clustering y visualizacion de datos geoespaciales
Algortimos bio-inspirados para clustering y visualizacion de datos geoespaciales
 
Design Scripts: Designing (inter)action with intent
Design Scripts: Designing (inter)action with intent Design Scripts: Designing (inter)action with intent
Design Scripts: Designing (inter)action with intent
 
Translating Related Words to Videos and Back through Latent Topics
Translating Related Words to Videos and Back through Latent TopicsTranslating Related Words to Videos and Back through Latent Topics
Translating Related Words to Videos and Back through Latent Topics
 
Excursions into Blended Reality
Excursions into Blended RealityExcursions into Blended Reality
Excursions into Blended Reality
 

Mais de zukun

My lyn tutorial 2009
My lyn tutorial 2009My lyn tutorial 2009
My lyn tutorial 2009zukun
 
ETHZ CV2012: Tutorial openCV
ETHZ CV2012: Tutorial openCVETHZ CV2012: Tutorial openCV
ETHZ CV2012: Tutorial openCVzukun
 
ETHZ CV2012: Information
ETHZ CV2012: InformationETHZ CV2012: Information
ETHZ CV2012: Informationzukun
 
Siwei lyu: natural image statistics
Siwei lyu: natural image statisticsSiwei lyu: natural image statistics
Siwei lyu: natural image statisticszukun
 
Lecture9 camera calibration
Lecture9 camera calibrationLecture9 camera calibration
Lecture9 camera calibrationzukun
 
Brunelli 2008: template matching techniques in computer vision
Brunelli 2008: template matching techniques in computer visionBrunelli 2008: template matching techniques in computer vision
Brunelli 2008: template matching techniques in computer visionzukun
 
Modern features-part-4-evaluation
Modern features-part-4-evaluationModern features-part-4-evaluation
Modern features-part-4-evaluationzukun
 
Modern features-part-3-software
Modern features-part-3-softwareModern features-part-3-software
Modern features-part-3-softwarezukun
 
Modern features-part-2-descriptors
Modern features-part-2-descriptorsModern features-part-2-descriptors
Modern features-part-2-descriptorszukun
 
Modern features-part-1-detectors
Modern features-part-1-detectorsModern features-part-1-detectors
Modern features-part-1-detectorszukun
 
Modern features-part-0-intro
Modern features-part-0-introModern features-part-0-intro
Modern features-part-0-introzukun
 
Lecture 02 internet video search
Lecture 02 internet video searchLecture 02 internet video search
Lecture 02 internet video searchzukun
 
Lecture 01 internet video search
Lecture 01 internet video searchLecture 01 internet video search
Lecture 01 internet video searchzukun
 
Lecture 03 internet video search
Lecture 03 internet video searchLecture 03 internet video search
Lecture 03 internet video searchzukun
 
Icml2012 tutorial representation_learning
Icml2012 tutorial representation_learningIcml2012 tutorial representation_learning
Icml2012 tutorial representation_learningzukun
 
Advances in discrete energy minimisation for computer vision
Advances in discrete energy minimisation for computer visionAdvances in discrete energy minimisation for computer vision
Advances in discrete energy minimisation for computer visionzukun
 
Gephi tutorial: quick start
Gephi tutorial: quick startGephi tutorial: quick start
Gephi tutorial: quick startzukun
 
EM algorithm and its application in probabilistic latent semantic analysis
EM algorithm and its application in probabilistic latent semantic analysisEM algorithm and its application in probabilistic latent semantic analysis
EM algorithm and its application in probabilistic latent semantic analysiszukun
 
Object recognition with pictorial structures
Object recognition with pictorial structuresObject recognition with pictorial structures
Object recognition with pictorial structureszukun
 
Iccv2011 learning spatiotemporal graphs of human activities
Iccv2011 learning spatiotemporal graphs of human activities Iccv2011 learning spatiotemporal graphs of human activities
Iccv2011 learning spatiotemporal graphs of human activities zukun
 

Mais de zukun (20)

My lyn tutorial 2009
My lyn tutorial 2009My lyn tutorial 2009
My lyn tutorial 2009
 
ETHZ CV2012: Tutorial openCV
ETHZ CV2012: Tutorial openCVETHZ CV2012: Tutorial openCV
ETHZ CV2012: Tutorial openCV
 
ETHZ CV2012: Information
ETHZ CV2012: InformationETHZ CV2012: Information
ETHZ CV2012: Information
 
Siwei lyu: natural image statistics
Siwei lyu: natural image statisticsSiwei lyu: natural image statistics
Siwei lyu: natural image statistics
 
Lecture9 camera calibration
Lecture9 camera calibrationLecture9 camera calibration
Lecture9 camera calibration
 
Brunelli 2008: template matching techniques in computer vision
Brunelli 2008: template matching techniques in computer visionBrunelli 2008: template matching techniques in computer vision
Brunelli 2008: template matching techniques in computer vision
 
Modern features-part-4-evaluation
Modern features-part-4-evaluationModern features-part-4-evaluation
Modern features-part-4-evaluation
 
Modern features-part-3-software
Modern features-part-3-softwareModern features-part-3-software
Modern features-part-3-software
 
Modern features-part-2-descriptors
Modern features-part-2-descriptorsModern features-part-2-descriptors
Modern features-part-2-descriptors
 
Modern features-part-1-detectors
Modern features-part-1-detectorsModern features-part-1-detectors
Modern features-part-1-detectors
 
Modern features-part-0-intro
Modern features-part-0-introModern features-part-0-intro
Modern features-part-0-intro
 
Lecture 02 internet video search
Lecture 02 internet video searchLecture 02 internet video search
Lecture 02 internet video search
 
Lecture 01 internet video search
Lecture 01 internet video searchLecture 01 internet video search
Lecture 01 internet video search
 
Lecture 03 internet video search
Lecture 03 internet video searchLecture 03 internet video search
Lecture 03 internet video search
 
Icml2012 tutorial representation_learning
Icml2012 tutorial representation_learningIcml2012 tutorial representation_learning
Icml2012 tutorial representation_learning
 
Advances in discrete energy minimisation for computer vision
Advances in discrete energy minimisation for computer visionAdvances in discrete energy minimisation for computer vision
Advances in discrete energy minimisation for computer vision
 
Gephi tutorial: quick start
Gephi tutorial: quick startGephi tutorial: quick start
Gephi tutorial: quick start
 
EM algorithm and its application in probabilistic latent semantic analysis
EM algorithm and its application in probabilistic latent semantic analysisEM algorithm and its application in probabilistic latent semantic analysis
EM algorithm and its application in probabilistic latent semantic analysis
 
Object recognition with pictorial structures
Object recognition with pictorial structuresObject recognition with pictorial structures
Object recognition with pictorial structures
 
Iccv2011 learning spatiotemporal graphs of human activities
Iccv2011 learning spatiotemporal graphs of human activities Iccv2011 learning spatiotemporal graphs of human activities
Iccv2011 learning spatiotemporal graphs of human activities
 

Último

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu SubbuApidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbuapidays
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...Zilliz
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot ModelNavi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot ModelDeepika Singh
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Jeffrey Haguewood
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays
 
A Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusA Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusZilliz
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdflior mazor
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxRustici Software
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfOverkill Security
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Zilliz
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 

Último (20)

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu SubbuApidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot ModelNavi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
A Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusA Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source Milvus
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 

cvpr2011: human activity recognition - part 2: overview

  • 1. Frontiers of Human Activity Analysis J. K. Aggarwal Michael S. Ryoo Kris M. Kitani
  • 3. Machine point of view  Activities as videos  Activity = a particular set of videos T Hugging Pushing T Punching Shaking hands video space (640*480*100 D) 3
  • 4. Activity classification  Simple task of identifying videos  Categorize given videos into their types.  Known, limited number of classes  Assumes that each video contains a single activity ? Push Punch Kick Shake Hug 4
  • 5. Activity classification  Activity categorization  Input = a video segment containing 1 activity Hugging Pushing ? Punching Shaking hands 5
  • 6. Activity detection  Search for the particular time interval  <starting time, ending time>  Video segment containing the activity Hugging Pushing Input: continuous video stream Punching Shaking hands 6
  • 7. Activity detection by classification  Binary classifier Push/not-push Yes, a push. No, not a push. Classifier  Sliding window technique  Classify all possible time intervals Pushing 7
  • 8. Recognition process  Represent videos in terms of features  Captures properties of activity videos Running  Recognize activities by comparing video representations Not running  Decision boundary 8
  • 9. Taxonomy  Approach based taxonomy  Recognition approaches can be categorized. Human activity recognition Single-layered Hierarchical approaches approaches Space-time Sequential Statistical Syntactic Description approaches approaches -based Trajecto- Volumes Local Exemplar State-based ries features -based Aggarwal and Ryoo, ACM CSUR 2011 9
  • 10. Single layered vs. hierarchical  Single layered approaches T Feature extraction  Hierarchical approaches T … Stretching <1, 20> T Withdrawing <21, 30> … 10
  • 11. Taxonomy – single layered  These approaches recognize actions directly from a sequence of images. 11
  • 12. Single layered approaches  Action representation  Video volumes themselves  Features directly extracted from videos  Action classification  Machine learning techniques  Support vector machines  Hidden Markov models 12
  • 13. Taxonomy – Hierarchical Hierarchical approaches Statistical Syntactic Description-based approaches approaches approaches [Pinhanez and Bobick ’98] Human actions [Nguyen et al. ’05] [Gupta et al. ’09] [Intille and Bobick ’99] [Ivanov and Bobick ’00] [Vu et al. ’03] Human-Human [Oliver et al. ’02] interactions [Joo and Chellapha ’06] [Ghanem et al. ’04] [Ryoo and Aggarwal ’06, ’09a] [Shi et al. ’04]O [Moore and Essa ’02]O [Siskind ’01]O Human-Object [Yu and Aggarwal ’06]O [Minnen et al. ’03]O [Nevatia et al. ’03, ’04]O interactions [Damen and Hogg ’09]O [Kitani et al. ’07]O [Ryoo and Aggarwal ’07]O [Cupillard et al. ’02]G [Gong and Xiang ’03]G Group activities [Ryoo and Aggarwal ’08, ’10]G [Zhang et al.’06]G [Dai et al.’08]G 13
  • 14. Hierarchical approaches  Layered approaches  Activities in terms of sub-events.  Human interactions  Multiple agents  Suitable for activity-level recognition 14
  • 15. Hierarchical approaches  Activities as semantic structures  Activity = a concatenation of its sub-events  Human-oriented: high-level  Hierarchically organized representations Hand shake = “two persons do shake-action (stretches, stays stretched, withdraw) simultaneously, while touching”. Fighting -> Punching : 0.3 this==Push_interactions(p1,p2) | Punching Fighting : 0.7 i=Stretch(p1’s arm) j=Stay_Stretched(p1’s arm) Punching -> stretch withdraw : 0.8 k =Touching(p1, p2) l =Depart(p2, p1) | stretch stay_withdrawn : 0.1 | stay_stretched withdraw : 0.1 15