Iccv2009 recognition and learning object categories p2 c01 - recognizing a large number of object classes

•Transferir como PPTX, PDF•

1 gostou•598 visualizações

zukun

Educação Tecnologia

What does the world look like? Scaling to billions of images Object Recognition for large-scale search Focus on scaling rather than understanding image High level image statistics

Content-Based Image Retrieval Variety of simple/hand-designed cues: Color and/or Texture histograms, Shape, PCA, etc. Various distance metrics Earth Movers Distance (Rubner et al. ‘98) QBIC from IBM (1999) Blobworld, Carson et al. 2002

Some vision techniques for large scale recognition Efficient matching methods Pyramid Match Kernel Learning to compare images Metrics for retrieval Learning compact descriptors

Matching features incategory-level recognition

Comparing sets of local features Previous strategies: ,[object Object], [Schmid, Lowe, Tuytelaars et al.] ,[object Object], [Rubner et al., Belongie et al., Gold & Rangarajan, Wallraven & Caputo, Berg et al., Zhang et al.,…] ,[object Object], [Csurka et al., Sivic & Zisserman, Lazebnik & Ponce] Slide credit: Kristen Grauman

Pyramid match kernel optimal partial matching [Grauman & Darrell, ICCV 2005] Optimal match: O(m3) Pyramid match: O(mL) m = # features L = # levels in pyramid Slide credit: Kristen Grauman

Pyramid match: main idea Feature space partitions serve to “match” the local descriptors within successively wider regions. descriptor space Slide credit: Kristen Grauman

Pyramid match: main idea Histogram intersection counts number of possible matches at a given partitioning. Slide credit: Kristen Grauman

Computing the partial matching Earth Mover’s Distance [Rubner, Tomasi, Guibas 1998] Hungarian method [Kuhn, 1955] Greedy matching … Pyramid match [Grauman and Darrell, ICCV 2005] for sets with features of dimension

Recognition on the ETH-80 Complexity Kernel Match [Wallraven et al.] Pyramid match Recognition accuracy (%) Testing time (s) Mean number of features per set (m) Mean number of features per set (m) Slide credit: Kristen Grauman

Pyramid match kernel: examples ofextensions and applications by other groups Spatial Pyramid Match Kernel Lazebnik, Schmid, Ponce, 2006. Dual-space Pyramid Matching Hu et al., 2007. Representing Shape with a Pyramid Kernel Bosch & Zisserman, 2007. Scene recognition Shape representation Medical image classification Slide credit: Kristen Grauman L=1 L=2 L=0

Pyramid match kernel: examples of extensions and applications by other groups wave sit down From Omnidirectional Images to Hierarchical Localization, Murillo et al. 2007. Single View Human Action Recognition using Key Pose Matching, Lv & Nevatia, 2007. Spatio-temporal Pyramid Matching for Sports Videos, Choi et al., 2008. Action recognition Video indexing Robot localization Slide : Kristen Grauman

Learning how to compare images ,[object Object]

Number of existing techniques for metric learningdissimilar similar [Weinberger et al. 2004, Hertz et al. 2004, Frome et al. 2007, Varma & Ray 2007, Kumar et al. 2007]

Problem-specific knowledge Example sources of similarity constraints Fully labeled image databases Partially labeled image databases

Locality Sensitive Hashing (LSH) Gionis, A. & Indyk, P. & Motwani, R. (1999) ,[object Object]

Quantize each projection with few bits101 0 Descriptor in high D space 1 0 1 1 0

Fast Image Search for Learned MetricsJain, Kulis, & Grauman, CVPR 2008 Learn a Malhanobis metric for LSH h( ) = h( ) h( ) ≠h( ) Less likely to split pairs like those with similarity constraint More likely to split pairs like those with dissimilarity constraint Slide : Kristen Grauman

Results: Flickr dataset Query time: Error rate ,[object Object]

Categorize scene based on nearest exemplars

Base metric: Ling & Soatto’s Proximity Distribution Kernel (PDK)slower search faster search 30% of data 2% of data Slide : Kristen Grauman

Mais conteúdo relacionado

Mais procurados

A Quantitative analysis and performance study for similarity search methods i...Jungyeol

Saliency-based Models of Image Content and their Application to Auto-Annotati...Jonathon Hare

Depth estimation using deep learningUniversity of Oklahoma

[DL輪読会]PV-RCNN: Point-Voxel Feature Set Abstraction for 3D Object DetectionDeep Learning JP

Spot the Dog: An overview of semantic retrieval of unannotated images in the ...Jonathon Hare

Seed net automatic seed generation with deep reinforcement learning for robus...NAVER Engineering

Mind the Gap: Another look at the problem of the semantic gap in image retrievalJonathon Hare

A brief introduction to extracting information from imagesJonathon Hare

posterYanira Corvera

CLUSTERING HYPERSPECTRAL DATAcsandit

Scale Saliency: Applications in Visual Matching,Tracking and View-Based Objec...Jonathon Hare

Real-Time Volumetric Tests (EG 2008)Matthias Trapp

Deep LearningPierre de Lacaze

Object detection with deep learningSushant Shrivastava

Object detection - RCNNs vs RetinanetRishabh Indoria

V2 v posenetNAVER Engineering

Image Completion using Planar Structure Guidance (SIGGRAPH 2014)Jia-Bin Huang

MediaEval 2016 - Placing Images with Refined Language Models and Similarity S...multimediaeval

Automatic Building detection for satellite Images using IGV and DSMAmit Raikar

MultiModal Retrieval ImageKonstantinos Zagoris

Mais procurados (20)

A Quantitative analysis and performance study for similarity search methods i...

Saliency-based Models of Image Content and their Application to Auto-Annotati...

Depth estimation using deep learning

[DL輪読会]PV-RCNN: Point-Voxel Feature Set Abstraction for 3D Object Detection

Spot the Dog: An overview of semantic retrieval of unannotated images in the ...

Seed net automatic seed generation with deep reinforcement learning for robus...

Mind the Gap: Another look at the problem of the semantic gap in image retrieval

A brief introduction to extracting information from images

poster

CLUSTERING HYPERSPECTRAL DATA

Scale Saliency: Applications in Visual Matching,Tracking and View-Based Objec...

Real-Time Volumetric Tests (EG 2008)

Deep Learning

Object detection with deep learning

Object detection - RCNNs vs Retinanet

V2 v posenet

Image Completion using Planar Structure Guidance (SIGGRAPH 2014)

MediaEval 2016 - Placing Images with Refined Language Models and Similarity S...

Automatic Building detection for satellite Images using IGV and DSM

MultiModal Retrieval Image

Destaque

CVPR2010: Learnings from founding a computer vision startup: Chapter 8: Softw...zukun

Tuto cvpr part1zukun

CVPR2010: Learnings from founding a computer vision startup: Chapter 1, 2: Wh...zukun

NIPS2008: tutorial: statistical models of visual imageszukun

NIPS2009: Understand Visual Scenes - Part 2zukun

ECCV2010: distance function and metric learning part 2zukun

Destaque (6)

CVPR2010: Learnings from founding a computer vision startup: Chapter 8: Softw...

Tuto cvpr part1

CVPR2010: Learnings from founding a computer vision startup: Chapter 1, 2: Wh...

NIPS2008: tutorial: statistical models of visual images

NIPS2009: Understand Visual Scenes - Part 2

ECCV2010: distance function and metric learning part 2

Semelhante a Iccv2009 recognition and learning object categories p2 c01 - recognizing a large number of object classes

20110220 computer vision_eruhimov_lecture01Computer Science Club

Soundarya m.scsowfi

Multimedia searchingUniversity PARIS-SUD

Machine Learning ProjectAdeyemi Fowe

Multilabel Image Retreval Using HashingSurbhi Bhosale

Representative Previous Workbutest

image processing image processing image processingSportsAcademy1

Wang midterm-defenceZhipeng Wang

Dense Image Matching - Challenges and Potentials (Keynote 3D-ARCH 2015) Konrad Wenzel

A Framework for Comparison and Evaluation of Nonlinear Intra-Subject Image Re...Kitware Kitware

OBJECT DETECTION AND RECOGNITION: A SURVEYJournal For Research

Artem Baklanov - Votes Aggregation Techniques in Geo-Wiki Crowdsourcing Game:...AIST

VERIFICATION_&_VALIDATION_OF_A_SEMANTIC_IMAGE_TAGGING_FRAMEWORK_VIA_GENERATIO...grssieee

Semantics In Digital Photos A Contenxtual AnalysisAllenWu

Query Image Searching With Integrated Textual and Visual Relevance Feedback f...IJERA Editor

A Review on Matching For Sketch TechniqueIOSR Journals

[OSGeo-KR Tech Workshop] Deep Learning for Single Image Super-ResolutionTaegyun Jeon

Computer vision,,summer training programmePraveen Pandey

Mit6870 orsu lecture2zukun

Semelhante a Iccv2009 recognition and learning object categories p2 c01 - recognizing a large number of object classes (20)

20110220 computer vision_eruhimov_lecture01

Soundarya m.sc

Multimedia searching

Machine Learning Project

Multilabel Image Retreval Using Hashing

Representative Previous Work

image processing image processing image processing

Wang midterm-defence

Dense Image Matching - Challenges and Potentials (Keynote 3D-ARCH 2015)

A Framework for Comparison and Evaluation of Nonlinear Intra-Subject Image Re...

OBJECT DETECTION AND RECOGNITION: A SURVEY

Artem Baklanov - Votes Aggregation Techniques in Geo-Wiki Crowdsourcing Game:...

VERIFICATION_&_VALIDATION_OF_A_SEMANTIC_IMAGE_TAGGING_FRAMEWORK_VIA_GENERATIO...

Semantics In Digital Photos A Contenxtual Analysis

Query Image Searching With Integrated Textual and Visual Relevance Feedback f...

A Review on Matching For Sketch Technique

[OSGeo-KR Tech Workshop] Deep Learning for Single Image Super-Resolution

Computer vision,,summer training programme

Mit6870 orsu lecture2

Mais de zukun

My lyn tutorial 2009zukun

ETHZ CV2012: Tutorial openCVzukun

ETHZ CV2012: Informationzukun

Siwei lyu: natural image statisticszukun

Lecture9 camera calibrationzukun

Brunelli 2008: template matching techniques in computer visionzukun

Modern features-part-4-evaluationzukun

Modern features-part-3-softwarezukun

Modern features-part-2-descriptorszukun

Modern features-part-1-detectorszukun

Modern features-part-0-introzukun

Lecture 02 internet video searchzukun

Lecture 01 internet video searchzukun

Lecture 03 internet video searchzukun

Icml2012 tutorial representation_learningzukun

Advances in discrete energy minimisation for computer visionzukun

Gephi tutorial: quick startzukun

EM algorithm and its application in probabilistic latent semantic analysiszukun

Object recognition with pictorial structureszukun

Iccv2011 learning spatiotemporal graphs of human activities zukun

Mais de zukun (20)

My lyn tutorial 2009

ETHZ CV2012: Tutorial openCV

ETHZ CV2012: Information

Siwei lyu: natural image statistics

Lecture9 camera calibration

Brunelli 2008: template matching techniques in computer vision

Modern features-part-4-evaluation

Modern features-part-3-software

Modern features-part-2-descriptors

Modern features-part-1-detectors

Modern features-part-0-intro

Lecture 02 internet video search

Lecture 01 internet video search

Lecture 03 internet video search

Icml2012 tutorial representation_learning

Advances in discrete energy minimisation for computer vision

Gephi tutorial: quick start

EM algorithm and its application in probabilistic latent semantic analysis

Object recognition with pictorial structures

Iccv2011 learning spatiotemporal graphs of human activities

Último

Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhikauryashika82

Call Girls in Dwarka Mor Delhi Contact Us 9654467111Sapana Sha

Paris 2024 Olympic Geographies - an activityGeoBlogs

Measures of Dispersion and Variability: Range, QD, AD and SDThiyagu K

Interactive Powerpoint_How to Master effective communicationnomboosow

Holdier Curriculum Vitae (April 2024).pdfagholdier

Disha NEET Physics Guide for classes 11 and 12.pdfchloefrazer622

The basics of sentences session 2pptx copy.pptxheathfieldcps1

Código Creativo y Arte de Software | Unidad 1Maestría en Comunicación Digital Interactiva - UNR

General AI for Medical Educators April 2024Janet Corral

Advanced Views - Calendar View in Odoo 17Celine George

1029 - Danh muc Sach Giao Khoa 10 . pdfQucHHunhnh

Q4-W6-Restating Informational Text Grade 3JemimahLaneBuaron

Sports & Fitness Value Added Course FY..Disha Kariya

The Most Excellent Way | 1 Corinthians 13Steve Thomason

A Critique of the Proposed National Education Policy ReformChameera Dedduwage

Key note speaker Neum_Admir Softic_ENG.pdfAdmir Softic

9548086042 for call girls in Indira Nagar with room servicediscovermytutordmt

Grant Readiness 101 TechSoup and Remy ConsultingTechSoup

Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...EduSkills OECD

Iccv2009 recognition and learning object categories p2 c01 - recognizing a large number of object classes

1. Large Scale Recognition and Retrieval

2. What does the world look like? Scaling to billions of images Object Recognition for large-scale search Focus on scaling rather than understanding image High level image statistics

3. Content-Based Image Retrieval Variety of simple/hand-designed cues: Color and/or Texture histograms, Shape, PCA, etc. Various distance metrics Earth Movers Distance (Rubner et al. ‘98) QBIC from IBM (1999) Blobworld, Carson et al. 2002

4. Some vision techniques for large scale recognition Efficient matching methods Pyramid Match Kernel Learning to compare images Metrics for retrieval Learning compact descriptors

5. Some vision techniques for large scale recognition Efficient matching methods Pyramid Match Kernel Learning to compare images Metrics for retrieval Learning compact descriptors

6. Matching features incategory-level recognition

8. Pyramid match kernel optimal partial matching [Grauman & Darrell, ICCV 2005] Optimal match: O(m3) Pyramid match: O(mL) m = # features L = # levels in pyramid Slide credit: Kristen Grauman

9. Pyramid match: main idea Feature space partitions serve to “match” the local descriptors within successively wider regions. descriptor space Slide credit: Kristen Grauman

10. Pyramid match: main idea Histogram intersection counts number of possible matches at a given partitioning. Slide credit: Kristen Grauman

11. Computing the partial matching Earth Mover’s Distance [Rubner, Tomasi, Guibas 1998] Hungarian method [Kuhn, 1955] Greedy matching … Pyramid match [Grauman and Darrell, ICCV 2005] for sets with features of dimension

12. Recognition on the ETH-80 Complexity Kernel Match [Wallraven et al.] Pyramid match Recognition accuracy (%) Testing time (s) Mean number of features per set (m) Mean number of features per set (m) Slide credit: Kristen Grauman

13. Pyramid match kernel: examples ofextensions and applications by other groups Spatial Pyramid Match Kernel Lazebnik, Schmid, Ponce, 2006. Dual-space Pyramid Matching Hu et al., 2007. Representing Shape with a Pyramid Kernel Bosch & Zisserman, 2007. Scene recognition Shape representation Medical image classification Slide credit: Kristen Grauman L=1 L=2 L=0

14. Pyramid match kernel: examples of extensions and applications by other groups wave sit down From Omnidirectional Images to Hierarchical Localization, Murillo et al. 2007. Single View Human Action Recognition using Key Pose Matching, Lv & Nevatia, 2007. Spatio-temporal Pyramid Matching for Sports Videos, Choi et al., 2008. Action recognition Video indexing Robot localization Slide : Kristen Grauman

15. Some vision techniques for large scale recognition Efficient matching methods Pyramid Match Kernel Learning to compare images Metrics for retrieval Learning compact descriptors

16.

17. Number of existing techniques for metric learningdissimilar similar [Weinberger et al. 2004, Hertz et al. 2004, Frome et al. 2007, Varma & Ray 2007, Kumar et al. 2007]

18. Problem-specific knowledge Example sources of similarity constraints Fully labeled image databases Partially labeled image databases

19.

20. Quantize each projection with few bits101 0 Descriptor in high D space 1 0 1 1 0

21. Fast Image Search for Learned MetricsJain, Kulis, & Grauman, CVPR 2008 Learn a Malhanobis metric for LSH h( ) = h( ) h( ) ≠h( ) Less likely to split pairs like those with similarity constraint More likely to split pairs like those with dissimilarity constraint Slide : Kristen Grauman

22.

23. Categorize scene based on nearest exemplars

24. Base metric: Ling & Soatto’s Proximity Distribution Kernel (PDK)slower search faster search 30% of data 2% of data Slide : Kristen Grauman

25.

26. Categorize scene based on nearest exemplars

27. Base metric: Ling & Soatto’s Proximity Distribution Kernel (PDK)slower search faster search 30% of data 2% of data Slide : Kristen Grauman

28. Some vision techniques for large scale recognition Efficient matching methods Pyramid Match Kernel Learning to compare images Metrics for retrieval Learning compact descriptors

29. Semantic Hashing [Salakhutdinov & Hinton, 2007] for text documents Query Image Semantic HashFunction Address Space Binary code Images in database Query address Semantically similar images Quite differentto a (conventional)randomizing hash

30. Exploring different choices of semantic hash function Torralba, Fergus, Weiss, CVPR 2008 <10μs Binary code Image 1 3. RBM 2. BoostSSC 1.LSH Retrieved images <1ms Semantic Hash Query Image Gist descriptor Compute Gist ~1ms (in Matlab)

31.

32. LabelMe retrieval comparison 32-bit learned codes do as well as 512-dim real-valued input descriptor Learning methods outperform LSH % of 50 true neighbors in retrieval set 0 2,000 10,000 20,0000 Size of retrieval set

33.

34. Jain, Kulis, & Grauman, CVPR 2008: Learn Malhanobis distance for LSH.

35. Torralba, Fergus, Weiss, CVPR 2008: Directly learn mapping from image to binary code.

36. Use Hamming distance (binary codes) for speed

37. Learning metric really helps over plain LSH

38. Learning only applied to metric, not representation

Notas do Editor

Goal – match image to others in order to name locationMake point of high dimensions here.Smaller gap with ML because also serves as dim reduction, so need fewer bits. Here num bits is the same for both. PDK is higher d than PMK. So we didn’t observe this gap discrepancy for ML PMK hash because num bits used was enough for both.To evaluate our learned hash functions when the base kernel is theproximity distribution kernel (PDK), we performed experiments with a dataset of 5400 images of 18 different tourist attractions from the photo-sharing site Flickr. We took three cities inEurope that have major tourist attractions: Rome, London, and Paris. The tourist sites for eachcity were taken from the top attractions in www.TripAdvisor.com under the headings Religioussite, Architectural building, Historic site, Opera, Museum, and Theater. Overall, the list yielded18 classes: eight from Rome, five from London, and five from Paris. The classes are: Arc deTriomphe, Basilica San Pietro, Castel SantAngelo, Colosseum, Eiffel Tower, Globe Theatre,Hotel des Invalides, House of Parliament, Louvre, Notre Dame Cathedral, Pantheon, PiazzaCampidoglio, Roman Forum, Santa Maria Maggiore, Spanish Steps, St. Pauls Cathedral, Tower
Left side – 30% of data searched. Right side – 2-3% searched

Iccv2009 recognition and learning object categories p2 c01 - recognizing a large number of object classes

Recomendados

Recomendados

Mais conteúdo relacionado

Mais procurados

Mais procurados (20)

Destaque

Destaque (6)

Semelhante a Iccv2009 recognition and learning object categories p2 c01 - recognizing a large number of object classes

Semelhante a Iccv2009 recognition and learning object categories p2 c01 - recognizing a large number of object classes (20)

Mais de zukun

Mais de zukun (20)

Último

Último (20)

Iccv2009 recognition and learning object categories p2 c01 - recognizing a large number of object classes

Notas do Editor