SlideShare uma empresa Scribd logo
1 de 30
Baixar para ler offline
Cluster-based Landmark and Event Detection
on Tagged Photo Collections
Symeon Papadopoulos, Christos Zigkolis,
Yiannis Kompatsiaris, Athena Vakali
user generated content creates new
opportunities
real-world depicted in users’ online collections
potential for many insights into what people
see, do and like




       need new tools for content organization
image clustering
clusters  landmarks + events




                                landmark

                                event
the framework
+          +


photos       tags       geo
overview




1              2

    landmark           landmark


               event
4              3
step 1: create photo similarity graph




            1                  2

                   landmark             landmark


                                event
            4                  3
SURF
     SIFT

visual similarity   casa mila, la pedrera



                      tag similarity

                         co-occurrence
                    latent semantic indexing
step 2: use graph to cluster the photos




            1                  2

                   landmark             landmark


                                event
            4                  3
the concept of node structure


 neighborhood of node v   + node itself   = structure of node v



                 v             v                    v




          N(v)                  v                  Γ(v)
the concept of structural similarity (1)




            v
                         u




            Γ(v) ∩ Γ(u)
                              structural similarity between nodes v and u

                Γ(v)  Γ(u)
the concept of structural similarity (2)


 high structural similarity
                                       photo cluster 1
                        C
                               A
                                   B



             photo cluster 2
 low structural similarity
# edges
complexity
                                     O (km  m)            graph-based clustering


                                  average node degree

                                                        # dimensions
                                         # clusters


             k-means clustering          O (I  C  n  D)

                                    # iterations
                                               # nodes


                                     O (n2  log n)
                         hierarchical agglomerative clustering
step 3: detect landmarks & events




            1                 2

                  landmark            landmark


                              event
            4                 3
#users / #photos                baseline features



                                            [2 years, 50 users / 120 photos]
         [1 day, 2 users / 10 photos]




      Quack et al., CIVR 2008                                 duration
Landmark Tags   additional
                  features




                    Event Tags
step 4: post-process landmark clusters




            1                 2

                  landmark             landmark


                               event
            4                 3
cluster merging based on proximity
cluster tag filtering




                                 CLUSTER TAGS

            helado   tropical   barcelona        cielos   spain      field

            park güell          jaume oller   park   sclupture   el beso



       low frequency tags
                                                     generic tags
results
207,750 photos
7,768 users
33,959 unique tags

compare graph-based vs. k-means clustering
     user study             geospatial coherence

                  high geospatial
                  coherence

                                              low geospatial
                                              coherence
user study
                             VISUAL
                 precision    recall   κ-statistic

   graph-based    1.000       0.110      1.000

   k-means        0.806       0.324      0.226


                              TAG
                 precision    recall   κ-statistic

   graph-based    0.950       0.182      0.820

   k-means        0.848       0.307      0.564
geospatial coherence
                                  VISUAL
                              radius     std. deviation

                graph-based   357 m          1.18 km

                k-means       2.4 km         1.73 km

                                       TAG

                graph-based   456 m          1.15 km

                k-means       767 m          1.76 km
classification performance



     16% - 23%
            improvement thanks to tag features
landmark localization accuracy


                     sagrada familia, cathedral, catholic   15.2m



                     la pedrera, casa mila                  31.8m



                     parc guell                              9.6m



                     boqueria, market, mercado, ramblas     82.1m



                     camp nou, fc barcelona, nou camp       18.7m
event category composition



                        music, concert, gigs, dj     43.1%




                        conference, presentation     6.5%




                        local traditional, parades   4.6%




                        racing, motorbikes, f1       3.3%
clusttour




                www.clusttour.gr

twitter.com/clusttour       facebook.com/clusttour

Mais conteúdo relacionado

Mais de Symeon Papadopoulos

Mais de Symeon Papadopoulos (20)

DeepFake Detection: Challenges, Progress and Hands-on Demonstration of Techno...
DeepFake Detection: Challenges, Progress and Hands-on Demonstration of Techno...DeepFake Detection: Challenges, Progress and Hands-on Demonstration of Techno...
DeepFake Detection: Challenges, Progress and Hands-on Demonstration of Techno...
 
Deepfakes: An Emerging Internet Threat and their Detection
Deepfakes: An Emerging Internet Threat and their DetectionDeepfakes: An Emerging Internet Threat and their Detection
Deepfakes: An Emerging Internet Threat and their Detection
 
Knowledge-based Fusion for Image Tampering Localization
Knowledge-based Fusion for Image Tampering LocalizationKnowledge-based Fusion for Image Tampering Localization
Knowledge-based Fusion for Image Tampering Localization
 
Deepfake Detection: The Importance of Training Data Preprocessing and Practic...
Deepfake Detection: The Importance of Training Data Preprocessing and Practic...Deepfake Detection: The Importance of Training Data Preprocessing and Practic...
Deepfake Detection: The Importance of Training Data Preprocessing and Practic...
 
COVID-19 Infodemic vs Contact Tracing
COVID-19 Infodemic vs Contact TracingCOVID-19 Infodemic vs Contact Tracing
COVID-19 Infodemic vs Contact Tracing
 
Similarity-based retrieval of multimedia content
Similarity-based retrieval of multimedia contentSimilarity-based retrieval of multimedia content
Similarity-based retrieval of multimedia content
 
Twitter-based Sensing of City-level Air Quality
Twitter-based Sensing of City-level Air QualityTwitter-based Sensing of City-level Air Quality
Twitter-based Sensing of City-level Air Quality
 
Aggregating and Analyzing the Context of Social Media Content
Aggregating and Analyzing the Context of Social Media ContentAggregating and Analyzing the Context of Social Media Content
Aggregating and Analyzing the Context of Social Media Content
 
Verifying Multimedia Content on the Internet
Verifying Multimedia Content on the InternetVerifying Multimedia Content on the Internet
Verifying Multimedia Content on the Internet
 
A Web-based Service for Image Tampering Detection
A Web-based Service for Image Tampering DetectionA Web-based Service for Image Tampering Detection
A Web-based Service for Image Tampering Detection
 
Learning to detect Misleading Content on Twitter
Learning to detect Misleading Content on TwitterLearning to detect Misleading Content on Twitter
Learning to detect Misleading Content on Twitter
 
Near-Duplicate Video Retrieval by Aggregating Intermediate CNN Layers
Near-Duplicate Video Retrieval by Aggregating Intermediate CNN LayersNear-Duplicate Video Retrieval by Aggregating Intermediate CNN Layers
Near-Duplicate Video Retrieval by Aggregating Intermediate CNN Layers
 
Verifying Multimedia Use at MediaEval 2016
Verifying Multimedia Use at MediaEval 2016Verifying Multimedia Use at MediaEval 2016
Verifying Multimedia Use at MediaEval 2016
 
Multimedia Privacy
Multimedia PrivacyMultimedia Privacy
Multimedia Privacy
 
Placing Images with Refined Language Models and Similarity Search with PCA-re...
Placing Images with Refined Language Models and Similarity Search with PCA-re...Placing Images with Refined Language Models and Similarity Search with PCA-re...
Placing Images with Refined Language Models and Similarity Search with PCA-re...
 
In-depth Exploration of Geotagging Performance
In-depth Exploration of Geotagging PerformanceIn-depth Exploration of Geotagging Performance
In-depth Exploration of Geotagging Performance
 
Perceived versus Actual Predictability of Personal Information in Social Netw...
Perceived versus Actual Predictability of Personal Information in Social Netw...Perceived versus Actual Predictability of Personal Information in Social Netw...
Perceived versus Actual Predictability of Personal Information in Social Netw...
 
Web and Social Media Image Forensics for News Professionals
Web and Social Media Image Forensics for News ProfessionalsWeb and Social Media Image Forensics for News Professionals
Web and Social Media Image Forensics for News Professionals
 
Predicting News Popularity by Mining Online Discussions
Predicting News Popularity by Mining Online DiscussionsPredicting News Popularity by Mining Online Discussions
Predicting News Popularity by Mining Online Discussions
 
Finding Diverse Social Images at MediaEval 2015
Finding Diverse Social Images at MediaEval 2015Finding Diverse Social Images at MediaEval 2015
Finding Diverse Social Images at MediaEval 2015
 

Último

Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Victor Rentea
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Victor Rentea
 

Último (20)

Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUKSpring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
JohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptxJohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptx
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
 

Cluster based landmark and event detection for tagged photo collections

  • 1. Cluster-based Landmark and Event Detection on Tagged Photo Collections Symeon Papadopoulos, Christos Zigkolis, Yiannis Kompatsiaris, Athena Vakali
  • 2. user generated content creates new opportunities
  • 3. real-world depicted in users’ online collections
  • 4. potential for many insights into what people see, do and like need new tools for content organization
  • 6. clusters  landmarks + events landmark event
  • 8. + + photos tags geo
  • 9. overview 1 2 landmark landmark event 4 3
  • 10. step 1: create photo similarity graph 1 2 landmark landmark event 4 3
  • 11. SURF SIFT visual similarity casa mila, la pedrera tag similarity co-occurrence latent semantic indexing
  • 12. step 2: use graph to cluster the photos 1 2 landmark landmark event 4 3
  • 13. the concept of node structure neighborhood of node v + node itself = structure of node v v v v N(v) v Γ(v)
  • 14. the concept of structural similarity (1) v u Γ(v) ∩ Γ(u) structural similarity between nodes v and u Γ(v)  Γ(u)
  • 15. the concept of structural similarity (2) high structural similarity photo cluster 1 C A B photo cluster 2 low structural similarity
  • 16. # edges complexity O (km  m) graph-based clustering average node degree # dimensions # clusters k-means clustering O (I  C  n  D) # iterations # nodes O (n2  log n) hierarchical agglomerative clustering
  • 17. step 3: detect landmarks & events 1 2 landmark landmark event 4 3
  • 18. #users / #photos baseline features [2 years, 50 users / 120 photos] [1 day, 2 users / 10 photos] Quack et al., CIVR 2008 duration
  • 19. Landmark Tags additional features Event Tags
  • 20. step 4: post-process landmark clusters 1 2 landmark landmark event 4 3
  • 21. cluster merging based on proximity
  • 22. cluster tag filtering CLUSTER TAGS helado tropical barcelona cielos spain field park güell jaume oller park sclupture el beso low frequency tags generic tags
  • 24. 207,750 photos 7,768 users 33,959 unique tags compare graph-based vs. k-means clustering user study geospatial coherence high geospatial coherence low geospatial coherence
  • 25. user study VISUAL precision recall κ-statistic graph-based 1.000 0.110 1.000 k-means 0.806 0.324 0.226 TAG precision recall κ-statistic graph-based 0.950 0.182 0.820 k-means 0.848 0.307 0.564
  • 26. geospatial coherence VISUAL radius std. deviation graph-based 357 m 1.18 km k-means 2.4 km 1.73 km TAG graph-based 456 m 1.15 km k-means 767 m 1.76 km
  • 27. classification performance 16% - 23% improvement thanks to tag features
  • 28. landmark localization accuracy sagrada familia, cathedral, catholic 15.2m la pedrera, casa mila 31.8m parc guell 9.6m boqueria, market, mercado, ramblas 82.1m camp nou, fc barcelona, nou camp 18.7m
  • 29. event category composition music, concert, gigs, dj 43.1% conference, presentation 6.5% local traditional, parades 4.6% racing, motorbikes, f1 3.3%
  • 30. clusttour www.clusttour.gr twitter.com/clusttour facebook.com/clusttour