SlideShare uma empresa Scribd logo
1 de 12
MediaEval 2015 Workshop, Retrieving Diverse Social Images Task
14-15 September 2015, Wurzen, Germany
USEMP: Finding Diverse Images at MediaEval 2015
Eleftherios Spyromitros-Xioufis1, Adrian Popescu2,
Symeon Papadopoulos1, Yiannis Kompatsiaris1
1 CERTH-ITI, Thermi-Thessaloniki, Greece, {espyromi,papadop,ikom}@iti.gr
2 CEA, LIST, 91190 Gif-sur-Yvette, France, adrian.popescu@cea.fr
Summary of our participation
• supervised Maximal Marginal Relevance (sMMR) [1]:
– A supervised diversification method that jointly optimizes
relevance and diversity
• The runs
– Fully automated, no external data*
– Each run corresponds to a different instantiation of sMMR
#2
Run id Run Type Relevance Features Diversity Features
1 visual-only CNN* [1] VLAD+CSURF [2]
2 text-only BOW BOW
3 & 5 visual+textual CNN, BOW, META VLAD+CSURF
[1] E. Spyromitros-Xioufis et al., “Improving diversity in image search via supervised relevance scoring”, ICMR 2015
[2] E. Spyromitros-Xioufis et al., “A comprehensive study over VLAD and Product Quantization in large-scale image
retrieval”, IEEE Transactions on Multimedia, 2014
Overview of our approach
• sMMR builds incrementally a refined set 𝑆 ⊂ 𝐼, 𝑆 = 𝐾
• At each step 𝐽 = 1, … , 𝐾 selects the image 𝑖𝑚∗ that
scores highest to the following criterion:
#3
𝑈(𝑖𝑚∗
|𝑞) = 𝑤 ∗ 𝑅 𝑖𝑚∗
𝑞 + 1 − 𝑤 ∗ min
𝑖𝑚 𝑗∈𝑆 𝐽−1
𝑑(𝑖𝑚∗
, 𝑖𝑚𝑗)
Overview of our approach
• sMMR builds incrementally a refined set 𝑆 ⊂ 𝐼, 𝑆 = 𝐾
• At each step 𝐽 = 1, … , 𝐾 selects the image 𝑖𝑚∗ that
scores highest to the following criterion:
#4
𝑈(𝑖𝑚∗
|𝑞) = 𝑤 ∗ 𝑅 𝑖𝑚∗
𝑞 + 1 − 𝑤 ∗ min
𝑖𝑚 𝑗∈𝑆 𝐽−1
𝑑(𝑖𝑚∗
, 𝑖𝑚𝑗)
Relevance to the query 
output of a task and query specific classifier
Overview of our approach
• sMMR builds incrementally a refined set 𝑆 ⊂ 𝐼, 𝑆 = 𝐾
• At each step 𝐽 = 1, … , 𝐾 selects the image 𝑖𝑚∗ that
scores highest to the following criterion:
#5
𝑈(𝑖𝑚∗
|𝑞) = 𝑤 ∗ 𝑅 𝑖𝑚∗
𝑞 + 1 − 𝑤 ∗ min
𝑖𝑚 𝑗∈𝑆 𝐽−1
𝑑(𝑖𝑚∗
, 𝑖𝑚𝑗)
Relevance to the query 
output of a task and query specific classifier
Diversity in 𝑆 
distance to the most similar image already selected
Learning relevance from ground truth
#6
devset queries
q1 q2 q3
test query, e.g. “Eiffel Tower”
Wikipedia images
Flickr images ? ?
?
?
?
Flickrimages
Learning relevance from ground truth
#7
devset queries
q1 q2 q3
test query, e.g. “Eiffel Tower”
Wikipedia images
Flickr images ? ?
?
?
?
Flickrimages
training set for ℎeiffel
Learning relevance from ground truth
#8
devset queries
q1 q2 q3
test query, e.g. “Eiffel Tower”
Wikipedia images
Flickr images ? ?
?
?
?
Flickrimages
training set for ℎeiffel
Learning relevance from ground truth
#9
devset queries
q1 q2 q3
test query, e.g. “Eiffel Tower”
Wikipedia images
Flickr images ? ?
?
?
?
Flickrimages
training set for ℎeiffel
Learning relevance from ground truth
#10
devset queries
q1 q2 q3
test query, e.g. “Eiffel Tower”
Wikipedia images
Flickr images ? ?
?
?
?
Flickrimages
training set for ℎeiffel
Learning relevance from ground truth
#11
devset queries
q1 q2 q3
test query, e.g. “Eiffel Tower”
Wikipedia images
Flickr images ? ?
?
?
?
Flickrimages
training set for ℎeiffel
#12
This work was supported by the USEMP FP7 project
More details at the poster session!

Mais conteúdo relacionado

Mais procurados

unrban-building-damage-detection-by-PJLi.ppt
unrban-building-damage-detection-by-PJLi.pptunrban-building-damage-detection-by-PJLi.ppt
unrban-building-damage-detection-by-PJLi.ppt
grssieee
 
Project presentation
Project presentationProject presentation
Project presentation
Maham Sajid
 
Visual Object Analysis using Regions and Local Features
Visual Object Analysis using Regions and Local FeaturesVisual Object Analysis using Regions and Local Features
Visual Object Analysis using Regions and Local Features
Universitat Politècnica de Catalunya
 

Mais procurados (7)

PyData Delhi 2018 : Creating Art with Neural Nets
PyData Delhi 2018 : Creating Art with Neural NetsPyData Delhi 2018 : Creating Art with Neural Nets
PyData Delhi 2018 : Creating Art with Neural Nets
 
unrban-building-damage-detection-by-PJLi.ppt
unrban-building-damage-detection-by-PJLi.pptunrban-building-damage-detection-by-PJLi.ppt
unrban-building-damage-detection-by-PJLi.ppt
 
Image formation
Image formationImage formation
Image formation
 
Project presentation
Project presentationProject presentation
Project presentation
 
Hyougo iv2014 slide
Hyougo iv2014 slideHyougo iv2014 slide
Hyougo iv2014 slide
 
Visual Object Analysis using Regions and Local Features
Visual Object Analysis using Regions and Local FeaturesVisual Object Analysis using Regions and Local Features
Visual Object Analysis using Regions and Local Features
 
CSTalks - Object detection and tracking - 25th May
CSTalks - Object detection and tracking - 25th MayCSTalks - Object detection and tracking - 25th May
CSTalks - Object detection and tracking - 25th May
 

Destaque

戒掉你的呆保單
戒掉你的呆保單戒掉你的呆保單
戒掉你的呆保單
Moxiame
 
Conventions of newspapers
Conventions of newspapersConventions of newspapers
Conventions of newspapers
nicolecoltman
 

Destaque (16)

4 Os cambios sociais. orixes e desenvolvemento do movemento obreiro
4 Os cambios sociais. orixes e desenvolvemento do movemento obreiro4 Os cambios sociais. orixes e desenvolvemento do movemento obreiro
4 Os cambios sociais. orixes e desenvolvemento do movemento obreiro
 
pdp new profile
pdp new profilepdp new profile
pdp new profile
 
Dados Portal Ricmais Paraná maio13
Dados Portal Ricmais Paraná  maio13Dados Portal Ricmais Paraná  maio13
Dados Portal Ricmais Paraná maio13
 
Macrosolutions Training: Program Management
Macrosolutions Training: Program ManagementMacrosolutions Training: Program Management
Macrosolutions Training: Program Management
 
戒掉你的呆保單
戒掉你的呆保單戒掉你的呆保單
戒掉你的呆保單
 
Artigo: Educação Corporativa nos novos cenários empresariais.
Artigo: Educação Corporativa nos novos cenários empresariais.Artigo: Educação Corporativa nos novos cenários empresariais.
Artigo: Educação Corporativa nos novos cenários empresariais.
 
Personalized Privacy-Aware Image Classification
Personalized Privacy-Aware Image ClassificationPersonalized Privacy-Aware Image Classification
Personalized Privacy-Aware Image Classification
 
Gestão de Conflitos em empresas familiares - PDA - JValério - Fundação Dom Ca...
Gestão de Conflitos em empresas familiares - PDA - JValério - Fundação Dom Ca...Gestão de Conflitos em empresas familiares - PDA - JValério - Fundação Dom Ca...
Gestão de Conflitos em empresas familiares - PDA - JValério - Fundação Dom Ca...
 
Solar lantern technology adoption model for indian villages - final
Solar lantern   technology adoption model for indian villages - finalSolar lantern   technology adoption model for indian villages - final
Solar lantern technology adoption model for indian villages - final
 
Industrial Temperature Controller using Microcontroller
Industrial Temperature Controller using MicrocontrollerIndustrial Temperature Controller using Microcontroller
Industrial Temperature Controller using Microcontroller
 
Rural Drinking Water
Rural Drinking WaterRural Drinking Water
Rural Drinking Water
 
Governos poplistas no brasil
Governos poplistas no brasilGovernos poplistas no brasil
Governos poplistas no brasil
 
Plano Patrocinio Curitiba zero grau 2017
Plano Patrocinio Curitiba zero grau 2017Plano Patrocinio Curitiba zero grau 2017
Plano Patrocinio Curitiba zero grau 2017
 
Animal Nutrition (www.bioguruindia.com)
Animal Nutrition  (www.bioguruindia.com)Animal Nutrition  (www.bioguruindia.com)
Animal Nutrition (www.bioguruindia.com)
 
3 fault analysis_trainer
3 fault analysis_trainer3 fault analysis_trainer
3 fault analysis_trainer
 
Conventions of newspapers
Conventions of newspapersConventions of newspapers
Conventions of newspapers
 

Semelhante a Finding Diverse Social Images at MediaEval 2015

CERTH @ MediaEval 2012 Social Event Detection Task
CERTH @ MediaEval 2012 Social Event Detection TaskCERTH @ MediaEval 2012 Social Event Detection Task
CERTH @ MediaEval 2012 Social Event Detection Task
MediaEval2012
 
CERTH @ MediaEval 2012 Social Event Detection Task
CERTH @ MediaEval 2012 Social Event Detection TaskCERTH @ MediaEval 2012 Social Event Detection Task
CERTH @ MediaEval 2012 Social Event Detection Task
Symeon Papadopoulos
 

Semelhante a Finding Diverse Social Images at MediaEval 2015 (20)

MediaEval 2015 - USEMP: Finding Diverse Images at MediaEval 2015
MediaEval 2015 - USEMP: Finding Diverse Images at MediaEval 2015MediaEval 2015 - USEMP: Finding Diverse Images at MediaEval 2015
MediaEval 2015 - USEMP: Finding Diverse Images at MediaEval 2015
 
CERTH @ MediaEval 2014 Social Event Detection Task
CERTH @ MediaEval 2014 Social Event Detection TaskCERTH @ MediaEval 2014 Social Event Detection Task
CERTH @ MediaEval 2014 Social Event Detection Task
 
Kaggle's WISE 2014 challenge
Kaggle's WISE 2014 challenge Kaggle's WISE 2014 challenge
Kaggle's WISE 2014 challenge
 
Retrieving Diverse Social Images at MediaEval 2014: Challenge, Dataset and Ev...
Retrieving Diverse Social Images at MediaEval 2014: Challenge, Dataset and Ev...Retrieving Diverse Social Images at MediaEval 2014: Challenge, Dataset and Ev...
Retrieving Diverse Social Images at MediaEval 2014: Challenge, Dataset and Ev...
 
Video Thumbnail Selector
Video Thumbnail SelectorVideo Thumbnail Selector
Video Thumbnail Selector
 
CERTH @ MediaEval 2012 Social Event Detection Task
CERTH @ MediaEval 2012 Social Event Detection TaskCERTH @ MediaEval 2012 Social Event Detection Task
CERTH @ MediaEval 2012 Social Event Detection Task
 
CERTH @ MediaEval 2012 Social Event Detection Task
CERTH @ MediaEval 2012 Social Event Detection TaskCERTH @ MediaEval 2012 Social Event Detection Task
CERTH @ MediaEval 2012 Social Event Detection Task
 
Social Event Detection using Multimodal Clustering and Integrating Supervisor...
Social Event Detection using Multimodal Clustering and Integrating Supervisor...Social Event Detection using Multimodal Clustering and Integrating Supervisor...
Social Event Detection using Multimodal Clustering and Integrating Supervisor...
 
Learning Analytics for the Evaluation of Competencies and Behaviors in Seriou...
Learning Analytics for the Evaluation of Competencies and Behaviors in Seriou...Learning Analytics for the Evaluation of Competencies and Behaviors in Seriou...
Learning Analytics for the Evaluation of Competencies and Behaviors in Seriou...
 
Aplicando Analítica de Aprendizaje para la Evaluación de Competencias y Compo...
Aplicando Analítica de Aprendizaje para la Evaluación de Competencias y Compo...Aplicando Analítica de Aprendizaje para la Evaluación de Competencias y Compo...
Aplicando Analítica de Aprendizaje para la Evaluación de Competencias y Compo...
 
Graph-based multimodal clustering for social event detection in large collect...
Graph-based multimodal clustering for social event detection in large collect...Graph-based multimodal clustering for social event detection in large collect...
Graph-based multimodal clustering for social event detection in large collect...
 
Visual-Textual Joint Relevance Learning for Tag-Based Social Image Search
Visual-Textual Joint Relevance Learning for Tag-Based Social Image SearchVisual-Textual Joint Relevance Learning for Tag-Based Social Image Search
Visual-Textual Joint Relevance Learning for Tag-Based Social Image Search
 
Yulia Honcharenko "Application of metric learning for logo recognition"
Yulia Honcharenko "Application of metric learning for logo recognition"Yulia Honcharenko "Application of metric learning for logo recognition"
Yulia Honcharenko "Application of metric learning for logo recognition"
 
Ppig2014 problem solvingpaths
Ppig2014 problem solvingpathsPpig2014 problem solvingpaths
Ppig2014 problem solvingpaths
 
Large scale object recognition (AMMAI presentation)
Large scale object recognition (AMMAI presentation)Large scale object recognition (AMMAI presentation)
Large scale object recognition (AMMAI presentation)
 
MediaEval 2015 - Retrieving Diverse Social Images at MediaEval 2015: Challeng...
MediaEval 2015 - Retrieving Diverse Social Images at MediaEval 2015: Challeng...MediaEval 2015 - Retrieving Diverse Social Images at MediaEval 2015: Challeng...
MediaEval 2015 - Retrieving Diverse Social Images at MediaEval 2015: Challeng...
 
OpenRepGrid – An Open Source Software for the Analysis of Repertory Grids
OpenRepGrid – An Open Source Software for the Analysis of Repertory GridsOpenRepGrid – An Open Source Software for the Analysis of Repertory Grids
OpenRepGrid – An Open Source Software for the Analysis of Repertory Grids
 
Knowing when to look
Knowing when to lookKnowing when to look
Knowing when to look
 
MediaEval 2015 - UNED-UV @ Retrieving Diverse Social Images Task - Poster
MediaEval 2015 - UNED-UV @ Retrieving Diverse Social Images Task - PosterMediaEval 2015 - UNED-UV @ Retrieving Diverse Social Images Task - Poster
MediaEval 2015 - UNED-UV @ Retrieving Diverse Social Images Task - Poster
 
An Empirical Comparison of Knowledge Graph Embeddings for Item Recommendation
An Empirical Comparison of Knowledge Graph Embeddings for Item RecommendationAn Empirical Comparison of Knowledge Graph Embeddings for Item Recommendation
An Empirical Comparison of Knowledge Graph Embeddings for Item Recommendation
 

Mais de Symeon Papadopoulos

Mais de Symeon Papadopoulos (20)

DeepFake Detection: Challenges, Progress and Hands-on Demonstration of Techno...
DeepFake Detection: Challenges, Progress and Hands-on Demonstration of Techno...DeepFake Detection: Challenges, Progress and Hands-on Demonstration of Techno...
DeepFake Detection: Challenges, Progress and Hands-on Demonstration of Techno...
 
Deepfakes: An Emerging Internet Threat and their Detection
Deepfakes: An Emerging Internet Threat and their DetectionDeepfakes: An Emerging Internet Threat and their Detection
Deepfakes: An Emerging Internet Threat and their Detection
 
Knowledge-based Fusion for Image Tampering Localization
Knowledge-based Fusion for Image Tampering LocalizationKnowledge-based Fusion for Image Tampering Localization
Knowledge-based Fusion for Image Tampering Localization
 
Deepfake Detection: The Importance of Training Data Preprocessing and Practic...
Deepfake Detection: The Importance of Training Data Preprocessing and Practic...Deepfake Detection: The Importance of Training Data Preprocessing and Practic...
Deepfake Detection: The Importance of Training Data Preprocessing and Practic...
 
COVID-19 Infodemic vs Contact Tracing
COVID-19 Infodemic vs Contact TracingCOVID-19 Infodemic vs Contact Tracing
COVID-19 Infodemic vs Contact Tracing
 
Similarity-based retrieval of multimedia content
Similarity-based retrieval of multimedia contentSimilarity-based retrieval of multimedia content
Similarity-based retrieval of multimedia content
 
Twitter-based Sensing of City-level Air Quality
Twitter-based Sensing of City-level Air QualityTwitter-based Sensing of City-level Air Quality
Twitter-based Sensing of City-level Air Quality
 
Aggregating and Analyzing the Context of Social Media Content
Aggregating and Analyzing the Context of Social Media ContentAggregating and Analyzing the Context of Social Media Content
Aggregating and Analyzing the Context of Social Media Content
 
Verifying Multimedia Content on the Internet
Verifying Multimedia Content on the InternetVerifying Multimedia Content on the Internet
Verifying Multimedia Content on the Internet
 
A Web-based Service for Image Tampering Detection
A Web-based Service for Image Tampering DetectionA Web-based Service for Image Tampering Detection
A Web-based Service for Image Tampering Detection
 
Learning to detect Misleading Content on Twitter
Learning to detect Misleading Content on TwitterLearning to detect Misleading Content on Twitter
Learning to detect Misleading Content on Twitter
 
Near-Duplicate Video Retrieval by Aggregating Intermediate CNN Layers
Near-Duplicate Video Retrieval by Aggregating Intermediate CNN LayersNear-Duplicate Video Retrieval by Aggregating Intermediate CNN Layers
Near-Duplicate Video Retrieval by Aggregating Intermediate CNN Layers
 
Verifying Multimedia Use at MediaEval 2016
Verifying Multimedia Use at MediaEval 2016Verifying Multimedia Use at MediaEval 2016
Verifying Multimedia Use at MediaEval 2016
 
Multimedia Privacy
Multimedia PrivacyMultimedia Privacy
Multimedia Privacy
 
Placing Images with Refined Language Models and Similarity Search with PCA-re...
Placing Images with Refined Language Models and Similarity Search with PCA-re...Placing Images with Refined Language Models and Similarity Search with PCA-re...
Placing Images with Refined Language Models and Similarity Search with PCA-re...
 
In-depth Exploration of Geotagging Performance
In-depth Exploration of Geotagging PerformanceIn-depth Exploration of Geotagging Performance
In-depth Exploration of Geotagging Performance
 
Perceived versus Actual Predictability of Personal Information in Social Netw...
Perceived versus Actual Predictability of Personal Information in Social Netw...Perceived versus Actual Predictability of Personal Information in Social Netw...
Perceived versus Actual Predictability of Personal Information in Social Netw...
 
Web and Social Media Image Forensics for News Professionals
Web and Social Media Image Forensics for News ProfessionalsWeb and Social Media Image Forensics for News Professionals
Web and Social Media Image Forensics for News Professionals
 
Predicting News Popularity by Mining Online Discussions
Predicting News Popularity by Mining Online DiscussionsPredicting News Popularity by Mining Online Discussions
Predicting News Popularity by Mining Online Discussions
 
Verifying Multimedia Use at MediaEval 2015
Verifying Multimedia Use at MediaEval 2015Verifying Multimedia Use at MediaEval 2015
Verifying Multimedia Use at MediaEval 2015
 

Último

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 

Último (20)

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUKSpring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 

Finding Diverse Social Images at MediaEval 2015

  • 1. MediaEval 2015 Workshop, Retrieving Diverse Social Images Task 14-15 September 2015, Wurzen, Germany USEMP: Finding Diverse Images at MediaEval 2015 Eleftherios Spyromitros-Xioufis1, Adrian Popescu2, Symeon Papadopoulos1, Yiannis Kompatsiaris1 1 CERTH-ITI, Thermi-Thessaloniki, Greece, {espyromi,papadop,ikom}@iti.gr 2 CEA, LIST, 91190 Gif-sur-Yvette, France, adrian.popescu@cea.fr
  • 2. Summary of our participation • supervised Maximal Marginal Relevance (sMMR) [1]: – A supervised diversification method that jointly optimizes relevance and diversity • The runs – Fully automated, no external data* – Each run corresponds to a different instantiation of sMMR #2 Run id Run Type Relevance Features Diversity Features 1 visual-only CNN* [1] VLAD+CSURF [2] 2 text-only BOW BOW 3 & 5 visual+textual CNN, BOW, META VLAD+CSURF [1] E. Spyromitros-Xioufis et al., “Improving diversity in image search via supervised relevance scoring”, ICMR 2015 [2] E. Spyromitros-Xioufis et al., “A comprehensive study over VLAD and Product Quantization in large-scale image retrieval”, IEEE Transactions on Multimedia, 2014
  • 3. Overview of our approach • sMMR builds incrementally a refined set 𝑆 ⊂ 𝐼, 𝑆 = 𝐾 • At each step 𝐽 = 1, … , 𝐾 selects the image 𝑖𝑚∗ that scores highest to the following criterion: #3 𝑈(𝑖𝑚∗ |𝑞) = 𝑤 ∗ 𝑅 𝑖𝑚∗ 𝑞 + 1 − 𝑤 ∗ min 𝑖𝑚 𝑗∈𝑆 𝐽−1 𝑑(𝑖𝑚∗ , 𝑖𝑚𝑗)
  • 4. Overview of our approach • sMMR builds incrementally a refined set 𝑆 ⊂ 𝐼, 𝑆 = 𝐾 • At each step 𝐽 = 1, … , 𝐾 selects the image 𝑖𝑚∗ that scores highest to the following criterion: #4 𝑈(𝑖𝑚∗ |𝑞) = 𝑤 ∗ 𝑅 𝑖𝑚∗ 𝑞 + 1 − 𝑤 ∗ min 𝑖𝑚 𝑗∈𝑆 𝐽−1 𝑑(𝑖𝑚∗ , 𝑖𝑚𝑗) Relevance to the query  output of a task and query specific classifier
  • 5. Overview of our approach • sMMR builds incrementally a refined set 𝑆 ⊂ 𝐼, 𝑆 = 𝐾 • At each step 𝐽 = 1, … , 𝐾 selects the image 𝑖𝑚∗ that scores highest to the following criterion: #5 𝑈(𝑖𝑚∗ |𝑞) = 𝑤 ∗ 𝑅 𝑖𝑚∗ 𝑞 + 1 − 𝑤 ∗ min 𝑖𝑚 𝑗∈𝑆 𝐽−1 𝑑(𝑖𝑚∗ , 𝑖𝑚𝑗) Relevance to the query  output of a task and query specific classifier Diversity in 𝑆  distance to the most similar image already selected
  • 6. Learning relevance from ground truth #6 devset queries q1 q2 q3 test query, e.g. “Eiffel Tower” Wikipedia images Flickr images ? ? ? ? ? Flickrimages
  • 7. Learning relevance from ground truth #7 devset queries q1 q2 q3 test query, e.g. “Eiffel Tower” Wikipedia images Flickr images ? ? ? ? ? Flickrimages training set for ℎeiffel
  • 8. Learning relevance from ground truth #8 devset queries q1 q2 q3 test query, e.g. “Eiffel Tower” Wikipedia images Flickr images ? ? ? ? ? Flickrimages training set for ℎeiffel
  • 9. Learning relevance from ground truth #9 devset queries q1 q2 q3 test query, e.g. “Eiffel Tower” Wikipedia images Flickr images ? ? ? ? ? Flickrimages training set for ℎeiffel
  • 10. Learning relevance from ground truth #10 devset queries q1 q2 q3 test query, e.g. “Eiffel Tower” Wikipedia images Flickr images ? ? ? ? ? Flickrimages training set for ℎeiffel
  • 11. Learning relevance from ground truth #11 devset queries q1 q2 q3 test query, e.g. “Eiffel Tower” Wikipedia images Flickr images ? ? ? ? ? Flickrimages training set for ℎeiffel
  • 12. #12 This work was supported by the USEMP FP7 project More details at the poster session!

Notas do Editor

  1. Here is an overview of our approach. sMMT builds a refined set of images S with K elements from a larger set of images I, incrementally at K steps. At each step, the method greedily selects to include in S the image (among the unselected ones) that maximizes the following criterion that jointly considers relevance and diversity. The criterion is a weighted combination of a Relevance score and a Diversity score: The relevance score is basically the output of a classifier that is trained to distinguish relevant from irrelevant images. It is task specific because it uses the relevance ground truth given for this task and query specific because it includes the Wikipedia images/page given for each location in the set of positive/relevant examples. For the diversity part, we define the diversity score for an image at step J, as being equal to the distance of this images to the most similar image among those already included in S.
  2. Here is an overview of our approach. sMMT builds a refined set of images S with K elements from a larger set of images I, incrementally at K steps. At each step, the method greedily selects to include in S the image (among the unselected ones) that maximizes the following criterion that jointly considers relevance and diversity. The criterion is a weighted combination of a Relevance score and a Diversity score: The relevance score is basically the output of a classifier that is trained to distinguish relevant from irrelevant images. It is task specific because it uses the relevance ground truth given for this task and query specific because it includes the Wikipedia images/page given for each location in the set of positive/relevant examples. For the diversity part, we define the diversity score for an image at step J, as being equal to the distance of this images to the most similar image among those already included in S.
  3. Here is an overview of our approach. sMMT builds a refined set of images S with K elements from a larger set of images I, incrementally at K steps. At each step, the method greedily selects to include in S the image (among the unselected ones) that maximizes the following criterion that jointly considers relevance and diversity. The criterion is a weighted combination of a Relevance score and a Diversity score: The relevance score is basically the output of a classifier that is trained to distinguish relevant from irrelevant images. It is task specific because it uses the relevance ground truth given for this task and query specific because it includes the Wikipedia images/page given for each location in the set of positive/relevant examples. For the diversity part, we define the diversity score for an image at step J, as being equal to the distance of this images to the most similar image among those already included in S.