SlideShare uma empresa Scribd logo
1 de 54
Baixar para ler offline
M I N I N G E V E N T S F R O M
M U LT I M E D I A S T R E A M S
J O N A T H O N H A R E A N D S I N A S A M A N G O O E I
A P P L E W W D C K E Y N O T E 2 0 1 3
“Life is full of special moments. So is your photo
library.”
B U T W H AT I F W E G O B E Y O N D
P E R S O N A L M E D I A L I B R A R I E S ?
C A N W E A U T O M AT I C A L LY
F I N D M E A N I N G F U L E V E N T S
A N D T R E N D S I N S T R E A M S
O F S O C I A L M U LT I M E D I A ?
C O N T E N T S
• Research challenges
• Case studies using shared social media:
• Monitoring Twitter’s visual pulse
• Detecting social events
• Open questions & future directions
R E S E A R C H C H A L L E N G E S 1
• How do we deal with massive amounts of data?
• There is a temporal aspect to the data we’re
dealing with…
• Develop streaming algorithms that can be:
• Incrementally updated
• Allowed to forget
R E S E A R C H C H A L L E N G E S 2
• How can we effectively make use of contextual data
and metadata from different modalities?
• Develop techniques for exploiting all modalities
and fusing features from each modality effectively
• Develop techniques that are robust to missing or
inaccurate features
M O N I T O R I N G T W I T T E R ’ S
V I S U A L P U L S E
P H O T O S H A R I N G ( B R O A D C A S T I N G ? )
O N T W I T T E R
• Photos are heavily shared on Twitter
• In 2012 we were capturing 100,000 images per day
from the sample stream
• Equates to 70GB of data
• Sample stream is supposedly ~1% of all tweets
• Implying on the firehose upwards of 10,000,000
images per day
• Thats 7TB of data per day
W H AT T Y P E S O F P H O T O S
A R E S H A R E D O N T W I T T E R ?
• Small study performed in September 2012
• Aim to get a feel for what is being Tweeted
• ~1 Day’s worth of data chosen (81368 images)
• 14 trusted Human annotators from our research
group
• Uniform random sample of 667 images annotated
0"
50"
100"
150"
200"
250"
300"
350"
400"
450"
animal:bird"
animal:bird:several"
animal:dinsour"
animal:ki7ens"
animal:rabbit"
color:blue"
object:glasses"
object:money"
object:spectacles"
object:watch"
scene:outdoors:nature"
sen@ment:sad"
sport"
text:dutch"
text:german"
text:italian"
text:portuguese"
@me:night"
animal:fish"
art:sculpture"
Baby"
object:jewellery"
object:motorbike"
object:newspaper"
object:vegetable"
scene:concert"
text:cyrillic"
type:wD"
art:s@llElife"
object:albumEcover"
object:sign"
type:awesome"
type:manga"
art:drawing"
art:pain@ng"
object:furniture"
type:advert"
animal:dog"
art:filmEframes"
color:blackEandEwhite"
object:car"
object:ornament"
object:toy"
sen@ment:disturbing"
object:book"
object:vehicle"
ac@vity:music"
object:fashion"
people:porn"
type:promoEposter"
object:building"
object:drink"
object:shoes"
type:meme"
object:clothes"
sen@ment:humorous"
ac@vity:sport"
text:spanish"
people:child"
people:children"
people:par@allyEnaked"
object:technology"
people:partEofEhuman"
animal:cat"
art:poster"
text:arabic"
people:crowd"
scene:outdoor:suburban"
type:screenshot"
scene:outdoor:landscape"
people:groupEpose"
scene:outdoors:city"
type:cartoon"
people:mugEshot"
object:food"
text:unknown"
type:collage"
scene:indoor"
type:graphic"
people:several"
text:english"
people:individual"
type:photograph"
1 D AY ’ S W O R T H O F D ATA C H O S E N
!
1 4 A N N O TAT O R S
!
0"
50"
100"
150"
200"
250"
300"
350"
400"
450"
type:advert"type:aw
esom
e"type:cartoon"
type:collage"
type:graphic"
type:m
anga"
type:m
em
e"
type:photograph"
type:prom
o;poster"type:screenshot"
type:w
<"
TYPES
0"
10"
20"
30"
40"
50"
60"
70"
80"
90"
text:arabic"
text:cyrillic"
text:dutch"
text:english"
text:germ
an"
text:italian"
text:portuguese"
text:spanish"
text:unknow
n"
TEXT
0"
20"
40"
60"
80"
100"
120"
140"
160"
people:child"people:children"
people:crow
d"
people:group6pose"people:individual"people:m
ug6shot"
people:part6of6hum
an"
people:par=ally6naked"
people:porn"people:several"
PEOPLE
0"
2"
4"
6"
8"
10"
12"
14"
16"
18"
anim
al:bird"anim
al:bird:several"
anim
al:cat"
anim
al:dinsour"
anim
al:dog"
anim
al:fish"
anim
al:ki<ens"
anim
al:rabbit"
ANIMALS
0"
5"
10"
15"
20"
25"
30"
35"
40"object:album
3cover"
object:book"
object:building"
object:car"
object:clothes"
object:drink"
object:fashion"
object:food"
object:furniture"
object:glasses"
object:jew
ellery"
object:m
oney"
object:m
otorbike"
object:new
spaper"
object:ornam
ent"
object:shoes"
object:sign"
object:spectacles"
object:technology"
object:toy"
object:vegetable"
object:vehicle"
object:w
atch"
OBJECTS
C A N W E F I N D W H AT I M A G E S
A R E T R E N D I N G O N T W I T T E R ?
• Look for near duplicates of images we’ve seen before
• can’t just use the URL…
• the same image appears at many different URLs
• potentially with slight changes
• compression
• added text
• scaling
• rotation/warps
• Don’t need to consider all images we’ve seen in the
past…
• maybe just those in a sliding time window
• Need a fast and robust near duplicate detection
technique that can be adapted to working with a stream.
• Want to use a local feature (i.e. DoG-SIFT) for
robustness
• BoVW+Inverted index, VLAD+PQ not ideal
• …use locality sensitive hashing…
• Idea inspired by method of Dong et al at ICMR’12
PicSlurper
Feature
Extractor
Random LSHs
HashTables
1 2 3 4
BitString Partitioner
images
features
LSH sketches
Duplicate Graph
Construction
Connected Component
Labelling
Duplicates
Pruning of
Old Images
W O R K F L O W
HashTable
1
212
...
image1, image4
image2, image54, ...
...
783 image4, image4096
image1 image4
image54image2
image4096
4 HASH TABLES
32-BIT KEY
VALUES ARE
LISTS OF
IMAGES
ANY COLLISION RESULTS IN A
GRAPH EDGE
CONNECTED
COMPONENTS
DUPLICATES
D E M O
S O C I A L E V E N T
D E T E C T I O N
M E D I A E VA L 2 0 1 3 : S E D TA S K
• Open evaluation challenge at MediaEval 2013:
• http://www.multimediaeval.org/
• Event clustering of multimodal social streams
!
• Specifically:
• Detect social events…
• Given 500k flickr images with
• image, tags, (some) geo, (some) time take, time
posted etc.
W H AT A R E S O C I A L E V E N T S ?
We define social events as events that are
planned by people, attended by people and the
media illustrating the events are captured by
people
S E D 2 0 1 3 E X A M P L E S
< P H O T O I D = " 4 3 0 2 7 4 6 4 2 9 " P H O T O _ U R L = " H T T P : / /
FA R M 5 . S TAT I C F L I C K R . C O M /
4 0 6 8 / 4 3 0 2 7 4 6 4 2 9 _ F 8 C D 7 F 2 5 8 2 . J P G " U S E R N A M E = " AT
T H E F O O T O F T H E H I L L " D AT E TA K E N = " 2 0 1 0 - 0 1 - 2 3
2 1 : 3 6 : 0 5 . 0 " D AT E U P L O A D E D = " 2 0 1 0 - 0 1 - 2 5 1 0 : 4 3 : 3 4 . 0 " >
< T I T L E > P O O , D E A D V O I C E S O N A I R @ A 4 < / T I T L E >
< D E S C R I P T I O N > W H O L E S E T . . . < / D E S C R I P T I O N >
< TA G S >
< TA G > 2 / 5 8 < / TA G >
< TA G > 5 8 M M < / TA G >
< TA G > C O N C E RT < / TA G >
< TA G > F 2 < / TA G >
< TA G > P O O < / TA G >
< / TA G S >
< / P H O T O >
< P H O T O I D = " 2 2 8 0 0 6 0 8 5 2 " P H O T O _ U R L = " H T T P : / /
FA R M 3 . S TAT I C F L I C K R . C O M /
2 1 1 0 / 2 2 8 0 0 6 0 8 5 2 _ 1 6 5 3 9 F 5 F 0 D . J P G "
U S E R N A M E = " P I L O T _ 1 0 " D AT E TA K E N = " 2 0 0 8 - 0 2 - 1 9
2 3 : 3 4 : 0 9 . 0 " D AT E U P L O A D E D = " 2 0 0 8 - 0 2 - 2 0 1 8 : 3 9 : 5 0 . 0 " >
< T I T L E > J E N S L E K M A N @ T E AT R O R A S I , R AV E N N A < /
T I T L E >
< D E S C R I P T I O N > 1 9 F E B B R A I O 2 0 0 8 < / D E S C R I P T I O N >
< TA G S >
< TA G > A L L R I G H T S R E S E R V E D < / TA G >
< TA G > C O N C E RT I < / TA G >
< TA G > C O O L P I X _ 4 3 0 0 < / TA G >
< TA G > E M I L I A R O M A G N A < / TA G >
< / TA G S >
< L O C AT I O N L AT I T U D E = " 4 4 . 4 1 5 3 "
L O N G I T U D E = " 1 2 . 2 0 5 2 " > < / L O C AT I O N >
< / P H O T O >
B U I L D S PA R S E M A T R I X
Sparse
Feature
Matrix
Combined
Sparse
Matrix
Sparse
Feature
Matrix
Sparse
Feature
Matrix
Sparse
Feature
Matrix
B U I L D S PA R S E M A T R I X
Sparse
Feature
Matrix
Combined
Sparse
Matrix
Sparse
Feature
Matrix
Sparse
Feature
Matrix
Sparse
Feature
Matrix
DBSCAN
Spectral
+
Incremental
C L U S T E R
B U I L D S PA R S E M A T R I X
Sparse
Feature
Matrix
Combined
Sparse
Matrix
Sparse
Feature
Matrix
Sparse
Feature
Matrix
Sparse
Feature
Matrix
DBSCAN
Spectral
+
Incremental
C L U S T E R
F E AT U R E S
• Events potentially separable using:
• Images: should look similar?
• Time: should be temporally close?
• Location: should be geographically close?
• Text: should be described similarly?
• Flickr multimodal social stream contains:
• Time taken (potentially inaccurate or missing entirely)
• Time posted (accurate, though may be event agnostic)
• Geo (often inaccurate, sparse)
• Tags, title, description (multi-tag, spelling etc.)
• Image features (these didn’t work so well!)
H T T P S : / / G I T H U B . C O M / S I N J A X / S O T O N - M E D I A E VA L 2 0 1 3
F E AT U R E S
• Events potentially separable using:
• Images: should look similar?
• Time: should be temporally close?
• Location: should be geographically close?
• Text: should be described similarly?
• Flickr multimodal social stream contains:
• Time taken (potentially inaccurate or missing entirely)
• Time posted (accurate, though may be event agnostic)
• Geo (often inaccurate, sparse)
• Tags, title, description (multi-tag, spelling etc.)
• Image features (these didn’t work so well!)
H T T P S : / / G I T H U B . C O M / S I N J A X / S O T O N - M E D I A E VA L 2 0 1 3
E V E N T S E PA R AT I O N W I T H F E AT U R E S
• January 2007 until February 2007
• Random color assigned to clusters
E V E N T S E PA R AT I O N W I T H F E AT U R E S
E V E N T S L I K E T H I S C O U L D E A S I LY B E C O N F U S E D
E V E N T S E PA R AT I O N W I T H F E AT U R E S
M O R E F E A T U R E S M A Y H E L P S E PA R A T E E V E N T S
F E AT U R E W E I G H T S
• The features matter for different reasons
• Some are more important than others
• This is a feature fusion problem!
• We went with a simple strategy: weighted
summation
• Use simplex search to find what feature weights
give the best cluster quality
F E AT U R E W E I G H T S I M P L E X
F E AT U R E W E I G H T S I M P L E X
F E AT U R E W E I G H T S I M P L E X
F E AT U R E W E I G H T S I M P L E X
F E AT U R E W E I G H T S
• Time taken is apparently most important
• Time posted + geo seem to hold the same
information
• Tags beat titles and descriptions
C L U S T E R I N G – F I N D I N G E V E N T S
• Clustering specific challenges
• Number of clusters hard to estimate
• But, it’s a parameter in many clustering algorithms
• Many noise points
• 2% clusters with 1 member
• Clustering a stream
• The size of the streams
• Updating clusters
C L U S T E R I N G – D B S C A N
• DBSCAN is an old, well studied
clustering algorithm
• Detects clusters and identify
noise
• No knowledge of cluster count
needed
• Requirements:
• Neighbourhood function
(e.g. thresholded sparse
similarity matrix)
• Neighbourhood density
counts
• Find mutually density-
connected items
H T T P : / / B I T. LY / O I C L U S T E R I N G
C L U S T E R I N G - S P E C T R A L
• Theoretically appealing non-
parametric clustering algorithm
• Rooted in graph theory
• Potentially auto detects
cluster count
• Basic premise is:
• Use the smallest (near zero)
eigenvalued eigenvectors of
the graph laplacian of the
similarity matrix of some
data as a space within which
to apply another clustering
algorithm
H T T P : / / B I T. LY / O I C L U S T E R I N G
C L U S T E R I N G - I N C R E M E N TA L
• Practical restrictions of
spectral clustering mean we
can’t apply it to the whole
dataset
• Make an assumption about
the data style
• Images likely to be
clustered together will
appear sequentially in
terms of upload time
• Leverage this to cluster sub
windows of data
H T T P : / / B I T. LY / O I C L U S T E R I N G
Cluster Initial
Window
1) Grow window,
cluster again
2)
...
Identify stable
clusters
3) Continue, ignoring
stable clusters
4)
M E D I A E VA L S E D TA S K R E S U LT S
• Calibration on 300k training items to optimise:
• Feature weightings
• Clustering parameters
!
• Clustering performed on a 200k item test set
• In the 2013 SED task, this technique:
• produced the best F1 scores;
• was one of the only streaming approaches;
• was one of the few open-source approaches
R E S U LT S
TABLE II: Results from MediaEval 2013. Our 4 submitted
runs are presented last and in bold. The best results over all
runs are also highlighted.
Group/Technique F1 NMI divF1
CERTH-ITI(1) [13] 0.8865 0.8739 0.5701
CERTH-ITI(2) [14] 0.7031 0.9131 0.6367
UPC [15] 0.8798 0.9720 0.8268
UNITN [16] 0.9320 0.9849 0.8793
TUWIEN [17] - 0.94 0.78
ADMRG [18] 0.812 0.954 0.758
ISMLL [19] 0.8755 0.9641 -
NTUA [20] 0.2364 0.6644 -
VIT [21] 0.1426 0.1802 0.0724
QM [22] - 0.94 0.78
DBSCAN (best-weight) 0.9454 0.9851 0.8865
Spectral (best-weight) 0.9114 0.9765 0.8534
DBSCAN (average-weight) 0.9461 0.9852 0.8864
Spectral (average-weight) 0.9024 0.9737 0.8455
of the relative methods of other participants who came close
of t
and
sepa
eithe
loca
In
work
high
2013
desig
A. P
In
dete
temp
a co
are c
O P E N Q U E S T I O N S A N D
F U T U R E D I R E C T I O N S
I S E V E N T D E T E C T I O N S O LV E D ?
• Scores in the 98% range certainly indicate so!
• But that’s a bit of a fallacy - the data was biased
• It only contained image of social events
• We need to do more exploration of real data
streams which also contain noise in the form of
images that don’t belong to a social event
W H Y D O N ’ T I M A G E F E AT U R E S W O R K
W E L L I N T H E S E D TA S K ?
• The features we tried using were too strict.
• Designed for near duplicate detection
• but, social events are highly visually diverse
• highly visually similar images already being
clustered through the other features
• Need to look at better engineered or more specific
features
• Biometrics, Facial matching, Text, colour…
W H E R E N E X T W I T H T H E M U LT I M E D I A
T W I T T E R A N A LY S I S ?
• Involve multimodal features
• Tweets are rather rich in data that can be combined
with the image analysis
• Applications:
• Meme-tracking
• Detection of world events
• Exploiting multimodal feature co-occurrence
A C K N O W L E D G E M E N T S
David Dupplaw, Paul Lewis, Kirk Martinez, Jamie Davies,
Neha Jain, John Preston, Laurence Wilmore, Ash Smith,
Heather Packer and other members of the WAIS research
group at the University of Southampton
A N Y Q U E S T I O N S ?

Mais conteúdo relacionado

Mais procurados

10 d bs in 30 minutes
10 d bs in 30 minutes10 d bs in 30 minutes
10 d bs in 30 minutes
David Simons
 

Mais procurados (19)

Gain Maximum Visibility into Your Applications
Gain Maximum Visibility into Your Applications Gain Maximum Visibility into Your Applications
Gain Maximum Visibility into Your Applications
 
April Wensel - Crafting Compassionate Code
April Wensel - Crafting Compassionate CodeApril Wensel - Crafting Compassionate Code
April Wensel - Crafting Compassionate Code
 
Network x python_meetup_2015-08-27
Network x python_meetup_2015-08-27Network x python_meetup_2015-08-27
Network x python_meetup_2015-08-27
 
AWS Seminar Series 2015 Melbourne
AWS Seminar Series 2015 MelbourneAWS Seminar Series 2015 Melbourne
AWS Seminar Series 2015 Melbourne
 
Beyond the Retrospective: Embracing Complexity on the Road to Service Ownership
Beyond the Retrospective: Embracing Complexity on the Road to Service OwnershipBeyond the Retrospective: Embracing Complexity on the Road to Service Ownership
Beyond the Retrospective: Embracing Complexity on the Road to Service Ownership
 
10 d bs in 30 minutes
10 d bs in 30 minutes10 d bs in 30 minutes
10 d bs in 30 minutes
 
AWS SeMINAR SERIES 2015 Sydney
AWS SeMINAR SERIES 2015 SydneyAWS SeMINAR SERIES 2015 Sydney
AWS SeMINAR SERIES 2015 Sydney
 
Thinking like a Network
Thinking like a NetworkThinking like a Network
Thinking like a Network
 
CIA For WordPress Developers
CIA For WordPress DevelopersCIA For WordPress Developers
CIA For WordPress Developers
 
100% Visibility - Jason Yee - Codemotion Amsterdam 2018
100% Visibility - Jason Yee - Codemotion Amsterdam 2018100% Visibility - Jason Yee - Codemotion Amsterdam 2018
100% Visibility - Jason Yee - Codemotion Amsterdam 2018
 
SearchLove San Diego 2018 | Ashley Ward | Reuse, Recycle: How to Repurpose Yo...
SearchLove San Diego 2018 | Ashley Ward | Reuse, Recycle: How to Repurpose Yo...SearchLove San Diego 2018 | Ashley Ward | Reuse, Recycle: How to Repurpose Yo...
SearchLove San Diego 2018 | Ashley Ward | Reuse, Recycle: How to Repurpose Yo...
 
Whispers in Chaos
Whispers in ChaosWhispers in Chaos
Whispers in Chaos
 
AWS Seminar Series 2015 Brisbane
AWS Seminar Series 2015 BrisbaneAWS Seminar Series 2015 Brisbane
AWS Seminar Series 2015 Brisbane
 
Auckland AWS Seminar Series
Auckland AWS Seminar SeriesAuckland AWS Seminar Series
Auckland AWS Seminar Series
 
Transforming developer from Commodity to Premium - A tale of micorservices
Transforming developer from Commodity to Premium - A tale of micorservicesTransforming developer from Commodity to Premium - A tale of micorservices
Transforming developer from Commodity to Premium - A tale of micorservices
 
You Created a Plugin. Now What? WordCamp Sacramento
You Created a Plugin. Now What? WordCamp SacramentoYou Created a Plugin. Now What? WordCamp Sacramento
You Created a Plugin. Now What? WordCamp Sacramento
 
Interactive media : information and libraries (#bobcatsss2017)
Interactive media : information and libraries (#bobcatsss2017)Interactive media : information and libraries (#bobcatsss2017)
Interactive media : information and libraries (#bobcatsss2017)
 
You Created a Plugin. Now What? WordCamp Orange County
You Created a Plugin. Now What? WordCamp Orange CountyYou Created a Plugin. Now What? WordCamp Orange County
You Created a Plugin. Now What? WordCamp Orange County
 
Online Marketing For Financial Planners
Online Marketing For Financial PlannersOnline Marketing For Financial Planners
Online Marketing For Financial Planners
 

Destaque

OpenIMAJ and ImageTerrier: Java Libraries and Tools for Scalable Multimedia A...
OpenIMAJ and ImageTerrier: Java Libraries and Tools for Scalable Multimedia A...OpenIMAJ and ImageTerrier: Java Libraries and Tools for Scalable Multimedia A...
OpenIMAJ and ImageTerrier: Java Libraries and Tools for Scalable Multimedia A...
Jonathon Hare
 

Destaque (20)

Mining Events from Multimedia Streams (WAIS Research group seminar June 2014)
Mining Events from Multimedia Streams (WAIS Research group seminar June 2014)Mining Events from Multimedia Streams (WAIS Research group seminar June 2014)
Mining Events from Multimedia Streams (WAIS Research group seminar June 2014)
 
A brief introduction to extracting information from images
A brief introduction to extracting information from imagesA brief introduction to extracting information from images
A brief introduction to extracting information from images
 
Spot the Dog: An overview of semantic retrieval of unannotated images in the ...
Spot the Dog: An overview of semantic retrieval of unannotated images in the ...Spot the Dog: An overview of semantic retrieval of unannotated images in the ...
Spot the Dog: An overview of semantic retrieval of unannotated images in the ...
 
WAISFest 2011: Southampton Goggles
WAISFest 2011: Southampton GogglesWAISFest 2011: Southampton Goggles
WAISFest 2011: Southampton Goggles
 
Saliency-based Models of Image Content and their Application to Auto-Annotati...
Saliency-based Models of Image Content and their Application to Auto-Annotati...Saliency-based Models of Image Content and their Application to Auto-Annotati...
Saliency-based Models of Image Content and their Application to Auto-Annotati...
 
IMAGE DIVERSITY ANALYSIS: CONTEXT, OPINION AND BIAS
IMAGE DIVERSITY ANALYSIS: CONTEXT, OPINION AND BIASIMAGE DIVERSITY ANALYSIS: CONTEXT, OPINION AND BIAS
IMAGE DIVERSITY ANALYSIS: CONTEXT, OPINION AND BIAS
 
A Linear-Algebraic Technique with an Application in Semantic Image Retrieval
A Linear-Algebraic Technique with an Application in Semantic Image RetrievalA Linear-Algebraic Technique with an Application in Semantic Image Retrieval
A Linear-Algebraic Technique with an Application in Semantic Image Retrieval
 
Searching Images: Recent research at Southampton
Searching Images: Recent research at SouthamptonSearching Images: Recent research at Southampton
Searching Images: Recent research at Southampton
 
Searching Images: Recent research at Southampton
Searching Images: Recent research at SouthamptonSearching Images: Recent research at Southampton
Searching Images: Recent research at Southampton
 
Semantic Retrieval and Automatic Annotation: Linear Transformations, Correlat...
Semantic Retrieval and Automatic Annotation: Linear Transformations, Correlat...Semantic Retrieval and Automatic Annotation: Linear Transformations, Correlat...
Semantic Retrieval and Automatic Annotation: Linear Transformations, Correlat...
 
The Art and Science of Image Retrieval
The Art and Science of Image RetrievalThe Art and Science of Image Retrieval
The Art and Science of Image Retrieval
 
Scale Saliency: Applications in Visual Matching,Tracking and View-Based Objec...
Scale Saliency: Applications in Visual Matching,Tracking and View-Based Objec...Scale Saliency: Applications in Visual Matching,Tracking and View-Based Objec...
Scale Saliency: Applications in Visual Matching,Tracking and View-Based Objec...
 
BUILDING A SCALABLE MULTIMEDIA WEB OBSERVATORY
BUILDING A SCALABLE MULTIMEDIA WEB OBSERVATORYBUILDING A SCALABLE MULTIMEDIA WEB OBSERVATORY
BUILDING A SCALABLE MULTIMEDIA WEB OBSERVATORY
 
Searching Images: Recent research at Southampton
Searching Images: Recent research at SouthamptonSearching Images: Recent research at Southampton
Searching Images: Recent research at Southampton
 
OpenIMAJ and ImageTerrier: Java Libraries and Tools for Scalable Multimedia A...
OpenIMAJ and ImageTerrier: Java Libraries and Tools for Scalable Multimedia A...OpenIMAJ and ImageTerrier: Java Libraries and Tools for Scalable Multimedia A...
OpenIMAJ and ImageTerrier: Java Libraries and Tools for Scalable Multimedia A...
 
Sharp images and fuzzy concepts: Multimedia retrieval and the semantic gap
Sharp images and fuzzy concepts: Multimedia retrieval and the semantic gapSharp images and fuzzy concepts: Multimedia retrieval and the semantic gap
Sharp images and fuzzy concepts: Multimedia retrieval and the semantic gap
 
Content-based image retrieval using a mobile device as a novel interface
Content-based image retrieval using a mobile device as a novel interfaceContent-based image retrieval using a mobile device as a novel interface
Content-based image retrieval using a mobile device as a novel interface
 
Mind the Gap: Another look at the problem of the semantic gap in image retrieval
Mind the Gap: Another look at the problem of the semantic gap in image retrievalMind the Gap: Another look at the problem of the semantic gap in image retrieval
Mind the Gap: Another look at the problem of the semantic gap in image retrieval
 
Multimodal Searching and Semantic Spaces: ...or how to find images of Dalmati...
Multimodal Searching and Semantic Spaces: ...or how to find images of Dalmati...Multimodal Searching and Semantic Spaces: ...or how to find images of Dalmati...
Multimodal Searching and Semantic Spaces: ...or how to find images of Dalmati...
 
Bridging the Semantic Gap in Multimedia Information Retrieval: Top-down and B...
Bridging the Semantic Gap in Multimedia Information Retrieval: Top-down and B...Bridging the Semantic Gap in Multimedia Information Retrieval: Top-down and B...
Bridging the Semantic Gap in Multimedia Information Retrieval: Top-down and B...
 

Semelhante a SEWM'14 keynote: Mining Events from Multimedia Streams

From Content Strategy to Drupal Site Building - Connecting the dots
From Content Strategy to Drupal Site Building - Connecting the dotsFrom Content Strategy to Drupal Site Building - Connecting the dots
From Content Strategy to Drupal Site Building - Connecting the dots
Ronald Ashri
 
From Content Strategy to Drupal Site Building - Connecting the Dots
From Content Strategy to Drupal Site Building - Connecting the DotsFrom Content Strategy to Drupal Site Building - Connecting the Dots
From Content Strategy to Drupal Site Building - Connecting the Dots
Ronald Ashri
 

Semelhante a SEWM'14 keynote: Mining Events from Multimedia Streams (20)

Backpack Reporting (Updated)
Backpack Reporting (Updated)Backpack Reporting (Updated)
Backpack Reporting (Updated)
 
How GZIP compression works - JS Conf EU 2014
How GZIP compression works - JS Conf EU 2014How GZIP compression works - JS Conf EU 2014
How GZIP compression works - JS Conf EU 2014
 
Choosing the Right Database
Choosing the Right DatabaseChoosing the Right Database
Choosing the Right Database
 
Focus on story, don't be seduced by sexy tools: How journalists can create gr...
Focus on story, don't be seduced by sexy tools: How journalists can create gr...Focus on story, don't be seduced by sexy tools: How journalists can create gr...
Focus on story, don't be seduced by sexy tools: How journalists can create gr...
 
The Ethics of Everybody Else
The Ethics of Everybody ElseThe Ethics of Everybody Else
The Ethics of Everybody Else
 
Keynote: Von Scrum und Tütensuppen - Warum Scrummaster die besseren Köche sin...
Keynote: Von Scrum und Tütensuppen - Warum Scrummaster die besseren Köche sin...Keynote: Von Scrum und Tütensuppen - Warum Scrummaster die besseren Köche sin...
Keynote: Von Scrum und Tütensuppen - Warum Scrummaster die besseren Köche sin...
 
The New Norm(al): Confronting What Open Means for Higher Education
The New Norm(al): Confronting What Open Means for Higher EducationThe New Norm(al): Confronting What Open Means for Higher Education
The New Norm(al): Confronting What Open Means for Higher Education
 
The Ethics of Everybody Else | Wrangle Conference 2017
The Ethics of Everybody Else | Wrangle Conference 2017The Ethics of Everybody Else | Wrangle Conference 2017
The Ethics of Everybody Else | Wrangle Conference 2017
 
STC-PMC November 2016 Presentation - Mobile First Content
STC-PMC November 2016 Presentation - Mobile First ContentSTC-PMC November 2016 Presentation - Mobile First Content
STC-PMC November 2016 Presentation - Mobile First Content
 
Metaverse (A comprehensive Introduction)
Metaverse (A comprehensive Introduction)Metaverse (A comprehensive Introduction)
Metaverse (A comprehensive Introduction)
 
Mirko Lorenz Data Driven Journalism Overview Seminar Ordine dei Giornalisti d...
Mirko Lorenz Data Driven Journalism Overview Seminar Ordine dei Giornalisti d...Mirko Lorenz Data Driven Journalism Overview Seminar Ordine dei Giornalisti d...
Mirko Lorenz Data Driven Journalism Overview Seminar Ordine dei Giornalisti d...
 
Trends, organisatie impact en social media presentatie
Trends, organisatie impact en social media presentatieTrends, organisatie impact en social media presentatie
Trends, organisatie impact en social media presentatie
 
From Content Strategy to Drupal Site Building - Connecting the dots
From Content Strategy to Drupal Site Building - Connecting the dotsFrom Content Strategy to Drupal Site Building - Connecting the dots
From Content Strategy to Drupal Site Building - Connecting the dots
 
From Content Strategy to Drupal Site Building - Connecting the Dots
From Content Strategy to Drupal Site Building - Connecting the DotsFrom Content Strategy to Drupal Site Building - Connecting the Dots
From Content Strategy to Drupal Site Building - Connecting the Dots
 
INCLUSIVE TRADE: THE RISE OF FAB LABS
INCLUSIVE TRADE: THE RISE OF FAB LABSINCLUSIVE TRADE: THE RISE OF FAB LABS
INCLUSIVE TRADE: THE RISE OF FAB LABS
 
uso de materiales en condiciones asépticas
uso de materiales en condiciones asépticasuso de materiales en condiciones asépticas
uso de materiales en condiciones asépticas
 
Hacking with Love
Hacking with LoveHacking with Love
Hacking with Love
 
Development and Deployment: The Human Factor
Development and Deployment: The Human FactorDevelopment and Deployment: The Human Factor
Development and Deployment: The Human Factor
 
Data Interoperability for Learning Analytics and Lifelong Learning
Data Interoperability for Learning Analytics and Lifelong LearningData Interoperability for Learning Analytics and Lifelong Learning
Data Interoperability for Learning Analytics and Lifelong Learning
 
Data Interoperability for Learning Analytics and Lifelong Learning
Data Interoperability for Learning Analytics and Lifelong LearningData Interoperability for Learning Analytics and Lifelong Learning
Data Interoperability for Learning Analytics and Lifelong Learning
 

Último

Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
amitlee9823
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
MarinCaroMartnezBerg
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
amitlee9823
 
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
amitlee9823
 
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
amitlee9823
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
amitlee9823
 
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 

Último (20)

Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptx
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptx
 
Anomaly detection and data imputation within time series
Anomaly detection and data imputation within time seriesAnomaly detection and data imputation within time series
Anomaly detection and data imputation within time series
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signals
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptx
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
 
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
 
Sampling (random) method and Non random.ppt
Sampling (random) method and Non random.pptSampling (random) method and Non random.ppt
Sampling (random) method and Non random.ppt
 
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
 
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
 
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
ALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptxALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptx
 
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 

SEWM'14 keynote: Mining Events from Multimedia Streams

  • 1. M I N I N G E V E N T S F R O M M U LT I M E D I A S T R E A M S J O N A T H O N H A R E A N D S I N A S A M A N G O O E I
  • 2. A P P L E W W D C K E Y N O T E 2 0 1 3 “Life is full of special moments. So is your photo library.”
  • 3. B U T W H AT I F W E G O B E Y O N D P E R S O N A L M E D I A L I B R A R I E S ?
  • 4. C A N W E A U T O M AT I C A L LY F I N D M E A N I N G F U L E V E N T S A N D T R E N D S I N S T R E A M S O F S O C I A L M U LT I M E D I A ?
  • 5. C O N T E N T S • Research challenges • Case studies using shared social media: • Monitoring Twitter’s visual pulse • Detecting social events • Open questions & future directions
  • 6. R E S E A R C H C H A L L E N G E S 1 • How do we deal with massive amounts of data? • There is a temporal aspect to the data we’re dealing with… • Develop streaming algorithms that can be: • Incrementally updated • Allowed to forget
  • 7. R E S E A R C H C H A L L E N G E S 2 • How can we effectively make use of contextual data and metadata from different modalities? • Develop techniques for exploiting all modalities and fusing features from each modality effectively • Develop techniques that are robust to missing or inaccurate features
  • 8. M O N I T O R I N G T W I T T E R ’ S V I S U A L P U L S E
  • 9. P H O T O S H A R I N G ( B R O A D C A S T I N G ? ) O N T W I T T E R • Photos are heavily shared on Twitter • In 2012 we were capturing 100,000 images per day from the sample stream • Equates to 70GB of data • Sample stream is supposedly ~1% of all tweets • Implying on the firehose upwards of 10,000,000 images per day • Thats 7TB of data per day
  • 10. W H AT T Y P E S O F P H O T O S A R E S H A R E D O N T W I T T E R ?
  • 11. • Small study performed in September 2012 • Aim to get a feel for what is being Tweeted • ~1 Day’s worth of data chosen (81368 images) • 14 trusted Human annotators from our research group • Uniform random sample of 667 images annotated
  • 12. 0" 50" 100" 150" 200" 250" 300" 350" 400" 450" animal:bird" animal:bird:several" animal:dinsour" animal:ki7ens" animal:rabbit" color:blue" object:glasses" object:money" object:spectacles" object:watch" scene:outdoors:nature" sen@ment:sad" sport" text:dutch" text:german" text:italian" text:portuguese" @me:night" animal:fish" art:sculpture" Baby" object:jewellery" object:motorbike" object:newspaper" object:vegetable" scene:concert" text:cyrillic" type:wD" art:s@llElife" object:albumEcover" object:sign" type:awesome" type:manga" art:drawing" art:pain@ng" object:furniture" type:advert" animal:dog" art:filmEframes" color:blackEandEwhite" object:car" object:ornament" object:toy" sen@ment:disturbing" object:book" object:vehicle" ac@vity:music" object:fashion" people:porn" type:promoEposter" object:building" object:drink" object:shoes" type:meme" object:clothes" sen@ment:humorous" ac@vity:sport" text:spanish" people:child" people:children" people:par@allyEnaked" object:technology" people:partEofEhuman" animal:cat" art:poster" text:arabic" people:crowd" scene:outdoor:suburban" type:screenshot" scene:outdoor:landscape" people:groupEpose" scene:outdoors:city" type:cartoon" people:mugEshot" object:food" text:unknown" type:collage" scene:indoor" type:graphic" people:several" text:english" people:individual" type:photograph"
  • 13. 1 D AY ’ S W O R T H O F D ATA C H O S E N ! 1 4 A N N O TAT O R S ! 0" 50" 100" 150" 200" 250" 300" 350" 400" 450" type:advert"type:aw esom e"type:cartoon" type:collage" type:graphic" type:m anga" type:m em e" type:photograph" type:prom o;poster"type:screenshot" type:w <" TYPES
  • 18. C A N W E F I N D W H AT I M A G E S A R E T R E N D I N G O N T W I T T E R ?
  • 19. • Look for near duplicates of images we’ve seen before • can’t just use the URL… • the same image appears at many different URLs • potentially with slight changes • compression • added text • scaling • rotation/warps • Don’t need to consider all images we’ve seen in the past… • maybe just those in a sliding time window
  • 20. • Need a fast and robust near duplicate detection technique that can be adapted to working with a stream. • Want to use a local feature (i.e. DoG-SIFT) for robustness • BoVW+Inverted index, VLAD+PQ not ideal • …use locality sensitive hashing… • Idea inspired by method of Dong et al at ICMR’12
  • 21. PicSlurper Feature Extractor Random LSHs HashTables 1 2 3 4 BitString Partitioner images features LSH sketches Duplicate Graph Construction Connected Component Labelling Duplicates Pruning of Old Images W O R K F L O W
  • 22. HashTable 1 212 ... image1, image4 image2, image54, ... ... 783 image4, image4096 image1 image4 image54image2 image4096 4 HASH TABLES 32-BIT KEY VALUES ARE LISTS OF IMAGES ANY COLLISION RESULTS IN A GRAPH EDGE CONNECTED COMPONENTS DUPLICATES
  • 23. D E M O
  • 24. S O C I A L E V E N T D E T E C T I O N
  • 25. M E D I A E VA L 2 0 1 3 : S E D TA S K • Open evaluation challenge at MediaEval 2013: • http://www.multimediaeval.org/ • Event clustering of multimodal social streams ! • Specifically: • Detect social events… • Given 500k flickr images with • image, tags, (some) geo, (some) time take, time posted etc.
  • 26. W H AT A R E S O C I A L E V E N T S ? We define social events as events that are planned by people, attended by people and the media illustrating the events are captured by people
  • 27. S E D 2 0 1 3 E X A M P L E S < P H O T O I D = " 4 3 0 2 7 4 6 4 2 9 " P H O T O _ U R L = " H T T P : / / FA R M 5 . S TAT I C F L I C K R . C O M / 4 0 6 8 / 4 3 0 2 7 4 6 4 2 9 _ F 8 C D 7 F 2 5 8 2 . J P G " U S E R N A M E = " AT T H E F O O T O F T H E H I L L " D AT E TA K E N = " 2 0 1 0 - 0 1 - 2 3 2 1 : 3 6 : 0 5 . 0 " D AT E U P L O A D E D = " 2 0 1 0 - 0 1 - 2 5 1 0 : 4 3 : 3 4 . 0 " > < T I T L E > P O O , D E A D V O I C E S O N A I R @ A 4 < / T I T L E > < D E S C R I P T I O N > W H O L E S E T . . . < / D E S C R I P T I O N > < TA G S > < TA G > 2 / 5 8 < / TA G > < TA G > 5 8 M M < / TA G > < TA G > C O N C E RT < / TA G > < TA G > F 2 < / TA G > < TA G > P O O < / TA G > < / TA G S > < / P H O T O > < P H O T O I D = " 2 2 8 0 0 6 0 8 5 2 " P H O T O _ U R L = " H T T P : / / FA R M 3 . S TAT I C F L I C K R . C O M / 2 1 1 0 / 2 2 8 0 0 6 0 8 5 2 _ 1 6 5 3 9 F 5 F 0 D . J P G " U S E R N A M E = " P I L O T _ 1 0 " D AT E TA K E N = " 2 0 0 8 - 0 2 - 1 9 2 3 : 3 4 : 0 9 . 0 " D AT E U P L O A D E D = " 2 0 0 8 - 0 2 - 2 0 1 8 : 3 9 : 5 0 . 0 " > < T I T L E > J E N S L E K M A N @ T E AT R O R A S I , R AV E N N A < / T I T L E > < D E S C R I P T I O N > 1 9 F E B B R A I O 2 0 0 8 < / D E S C R I P T I O N > < TA G S > < TA G > A L L R I G H T S R E S E R V E D < / TA G > < TA G > C O N C E RT I < / TA G > < TA G > C O O L P I X _ 4 3 0 0 < / TA G > < TA G > E M I L I A R O M A G N A < / TA G > < / TA G S > < L O C AT I O N L AT I T U D E = " 4 4 . 4 1 5 3 " L O N G I T U D E = " 1 2 . 2 0 5 2 " > < / L O C AT I O N > < / P H O T O >
  • 28.
  • 29. B U I L D S PA R S E M A T R I X Sparse Feature Matrix Combined Sparse Matrix Sparse Feature Matrix Sparse Feature Matrix Sparse Feature Matrix
  • 30. B U I L D S PA R S E M A T R I X Sparse Feature Matrix Combined Sparse Matrix Sparse Feature Matrix Sparse Feature Matrix Sparse Feature Matrix DBSCAN Spectral + Incremental C L U S T E R
  • 31. B U I L D S PA R S E M A T R I X Sparse Feature Matrix Combined Sparse Matrix Sparse Feature Matrix Sparse Feature Matrix Sparse Feature Matrix DBSCAN Spectral + Incremental C L U S T E R
  • 32. F E AT U R E S • Events potentially separable using: • Images: should look similar? • Time: should be temporally close? • Location: should be geographically close? • Text: should be described similarly? • Flickr multimodal social stream contains: • Time taken (potentially inaccurate or missing entirely) • Time posted (accurate, though may be event agnostic) • Geo (often inaccurate, sparse) • Tags, title, description (multi-tag, spelling etc.) • Image features (these didn’t work so well!) H T T P S : / / G I T H U B . C O M / S I N J A X / S O T O N - M E D I A E VA L 2 0 1 3
  • 33. F E AT U R E S • Events potentially separable using: • Images: should look similar? • Time: should be temporally close? • Location: should be geographically close? • Text: should be described similarly? • Flickr multimodal social stream contains: • Time taken (potentially inaccurate or missing entirely) • Time posted (accurate, though may be event agnostic) • Geo (often inaccurate, sparse) • Tags, title, description (multi-tag, spelling etc.) • Image features (these didn’t work so well!) H T T P S : / / G I T H U B . C O M / S I N J A X / S O T O N - M E D I A E VA L 2 0 1 3
  • 34. E V E N T S E PA R AT I O N W I T H F E AT U R E S • January 2007 until February 2007 • Random color assigned to clusters
  • 35. E V E N T S E PA R AT I O N W I T H F E AT U R E S E V E N T S L I K E T H I S C O U L D E A S I LY B E C O N F U S E D
  • 36. E V E N T S E PA R AT I O N W I T H F E AT U R E S M O R E F E A T U R E S M A Y H E L P S E PA R A T E E V E N T S
  • 37. F E AT U R E W E I G H T S • The features matter for different reasons • Some are more important than others • This is a feature fusion problem! • We went with a simple strategy: weighted summation • Use simplex search to find what feature weights give the best cluster quality
  • 38. F E AT U R E W E I G H T S I M P L E X
  • 39. F E AT U R E W E I G H T S I M P L E X
  • 40. F E AT U R E W E I G H T S I M P L E X
  • 41. F E AT U R E W E I G H T S I M P L E X
  • 42. F E AT U R E W E I G H T S • Time taken is apparently most important • Time posted + geo seem to hold the same information • Tags beat titles and descriptions
  • 43. C L U S T E R I N G – F I N D I N G E V E N T S • Clustering specific challenges • Number of clusters hard to estimate • But, it’s a parameter in many clustering algorithms • Many noise points • 2% clusters with 1 member • Clustering a stream • The size of the streams • Updating clusters
  • 44. C L U S T E R I N G – D B S C A N • DBSCAN is an old, well studied clustering algorithm • Detects clusters and identify noise • No knowledge of cluster count needed • Requirements: • Neighbourhood function (e.g. thresholded sparse similarity matrix) • Neighbourhood density counts • Find mutually density- connected items H T T P : / / B I T. LY / O I C L U S T E R I N G
  • 45. C L U S T E R I N G - S P E C T R A L • Theoretically appealing non- parametric clustering algorithm • Rooted in graph theory • Potentially auto detects cluster count • Basic premise is: • Use the smallest (near zero) eigenvalued eigenvectors of the graph laplacian of the similarity matrix of some data as a space within which to apply another clustering algorithm H T T P : / / B I T. LY / O I C L U S T E R I N G
  • 46. C L U S T E R I N G - I N C R E M E N TA L • Practical restrictions of spectral clustering mean we can’t apply it to the whole dataset • Make an assumption about the data style • Images likely to be clustered together will appear sequentially in terms of upload time • Leverage this to cluster sub windows of data H T T P : / / B I T. LY / O I C L U S T E R I N G Cluster Initial Window 1) Grow window, cluster again 2) ... Identify stable clusters 3) Continue, ignoring stable clusters 4)
  • 47. M E D I A E VA L S E D TA S K R E S U LT S • Calibration on 300k training items to optimise: • Feature weightings • Clustering parameters ! • Clustering performed on a 200k item test set • In the 2013 SED task, this technique: • produced the best F1 scores; • was one of the only streaming approaches; • was one of the few open-source approaches
  • 48. R E S U LT S TABLE II: Results from MediaEval 2013. Our 4 submitted runs are presented last and in bold. The best results over all runs are also highlighted. Group/Technique F1 NMI divF1 CERTH-ITI(1) [13] 0.8865 0.8739 0.5701 CERTH-ITI(2) [14] 0.7031 0.9131 0.6367 UPC [15] 0.8798 0.9720 0.8268 UNITN [16] 0.9320 0.9849 0.8793 TUWIEN [17] - 0.94 0.78 ADMRG [18] 0.812 0.954 0.758 ISMLL [19] 0.8755 0.9641 - NTUA [20] 0.2364 0.6644 - VIT [21] 0.1426 0.1802 0.0724 QM [22] - 0.94 0.78 DBSCAN (best-weight) 0.9454 0.9851 0.8865 Spectral (best-weight) 0.9114 0.9765 0.8534 DBSCAN (average-weight) 0.9461 0.9852 0.8864 Spectral (average-weight) 0.9024 0.9737 0.8455 of the relative methods of other participants who came close of t and sepa eithe loca In work high 2013 desig A. P In dete temp a co are c
  • 49. O P E N Q U E S T I O N S A N D F U T U R E D I R E C T I O N S
  • 50. I S E V E N T D E T E C T I O N S O LV E D ? • Scores in the 98% range certainly indicate so! • But that’s a bit of a fallacy - the data was biased • It only contained image of social events • We need to do more exploration of real data streams which also contain noise in the form of images that don’t belong to a social event
  • 51. W H Y D O N ’ T I M A G E F E AT U R E S W O R K W E L L I N T H E S E D TA S K ? • The features we tried using were too strict. • Designed for near duplicate detection • but, social events are highly visually diverse • highly visually similar images already being clustered through the other features • Need to look at better engineered or more specific features • Biometrics, Facial matching, Text, colour…
  • 52. W H E R E N E X T W I T H T H E M U LT I M E D I A T W I T T E R A N A LY S I S ? • Involve multimodal features • Tweets are rather rich in data that can be combined with the image analysis • Applications: • Meme-tracking • Detection of world events • Exploiting multimodal feature co-occurrence
  • 53. A C K N O W L E D G E M E N T S David Dupplaw, Paul Lewis, Kirk Martinez, Jamie Davies, Neha Jain, John Preston, Laurence Wilmore, Ash Smith, Heather Packer and other members of the WAIS research group at the University of Southampton
  • 54. A N Y Q U E S T I O N S ?