Defense Powepoint

A Generalized Multidimensional Index Structure
for Multimedia Data to Support Content-Based
Similarity Searches in a Collaborative
Environment
Kasturi Chatterjee
Distributed Multimedia Information Systems Laboratory
School of Computing and Information Sciences
Florida International University

Committee Members

• Dr. Shu-Ching Chen (Advisor)
• Dr. Jainendra K. Navlakha
• Dr. Xudong He
• Dr. Keqi Zhang
• Dr. Mei-Ling Shyu

2

Acknowledgment

School of Computing and Information Sciences
Continuing Graduate Assistantship (GA, RA)
Awards recognizing research

Florida International University
Dissertation Year Fellowship
Travel Grants (GSA)

Members of DMIS Lab

SCIS staffs
Special thanks to Olga
3

Outline
i. Motivation
ii. Contributions
a. Generalized Index Structure
b. Query Refinement
c. Visualizing & Analyzing Multimedia
Semantic Relationships in
Collaborative Environments
iii. Discussions
iv. Future Direction

4

What is so special about
multimedia data?

i. Expressive

ii. Attractive

Which medium is more
helpful?! 5

Everything comes at a price

i. Multidimensional
Representation
ii. Perception Subjectivity
iii.Semantic Gap

Very different from
traditional data!
6

Multidimensional
Representation
Imag
e
Y
Z

Apply feature extraction
(HSV color space) <3.5,0,8>

X
(0.1602,0.0818,0.0405,0.0536,0.0685,0.0667,0,0,0.0287,0,0,0)

black red yellow green blue purple

white red-yellow yellow- green- blue-
green blue purple

purple-red

7

Multidimensional
Representation
Video

Videos

Key Shot

Shots

Frames Frames Frames

temporally
related frames Apply feature extraction
(multi-modal)
(color-features, video-features, audio-features, ……)

average-volume, average-energy, ….
pixel-change, histogram-change, 8

Perception Subjectivity

• Togetherness • Sunset
• Baking • Dolphins
• Family
• Quality Time
• ………….
• ………….

9

Semantic Gap

Similar feature
representation

Very different
semantic
information

10

Are existing DBMS frameworks
able to handle Multimedia Data?

A Typical Query

Traditional alpha-numeric Multimedia queries
queries

SELECT image FROM table
SELECT studentName FROM
WHERE red „is-close-to‟ 0.245
table WHERE studentAge >
AND black „is-close-to‟ 0.356
20 AND studentMajor =
AND red-yellow „is-close-to‟
„Computer Science‟;
0.5672 AND …….. AND
semanticInterpretation =
„something‟….etc.
11

Communication Manager

What is missing? Application Front Ends

SQL Interface

SQL Compiler/Interpreter

i. Suitable data organization (index
structure)
Query Evaluation Engine
Query Query Query
Optimizer Processor Evaluator

ii. Suitable query handling
Catalog
Manager
Transaction
Manager
Lock
manager
Buffer
Manager

Access Structure
iii. Suitable handling of semantic contents
Recovery
Manager Manager
Storage
Manager

Index Structure

Index Access

12

Outline
i. Motivation
ii. Contributions
b. Query Refinement
iii. Discussions

13

Generalized Index Structure
GeM-Tree [chat09c]
Expectations
i. Provide a single framework to
manage different types of
multimedia data

separate index structures for different
data types are inefficient to embed
into the database kernel

14

GeM-Tree
Expectations
ii. Accommodate varied
Multidimensional
Representation
existing multidimensional
existing index structures index structures cannot
for database kernels are handle retrieval
mostly single-dimensional requirements of multimedia
data
plethora of feature
representations call for a
flexible structure 15

GeM-Tree
Expectations

iii.Accommodate CBR of individual data
type along with concept retrievals
involving cross-similarity between
multimedia data

query handling need to existing index structures
consider low-level features & cannot handle such retrieval
semantic-information approaches
16

What has been done so far

First generation Multi-dimensional index
index structures structures

Feature- Distance-
B-Tree [1]
• tree-based index Based Based
structure • feature space • metric-space
• single-dimensional indexed based formed from the
• currently used in on feature distances
relational databases dimension between data
• KDB-Tree [2], objects is
R-Tree[3], indexed
Hybrid-Tree[4] • M-Tree [5], VP-
Tree[6] 17

KDB-Tree

3 4 7 8
F I
12345678
G H J K
N

D A 1234 5678
L O
C M

12 34 56 78
E B

T P Q
1 2 5 6
DE ABC FGH IJ ST PQR KLM NO
S R

18

VP-Tree

I

J
Data Space
E
Partition for VP-Tree

H B

V (A,B,C,D) closest to V
A C

D (E,F,G,H) next close
G F
(I,J,K) farthest
K

19

Issues?

Feature-Based Indexes Distance-Based Indexes
Semantic Information during CBR
low-level feature values no existing semantics capturing
correlated to semantics model embedded into search
queries

Different data types
none designed for handling
videos/documents
Seamless solution
none designed to handle multiple data types from a
single framework 20

GeM-Tree
how does it accomplish the goals?
Expectation I
Provide a single framework to manage
different types of multimedia data
Using a data-signature to
represent multimedia data
objects
F image ( x 1 , x 2 ,......... , x i ) , ( 0 , 0 , 0 ,......., 0 ) , (1, 0 , 0 ) ,1
            
   
Image part: FA = {x1 ,x2 ,…….,xi F A F B F C

}

Video part: FB = {y1,y2,…….,yj}

Ids: FC = {object_id, v_id,
s_id} F shot ( z 1 , z 2 ,......... , z i ) , ( y , y ,...., y ) , (1,1, 0 ) ,1
           j   1 2
 
F A
F B
21F C

GeM-Tree
how does it accomplish the
goals?
Expectation II
Accommodate varied Multidimensional
Representation
Using Earth Mover‟s Distance (EMD) to calculate (dis)similarity

• Derived from Monge-Kantorovich, a transportation problem
• Calculates distance between 2 distributions
• Distributions can be of variable lengths
K ,n
Given two distributions
x X ,w D
K ,m
andY , u D
y , a flow between x
and is y aFmatrix R
f ij
mxn
, find a flow that minimizes the overall
m n
flow, W ork x, y, F d ij f ij
i 1 j i

m n m n
EMD x , y d ij f ij
f ij 22
i 1 j i i 1 j i
EMD is calculated by:

GeM-Tree
how does it accomplish the
goals?
Expectation III
Accommodate CBR of individual data type along
with concept retrievals involving cross-similarity
between + EMD +
data-signature multimedia data Affinity Relationship[8][9]
 a stochastic construct called
Markov Model Mediator [12]
 extended into HMMM for videos
 determines the closeness of
two multimedia objects (affinity)
by following the access patterns
 “more frequently two objects
are accessed together, greater is
their semantic closeness/affinity”
23

How GeM-Tree supports CBR

Range Search: select all the appropriate
database objects within a given range
from the query
k-NN Search: search the entire
database to select k database objects
most similar to the query

if ((d(Findex_object, Fquery) <= dk) && (A(data, query) >= affinityk ))
add index_object to priority queue;
update dk and affinityk;
else
check next index_object from priority queue;
24

How GeM-Tree supports cross-
multimedia similarity search

Low-level Similarity High-level Similarity

Euclidean distance between HMMM [9] framework is
F of data objects take care of traversed
the image and video (upwards/downwards)
components according to the information
gathered from FC part

FC={object_id, v_id,
s_id} 25

Performance of GeM-Tree
Index structure Index structure
handling only handling only
images videos

Query # of Distance Computations Accuracy

GeM AH HAH Seq GeM AH HAH Seq

Only 98 80 X 147 90% 93% X 98%
Image

Only 63 X 50 147 90% X 91% 95%
Video

Mixed 80 X X 147 80% X X 90%
Types

26

Performance of GeM-Tree

Capability of handling variable-length features and
supporting queries such as region-based/object-
based queries
Distance Computing during Developing Index
Structure
Data Type GeM-Tree

Only Images 145

Only Videos 240

Both 960

27

Outline
i. Motivation
ii. Contributions
b. Query Refinement
iii. Discussions

28

What is Query Refinement

To Alleviate….
i. Number of queries in each iteration
Semantic Gap
increases
Perception Subjectivity
ii. High-level semantic requirement of the
userFuzziness of multimedia query
is modified

29

Where do we stand?

Existing Query Refinement Models
for Index Structures [7]
attempts to capture user requirements by
ONLY adjusting the inter and intra-level
feature-weights

30

Query Refinement in GeM-Tree

Requirement I
Number of queries in each iteration
increases
i. Introduces the concept of multi-point
query

ii. Modifies the (dis)similarity computation
approach

n 2
D IS T M U L T I ( Q , O ) i 1
Wi | C Fi | r
31

Query Refinement in GeM-Tree

Requirement
II
High-level semantic requirement of the user is
modified
i. Introduces affinity update method

aff m , n t 1 x1 x ( access t 1
1)

ii. Embeds semantic information into the
index structure considering multi-point
query
n
m ax i 1
(m ax( affinity a , q i , affinity b , q i ), m ax( affinity a , q i 1 , affinity b , q i 1
))
32

Evaluation

Evaluation score proposed to compare the
utility of different multimedia data management
frameworks
T T m in F Fm ax
M odel _ Score (1 ) x (1 | |)
n 2 n 2
3x ( i 1
(Ti T m in ) ) n 3x ( i 1
( Fi Fm ax ) ) n

• Compares based on both computation time and
accuracy
• One can be improved at the cost of other
• A balance is necessary 33

Experimental Analysis

AccuracyComparisons
CPU Time Comparison
0.45
120%
0.4

100%
0.35

0.3
80%
AH-Tree
CPU Time
Accuracy

0.25
Refine
AH-Tree Refine
60% HybridTree
0.2 HybridTree Refine
Refine
AH-Tree
AH-Tree
0.15
40%
Naive
Naive
0.1
20%
0.05

00%
1 1 2 2 3 3
Iterations
Iterations

34

Experimental Analysis

35

Outline
i. Motivation
ii. Contributions
b. Query Refinement
iii. Discussions

36

Why?
~ 400 million
users *
Collaborative Environment

 Explosion of social network applications
 Multimedia Data an important communication
medium shared
 Data management no longer an isolated task
youtube video

The way a multimedia data is used in a
social network can be used to generate
A Multimedia Data Network

* http://www.facebook.com/press/info.php?statistics 37

Multimedia Data Network

Multimedia Data shared/accessed
among a particular user group can nodes
form a social network

Each data object acts as edges
an actor (node)

Their relationship the
link (edge)

38

What kind of relationship?

The edges defining the relationships
vary with applications

Want to utilize information for
User behavior collected for over 5 years
customizing Multimedia Database
using Multimedia Retrieval Application
developed at DMIS strategies Dataset
Management for COREL
having 10,000 images
Used semantic similarity, as
perceived and reported by users, as
the relationship
39

Multimedia Data Network for
10,000 images
How the relationship information was presented
before
affinity.txt

1 2 ……………………… 10000
1 24 34 ……………………… 0
2 12 0 ……………………… 45
3 ………………………………………………………….
4 ………………………………………………………….
. ………………………………………………………….
. ………………………………………………………….
. ………………………………………………………….
. ………………………………………………………….
. ………………………………………………………….
. ………………………………………………………….
. ………………………………………………………….
10000 ………………………………………………………….
40

Multimedia Data Network

Characteristics of the generated
network structure

 A weighted Disconnected Graph Structure
 Large Size
 Visual Interpretation/Analysis becomes
challenging

41

Graph Preview
Solution
Approach

Reduce number of
nodes

Maintain network
characteristics

Maximize similarity
between original
and represented
networks 42

Existing Approaches
Using
semantic
information Identifying
associated disjoint
with data clusters
(content-
Using based) Represent
structural clusters as
information of glyphs or
data compound graph
(structure-
based)

Discovering Clustered
Use node
groupings/clas Graph metrics
ses in data
Layouts
43

Issues with Clustered Graph
representations

Determining the cluster size

Preserving overall structural
similarity/equivalence
Determining the representative
nodes
Preserving the network
characteristics
44

Proposed Approach
Node Filtering

Similarity Calculation

Node Assignment
Determine Metric

Graph Layout
Pick nodes Calculate Calculate Assign Generate
based on structural structural & filtered the
network and semantic nodes to representati
structure/us semantic similarity original ve graph
er choice metric nodes to
maximize
overall
similarity

45

Detailed Algorithm
Sample nodes to
capture overall
Step 1 network
characteristics
Node Filtering

Pick nodes
based on Select nodes
network
structure/us representing different
er choice groups in the
network

Random sampling
approaches which
preserve the
distribution
46

Detailed Algorithm

Step 2 Structural metrics
Determine Node Metric

Calculate • Adjacency Matrices:
structural
and edge source &
semantic edge terminus
metric
Semantic metrics

• A matrix of scores
of different
centrality values
47

Detailed Algorithm
y ij ( k ) xs (i ) s ( j ) ( k 1) xt ( i ) t ( j ) ( k 1)

x ij ( k ) y kl ( k 1) y kl ( k 1)
Step 3 t ( k ) i ,t ( l ) j s ( k ) i,s (l ) j
Similarity Calculation

Calculate
structural & Structural similarity
semantic • Coupled node-edge
similarity
score [11]

Semantic metrics
• Euclidean distance
between semantic
values
48

Detailed Algorithm

Step 4
Hungarian Algorithm
Assign
Node Assignment

filtered nodes • Pick up m nodes from
to original
nodes to the set of n nodes
maximize which maximizes the
overall total similarity score
similarity between the original
graph and the sub-
graph formed
• Assignment Problem
applying Munkres
Algorithm
49

Detailed Algorithm

Step 5
SPi , j
Connect node i and j with edgei,j if threshold
Max ( SPi , j k )

Generate Shortest Path
Graph Layout

the Approach
representati
ve graph • Preserve the ties
between nodes
• Consider the
overall
reach/strength of
each node
50

Evaluation

• Overall structural comparison
• Degree of similarity between connected nodes
(dyads)
• Using Euclidean distance between the centrality
values
What is Centrality? [10]
• Centrality measures the power/importance of a node with
respect to the entire network it belongs to
• Measure of holistic behavior of a node
M
2
c ik c jk
k 1
Ec 1 M
2
max c ik c jk 51
k 1

Generated Previews

low error value ~ 0.02 52

How is the Multimedia Data
Network utilized ?

• identify mutual relationships and role of a
particular
multimedia data object in a database
• design decisions of operations of the index
structures
Index structure is built on ONLY the low-level
features

Semantic relationship was introduced during
querying

No existing insertion policies consider the
53
semantic information stored in a data object

Insertion policies

degree centrality is defined as the
Use degree centrality number of links incident upon a node
(i.e., the number of ties that a node
has)
For a Multimedia Data Network, degree centrality
identifies the power/importance of a particular data
object in the entire network image to be inserted
node 1 node 2

insert
higher centrality
54

Deletion policies

Current Status

Any delete request from the users is entertained

That the user and hence the data might belong to a
collaborative environment is not considered

55

Deletion policies

betweenness centrality is
Use betweenness centrality defined as the number of
vertices that connect via a
particular node

For a delete request, if betweenness centrality of the
node is high, ask the user to reconsider

56

Outline
i. Motivation
ii. Contributions
b. Query Refinement
iii. Discussions

57

Assumptions and Limitations

• Assumed that features used for indexing represent
the multimedia data well
• Accuracy calculations are not quantitative and it
may vary from person to person
• Can handle only Numeric Data
• Only Soccer videos were used as test bed, other
domains were not checked

58

Outline
i. Motivation
ii. Contributions
b. Query Refinement
iii. Discussions

59

Future Direction

• Intelligent multimedia index structure optimizer
• Document indexing
• Support traditional alpha-numeric data
• Query optimizer for multimedia database
• Multimedia data management framework for
Collaborative Applications

60

Publications
Journals & Book Chapters

i. [chat10] Kasturi Chatterjee, Shixia Liu, Shu-Ching Chen, “Social Network Preview using Graph
Similarity,” (submitted to ACM Transactions on Information Systems), 2010.
ii. [chat09a] Kasturi Chatterjee, S. Masoud Sadjadi, Shu-Ching Chen, “A Distributed Multimedia
Data Management over Grid,” Multimedia Services in Intelligent Environments – Integrated
Systems, 2009 (in press).
iii. [chat09b] Kasturi Chatterjee, Shu-Ching Chen, “HAH-tree: Towards a Multidimensional Index
Structure Supporting Different Video Modeling Approaches in a Video Database Management
System,” IJIDS, vol. 2, no. 2, pp. 188-207, 2010.
iv. [chat09c] Kasturi Chatterjee, Shu-Ching Chen, “A Multimedia Data Management Approach
with GeM-Tree,” JMM, 2010 (in press).
v. [chat09d] Shu-Ching Chen, Min Chen, Na Zhao, Shahid Hamid, Kasturi Chatterjee, and Michael
Armella, “Florida Public Hurricane Loss Model: Research in Multi-Disciplinary System
Integration Assisting Government Policy Making,” Special Issue on Building the Next
Generation Infrastructure for Digital Government, Government Information Quarterly, Volume
26, Issue 2, pp. 285-294, April 2009.
vi. [chat 07a] Kasturi Chatterjee and Shu-Ching Chen, “A Novel Indexing and Access Mechanism
using Affinity Hybrid Tree for Content-Based Image Retrieval in Multimedia Databases,”
International Journal of Semantic Computing (IJSC), Vol. 1, Issue 2, pp. 147-170, June 2007.
61

Publications
Conferences

i. [chat09d] Yudan Li, Kasturi Chatterjee, Shu-Ching Chen, and Keqi Zhang, “A 3-D Traffic
Animation System with Storm Surge Response,” accepted for publication, IEEE International
Symposium on Multimedia (ISM2009), 2009.
ii. [chat08a] Kasturi Chatterjee and Shu-Ching Chen, “GeM-Tree: Towards a Generalized
Multidimensional Index Structure Supporting Image and Video Retrieval,” the Fourth IEEE
Publications
International Workshop on Multimedia Information Processing and Retrieval (MIPR2008), in
conjunction with IEEE International Symposium on Multimedia (ISM2008), 2008.
iii. [chat08c] Kasturi Chatterjee and Shu-Ching Chen, “Hierarchical Affinity-Hybrid Tree: A
Multidimensional Index Structure to Organize Videos and Support Content-Based Retrievals,”
Proceedings of the 2008 IEEE International Conference on Information Reuse and Integration
(IEEE IRI-08), 2008.
iv. [chat08d] Shu-Ching Chen, Min Chen, Na Zhao, Shahid Hamid, Khalid Saleem, and Kasturi
Chatterjee, “Florida Public Hurricane Loss Model (FPHLM): Research Experience in System
Integration,” the 9th Annual International Conference on Digital Government Research, 2008.

62

Publications
Conferences

v. [chat08e] Kasturi Chatterjee, Shixia Liu, and Shu-Ching Chen, “Using Graph Similarity for
Social Network Analysis,” in 6th LA Grid Summit, (First Place), 2008.
vi. [chat06a] Kasturi Chatterjee and Shu-Ching Chen, “Affinity Hybrid Tree: An Indexing
Technique for Content-Based Image Retrieval in Multimedia Databases,” in proceedings of
IEEE International Symposium on Multimedia (ISM2006), (Best Paper Award), 2006.
vii. [chat06b] Kasturi Chatterjee, Khalid Saleem, Na Zhao, Min Chen, Shu-Ching Chen, and Shahid
Hamid, “Modeling Methodology for Component Reuse and System Integration for Hurricane
Loss Projection Application,” in proceedings of IEEE International Conference on Information
Reuse and Integration (IEEE IRI-2006),2006.

63

References

[1] R. Bayer, “Binary B-Trees for Virtual Memory,” in ACM-SIGFIDET Workshop, San
Diego, California, Session 5B, pp. 219-235, 1971.
[2] J. Robinson, “The k-d-b-tree: A search structure for large multidimensional dynamic indexes,” in
Proceedings of the 1981 ACM SIGMOD International Conference on Management of Data, Ann
Arbor, United States, pp. 10–18, 1981.
[3] Y. N. Peter, "Data structures and algorithms for nearest neighbor search in general metric
spaces,“ in Proceedings of the fourth annual ACM-SIAM Symposium on Discrete algorithms, pp. 311-
321, 1993.
[4] C. Patella, et al., “M-tree: An efficient access method for similarity search in metric spaces,’’ in
Proceedings of 23rd VLDB, pp. 426-435, 1997.
[5] A. Guttman, “R-Trees: A Dynamic Index Structure for Spatial Searching,” in Proc. 1984 ACM
SIGMOD International Conference on Management of Data, pp. 47-57, 1984.
[6] K. Chakrabarti, S. Mehrotra, “The Hybrid Tree: An Index Structure for High Dimensional Feature
Spaces,” in ICDE 1999, pp. 440-447, 1999.
[7] K. Chakbarti, et al., “ Efficient Query Refinement in Multimedia Databases,” in Proc. International
Conference on Data Engineering, pp. 196-200, 2000.
[8] M-L. Shyu, S-C. Chen, M. Chen, C. Zhang, and C-M. Shu, "MMM: A Stochastic Mechanism for
Image Database Queries," Proceedings of the IEEE Fifth International Symposium on Multimedia
Software Engineering (MSE2003), pp. 188-195, December 10-12, 2003, Taichung, Taiwan, ROC.

65

References

[9] Shu-Ching Chen, Na Zhao, and Mei-Ling Shyu, "Modeling Semantic Concepts and User
Preferences in Content-Based Video Retrieval," International Journal of Semantic Computing
(IJSC), Vol. 1, Issue 3, pp. 377-402, September 2007.
[10] L. C. Freeman, “Centrality in Social Network: Conceptual Classification,” Social
Networks, vol. 1, no. 3, pp. 215-239, 1979.
[12] L. A . Zager, et. sl., “Graph Similarity Scoring and Matching,” Applied Mathematics
Letters, vol. 21, no.1, pp. 86-94, 2007.

66

Defense Powepoint

Recomendados

Recomendados

Mais conteúdo relacionado

Destaque

Destaque (6)

Semelhante a Defense Powepoint

Semelhante a Defense Powepoint (20)

Último

Último (20)

Defense Powepoint

Notas do Editor