SlideShare a Scribd company logo
1 of 21
Download to read offline
Mathias Lux
This work is licensed under a Creative
Commons Attribution 3.0 Unported License.
What is LIRE?
• Library for CBIR
• Easy access & instant “success”
• Few loc to index & search
It’s based on Lucene
• Java text retrieval framework
– based on inverted lists

• Top level Apache project

• Extends to Solr
Modular Feature
Architecture
LireFeature as the basic Interface
• Extraction,
• Distance function,

• Serialization (byte[] based)
• toString(), field name, …
Fast Access & Linear
Search
• Efficient coding of serialization
– transformation to byte[]
– run length coding for sparse vectors

• Custom Lucene codec
– Lucene field compression
– update to DocValues in v1.0
Search with sub Linear
T ime Complexity
• Hashing based approach for global features
– Locality sensitive hashing
• bit sampling

– Proximity based hashing

• nearest neighbors as “buckets”,
• cp. work of G. Amato

• Local features supported

– SIFT, SURF, k-means, VLAD
Tools
• Parallel Indexing
– consumer-producer based
– up to the capabilities of the VM / HDD

• Intermediate byte based data format
– small footprint, efficient, relative paths
Extending LIRE
• Implement a global feature
– extraction, distance function, serialization

• Lire takes care of the rest
– Parallel indexing, hashing, search
Using Parts of LIRE
Take what you need …

• Feature implementations

– cp. work of Xinchao Li et al. at Mediaeval 2013

• Image processing

– Canny Edge Detector, SWT (coming soon),

• Tools & code base

– FastMap, Suffix Tree Clustering, …
UCID Data Set

MAP

precision 10

ER

CEDD

0,431

0,420

0,553

CEDD

Color Correlogram

0,586

0,480

0,370

Color Correlogram

Color Layout

0,277

0,285

0,679

Color Layout

Edge Histogram

0,180

0,202

0,813

Edge Histogram

FCTH

0,447

0,415

0,531

FCTH

JCD

0,470

0,435

0,508

JCD

Joint Histogram

0,348

0,313

0,603

Joint Histogram

LBP Opponent Joined

0,266

0,267

0,729

LBP Opponent Joined

Local Binary Patterns (LBP)

0,228

0,221

0,714

Local Binary Patterns (LBP)

Opponent Histogram

0,319

0,309

0,649

Opponent Histogram

PHOG

0,232

0,235

0,725

PHOG

RGB Color Histogram

0,403

0,358

0,550

RGB Color Histogram

Rotation Invariant LBP

0,165

0,174

0,813

Rotation Invariant LBP

Scalable Color

0,172

0,183

0,840

Scalable Color

SPCEDD

0,575

0,487

0,366

SPCEDD

SPLBP

0,264

0,251

0,683

SPLBP

Surf BoVW

0,348

0,313

0,634

Surf BoVW

VLAD-SURF

0,370

0,356

0,603

VLAD-SURF
SIMPLICity Data Set

MAP

precision 10

ER

CEDD

0,513

0,706

0,193

Color Correlogram

0,498

0,740

0,159

Color Layout

0,439

0,612

0,303

Edge Histogram

0,333

0,500

0,401

FCTH

0,499

0,703

0,207

JCD

0,520

0,730

0,183

JCD

Joint Histogram

0,449

0,689

0,197

Joint Histogram

LBP Opponent Joined

0,418

0,569

0,347

LBP Opponent Joined

Local Binary Patterns (LBP)

0,358

0,587

0,295

Local Binary Patterns (LBP)

OpponentHistogram

0,450

0,635

0,270

OpponentHistogram

PHOG

0,365

0,547

0,355

PHOG

RGB Color Histogram

0,450

0,704

0,191

RGB Color Histogram

Rotation Invariant LBP

0,338

0,520

0,375

Rotation Invariant LBP

Scalable Color

0,305

0,470

0,464

Scalable Color

SPCEDD

0,599

0,772

0,144

SPCEDD

SPLBP

0,395

0,556

0,348

SPLBP

SURF BoVW

0,338

0,464

0,475

SURF BoVW

VLAD-SURF

0,365

0,518

0,407

VLAD-SURF

CEDD

Color Correlogram
Color Layout
Edge Histogram

FCTH
Hashing - BitSampling
1,000
0,900

JCD

0,800

CEDD

0,700

FCTH

0,600

ACC

0,500

PHOG

0,400

OPH

0,300

ColHist

0,200

ColLay

0,100

EH
SPCEDD

0,000

0

500

1000

1500

2000

2500

100k images from flickr, 50 results cp. to linear search

3000
Hashing - Proximity
1,000

JCD

0,900

CEDD

0,800
0,700

FCTH

0,600

ACC

0,500

PHOG

0,400

OPHIST

0,300

ColHist

0,200

Collay

0,100

EH

0,000

SPCEDD

0

500

1000

1500

2000

2500

100k images from flickr, 50 results cp. to linear search

3000
Apache Solr Integration
• Motivation:
– Use a search and retrieval server with all its tools

• Objectives:
– indexing & management

– efficient content based image search
– content based ranking of results
Solr Plugin
• Custom Request Handler
– Uses Solr’s request and response framework

– Allows for content based retrieval

• Custom ValueSourceFunction
– Added to text based search queries
– Allows for ranking based on the distance function
Solr Plugin
• Custom type of index field
– DocValue based binary field
– transmission base64 encoded

• Custom Indexer
– XML documents to be uploaded to Solr
SOLR Plugin
• http://demo-itec.uni-

klu.ac.at/liredemo/wipo.html

• Local demo
Future Work
• DocValues based indexing

– make linear search faster

• Proximity hashing

– metric spaces approach
– more accurate

• Release version 1.0

– adding docs & features freeze
Acknowledgements
I’d like to thank Anna-Maria Pasterk, Arthur Li, Arthur Pitman,
Bastian Hösch, Benjamin Sznajder, Christian Penz, Christine

Keim, Christoph Kofler, Dan Hanley, Daniel Pötzinger, Fabrizio

Falchi, Franz Graf, Giuseppe Amato, Glenn Macstravic, James
Charters, Janine Lachner, Katharina Tomanec, Lukas Esterle,

Manuel Oraze, Marian Kogler, Marko Keuschnig, Michael Riegler,
Rodrigo Carvalho Rezende, Roman Divotkey, Roman Kern,
Savvas Chatzichristofis and Sandeep Gupta.
Lecture Book
T hanks for listening …
• Mathias Lux
• mlux@itec.uni-klu.ac.at

More Related Content

Similar to LIRE presentation at the ACM Multimedia Open Source Software Competition 2013

OSPF Summary LSA (Type 3 LSA)
OSPF Summary LSA (Type 3 LSA)OSPF Summary LSA (Type 3 LSA)
OSPF Summary LSA (Type 3 LSA)NetProtocol Xpert
 
Andy Parsons Pivotal June 2011
Andy Parsons Pivotal June 2011Andy Parsons Pivotal June 2011
Andy Parsons Pivotal June 2011Andy Parsons
 
Conceptos básicos. Seminario web 1: Introducción a NoSQL
Conceptos básicos. Seminario web 1: Introducción a NoSQLConceptos básicos. Seminario web 1: Introducción a NoSQL
Conceptos básicos. Seminario web 1: Introducción a NoSQLMongoDB
 
[Spark meetup] Spark Streaming Overview
[Spark meetup] Spark Streaming Overview[Spark meetup] Spark Streaming Overview
[Spark meetup] Spark Streaming OverviewStratio
 
OrientDB - The 2nd generation of (multi-model) NoSQL
OrientDB - The 2nd generation of  (multi-model) NoSQLOrientDB - The 2nd generation of  (multi-model) NoSQL
OrientDB - The 2nd generation of (multi-model) NoSQLRoberto Franchini
 
Bgpcep odl summit 2015
Bgpcep odl summit 2015Bgpcep odl summit 2015
Bgpcep odl summit 2015Giles Heron
 

Similar to LIRE presentation at the ACM Multimedia Open Source Software Competition 2013 (11)

OSPF Summary LSA (Type 3 LSA)
OSPF Summary LSA (Type 3 LSA)OSPF Summary LSA (Type 3 LSA)
OSPF Summary LSA (Type 3 LSA)
 
Andy Parsons Pivotal June 2011
Andy Parsons Pivotal June 2011Andy Parsons Pivotal June 2011
Andy Parsons Pivotal June 2011
 
Mpls technology
Mpls technologyMpls technology
Mpls technology
 
SenseiDB
SenseiDBSenseiDB
SenseiDB
 
Mpls te
Mpls teMpls te
Mpls te
 
Conceptos básicos. Seminario web 1: Introducción a NoSQL
Conceptos básicos. Seminario web 1: Introducción a NoSQLConceptos básicos. Seminario web 1: Introducción a NoSQL
Conceptos básicos. Seminario web 1: Introducción a NoSQL
 
Osp fv3 cs
Osp fv3 csOsp fv3 cs
Osp fv3 cs
 
[Spark meetup] Spark Streaming Overview
[Spark meetup] Spark Streaming Overview[Spark meetup] Spark Streaming Overview
[Spark meetup] Spark Streaming Overview
 
OrientDB - The 2nd generation of (multi-model) NoSQL
OrientDB - The 2nd generation of  (multi-model) NoSQLOrientDB - The 2nd generation of  (multi-model) NoSQL
OrientDB - The 2nd generation of (multi-model) NoSQL
 
Bgpcep odl summit 2015
Bgpcep odl summit 2015Bgpcep odl summit 2015
Bgpcep odl summit 2015
 
Cisco ospf
Cisco ospf Cisco ospf
Cisco ospf
 

More from dermotte

Invited Talk OAGM Workshop Salzburg, May 2015
Invited Talk OAGM Workshop Salzburg, May 2015Invited Talk OAGM Workshop Salzburg, May 2015
Invited Talk OAGM Workshop Salzburg, May 2015dermotte
 
CBMI 2013 Presentation: User Intentions in Multimedia
CBMI 2013 Presentation: User Intentions in MultimediaCBMI 2013 Presentation: User Intentions in Multimedia
CBMI 2013 Presentation: User Intentions in Multimediadermotte
 
Why did you record this video?
Why did you record this video?Why did you record this video?
Why did you record this video?dermotte
 
Content based image retrieval with LIRe
Content based image retrieval with LIReContent based image retrieval with LIRe
Content based image retrieval with LIRedermotte
 
Ohne LIRe keine Bildsuche
Ohne LIRe keine BildsucheOhne LIRe keine Bildsuche
Ohne LIRe keine Bildsuchedermotte
 
Callisto: Content Based Tag Recommendation for Images
Callisto: Content Based Tag Recommendation for ImagesCallisto: Content Based Tag Recommendation for Images
Callisto: Content Based Tag Recommendation for Imagesdermotte
 
User Intentions or "The other end of the camera ..."
User Intentions or "The other end of the camera ..."User Intentions or "The other end of the camera ..."
User Intentions or "The other end of the camera ..."dermotte
 
Visual Information Retrieval
Visual Information RetrievalVisual Information Retrieval
Visual Information Retrievaldermotte
 
Using Visual Features to Improve Tag Suggestions in Image Sharing Sites :: pr...
Using Visual Features to Improve Tag Suggestions in Image Sharing Sites :: pr...Using Visual Features to Improve Tag Suggestions in Image Sharing Sites :: pr...
Using Visual Features to Improve Tag Suggestions in Image Sharing Sites :: pr...dermotte
 
Power Laws Popularity And Interestingness
Power Laws Popularity And InterestingnessPower Laws Popularity And Interestingness
Power Laws Popularity And Interestingnessdermotte
 
Aspects of broad folksonomies
Aspects of broad folksonomiesAspects of broad folksonomies
Aspects of broad folksonomiesdermotte
 

More from dermotte (11)

Invited Talk OAGM Workshop Salzburg, May 2015
Invited Talk OAGM Workshop Salzburg, May 2015Invited Talk OAGM Workshop Salzburg, May 2015
Invited Talk OAGM Workshop Salzburg, May 2015
 
CBMI 2013 Presentation: User Intentions in Multimedia
CBMI 2013 Presentation: User Intentions in MultimediaCBMI 2013 Presentation: User Intentions in Multimedia
CBMI 2013 Presentation: User Intentions in Multimedia
 
Why did you record this video?
Why did you record this video?Why did you record this video?
Why did you record this video?
 
Content based image retrieval with LIRe
Content based image retrieval with LIReContent based image retrieval with LIRe
Content based image retrieval with LIRe
 
Ohne LIRe keine Bildsuche
Ohne LIRe keine BildsucheOhne LIRe keine Bildsuche
Ohne LIRe keine Bildsuche
 
Callisto: Content Based Tag Recommendation for Images
Callisto: Content Based Tag Recommendation for ImagesCallisto: Content Based Tag Recommendation for Images
Callisto: Content Based Tag Recommendation for Images
 
User Intentions or "The other end of the camera ..."
User Intentions or "The other end of the camera ..."User Intentions or "The other end of the camera ..."
User Intentions or "The other end of the camera ..."
 
Visual Information Retrieval
Visual Information RetrievalVisual Information Retrieval
Visual Information Retrieval
 
Using Visual Features to Improve Tag Suggestions in Image Sharing Sites :: pr...
Using Visual Features to Improve Tag Suggestions in Image Sharing Sites :: pr...Using Visual Features to Improve Tag Suggestions in Image Sharing Sites :: pr...
Using Visual Features to Improve Tag Suggestions in Image Sharing Sites :: pr...
 
Power Laws Popularity And Interestingness
Power Laws Popularity And InterestingnessPower Laws Popularity And Interestingness
Power Laws Popularity And Interestingness
 
Aspects of broad folksonomies
Aspects of broad folksonomiesAspects of broad folksonomies
Aspects of broad folksonomies
 

Recently uploaded

Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsRoshan Dwivedi
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 

Recently uploaded (20)

Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 

LIRE presentation at the ACM Multimedia Open Source Software Competition 2013

  • 1. Mathias Lux This work is licensed under a Creative Commons Attribution 3.0 Unported License.
  • 2. What is LIRE? • Library for CBIR • Easy access & instant “success” • Few loc to index & search
  • 3. It’s based on Lucene • Java text retrieval framework – based on inverted lists • Top level Apache project • Extends to Solr
  • 4. Modular Feature Architecture LireFeature as the basic Interface • Extraction, • Distance function, • Serialization (byte[] based) • toString(), field name, …
  • 5. Fast Access & Linear Search • Efficient coding of serialization – transformation to byte[] – run length coding for sparse vectors • Custom Lucene codec – Lucene field compression – update to DocValues in v1.0
  • 6. Search with sub Linear T ime Complexity • Hashing based approach for global features – Locality sensitive hashing • bit sampling – Proximity based hashing • nearest neighbors as “buckets”, • cp. work of G. Amato • Local features supported – SIFT, SURF, k-means, VLAD
  • 7. Tools • Parallel Indexing – consumer-producer based – up to the capabilities of the VM / HDD • Intermediate byte based data format – small footprint, efficient, relative paths
  • 8. Extending LIRE • Implement a global feature – extraction, distance function, serialization • Lire takes care of the rest – Parallel indexing, hashing, search
  • 9. Using Parts of LIRE Take what you need … • Feature implementations – cp. work of Xinchao Li et al. at Mediaeval 2013 • Image processing – Canny Edge Detector, SWT (coming soon), • Tools & code base – FastMap, Suffix Tree Clustering, …
  • 10. UCID Data Set MAP precision 10 ER CEDD 0,431 0,420 0,553 CEDD Color Correlogram 0,586 0,480 0,370 Color Correlogram Color Layout 0,277 0,285 0,679 Color Layout Edge Histogram 0,180 0,202 0,813 Edge Histogram FCTH 0,447 0,415 0,531 FCTH JCD 0,470 0,435 0,508 JCD Joint Histogram 0,348 0,313 0,603 Joint Histogram LBP Opponent Joined 0,266 0,267 0,729 LBP Opponent Joined Local Binary Patterns (LBP) 0,228 0,221 0,714 Local Binary Patterns (LBP) Opponent Histogram 0,319 0,309 0,649 Opponent Histogram PHOG 0,232 0,235 0,725 PHOG RGB Color Histogram 0,403 0,358 0,550 RGB Color Histogram Rotation Invariant LBP 0,165 0,174 0,813 Rotation Invariant LBP Scalable Color 0,172 0,183 0,840 Scalable Color SPCEDD 0,575 0,487 0,366 SPCEDD SPLBP 0,264 0,251 0,683 SPLBP Surf BoVW 0,348 0,313 0,634 Surf BoVW VLAD-SURF 0,370 0,356 0,603 VLAD-SURF
  • 11. SIMPLICity Data Set MAP precision 10 ER CEDD 0,513 0,706 0,193 Color Correlogram 0,498 0,740 0,159 Color Layout 0,439 0,612 0,303 Edge Histogram 0,333 0,500 0,401 FCTH 0,499 0,703 0,207 JCD 0,520 0,730 0,183 JCD Joint Histogram 0,449 0,689 0,197 Joint Histogram LBP Opponent Joined 0,418 0,569 0,347 LBP Opponent Joined Local Binary Patterns (LBP) 0,358 0,587 0,295 Local Binary Patterns (LBP) OpponentHistogram 0,450 0,635 0,270 OpponentHistogram PHOG 0,365 0,547 0,355 PHOG RGB Color Histogram 0,450 0,704 0,191 RGB Color Histogram Rotation Invariant LBP 0,338 0,520 0,375 Rotation Invariant LBP Scalable Color 0,305 0,470 0,464 Scalable Color SPCEDD 0,599 0,772 0,144 SPCEDD SPLBP 0,395 0,556 0,348 SPLBP SURF BoVW 0,338 0,464 0,475 SURF BoVW VLAD-SURF 0,365 0,518 0,407 VLAD-SURF CEDD Color Correlogram Color Layout Edge Histogram FCTH
  • 14. Apache Solr Integration • Motivation: – Use a search and retrieval server with all its tools • Objectives: – indexing & management – efficient content based image search – content based ranking of results
  • 15. Solr Plugin • Custom Request Handler – Uses Solr’s request and response framework – Allows for content based retrieval • Custom ValueSourceFunction – Added to text based search queries – Allows for ranking based on the distance function
  • 16. Solr Plugin • Custom type of index field – DocValue based binary field – transmission base64 encoded • Custom Indexer – XML documents to be uploaded to Solr
  • 18. Future Work • DocValues based indexing – make linear search faster • Proximity hashing – metric spaces approach – more accurate • Release version 1.0 – adding docs & features freeze
  • 19. Acknowledgements I’d like to thank Anna-Maria Pasterk, Arthur Li, Arthur Pitman, Bastian Hösch, Benjamin Sznajder, Christian Penz, Christine Keim, Christoph Kofler, Dan Hanley, Daniel Pötzinger, Fabrizio Falchi, Franz Graf, Giuseppe Amato, Glenn Macstravic, James Charters, Janine Lachner, Katharina Tomanec, Lukas Esterle, Manuel Oraze, Marian Kogler, Marko Keuschnig, Michael Riegler, Rodrigo Carvalho Rezende, Roman Divotkey, Roman Kern, Savvas Chatzichristofis and Sandeep Gupta.
  • 21. T hanks for listening … • Mathias Lux • mlux@itec.uni-klu.ac.at