SlideShare uma empresa Scribd logo
1 de 83
Baixar para ler offline
Sketching out Your Search Intent
  - Towards bridging intent and semantic gaps in
          web-scale multimedia search


                          Xian-Sheng Hua
                Lead Researcher, Microsoft Research Asia

  June 24 2010 – ACM Multimedia 2010 TPC Meeting – University of Amsterdam
Xian-Sheng Hua, MSR Asia




Evolution of CBIR/CBVR

                                      Annotation Based
  Commercial Search                                            ~2006 
      Engines
                                                ~2003     (1) Tagging Based
                                                          (2) Large-Scale CBIR/CBVR
       Conventional CBIR      ~2001
                            Direct-Text Based
                  1990’                           Academia
                                                  Prototypes

        1970~
        1980’
    Manual Labeling Based




                                                                                            2
Xian-Sheng Hua, MSR Asia




Three Schemes

                Example-Based


            Annotation-Based


                 Text-Based
Xian-Sheng Hua, MSR Asia




Limitation of Text-Based Search
• High noises
   ̵   Surrounding text: frequently not related to the visual content
   ̵   Social tags: also noisy, and a large portion don’t have tags

• Mostly only covers simple semantics
   ̵   Surrounding text: limited
   ̵   Social tags: People tend to only input simple and personal tags
   ̵   Query association: powerful but coverage is still limited (non-clicked
       images will never be well indexed)


       A complex example:
          Finding Lady Gaga walking on red carpet with black dress
Xian-Sheng Hua, MSR Asia




Limitation of Annotation-Based Search
• Low accuracy
  ̵   The accuracy of automatic annotation is still far from satisfactory
  ̵   Difficult to solve due

• Limited coverage
  ̵   The coverage of the semantic concepts is still far from the rich content
      that is contain in an image
  ̵   Difficult to extend

• High computation
  ̵   Learning costs high computation
  ̵   Recognition costs high if the number of concepts is large

                   Still has a long way to go
Xian-Sheng Hua, MSR Asia




Limitation of Large-Scale Example-Based Search
• You need an example first
   ̵   How to do it if I don’t have an initial example?

• Large-scale (semantic) similarity search is still difficult
   ̵   Though large-scale (near) duplicate detection is good




                     An example: TinEye
Xian-Sheng Hua, MSR Asia




Possible Way-Outs?
• Large-scale high-performance annotation?
  ̵ Model-based? Data-driven? Hybrid? …

• Large-scale manual labeling?
  ̵ ESP game? Label Me? Machinery Turk? …

• New query interface?
  ̵ Interactive search?


Or … To combine text, content and interface?
Xian-Sheng Hua, MSR Asia




Simple Attempts
• Exemplary attempts based on this idea
  ̵ Search by dominant color (Google/Bing)
   ̵ Show similar images (Bing)/Find similar images (Google)
    ̵ Show more sizes (Bing)
Xian-Sheng Hua, MSR Asia




Two Recent Projects
• Sketching out your search intent
  ̵ Image Search by Color Sketch
   ̵ Image Search by Concept Sketch
Search by Color Sketch
      WWW 2010 Demo
Xian-Sheng Hua, MSR Asia




Seems It Is Not New?
• “Query by sketch” has been proposed long time ago
  ̵   But on small datasets and difficult to scale up
  ̵   And seldom work well actually
  ̵   Not use to use – “object sketch”
It is based on a SIGGRAPH 95’s paper:
       C. Jacobs et al. Fast Multiresolution Image Querying. SIGGRAPH 1995
on small scale image set and difficult to scale-up (0.44 second for 10,000 images)
Xian-Sheng Hua, MSR Asia
Xian-Sheng Hua, MSR Asia




Our Approach
• Color sketch …
  ̵   Is not “object sketch”
  ̵   But only rough color distribution
  ̵   Supports large-scale data
  ̵   And uses text at the same time
Xian-Sheng Hua, MSR Asia




Our Approach
• Color sketch …
  ̵   Is not “object sketch”
  ̵   But only rough color distribution
  ̵   Supports large-scale data
  ̵   And uses text at the same time

• Advantages of “Color Sketch”
  ̵   More presentative than dominant color (and text only)
  ̵   More robust than “object sketch”
  ̵   Easy to use compared with “object sketch”
  ̵   Easy to scale up
Xian-Sheng Hua, MSR Asia




Example: Blue flower with green leaf
Xian-Sheng Hua, MSR Asia




Example: Brooke Hogan walking on red carpet
Xian-Sheng Hua, MSR Asia




Color Sketch Is More Powerful
  Blue Flower with Green Leaf   Brooke Hogan walking on red carpet
Xian-Sheng Hua, MSR Asia




Color Sketch Is More Powerful




                                                  30
Xian-Sheng Hua, MSR Asia




Demo
• It has been productized in Bing
Gambia flag
Xian-Sheng Hua, MSR Asia




 Summary – Why it Works
• What are difficult
   ̵   Semantics is difficult
   ̵   Intent prediction is difficult

• What are easier
   ̵   Use color to describe images’ content
   ̵   Use color to describe users’ intent
                   content                color sketch              intent




   Color sketch is an intermediate representation to connect images’ content and users’ intent.
Xian-Sheng Hua, MSR Asia




Limitation of Color Sketch
• Only works well for those semantics that can
  be described by color distribution
• Only works well when people is able to transfer
  their desire into color distribution



     A more complex example:
        A flying butterfly on the top-left of a flower.
Search by Concept Sketch
         SIGIR 2010
Xian-Sheng Hua, MSR Asia




Search By Concept Sketch
• A system for the image search intentions concerning:
   ̵   The presence of the semantic concepts (semantic constraints)
   ̵   the spatial layout of the concepts (spatial constraints)




        butterfly

              flower



 A concept map query             Image search results of our system
Xian-Sheng Hua, MSR Asia




 Framework
A concept map query                                                           A candidate image
                                1
    A: butterfly                    Create Candidate
    B: flower                          Image Set
       butterfly
    A: X=0.5, Y=0.5
    B: X=0.8, flower
              Y=0.8

                                2                                        3
                                    Instant Concept                          Concept Distribution
                                       Modeling                                  Estimation
                                                  Concept Models                      Estimated distributions

                                               butterfly                               butterfly
                   4                              flower
                            Intent                                                      flower
                        Interpretation
        Desired distributions
                                              5
                                                  Spatial distribution
         butterfly                                   comparison

           flower
                                                     Relevance score
Xian-Sheng Hua, MSR Asia




 Search Results Comparison
  Task: searching for the images with “snoopy appear at right”



    snoopy
 Text-based image search




   “show similar image”




              snoopy




Image search by concept map
Xian-Sheng Hua, MSR Asia




Search Results
Task: searching for the images with “snoopy appear at top” / “snoopy appear at left”
Xian-Sheng Hua, MSR Asia




Search Results Comparison
Task: searching for the images with “a jeep shown above a piece of grass”




  jeep grass
Xian-Sheng Hua, MSR Asia




Search Results Comparison
Task: searching for the images with “a car parked in front of a house”




  house car
Xian-Sheng Hua, MSR Asia




Search Results Comparison
Task: searching for the images with “a keyboard and a mouse being side by side”




  keyboard mouse
Xian-Sheng Hua, MSR Asia




Search Results Comparison
Task: searching for the images with “sky/Colosseum/grass appear from top to bottom”




  Colosseum grass
Xian-Sheng Hua, MSR Asia




 Advanced Function: Influence Scope
 Indication
• Allow users to explicitly specify the occupied
  region of a concept

           house
                   Default
                                          house


    lawn                           lawn




                                          house


                                   lawn




                             Search for the long narrow lawn at the bottom of the image
Xian-Sheng Hua, MSR Asia




Advanced Function:
Influence Scope Adjustment



      Searching for small windmill




      Searching for large windmill
Xian-Sheng Hua, MSR Asia




Advanced Function:
Visual Modeling Adjustment
                               sky


                           house


                               lawn


                 Clear                       Search

                         Advanced Function
                 Visual Assistor




                                                      Previous   Next




  Allow users to indicate or narrow down
  what a desired concept looks like (visual constraints)
Xian-Sheng Hua, MSR Asia




Advanced Function:
Visual Modeling Adjustment


                Searching for
                front-view jeeps

 jeep




               Searching for
               side-view jeeps




        Visual instances
Xian-Sheng Hua, MSR Asia




Advanced Function:
Visual Modeling Adjustment


                  Searching for
                  butterfly with
                  yellow flower

 butterfly


       flower
                  Searching for
                  butterfly with
                  red flower




                Visual instances
Xian-Sheng Hua, MSR Asia




Conclusions
• To bridge the gap between users’ intent and
  visual semantics of images by
  ̵ Sketching out your search intent, and
   ̵ Combining text and visual content

• Two exemplary approaches introduced
  ̵ Search by color sketch
   ̵ Search by concept sketch
Xian-Sheng Hua, MSR Asia




Thank You

Mais conteúdo relacionado

Destaque

Ptc creo fmx sales presentation
Ptc creo fmx sales presentationPtc creo fmx sales presentation
Ptc creo fmx sales presentationVictor Mitov
 
Dynamic Search Using Semantics & Statistics
Dynamic Search Using Semantics & StatisticsDynamic Search Using Semantics & Statistics
Dynamic Search Using Semantics & StatisticsPaul Hofmann
 
Network Intent Composition in OpenDaylight
Network Intent Composition in OpenDaylightNetwork Intent Composition in OpenDaylight
Network Intent Composition in OpenDaylightOpenDaylight
 
Summit 16: Applying Machine Learning to Intent-based Networking and Nfv Scali...
Summit 16: Applying Machine Learning to Intent-based Networking and Nfv Scali...Summit 16: Applying Machine Learning to Intent-based Networking and Nfv Scali...
Summit 16: Applying Machine Learning to Intent-based Networking and Nfv Scali...OPNFV
 
Droidcon it 2015: Android Lollipop for Enterprise
Droidcon it 2015: Android Lollipop for EnterpriseDroidcon it 2015: Android Lollipop for Enterprise
Droidcon it 2015: Android Lollipop for EnterpriseConsulthinkspa
 
Object-oriented design patterns in UML [Software Modeling] [Computer Science...
Object-oriented design patterns  in UML [Software Modeling] [Computer Science...Object-oriented design patterns  in UML [Software Modeling] [Computer Science...
Object-oriented design patterns in UML [Software Modeling] [Computer Science...Ivano Malavolta
 
Unsupervised Learning and Modeling of Knowledge and Intent for Spoken Dialogu...
Unsupervised Learning and Modeling of Knowledge and Intent for Spoken Dialogu...Unsupervised Learning and Modeling of Knowledge and Intent for Spoken Dialogu...
Unsupervised Learning and Modeling of Knowledge and Intent for Spoken Dialogu...Yun-Nung (Vivian) Chen
 
Evolution Towards Web 3.0: The Semantic Web
Evolution Towards Web 3.0: The Semantic WebEvolution Towards Web 3.0: The Semantic Web
Evolution Towards Web 3.0: The Semantic WebLeeFeigenbaum
 

Destaque (8)

Ptc creo fmx sales presentation
Ptc creo fmx sales presentationPtc creo fmx sales presentation
Ptc creo fmx sales presentation
 
Dynamic Search Using Semantics & Statistics
Dynamic Search Using Semantics & StatisticsDynamic Search Using Semantics & Statistics
Dynamic Search Using Semantics & Statistics
 
Network Intent Composition in OpenDaylight
Network Intent Composition in OpenDaylightNetwork Intent Composition in OpenDaylight
Network Intent Composition in OpenDaylight
 
Summit 16: Applying Machine Learning to Intent-based Networking and Nfv Scali...
Summit 16: Applying Machine Learning to Intent-based Networking and Nfv Scali...Summit 16: Applying Machine Learning to Intent-based Networking and Nfv Scali...
Summit 16: Applying Machine Learning to Intent-based Networking and Nfv Scali...
 
Droidcon it 2015: Android Lollipop for Enterprise
Droidcon it 2015: Android Lollipop for EnterpriseDroidcon it 2015: Android Lollipop for Enterprise
Droidcon it 2015: Android Lollipop for Enterprise
 
Object-oriented design patterns in UML [Software Modeling] [Computer Science...
Object-oriented design patterns  in UML [Software Modeling] [Computer Science...Object-oriented design patterns  in UML [Software Modeling] [Computer Science...
Object-oriented design patterns in UML [Software Modeling] [Computer Science...
 
Unsupervised Learning and Modeling of Knowledge and Intent for Spoken Dialogu...
Unsupervised Learning and Modeling of Knowledge and Intent for Spoken Dialogu...Unsupervised Learning and Modeling of Knowledge and Intent for Spoken Dialogu...
Unsupervised Learning and Modeling of Knowledge and Intent for Spoken Dialogu...
 
Evolution Towards Web 3.0: The Semantic Web
Evolution Towards Web 3.0: The Semantic WebEvolution Towards Web 3.0: The Semantic Web
Evolution Towards Web 3.0: The Semantic Web
 

Semelhante a Bridging Intent and Semantic Gaps in Multimedia Search

Learning a Joint Embedding Representation for Image Search using Self-supervi...
Learning a Joint Embedding Representation for Image Search using Self-supervi...Learning a Joint Embedding Representation for Image Search using Self-supervi...
Learning a Joint Embedding Representation for Image Search using Self-supervi...Sujit Pal
 
Saliency-based Models of Image Content and their Application to Auto-Annotati...
Saliency-based Models of Image Content and their Application to Auto-Annotati...Saliency-based Models of Image Content and their Application to Auto-Annotati...
Saliency-based Models of Image Content and their Application to Auto-Annotati...Jonathon Hare
 
Reinvent The Wheel: Sketching Your Own Design Process
Reinvent The Wheel: Sketching Your Own Design ProcessReinvent The Wheel: Sketching Your Own Design Process
Reinvent The Wheel: Sketching Your Own Design ProcessChristopher Fahey
 
Using content-based multimedia similarity search for learning
Using content-based multimedia similarity search for learningUsing content-based multimedia similarity search for learning
Using content-based multimedia similarity search for learningsuzreader
 
Content based image retrieval(cbir)
Content based image retrieval(cbir)Content based image retrieval(cbir)
Content based image retrieval(cbir)paddu123
 
The Search for a New Visual Search Beyond Language - StampedeCon AI Summit 2017
The Search for a New Visual Search Beyond Language - StampedeCon AI Summit 2017The Search for a New Visual Search Beyond Language - StampedeCon AI Summit 2017
The Search for a New Visual Search Beyond Language - StampedeCon AI Summit 2017StampedeCon
 
Analyzing visuals ppt
Analyzing visuals pptAnalyzing visuals ppt
Analyzing visuals pptjjory7
 

Semelhante a Bridging Intent and Semantic Gaps in Multimedia Search (8)

Learning a Joint Embedding Representation for Image Search using Self-supervi...
Learning a Joint Embedding Representation for Image Search using Self-supervi...Learning a Joint Embedding Representation for Image Search using Self-supervi...
Learning a Joint Embedding Representation for Image Search using Self-supervi...
 
Saliency-based Models of Image Content and their Application to Auto-Annotati...
Saliency-based Models of Image Content and their Application to Auto-Annotati...Saliency-based Models of Image Content and their Application to Auto-Annotati...
Saliency-based Models of Image Content and their Application to Auto-Annotati...
 
Reinvent The Wheel: Sketching Your Own Design Process
Reinvent The Wheel: Sketching Your Own Design ProcessReinvent The Wheel: Sketching Your Own Design Process
Reinvent The Wheel: Sketching Your Own Design Process
 
Using content-based multimedia similarity search for learning
Using content-based multimedia similarity search for learningUsing content-based multimedia similarity search for learning
Using content-based multimedia similarity search for learning
 
Content based image retrieval(cbir)
Content based image retrieval(cbir)Content based image retrieval(cbir)
Content based image retrieval(cbir)
 
The Search for a New Visual Search Beyond Language - StampedeCon AI Summit 2017
The Search for a New Visual Search Beyond Language - StampedeCon AI Summit 2017The Search for a New Visual Search Beyond Language - StampedeCon AI Summit 2017
The Search for a New Visual Search Beyond Language - StampedeCon AI Summit 2017
 
Machine Learning
Machine LearningMachine Learning
Machine Learning
 
Analyzing visuals ppt
Analyzing visuals pptAnalyzing visuals ppt
Analyzing visuals ppt
 

Último

SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embeddingZilliz
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfRankYa
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 

Último (20)

SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embedding
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 

Bridging Intent and Semantic Gaps in Multimedia Search

  • 1. Sketching out Your Search Intent - Towards bridging intent and semantic gaps in web-scale multimedia search Xian-Sheng Hua Lead Researcher, Microsoft Research Asia June 24 2010 – ACM Multimedia 2010 TPC Meeting – University of Amsterdam
  • 2. Xian-Sheng Hua, MSR Asia Evolution of CBIR/CBVR Annotation Based Commercial Search ~2006  Engines ~2003 (1) Tagging Based (2) Large-Scale CBIR/CBVR Conventional CBIR ~2001 Direct-Text Based 1990’ Academia Prototypes 1970~ 1980’ Manual Labeling Based 2
  • 3. Xian-Sheng Hua, MSR Asia Three Schemes Example-Based Annotation-Based Text-Based
  • 4. Xian-Sheng Hua, MSR Asia Limitation of Text-Based Search • High noises ̵ Surrounding text: frequently not related to the visual content ̵ Social tags: also noisy, and a large portion don’t have tags • Mostly only covers simple semantics ̵ Surrounding text: limited ̵ Social tags: People tend to only input simple and personal tags ̵ Query association: powerful but coverage is still limited (non-clicked images will never be well indexed) A complex example: Finding Lady Gaga walking on red carpet with black dress
  • 5.
  • 6. Xian-Sheng Hua, MSR Asia Limitation of Annotation-Based Search • Low accuracy ̵ The accuracy of automatic annotation is still far from satisfactory ̵ Difficult to solve due • Limited coverage ̵ The coverage of the semantic concepts is still far from the rich content that is contain in an image ̵ Difficult to extend • High computation ̵ Learning costs high computation ̵ Recognition costs high if the number of concepts is large Still has a long way to go
  • 7. Xian-Sheng Hua, MSR Asia Limitation of Large-Scale Example-Based Search • You need an example first ̵ How to do it if I don’t have an initial example? • Large-scale (semantic) similarity search is still difficult ̵ Though large-scale (near) duplicate detection is good An example: TinEye
  • 8. Xian-Sheng Hua, MSR Asia Possible Way-Outs? • Large-scale high-performance annotation? ̵ Model-based? Data-driven? Hybrid? … • Large-scale manual labeling? ̵ ESP game? Label Me? Machinery Turk? … • New query interface? ̵ Interactive search? Or … To combine text, content and interface?
  • 9. Xian-Sheng Hua, MSR Asia Simple Attempts • Exemplary attempts based on this idea ̵ Search by dominant color (Google/Bing) ̵ Show similar images (Bing)/Find similar images (Google) ̵ Show more sizes (Bing)
  • 10.
  • 11.
  • 12.
  • 13.
  • 14.
  • 15.
  • 16. Xian-Sheng Hua, MSR Asia Two Recent Projects • Sketching out your search intent ̵ Image Search by Color Sketch ̵ Image Search by Concept Sketch
  • 17. Search by Color Sketch WWW 2010 Demo
  • 18. Xian-Sheng Hua, MSR Asia Seems It Is Not New? • “Query by sketch” has been proposed long time ago ̵ But on small datasets and difficult to scale up ̵ And seldom work well actually ̵ Not use to use – “object sketch”
  • 19.
  • 20. It is based on a SIGGRAPH 95’s paper: C. Jacobs et al. Fast Multiresolution Image Querying. SIGGRAPH 1995 on small scale image set and difficult to scale-up (0.44 second for 10,000 images)
  • 22.
  • 23. Xian-Sheng Hua, MSR Asia Our Approach • Color sketch … ̵ Is not “object sketch” ̵ But only rough color distribution ̵ Supports large-scale data ̵ And uses text at the same time
  • 24.
  • 25.
  • 26. Xian-Sheng Hua, MSR Asia Our Approach • Color sketch … ̵ Is not “object sketch” ̵ But only rough color distribution ̵ Supports large-scale data ̵ And uses text at the same time • Advantages of “Color Sketch” ̵ More presentative than dominant color (and text only) ̵ More robust than “object sketch” ̵ Easy to use compared with “object sketch” ̵ Easy to scale up
  • 27. Xian-Sheng Hua, MSR Asia Example: Blue flower with green leaf
  • 28. Xian-Sheng Hua, MSR Asia Example: Brooke Hogan walking on red carpet
  • 29. Xian-Sheng Hua, MSR Asia Color Sketch Is More Powerful Blue Flower with Green Leaf Brooke Hogan walking on red carpet
  • 30. Xian-Sheng Hua, MSR Asia Color Sketch Is More Powerful 30
  • 31. Xian-Sheng Hua, MSR Asia Demo • It has been productized in Bing
  • 32.
  • 33.
  • 34.
  • 35.
  • 36.
  • 37.
  • 38.
  • 39.
  • 40.
  • 41.
  • 42.
  • 43.
  • 44.
  • 45.
  • 46.
  • 47.
  • 48.
  • 50.
  • 51.
  • 52.
  • 53.
  • 54.
  • 55.
  • 56.
  • 57.
  • 58.
  • 59.
  • 60.
  • 61.
  • 62.
  • 63.
  • 64.
  • 65.
  • 66. Xian-Sheng Hua, MSR Asia Summary – Why it Works • What are difficult ̵ Semantics is difficult ̵ Intent prediction is difficult • What are easier ̵ Use color to describe images’ content ̵ Use color to describe users’ intent content color sketch intent Color sketch is an intermediate representation to connect images’ content and users’ intent.
  • 67. Xian-Sheng Hua, MSR Asia Limitation of Color Sketch • Only works well for those semantics that can be described by color distribution • Only works well when people is able to transfer their desire into color distribution A more complex example: A flying butterfly on the top-left of a flower.
  • 68. Search by Concept Sketch SIGIR 2010
  • 69. Xian-Sheng Hua, MSR Asia Search By Concept Sketch • A system for the image search intentions concerning: ̵ The presence of the semantic concepts (semantic constraints) ̵ the spatial layout of the concepts (spatial constraints) butterfly flower A concept map query Image search results of our system
  • 70. Xian-Sheng Hua, MSR Asia Framework A concept map query A candidate image 1 A: butterfly Create Candidate B: flower Image Set butterfly A: X=0.5, Y=0.5 B: X=0.8, flower Y=0.8 2 3 Instant Concept Concept Distribution Modeling Estimation Concept Models Estimated distributions butterfly butterfly 4 flower Intent flower Interpretation Desired distributions 5 Spatial distribution butterfly comparison flower Relevance score
  • 71. Xian-Sheng Hua, MSR Asia Search Results Comparison Task: searching for the images with “snoopy appear at right” snoopy Text-based image search “show similar image” snoopy Image search by concept map
  • 72. Xian-Sheng Hua, MSR Asia Search Results Task: searching for the images with “snoopy appear at top” / “snoopy appear at left”
  • 73. Xian-Sheng Hua, MSR Asia Search Results Comparison Task: searching for the images with “a jeep shown above a piece of grass” jeep grass
  • 74. Xian-Sheng Hua, MSR Asia Search Results Comparison Task: searching for the images with “a car parked in front of a house” house car
  • 75. Xian-Sheng Hua, MSR Asia Search Results Comparison Task: searching for the images with “a keyboard and a mouse being side by side” keyboard mouse
  • 76. Xian-Sheng Hua, MSR Asia Search Results Comparison Task: searching for the images with “sky/Colosseum/grass appear from top to bottom” Colosseum grass
  • 77. Xian-Sheng Hua, MSR Asia Advanced Function: Influence Scope Indication • Allow users to explicitly specify the occupied region of a concept house Default house lawn lawn house lawn Search for the long narrow lawn at the bottom of the image
  • 78. Xian-Sheng Hua, MSR Asia Advanced Function: Influence Scope Adjustment Searching for small windmill Searching for large windmill
  • 79. Xian-Sheng Hua, MSR Asia Advanced Function: Visual Modeling Adjustment sky house lawn Clear Search Advanced Function Visual Assistor Previous Next Allow users to indicate or narrow down what a desired concept looks like (visual constraints)
  • 80. Xian-Sheng Hua, MSR Asia Advanced Function: Visual Modeling Adjustment Searching for front-view jeeps jeep Searching for side-view jeeps Visual instances
  • 81. Xian-Sheng Hua, MSR Asia Advanced Function: Visual Modeling Adjustment Searching for butterfly with yellow flower butterfly flower Searching for butterfly with red flower Visual instances
  • 82. Xian-Sheng Hua, MSR Asia Conclusions • To bridge the gap between users’ intent and visual semantics of images by ̵ Sketching out your search intent, and ̵ Combining text and visual content • Two exemplary approaches introduced ̵ Search by color sketch ̵ Search by concept sketch
  • 83. Xian-Sheng Hua, MSR Asia Thank You