SlideShare uma empresa Scribd logo
1 de 22
A service-based Architecture for
Multi-domain Search on the Web
ICSOC 2010 - 8th International Conference on Service Oriented Computing
San Francisco
December 7th - 10th 2010
Motivation

• Search is a fundamental activity for information management


• Search engines are superb in their ability of extracting documents that most
  closely match with user’s query


• But they fail in going beyond such simple paradigm!


• Example:


         a theater close to Union Square, San Francisco,
   • “Find
    showing a recent thriller movie, close to a steak house”
Search for a solution using keywords




                                       Not useful....
Let’s split the tasks, and search for
theaters using keywords




                           We got results
Inspect theatre details
Check for showtimes




down in the result set... no thriller movies! Search for a new one
Now let’s look for a steak house
Inspect restaurant details
What is Google doing?

• Guessing the user’s intention

  • Solving her atomic task (find - and book - a restaurant, find a theatre)


• Yahoo: We’re Moving From Web Of Pages To Web Of Objects

  • http://techcrunch.com/2010/09/16/live-from-yahoos-product-runway-
    event/


  • “There’s going to be a blurring between typing in a query and getting a
    bunch of a links… People just want answers. Let you buy a ticket right
    from a movie result.”
What should we - as users - do?

•Work on subproblems

• Start a search process

• Use available search services ... and there’s a lot of!
  • ProgrammableWeb
  • YQL
  • GoogleBase
  • Linked Data
  • ...


•Try to be smart!
The Search Computing Project

• Build theories, methods and tools to support search
  processes
  • Given a multi-domain query
  • Given a set of search services
  • Build several “solutions” which already integrate all the search
    process dimensions
  • Ranks “solutions” according to a global rank function and output
    results in rank order
  • Support user-friendly query definition and result browsing
  • Add search domains while the search process proceeds
  • Possibly change the relative weight of each ranking
Why search services deserve special handling?

 •Ranking
 •Chunked results
 •Data and control driven orchestration
     • Need for ad-hoc optimization strategies of query plans [1]
     • Algorithms for the computation of composite results (based on
       different join strategies) [2]

 •Novel user interaction paradigms [3]

[1] Braga, D., Ceri, S., Corcoglioniti, F., Grossniklaus, M. 2010. Panta Rhei: Flexible Execution Engine
for Search Computing Queries, in Ceri, S., Brambilla, M. (eds.) 2010. Search Computing Challenges and
Directions. Springer LNCS vol. 5950

[2] Martinenghi, D., Tagliasacchi, M. 2010. Proximity Rank Join. VLDB 2010, Singapore, September 2010
[3] Bozzon, A., Brambilla, M., Ceri, S., Fraternali, P. 2010. Liquid Query: Multi-Domain Exploratory Search
on the Web, WWW 2010, Raleigh, North Carolina, April 2010.
An architecture for multi-domain search

                         On-line                 Off-line
   Client




                         Client                   Tools



                      Load Balancer
   Server




              Cache                  Query      Profiler /
                                   Processor    Analyzer


                  Service Wrappers             Materializer
   Services




                        Services
Client Tier
                                                              Browser
“Search as a process” enactment        Liquid Query and Result Application
                                       View                                                    Model
                                                   Widget                 Transient
 • Query submission                       Liquid
                                          Query         Liquid
                                                                                          Persistent

                                          Module        Result
                                                        Module               Data Management


 • Result browsing and query




                                                                                                       Controller
                                                            Interaction          Application State
                                          UI Builder
   refinement                                                 Controller             Manager

                                                            Communication


 • Interaction primitives that support result manipulation,
   exploration and expansion of the search results

 • Alternative data visualizations


www.search-computing.net/demo/UI
Server Tier
                                                             QUERY ANALYZER
                                                          Query Analyzer API
                                                                  Query Parser

                                                              Query Planner

                                                             Query Optimizer



     EXECUTION ENGINE
                                Engine API
                                                                                     Service Repository
      Session
      Manager                                                                     Service Repository API
                                                 Engine Controller
                                                                                    Service Repository
       Cache                       Chain Of                                             Controller
      Manager                    Responsability



    Persistency
                                            Local
     Manager                                Functs
                                                                                         Persistent
                   Service Invoker                                                    Query Repository
    Custom      Custom                                                    QEP      Query Repository API
     Legacy       Web           Standard Wrappers
                                                                        Invoker
    Wrappers    Wrappers                                                              Query Repository
                           REST YQL GBase   DB    WSDL   SPARQL
                                                                                         Controller
Server Tier
• Query analyzer and optimizer
• translate SQL-like queries into an executable plan

      DEFINE QUERY NightPlan($X:String, $Y: Integer , $U:String, $V:String, $W:String) AS
          SELECT M.*, T.*, R.*, TotalPrice=T.Price + R.AvgPrice
          FROM ((Movie (iGenre: $X, iYear: $Y) AS M USING IMDB_MOVIES,
              JOIN
              Theatre (iAddress: $U, iCity: $V, iCountry: $W) AS T USING GOOGLE_DISPLAYING
              ON M.Title=T.Title)
              JOIN
              Restaurant (iCountry: $W, iCategory: "Italian Restaurant") AS R USING YQL_LOCAL
              ON T.address=R.Address AND T.city=R.City)
          WHERE R.Rating>3
          RANK BY (R=0.4, T=0.3, M=0.3)
          LIMIT 20 TUPLES AND 50 CALLS




• transformation takes into account data dependencies, execution
  constraints, service profile informations
Server Tier
• Execution Engine                               Movie


• Executes service execution plans while                   Restaurant

  satisfying the execution constraints
                                                 Theater
• Synchronous and asynchronous search
  mechanism
• Service Repository
• Abstraction of actual search services
   • Input/Output/Ranking
• Set of connection patterns, pairwise
  coupling of services attributes to pre-define
  joins
• Reference to actual search services
Some toughs on distribution

                         On-line                 Off-line
   Client




                         Client                   Tools



                      Load Balancer
   Server




              Cache                  Query      Profiler /
                                   Processor    Analyzer


                  Service Wrappers             Materializer
   Services




                        Services
Experimental Evaluation
• Query
   • “Find a theater close to Union Square, San Francisco, showing a
     recent thriller movie, close to a steak house”


• Services
   • IMDB Archive, Google Movie Search, Yahoo Local Search


• Demo
   • http://www.search-computing.com/UIMovie
Results
              • Service invocation is the most time-consuming                                                                          Movie


                task                                                                                                                                    Restaurant



              • Number of service invocation depends on the                                                                            Theater



                topology of execution plan
              • Cache can lead to huge performance
                improvements
                      40                                                                                40




                                                                                 Execution time (sec)
Service invocations




                      30                                                                                30


                      20                                                                                20


                      10                                                                                10


                       0                                                                                 0
                           0   10   20   30    40   50   60   70   80   90 100                               0        0.25      0.5              0.75           1
                                              Combinations                                                             Cache hit probability
                                movie            theater           restaurant                                0.25 s    0.5 s      1s              2s          4s
Conclusion and Future Works

• We propose an architecture supporting the execution of
  multi-domain queries using search services service
  based environment


• Our implementation, demonstrations, and experimental
  results show the feasibility of the approach


• Architecturally speaking, there is a lot of space for
  improvements
   • Smart control strategies, such as top-k joins
   • Efficient search service pre-fetching and
     materialization
   • Dynamic evolution of execution plans according to an
     exploratory search approach
Thanks!

Questions?

www.search-computing.net!

bozzon@elet.polimi.it!

Mais conteúdo relacionado

Destaque

Top 7 solution architect interview questions answers
Top 7 solution architect interview questions answersTop 7 solution architect interview questions answers
Top 7 solution architect interview questions answerstomhandsome70
 
Design Thinking Process
Design Thinking ProcessDesign Thinking Process
Design Thinking ProcessHeyy Gus
 
An introduction to fundamental architecture concepts
An introduction to fundamental architecture conceptsAn introduction to fundamental architecture concepts
An introduction to fundamental architecture conceptswweinmeyer79
 
Generating Architectural Concepts & Design Ideas
Generating Architectural Concepts & Design IdeasGenerating Architectural Concepts & Design Ideas
Generating Architectural Concepts & Design IdeasWan Muhammad / Asas-Khu™
 
Solution Architecture – Approach to Rapidly Scoping The Initial Solution Options
Solution Architecture – Approach to Rapidly Scoping The Initial Solution OptionsSolution Architecture – Approach to Rapidly Scoping The Initial Solution Options
Solution Architecture – Approach to Rapidly Scoping The Initial Solution OptionsAlan McSweeney
 
Structured Approach to Solution Architecture
Structured Approach to Solution ArchitectureStructured Approach to Solution Architecture
Structured Approach to Solution ArchitectureAlan McSweeney
 

Destaque (7)

Ms Lync 101
Ms Lync 101Ms Lync 101
Ms Lync 101
 
Top 7 solution architect interview questions answers
Top 7 solution architect interview questions answersTop 7 solution architect interview questions answers
Top 7 solution architect interview questions answers
 
Design Thinking Process
Design Thinking ProcessDesign Thinking Process
Design Thinking Process
 
An introduction to fundamental architecture concepts
An introduction to fundamental architecture conceptsAn introduction to fundamental architecture concepts
An introduction to fundamental architecture concepts
 
Generating Architectural Concepts & Design Ideas
Generating Architectural Concepts & Design IdeasGenerating Architectural Concepts & Design Ideas
Generating Architectural Concepts & Design Ideas
 
Solution Architecture – Approach to Rapidly Scoping The Initial Solution Options
Solution Architecture – Approach to Rapidly Scoping The Initial Solution OptionsSolution Architecture – Approach to Rapidly Scoping The Initial Solution Options
Solution Architecture – Approach to Rapidly Scoping The Initial Solution Options
 
Structured Approach to Solution Architecture
Structured Approach to Solution ArchitectureStructured Approach to Solution Architecture
Structured Approach to Solution Architecture
 

Semelhante a A service-based architecture for multi-domain search

Oracle Application Management Suite
Oracle Application Management SuiteOracle Application Management Suite
Oracle Application Management SuiteOracleVolutionSeries
 
MetaVis Webinar - 10 Things I Like in SharePoint 2013 Search
MetaVis Webinar - 10 Things I Like in SharePoint 2013 SearchMetaVis Webinar - 10 Things I Like in SharePoint 2013 Search
MetaVis Webinar - 10 Things I Like in SharePoint 2013 SearchAgnes Molnar
 
Mike Taulty MIX10 Silverlight 4 Patterns Frameworks
Mike Taulty MIX10 Silverlight 4 Patterns FrameworksMike Taulty MIX10 Silverlight 4 Patterns Frameworks
Mike Taulty MIX10 Silverlight 4 Patterns Frameworksukdpe
 
ESPC13 - 10 Things I Like in SharePoint 2013 Search
ESPC13 - 10 Things I Like in SharePoint 2013 SearchESPC13 - 10 Things I Like in SharePoint 2013 Search
ESPC13 - 10 Things I Like in SharePoint 2013 SearchAgnes Molnar
 
A Behind the Scenes Look at the Force.com Platform
A Behind the Scenes Look at the Force.com PlatformA Behind the Scenes Look at the Force.com Platform
A Behind the Scenes Look at the Force.com PlatformSalesforce Developers
 
Siddhi: A Second Look at Complex Event Processing Implementations
Siddhi: A Second Look at Complex Event Processing ImplementationsSiddhi: A Second Look at Complex Event Processing Implementations
Siddhi: A Second Look at Complex Event Processing ImplementationsSrinath Perera
 
SharePoint 2010 Online for Developer
SharePoint 2010 Online for DeveloperSharePoint 2010 Online for Developer
SharePoint 2010 Online for DeveloperK.Mohamed Faizal
 
SharePoint 2013 Search Architecture with Russ Houberg
SharePoint 2013  Search Architecture with Russ HoubergSharePoint 2013  Search Architecture with Russ Houberg
SharePoint 2013 Search Architecture with Russ Houbergknowledgelakemarketing
 
Real User Experience Insight External
Real User Experience Insight ExternalReal User Experience Insight External
Real User Experience Insight Externaloracleonthebrain
 
Azure Application insights - An Introduction
Azure Application insights - An IntroductionAzure Application insights - An Introduction
Azure Application insights - An IntroductionMatthias Güntert
 
Linking Services and Linked Data: Keynote for AIMSA 2012
Linking Services and Linked Data: Keynote for AIMSA 2012Linking Services and Linked Data: Keynote for AIMSA 2012
Linking Services and Linked Data: Keynote for AIMSA 2012John Domingue
 
Creating an Ecosystem Platform with Vertical PaaS
Creating an Ecosystem Platform with Vertical PaaSCreating an Ecosystem Platform with Vertical PaaS
Creating an Ecosystem Platform with Vertical PaaSWSO2
 
Composite Applications with SOA, BPEL and Java EE
Composite  Applications with SOA, BPEL and Java EEComposite  Applications with SOA, BPEL and Java EE
Composite Applications with SOA, BPEL and Java EEDmitri Shiryaev
 
Summer School Scale Cloud Across the Enterprise
Summer School   Scale Cloud Across the EnterpriseSummer School   Scale Cloud Across the Enterprise
Summer School Scale Cloud Across the EnterpriseWSO2
 
The Magic Behind Faster API Development, Testing and Delivery with API Virtua...
The Magic Behind Faster API Development, Testing and Delivery with API Virtua...The Magic Behind Faster API Development, Testing and Delivery with API Virtua...
The Magic Behind Faster API Development, Testing and Delivery with API Virtua...SmartBear
 
CloudStack Collaboration Conference 12; Refactoring cloud stack
CloudStack Collaboration Conference 12; Refactoring cloud stackCloudStack Collaboration Conference 12; Refactoring cloud stack
CloudStack Collaboration Conference 12; Refactoring cloud stackbuildacloud
 
Sql Server 2008 Performance and Scaleability
Sql Server 2008 Performance and ScaleabilitySql Server 2008 Performance and Scaleability
Sql Server 2008 Performance and Scaleabilitydataplex systems limited
 
10 Things I Like in SharePoint 2013 Search
10 Things I Like in SharePoint 2013 Search10 Things I Like in SharePoint 2013 Search
10 Things I Like in SharePoint 2013 SearchSPC Adriatics
 
SPCAdriatics - 10 Things I Like In SharePoint 2013 Search
SPCAdriatics - 10 Things I Like In SharePoint 2013 SearchSPCAdriatics - 10 Things I Like In SharePoint 2013 Search
SPCAdriatics - 10 Things I Like In SharePoint 2013 SearchAgnes Molnar
 

Semelhante a A service-based architecture for multi-domain search (20)

Oracle Application Management Suite
Oracle Application Management SuiteOracle Application Management Suite
Oracle Application Management Suite
 
MetaVis Webinar - 10 Things I Like in SharePoint 2013 Search
MetaVis Webinar - 10 Things I Like in SharePoint 2013 SearchMetaVis Webinar - 10 Things I Like in SharePoint 2013 Search
MetaVis Webinar - 10 Things I Like in SharePoint 2013 Search
 
Mike Taulty MIX10 Silverlight 4 Patterns Frameworks
Mike Taulty MIX10 Silverlight 4 Patterns FrameworksMike Taulty MIX10 Silverlight 4 Patterns Frameworks
Mike Taulty MIX10 Silverlight 4 Patterns Frameworks
 
ESPC13 - 10 Things I Like in SharePoint 2013 Search
ESPC13 - 10 Things I Like in SharePoint 2013 SearchESPC13 - 10 Things I Like in SharePoint 2013 Search
ESPC13 - 10 Things I Like in SharePoint 2013 Search
 
A Behind the Scenes Look at the Force.com Platform
A Behind the Scenes Look at the Force.com PlatformA Behind the Scenes Look at the Force.com Platform
A Behind the Scenes Look at the Force.com Platform
 
Siddhi: A Second Look at Complex Event Processing Implementations
Siddhi: A Second Look at Complex Event Processing ImplementationsSiddhi: A Second Look at Complex Event Processing Implementations
Siddhi: A Second Look at Complex Event Processing Implementations
 
SharePoint 2010 Online for Developer
SharePoint 2010 Online for DeveloperSharePoint 2010 Online for Developer
SharePoint 2010 Online for Developer
 
SharePoint 2013 Search Architecture with Russ Houberg
SharePoint 2013  Search Architecture with Russ HoubergSharePoint 2013  Search Architecture with Russ Houberg
SharePoint 2013 Search Architecture with Russ Houberg
 
Real User Experience Insight External
Real User Experience Insight ExternalReal User Experience Insight External
Real User Experience Insight External
 
Azure Application insights - An Introduction
Azure Application insights - An IntroductionAzure Application insights - An Introduction
Azure Application insights - An Introduction
 
Linking Services and Linked Data: Keynote for AIMSA 2012
Linking Services and Linked Data: Keynote for AIMSA 2012Linking Services and Linked Data: Keynote for AIMSA 2012
Linking Services and Linked Data: Keynote for AIMSA 2012
 
Creating an Ecosystem Platform with Vertical PaaS
Creating an Ecosystem Platform with Vertical PaaSCreating an Ecosystem Platform with Vertical PaaS
Creating an Ecosystem Platform with Vertical PaaS
 
Composite Applications with SOA, BPEL and Java EE
Composite  Applications with SOA, BPEL and Java EEComposite  Applications with SOA, BPEL and Java EE
Composite Applications with SOA, BPEL and Java EE
 
Summer School Scale Cloud Across the Enterprise
Summer School   Scale Cloud Across the EnterpriseSummer School   Scale Cloud Across the Enterprise
Summer School Scale Cloud Across the Enterprise
 
The Magic Behind Faster API Development, Testing and Delivery with API Virtua...
The Magic Behind Faster API Development, Testing and Delivery with API Virtua...The Magic Behind Faster API Development, Testing and Delivery with API Virtua...
The Magic Behind Faster API Development, Testing and Delivery with API Virtua...
 
Ideas for addictive series 40 web apps
Ideas for addictive series 40 web appsIdeas for addictive series 40 web apps
Ideas for addictive series 40 web apps
 
CloudStack Collaboration Conference 12; Refactoring cloud stack
CloudStack Collaboration Conference 12; Refactoring cloud stackCloudStack Collaboration Conference 12; Refactoring cloud stack
CloudStack Collaboration Conference 12; Refactoring cloud stack
 
Sql Server 2008 Performance and Scaleability
Sql Server 2008 Performance and ScaleabilitySql Server 2008 Performance and Scaleability
Sql Server 2008 Performance and Scaleability
 
10 Things I Like in SharePoint 2013 Search
10 Things I Like in SharePoint 2013 Search10 Things I Like in SharePoint 2013 Search
10 Things I Like in SharePoint 2013 Search
 
SPCAdriatics - 10 Things I Like In SharePoint 2013 Search
SPCAdriatics - 10 Things I Like In SharePoint 2013 SearchSPCAdriatics - 10 Things I Like In SharePoint 2013 Search
SPCAdriatics - 10 Things I Like In SharePoint 2013 Search
 

Mais de Alessandro Bozzon

Weaving the Web of People and Things for Intelligent Cities
Weaving the Web of People and Things for Intelligent CitiesWeaving the Web of People and Things for Intelligent Cities
Weaving the Web of People and Things for Intelligent CitiesAlessandro Bozzon
 
Trustworthy Micro-task Crowdsourcing: Challenges and Opportunities
Trustworthy Micro-task Crowdsourcing: Challenges and OpportunitiesTrustworthy Micro-task Crowdsourcing: Challenges and Opportunities
Trustworthy Micro-task Crowdsourcing: Challenges and OpportunitiesAlessandro Bozzon
 
SAIL 2015 Crowdmanagement Experiment. Pitch slides
SAIL 2015 Crowdmanagement Experiment. Pitch slidesSAIL 2015 Crowdmanagement Experiment. Pitch slides
SAIL 2015 Crowdmanagement Experiment. Pitch slidesAlessandro Bozzon
 
Social Data Science For Intelligent Cities
Social Data Science For Intelligent CitiesSocial Data Science For Intelligent Cities
Social Data Science For Intelligent CitiesAlessandro Bozzon
 
Pattern-Based Specification of Crowdsourcing Applications
Pattern-Based Specification of Crowdsourcing ApplicationsPattern-Based Specification of Crowdsourcing Applications
Pattern-Based Specification of Crowdsourcing ApplicationsAlessandro Bozzon
 
ICWE 2013 - Slides From The Poster And Demo Session
ICWE 2013 - Slides From The Poster And Demo SessionICWE 2013 - Slides From The Poster And Demo Session
ICWE 2013 - Slides From The Poster And Demo SessionAlessandro Bozzon
 
An Introduction to Human Computation and Games With A Purpose - Part I
An Introduction to Human Computation and Games With A Purpose - Part IAn Introduction to Human Computation and Games With A Purpose - Part I
An Introduction to Human Computation and Games With A Purpose - Part IAlessandro Bozzon
 
Engineering Web Search Applications
Engineering Web Search ApplicationsEngineering Web Search Applications
Engineering Web Search ApplicationsAlessandro Bozzon
 
Liquid Query: Multi-domain Exploratory Search on the Web
Liquid Query: Multi-domain Exploratory Search on the WebLiquid Query: Multi-domain Exploratory Search on the Web
Liquid Query: Multi-domain Exploratory Search on the WebAlessandro Bozzon
 

Mais de Alessandro Bozzon (11)

Weaving the Web of People and Things for Intelligent Cities
Weaving the Web of People and Things for Intelligent CitiesWeaving the Web of People and Things for Intelligent Cities
Weaving the Web of People and Things for Intelligent Cities
 
Trustworthy Micro-task Crowdsourcing: Challenges and Opportunities
Trustworthy Micro-task Crowdsourcing: Challenges and OpportunitiesTrustworthy Micro-task Crowdsourcing: Challenges and Opportunities
Trustworthy Micro-task Crowdsourcing: Challenges and Opportunities
 
SAIL 2015 Crowdmanagement Experiment. Pitch slides
SAIL 2015 Crowdmanagement Experiment. Pitch slidesSAIL 2015 Crowdmanagement Experiment. Pitch slides
SAIL 2015 Crowdmanagement Experiment. Pitch slides
 
Social Data Science For Intelligent Cities
Social Data Science For Intelligent CitiesSocial Data Science For Intelligent Cities
Social Data Science For Intelligent Cities
 
Pattern-Based Specification of Crowdsourcing Applications
Pattern-Based Specification of Crowdsourcing ApplicationsPattern-Based Specification of Crowdsourcing Applications
Pattern-Based Specification of Crowdsourcing Applications
 
ICWE 2013 - Slides From The Poster And Demo Session
ICWE 2013 - Slides From The Poster And Demo SessionICWE 2013 - Slides From The Poster And Demo Session
ICWE 2013 - Slides From The Poster And Demo Session
 
An Introduction to Human Computation and Games With A Purpose - Part I
An Introduction to Human Computation and Games With A Purpose - Part IAn Introduction to Human Computation and Games With A Purpose - Part I
An Introduction to Human Computation and Games With A Purpose - Part I
 
Reactive crowdsourcing
Reactive crowdsourcingReactive crowdsourcing
Reactive crowdsourcing
 
Engineering Web Search Applications
Engineering Web Search ApplicationsEngineering Web Search Applications
Engineering Web Search Applications
 
Search Computing
Search ComputingSearch Computing
Search Computing
 
Liquid Query: Multi-domain Exploratory Search on the Web
Liquid Query: Multi-domain Exploratory Search on the WebLiquid Query: Multi-domain Exploratory Search on the Web
Liquid Query: Multi-domain Exploratory Search on the Web
 

Último

Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesKari Kakkonen
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...Wes McKinney
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
QCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architecturesQCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architecturesBernd Ruecker
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Strongerpanagenda
 
Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...itnewsafrica
 
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...Nikki Chapple
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI AgeCprime
 
Decarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityDecarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityIES VE
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfNeo4j
 
Abdul Kader Baba- Managing Cybersecurity Risks and Compliance Requirements i...
Abdul Kader Baba- Managing Cybersecurity Risks  and Compliance Requirements i...Abdul Kader Baba- Managing Cybersecurity Risks  and Compliance Requirements i...
Abdul Kader Baba- Managing Cybersecurity Risks and Compliance Requirements i...itnewsafrica
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsNathaniel Shimoni
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Mark Goldstein
 
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality AssuranceInflectra
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Alkin Tezuysal
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch TuesdayIvanti
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 

Último (20)

Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examples
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
QCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architecturesQCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architectures
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
 
Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...
 
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI Age
 
Decarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityDecarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a reality
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdf
 
Abdul Kader Baba- Managing Cybersecurity Risks and Compliance Requirements i...
Abdul Kader Baba- Managing Cybersecurity Risks  and Compliance Requirements i...Abdul Kader Baba- Managing Cybersecurity Risks  and Compliance Requirements i...
Abdul Kader Baba- Managing Cybersecurity Risks and Compliance Requirements i...
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directions
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
 
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch Tuesday
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 

A service-based architecture for multi-domain search

  • 1. A service-based Architecture for Multi-domain Search on the Web ICSOC 2010 - 8th International Conference on Service Oriented Computing San Francisco December 7th - 10th 2010
  • 2. Motivation • Search is a fundamental activity for information management • Search engines are superb in their ability of extracting documents that most closely match with user’s query • But they fail in going beyond such simple paradigm! • Example: a theater close to Union Square, San Francisco, • “Find showing a recent thriller movie, close to a steak house”
  • 3. Search for a solution using keywords Not useful....
  • 4. Let’s split the tasks, and search for theaters using keywords We got results
  • 6. Check for showtimes down in the result set... no thriller movies! Search for a new one
  • 7. Now let’s look for a steak house
  • 9. What is Google doing? • Guessing the user’s intention • Solving her atomic task (find - and book - a restaurant, find a theatre) • Yahoo: We’re Moving From Web Of Pages To Web Of Objects • http://techcrunch.com/2010/09/16/live-from-yahoos-product-runway- event/ • “There’s going to be a blurring between typing in a query and getting a bunch of a links… People just want answers. Let you buy a ticket right from a movie result.”
  • 10. What should we - as users - do? •Work on subproblems • Start a search process • Use available search services ... and there’s a lot of! • ProgrammableWeb • YQL • GoogleBase • Linked Data • ... •Try to be smart!
  • 11. The Search Computing Project • Build theories, methods and tools to support search processes • Given a multi-domain query • Given a set of search services • Build several “solutions” which already integrate all the search process dimensions • Ranks “solutions” according to a global rank function and output results in rank order • Support user-friendly query definition and result browsing • Add search domains while the search process proceeds • Possibly change the relative weight of each ranking
  • 12. Why search services deserve special handling? •Ranking •Chunked results •Data and control driven orchestration • Need for ad-hoc optimization strategies of query plans [1] • Algorithms for the computation of composite results (based on different join strategies) [2] •Novel user interaction paradigms [3] [1] Braga, D., Ceri, S., Corcoglioniti, F., Grossniklaus, M. 2010. Panta Rhei: Flexible Execution Engine for Search Computing Queries, in Ceri, S., Brambilla, M. (eds.) 2010. Search Computing Challenges and Directions. Springer LNCS vol. 5950 [2] Martinenghi, D., Tagliasacchi, M. 2010. Proximity Rank Join. VLDB 2010, Singapore, September 2010 [3] Bozzon, A., Brambilla, M., Ceri, S., Fraternali, P. 2010. Liquid Query: Multi-Domain Exploratory Search on the Web, WWW 2010, Raleigh, North Carolina, April 2010.
  • 13. An architecture for multi-domain search On-line Off-line Client Client Tools Load Balancer Server Cache Query Profiler / Processor Analyzer Service Wrappers Materializer Services Services
  • 14. Client Tier Browser “Search as a process” enactment Liquid Query and Result Application View Model Widget Transient • Query submission Liquid Query Liquid Persistent Module Result Module Data Management • Result browsing and query Controller Interaction Application State UI Builder refinement Controller Manager Communication • Interaction primitives that support result manipulation, exploration and expansion of the search results • Alternative data visualizations www.search-computing.net/demo/UI
  • 15. Server Tier QUERY ANALYZER Query Analyzer API Query Parser Query Planner Query Optimizer EXECUTION ENGINE Engine API Service Repository Session Manager Service Repository API Engine Controller Service Repository Cache Chain Of Controller Manager Responsability Persistency Local Manager Functs Persistent Service Invoker Query Repository Custom Custom QEP Query Repository API Legacy Web Standard Wrappers Invoker Wrappers Wrappers Query Repository REST YQL GBase DB WSDL SPARQL Controller
  • 16. Server Tier • Query analyzer and optimizer • translate SQL-like queries into an executable plan DEFINE QUERY NightPlan($X:String, $Y: Integer , $U:String, $V:String, $W:String) AS SELECT M.*, T.*, R.*, TotalPrice=T.Price + R.AvgPrice FROM ((Movie (iGenre: $X, iYear: $Y) AS M USING IMDB_MOVIES, JOIN Theatre (iAddress: $U, iCity: $V, iCountry: $W) AS T USING GOOGLE_DISPLAYING ON M.Title=T.Title) JOIN Restaurant (iCountry: $W, iCategory: "Italian Restaurant") AS R USING YQL_LOCAL ON T.address=R.Address AND T.city=R.City) WHERE R.Rating>3 RANK BY (R=0.4, T=0.3, M=0.3) LIMIT 20 TUPLES AND 50 CALLS • transformation takes into account data dependencies, execution constraints, service profile informations
  • 17. Server Tier • Execution Engine Movie • Executes service execution plans while Restaurant satisfying the execution constraints Theater • Synchronous and asynchronous search mechanism • Service Repository • Abstraction of actual search services • Input/Output/Ranking • Set of connection patterns, pairwise coupling of services attributes to pre-define joins • Reference to actual search services
  • 18. Some toughs on distribution On-line Off-line Client Client Tools Load Balancer Server Cache Query Profiler / Processor Analyzer Service Wrappers Materializer Services Services
  • 19. Experimental Evaluation • Query • “Find a theater close to Union Square, San Francisco, showing a recent thriller movie, close to a steak house” • Services • IMDB Archive, Google Movie Search, Yahoo Local Search • Demo • http://www.search-computing.com/UIMovie
  • 20. Results • Service invocation is the most time-consuming Movie task Restaurant • Number of service invocation depends on the Theater topology of execution plan • Cache can lead to huge performance improvements 40 40 Execution time (sec) Service invocations 30 30 20 20 10 10 0 0 0 10 20 30 40 50 60 70 80 90 100 0 0.25 0.5 0.75 1 Combinations Cache hit probability movie theater restaurant 0.25 s 0.5 s 1s 2s 4s
  • 21. Conclusion and Future Works • We propose an architecture supporting the execution of multi-domain queries using search services service based environment • Our implementation, demonstrations, and experimental results show the feasibility of the approach • Architecturally speaking, there is a lot of space for improvements • Smart control strategies, such as top-k joins • Efficient search service pre-fetching and materialization • Dynamic evolution of execution plans according to an exploratory search approach

Notas do Editor

  1. \n
  2. \n
  3. \n
  4. \n
  5. \n
  6. \n
  7. \n
  8. \n
  9. \n
  10. \n
  11. \n
  12. All these aspects also challenge the performance and scalability issues of traditional SOA solutions\n
  13. All these aspects also challenge the performance and scalability issues of traditional SOA solutions\n\n
  14. All these aspects also challenge the performance and scalability issues of traditional SOA solutions\n
  15. \n
  16. \n
  17. \n
  18. Cache working both for search service results and queries\n\nLoad balancer distributes the execution among several engine instances, which can share the same cache\n\nService profiling can enanche the performance of the system by updating run-time information about services and the query processor\n\nData prefetch and service materialization can reduce latency time\n
  19. \n
  20. SX: average number of service invocation required to get the top-k results with k ranging from 1 to 100, averaged out of several inputs\n\nRestaurants invocation is almost linear, since it is in pipe and it returns on average 10 query results (chunk size). Fewer invocation for Movie and Theathres are required (they are on the left side of the pipe). \n\nDX: impact of caching on the average execution time, for cache hit probabilities ranging from 0 to 1\n20 concurrent clients, top 10 results, synthetic services\n
  21. \n
  22. \n