SlideShare a Scribd company logo
1 of 22
Infrastructure and Workflow
 for the Formal Evaluation of
            Semantic Search
                Technologies
 Stuart N. Wrigley1, Raúl García-Castro2 and Cassia Trojahn3
                                  1University of Sheffield, UK
                  2Universidad Politécnica de Madrid, Spain
                                               3INRIA, France




            Data infrastructurEs for Supporting Information Retrieval Evaluation:
                                                         DESIRE 2011 Workshop
SEALS Project
•   SEALS: Semantic Evaluation At Large Scale
•   EU FP7 funded Infrastructures project
•   June 2009 – June 2012.
•   Initial areas: ontology engineering, ontology storage and reasoning
    tools, ontology matching, semantic web service discovery, semantic
    search




                                                                     tiv ing




                                                                                      Re tivi
• Objectives:




                                                                             s
                                                                   Ac o rk




                                                                                       Ac
                                                                        itie




                                                                                        se ties
                                                                      tw




                                                                                          ar
      – SEALS Platform.




                                                                                             ch
                                                                   Ne
             • A lasting reference infrastructure.
             • Evaluations executed on-demand on SEALS Platform.
                                                                             Service
      – SEALS Evaluation Campaigns.                                          Activities

             • Two public evaluation campaigns
      – SEALS Community.
28.10.2011

2
Key (non-technical) features
• Infrastructure characteristics:
      –   Open (both in terms of use and development – Apache 2.0 license)
      –   Scalable (to users and data size – cluster-based)
      –   Extensible (new evals, new tool types, new metrics)
      –   Sustainable (beyond funded period)
      –   Independent (unbiased, trustworthy)
      –   Repeatable (eval results can be reproduced)
• Core criteria:
      – Interoperability
      – Scalability
      – Tool-specific measures (e.g., alignment precision, etc)

28.10.2011

3
Evaluation dependencies
       Tools

                               ER


                         Execution
                         Evaluation      Results
                          Request


     Test data




                          Evaluation
                         descriptions

28.10.2011

4
Platform architecture




28.10.2011

5
Evaluation overview

    Infrastructure             Tools                                         Tools             Infrastructure
        request             deployment                                   undeployment              release




                Execution             Execution                Evaluation           Execution
                 request             environment               description         environment
                 analysis            preparation                execution            clean-up


                     ER

                                    Test data       Activity          Results      Test data
                                     stage-in      Execution          storage      stage-out




28.10.2011

6
REPOSITORIES

28.10.2011

7
Repositories
                     External Interface
     HTTP
              File
    Request                                     Restlet Resources



                     Internal Interface
     Java                    Test Data              Tools               Results
    objects                 Repository            Repository           Repository
                               Implementation      Implementation       Implementation



                     Storage
     RDF                                     RDF                       File
              File
    Triples                               Repository                Repository
                                          (Metadata)                (Artifacts)


28.10.2011

8
Test Data Repository Service (TDRS)
• Storage of, and access to:
      – persistent test data sets (aka suites)
      – test data generators
• Suites are stored as ZIP files and accompanied by metadata.
• Suites can be versioned.
• ZIP-internal metadata allows structuring and repository-based iteration.


                                                 Data

                       Metadata                 Artifact
                                                Data
                                       Entity Artifact
                                             Artifact
                                             Item
    Discovery                                              Exploitation


28.10.2011

9
Results Repository Service (RRS)
• Storage of, and access to, suites of:
      – raw results
      – interpretations


• Suites stored as metadata and optional ZIP files.
• Metadata allows structuring and linking to ensure backlinks:
      – interpretation links to raw result dataItem
      – raw result links to tool and test suite dataItem




28.10.2011

10
TOOLS

28.10.2011

11
Tool wrapping and storage
• Tools are evaluated on within the Platform (ie, locally)

• Tools must have bi-directional communication with Platform.
• Each campaign (e.g., search) defines its own Java API specific to their tool
  type.
• Participating tools create a tool ‘wrapper’ to implement API.
• Also includes ‘setup’ and ‘tear down’ scripts and any 3rd party libraries /
  packages required.

• Bundles stored in Tool Repository Service (TRS).
• Tutorials on SEALS portal (http://www.seals-project.eu/).


28.10.2011

12
Semantic Search API
Method                             Functionality
boolean loadOntology(              Load an ontology
    URL ontology,
    String ontologyName,
    String ontologyNamespace)
void showGUI(boolean show)         Switch the GUI on or off
boolean executeQuery(String query) Execute a query
boolean isResultSetReady()         Are query results ready?
URL getResults()                   Retrieve the URL of the results file


boolean isUserInputComplete()      Has user hit ‘go’ (or equivalent)?
String getUserQuery()              Retrieve the query as entered by the user

28.10.2011

13
WORKFLOWS

28.10.2011

14
Evaluation overview (workflow)

     Infrastructure             Tools                                         Tools             Infrastructure
         request             deployment                                   undeployment              release




                 Execution             Execution                Evaluation           Execution
                  request             environment               description         environment
                  analysis            preparation                execution            clean-up


                      ER

                                     Test data       Activity          Results      Test data
                                      stage-in      Execution          storage      stage-out




28.10.2011

15
Workflow
• Business Process Execution Language (BPEL)
      – orchestrates manipulation of information using (only) web services interfaces.


• All entities involved must be exposed as web services (TDRS, RRS, the tool
  wrapper, custom services) defined using WSDL interfaces.
• ‘Custom services’ allow out-of-band processing:
      –   computation of analyses
      –   data / metadata manipulation
      –   timestamping
      –   etc.

• Workflow defined according to campaign requirements.
• Stored in the Evaluation Repository Service (ERS).
28.10.2011

16
Conceptual workflow for search
                                                                      Start

          Start

                                                                    Ontology        No       Get Ontology   Test Data
                                                                    Loaded?                      URL        Repository

     Load Test Suite          Test Data
                                                                         Yes
                              Repository

                                                   Test Data                                  Tool: Load
                                                                   Get Query
                                                   Repository                                  Ontology

         Loaded         No
                             Record error
      successfully?
                                                                 Tool: Execute
                                                                    Query
               Yes
     Tool: Switch off
           GUI
                                                                  Tool: Results     No
                                                                                                Pause
                                                                    Ready?

                                                                          Yes
        More test       No
                             Store Results   End
         cases?
                                                                Tool: Get Results


              Yes

                                                                 Store Results            Results
                               Results                                                   Repository
        Process
                              Repository
       Test Case


                                                                      End




28.10.2011

17
BPEL workflow snippets




28.10.2011                        Stored as XML
                       S
18
BPEL exception handling




28.10.2011

19
Summary
• SEALS Platform provides functionality to simplify and automate
  evaluations.
• Powerful cluster-based compute.
• Storage of test data, results and interpretations in perpetuity.
• Workflows specified in industry-standard BPEL.



• All for free!




28.10.2011

20
Thank you for your attention!

      http://www.seals-project.eu/
Links to resources
• Evaluation campaigns
      – http://www.seals-project.eu/seals-evaluation-campaigns/

• Test suites
      – Mooney (NL questions)
             • http://seals.sti2.at/tdrs-web/testdata/persistent/Mooney+NL+Testsuite/2010/
      – EvoOnt (SPARQL questions)
             •   http://seals.sti2.at/tdrs-web/testdata/persistent/EvoOnt+1K+SPARQL+Testsuite/2010/
             •   http://seals.sti2.at/tdrs-web/testdata/persistent/EvoOnt+10K+SPARQL+Testsuite/2010/
             •   http://seals.sti2.at/tdrs-web/testdata/persistent/EvoOnt+100K+SPARQL+Testsuite/2010/
             •   http://seals.sti2.at/tdrs-web/testdata/persistent/EvoOnt+1000K+SPARQL+Testsuite/2010/
             •   http://seals.sti2.at/tdrs-web/testdata/persistent/EvoOnt+10000K+SPARQL+Testsuite/2010/
      – QALD
             • http://seals.sti2.at/tdrs-web/testdata/persistent/QALD-MusicBrainz+SPARQL+Testsuite/2010/

• Last year’s campaign outcomes
      – http://www.seals-project.eu/seals-evaluation-campaigns/semantic-search-tools/results-2010

28.10.2011

22

More Related Content

Viewers also liked

Sensing 
Presence
(PreSense)
Ontology
–
 
User 
Modelling
 in 
the 
Semantic ...
Sensing 
Presence
(PreSense)
Ontology
–
 
User 
Modelling
 in 
the 
Semantic ...Sensing 
Presence
(PreSense)
Ontology
–
 
User 
Modelling
 in 
the 
Semantic ...
Sensing 
Presence
(PreSense)
Ontology
–
 
User 
Modelling
 in 
the 
Semantic ...Amparo Elizabeth Cano Basave
 
Exploring the similarity between Social Knowledge Sources and Twitter for Cro...
Exploring the similarity between Social Knowledge Sources and Twitter for Cro...Exploring the similarity between Social Knowledge Sources and Twitter for Cro...
Exploring the similarity between Social Knowledge Sources and Twitter for Cro...Andrea Varga
 
Web Scale Information Extraction tutorial ecml2013
Web Scale Information Extraction tutorial ecml2013Web Scale Information Extraction tutorial ecml2013
Web Scale Information Extraction tutorial ecml2013Anna Lisa Gentile
 
Volatile Classification of Point of Interests based on Social Activity Streams
Volatile Classification of Point of Interests based on Social Activity StreamsVolatile Classification of Point of Interests based on Social Activity Streams
Volatile Classification of Point of Interests based on Social Activity StreamsAmparo Elizabeth Cano Basave
 
Asterid: Linked Data Asterisms
Asterid: Linked Data AsterismsAsterid: Linked Data Asterisms
Asterid: Linked Data AsterismsGregoire Burel
 
SEO, SEM & Social - Intro, Best Practices & Campaign Tools
SEO, SEM & Social - Intro, Best Practices & Campaign ToolsSEO, SEM & Social - Intro, Best Practices & Campaign Tools
SEO, SEM & Social - Intro, Best Practices & Campaign ToolsLuke Freeman
 
Smart Cities and E-governance
Smart Cities and E-governanceSmart Cities and E-governance
Smart Cities and E-governancesteveking1225
 
E governance
E governanceE governance
E governanceGoa App
 
Role of technology in SMART governance “Smart City, Safe City"
Role of technology in SMART governance “Smart City, Safe City"Role of technology in SMART governance “Smart City, Safe City"
Role of technology in SMART governance “Smart City, Safe City"KRITYANAND UNESCO CLUB Jamshedpur
 

Viewers also liked (14)

Sensing 
Presence
(PreSense)
Ontology
–
 
User 
Modelling
 in 
the 
Semantic ...
Sensing 
Presence
(PreSense)
Ontology
–
 
User 
Modelling
 in 
the 
Semantic ...Sensing 
Presence
(PreSense)
Ontology
–
 
User 
Modelling
 in 
the 
Semantic ...
Sensing 
Presence
(PreSense)
Ontology
–
 
User 
Modelling
 in 
the 
Semantic ...
 
Lodie overview
Lodie overviewLodie overview
Lodie overview
 
Exploring the similarity between Social Knowledge Sources and Twitter for Cro...
Exploring the similarity between Social Knowledge Sources and Twitter for Cro...Exploring the similarity between Social Knowledge Sources and Twitter for Cro...
Exploring the similarity between Social Knowledge Sources and Twitter for Cro...
 
Web Scale Information Extraction tutorial ecml2013
Web Scale Information Extraction tutorial ecml2013Web Scale Information Extraction tutorial ecml2013
Web Scale Information Extraction tutorial ecml2013
 
Volatile Classification of Point of Interests based on Social Activity Streams
Volatile Classification of Point of Interests based on Social Activity StreamsVolatile Classification of Point of Interests based on Social Activity Streams
Volatile Classification of Point of Interests based on Social Activity Streams
 
Mapping Keywords to
Mapping Keywords to Mapping Keywords to
Mapping Keywords to
 
Asterid: Linked Data Asterisms
Asterid: Linked Data AsterismsAsterid: Linked Data Asterisms
Asterid: Linked Data Asterisms
 
SEO vs SEM
SEO vs SEMSEO vs SEM
SEO vs SEM
 
SEO, SEM & Social - Intro, Best Practices & Campaign Tools
SEO, SEM & Social - Intro, Best Practices & Campaign ToolsSEO, SEM & Social - Intro, Best Practices & Campaign Tools
SEO, SEM & Social - Intro, Best Practices & Campaign Tools
 
e-governance in India
e-governance in Indiae-governance in India
e-governance in India
 
Smart Cities and E-governance
Smart Cities and E-governanceSmart Cities and E-governance
Smart Cities and E-governance
 
E governance
E governanceE governance
E governance
 
Role of technology in SMART governance “Smart City, Safe City"
Role of technology in SMART governance “Smart City, Safe City"Role of technology in SMART governance “Smart City, Safe City"
Role of technology in SMART governance “Smart City, Safe City"
 
E governance
E governanceE governance
E governance
 

Similar to Infrastructure and Workflow for the Formal Evaluation of Semantic Search Technologies

Reverse Engineering of Software Architecture
Reverse Engineering of Software ArchitectureReverse Engineering of Software Architecture
Reverse Engineering of Software ArchitectureDharmalingam Ganesan
 
Architecting Smarter Apps with Entity Framework
Architecting Smarter Apps with Entity FrameworkArchitecting Smarter Apps with Entity Framework
Architecting Smarter Apps with Entity FrameworkSaltmarch Media
 
04.egovFrame Runtime Environment Workshop
04.egovFrame Runtime Environment Workshop04.egovFrame Runtime Environment Workshop
04.egovFrame Runtime Environment WorkshopChuong Nguyen
 
Data repositories -- Xiamen University 2012 06-08
Data repositories -- Xiamen University 2012 06-08Data repositories -- Xiamen University 2012 06-08
Data repositories -- Xiamen University 2012 06-08Jian Qin
 
Agile Open Source Performance Testing Workshop for Business Managers
Agile Open Source Performance Testing Workshop for Business ManagersAgile Open Source Performance Testing Workshop for Business Managers
Agile Open Source Performance Testing Workshop for Business ManagersClever Moe
 
Strudel: Framework for Transaction Performance Analyses on SQL/NoSQL Systems
Strudel: Framework for Transaction Performance Analyses on SQL/NoSQL SystemsStrudel: Framework for Transaction Performance Analyses on SQL/NoSQL Systems
Strudel: Framework for Transaction Performance Analyses on SQL/NoSQL Systemstatemura
 
Entity Framework Today (May 2012)
Entity Framework Today (May 2012)Entity Framework Today (May 2012)
Entity Framework Today (May 2012)Julie Lerman
 
Easy Java Integration Testing with Testcontainers​
Easy Java Integration Testing with Testcontainers​Easy Java Integration Testing with Testcontainers​
Easy Java Integration Testing with Testcontainers​Payara
 
Session 49 Practical Semantic Sticky Note
Session 49 Practical Semantic Sticky NoteSession 49 Practical Semantic Sticky Note
Session 49 Practical Semantic Sticky NoteISSGC Summer School
 
2013 06-24 Wf4Ever: Annotating research objects (PDF)
2013 06-24 Wf4Ever: Annotating research objects (PDF)2013 06-24 Wf4Ever: Annotating research objects (PDF)
2013 06-24 Wf4Ever: Annotating research objects (PDF)Stian Soiland-Reyes
 
2013 06-24 Wf4Ever: Annotating research objects (PPTX)
2013 06-24 Wf4Ever: Annotating research objects (PPTX)2013 06-24 Wf4Ever: Annotating research objects (PPTX)
2013 06-24 Wf4Ever: Annotating research objects (PPTX)Stian Soiland-Reyes
 
Top100summit christina
Top100summit christinaTop100summit christina
Top100summit christinaChristina Geng
 
Requirements for Processing Datasets for Recommender Systems
Requirements for Processing Datasets for Recommender SystemsRequirements for Processing Datasets for Recommender Systems
Requirements for Processing Datasets for Recommender SystemsStoitsis Giannis
 
WOTS2E: A Search Engine for a Semantic Web of Things
WOTS2E: A Search Engine for a Semantic Web of ThingsWOTS2E: A Search Engine for a Semantic Web of Things
WOTS2E: A Search Engine for a Semantic Web of ThingsAndreas Kamilaris
 
Session 49 - Semantic metadata management practical
Session 49 - Semantic metadata management practical Session 49 - Semantic metadata management practical
Session 49 - Semantic metadata management practical ISSGC Summer School
 
Siddhi: A Second Look at Complex Event Processing Implementations
Siddhi: A Second Look at Complex Event Processing ImplementationsSiddhi: A Second Look at Complex Event Processing Implementations
Siddhi: A Second Look at Complex Event Processing ImplementationsSrinath Perera
 
RUresearch: Supporting the Management and Preservation of Research Data - Ale...
RUresearch: Supporting the Management and Preservation of Research Data - Ale...RUresearch: Supporting the Management and Preservation of Research Data - Ale...
RUresearch: Supporting the Management and Preservation of Research Data - Ale...ASIS&T
 

Similar to Infrastructure and Workflow for the Formal Evaluation of Semantic Search Technologies (20)

Reverse Engineering of Software Architecture
Reverse Engineering of Software ArchitectureReverse Engineering of Software Architecture
Reverse Engineering of Software Architecture
 
Architecting Smarter Apps with Entity Framework
Architecting Smarter Apps with Entity FrameworkArchitecting Smarter Apps with Entity Framework
Architecting Smarter Apps with Entity Framework
 
04.egovFrame Runtime Environment Workshop
04.egovFrame Runtime Environment Workshop04.egovFrame Runtime Environment Workshop
04.egovFrame Runtime Environment Workshop
 
2013-01-17 Research Object
2013-01-17 Research Object2013-01-17 Research Object
2013-01-17 Research Object
 
Data repositories -- Xiamen University 2012 06-08
Data repositories -- Xiamen University 2012 06-08Data repositories -- Xiamen University 2012 06-08
Data repositories -- Xiamen University 2012 06-08
 
A View on eScience
A View on eScienceA View on eScience
A View on eScience
 
Agile Open Source Performance Testing Workshop for Business Managers
Agile Open Source Performance Testing Workshop for Business ManagersAgile Open Source Performance Testing Workshop for Business Managers
Agile Open Source Performance Testing Workshop for Business Managers
 
Strudel: Framework for Transaction Performance Analyses on SQL/NoSQL Systems
Strudel: Framework for Transaction Performance Analyses on SQL/NoSQL SystemsStrudel: Framework for Transaction Performance Analyses on SQL/NoSQL Systems
Strudel: Framework for Transaction Performance Analyses on SQL/NoSQL Systems
 
Entity Framework Today (May 2012)
Entity Framework Today (May 2012)Entity Framework Today (May 2012)
Entity Framework Today (May 2012)
 
Easy Java Integration Testing with Testcontainers​
Easy Java Integration Testing with Testcontainers​Easy Java Integration Testing with Testcontainers​
Easy Java Integration Testing with Testcontainers​
 
Session 49 Practical Semantic Sticky Note
Session 49 Practical Semantic Sticky NoteSession 49 Practical Semantic Sticky Note
Session 49 Practical Semantic Sticky Note
 
2013 06-24 Wf4Ever: Annotating research objects (PDF)
2013 06-24 Wf4Ever: Annotating research objects (PDF)2013 06-24 Wf4Ever: Annotating research objects (PDF)
2013 06-24 Wf4Ever: Annotating research objects (PDF)
 
2013 06-24 Wf4Ever: Annotating research objects (PPTX)
2013 06-24 Wf4Ever: Annotating research objects (PPTX)2013 06-24 Wf4Ever: Annotating research objects (PPTX)
2013 06-24 Wf4Ever: Annotating research objects (PPTX)
 
Top100summit christina
Top100summit christinaTop100summit christina
Top100summit christina
 
Requirements for Processing Datasets for Recommender Systems
Requirements for Processing Datasets for Recommender SystemsRequirements for Processing Datasets for Recommender Systems
Requirements for Processing Datasets for Recommender Systems
 
6 chap
6 chap6 chap
6 chap
 
WOTS2E: A Search Engine for a Semantic Web of Things
WOTS2E: A Search Engine for a Semantic Web of ThingsWOTS2E: A Search Engine for a Semantic Web of Things
WOTS2E: A Search Engine for a Semantic Web of Things
 
Session 49 - Semantic metadata management practical
Session 49 - Semantic metadata management practical Session 49 - Semantic metadata management practical
Session 49 - Semantic metadata management practical
 
Siddhi: A Second Look at Complex Event Processing Implementations
Siddhi: A Second Look at Complex Event Processing ImplementationsSiddhi: A Second Look at Complex Event Processing Implementations
Siddhi: A Second Look at Complex Event Processing Implementations
 
RUresearch: Supporting the Management and Preservation of Research Data - Ale...
RUresearch: Supporting the Management and Preservation of Research Data - Ale...RUresearch: Supporting the Management and Preservation of Research Data - Ale...
RUresearch: Supporting the Management and Preservation of Research Data - Ale...
 

Recently uploaded

Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfhans926745
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 

Recently uploaded (20)

Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 

Infrastructure and Workflow for the Formal Evaluation of Semantic Search Technologies

  • 1. Infrastructure and Workflow for the Formal Evaluation of Semantic Search Technologies Stuart N. Wrigley1, Raúl García-Castro2 and Cassia Trojahn3 1University of Sheffield, UK 2Universidad Politécnica de Madrid, Spain 3INRIA, France Data infrastructurEs for Supporting Information Retrieval Evaluation: DESIRE 2011 Workshop
  • 2. SEALS Project • SEALS: Semantic Evaluation At Large Scale • EU FP7 funded Infrastructures project • June 2009 – June 2012. • Initial areas: ontology engineering, ontology storage and reasoning tools, ontology matching, semantic web service discovery, semantic search tiv ing Re tivi • Objectives: s Ac o rk Ac itie se ties tw ar – SEALS Platform. ch Ne • A lasting reference infrastructure. • Evaluations executed on-demand on SEALS Platform. Service – SEALS Evaluation Campaigns. Activities • Two public evaluation campaigns – SEALS Community. 28.10.2011 2
  • 3. Key (non-technical) features • Infrastructure characteristics: – Open (both in terms of use and development – Apache 2.0 license) – Scalable (to users and data size – cluster-based) – Extensible (new evals, new tool types, new metrics) – Sustainable (beyond funded period) – Independent (unbiased, trustworthy) – Repeatable (eval results can be reproduced) • Core criteria: – Interoperability – Scalability – Tool-specific measures (e.g., alignment precision, etc) 28.10.2011 3
  • 4. Evaluation dependencies Tools ER Execution Evaluation Results Request Test data Evaluation descriptions 28.10.2011 4
  • 6. Evaluation overview Infrastructure Tools Tools Infrastructure request deployment undeployment release Execution Execution Evaluation Execution request environment description environment analysis preparation execution clean-up ER Test data Activity Results Test data stage-in Execution storage stage-out 28.10.2011 6
  • 8. Repositories External Interface HTTP File Request Restlet Resources Internal Interface Java Test Data Tools Results objects Repository Repository Repository Implementation Implementation Implementation Storage RDF RDF File File Triples Repository Repository (Metadata) (Artifacts) 28.10.2011 8
  • 9. Test Data Repository Service (TDRS) • Storage of, and access to: – persistent test data sets (aka suites) – test data generators • Suites are stored as ZIP files and accompanied by metadata. • Suites can be versioned. • ZIP-internal metadata allows structuring and repository-based iteration. Data Metadata Artifact Data Entity Artifact Artifact Item Discovery Exploitation 28.10.2011 9
  • 10. Results Repository Service (RRS) • Storage of, and access to, suites of: – raw results – interpretations • Suites stored as metadata and optional ZIP files. • Metadata allows structuring and linking to ensure backlinks: – interpretation links to raw result dataItem – raw result links to tool and test suite dataItem 28.10.2011 10
  • 12. Tool wrapping and storage • Tools are evaluated on within the Platform (ie, locally) • Tools must have bi-directional communication with Platform. • Each campaign (e.g., search) defines its own Java API specific to their tool type. • Participating tools create a tool ‘wrapper’ to implement API. • Also includes ‘setup’ and ‘tear down’ scripts and any 3rd party libraries / packages required. • Bundles stored in Tool Repository Service (TRS). • Tutorials on SEALS portal (http://www.seals-project.eu/). 28.10.2011 12
  • 13. Semantic Search API Method Functionality boolean loadOntology( Load an ontology URL ontology, String ontologyName, String ontologyNamespace) void showGUI(boolean show) Switch the GUI on or off boolean executeQuery(String query) Execute a query boolean isResultSetReady() Are query results ready? URL getResults() Retrieve the URL of the results file boolean isUserInputComplete() Has user hit ‘go’ (or equivalent)? String getUserQuery() Retrieve the query as entered by the user 28.10.2011 13
  • 15. Evaluation overview (workflow) Infrastructure Tools Tools Infrastructure request deployment undeployment release Execution Execution Evaluation Execution request environment description environment analysis preparation execution clean-up ER Test data Activity Results Test data stage-in Execution storage stage-out 28.10.2011 15
  • 16. Workflow • Business Process Execution Language (BPEL) – orchestrates manipulation of information using (only) web services interfaces. • All entities involved must be exposed as web services (TDRS, RRS, the tool wrapper, custom services) defined using WSDL interfaces. • ‘Custom services’ allow out-of-band processing: – computation of analyses – data / metadata manipulation – timestamping – etc. • Workflow defined according to campaign requirements. • Stored in the Evaluation Repository Service (ERS). 28.10.2011 16
  • 17. Conceptual workflow for search Start Start Ontology No Get Ontology Test Data Loaded? URL Repository Load Test Suite Test Data Yes Repository Test Data Tool: Load Get Query Repository Ontology Loaded No Record error successfully? Tool: Execute Query Yes Tool: Switch off GUI Tool: Results No Pause Ready? Yes More test No Store Results End cases? Tool: Get Results Yes Store Results Results Results Repository Process Repository Test Case End 28.10.2011 17
  • 18. BPEL workflow snippets 28.10.2011 Stored as XML S 18
  • 20. Summary • SEALS Platform provides functionality to simplify and automate evaluations. • Powerful cluster-based compute. • Storage of test data, results and interpretations in perpetuity. • Workflows specified in industry-standard BPEL. • All for free! 28.10.2011 20
  • 21. Thank you for your attention! http://www.seals-project.eu/
  • 22. Links to resources • Evaluation campaigns – http://www.seals-project.eu/seals-evaluation-campaigns/ • Test suites – Mooney (NL questions) • http://seals.sti2.at/tdrs-web/testdata/persistent/Mooney+NL+Testsuite/2010/ – EvoOnt (SPARQL questions) • http://seals.sti2.at/tdrs-web/testdata/persistent/EvoOnt+1K+SPARQL+Testsuite/2010/ • http://seals.sti2.at/tdrs-web/testdata/persistent/EvoOnt+10K+SPARQL+Testsuite/2010/ • http://seals.sti2.at/tdrs-web/testdata/persistent/EvoOnt+100K+SPARQL+Testsuite/2010/ • http://seals.sti2.at/tdrs-web/testdata/persistent/EvoOnt+1000K+SPARQL+Testsuite/2010/ • http://seals.sti2.at/tdrs-web/testdata/persistent/EvoOnt+10000K+SPARQL+Testsuite/2010/ – QALD • http://seals.sti2.at/tdrs-web/testdata/persistent/QALD-MusicBrainz+SPARQL+Testsuite/2010/ • Last year’s campaign outcomes – http://www.seals-project.eu/seals-evaluation-campaigns/semantic-search-tools/results-2010 28.10.2011 22