SlideShare uma empresa Scribd logo
OWLIM

Mariana Damova, PhD



     DM2E
Vienna, November 2012
Ontotext
   – Top-5 provider of core Semantic Technology
   – Established in year 2000; offices in Bulgaria, UK, USA
   – Active both in research and commercial projects (FP7 funding for 10 years)

• 360° semantic technology – unique portfolio:
   – Semantic Databases: high-performance RDF DBMS, scalable reasoning
   – Semantic Search: text-mining (IE), metadata generation, Information Retrieval (IR)
   – Web Mining: focused crawling, screen scraping, data fusion
   – Linked Data Management and Data Integration

   Good recognition in the SemTech community
   – Ontotext pages are ranked #1 for “semantic annotation” and “semantic repository” at
     GYM, #3 for “linked data management” at Google

   Several joint ventures and subsidiaries
   – Innovantage: leading online recruitment intelligence provider in UK
Ontotext Clients (selected)

          British Broadcasting Corporation (BBC)
                – Run its World Cup 2010 sites on top of OWLIM
                – Since Mar’12 BBC Sports
                – 2012 Olympics sections are driven
                  by OWLIM and a Concept Extraction service developed by Ontotext
          Press Association (UK)
                – Analysis of Sports news
                – Concept extraction
                – Linked data generation
          Top-3 USA media (not allowed to name)
          The National Archives (UK) contracted Ontotext to implement
          semantic KB and semantic search for the Government Web Archive
          British Museum (UK) Ontotext leads the development of Phase 3 of
          ResearchSpace project on collaborative research in cultural heritage;
          British Museum’s public SPARQL end-point is powered by OWLIM
          de Bibliothek (Holland) aggregation of data from 150 library databases
Semantic Technologies


•   Semantic technologies (RDF, LOD) allow for an unprecedented ease of
    integration of heterogeneous data sources
      – Already adopted in pharmaceuticals and publishing industries
      – Cultural heritage is the next

     BBC – when MySQL was replaced with OWLIM in their “Dynamic Semantic
       Publishing” architecture, the BBC team observed considerable reduction of
       complexity of database design, query specification, application
       development, and query evaluation time. BBC World Cup 2010 dynamic
       semantic publishing. Jem Rayfield, Senior Technical Architect BBC News
       and Knowledge.
       http://www.bbc.co.uk/blogs/bbcinternet/2010/07/bbc_world_cup_2010_dyna
       mic_sem.html
OWLIM
Semantic Repository for RDFS and OWL

• OWLIM is a family of scalable semantic repositories
   • OWLIM-Lite: in-memory, fastest, scales to ~100 million statements
   • OWLIM-SE: file-based, sameAs & query optimizations, scales to 20 billion
     statements
   • OWLIM-Enterprise: replication cluster deployment for resilience and high
     performance parallel query-answering

• OWLIM provides
    – Management, integration and analysis of heterogeneous data
    – Combined with light-weight, high-performance reasoning
    – The inference is based on logical rule-entailment
    – Full RDFS, OWL Horst, restricted OWL-Lite, OWL2-QL and OWL2 RL
    – Custom semantics can be defined via rules and axiomatic triples
OWLIM in the Cultural Heritage Domain

Selected commercial projects
          ResearchSpace project funded by the Andrew W. Mellon Foundation
          Support for collaborative web-based research, information sharing and web publishing for
          the cultural heritage scholarly community. An Ontotext-led international consortium.
             The Polish Digital National Museum aggregates artifacts from over 70 contributing
           cultural institutions in the Digital Libraries Federation PIONIER Network using OWLIM
           repository of Ontotext
            LODAC (Linked Open Data in Academia), Japan's National Institute of Informatics
           aggregates various information across multiple Japanese resources as LOD. The system
           uses 8 OWLIM nodes and aggregates 19 collections with 700 000 entities and 15M triples.
            SemTech for Cultural Heritage project funded by ITCC
           Semantic publishing of Bulgarian cultural heritage to Europeana Establishing a Bulgarian
            technical aggregator for Europeana
Selected research projects
            MOLTO FP7 project, a use case in cultural heritage for a semantic knowledge
           representationinfrastructure for querying RDF and presenting query results, includes close
           to 9K museum objects from two collections of The Gothenburg City
             Charisma (Cultural Heritage Advanced Research Infrastructures) an EU-funded
           integrating activity project, a consortium of 21 partners, metadata from 6 major European
           cultural institutions has selected OWLIM repository of Ontotext
OWLIM PERFORMANCE



•   OWLIM is a scalable, robust and efficient triple store
     – Serving the two most important web-sites for the London Olympic Games
         • Official Olympics website
         • BBC Olympics website
     – Performance highlights
         • OWLIM loads the 100M and the 200M datasets almost twice as fast as the next best product
           (17 min. for 100M)
         • Best query performance among those repositories that can handle update and multi-client
           query tasks (5,285 Query-mixes-per-hour, where a query mix contains 25 queries; e.g. about
           100 queries/sec)
         • OWLIM v5 is 43% faster than v.4.3 on the BSBM Explore and Update scenario
         • OWLIM v5 requires between 25% and 70% less storage space



•   OWL 2 RL-type languages have proven to be the only feasible approach for
    reasoning with billion statements
Reasoning complexity
owl:sameAs Optimization

a way to handle the equivalent statements by a single master node,
which has as an impact efficient and compact handling of inferred
statements resulting in 4-6 times more statements available to query
than the explicitly introduced ones
OWLIM Replication Cluster

• Distribution through data replication is used to ensure:
   – Better handling of concurrent user requests
   – Failover support
• How does it work?
   – Every user request is pushed in a transaction queue
   – Each data write request is are multiplexed to all repository instances
   – Each read request is dispatched to one of the
     instance only
   – To ensure load-balancing, each
     read requests is send to the
     instance with smallest execution
     queue at this point in time
Geo-spatial index

• Geo-spatial information concerns the geometry of points, shapes and distances relative to the
  surface of the Earth (or any spherical object).
• When using OWLIM-SE all angles are in decimal degrees with the latitude ranging from -90 to
  +90 degrees and the longitude ranging from -180 to +180 degrees.




• airports have a reference point given by latitude, longitude and altitude;
• political boundaries can be specified by polygons where each vertex is a 2-Dimensional
  latitude/longitude pair.
RDF Rank

• OWLIM-SE includes a plug-in that allows for efficient
  calculation of a modification of PageRank over RDF graphs
• Computation of rank values is fast, e.g.
   – 400M LOD statements takes 310 sec (27 iteraions)

• Results are available through a system predicate
• Example: get the 100 most important nodes in the RDF graph
      SELECT ?n {?n rank:hasRDFRank ?r}
      ORDER BY DESC(?r) LIMIT 100
Define: nested repositories

”Nested repositories” represent a new data
   management concept for RDF data:
•   a mechanism for sharing data stored across
    multiple repositories, where
•   one of them contains a large body of
    knowledge which gets embedded in other
    repositories
•   each containing more specific data, which are
    being interlinked with the common body of
    knowledge
http://www.ontotext.com/owlim




                       mariana.damova@ontotext.com

Mais conteúdo relacionado

Destaque

Europeana datainaction nov2012
Europeana datainaction nov2012Europeana datainaction nov2012
Europeana datainaction nov2012
Mariana Damova, Ph.D
 
HBMI_Workshop_Nagoya_Feb2015
HBMI_Workshop_Nagoya_Feb2015HBMI_Workshop_Nagoya_Feb2015
HBMI_Workshop_Nagoya_Feb2015
Shin Yamamoto
 
Empathy_Style_Management_09Jun2015_digest
Empathy_Style_Management_09Jun2015_digestEmpathy_Style_Management_09Jun2015_digest
Empathy_Style_Management_09Jun2015_digest
Shin Yamamoto
 
How to perform "Good_and_New"?
How to perform "Good_and_New"?How to perform "Good_and_New"?
How to perform "Good_and_New"?
Shin Yamamoto
 
Healthcare Business Innovation
Healthcare Business InnovationHealthcare Business Innovation
Healthcare Business Innovation
Shin Yamamoto
 
Empathy_Based_Management_Healthcare_Jull2015
Empathy_Based_Management_Healthcare_Jull2015Empathy_Based_Management_Healthcare_Jull2015
Empathy_Based_Management_Healthcare_Jull2015
Shin Yamamoto
 
Kohchi_BMGen_WS_28Jan2016
Kohchi_BMGen_WS_28Jan2016Kohchi_BMGen_WS_28Jan2016
Kohchi_BMGen_WS_28Jan2016
Shin Yamamoto
 
HBMI_WS_Kobe_07Feb2015
HBMI_WS_Kobe_07Feb2015HBMI_WS_Kobe_07Feb2015
HBMI_WS_Kobe_07Feb2015
Shin Yamamoto
 
Business Model Canvas 6 rules for your communication
Business Model Canvas 6 rules for your communicationBusiness Model Canvas 6 rules for your communication
Business Model Canvas 6 rules for your communication
Shin Yamamoto
 
семантични технологии основи
семантични технологии   основисемантични технологии   основи
семантични технологии основи
Mariana Damova, Ph.D
 
Fact forge20 edf
Fact forge20 edfFact forge20 edf
Fact forge20 edf
Mariana Damova, Ph.D
 
Kobe_Medical_Device_Innovation_24Feb2016
Kobe_Medical_Device_Innovation_24Feb2016Kobe_Medical_Device_Innovation_24Feb2016
Kobe_Medical_Device_Innovation_24Feb2016
Shin Yamamoto
 
多職種連携に欠かせない共感_患医ねっと25Feb2016
多職種連携に欠かせない共感_患医ねっと25Feb2016多職種連携に欠かせない共感_患医ねっと25Feb2016
多職種連携に欠かせない共感_患医ねっと25Feb2016
Shin Yamamoto
 
Molbio_carrer_30Nov2016
Molbio_carrer_30Nov2016Molbio_carrer_30Nov2016
Molbio_carrer_30Nov2016
Shin Yamamoto
 
Create_Vision_by_Mindmap_5Feb2015商工会議所
Create_Vision_by_Mindmap_5Feb2015商工会議所Create_Vision_by_Mindmap_5Feb2015商工会議所
Create_Vision_by_Mindmap_5Feb2015商工会議所
Shin Yamamoto
 
1969 Lyndon High School class reunion
1969 Lyndon High School class reunion1969 Lyndon High School class reunion
1969 Lyndon High School class reunion
Doug Anstaett
 
Business Model_Innovation_JMA_6Jul2015
Business Model_Innovation_JMA_6Jul2015Business Model_Innovation_JMA_6Jul2015
Business Model_Innovation_JMA_6Jul2015Shin Yamamoto
 

Destaque (17)

Europeana datainaction nov2012
Europeana datainaction nov2012Europeana datainaction nov2012
Europeana datainaction nov2012
 
HBMI_Workshop_Nagoya_Feb2015
HBMI_Workshop_Nagoya_Feb2015HBMI_Workshop_Nagoya_Feb2015
HBMI_Workshop_Nagoya_Feb2015
 
Empathy_Style_Management_09Jun2015_digest
Empathy_Style_Management_09Jun2015_digestEmpathy_Style_Management_09Jun2015_digest
Empathy_Style_Management_09Jun2015_digest
 
How to perform "Good_and_New"?
How to perform "Good_and_New"?How to perform "Good_and_New"?
How to perform "Good_and_New"?
 
Healthcare Business Innovation
Healthcare Business InnovationHealthcare Business Innovation
Healthcare Business Innovation
 
Empathy_Based_Management_Healthcare_Jull2015
Empathy_Based_Management_Healthcare_Jull2015Empathy_Based_Management_Healthcare_Jull2015
Empathy_Based_Management_Healthcare_Jull2015
 
Kohchi_BMGen_WS_28Jan2016
Kohchi_BMGen_WS_28Jan2016Kohchi_BMGen_WS_28Jan2016
Kohchi_BMGen_WS_28Jan2016
 
HBMI_WS_Kobe_07Feb2015
HBMI_WS_Kobe_07Feb2015HBMI_WS_Kobe_07Feb2015
HBMI_WS_Kobe_07Feb2015
 
Business Model Canvas 6 rules for your communication
Business Model Canvas 6 rules for your communicationBusiness Model Canvas 6 rules for your communication
Business Model Canvas 6 rules for your communication
 
семантични технологии основи
семантични технологии   основисемантични технологии   основи
семантични технологии основи
 
Fact forge20 edf
Fact forge20 edfFact forge20 edf
Fact forge20 edf
 
Kobe_Medical_Device_Innovation_24Feb2016
Kobe_Medical_Device_Innovation_24Feb2016Kobe_Medical_Device_Innovation_24Feb2016
Kobe_Medical_Device_Innovation_24Feb2016
 
多職種連携に欠かせない共感_患医ねっと25Feb2016
多職種連携に欠かせない共感_患医ねっと25Feb2016多職種連携に欠かせない共感_患医ねっと25Feb2016
多職種連携に欠かせない共感_患医ねっと25Feb2016
 
Molbio_carrer_30Nov2016
Molbio_carrer_30Nov2016Molbio_carrer_30Nov2016
Molbio_carrer_30Nov2016
 
Create_Vision_by_Mindmap_5Feb2015商工会議所
Create_Vision_by_Mindmap_5Feb2015商工会議所Create_Vision_by_Mindmap_5Feb2015商工会議所
Create_Vision_by_Mindmap_5Feb2015商工会議所
 
1969 Lyndon High School class reunion
1969 Lyndon High School class reunion1969 Lyndon High School class reunion
1969 Lyndon High School class reunion
 
Business Model_Innovation_JMA_6Jul2015
Business Model_Innovation_JMA_6Jul2015Business Model_Innovation_JMA_6Jul2015
Business Model_Innovation_JMA_6Jul2015
 

Semelhante a Dm2 e ontotext-nov2012

ResearchSpace- Example of a VRE Based on CIDOC CRM
ResearchSpace- Example of a VRE Based on CIDOC CRMResearchSpace- Example of a VRE Based on CIDOC CRM
ResearchSpace- Example of a VRE Based on CIDOC CRM
Vladimir Alexiev, PhD, PMP
 
Do MORe with your data
Do MORe with your dataDo MORe with your data
Do MORe with your data
locloud
 
MARC records for archived websites on the Archive of Tomorrow project / Mark ...
MARC records for archived websites on the Archive of Tomorrow project / Mark ...MARC records for archived websites on the Archive of Tomorrow project / Mark ...
MARC records for archived websites on the Archive of Tomorrow project / Mark ...
CILIP MDG
 
Ee bdm ws-v1
Ee bdm ws-v1Ee bdm ws-v1
About company
About companyAbout company
About company
Ilya Klintsov
 
ESWC SS 2012 - Wednesday Tutorial Barry Norton: Building (Production) Semanti...
ESWC SS 2012 - Wednesday Tutorial Barry Norton: Building (Production) Semanti...ESWC SS 2012 - Wednesday Tutorial Barry Norton: Building (Production) Semanti...
ESWC SS 2012 - Wednesday Tutorial Barry Norton: Building (Production) Semanti...
eswcsummerschool
 
An overview of The European Library. Olaf Janssen presenting during DRH 2005,...
An overview of The European Library. Olaf Janssen presenting during DRH 2005,...An overview of The European Library. Olaf Janssen presenting during DRH 2005,...
An overview of The European Library. Olaf Janssen presenting during DRH 2005,...
Olaf Janssen
 
ICOS: Integrated Carbon Observation System Open data to open our eyes to clim...
ICOS: Integrated Carbon Observation System Open data to open our eyes to clim...ICOS: Integrated Carbon Observation System Open data to open our eyes to clim...
ICOS: Integrated Carbon Observation System Open data to open our eyes to clim...
Blue BRIDGE
 
Uk discovery-jisc-project-showcase
Uk discovery-jisc-project-showcaseUk discovery-jisc-project-showcase
Uk discovery-jisc-project-showcase
RDTF-Discovery
 
Metadata and me
Metadata and meMetadata and me
Metadata and me
Nick Sheppard
 
Museum reasonableview
Museum reasonableviewMuseum reasonableview
Museum reasonableview
Mariana Damova, Ph.D
 
Documents, services, and data on the web
Documents, services, and data on the webDocuments, services, and data on the web
Documents, services, and data on the web
Chiara Del Vescovo
 
Mind the gap! Reflections on the state of repository data harvesting
Mind the gap! Reflections on the state of repository data harvestingMind the gap! Reflections on the state of repository data harvesting
Mind the gap! Reflections on the state of repository data harvesting
Simeon Warner
 
Infrastructure - A necessary platform for user empowerment
Infrastructure - A necessary platform for user empowermentInfrastructure - A necessary platform for user empowerment
Infrastructure - A necessary platform for user empowerment
RICHES
 
Edinburgh OldMapsOnline Workshop
Edinburgh OldMapsOnline WorkshopEdinburgh OldMapsOnline Workshop
Edinburgh OldMapsOnline Workshop
Petr Pridal
 
The Open Archives Initiative Protocol for Metadata Harvesting and ePrints UK
The Open Archives Initiative Protocol for Metadata Harvesting and ePrints UKThe Open Archives Initiative Protocol for Metadata Harvesting and ePrints UK
The Open Archives Initiative Protocol for Metadata Harvesting and ePrints UK
Andy Powell
 
Europeana Creative. EDM Endpoint. Custom Views
Europeana Creative. EDM Endpoint. Custom ViewsEuropeana Creative. EDM Endpoint. Custom Views
Europeana Creative. EDM Endpoint. Custom Views
Vladimir Alexiev, PhD, PMP
 
Semantic Technologies for Cultural Heritage
Semantic Technologies for Cultural HeritageSemantic Technologies for Cultural Heritage
Semantic Technologies for Cultural Heritage
Vladimir Alexiev, PhD, PMP
 
Elibrary technical strategy
Elibrary technical strategyElibrary technical strategy
Elibrary technical strategy
ziauddin farooqui
 
High and Lows of Library Linked Data
High and Lows of Library Linked DataHigh and Lows of Library Linked Data
High and Lows of Library Linked Data
Adrian Stevenson
 

Semelhante a Dm2 e ontotext-nov2012 (20)

ResearchSpace- Example of a VRE Based on CIDOC CRM
ResearchSpace- Example of a VRE Based on CIDOC CRMResearchSpace- Example of a VRE Based on CIDOC CRM
ResearchSpace- Example of a VRE Based on CIDOC CRM
 
Do MORe with your data
Do MORe with your dataDo MORe with your data
Do MORe with your data
 
MARC records for archived websites on the Archive of Tomorrow project / Mark ...
MARC records for archived websites on the Archive of Tomorrow project / Mark ...MARC records for archived websites on the Archive of Tomorrow project / Mark ...
MARC records for archived websites on the Archive of Tomorrow project / Mark ...
 
Ee bdm ws-v1
Ee bdm ws-v1Ee bdm ws-v1
Ee bdm ws-v1
 
About company
About companyAbout company
About company
 
ESWC SS 2012 - Wednesday Tutorial Barry Norton: Building (Production) Semanti...
ESWC SS 2012 - Wednesday Tutorial Barry Norton: Building (Production) Semanti...ESWC SS 2012 - Wednesday Tutorial Barry Norton: Building (Production) Semanti...
ESWC SS 2012 - Wednesday Tutorial Barry Norton: Building (Production) Semanti...
 
An overview of The European Library. Olaf Janssen presenting during DRH 2005,...
An overview of The European Library. Olaf Janssen presenting during DRH 2005,...An overview of The European Library. Olaf Janssen presenting during DRH 2005,...
An overview of The European Library. Olaf Janssen presenting during DRH 2005,...
 
ICOS: Integrated Carbon Observation System Open data to open our eyes to clim...
ICOS: Integrated Carbon Observation System Open data to open our eyes to clim...ICOS: Integrated Carbon Observation System Open data to open our eyes to clim...
ICOS: Integrated Carbon Observation System Open data to open our eyes to clim...
 
Uk discovery-jisc-project-showcase
Uk discovery-jisc-project-showcaseUk discovery-jisc-project-showcase
Uk discovery-jisc-project-showcase
 
Metadata and me
Metadata and meMetadata and me
Metadata and me
 
Museum reasonableview
Museum reasonableviewMuseum reasonableview
Museum reasonableview
 
Documents, services, and data on the web
Documents, services, and data on the webDocuments, services, and data on the web
Documents, services, and data on the web
 
Mind the gap! Reflections on the state of repository data harvesting
Mind the gap! Reflections on the state of repository data harvestingMind the gap! Reflections on the state of repository data harvesting
Mind the gap! Reflections on the state of repository data harvesting
 
Infrastructure - A necessary platform for user empowerment
Infrastructure - A necessary platform for user empowermentInfrastructure - A necessary platform for user empowerment
Infrastructure - A necessary platform for user empowerment
 
Edinburgh OldMapsOnline Workshop
Edinburgh OldMapsOnline WorkshopEdinburgh OldMapsOnline Workshop
Edinburgh OldMapsOnline Workshop
 
The Open Archives Initiative Protocol for Metadata Harvesting and ePrints UK
The Open Archives Initiative Protocol for Metadata Harvesting and ePrints UKThe Open Archives Initiative Protocol for Metadata Harvesting and ePrints UK
The Open Archives Initiative Protocol for Metadata Harvesting and ePrints UK
 
Europeana Creative. EDM Endpoint. Custom Views
Europeana Creative. EDM Endpoint. Custom ViewsEuropeana Creative. EDM Endpoint. Custom Views
Europeana Creative. EDM Endpoint. Custom Views
 
Semantic Technologies for Cultural Heritage
Semantic Technologies for Cultural HeritageSemantic Technologies for Cultural Heritage
Semantic Technologies for Cultural Heritage
 
Elibrary technical strategy
Elibrary technical strategyElibrary technical strategy
Elibrary technical strategy
 
High and Lows of Library Linked Data
High and Lows of Library Linked DataHigh and Lows of Library Linked Data
High and Lows of Library Linked Data
 

Mais de Mariana Damova, Ph.D

ИКТ програма 2018-2020 Хоризонт 2020 мариана дамова
ИКТ програма 2018-2020 Хоризонт 2020 мариана дамоваИКТ програма 2018-2020 Хоризонт 2020 мариана дамова
ИКТ програма 2018-2020 Хоризонт 2020 мариана дамова
Mariana Damova, Ph.D
 
Geography of Letters - The Spirituality of Sofia in the Historic Memory
Geography of Letters - The Spirituality of Sofia in the Historic MemoryGeography of Letters - The Spirituality of Sofia in the Historic Memory
Geography of Letters - The Spirituality of Sofia in the Historic Memory
Mariana Damova, Ph.D
 
Startup Europe Week Sofia 2017 - Introduction
Startup Europe Week Sofia 2017 - IntroductionStartup Europe Week Sofia 2017 - Introduction
Startup Europe Week Sofia 2017 - Introduction
Mariana Damova, Ph.D
 
IndustryInform Service of Mozaika
IndustryInform Service of MozaikaIndustryInform Service of Mozaika
IndustryInform Service of Mozaika
Mariana Damova, Ph.D
 
Семантични технологии основи
Семантични технологии   основи Семантични технологии   основи
Семантични технологии основи
Mariana Damova, Ph.D
 
IndustryInform Demo March 2016
IndustryInform Demo March 2016IndustryInform Demo March 2016
IndustryInform Demo March 2016
Mariana Damova, Ph.D
 
Startup Europe Week Sofia introduction
Startup Europe Week Sofia introductionStartup Europe Week Sofia introduction
Startup Europe Week Sofia introduction
Mariana Damova, Ph.D
 
Mozaika-Jan2016a
Mozaika-Jan2016aMozaika-Jan2016a
Mozaika-Jan2016a
Mariana Damova, Ph.D
 
Concordia july2015
Concordia july2015Concordia july2015
Concordia july2015
Mariana Damova, Ph.D
 
Industry informofmozaikathehumanizingtechnologieslab june23
Industry informofmozaikathehumanizingtechnologieslab june23Industry informofmozaikathehumanizingtechnologieslab june23
Industry informofmozaikathehumanizingtechnologieslab june23Mariana Damova, Ph.D
 
Industry informofmozaikathehumanizingtechnologieslab june23
Industry informofmozaikathehumanizingtechnologieslab june23Industry informofmozaikathehumanizingtechnologieslab june23
Industry informofmozaikathehumanizingtechnologieslab june23
Mariana Damova, Ph.D
 
Communication channels for the european single digital market
Communication channels for the european single digital marketCommunication channels for the european single digital market
Communication channels for the european single digital market
Mariana Damova, Ph.D
 
Bulgariana europeana27112013 ним
Bulgariana europeana27112013 нимBulgariana europeana27112013 ним
Bulgariana europeana27112013 ним
Mariana Damova, Ph.D
 
NLIWoD ISWC 2014 - Multilingual Retrieval Interface for Structured data on th...
NLIWoD ISWC 2014 - Multilingual Retrieval Interface for Structured data on th...NLIWoD ISWC 2014 - Multilingual Retrieval Interface for Structured data on th...
NLIWoD ISWC 2014 - Multilingual Retrieval Interface for Structured data on th...
Mariana Damova, Ph.D
 
Mozaika june2014
Mozaika june2014Mozaika june2014
Mozaika june2014
Mariana Damova, Ph.D
 
Europeana in Bulgaria
Europeana in BulgariaEuropeana in Bulgaria
Europeana in Bulgaria
Mariana Damova, Ph.D
 
Bulgariana europeana02112013
Bulgariana europeana02112013Bulgariana europeana02112013
Bulgariana europeana02112013
Mariana Damova, Ph.D
 
проектиране на онтологии и връзката им с езиковите технологии
проектиране на онтологии и връзката им с езиковите технологиипроектиране на онтологии и връзката им с езиковите технологии
проектиране на онтологии и връзката им с езиковите технологии
Mariana Damova, Ph.D
 
Multilingual Access to Cultural Heritage Content on the Semantic Web - Acl2013
Multilingual Access to Cultural Heritage Content on the Semantic Web - Acl2013Multilingual Access to Cultural Heritage Content on the Semantic Web - Acl2013
Multilingual Access to Cultural Heritage Content on the Semantic Web - Acl2013
Mariana Damova, Ph.D
 
Support Europeana in Securing Funding for the Connecting Europe Facility (CEF)
Support Europeana in Securing Funding for the Connecting Europe Facility (CEF)Support Europeana in Securing Funding for the Connecting Europe Facility (CEF)
Support Europeana in Securing Funding for the Connecting Europe Facility (CEF)
Mariana Damova, Ph.D
 

Mais de Mariana Damova, Ph.D (20)

ИКТ програма 2018-2020 Хоризонт 2020 мариана дамова
ИКТ програма 2018-2020 Хоризонт 2020 мариана дамоваИКТ програма 2018-2020 Хоризонт 2020 мариана дамова
ИКТ програма 2018-2020 Хоризонт 2020 мариана дамова
 
Geography of Letters - The Spirituality of Sofia in the Historic Memory
Geography of Letters - The Spirituality of Sofia in the Historic MemoryGeography of Letters - The Spirituality of Sofia in the Historic Memory
Geography of Letters - The Spirituality of Sofia in the Historic Memory
 
Startup Europe Week Sofia 2017 - Introduction
Startup Europe Week Sofia 2017 - IntroductionStartup Europe Week Sofia 2017 - Introduction
Startup Europe Week Sofia 2017 - Introduction
 
IndustryInform Service of Mozaika
IndustryInform Service of MozaikaIndustryInform Service of Mozaika
IndustryInform Service of Mozaika
 
Семантични технологии основи
Семантични технологии   основи Семантични технологии   основи
Семантични технологии основи
 
IndustryInform Demo March 2016
IndustryInform Demo March 2016IndustryInform Demo March 2016
IndustryInform Demo March 2016
 
Startup Europe Week Sofia introduction
Startup Europe Week Sofia introductionStartup Europe Week Sofia introduction
Startup Europe Week Sofia introduction
 
Mozaika-Jan2016a
Mozaika-Jan2016aMozaika-Jan2016a
Mozaika-Jan2016a
 
Concordia july2015
Concordia july2015Concordia july2015
Concordia july2015
 
Industry informofmozaikathehumanizingtechnologieslab june23
Industry informofmozaikathehumanizingtechnologieslab june23Industry informofmozaikathehumanizingtechnologieslab june23
Industry informofmozaikathehumanizingtechnologieslab june23
 
Industry informofmozaikathehumanizingtechnologieslab june23
Industry informofmozaikathehumanizingtechnologieslab june23Industry informofmozaikathehumanizingtechnologieslab june23
Industry informofmozaikathehumanizingtechnologieslab june23
 
Communication channels for the european single digital market
Communication channels for the european single digital marketCommunication channels for the european single digital market
Communication channels for the european single digital market
 
Bulgariana europeana27112013 ним
Bulgariana europeana27112013 нимBulgariana europeana27112013 ним
Bulgariana europeana27112013 ним
 
NLIWoD ISWC 2014 - Multilingual Retrieval Interface for Structured data on th...
NLIWoD ISWC 2014 - Multilingual Retrieval Interface for Structured data on th...NLIWoD ISWC 2014 - Multilingual Retrieval Interface for Structured data on th...
NLIWoD ISWC 2014 - Multilingual Retrieval Interface for Structured data on th...
 
Mozaika june2014
Mozaika june2014Mozaika june2014
Mozaika june2014
 
Europeana in Bulgaria
Europeana in BulgariaEuropeana in Bulgaria
Europeana in Bulgaria
 
Bulgariana europeana02112013
Bulgariana europeana02112013Bulgariana europeana02112013
Bulgariana europeana02112013
 
проектиране на онтологии и връзката им с езиковите технологии
проектиране на онтологии и връзката им с езиковите технологиипроектиране на онтологии и връзката им с езиковите технологии
проектиране на онтологии и връзката им с езиковите технологии
 
Multilingual Access to Cultural Heritage Content on the Semantic Web - Acl2013
Multilingual Access to Cultural Heritage Content on the Semantic Web - Acl2013Multilingual Access to Cultural Heritage Content on the Semantic Web - Acl2013
Multilingual Access to Cultural Heritage Content on the Semantic Web - Acl2013
 
Support Europeana in Securing Funding for the Connecting Europe Facility (CEF)
Support Europeana in Securing Funding for the Connecting Europe Facility (CEF)Support Europeana in Securing Funding for the Connecting Europe Facility (CEF)
Support Europeana in Securing Funding for the Connecting Europe Facility (CEF)
 

Último

Choosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptxChoosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptx
Brandon Minnick, MBA
 
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdfMonitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Tosin Akinosho
 
Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024
Jason Packer
 
Recommendation System using RAG Architecture
Recommendation System using RAG ArchitectureRecommendation System using RAG Architecture
Recommendation System using RAG Architecture
fredae14
 
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
Edge AI and Vision Alliance
 
HCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAUHCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAU
panagenda
 
Digital Marketing Trends in 2024 | Guide for Staying Ahead
Digital Marketing Trends in 2024 | Guide for Staying AheadDigital Marketing Trends in 2024 | Guide for Staying Ahead
Digital Marketing Trends in 2024 | Guide for Staying Ahead
Wask
 
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with SlackLet's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
shyamraj55
 
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...
Jeffrey Haguewood
 
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development ProvidersYour One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
akankshawande
 
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy SurveyTrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc
 
National Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practicesNational Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practices
Quotidiano Piemontese
 
Taking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdfTaking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdf
ssuserfac0301
 
20240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 202420240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 2024
Matthew Sinclair
 
Artificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopmentArtificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopment
Octavian Nadolu
 
AI 101: An Introduction to the Basics and Impact of Artificial Intelligence
AI 101: An Introduction to the Basics and Impact of Artificial IntelligenceAI 101: An Introduction to the Basics and Impact of Artificial Intelligence
AI 101: An Introduction to the Basics and Impact of Artificial Intelligence
IndexBug
 
Webinar: Designing a schema for a Data Warehouse
Webinar: Designing a schema for a Data WarehouseWebinar: Designing a schema for a Data Warehouse
Webinar: Designing a schema for a Data Warehouse
Federico Razzoli
 
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Speck&Tech
 
GenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizationsGenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizations
kumardaparthi1024
 
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success StoryDriving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Safe Software
 

Último (20)

Choosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptxChoosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptx
 
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdfMonitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdf
 
Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024
 
Recommendation System using RAG Architecture
Recommendation System using RAG ArchitectureRecommendation System using RAG Architecture
Recommendation System using RAG Architecture
 
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
 
HCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAUHCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAU
 
Digital Marketing Trends in 2024 | Guide for Staying Ahead
Digital Marketing Trends in 2024 | Guide for Staying AheadDigital Marketing Trends in 2024 | Guide for Staying Ahead
Digital Marketing Trends in 2024 | Guide for Staying Ahead
 
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with SlackLet's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
 
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...
 
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development ProvidersYour One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
 
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy SurveyTrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy Survey
 
National Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practicesNational Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practices
 
Taking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdfTaking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdf
 
20240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 202420240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 2024
 
Artificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopmentArtificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopment
 
AI 101: An Introduction to the Basics and Impact of Artificial Intelligence
AI 101: An Introduction to the Basics and Impact of Artificial IntelligenceAI 101: An Introduction to the Basics and Impact of Artificial Intelligence
AI 101: An Introduction to the Basics and Impact of Artificial Intelligence
 
Webinar: Designing a schema for a Data Warehouse
Webinar: Designing a schema for a Data WarehouseWebinar: Designing a schema for a Data Warehouse
Webinar: Designing a schema for a Data Warehouse
 
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
 
GenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizationsGenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizations
 
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success StoryDriving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success Story
 

Dm2 e ontotext-nov2012

  • 1. OWLIM Mariana Damova, PhD DM2E Vienna, November 2012
  • 2. Ontotext – Top-5 provider of core Semantic Technology – Established in year 2000; offices in Bulgaria, UK, USA – Active both in research and commercial projects (FP7 funding for 10 years) • 360° semantic technology – unique portfolio: – Semantic Databases: high-performance RDF DBMS, scalable reasoning – Semantic Search: text-mining (IE), metadata generation, Information Retrieval (IR) – Web Mining: focused crawling, screen scraping, data fusion – Linked Data Management and Data Integration Good recognition in the SemTech community – Ontotext pages are ranked #1 for “semantic annotation” and “semantic repository” at GYM, #3 for “linked data management” at Google Several joint ventures and subsidiaries – Innovantage: leading online recruitment intelligence provider in UK
  • 3. Ontotext Clients (selected) British Broadcasting Corporation (BBC) – Run its World Cup 2010 sites on top of OWLIM – Since Mar’12 BBC Sports – 2012 Olympics sections are driven by OWLIM and a Concept Extraction service developed by Ontotext Press Association (UK) – Analysis of Sports news – Concept extraction – Linked data generation Top-3 USA media (not allowed to name) The National Archives (UK) contracted Ontotext to implement semantic KB and semantic search for the Government Web Archive British Museum (UK) Ontotext leads the development of Phase 3 of ResearchSpace project on collaborative research in cultural heritage; British Museum’s public SPARQL end-point is powered by OWLIM de Bibliothek (Holland) aggregation of data from 150 library databases
  • 4. Semantic Technologies • Semantic technologies (RDF, LOD) allow for an unprecedented ease of integration of heterogeneous data sources – Already adopted in pharmaceuticals and publishing industries – Cultural heritage is the next BBC – when MySQL was replaced with OWLIM in their “Dynamic Semantic Publishing” architecture, the BBC team observed considerable reduction of complexity of database design, query specification, application development, and query evaluation time. BBC World Cup 2010 dynamic semantic publishing. Jem Rayfield, Senior Technical Architect BBC News and Knowledge. http://www.bbc.co.uk/blogs/bbcinternet/2010/07/bbc_world_cup_2010_dyna mic_sem.html
  • 6. Semantic Repository for RDFS and OWL • OWLIM is a family of scalable semantic repositories • OWLIM-Lite: in-memory, fastest, scales to ~100 million statements • OWLIM-SE: file-based, sameAs & query optimizations, scales to 20 billion statements • OWLIM-Enterprise: replication cluster deployment for resilience and high performance parallel query-answering • OWLIM provides – Management, integration and analysis of heterogeneous data – Combined with light-weight, high-performance reasoning – The inference is based on logical rule-entailment – Full RDFS, OWL Horst, restricted OWL-Lite, OWL2-QL and OWL2 RL – Custom semantics can be defined via rules and axiomatic triples
  • 7. OWLIM in the Cultural Heritage Domain Selected commercial projects ResearchSpace project funded by the Andrew W. Mellon Foundation Support for collaborative web-based research, information sharing and web publishing for the cultural heritage scholarly community. An Ontotext-led international consortium. The Polish Digital National Museum aggregates artifacts from over 70 contributing cultural institutions in the Digital Libraries Federation PIONIER Network using OWLIM repository of Ontotext LODAC (Linked Open Data in Academia), Japan's National Institute of Informatics aggregates various information across multiple Japanese resources as LOD. The system uses 8 OWLIM nodes and aggregates 19 collections with 700 000 entities and 15M triples. SemTech for Cultural Heritage project funded by ITCC Semantic publishing of Bulgarian cultural heritage to Europeana Establishing a Bulgarian technical aggregator for Europeana Selected research projects MOLTO FP7 project, a use case in cultural heritage for a semantic knowledge representationinfrastructure for querying RDF and presenting query results, includes close to 9K museum objects from two collections of The Gothenburg City Charisma (Cultural Heritage Advanced Research Infrastructures) an EU-funded integrating activity project, a consortium of 21 partners, metadata from 6 major European cultural institutions has selected OWLIM repository of Ontotext
  • 8. OWLIM PERFORMANCE • OWLIM is a scalable, robust and efficient triple store – Serving the two most important web-sites for the London Olympic Games • Official Olympics website • BBC Olympics website – Performance highlights • OWLIM loads the 100M and the 200M datasets almost twice as fast as the next best product (17 min. for 100M) • Best query performance among those repositories that can handle update and multi-client query tasks (5,285 Query-mixes-per-hour, where a query mix contains 25 queries; e.g. about 100 queries/sec) • OWLIM v5 is 43% faster than v.4.3 on the BSBM Explore and Update scenario • OWLIM v5 requires between 25% and 70% less storage space • OWL 2 RL-type languages have proven to be the only feasible approach for reasoning with billion statements
  • 10. owl:sameAs Optimization a way to handle the equivalent statements by a single master node, which has as an impact efficient and compact handling of inferred statements resulting in 4-6 times more statements available to query than the explicitly introduced ones
  • 11. OWLIM Replication Cluster • Distribution through data replication is used to ensure: – Better handling of concurrent user requests – Failover support • How does it work? – Every user request is pushed in a transaction queue – Each data write request is are multiplexed to all repository instances – Each read request is dispatched to one of the instance only – To ensure load-balancing, each read requests is send to the instance with smallest execution queue at this point in time
  • 12. Geo-spatial index • Geo-spatial information concerns the geometry of points, shapes and distances relative to the surface of the Earth (or any spherical object). • When using OWLIM-SE all angles are in decimal degrees with the latitude ranging from -90 to +90 degrees and the longitude ranging from -180 to +180 degrees. • airports have a reference point given by latitude, longitude and altitude; • political boundaries can be specified by polygons where each vertex is a 2-Dimensional latitude/longitude pair.
  • 13. RDF Rank • OWLIM-SE includes a plug-in that allows for efficient calculation of a modification of PageRank over RDF graphs • Computation of rank values is fast, e.g. – 400M LOD statements takes 310 sec (27 iteraions) • Results are available through a system predicate • Example: get the 100 most important nodes in the RDF graph SELECT ?n {?n rank:hasRDFRank ?r} ORDER BY DESC(?r) LIMIT 100
  • 14. Define: nested repositories ”Nested repositories” represent a new data management concept for RDF data: • a mechanism for sharing data stored across multiple repositories, where • one of them contains a large body of knowledge which gets embedded in other repositories • each containing more specific data, which are being interlinked with the common body of knowledge
  • 15. http://www.ontotext.com/owlim mariana.damova@ontotext.com