2. About Ontotext
• Provides products & solutions for content enrichment and metadata
management
− 70 employees, headquarters in Sofia (Bulgaria)
− Sales presence in NYC and London
• Major clients and industries
− Media & Publishing
− Health Care & Life Sciences
− Cultural Heritage & Digital Libraries
− Financial information providers
− Government
− Education
Aug 2015 2Smart Data Week 2015
3. Ontotext’s Vision for Smart Data Management
3Aug 2015
Graph Database
• Flexible RDF graph data
model
• Ontology metadata layer
Semantic Search
• Semantic, exploratory search
• Metadata driven content
Text Mining & Interlinking
• People, locations,
organisations, topics
• Discover implicit relations
• Reuse open knowledge
graphs
4. SPARQL – the Good & the Bad
• Very good for complex graph pattern matching
• Not so good for
−Full-text search
−Snippet extraction
−Faceted search
−Complex aggregations
−Range queries
Aug 2015 4Smart Data Week 2015
5. What If…
• We could get the full power of SPARQL
• … and extremely fast
−Full-text search / snippet extraction
−Faceted search
−Complex aggregations
−Range queries
• … while using only SPARQL (query + update)
Aug 2015 5Smart Data Week 2015
6. Ontotext GraphDB Connectors
• Provide extremely fast full-text search, range, faceted search,
aggregations
• Utilise an external engine like Lucene, Solr or Elasticsearch
• Flexible schema mapping: index only what you need
• Real-time synchronization of data in GraphDB and the external engine
• Connector management via SPARQL
• Data querying & update via SPARQL
• Based on the GraphDB plug-in architecture
Aug 2015 6Smart Data Week 2015
7. Workflow
Selective
replication
Query Processor
Graph indexesInternal indexes
SPARQL SELECT with or without an
embedded
Lucene/Solr/Elasticsearch query
Solr/Elasticsearch direct
queries
Lucene/Solr/Elasticsearch GraphDB engine
SPARQL INSERT/DELETE
Aug 2015 7Smart Data Week 2015
8. Interface
• All interaction via SPARQL queries
−INSERT for creating connectors
−SELECT for getting connector configuration parameters
−INSERT/SELECT/DELETE for managing & querying RDF data
Aug 2015 8Smart Data Week 2015
16. PREFIX : <http://www.ontotext.com/connectors/elasticsearch#>
PREFIX inst: <http://www.ontotext.com/connectors/elasticsearch/instance#>
SELECT ?facetName ?facetValue ?facetCount
WHERE {
?r a inst:my_index2 ;
:facetFields "year,sugar" ;
:facets _:f .
_:f :facetName ?facetName .
_:f :facetValue ?facetValue .
_:f :facetCount ?facetCount .
}
Faceted Search with SPARQL
Aug 2015 16Smart Data Week 2015
?facetName ?facetValue ?facetCount
year 2012 3
year 2013 2
sugar Dry 3
sugar medium 2
17. Summary
• High-performance full-text search, faceted search & aggregations
within SPARQL are important
• Ontotext GraphDB Connectors provide a solution for the problem,
utilising external engines like Elasticsearch and Solr
• Data access only via SPARQL, external engine component
transparent to applications and users
Aug 2015 17Smart Data Week 2015