This document provides an overview of Semantic Web Company (SWC) and their PoolParty Semantic Suite product. It discusses SWC's background, customers, and partners. It then describes the key components and functionalities of PoolParty, including maintaining vocabularies, entity extraction, linked data integration, and advanced features like custom ontologies and corpus analysis. The document explains how PoolParty can integrate with databases like MarkLogic and Virtuoso, as well as content management systems like Drupal. Overall, the document aims to introduce SWC and PoolParty and demonstrate how their semantic technologies can provide benefits for tasks like data integration, search, and knowledge management.
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
PoolParty Semantic Suite - LT-Innovate Industry Summit-2016 - Brussels
1. Language Technology Industry
Summit 2016, 17.5.2016,
Brussels
Martin Kaltenböck
CFO, Semantic Web Company
POOLPARTY
SEMANTIC SUITE
Solution
Spotlight
1
2. INTRODUCING
SEMANTIC
WEB COMPANY
(SWC) AND
POOLPARTY
Basic facts about the Company
▸ Founded in 2004
▸ Based in Vienna
▸ Privately held
▸ >30 employees, experts in text
mining & linked data
▸ SWC participates in EU-projects
with a total funding of over €
17.0 million
▸ SWC named to KMWorld’s 2016
"100 Companies That Matter in
Knowledge Management“
▸ Organising SEMANTiCS
conference series since 2005
(13-14.9.2016 in Leipzig, DE)
About PoolParty Software Suite
▸ First release in 2009
▸ Current version 5.3
▸ W3C standards compliant
▸ >100 installations world-wide
▸ 50% of SWC’s revenue is
reinvested into development
of PoolParty
▸ PoolParty can be installed
on-premises or used as cloud
service
2
3. SELECTED
CUSTOMER
REFERENCES
AND PARTNERS
SWC head-
quarters
3
Customer References
● Credit Suisse
● Boehringer Ingelheim
● Roche
● adidas
● The Pokémon Company
● Canadian Broadcasting Corporation
● Red Bull Media House
● Wolters Kluwer
● Bank of America
● HealthStream
● TC Media
● Techtarget
● BMJ Publishing Group
● CafePress
● Pearson - Always Learning
● Education Services Australia
● American Physical Society
● Healthdirect Australia
● World Bank Group
● Inter-American Development Bank
● Renewable Energy Partnership
● Wood MacKenzie
● Oxford University Press
● International Atomic Energy Agency
● Norwegian Directorate of Immigration
● Ministry of Finance (AT)
● Council of the E.U.
● Australian National Data Service
Partners
● Accenture
● EPAM Systems
● Enterprise Knowledge
● Term Management
● Taxonomy Strategies
● MarkLogic
● Solnet Solutions
● Wolters Kluwer
● Mekon
● Tellura
US
East
US
West
AUS/
NZL
UK
5. TECHNICAL
CORE
COMPONENTS
5
Bain Capital is a venture capital
company based in Boston, MA.
Since inception it has invested in
hundreds of companies including
AMC Entertainment, Brookstone,
and Burger King. The company was
co-founded by Mitt Romney.
Taxonomy &
Ontology Server
Entity Extraction &
Text Mining
6. PoolParty as a
supervised
learning
system
6 Content Manager
Integrator
Taxonomist/
Ontologist
Thesaurus
Server
Extractor
PowerTagging
uses API
is user of
is user of
is basis of
is basis of
Index
annotates
enriches
Referenc
e Corpus
CMS
extends
is basis of
analyzes
uses API
10. SEMANTIC
SEARCHBeyond simple search over documents: Faceted search, Smart
search assistants and search over unstructured and structured
content in combination.
10
11. TOPIC PAGES
Dynamic Semantic Publishing: Create landing pages on-the-fly
from different content sources and information streams.
11
15. Use Cases for
SKOS, Linked
Data, and for
Vocabulary
Hubs
▸ EIP Water Marketplace
Matchmaking of Supply and Demand in Water Innovation
▸ Climate Tagger (PDF)
Streamline and catalogue data and information resources
▸ CTCN Matchmaking Assistant
Accurate matchmaking between ‘problem statements’ and solution providers
▸ healthdirect Australia (PDF)
Semantic Search based on the Australian Health Thesaurus
▸ Wolters Kluwer (PDF)
Vocabularies as a backbone for enterprise linked data & visualization
▸ Boehringer Ingelheim (PDF)
Vocabularies as means for data integration
▸ A Retailer
Personalization based on controlled vocabularies
15
16. Place your screenshot here
16Climate
Tagger
Help organizations in the
climate and development
arenas catalogue, categorize,
contextualize, and connect
data and information
resources.
Climate Tagger is backed by
the expansive Climate
Compatible Development
Thesaurus.
http://www.climatetagger.net
17. Place your screenshot here
17EIP Water
Matchmaking
Controlled vocabularies
enable accurate matchmaking
between Supply and Demand
for Water Innovation in
Europe.
Matchmaking is based upon
the EIP Water Innovation
Thesaurus (GEMET based).
http://www.eip-water.eu
18. Place your screenshot here
18CTCN
Matchmaking
Controlled vocabularies
enable accurate
matchmaking between
‘problem statements’ and
capabilities of solution
providers.
Matchmaking is based upon
the Climate Compatible
Development Thesaurus.
Reference
19. Place your screenshot here
19healthdirect
Australia
Integrated views and
semantic search over more
than 100 trusted sources.
Harmonization of various
metadata systems through
the use of a central
vocabulary hub:
Australian Health Thesaurus.
http://www.healthdirect.gov.au
20. 20Wolters
Kluwer
Usage of controlled
vocabularies as part of the
semantic search
architecture.
Provision of Topics Browser
to navigate topics, relations
and related documents.
Reference
http://vocabulary.
wolterskluwer.de
21. Place your screenshot here
21Boehringer
Ingelheim
Data integration based on
controlled vocabularies:
Linking of structured and
unstructured data.
Semantic search and data
analytics based on RDF
graphs and SPARQL.
Reference
22. Place your screenshot here
22A Retailer
Controlled vocabularies
enable personalization,
searchability of localized
content, data governance
and standardization.
Personalizing user
experiences with brands and
products is a data driven
task.
See example
23. SUMMARY
WHY
TAXONOMISTS
AND
INFORMATION
ARCHITECTS
LIKE
POOLPARTY
Read more
Different project stakeholders expect specific
qualities from a semantic technology platform:
23
I am a taxonomist. I need a tool that
provides convenient functionalities and
intuitive user interfaces for my daily work.
I am an information architect. Enterprise
metadata management deserves scalable
technologies, which provide semantic services
on top of rich APIs based on standards.
27. Place your screenshot here
27Maintaining
Vocabularies
Taxonomies and controlled
vocabularies are maintained by
using the SKOS standard of W3C.
The intuitive user interface
provides comfortable control
elements like drag & drop or
autocomplete.
A tree view on the taxonomy
plays a central part in navigation
and orientation.
28. Place your screenshot here
28SKOS Editor
The SKOS View on a concept
allows the management of
labels (e.g. synonyms),
hierarchies and non-hierarchical
relations, and mappings to other
vocabularies.
Also more complex actions like
merging of concepts, moving of
subtrees or the creation of poly-
hierarchies are supported.
PoolParty fully covers the SKOS
standard of W3C incl. SKOS-XL
and SKOS Collections.
29. Place your screenshot here
29History &
Audit Trails
Every change being made on a
concept of a thesaurus is stored
and can be tracked.
A full history containing the author,
timestamp and action being taken
can be displayed for each concept
and for the whole project.
Recovery and rollback can be
managed by PoolParty’s snapshot
mechanism.
30. Place your screenshot here
30Linking &
Mapping
The same concept can occur in
several taxonomies and can be put
in different contexts.
PoolParty provides a comfortable
dialogue for the semi-automatic
linking between concepts from
several thesauri.
Additionally, concepts can also be
mapped to linked data sources like
DBpedia or Geonames, or even to
non-RDF sources provided by you.
31. Place your screenshot here
31User Management
& Roles
User Management is based on user
accounts, roles, and groups.
User authentication can be
integrated with LDAP.
PoolParty’s security layer is based
on Spring Security.
PoolParty’s API is fully integrated
with the security layer.
32. Place your screenshot here
32Workflows
Approval (or rejection) of changes
on a thesaurus can be governed by
workflows.
Several roles in the PoolParty
system have different rights to
apply changes, reject or approve
those.
A clearly structured dashboard
helps taxonomists not to loose
track of all the tasks that need to
be performed.
35. Place your screenshot here
35Entity Extraction
PoolParty’s API provides a rich set
of methods for text mining and
entity extraction.
This ultra-fast service makes use of
your controlled vocabularies,
therefore it is highly accurate for
your specific domain.
The service will improve over time
and learns from reference text
corpora. It supports over 40
languages and comes with a
powerful disambiguation algorithm.
36. Place your screenshot here
36Custom Schemes
& Ontologies
SKOS is based on a simple schema.
This can be expanded by
additional custom schemes.
Custom schemes can be created
with help of PoolParty’s ontology &
schema editor.
For an increased interoperability,
PoolParty provides a rich set of
preconfigured ontologies like
schema.org or FOAF.
37. Place your screenshot here
37Quality
Management
Data quality and especially the
quality of metadata is key to a
more efficient information
management.
PoolParty Server provides
several built-in quality checks
(e.g. to avoid circularities).
Checks can be executed at run-
time or at any time to generate a
quality report.
38. Place your screenshot here
38Corpus Analysis
PoolParty can automatically
analyze reference text corpora.
The calculation of a statistical
model of a ‘typical vocabulary’
of a specific domain helps to
suggest candidate concepts for
the expansion of a taxonomy.
By this means, the quality of
term extraction improves over
time and potential relations
between concepts and terms can
be suggested by the system.
39. Place your screenshot here
39Linked Data
The use of Linked Data standards
increases interoperability of your
knowledge graphs & metadata.
With PoolParty, each thesaurus
and ontology can be provided as a
Linked Data graph.
In return, every linked data source
can potentially be used to enrich a
thesaurus.
PoolParty supports scenarios like
‘Enterprise Linked Data’ as well as
‘Linked Open Data’.
40. Place your screenshot here
40RDF based ETL
Data processing tasks can be
modelled as pipelines: Make use of
the intuitively usable graphical
interface.
Versatile data integration platform:
Link data from internal and
external data sources in a central
NoSQL linked data warehouse.
Custom plugins: Your data
processing pipelines are highly
customizable by creating your own
data processing units (DPUs).
41. Place your screenshot here
41GraphSearch
Semantic search at the highest
level: PoolParty Graph Search
Server combines the power of
graph databases and SPARQL
engines with features of
‘traditional’ search engines.
Document search and visual
analytics: Benefit from additional
insights through interactive
visualizations of reports and
search results derived from your
data lake by executing
sophisticated SPARQL queries.
45. YOUR BENEFIT
45
Semantic as a Service
Standards-based technology
Precise document classification
Semantic Middleware for
Enrichment and Linking
+ =
FULL SEMANTICS
STACK
Fast Time to Results
Ask Anything Universal Index
Trusted Data and Transactions
Enterprise-Grade Security
Scale-Out Commodity Hardware
Lightning Fast and Real-Time
Operational and
Transactional Enterprise
NoSQL Database
Data Integration
Intelligent Search
Deep Analytics
Data Enrichment
Data Governance
Graph-based metadata
management
Superior user friendliness
Beyond search
47. YOUR BENEFIT
47
Semantic as a Service
Standards-based technology
Precise document classification
Semantic Middleware for
Enrichment and Linking
+ =
GRAPH BASED
ANALYTICS
Performant SPARQL engine
Massive Linked Data Graphs
Transactions
Scaling to trillions of triples
Federated environments
Built-in inferencing
Native database capability
and a virtual database
Data Integration
Intelligent Search
Linked Data
Data Enrichment
Data Virtualization
Graph-based metadata
management
Superior user friendliness
Beyond search
48. Place your screenshot here
48SPARQL-based
Analytics
This application is based on
documents from Pharma
industry, a database about
impact factors of
publications, and several
taxonomies.
It uses PoolParty Semantic
Integrator incl. UnifiedViews
and Virtuoso as its technical
basis.
Demo Application
50. ~ 1.2 Mio active websites
YOUR BENEFIT
50
Semantic as a Service
Standards-based technology
Precise document classification
Semantic Middleware for
Enrichment and Linking
D
+ =
FULL SEMANTICS
STACK
Cutting edge CMS (Standards)
~ 10k active maintained modules
Huge community in place &
stable core team (Association)
Stable, robust & performant
Interfaces to several software
Open Source Content
Management System (CMS)
Data Integration
Intelligent Search
Deep Analytics
Data Enrichment
Data Governance
Graph-based metadata
management
Superior user friendliness
Beyond search