SlideShare a Scribd company logo
1 of 31
Download to read offline
BUILDING THE
FASHION
KNOWLEDGE
GRAPH
TALK AT
CONNECTED DATA LONDON
KATARIINA KARI
07-11-2018
2
Zalando at a Glance
Enterprise Knowledge Graph
Definition of the Knowledge Graph
In the Beginning...
What we implemented and added value
BUILDING THE FASHION
KNOWLEDGE GRAPH
3
ZALANDO AT A GLANCE
~ 4.5billion EUR
revenue 2017
> 200
million
visits
per
month
> 15,000
employees in
Europe
> 90
million
orders
> 24
million
active customers
> 300,000
product choices
~ 2,000
brands
17
countries
as at Aug 2018
OUR VISION:
CONNECTING PEOPLE AND FASHION
5
KNOWLEDGE GRAPH
AS WE UNDERSTAND IT
6
A NAMED DIRECTED GRAPH OF CONCEPTS WITH URL-LIKE IDENTIFIERS
https://knowledge.zalando.net/ontology/pumps
IRI
URL
internal
structural
associative
7
UNDERSTANDING AND SPEAKING OUR CUSTOMER’S LANGUAGE
The right kind
of contents
The best
possible view
contents
SEARCH BROWSING
8
FASHION CONCEPTS ARE THE CORE
Zalando
contents
application
ontology
external
vocabulary
schema.org
extension?
?
COMMUNICATING THE KNOWLEDGE GRAPH
TO DIFFERENT PROFESSIONAL
10
COMMUNICATING THE KNOWLEDGE GRAPH
PRODUCT MANAGERS
Does it improve our
customer experience?
Does it make money?
BACKEND ENGINEERS
Open World
Assumption?
Why Graph Databases?
MACHINE LEARNING
EXPERTS
Only see the graph as a data
source like any other data and
complain there is too little of
the data.
11
IN THE BEGINNING...
“Search can be improved with many Machine Learning algorithms.
Most successful search engines also use Knowledge Graphs to
improve the search. We should explore this possibility.”
“Is a static Category Tree the best way
to represent fashion contents?”
14
IN THE BEGINNING...
Little Semantic Web
Knowledge Inside the
Company
“Ontologies were used in
one project I worked on
in another company.
They did not really
work.”
“Machine Learning
works better.”
“So it is manual work?
Will it scale?”
Upper Management
Endorsement
Team of Backend Developers
was put together and some
research engineers
Knowledge
sharing on
Ontologies
RDF,
SPARQL
15
GETTING INTO THE TOPIC OF SEMANTIC WEB
Do you the benefit and added value of
knowledge graphs?
Team skills
Research
Backend
Engineering
Backend &
Frontend
Engineering
Product
16
GETTING INTO THE TOPIC OF SEMANTIC WEB
Modelling
RDF
GraphDB
OntoClean
“Proper
modelling.
How should it
be done?”
SPARQL
RDF
Syntax,likeTurtle
GraphDB
Modelling
OntoClean
HARD TO LEARN
“Enough
high-quality
data”
“Knowledge modelling. Data
has to be correct at all times,
but at the same time simple
and easy to follow”
“Performance and use
of graph databases”
“balance between a clean
graph and use cases”
EASY TO LEARN
17
COOLEST TOPICS OR FEATURES IN SEMANTIC WEB
“Graph
databases in
general”
“Sparql Query
language”
“interlinkedness: how one
part of the system can make
use of another, almost
unrelated, part”
“The power of SPARQL
for querying the graph.”
“SPARQL and what it can
do with normalised data”
“The possibility to connect
different kinds of information
from multiple sources into
one entity”
“There are many features and
use cases still undiscovered,
which I believe, a graph data
structure helps to fulfil.”
“Ability to create human
understanding and
'intelligence' out of relations.”
18
WHAT WE HAVE
IMPLEMENTED
& BIGGEST ADDED VALUE
19
IMPLEMENTATIONS
Zalando
contents
20
WHERE WE SEE THE MOST VALUE OF THE GRAPH
Measured significant
improvement of search
powered by the graph
Fashion concepts do not
only fetch products, they
fetch editorials and other
kinds of content
21
A KNOWLEDGE GRAPH
FOR AN ENTERPRISE
22
GRAPH CONTENTS IS PEER-REVIEWED
application
ontology
fashion
concepts
maintained in a GitHub repository
pull requests
4-eye principle
MODELLING PRINCIPLES
OntoClean adapted
Consistency in content
connections are analysed for
subsuming fashion concepts
Use Case Driven Modelling
23
NO ZALANDO CONTENTS IN THE GRAPH
Zalando
contents
> 300K products cannot
be stored in the graph
MICROSERVICES TO THE RESCUE
We store rules with which products
can be retrieved from other
systems via API calls.
Another service uses those rules to
index our fashion concept
identifiers onto Zalando’s products.
So that other services can
consume it.
24
GRAPH DATABASE – WHAT TO USE?
DATOMIC
Implement our own
triple store
BLAZEGRAPH
Open-source graph
database
AMAZON NEPTUNE
AWS, compliant, supported
25
HOW WE DECREASED LATENCY
COMPLEX
MODELLING
What is a fashion
concept is implied via
modelling
SPARQL
Queries are long and
complex = latency
INFERENCE
We implement our own
inference rules
26
GDPR
The Knowledge Graph at Zalando is….
satisfying use cases
peer-reviewed
and adapted to a micro-service architecture
The Knowledge Graph
adds convenience for our customers
and drives a dynamic shop experience.
THANK YOU
QUESTIONS?
KATARIINA KARI
katariina.kari@zalando.fi
+358 40 513 5700
07-11-2018
RESEARCH ENGINEER
This presentation and its contents are strictly confidential. It may not, in
whole or in part, be reproduced, redistributed, published or passed on to
any other person by the recipient.
The information in this presentation has not been independently verified. No
representation or warranty, express or implied, is made as to the accuracy
or completeness of the presentation and the information contained herein
and no reliance should be placed on such information. No responsibility is
accepted for any liability for any loss howsoever arising, directly or
indirectly, from this presentation or its contents.
DISCLAIMER
31

More Related Content

What's hot

How LinkedIn Democratizes Big Data Visualization
How LinkedIn Democratizes Big Data VisualizationHow LinkedIn Democratizes Big Data Visualization
How LinkedIn Democratizes Big Data Visualization
Chi-Yi Kuan
 
It’s All About The Cards: Sharing on Social Media Encouraged HTML Metadata G...
It’s All About The Cards: Sharing on Social Media Encouraged HTML Metadata G...It’s All About The Cards: Sharing on Social Media Encouraged HTML Metadata G...
It’s All About The Cards: Sharing on Social Media Encouraged HTML Metadata G...
Shawn Jones
 
Using Spark-Solr at Scale: Productionizing Spark for Search with Apache Solr...
 Using Spark-Solr at Scale: Productionizing Spark for Search with Apache Solr... Using Spark-Solr at Scale: Productionizing Spark for Search with Apache Solr...
Using Spark-Solr at Scale: Productionizing Spark for Search with Apache Solr...
Databricks
 

What's hot (20)

Sustainability Investment Research Using Cognitive Analytics
Sustainability Investment Research Using Cognitive AnalyticsSustainability Investment Research Using Cognitive Analytics
Sustainability Investment Research Using Cognitive Analytics
 
AnzoGraph DB - SPARQL 101
AnzoGraph DB - SPARQL 101AnzoGraph DB - SPARQL 101
AnzoGraph DB - SPARQL 101
 
Scalable, Fast Analytics with Graph - Why and How
Scalable, Fast Analytics with Graph - Why and HowScalable, Fast Analytics with Graph - Why and How
Scalable, Fast Analytics with Graph - Why and How
 
Risk Analytics Using Knowledge Graphs / FIBO with Deep Learning
Risk Analytics Using Knowledge Graphs / FIBO with Deep LearningRisk Analytics Using Knowledge Graphs / FIBO with Deep Learning
Risk Analytics Using Knowledge Graphs / FIBO with Deep Learning
 
Fried data summit data quality data analytics together
Fried data summit data quality data analytics togetherFried data summit data quality data analytics together
Fried data summit data quality data analytics together
 
Creating a Data Distribution Knowledge Base using Neo4j, UBS
Creating a Data Distribution Knowledge Base using Neo4j, UBSCreating a Data Distribution Knowledge Base using Neo4j, UBS
Creating a Data Distribution Knowledge Base using Neo4j, UBS
 
AnzoGraph DB: Driving AI and Machine Insights with Knowledge Graphs in a Conn...
AnzoGraph DB: Driving AI and Machine Insights with Knowledge Graphs in a Conn...AnzoGraph DB: Driving AI and Machine Insights with Knowledge Graphs in a Conn...
AnzoGraph DB: Driving AI and Machine Insights with Knowledge Graphs in a Conn...
 
Revolutionizing the Legal Industry with Spark, NLP and Azure Databricks at Cl...
Revolutionizing the Legal Industry with Spark, NLP and Azure Databricks at Cl...Revolutionizing the Legal Industry with Spark, NLP and Azure Databricks at Cl...
Revolutionizing the Legal Industry with Spark, NLP and Azure Databricks at Cl...
 
Going Beyond Rows and Columns with Graph Analytics
Going Beyond Rows and Columns with Graph AnalyticsGoing Beyond Rows and Columns with Graph Analytics
Going Beyond Rows and Columns with Graph Analytics
 
Using a Semantic and Graph-based Data Catalog in a Modern Data Fabric
Using a Semantic and Graph-based Data Catalog in a Modern Data FabricUsing a Semantic and Graph-based Data Catalog in a Modern Data Fabric
Using a Semantic and Graph-based Data Catalog in a Modern Data Fabric
 
Modern Data Discovery and Integration in Insurance
Modern Data Discovery and Integration in InsuranceModern Data Discovery and Integration in Insurance
Modern Data Discovery and Integration in Insurance
 
How LinkedIn Democratizes Big Data Visualization
How LinkedIn Democratizes Big Data VisualizationHow LinkedIn Democratizes Big Data Visualization
How LinkedIn Democratizes Big Data Visualization
 
Large Scale Graph Analytics with RDF and LPG Parallel Processing
Large Scale Graph Analytics with RDF and LPG Parallel ProcessingLarge Scale Graph Analytics with RDF and LPG Parallel Processing
Large Scale Graph Analytics with RDF and LPG Parallel Processing
 
Geschäftliches Potential für System-Integratoren und Berater - Graphdatenban...
Geschäftliches Potential für System-Integratoren und Berater -  Graphdatenban...Geschäftliches Potential für System-Integratoren und Berater -  Graphdatenban...
Geschäftliches Potential für System-Integratoren und Berater - Graphdatenban...
 
Knowledge Graph for Machine Learning and Data Science
Knowledge Graph for Machine Learning and Data ScienceKnowledge Graph for Machine Learning and Data Science
Knowledge Graph for Machine Learning and Data Science
 
It’s All About The Cards: Sharing on Social Media Encouraged HTML Metadata G...
It’s All About The Cards: Sharing on Social Media Encouraged HTML Metadata G...It’s All About The Cards: Sharing on Social Media Encouraged HTML Metadata G...
It’s All About The Cards: Sharing on Social Media Encouraged HTML Metadata G...
 
TigerGraph UI Toolkits Financial Crimes
TigerGraph UI Toolkits Financial CrimesTigerGraph UI Toolkits Financial Crimes
TigerGraph UI Toolkits Financial Crimes
 
Using Spark-Solr at Scale: Productionizing Spark for Search with Apache Solr...
 Using Spark-Solr at Scale: Productionizing Spark for Search with Apache Solr... Using Spark-Solr at Scale: Productionizing Spark for Search with Apache Solr...
Using Spark-Solr at Scale: Productionizing Spark for Search with Apache Solr...
 
2017-01-08-scaling tribalknowledge
2017-01-08-scaling tribalknowledge2017-01-08-scaling tribalknowledge
2017-01-08-scaling tribalknowledge
 
Building a Scalable Data Science Solution to Outperform Sales Execution in Tr...
Building a Scalable Data Science Solution to Outperform Sales Execution in Tr...Building a Scalable Data Science Solution to Outperform Sales Execution in Tr...
Building a Scalable Data Science Solution to Outperform Sales Execution in Tr...
 

Similar to Building, and communicating, a knowledge graph in Zalando

События, шины и интеграция данных в непростом мире микросервисов / Валентин Г...
События, шины и интеграция данных в непростом мире микросервисов / Валентин Г...События, шины и интеграция данных в непростом мире микросервисов / Валентин Г...
События, шины и интеграция данных в непростом мире микросервисов / Валентин Г...
Ontico
 
RedisGraph A Low Latency Graph DB: Pieter Cailliau
RedisGraph A Low Latency Graph DB: Pieter CailliauRedisGraph A Low Latency Graph DB: Pieter Cailliau
RedisGraph A Low Latency Graph DB: Pieter Cailliau
Redis Labs
 
Megha_Singh_Resume
Megha_Singh_ResumeMegha_Singh_Resume
Megha_Singh_Resume
Megha Singh
 

Similar to Building, and communicating, a knowledge graph in Zalando (20)

Building a Knowledge Graph at Zalando
Building a Knowledge Graph at ZalandoBuilding a Knowledge Graph at Zalando
Building a Knowledge Graph at Zalando
 
The Evolution of the Fashion Retail Industry in the Age of AI with Kshitij Ku...
The Evolution of the Fashion Retail Industry in the Age of AI with Kshitij Ku...The Evolution of the Fashion Retail Industry in the Age of AI with Kshitij Ku...
The Evolution of the Fashion Retail Industry in the Age of AI with Kshitij Ku...
 
Transforming enterprise it with containers, ap is and integration api manage...
Transforming enterprise it with containers, ap is and integration  api manage...Transforming enterprise it with containers, ap is and integration  api manage...
Transforming enterprise it with containers, ap is and integration api manage...
 
PoolParty Semantic Suite - LT-Innovate Industry Summit-2016 - Brussels
PoolParty Semantic Suite -  LT-Innovate Industry Summit-2016 - BrusselsPoolParty Semantic Suite -  LT-Innovate Industry Summit-2016 - Brussels
PoolParty Semantic Suite - LT-Innovate Industry Summit-2016 - Brussels
 
События, шины и интеграция данных в непростом мире микросервисов / Валентин Г...
События, шины и интеграция данных в непростом мире микросервисов / Валентин Г...События, шины и интеграция данных в непростом мире микросервисов / Валентин Г...
События, шины и интеграция данных в непростом мире микросервисов / Валентин Г...
 
RedisGraph A Low Latency Graph DB: Pieter Cailliau
RedisGraph A Low Latency Graph DB: Pieter CailliauRedisGraph A Low Latency Graph DB: Pieter Cailliau
RedisGraph A Low Latency Graph DB: Pieter Cailliau
 
Transform the internal it landscape with APIs and integration
Transform the internal it landscape with APIs and integrationTransform the internal it landscape with APIs and integration
Transform the internal it landscape with APIs and integration
 
[CAS4687] going mobile with a hybrid cloud and on premises architecture rrs
[CAS4687] going mobile with a hybrid cloud and on premises architecture rrs[CAS4687] going mobile with a hybrid cloud and on premises architecture rrs
[CAS4687] going mobile with a hybrid cloud and on premises architecture rrs
 
Racing for the Flexibility Integrating Aras into the IT Landscape
Racing for the Flexibility Integrating Aras into the IT LandscapeRacing for the Flexibility Integrating Aras into the IT Landscape
Racing for the Flexibility Integrating Aras into the IT Landscape
 
[CAS4687] Going Mobile with a Hybrid Cloud and On-Premises architecture
[CAS4687] Going Mobile with a Hybrid Cloud and On-Premises architecture[CAS4687] Going Mobile with a Hybrid Cloud and On-Premises architecture
[CAS4687] Going Mobile with a Hybrid Cloud and On-Premises architecture
 
Accelerating a Path to Digital with a Cloud Data Strategy
Accelerating a Path to Digital with a Cloud Data StrategyAccelerating a Path to Digital with a Cloud Data Strategy
Accelerating a Path to Digital with a Cloud Data Strategy
 
Megha_Singh_Resume
Megha_Singh_ResumeMegha_Singh_Resume
Megha_Singh_Resume
 
How We Made our Tech Organization and Architecture Converge Towards Scalability
How We Made our Tech Organization and Architecture Converge Towards ScalabilityHow We Made our Tech Organization and Architecture Converge Towards Scalability
How We Made our Tech Organization and Architecture Converge Towards Scalability
 
Demystifying Decoupled Drupal for Developers & Content Authors
Demystifying Decoupled Drupal for Developers & Content AuthorsDemystifying Decoupled Drupal for Developers & Content Authors
Demystifying Decoupled Drupal for Developers & Content Authors
 
Democratization of Data @Indix
Democratization of Data @IndixDemocratization of Data @Indix
Democratization of Data @Indix
 
Building Enterprise-Ready Knowledge Graph Applications in the Cloud
Building Enterprise-Ready Knowledge Graph Applications in the CloudBuilding Enterprise-Ready Knowledge Graph Applications in the Cloud
Building Enterprise-Ready Knowledge Graph Applications in the Cloud
 
Meet Magento 2015 Utrecht - ElasticSearch - Smile
Meet Magento 2015 Utrecht - ElasticSearch - SmileMeet Magento 2015 Utrecht - ElasticSearch - Smile
Meet Magento 2015 Utrecht - ElasticSearch - Smile
 
Digital Personalisation: Growing Revenue Faster with Digital Experiences That...
Digital Personalisation: Growing Revenue Faster with Digital Experiences That...Digital Personalisation: Growing Revenue Faster with Digital Experiences That...
Digital Personalisation: Growing Revenue Faster with Digital Experiences That...
 
Scaling the mirrorworld with knowledge graphs
Scaling the mirrorworld with knowledge graphsScaling the mirrorworld with knowledge graphs
Scaling the mirrorworld with knowledge graphs
 
The 3 pillars of agile integration: Container, Connector and API
The 3 pillars of agile integration:  Container, Connector and APIThe 3 pillars of agile integration:  Container, Connector and API
The 3 pillars of agile integration: Container, Connector and API
 

More from Connected Data World

The years of the graph: The future of the future is here
The years of the graph: The future of the future is hereThe years of the graph: The future of the future is here
The years of the graph: The future of the future is here
Connected Data World
 
In Search of the Universal Data Model
In Search of the Universal Data ModelIn Search of the Universal Data Model
In Search of the Universal Data Model
Connected Data World
 
Graph Realities
Graph RealitiesGraph Realities
Graph Realities
Connected Data World
 
RAPIDS cuGraph – Accelerating all your Graph needs
RAPIDS cuGraph – Accelerating all your Graph needsRAPIDS cuGraph – Accelerating all your Graph needs
RAPIDS cuGraph – Accelerating all your Graph needs
Connected Data World
 
Elegant and Scalable Code Querying with Code Property Graphs
Elegant and Scalable Code Querying with Code Property GraphsElegant and Scalable Code Querying with Code Property Graphs
Elegant and Scalable Code Querying with Code Property Graphs
Connected Data World
 

More from Connected Data World (20)

Systems that learn and reason | Frank Van Harmelen
Systems that learn and reason | Frank Van HarmelenSystems that learn and reason | Frank Van Harmelen
Systems that learn and reason | Frank Van Harmelen
 
Graph Abstractions Matter by Ora Lassila
Graph Abstractions Matter by Ora LassilaGraph Abstractions Matter by Ora Lassila
Graph Abstractions Matter by Ora Lassila
 
Κnowledge Architecture: Combining Strategy, Data Science and Information Arch...
Κnowledge Architecture: Combining Strategy, Data Science and Information Arch...Κnowledge Architecture: Combining Strategy, Data Science and Information Arch...
Κnowledge Architecture: Combining Strategy, Data Science and Information Arch...
 
How to get started with Graph Machine Learning
How to get started with Graph Machine LearningHow to get started with Graph Machine Learning
How to get started with Graph Machine Learning
 
Graphs in sustainable finance
Graphs in sustainable financeGraphs in sustainable finance
Graphs in sustainable finance
 
The years of the graph: The future of the future is here
The years of the graph: The future of the future is hereThe years of the graph: The future of the future is here
The years of the graph: The future of the future is here
 
From Taxonomies and Schemas to Knowledge Graphs: Parts 1 & 2
From Taxonomies and Schemas to Knowledge Graphs: Parts 1 & 2From Taxonomies and Schemas to Knowledge Graphs: Parts 1 & 2
From Taxonomies and Schemas to Knowledge Graphs: Parts 1 & 2
 
From Taxonomies and Schemas to Knowledge Graphs: Part 3
From Taxonomies and Schemas to Knowledge Graphs: Part 3From Taxonomies and Schemas to Knowledge Graphs: Part 3
From Taxonomies and Schemas to Knowledge Graphs: Part 3
 
In Search of the Universal Data Model
In Search of the Universal Data ModelIn Search of the Universal Data Model
In Search of the Universal Data Model
 
Graph in Apache Cassandra. The World’s Most Scalable Graph Database
Graph in Apache Cassandra. The World’s Most Scalable Graph DatabaseGraph in Apache Cassandra. The World’s Most Scalable Graph Database
Graph in Apache Cassandra. The World’s Most Scalable Graph Database
 
Enterprise Data Governance: Leveraging Knowledge Graph & AI in support of a d...
Enterprise Data Governance: Leveraging Knowledge Graph & AI in support of a d...Enterprise Data Governance: Leveraging Knowledge Graph & AI in support of a d...
Enterprise Data Governance: Leveraging Knowledge Graph & AI in support of a d...
 
Graph Realities
Graph RealitiesGraph Realities
Graph Realities
 
Powering Question-Driven Problem Solving to Improve the Chances of Finding Ne...
Powering Question-Driven Problem Solving to Improve the Chances of Finding Ne...Powering Question-Driven Problem Solving to Improve the Chances of Finding Ne...
Powering Question-Driven Problem Solving to Improve the Chances of Finding Ne...
 
Semantic similarity for faster Knowledge Graph delivery at scale
Semantic similarity for faster Knowledge Graph delivery at scaleSemantic similarity for faster Knowledge Graph delivery at scale
Semantic similarity for faster Knowledge Graph delivery at scale
 
Knowledge Graphs and AI to Hyper-Personalise the Fashion Retail Experience at...
Knowledge Graphs and AI to Hyper-Personalise the Fashion Retail Experience at...Knowledge Graphs and AI to Hyper-Personalise the Fashion Retail Experience at...
Knowledge Graphs and AI to Hyper-Personalise the Fashion Retail Experience at...
 
Schema, Google & The Future of the Web
Schema, Google & The Future of the WebSchema, Google & The Future of the Web
Schema, Google & The Future of the Web
 
RAPIDS cuGraph – Accelerating all your Graph needs
RAPIDS cuGraph – Accelerating all your Graph needsRAPIDS cuGraph – Accelerating all your Graph needs
RAPIDS cuGraph – Accelerating all your Graph needs
 
Elegant and Scalable Code Querying with Code Property Graphs
Elegant and Scalable Code Querying with Code Property GraphsElegant and Scalable Code Querying with Code Property Graphs
Elegant and Scalable Code Querying with Code Property Graphs
 
From Knowledge Graphs to AI-powered SEO: Using taxonomies, schemas and knowle...
From Knowledge Graphs to AI-powered SEO: Using taxonomies, schemas and knowle...From Knowledge Graphs to AI-powered SEO: Using taxonomies, schemas and knowle...
From Knowledge Graphs to AI-powered SEO: Using taxonomies, schemas and knowle...
 
Graph for Good: Empowering your NGO
Graph for Good: Empowering your NGOGraph for Good: Empowering your NGO
Graph for Good: Empowering your NGO
 

Recently uploaded

Recently uploaded (20)

Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 

Building, and communicating, a knowledge graph in Zalando

  • 1. BUILDING THE FASHION KNOWLEDGE GRAPH TALK AT CONNECTED DATA LONDON KATARIINA KARI 07-11-2018
  • 2. 2 Zalando at a Glance Enterprise Knowledge Graph Definition of the Knowledge Graph In the Beginning... What we implemented and added value BUILDING THE FASHION KNOWLEDGE GRAPH
  • 3. 3 ZALANDO AT A GLANCE ~ 4.5billion EUR revenue 2017 > 200 million visits per month > 15,000 employees in Europe > 90 million orders > 24 million active customers > 300,000 product choices ~ 2,000 brands 17 countries as at Aug 2018
  • 5. 5 KNOWLEDGE GRAPH AS WE UNDERSTAND IT
  • 6. 6 A NAMED DIRECTED GRAPH OF CONCEPTS WITH URL-LIKE IDENTIFIERS https://knowledge.zalando.net/ontology/pumps IRI URL internal structural associative
  • 7. 7 UNDERSTANDING AND SPEAKING OUR CUSTOMER’S LANGUAGE The right kind of contents The best possible view contents SEARCH BROWSING
  • 8. 8 FASHION CONCEPTS ARE THE CORE Zalando contents application ontology external vocabulary schema.org extension? ?
  • 9. COMMUNICATING THE KNOWLEDGE GRAPH TO DIFFERENT PROFESSIONAL
  • 10. 10 COMMUNICATING THE KNOWLEDGE GRAPH PRODUCT MANAGERS Does it improve our customer experience? Does it make money? BACKEND ENGINEERS Open World Assumption? Why Graph Databases? MACHINE LEARNING EXPERTS Only see the graph as a data source like any other data and complain there is too little of the data.
  • 12. “Search can be improved with many Machine Learning algorithms. Most successful search engines also use Knowledge Graphs to improve the search. We should explore this possibility.”
  • 13. “Is a static Category Tree the best way to represent fashion contents?”
  • 14. 14 IN THE BEGINNING... Little Semantic Web Knowledge Inside the Company “Ontologies were used in one project I worked on in another company. They did not really work.” “Machine Learning works better.” “So it is manual work? Will it scale?” Upper Management Endorsement Team of Backend Developers was put together and some research engineers Knowledge sharing on Ontologies RDF, SPARQL
  • 15. 15 GETTING INTO THE TOPIC OF SEMANTIC WEB Do you the benefit and added value of knowledge graphs? Team skills Research Backend Engineering Backend & Frontend Engineering Product
  • 16. 16 GETTING INTO THE TOPIC OF SEMANTIC WEB Modelling RDF GraphDB OntoClean “Proper modelling. How should it be done?” SPARQL RDF Syntax,likeTurtle GraphDB Modelling OntoClean HARD TO LEARN “Enough high-quality data” “Knowledge modelling. Data has to be correct at all times, but at the same time simple and easy to follow” “Performance and use of graph databases” “balance between a clean graph and use cases” EASY TO LEARN
  • 17. 17 COOLEST TOPICS OR FEATURES IN SEMANTIC WEB “Graph databases in general” “Sparql Query language” “interlinkedness: how one part of the system can make use of another, almost unrelated, part” “The power of SPARQL for querying the graph.” “SPARQL and what it can do with normalised data” “The possibility to connect different kinds of information from multiple sources into one entity” “There are many features and use cases still undiscovered, which I believe, a graph data structure helps to fulfil.” “Ability to create human understanding and 'intelligence' out of relations.”
  • 18. 18 WHAT WE HAVE IMPLEMENTED & BIGGEST ADDED VALUE
  • 20. 20 WHERE WE SEE THE MOST VALUE OF THE GRAPH Measured significant improvement of search powered by the graph Fashion concepts do not only fetch products, they fetch editorials and other kinds of content
  • 21. 21 A KNOWLEDGE GRAPH FOR AN ENTERPRISE
  • 22. 22 GRAPH CONTENTS IS PEER-REVIEWED application ontology fashion concepts maintained in a GitHub repository pull requests 4-eye principle MODELLING PRINCIPLES OntoClean adapted Consistency in content connections are analysed for subsuming fashion concepts Use Case Driven Modelling
  • 23. 23 NO ZALANDO CONTENTS IN THE GRAPH Zalando contents > 300K products cannot be stored in the graph MICROSERVICES TO THE RESCUE We store rules with which products can be retrieved from other systems via API calls. Another service uses those rules to index our fashion concept identifiers onto Zalando’s products. So that other services can consume it.
  • 24. 24 GRAPH DATABASE – WHAT TO USE? DATOMIC Implement our own triple store BLAZEGRAPH Open-source graph database AMAZON NEPTUNE AWS, compliant, supported
  • 25. 25 HOW WE DECREASED LATENCY COMPLEX MODELLING What is a fashion concept is implied via modelling SPARQL Queries are long and complex = latency INFERENCE We implement our own inference rules
  • 27. The Knowledge Graph at Zalando is…. satisfying use cases peer-reviewed and adapted to a micro-service architecture
  • 28. The Knowledge Graph adds convenience for our customers and drives a dynamic shop experience.
  • 30. KATARIINA KARI katariina.kari@zalando.fi +358 40 513 5700 07-11-2018 RESEARCH ENGINEER
  • 31. This presentation and its contents are strictly confidential. It may not, in whole or in part, be reproduced, redistributed, published or passed on to any other person by the recipient. The information in this presentation has not been independently verified. No representation or warranty, express or implied, is made as to the accuracy or completeness of the presentation and the information contained herein and no reliance should be placed on such information. No responsibility is accepted for any liability for any loss howsoever arising, directly or indirectly, from this presentation or its contents. DISCLAIMER 31