SlideShare uma empresa Scribd logo
1 de 27
Baixar para ler offline
Graph Exploration w/ Neo4j
1
https://s3.amazonaws.com/dev.assets.neo4j.com/wp-content/uploads/graph-data-technologies-graph-databases-for-beginners.png
Our Project Partners
2
3GRAPH EXPLORATION
Efficiently extracting knowledge from graph data
even if we do not know exactly what we are looking for
Graph Exploration: From Users to Large Graphs.
CIKM 2016, SIGMOD 2017, KDD 2018
Adaptive Databases
GRAPH EXPLORATION
4
Graph Exploration Stack
Intuitive queries
Interactive algorithms
Users
Graph
Adaptive Databases
Intuitive queries
Interactive algorithms
GRAPH EXPLORATION
5
Graph Exploration Stack
Users
Graph
This project is about ...
6
3,27 × 109
base pairs
Graph Exploration in Biology - Complex Graphs
http://jcs.biologists.org/content/joces/118/21/4947/F3.large.jpg
Graph Exploration in Biology - Status Quo
MATCH (p1:Phenotype)-[:HAS]-(a1:Association)-[:HAS]-(snp:Snp)-[:HAS]-(a2:Association)-[:HAS]-(p2:Phenotype)
WHERE p1.name = 'foo1‘ AND p2.name = 'foo2p‘ AND a1.p < 0.01 AND a2.p < 0.01
WITH DISTINCT snpORDER BY snp.sid
RETURN collect(snp.sid)
7
MATCH
(p1:Phenotype)-[:HAS]-(a1:Association)-[:HAS]-(snp:Snp)-[:HAS]-(a2:Association)-[:HAS]-(p2:Phenotype
)
WHERE p1.name = 'foo1'
AND p2.name = ‘foo2'
AND a1.p < 0.01 AND a2.p < 0.01
WITH DISTINCT snp
MATCH (snp)-[:IN]-(pw:PositionWindow)<-[:IN]-(l:Locus)--(g:Gene)
WHERE l.feature= 'gene'
RETURN collect(DISTINCT g.name)
MATCH
(p1:Phenotype)-[:HAS]-(a1:Association)-[:HAS]-(snp:Snp)-[:HAS]-(a2:Association)-[:HAS]-(p2:Phenotype)
WHEREp1.name = 'foo1'
ANDp2.name = ‘foo2'
ANDa1.p < 0.01 AND a2.p < 0.01
WITH DISTINCT snp
MATCH(snp)-[:IN]-(pw:PositionWindow)<-[:IN]-(l:Locus)--(g:Gene)
WHERE l.feature= 'gene'
WITH DISTINCT g ORDER BY g.name
MATCH(g)-[:CODES]-(:Transcript)-[:CODES]-(p:Protein)-[:MEMBER]-(go:Goterm)
WHERE go.namespace= 'biological_process'
WITH DISTINCT go,p
RETURN go.name, count(p) ORDER BY count(p)DESC
LIMIT 10
MATCH
(p1:Phenotype)-[:HAS]-(a1:Association)-[:HAS]-(snp:Snp)-[:HAS]-(a2:Association)-[:HAS]-(p2:Phenotype)
WHERE p1.name = 'foo1'
AND p2.name = ‘foo2'AND a1.p < 0.01 AND a2.p < 0.01
WITH DISTINCT snpMATCH (snp)-[:IN]-(pw:PositionWindow)<-[:IN]-(l:Locus)--(g:Gene)
WHERE l.feature= 'gene'
WITH DISTINCT gORDER BY g.name
MATCH (g)-[:CODES]-(:Transcript)-[:IS]-(ps:Probeset)-[:SIG]-(s:Sample)
WHERE s.name= 'mustafavi'
RETURN DISTINCT g.name
Can we do better?
8
Problem
● Given two node sets:
How similar are they in my understanding?
● Example
→ Set of movies I like
→ Set of movies I don’t know
→ Will I like the movies I don’t know?
9
As
Good
As It
Gets
Hell or
High
Water
Pulp
Fiction
The
Matrix
Skyfall
Avatar
?
What is a Knowledge Graph?
● (directed) graph G : ⟨V, E, φ, ψ⟩, where 
○ V is a set of nodes,
○ E ⊆ V × V is a set of edges,
○ φ : V → LV
 is an edge labeling function
and
○ ψ : E → LE
is a node labeling function
We refer to the elements of LV
and LE
as node
labels and edge labels
10
What are Meta-Paths?
Graph Path
Graph Schema Meta-Path
A meta-path for a path ⟨n1
, ..., nt
⟩, ni
∈ V , 1 ≤ i ≤ t is a sequence 
P : ⟨φ(n1
),ψ (n1
, n2
), ..., ψ (nt−1
, nt
), φ(nt
)⟩ that alternates node- and
edge-types along the path. 
11
Motivating Example
Q: How famous is Diane Kruger in America?
MATCH(n:Person)
WHERE n.name = “Diane Kruger”
RETURN n
MATCH(m:Movie)
WHERE m.location = “America”
RETURN m
Diane
Kruger
As
Good
As It
Gets
Stand
By MeTop
Gun
Pulp
Fiction A Few
Good
Men
The
Matrix
Up
12
How similar are they?
● Similarity depends on
○ expert knowledge
○ connections among nodes
13
Individualized
exploration
Extract ratingsCompute Meta-Paths
What does the System do and how?
Overview
✓✗
◎
Learn representation
for meta-paths
Calculate
similarity
14
Approximate Meta-Paths
Problem: How to compute all meta-paths fast?
Approx. Solution: Mine meta-paths using the graph’s schema!
Meta-Paths Computation
Compute schema
15
Approximate Meta-Paths
Problem: How to compute all meta-paths fast?
Approx. Solution: Mine meta-paths using the graph’s schema!
Meta-Paths Computation
Meta-Path
approximation
• Extract real and some
non-existent
meta-paths via DFS
on schema
Classifier
• Real meta-paths as
training data
• Predict probability of
meta-path
Exhaustive Meta-Path
computation
• Extract real
meta-paths via DFS
• Early termination
Training data Inference data
Neo4j Graph Algorithm Procedures
Use procedures for schema and meta-paths computation
16
Learning a Meta-Path Embedding
Problem: Vector representation required for active
learning and preference prediction.
Meta-Paths Embedding
?
(3 5 1)T
17
Learning a Meta-Path Embedding
Problem: Vector representation required for active
learning and preference prediction.
Solution: Embed meta-paths
→ Similar meta-paths should have similar vectors.
Challenges:
- Multiple similarity interpretations for meta-paths
- Large number of meta-paths.
- Shared parts between meta-paths.
Our method: Modified word2vec (and paragraph
vectors) method.
Meta-Paths Embedding
(3 5 1)T
18
Learn the Domain Value of all Meta-Paths
- Problem: Users don’t want to rate all meta-paths
→ too many
→ time-consuming
→ tedious and boring
- Solution: Label only a few, but very informative paths
→ explore the space of all meta-paths
→ sequentially query the most uncertain datapoint
⇒ use active learning strategy
Active Learning
✓✗
◎
19
Active Learning
20
Use Learned Preferences for Graph Exploration
Result Explanation
Icons made by Eucalyp from www.flaticon.com is
licensed by CC 3.0 BY
Graph (with
meta-paths)
Domain Knowledge
What is
important in
the graph?
Personalized Exploration
Tool
Similarity Measure
Related Nodes
Stats
21
Personalized Exploration Tool
Result Explanation
Transform Nodes to
Vectors
(Graph-Embedding)
Adapt Vectors Using
Domain-Knowledge
Personalized Vector
Space
precomputed
22
Personalized Exploration Tool
Result Explanation
23
Problem: Suggest similar nodes with regard to given node sets and
user preferences.
Solution: Learn personalized similarity metric between nodes →
Similar nodes should have similar vector space embeddings.
Challenges:
- Different view of abstraction between meta-path preferences and
node instances
- High computational costs but online adaption of embedding space
needed
Our method: Modified node2vec through bias on already
pretrained embeddings
Personalized Exploration Tool
Result Explanation
What nodes
are close to
my selection?How close are
my sets?
Find clusters!
What are
outliers?Personalized Vector
Space
24
System Architecture - How does it work with Neo4j?
Neo4j Graph Database
Neo4j Graph Algorithm Procedures
Containing Meta-Paths Computation
Python Backend Server
ReactJS Frontend
Meta-Path
Embedding
Active Learning Explanation
Node selection
Meta-Path
ordering
Result
visualization
25
● Easy to get your code running in neo4j.
● Neo4j-graph-algorithms: efficiency vs convenience.
● Sometimes no stack-trace when an error occurs.
● Great support and community. Always available.
● Cypher: Easy to begin with, hard to master.
(hpi)-[:LIKES]->(neo4j)
What about neo4j?
Meta-Paths Computation
26
Outlook
4 research topics:
● Efficient meta-paths computation
● Sensible meta-paths embedding
● Precise active learning of meta-paths ranking
● Explanation and exploration using ranked meta-paths
Open Source!
● Meta-paths computation in neo4j graph algos
● MetaExp Repository
● You can use and improve as well!
https://hpi.de/mueller/metaexp/ 27

Mais conteúdo relacionado

Mais procurados

Graph Hardware Architecture - Enterprise graphs deserve great hardware!
Graph Hardware Architecture - Enterprise graphs deserve great hardware!Graph Hardware Architecture - Enterprise graphs deserve great hardware!
Graph Hardware Architecture - Enterprise graphs deserve great hardware!TigerGraph
 
Training Series: Build APIs with Neo4j GraphQL Library
Training Series: Build APIs with Neo4j GraphQL LibraryTraining Series: Build APIs with Neo4j GraphQL Library
Training Series: Build APIs with Neo4j GraphQL LibraryNeo4j
 
Neo4j GraphTalk Basel - Health & Life Sciences
Neo4j GraphTalk Basel - Health & Life SciencesNeo4j GraphTalk Basel - Health & Life Sciences
Neo4j GraphTalk Basel - Health & Life SciencesNeo4j
 
Training Week: Build APIs with Neo4j GraphQL Library
Training Week: Build APIs with Neo4j GraphQL LibraryTraining Week: Build APIs with Neo4j GraphQL Library
Training Week: Build APIs with Neo4j GraphQL LibraryNeo4j
 
Graphs and Financial Services Analytics
Graphs and Financial Services AnalyticsGraphs and Financial Services Analytics
Graphs and Financial Services AnalyticsNeo4j
 
Fast Parallel Similarity Calculations with FPGA Hardware
Fast Parallel Similarity Calculations with FPGA HardwareFast Parallel Similarity Calculations with FPGA Hardware
Fast Parallel Similarity Calculations with FPGA HardwareTigerGraph
 
Graph Gurus 21: Integrating Real-Time Deep-Link Graph Analytics with Spark AI
Graph Gurus 21: Integrating Real-Time Deep-Link Graph Analytics with Spark AIGraph Gurus 21: Integrating Real-Time Deep-Link Graph Analytics with Spark AI
Graph Gurus 21: Integrating Real-Time Deep-Link Graph Analytics with Spark AITigerGraph
 
MongoDB company and case studies - john hong
MongoDB company and case studies - john hong MongoDB company and case studies - john hong
MongoDB company and case studies - john hong Ha-Yang(White) Moon
 
Graph Gurus Episode 1: Enterprise Graph
Graph Gurus Episode 1: Enterprise GraphGraph Gurus Episode 1: Enterprise Graph
Graph Gurus Episode 1: Enterprise GraphTigerGraph
 
Graph Gurus Episode 25: Unleash the Business Value of Your Data Lake with Gra...
Graph Gurus Episode 25: Unleash the Business Value of Your Data Lake with Gra...Graph Gurus Episode 25: Unleash the Business Value of Your Data Lake with Gra...
Graph Gurus Episode 25: Unleash the Business Value of Your Data Lake with Gra...TigerGraph
 
Democratizing Data
Democratizing DataDemocratizing Data
Democratizing DataDatabricks
 
Predicting Influence and Communities Using Graph Algorithms
Predicting Influence and Communities Using Graph AlgorithmsPredicting Influence and Communities Using Graph Algorithms
Predicting Influence and Communities Using Graph AlgorithmsDatabricks
 
Using Graph Algorithms For Advanced Analytics - Part 4 Similarity 30 graph al...
Using Graph Algorithms For Advanced Analytics - Part 4 Similarity 30 graph al...Using Graph Algorithms For Advanced Analytics - Part 4 Similarity 30 graph al...
Using Graph Algorithms For Advanced Analytics - Part 4 Similarity 30 graph al...TigerGraph
 
Graph Gurus 15: Introducing TigerGraph 2.4
Graph Gurus 15: Introducing TigerGraph 2.4 Graph Gurus 15: Introducing TigerGraph 2.4
Graph Gurus 15: Introducing TigerGraph 2.4 TigerGraph
 
Webinar: Enterprise Data Management in the Era of MongoDB and Data Lakes
Webinar: Enterprise Data Management in the Era of MongoDB and Data LakesWebinar: Enterprise Data Management in the Era of MongoDB and Data Lakes
Webinar: Enterprise Data Management in the Era of MongoDB and Data LakesMongoDB
 
Intro to Neo4j and Graph Databases
Intro to Neo4j and Graph DatabasesIntro to Neo4j and Graph Databases
Intro to Neo4j and Graph DatabasesNeo4j
 
Plume - A Code Property Graph Extraction and Analysis Library
Plume - A Code Property Graph Extraction and Analysis LibraryPlume - A Code Property Graph Extraction and Analysis Library
Plume - A Code Property Graph Extraction and Analysis LibraryTigerGraph
 
Transform Your Telecom Operations with Graph Technologies
Transform Your Telecom Operations with Graph TechnologiesTransform Your Telecom Operations with Graph Technologies
Transform Your Telecom Operations with Graph TechnologiesNeo4j
 
The convergence of reporting and interactive BI on Hadoop
The convergence of reporting and interactive BI on HadoopThe convergence of reporting and interactive BI on Hadoop
The convergence of reporting and interactive BI on HadoopDataWorks Summit
 
An Introduction to Graph: Database, Analytics, and Cloud Services
An Introduction to Graph:  Database, Analytics, and Cloud ServicesAn Introduction to Graph:  Database, Analytics, and Cloud Services
An Introduction to Graph: Database, Analytics, and Cloud ServicesJean Ihm
 

Mais procurados (20)

Graph Hardware Architecture - Enterprise graphs deserve great hardware!
Graph Hardware Architecture - Enterprise graphs deserve great hardware!Graph Hardware Architecture - Enterprise graphs deserve great hardware!
Graph Hardware Architecture - Enterprise graphs deserve great hardware!
 
Training Series: Build APIs with Neo4j GraphQL Library
Training Series: Build APIs with Neo4j GraphQL LibraryTraining Series: Build APIs with Neo4j GraphQL Library
Training Series: Build APIs with Neo4j GraphQL Library
 
Neo4j GraphTalk Basel - Health & Life Sciences
Neo4j GraphTalk Basel - Health & Life SciencesNeo4j GraphTalk Basel - Health & Life Sciences
Neo4j GraphTalk Basel - Health & Life Sciences
 
Training Week: Build APIs with Neo4j GraphQL Library
Training Week: Build APIs with Neo4j GraphQL LibraryTraining Week: Build APIs with Neo4j GraphQL Library
Training Week: Build APIs with Neo4j GraphQL Library
 
Graphs and Financial Services Analytics
Graphs and Financial Services AnalyticsGraphs and Financial Services Analytics
Graphs and Financial Services Analytics
 
Fast Parallel Similarity Calculations with FPGA Hardware
Fast Parallel Similarity Calculations with FPGA HardwareFast Parallel Similarity Calculations with FPGA Hardware
Fast Parallel Similarity Calculations with FPGA Hardware
 
Graph Gurus 21: Integrating Real-Time Deep-Link Graph Analytics with Spark AI
Graph Gurus 21: Integrating Real-Time Deep-Link Graph Analytics with Spark AIGraph Gurus 21: Integrating Real-Time Deep-Link Graph Analytics with Spark AI
Graph Gurus 21: Integrating Real-Time Deep-Link Graph Analytics with Spark AI
 
MongoDB company and case studies - john hong
MongoDB company and case studies - john hong MongoDB company and case studies - john hong
MongoDB company and case studies - john hong
 
Graph Gurus Episode 1: Enterprise Graph
Graph Gurus Episode 1: Enterprise GraphGraph Gurus Episode 1: Enterprise Graph
Graph Gurus Episode 1: Enterprise Graph
 
Graph Gurus Episode 25: Unleash the Business Value of Your Data Lake with Gra...
Graph Gurus Episode 25: Unleash the Business Value of Your Data Lake with Gra...Graph Gurus Episode 25: Unleash the Business Value of Your Data Lake with Gra...
Graph Gurus Episode 25: Unleash the Business Value of Your Data Lake with Gra...
 
Democratizing Data
Democratizing DataDemocratizing Data
Democratizing Data
 
Predicting Influence and Communities Using Graph Algorithms
Predicting Influence and Communities Using Graph AlgorithmsPredicting Influence and Communities Using Graph Algorithms
Predicting Influence and Communities Using Graph Algorithms
 
Using Graph Algorithms For Advanced Analytics - Part 4 Similarity 30 graph al...
Using Graph Algorithms For Advanced Analytics - Part 4 Similarity 30 graph al...Using Graph Algorithms For Advanced Analytics - Part 4 Similarity 30 graph al...
Using Graph Algorithms For Advanced Analytics - Part 4 Similarity 30 graph al...
 
Graph Gurus 15: Introducing TigerGraph 2.4
Graph Gurus 15: Introducing TigerGraph 2.4 Graph Gurus 15: Introducing TigerGraph 2.4
Graph Gurus 15: Introducing TigerGraph 2.4
 
Webinar: Enterprise Data Management in the Era of MongoDB and Data Lakes
Webinar: Enterprise Data Management in the Era of MongoDB and Data LakesWebinar: Enterprise Data Management in the Era of MongoDB and Data Lakes
Webinar: Enterprise Data Management in the Era of MongoDB and Data Lakes
 
Intro to Neo4j and Graph Databases
Intro to Neo4j and Graph DatabasesIntro to Neo4j and Graph Databases
Intro to Neo4j and Graph Databases
 
Plume - A Code Property Graph Extraction and Analysis Library
Plume - A Code Property Graph Extraction and Analysis LibraryPlume - A Code Property Graph Extraction and Analysis Library
Plume - A Code Property Graph Extraction and Analysis Library
 
Transform Your Telecom Operations with Graph Technologies
Transform Your Telecom Operations with Graph TechnologiesTransform Your Telecom Operations with Graph Technologies
Transform Your Telecom Operations with Graph Technologies
 
The convergence of reporting and interactive BI on Hadoop
The convergence of reporting and interactive BI on HadoopThe convergence of reporting and interactive BI on Hadoop
The convergence of reporting and interactive BI on Hadoop
 
An Introduction to Graph: Database, Analytics, and Cloud Services
An Introduction to Graph:  Database, Analytics, and Cloud ServicesAn Introduction to Graph:  Database, Analytics, and Cloud Services
An Introduction to Graph: Database, Analytics, and Cloud Services
 

Semelhante a Neo4j MeetUp - Graph Exploration with MetaExp

Graph Analyses with Python and NetworkX
Graph Analyses with Python and NetworkXGraph Analyses with Python and NetworkX
Graph Analyses with Python and NetworkXBenjamin Bengfort
 
Dagstuhl seminar talk on querying big graphs
Dagstuhl seminar talk on querying big graphsDagstuhl seminar talk on querying big graphs
Dagstuhl seminar talk on querying big graphsArijit Khan
 
Data Structure Graph DMZ #DMZone
Data Structure Graph DMZ #DMZoneData Structure Graph DMZ #DMZone
Data Structure Graph DMZ #DMZoneDoug Needham
 
Machine Learning + Graph Databases for Better Recommendations V2 08/20/2022
Machine Learning + Graph Databases for Better Recommendations V2 08/20/2022Machine Learning + Graph Databases for Better Recommendations V2 08/20/2022
Machine Learning + Graph Databases for Better Recommendations V2 08/20/2022ArangoDB Database
 
ATO 2022 - Machine Learning + Graph Databases for Better Recommendations (3)....
ATO 2022 - Machine Learning + Graph Databases for Better Recommendations (3)....ATO 2022 - Machine Learning + Graph Databases for Better Recommendations (3)....
ATO 2022 - Machine Learning + Graph Databases for Better Recommendations (3)....ArangoDB Database
 
High-Performance Graph Analysis and Modeling
High-Performance Graph Analysis and ModelingHigh-Performance Graph Analysis and Modeling
High-Performance Graph Analysis and ModelingNesreen K. Ahmed
 
Knowledge graphs, meet Deep Learning
Knowledge graphs, meet Deep LearningKnowledge graphs, meet Deep Learning
Knowledge graphs, meet Deep LearningConnected Data World
 
Machine Learning Project
Machine Learning ProjectMachine Learning Project
Machine Learning ProjectAdeyemi Fowe
 
Knowledge Graph Embeddings for Recommender Systems
Knowledge Graph Embeddings for Recommender SystemsKnowledge Graph Embeddings for Recommender Systems
Knowledge Graph Embeddings for Recommender SystemsEnrico Palumbo
 
Semi-Supervised Classification with Graph Convolutional Networks @ICLR2017読み会
Semi-Supervised Classification with Graph Convolutional Networks @ICLR2017読み会Semi-Supervised Classification with Graph Convolutional Networks @ICLR2017読み会
Semi-Supervised Classification with Graph Convolutional Networks @ICLR2017読み会Eiji Sekiya
 
Social Network Analysis Introduction including Data Structure Graph overview.
Social Network Analysis Introduction including Data Structure Graph overview. Social Network Analysis Introduction including Data Structure Graph overview.
Social Network Analysis Introduction including Data Structure Graph overview. Doug Needham
 
Improve ml predictions using graph algorithms (webinar july 23_19).pptx
Improve ml predictions using graph algorithms (webinar july 23_19).pptxImprove ml predictions using graph algorithms (webinar july 23_19).pptx
Improve ml predictions using graph algorithms (webinar july 23_19).pptxNeo4j
 
Generative modeling with Convolutional Neural Networks
Generative modeling with Convolutional Neural NetworksGenerative modeling with Convolutional Neural Networks
Generative modeling with Convolutional Neural NetworksDenis Dus
 
Machine Learning + Graph Databases for Better Recommendations V1 08/06/2022
Machine Learning + Graph Databases for Better Recommendations V1 08/06/2022Machine Learning + Graph Databases for Better Recommendations V1 08/06/2022
Machine Learning + Graph Databases for Better Recommendations V1 08/06/2022ArangoDB Database
 
Introduction to Graph neural networks @ Vienna Deep Learning meetup
Introduction to Graph neural networks @  Vienna Deep Learning meetupIntroduction to Graph neural networks @  Vienna Deep Learning meetup
Introduction to Graph neural networks @ Vienna Deep Learning meetupLiad Magen
 
Relational machine-learning
Relational machine-learningRelational machine-learning
Relational machine-learningBhushan Kotnis
 
Recent Progress on Utilizing Tag Information with GANs - StarGAN & TD-GAN
Recent Progress on Utilizing Tag Information with GANs - StarGAN & TD-GANRecent Progress on Utilizing Tag Information with GANs - StarGAN & TD-GAN
Recent Progress on Utilizing Tag Information with GANs - StarGAN & TD-GANHao-Wen (Herman) Dong
 

Semelhante a Neo4j MeetUp - Graph Exploration with MetaExp (20)

Graph Analyses with Python and NetworkX
Graph Analyses with Python and NetworkXGraph Analyses with Python and NetworkX
Graph Analyses with Python and NetworkX
 
Dagstuhl seminar talk on querying big graphs
Dagstuhl seminar talk on querying big graphsDagstuhl seminar talk on querying big graphs
Dagstuhl seminar talk on querying big graphs
 
Data Structure Graph DMZ #DMZone
Data Structure Graph DMZ #DMZoneData Structure Graph DMZ #DMZone
Data Structure Graph DMZ #DMZone
 
Machine Learning + Graph Databases for Better Recommendations V2 08/20/2022
Machine Learning + Graph Databases for Better Recommendations V2 08/20/2022Machine Learning + Graph Databases for Better Recommendations V2 08/20/2022
Machine Learning + Graph Databases for Better Recommendations V2 08/20/2022
 
ATO 2022 - Machine Learning + Graph Databases for Better Recommendations (3)....
ATO 2022 - Machine Learning + Graph Databases for Better Recommendations (3)....ATO 2022 - Machine Learning + Graph Databases for Better Recommendations (3)....
ATO 2022 - Machine Learning + Graph Databases for Better Recommendations (3)....
 
High-Performance Graph Analysis and Modeling
High-Performance Graph Analysis and ModelingHigh-Performance Graph Analysis and Modeling
High-Performance Graph Analysis and Modeling
 
Knowledge graphs, meet Deep Learning
Knowledge graphs, meet Deep LearningKnowledge graphs, meet Deep Learning
Knowledge graphs, meet Deep Learning
 
Machine Learning Project
Machine Learning ProjectMachine Learning Project
Machine Learning Project
 
Knowledge Graph Embeddings for Recommender Systems
Knowledge Graph Embeddings for Recommender SystemsKnowledge Graph Embeddings for Recommender Systems
Knowledge Graph Embeddings for Recommender Systems
 
Ghost
GhostGhost
Ghost
 
Semi-Supervised Classification with Graph Convolutional Networks @ICLR2017読み会
Semi-Supervised Classification with Graph Convolutional Networks @ICLR2017読み会Semi-Supervised Classification with Graph Convolutional Networks @ICLR2017読み会
Semi-Supervised Classification with Graph Convolutional Networks @ICLR2017読み会
 
Social Network Analysis Introduction including Data Structure Graph overview.
Social Network Analysis Introduction including Data Structure Graph overview. Social Network Analysis Introduction including Data Structure Graph overview.
Social Network Analysis Introduction including Data Structure Graph overview.
 
PointNet
PointNetPointNet
PointNet
 
Improve ml predictions using graph algorithms (webinar july 23_19).pptx
Improve ml predictions using graph algorithms (webinar july 23_19).pptxImprove ml predictions using graph algorithms (webinar july 23_19).pptx
Improve ml predictions using graph algorithms (webinar july 23_19).pptx
 
Generative modeling with Convolutional Neural Networks
Generative modeling with Convolutional Neural NetworksGenerative modeling with Convolutional Neural Networks
Generative modeling with Convolutional Neural Networks
 
Yarn spark next_gen_hadoop_8_jan_2014
Yarn spark next_gen_hadoop_8_jan_2014Yarn spark next_gen_hadoop_8_jan_2014
Yarn spark next_gen_hadoop_8_jan_2014
 
Machine Learning + Graph Databases for Better Recommendations V1 08/06/2022
Machine Learning + Graph Databases for Better Recommendations V1 08/06/2022Machine Learning + Graph Databases for Better Recommendations V1 08/06/2022
Machine Learning + Graph Databases for Better Recommendations V1 08/06/2022
 
Introduction to Graph neural networks @ Vienna Deep Learning meetup
Introduction to Graph neural networks @  Vienna Deep Learning meetupIntroduction to Graph neural networks @  Vienna Deep Learning meetup
Introduction to Graph neural networks @ Vienna Deep Learning meetup
 
Relational machine-learning
Relational machine-learningRelational machine-learning
Relational machine-learning
 
Recent Progress on Utilizing Tag Information with GANs - StarGAN & TD-GAN
Recent Progress on Utilizing Tag Information with GANs - StarGAN & TD-GANRecent Progress on Utilizing Tag Information with GANs - StarGAN & TD-GAN
Recent Progress on Utilizing Tag Information with GANs - StarGAN & TD-GAN
 

Último

英国UN学位证,北安普顿大学毕业证书1:1制作
英国UN学位证,北安普顿大学毕业证书1:1制作英国UN学位证,北安普顿大学毕业证书1:1制作
英国UN学位证,北安普顿大学毕业证书1:1制作qr0udbr0
 
Salesforce Implementation Services PPT By ABSYZ
Salesforce Implementation Services PPT By ABSYZSalesforce Implementation Services PPT By ABSYZ
Salesforce Implementation Services PPT By ABSYZABSYZ Inc
 
PREDICTING RIVER WATER QUALITY ppt presentation
PREDICTING  RIVER  WATER QUALITY  ppt presentationPREDICTING  RIVER  WATER QUALITY  ppt presentation
PREDICTING RIVER WATER QUALITY ppt presentationvaddepallysandeep122
 
CRM Contender Series: HubSpot vs. Salesforce
CRM Contender Series: HubSpot vs. SalesforceCRM Contender Series: HubSpot vs. Salesforce
CRM Contender Series: HubSpot vs. SalesforceBrainSell Technologies
 
UI5ers live - Custom Controls wrapping 3rd-party libs.pptx
UI5ers live - Custom Controls wrapping 3rd-party libs.pptxUI5ers live - Custom Controls wrapping 3rd-party libs.pptx
UI5ers live - Custom Controls wrapping 3rd-party libs.pptxAndreas Kunz
 
Sending Calendar Invites on SES and Calendarsnack.pdf
Sending Calendar Invites on SES and Calendarsnack.pdfSending Calendar Invites on SES and Calendarsnack.pdf
Sending Calendar Invites on SES and Calendarsnack.pdf31events.com
 
20240415 [Container Plumbing Days] Usernetes Gen2 - Kubernetes in Rootless Do...
20240415 [Container Plumbing Days] Usernetes Gen2 - Kubernetes in Rootless Do...20240415 [Container Plumbing Days] Usernetes Gen2 - Kubernetes in Rootless Do...
20240415 [Container Plumbing Days] Usernetes Gen2 - Kubernetes in Rootless Do...Akihiro Suda
 
Introduction Computer Science - Software Design.pdf
Introduction Computer Science - Software Design.pdfIntroduction Computer Science - Software Design.pdf
Introduction Computer Science - Software Design.pdfFerryKemperman
 
Cyber security and its impact on E commerce
Cyber security and its impact on E commerceCyber security and its impact on E commerce
Cyber security and its impact on E commercemanigoyal112
 
Folding Cheat Sheet #4 - fourth in a series
Folding Cheat Sheet #4 - fourth in a seriesFolding Cheat Sheet #4 - fourth in a series
Folding Cheat Sheet #4 - fourth in a seriesPhilip Schwarz
 
Xen Safety Embedded OSS Summit April 2024 v4.pdf
Xen Safety Embedded OSS Summit April 2024 v4.pdfXen Safety Embedded OSS Summit April 2024 v4.pdf
Xen Safety Embedded OSS Summit April 2024 v4.pdfStefano Stabellini
 
How To Manage Restaurant Staff -BTRESTRO
How To Manage Restaurant Staff -BTRESTROHow To Manage Restaurant Staff -BTRESTRO
How To Manage Restaurant Staff -BTRESTROmotivationalword821
 
Unveiling Design Patterns: A Visual Guide with UML Diagrams
Unveiling Design Patterns: A Visual Guide with UML DiagramsUnveiling Design Patterns: A Visual Guide with UML Diagrams
Unveiling Design Patterns: A Visual Guide with UML DiagramsAhmed Mohamed
 
Innovate and Collaborate- Harnessing the Power of Open Source Software.pdf
Innovate and Collaborate- Harnessing the Power of Open Source Software.pdfInnovate and Collaborate- Harnessing the Power of Open Source Software.pdf
Innovate and Collaborate- Harnessing the Power of Open Source Software.pdfYashikaSharma391629
 
Post Quantum Cryptography – The Impact on Identity
Post Quantum Cryptography – The Impact on IdentityPost Quantum Cryptography – The Impact on Identity
Post Quantum Cryptography – The Impact on Identityteam-WIBU
 
Simplifying Microservices & Apps - The art of effortless development - Meetup...
Simplifying Microservices & Apps - The art of effortless development - Meetup...Simplifying Microservices & Apps - The art of effortless development - Meetup...
Simplifying Microservices & Apps - The art of effortless development - Meetup...Rob Geurden
 
A healthy diet for your Java application Devoxx France.pdf
A healthy diet for your Java application Devoxx France.pdfA healthy diet for your Java application Devoxx France.pdf
A healthy diet for your Java application Devoxx France.pdfMarharyta Nedzelska
 
Balasore Best It Company|| Top 10 IT Company || Balasore Software company Odisha
Balasore Best It Company|| Top 10 IT Company || Balasore Software company OdishaBalasore Best It Company|| Top 10 IT Company || Balasore Software company Odisha
Balasore Best It Company|| Top 10 IT Company || Balasore Software company Odishasmiwainfosol
 
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...Natan Silnitsky
 
Intelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalmIntelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalmSujith Sukumaran
 

Último (20)

英国UN学位证,北安普顿大学毕业证书1:1制作
英国UN学位证,北安普顿大学毕业证书1:1制作英国UN学位证,北安普顿大学毕业证书1:1制作
英国UN学位证,北安普顿大学毕业证书1:1制作
 
Salesforce Implementation Services PPT By ABSYZ
Salesforce Implementation Services PPT By ABSYZSalesforce Implementation Services PPT By ABSYZ
Salesforce Implementation Services PPT By ABSYZ
 
PREDICTING RIVER WATER QUALITY ppt presentation
PREDICTING  RIVER  WATER QUALITY  ppt presentationPREDICTING  RIVER  WATER QUALITY  ppt presentation
PREDICTING RIVER WATER QUALITY ppt presentation
 
CRM Contender Series: HubSpot vs. Salesforce
CRM Contender Series: HubSpot vs. SalesforceCRM Contender Series: HubSpot vs. Salesforce
CRM Contender Series: HubSpot vs. Salesforce
 
UI5ers live - Custom Controls wrapping 3rd-party libs.pptx
UI5ers live - Custom Controls wrapping 3rd-party libs.pptxUI5ers live - Custom Controls wrapping 3rd-party libs.pptx
UI5ers live - Custom Controls wrapping 3rd-party libs.pptx
 
Sending Calendar Invites on SES and Calendarsnack.pdf
Sending Calendar Invites on SES and Calendarsnack.pdfSending Calendar Invites on SES and Calendarsnack.pdf
Sending Calendar Invites on SES and Calendarsnack.pdf
 
20240415 [Container Plumbing Days] Usernetes Gen2 - Kubernetes in Rootless Do...
20240415 [Container Plumbing Days] Usernetes Gen2 - Kubernetes in Rootless Do...20240415 [Container Plumbing Days] Usernetes Gen2 - Kubernetes in Rootless Do...
20240415 [Container Plumbing Days] Usernetes Gen2 - Kubernetes in Rootless Do...
 
Introduction Computer Science - Software Design.pdf
Introduction Computer Science - Software Design.pdfIntroduction Computer Science - Software Design.pdf
Introduction Computer Science - Software Design.pdf
 
Cyber security and its impact on E commerce
Cyber security and its impact on E commerceCyber security and its impact on E commerce
Cyber security and its impact on E commerce
 
Folding Cheat Sheet #4 - fourth in a series
Folding Cheat Sheet #4 - fourth in a seriesFolding Cheat Sheet #4 - fourth in a series
Folding Cheat Sheet #4 - fourth in a series
 
Xen Safety Embedded OSS Summit April 2024 v4.pdf
Xen Safety Embedded OSS Summit April 2024 v4.pdfXen Safety Embedded OSS Summit April 2024 v4.pdf
Xen Safety Embedded OSS Summit April 2024 v4.pdf
 
How To Manage Restaurant Staff -BTRESTRO
How To Manage Restaurant Staff -BTRESTROHow To Manage Restaurant Staff -BTRESTRO
How To Manage Restaurant Staff -BTRESTRO
 
Unveiling Design Patterns: A Visual Guide with UML Diagrams
Unveiling Design Patterns: A Visual Guide with UML DiagramsUnveiling Design Patterns: A Visual Guide with UML Diagrams
Unveiling Design Patterns: A Visual Guide with UML Diagrams
 
Innovate and Collaborate- Harnessing the Power of Open Source Software.pdf
Innovate and Collaborate- Harnessing the Power of Open Source Software.pdfInnovate and Collaborate- Harnessing the Power of Open Source Software.pdf
Innovate and Collaborate- Harnessing the Power of Open Source Software.pdf
 
Post Quantum Cryptography – The Impact on Identity
Post Quantum Cryptography – The Impact on IdentityPost Quantum Cryptography – The Impact on Identity
Post Quantum Cryptography – The Impact on Identity
 
Simplifying Microservices & Apps - The art of effortless development - Meetup...
Simplifying Microservices & Apps - The art of effortless development - Meetup...Simplifying Microservices & Apps - The art of effortless development - Meetup...
Simplifying Microservices & Apps - The art of effortless development - Meetup...
 
A healthy diet for your Java application Devoxx France.pdf
A healthy diet for your Java application Devoxx France.pdfA healthy diet for your Java application Devoxx France.pdf
A healthy diet for your Java application Devoxx France.pdf
 
Balasore Best It Company|| Top 10 IT Company || Balasore Software company Odisha
Balasore Best It Company|| Top 10 IT Company || Balasore Software company OdishaBalasore Best It Company|| Top 10 IT Company || Balasore Software company Odisha
Balasore Best It Company|| Top 10 IT Company || Balasore Software company Odisha
 
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...
 
Intelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalmIntelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalm
 

Neo4j MeetUp - Graph Exploration with MetaExp

  • 1. Graph Exploration w/ Neo4j 1 https://s3.amazonaws.com/dev.assets.neo4j.com/wp-content/uploads/graph-data-technologies-graph-databases-for-beginners.png
  • 3. 3GRAPH EXPLORATION Efficiently extracting knowledge from graph data even if we do not know exactly what we are looking for Graph Exploration: From Users to Large Graphs. CIKM 2016, SIGMOD 2017, KDD 2018
  • 4. Adaptive Databases GRAPH EXPLORATION 4 Graph Exploration Stack Intuitive queries Interactive algorithms Users Graph
  • 5. Adaptive Databases Intuitive queries Interactive algorithms GRAPH EXPLORATION 5 Graph Exploration Stack Users Graph This project is about ...
  • 6. 6 3,27 × 109 base pairs Graph Exploration in Biology - Complex Graphs http://jcs.biologists.org/content/joces/118/21/4947/F3.large.jpg
  • 7. Graph Exploration in Biology - Status Quo MATCH (p1:Phenotype)-[:HAS]-(a1:Association)-[:HAS]-(snp:Snp)-[:HAS]-(a2:Association)-[:HAS]-(p2:Phenotype) WHERE p1.name = 'foo1‘ AND p2.name = 'foo2p‘ AND a1.p < 0.01 AND a2.p < 0.01 WITH DISTINCT snpORDER BY snp.sid RETURN collect(snp.sid) 7 MATCH (p1:Phenotype)-[:HAS]-(a1:Association)-[:HAS]-(snp:Snp)-[:HAS]-(a2:Association)-[:HAS]-(p2:Phenotype ) WHERE p1.name = 'foo1' AND p2.name = ‘foo2' AND a1.p < 0.01 AND a2.p < 0.01 WITH DISTINCT snp MATCH (snp)-[:IN]-(pw:PositionWindow)<-[:IN]-(l:Locus)--(g:Gene) WHERE l.feature= 'gene' RETURN collect(DISTINCT g.name) MATCH (p1:Phenotype)-[:HAS]-(a1:Association)-[:HAS]-(snp:Snp)-[:HAS]-(a2:Association)-[:HAS]-(p2:Phenotype) WHEREp1.name = 'foo1' ANDp2.name = ‘foo2' ANDa1.p < 0.01 AND a2.p < 0.01 WITH DISTINCT snp MATCH(snp)-[:IN]-(pw:PositionWindow)<-[:IN]-(l:Locus)--(g:Gene) WHERE l.feature= 'gene' WITH DISTINCT g ORDER BY g.name MATCH(g)-[:CODES]-(:Transcript)-[:CODES]-(p:Protein)-[:MEMBER]-(go:Goterm) WHERE go.namespace= 'biological_process' WITH DISTINCT go,p RETURN go.name, count(p) ORDER BY count(p)DESC LIMIT 10 MATCH (p1:Phenotype)-[:HAS]-(a1:Association)-[:HAS]-(snp:Snp)-[:HAS]-(a2:Association)-[:HAS]-(p2:Phenotype) WHERE p1.name = 'foo1' AND p2.name = ‘foo2'AND a1.p < 0.01 AND a2.p < 0.01 WITH DISTINCT snpMATCH (snp)-[:IN]-(pw:PositionWindow)<-[:IN]-(l:Locus)--(g:Gene) WHERE l.feature= 'gene' WITH DISTINCT gORDER BY g.name MATCH (g)-[:CODES]-(:Transcript)-[:IS]-(ps:Probeset)-[:SIG]-(s:Sample) WHERE s.name= 'mustafavi' RETURN DISTINCT g.name
  • 8. Can we do better? 8
  • 9. Problem ● Given two node sets: How similar are they in my understanding? ● Example → Set of movies I like → Set of movies I don’t know → Will I like the movies I don’t know? 9 As Good As It Gets Hell or High Water Pulp Fiction The Matrix Skyfall Avatar ?
  • 10. What is a Knowledge Graph? ● (directed) graph G : ⟨V, E, φ, ψ⟩, where  ○ V is a set of nodes, ○ E ⊆ V × V is a set of edges, ○ φ : V → LV  is an edge labeling function and ○ ψ : E → LE is a node labeling function We refer to the elements of LV and LE as node labels and edge labels 10
  • 11. What are Meta-Paths? Graph Path Graph Schema Meta-Path A meta-path for a path ⟨n1 , ..., nt ⟩, ni ∈ V , 1 ≤ i ≤ t is a sequence  P : ⟨φ(n1 ),ψ (n1 , n2 ), ..., ψ (nt−1 , nt ), φ(nt )⟩ that alternates node- and edge-types along the path.  11
  • 12. Motivating Example Q: How famous is Diane Kruger in America? MATCH(n:Person) WHERE n.name = “Diane Kruger” RETURN n MATCH(m:Movie) WHERE m.location = “America” RETURN m Diane Kruger As Good As It Gets Stand By MeTop Gun Pulp Fiction A Few Good Men The Matrix Up 12
  • 13. How similar are they? ● Similarity depends on ○ expert knowledge ○ connections among nodes 13
  • 14. Individualized exploration Extract ratingsCompute Meta-Paths What does the System do and how? Overview ✓✗ ◎ Learn representation for meta-paths Calculate similarity 14
  • 15. Approximate Meta-Paths Problem: How to compute all meta-paths fast? Approx. Solution: Mine meta-paths using the graph’s schema! Meta-Paths Computation Compute schema 15
  • 16. Approximate Meta-Paths Problem: How to compute all meta-paths fast? Approx. Solution: Mine meta-paths using the graph’s schema! Meta-Paths Computation Meta-Path approximation • Extract real and some non-existent meta-paths via DFS on schema Classifier • Real meta-paths as training data • Predict probability of meta-path Exhaustive Meta-Path computation • Extract real meta-paths via DFS • Early termination Training data Inference data Neo4j Graph Algorithm Procedures Use procedures for schema and meta-paths computation 16
  • 17. Learning a Meta-Path Embedding Problem: Vector representation required for active learning and preference prediction. Meta-Paths Embedding ? (3 5 1)T 17
  • 18. Learning a Meta-Path Embedding Problem: Vector representation required for active learning and preference prediction. Solution: Embed meta-paths → Similar meta-paths should have similar vectors. Challenges: - Multiple similarity interpretations for meta-paths - Large number of meta-paths. - Shared parts between meta-paths. Our method: Modified word2vec (and paragraph vectors) method. Meta-Paths Embedding (3 5 1)T 18
  • 19. Learn the Domain Value of all Meta-Paths - Problem: Users don’t want to rate all meta-paths → too many → time-consuming → tedious and boring - Solution: Label only a few, but very informative paths → explore the space of all meta-paths → sequentially query the most uncertain datapoint ⇒ use active learning strategy Active Learning ✓✗ ◎ 19
  • 21. Use Learned Preferences for Graph Exploration Result Explanation Icons made by Eucalyp from www.flaticon.com is licensed by CC 3.0 BY Graph (with meta-paths) Domain Knowledge What is important in the graph? Personalized Exploration Tool Similarity Measure Related Nodes Stats 21
  • 22. Personalized Exploration Tool Result Explanation Transform Nodes to Vectors (Graph-Embedding) Adapt Vectors Using Domain-Knowledge Personalized Vector Space precomputed 22
  • 23. Personalized Exploration Tool Result Explanation 23 Problem: Suggest similar nodes with regard to given node sets and user preferences. Solution: Learn personalized similarity metric between nodes → Similar nodes should have similar vector space embeddings. Challenges: - Different view of abstraction between meta-path preferences and node instances - High computational costs but online adaption of embedding space needed Our method: Modified node2vec through bias on already pretrained embeddings
  • 24. Personalized Exploration Tool Result Explanation What nodes are close to my selection?How close are my sets? Find clusters! What are outliers?Personalized Vector Space 24
  • 25. System Architecture - How does it work with Neo4j? Neo4j Graph Database Neo4j Graph Algorithm Procedures Containing Meta-Paths Computation Python Backend Server ReactJS Frontend Meta-Path Embedding Active Learning Explanation Node selection Meta-Path ordering Result visualization 25
  • 26. ● Easy to get your code running in neo4j. ● Neo4j-graph-algorithms: efficiency vs convenience. ● Sometimes no stack-trace when an error occurs. ● Great support and community. Always available. ● Cypher: Easy to begin with, hard to master. (hpi)-[:LIKES]->(neo4j) What about neo4j? Meta-Paths Computation 26
  • 27. Outlook 4 research topics: ● Efficient meta-paths computation ● Sensible meta-paths embedding ● Precise active learning of meta-paths ranking ● Explanation and exploration using ranked meta-paths Open Source! ● Meta-paths computation in neo4j graph algos ● MetaExp Repository ● You can use and improve as well! https://hpi.de/mueller/metaexp/ 27