SlideShare a Scribd company logo
1 of 27
Natural Language
Search
...using Neo4j
We’ll be covering...
What is natural language search?
What do brains and graphs have in common?
How do you model time as a graph?
How do you model time-based events on a graph?
How do you anticipate natural language queries and
map those to results?
How do you transform answers into questions?
What is Natural
Language Search?
Natural language search is like querying a
database using your own natural language.
In a way, it is kind of like programming a
person with words (Teaching, Evangelism,
Sales Pitches, Planning, etc.)
Progress
What is natural language search?
What do brains and graphs have in common?
How do you model time as a graph?
How do you model time-based events on a graph?
How do you anticipate natural language queries and
map those to results?
How do you transform answers into questions?
What do brains and
graphs have in common?
Networks condense a lot of information into
small points.
These small points help us understand or
interpret a lot of information by exploring the
world from many different small points.
Graphs, like brains, help us explore a lot of
information from relative points.
But what is a network?
A network is a representation or model of the
interconnectedness of information.
A graph is the de facto mathematical
component that defines the level of
interconnectivity in a network.
A graph database merges these two concepts
into a persistent storage medium.

Networks (Information) + Graph (Mathematics) = Neo4j
Graph of people meeting people

Anne met Pam
Pam met Sally
Sally met Anne
John met Sally
Path Finding = Searching
The key component when using a graph
database is traversals.
Traversals model the pathways in a network
by enumerating over all possibilities.
Possibilities that meet a criteria are returned
by a query.
(Neo4j’s Cypher Query Language)
Progress
What is natural language search?
What do brains and graphs have in common?
How do you model time as a graph?
How do you model time-based events on a graph?
How do you anticipate natural language queries and
map those to results?
How do you transform answers into questions?
Time based traversals
Time is a hierarchical method of categorizing
the linearity of global events.
Hours, minutes, seconds...
“Neo4j Meetup is at 6:00 PM on October 29th”
Time Scale Event Meta Model

Modeling events over time is easy in Neo4j
Let’s go over the GraphGist for the Time Scale
Event Meta Model
http://gist.neo4j.org/?github-kbastani%2Fgists%2F%2Fmeta%2FTimeScaleEventMetaModel.adoc
ac
Progress
What is natural language search?
What do brains and graphs have in common?
How do you model time as a graph?
How do you model time-based events on a
graph?
How do you anticipate natural language queries and
map those to results?
How do you transform answers into questions?
Progress
What is natural language search?
What do brains and graphs have in common?
How do you model time as a graph?
How do you model time-based events on a graph?
How do you anticipate natural language
queries and map those to results?
How do you transform answers into questions?
Neo4j allows you to store information as a
series of paths, and that is really valuable for
giving a user options when it comes to search.
It starts with something I call
“Search Cache”
Search Cache
A search cache is a repository of all relevant paths condensed into
a hierarchical data store.
A hierarchical data store is like folder paths that model a storage
collection into a linear path. (Dimensionality Reduction)
An address is a hierarchy, revealing a path.
ex. http://www.neo4j.com/download
ex. > rootneo4j-communitybinneo4j.sh
Natural language path:
> what is the matrix?
Type Ahead / Autocomplete
For search it comes down to enumerating over
all possibilities and then mapping those paths
to an action.
http://kbastani.github.io/predictive-autocomplete
Never do real time processing for natural
language search (It is a hard problem -which means it will take time*)
Distributed Caching Frameworks
Take a distributed approach to building out
your search cache.
Use Neo4j to model your network and then
enumerate over all possibilities as a query and
add each possibility to a search cache.
Distribute the load to a network of compute
instances like MapReduce.
In C# at
http://kbastani.github.io/predictive-autocomplete
How do I build a search cache?
The best way to do this is using blob storage.
I use Windows Azure, but you can use any
data storage as long as it maps to a JSON
file via HTTP GET request.
ex. HTTP GET
../natural/language/search/is/cool

.. Working on open source project using C#
Progress
What is natural language search?
What do brains and graphs have in common?
How do you model time as a graph?
How do you model time-based events on a graph?
How do you anticipate natural language queries and
map those to results?
How do you transform answers into
questions?
How to transform answers into
questions?
You have a bunch of answers already in natural
language.
Each language has a specific template that allows you
to transform an answer into a question.
“X is Y” -> “What is X?”
Is X a Person? Then “Who is X?”
Add “What is X?” to the search cache.
Example: http://www.arktera.com/
Questions?

MATCH questions-[:without]->answers
RETURN *
0 Results Found
Neo4j Events
http://www.graphconnect.com
New York: November 5-6
London: November 18-19
http://www.graphconnect.com/videos
Watch the videos! Very valuable insights
from our community
Neo4j Trainings

Interested in Neo4j training?
Talk to me after!
Thanks!
Follow me on Twitter!
@kennybastani
Connect with me on LinkedIn
/in/kennybastani

More Related Content

Similar to Natural Language Search with Neo4j - Kenny Bastani @ GraphConnect NY 2013

Text-mining and Automation
Text-mining and AutomationText-mining and Automation
Text-mining and Automationbenosteen
 
Ace Maths Solutions Unit Five Reading: Exercises on Teaching Data Handling (pdf)
Ace Maths Solutions Unit Five Reading: Exercises on Teaching Data Handling (pdf)Ace Maths Solutions Unit Five Reading: Exercises on Teaching Data Handling (pdf)
Ace Maths Solutions Unit Five Reading: Exercises on Teaching Data Handling (pdf)PiLNAfrica
 
Ace Maths Solutions Unit Five Reading: Exercises on Teaching Data Handling (pdf)
Ace Maths Solutions Unit Five Reading: Exercises on Teaching Data Handling (pdf)Ace Maths Solutions Unit Five Reading: Exercises on Teaching Data Handling (pdf)
Ace Maths Solutions Unit Five Reading: Exercises on Teaching Data Handling (pdf)Saide OER Africa
 
Ace Maths Solutions Unit Five Reading: Exercises on Teaching Data Handling (w...
Ace Maths Solutions Unit Five Reading: Exercises on Teaching Data Handling (w...Ace Maths Solutions Unit Five Reading: Exercises on Teaching Data Handling (w...
Ace Maths Solutions Unit Five Reading: Exercises on Teaching Data Handling (w...PiLNAfrica
 
Ace Maths Solutions Unit Five Reading: Exercises on Teaching Data Handling (w...
Ace Maths Solutions Unit Five Reading: Exercises on Teaching Data Handling (w...Ace Maths Solutions Unit Five Reading: Exercises on Teaching Data Handling (w...
Ace Maths Solutions Unit Five Reading: Exercises on Teaching Data Handling (w...Saide OER Africa
 
Hacking and mash-ups for beginners at MCN2011
Hacking and mash-ups for beginners at MCN2011Hacking and mash-ups for beginners at MCN2011
Hacking and mash-ups for beginners at MCN2011Mia
 
Genomics data insights
Genomics data insightsGenomics data insights
Genomics data insightsVishwas N
 
Applications of Large Language Models in Materials Discovery and Design
Applications of Large Language Models in Materials Discovery and DesignApplications of Large Language Models in Materials Discovery and Design
Applications of Large Language Models in Materials Discovery and DesignAnubhav Jain
 
ExperTwin: An Alter Ego in Cyberspace for Knowledge Workers
ExperTwin: An Alter Ego in Cyberspace for Knowledge WorkersExperTwin: An Alter Ego in Cyberspace for Knowledge Workers
ExperTwin: An Alter Ego in Cyberspace for Knowledge WorkersCarlos Toxtli
 
Week1- Introduction.pptx
Week1- Introduction.pptxWeek1- Introduction.pptx
Week1- Introduction.pptxfahmi324663
 
Breaking Through The Challenges of Scalable Deep Learning for Video Analytics
Breaking Through The Challenges of Scalable Deep Learning for Video AnalyticsBreaking Through The Challenges of Scalable Deep Learning for Video Analytics
Breaking Through The Challenges of Scalable Deep Learning for Video AnalyticsJason Anderson
 
Data science presentation
Data science presentationData science presentation
Data science presentationMSDEVMTL
 
Real-time Generation of Topic Maps from Speech Streams
Real-time Generation of Topic Maps from Speech StreamsReal-time Generation of Topic Maps from Speech Streams
Real-time Generation of Topic Maps from Speech Streamstmra
 
Azure cognitive service
Azure cognitive serviceAzure cognitive service
Azure cognitive serviceVishwas N
 
Riding The Semantic Wave
Riding The Semantic WaveRiding The Semantic Wave
Riding The Semantic WaveKaniska Mandal
 
Building a Semantic search Engine in a library
Building a Semantic search Engine in a libraryBuilding a Semantic search Engine in a library
Building a Semantic search Engine in a librarySEECS NUST
 
MongoDB & Machine Learning
MongoDB & Machine LearningMongoDB & Machine Learning
MongoDB & Machine LearningTom Maiaroto
 
Data Science Accelerator Program
Data Science Accelerator ProgramData Science Accelerator Program
Data Science Accelerator ProgramGoDataDriven
 
Designing a synergistic relationship between undergraduate Data Science educa...
Designing a synergistic relationship between undergraduate Data Science educa...Designing a synergistic relationship between undergraduate Data Science educa...
Designing a synergistic relationship between undergraduate Data Science educa...Ciera Martinez
 

Similar to Natural Language Search with Neo4j - Kenny Bastani @ GraphConnect NY 2013 (20)

Text-mining and Automation
Text-mining and AutomationText-mining and Automation
Text-mining and Automation
 
Ace Maths Solutions Unit Five Reading: Exercises on Teaching Data Handling (pdf)
Ace Maths Solutions Unit Five Reading: Exercises on Teaching Data Handling (pdf)Ace Maths Solutions Unit Five Reading: Exercises on Teaching Data Handling (pdf)
Ace Maths Solutions Unit Five Reading: Exercises on Teaching Data Handling (pdf)
 
Ace Maths Solutions Unit Five Reading: Exercises on Teaching Data Handling (pdf)
Ace Maths Solutions Unit Five Reading: Exercises on Teaching Data Handling (pdf)Ace Maths Solutions Unit Five Reading: Exercises on Teaching Data Handling (pdf)
Ace Maths Solutions Unit Five Reading: Exercises on Teaching Data Handling (pdf)
 
Ace Maths Solutions Unit Five Reading: Exercises on Teaching Data Handling (w...
Ace Maths Solutions Unit Five Reading: Exercises on Teaching Data Handling (w...Ace Maths Solutions Unit Five Reading: Exercises on Teaching Data Handling (w...
Ace Maths Solutions Unit Five Reading: Exercises on Teaching Data Handling (w...
 
Ace Maths Solutions Unit Five Reading: Exercises on Teaching Data Handling (w...
Ace Maths Solutions Unit Five Reading: Exercises on Teaching Data Handling (w...Ace Maths Solutions Unit Five Reading: Exercises on Teaching Data Handling (w...
Ace Maths Solutions Unit Five Reading: Exercises on Teaching Data Handling (w...
 
Hacking and mash-ups for beginners at MCN2011
Hacking and mash-ups for beginners at MCN2011Hacking and mash-ups for beginners at MCN2011
Hacking and mash-ups for beginners at MCN2011
 
Genomics data insights
Genomics data insightsGenomics data insights
Genomics data insights
 
Applications of Large Language Models in Materials Discovery and Design
Applications of Large Language Models in Materials Discovery and DesignApplications of Large Language Models in Materials Discovery and Design
Applications of Large Language Models in Materials Discovery and Design
 
ExperTwin: An Alter Ego in Cyberspace for Knowledge Workers
ExperTwin: An Alter Ego in Cyberspace for Knowledge WorkersExperTwin: An Alter Ego in Cyberspace for Knowledge Workers
ExperTwin: An Alter Ego in Cyberspace for Knowledge Workers
 
Week1- Introduction.pptx
Week1- Introduction.pptxWeek1- Introduction.pptx
Week1- Introduction.pptx
 
Breaking Through The Challenges of Scalable Deep Learning for Video Analytics
Breaking Through The Challenges of Scalable Deep Learning for Video AnalyticsBreaking Through The Challenges of Scalable Deep Learning for Video Analytics
Breaking Through The Challenges of Scalable Deep Learning for Video Analytics
 
Data science presentation
Data science presentationData science presentation
Data science presentation
 
How to supervise a thesis in NLP in the ChatGPT era? By Laure Soulier
How to supervise a thesis in NLP in the ChatGPT era? By Laure SoulierHow to supervise a thesis in NLP in the ChatGPT era? By Laure Soulier
How to supervise a thesis in NLP in the ChatGPT era? By Laure Soulier
 
Real-time Generation of Topic Maps from Speech Streams
Real-time Generation of Topic Maps from Speech StreamsReal-time Generation of Topic Maps from Speech Streams
Real-time Generation of Topic Maps from Speech Streams
 
Azure cognitive service
Azure cognitive serviceAzure cognitive service
Azure cognitive service
 
Riding The Semantic Wave
Riding The Semantic WaveRiding The Semantic Wave
Riding The Semantic Wave
 
Building a Semantic search Engine in a library
Building a Semantic search Engine in a libraryBuilding a Semantic search Engine in a library
Building a Semantic search Engine in a library
 
MongoDB & Machine Learning
MongoDB & Machine LearningMongoDB & Machine Learning
MongoDB & Machine Learning
 
Data Science Accelerator Program
Data Science Accelerator ProgramData Science Accelerator Program
Data Science Accelerator Program
 
Designing a synergistic relationship between undergraduate Data Science educa...
Designing a synergistic relationship between undergraduate Data Science educa...Designing a synergistic relationship between undergraduate Data Science educa...
Designing a synergistic relationship between undergraduate Data Science educa...
 

More from Neo4j

Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Neo4j
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfNeo4j
 
ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...
ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...
ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...Neo4j
 
BBVA - GraphSummit Madrid - Caso de éxito en BBVA: Optimizando con grafos
BBVA - GraphSummit Madrid - Caso de éxito en BBVA: Optimizando con grafosBBVA - GraphSummit Madrid - Caso de éxito en BBVA: Optimizando con grafos
BBVA - GraphSummit Madrid - Caso de éxito en BBVA: Optimizando con grafosNeo4j
 
Graph Everywhere - Josep Taruella - Por qué Graph Data Science en tus modelos...
Graph Everywhere - Josep Taruella - Por qué Graph Data Science en tus modelos...Graph Everywhere - Josep Taruella - Por qué Graph Data Science en tus modelos...
Graph Everywhere - Josep Taruella - Por qué Graph Data Science en tus modelos...Neo4j
 
GraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4j
GraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4jGraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4j
GraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4jNeo4j
 
Neo4j_Exploring the Impact of Graph Technology on Financial Services.pdf
Neo4j_Exploring the Impact of Graph Technology on Financial Services.pdfNeo4j_Exploring the Impact of Graph Technology on Financial Services.pdf
Neo4j_Exploring the Impact of Graph Technology on Financial Services.pdfNeo4j
 
Rabobank_Exploring the Impact of Graph Technology on Financial Services.pdf
Rabobank_Exploring the Impact of Graph Technology on Financial Services.pdfRabobank_Exploring the Impact of Graph Technology on Financial Services.pdf
Rabobank_Exploring the Impact of Graph Technology on Financial Services.pdfNeo4j
 
Webinar - IA generativa e grafi Neo4j: RAG time!
Webinar - IA generativa e grafi Neo4j: RAG time!Webinar - IA generativa e grafi Neo4j: RAG time!
Webinar - IA generativa e grafi Neo4j: RAG time!Neo4j
 
IA Generativa y Grafos de Neo4j: RAG time
IA Generativa y Grafos de Neo4j: RAG timeIA Generativa y Grafos de Neo4j: RAG time
IA Generativa y Grafos de Neo4j: RAG timeNeo4j
 
Neo4j: Data Engineering for RAG (retrieval augmented generation)
Neo4j: Data Engineering for RAG (retrieval augmented generation)Neo4j: Data Engineering for RAG (retrieval augmented generation)
Neo4j: Data Engineering for RAG (retrieval augmented generation)Neo4j
 
Neo4j Graph Summit 2024 Workshop - EMEA - Breda_and_Munchen.pdf
Neo4j Graph Summit 2024 Workshop - EMEA - Breda_and_Munchen.pdfNeo4j Graph Summit 2024 Workshop - EMEA - Breda_and_Munchen.pdf
Neo4j Graph Summit 2024 Workshop - EMEA - Breda_and_Munchen.pdfNeo4j
 
Enabling GenAI Breakthroughs with Knowledge Graphs
Enabling GenAI Breakthroughs with Knowledge GraphsEnabling GenAI Breakthroughs with Knowledge Graphs
Enabling GenAI Breakthroughs with Knowledge GraphsNeo4j
 
Neo4j_Anurag Tandon_Product Vision and Roadmap.Benelux.pptx.pdf
Neo4j_Anurag Tandon_Product Vision and Roadmap.Benelux.pptx.pdfNeo4j_Anurag Tandon_Product Vision and Roadmap.Benelux.pptx.pdf
Neo4j_Anurag Tandon_Product Vision and Roadmap.Benelux.pptx.pdfNeo4j
 
Neo4j Jesus Barrasa The Art of the Possible with Graph
Neo4j Jesus Barrasa The Art of the Possible with GraphNeo4j Jesus Barrasa The Art of the Possible with Graph
Neo4j Jesus Barrasa The Art of the Possible with GraphNeo4j
 
SWIFT: Maintaining Critical Standards in the Financial Services Industry with...
SWIFT: Maintaining Critical Standards in the Financial Services Industry with...SWIFT: Maintaining Critical Standards in the Financial Services Industry with...
SWIFT: Maintaining Critical Standards in the Financial Services Industry with...Neo4j
 
Deloitte & Red Cross: Talk to your data with Knowledge-enriched Generative AI
Deloitte & Red Cross: Talk to your data with Knowledge-enriched Generative AIDeloitte & Red Cross: Talk to your data with Knowledge-enriched Generative AI
Deloitte & Red Cross: Talk to your data with Knowledge-enriched Generative AINeo4j
 
Ingka Digital: Linked Metadata by Design
Ingka Digital: Linked Metadata by DesignIngka Digital: Linked Metadata by Design
Ingka Digital: Linked Metadata by DesignNeo4j
 
Discover Neo4j Aura_ The Future of Graph Database-as-a-Service Workshop_3.13.24
Discover Neo4j Aura_ The Future of Graph Database-as-a-Service Workshop_3.13.24Discover Neo4j Aura_ The Future of Graph Database-as-a-Service Workshop_3.13.24
Discover Neo4j Aura_ The Future of Graph Database-as-a-Service Workshop_3.13.24Neo4j
 
GraphSummit Copenhagen 2024 - Neo4j Vision and Roadmap.pptx
GraphSummit Copenhagen 2024 - Neo4j Vision and Roadmap.pptxGraphSummit Copenhagen 2024 - Neo4j Vision and Roadmap.pptx
GraphSummit Copenhagen 2024 - Neo4j Vision and Roadmap.pptxNeo4j
 

More from Neo4j (20)

Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdf
 
ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...
ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...
ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...
 
BBVA - GraphSummit Madrid - Caso de éxito en BBVA: Optimizando con grafos
BBVA - GraphSummit Madrid - Caso de éxito en BBVA: Optimizando con grafosBBVA - GraphSummit Madrid - Caso de éxito en BBVA: Optimizando con grafos
BBVA - GraphSummit Madrid - Caso de éxito en BBVA: Optimizando con grafos
 
Graph Everywhere - Josep Taruella - Por qué Graph Data Science en tus modelos...
Graph Everywhere - Josep Taruella - Por qué Graph Data Science en tus modelos...Graph Everywhere - Josep Taruella - Por qué Graph Data Science en tus modelos...
Graph Everywhere - Josep Taruella - Por qué Graph Data Science en tus modelos...
 
GraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4j
GraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4jGraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4j
GraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4j
 
Neo4j_Exploring the Impact of Graph Technology on Financial Services.pdf
Neo4j_Exploring the Impact of Graph Technology on Financial Services.pdfNeo4j_Exploring the Impact of Graph Technology on Financial Services.pdf
Neo4j_Exploring the Impact of Graph Technology on Financial Services.pdf
 
Rabobank_Exploring the Impact of Graph Technology on Financial Services.pdf
Rabobank_Exploring the Impact of Graph Technology on Financial Services.pdfRabobank_Exploring the Impact of Graph Technology on Financial Services.pdf
Rabobank_Exploring the Impact of Graph Technology on Financial Services.pdf
 
Webinar - IA generativa e grafi Neo4j: RAG time!
Webinar - IA generativa e grafi Neo4j: RAG time!Webinar - IA generativa e grafi Neo4j: RAG time!
Webinar - IA generativa e grafi Neo4j: RAG time!
 
IA Generativa y Grafos de Neo4j: RAG time
IA Generativa y Grafos de Neo4j: RAG timeIA Generativa y Grafos de Neo4j: RAG time
IA Generativa y Grafos de Neo4j: RAG time
 
Neo4j: Data Engineering for RAG (retrieval augmented generation)
Neo4j: Data Engineering for RAG (retrieval augmented generation)Neo4j: Data Engineering for RAG (retrieval augmented generation)
Neo4j: Data Engineering for RAG (retrieval augmented generation)
 
Neo4j Graph Summit 2024 Workshop - EMEA - Breda_and_Munchen.pdf
Neo4j Graph Summit 2024 Workshop - EMEA - Breda_and_Munchen.pdfNeo4j Graph Summit 2024 Workshop - EMEA - Breda_and_Munchen.pdf
Neo4j Graph Summit 2024 Workshop - EMEA - Breda_and_Munchen.pdf
 
Enabling GenAI Breakthroughs with Knowledge Graphs
Enabling GenAI Breakthroughs with Knowledge GraphsEnabling GenAI Breakthroughs with Knowledge Graphs
Enabling GenAI Breakthroughs with Knowledge Graphs
 
Neo4j_Anurag Tandon_Product Vision and Roadmap.Benelux.pptx.pdf
Neo4j_Anurag Tandon_Product Vision and Roadmap.Benelux.pptx.pdfNeo4j_Anurag Tandon_Product Vision and Roadmap.Benelux.pptx.pdf
Neo4j_Anurag Tandon_Product Vision and Roadmap.Benelux.pptx.pdf
 
Neo4j Jesus Barrasa The Art of the Possible with Graph
Neo4j Jesus Barrasa The Art of the Possible with GraphNeo4j Jesus Barrasa The Art of the Possible with Graph
Neo4j Jesus Barrasa The Art of the Possible with Graph
 
SWIFT: Maintaining Critical Standards in the Financial Services Industry with...
SWIFT: Maintaining Critical Standards in the Financial Services Industry with...SWIFT: Maintaining Critical Standards in the Financial Services Industry with...
SWIFT: Maintaining Critical Standards in the Financial Services Industry with...
 
Deloitte & Red Cross: Talk to your data with Knowledge-enriched Generative AI
Deloitte & Red Cross: Talk to your data with Knowledge-enriched Generative AIDeloitte & Red Cross: Talk to your data with Knowledge-enriched Generative AI
Deloitte & Red Cross: Talk to your data with Knowledge-enriched Generative AI
 
Ingka Digital: Linked Metadata by Design
Ingka Digital: Linked Metadata by DesignIngka Digital: Linked Metadata by Design
Ingka Digital: Linked Metadata by Design
 
Discover Neo4j Aura_ The Future of Graph Database-as-a-Service Workshop_3.13.24
Discover Neo4j Aura_ The Future of Graph Database-as-a-Service Workshop_3.13.24Discover Neo4j Aura_ The Future of Graph Database-as-a-Service Workshop_3.13.24
Discover Neo4j Aura_ The Future of Graph Database-as-a-Service Workshop_3.13.24
 
GraphSummit Copenhagen 2024 - Neo4j Vision and Roadmap.pptx
GraphSummit Copenhagen 2024 - Neo4j Vision and Roadmap.pptxGraphSummit Copenhagen 2024 - Neo4j Vision and Roadmap.pptx
GraphSummit Copenhagen 2024 - Neo4j Vision and Roadmap.pptx
 

Recently uploaded

From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersNicole Novielli
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demoHarshalMandlekar2
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxBkGupta21
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxLoriGlavin3
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESmohitsingh558521
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embeddingZilliz
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 

Recently uploaded (20)

From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software Developers
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demo
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptx
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdf
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embedding
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 

Natural Language Search with Neo4j - Kenny Bastani @ GraphConnect NY 2013

  • 2. We’ll be covering... What is natural language search? What do brains and graphs have in common? How do you model time as a graph? How do you model time-based events on a graph? How do you anticipate natural language queries and map those to results? How do you transform answers into questions?
  • 3. What is Natural Language Search? Natural language search is like querying a database using your own natural language. In a way, it is kind of like programming a person with words (Teaching, Evangelism, Sales Pitches, Planning, etc.)
  • 4. Progress What is natural language search? What do brains and graphs have in common? How do you model time as a graph? How do you model time-based events on a graph? How do you anticipate natural language queries and map those to results? How do you transform answers into questions?
  • 5. What do brains and graphs have in common? Networks condense a lot of information into small points. These small points help us understand or interpret a lot of information by exploring the world from many different small points. Graphs, like brains, help us explore a lot of information from relative points.
  • 6. But what is a network? A network is a representation or model of the interconnectedness of information. A graph is the de facto mathematical component that defines the level of interconnectivity in a network. A graph database merges these two concepts into a persistent storage medium. Networks (Information) + Graph (Mathematics) = Neo4j
  • 7. Graph of people meeting people Anne met Pam Pam met Sally Sally met Anne John met Sally
  • 8. Path Finding = Searching The key component when using a graph database is traversals. Traversals model the pathways in a network by enumerating over all possibilities. Possibilities that meet a criteria are returned by a query. (Neo4j’s Cypher Query Language)
  • 9. Progress What is natural language search? What do brains and graphs have in common? How do you model time as a graph? How do you model time-based events on a graph? How do you anticipate natural language queries and map those to results? How do you transform answers into questions?
  • 10. Time based traversals Time is a hierarchical method of categorizing the linearity of global events. Hours, minutes, seconds... “Neo4j Meetup is at 6:00 PM on October 29th”
  • 11. Time Scale Event Meta Model Modeling events over time is easy in Neo4j Let’s go over the GraphGist for the Time Scale Event Meta Model http://gist.neo4j.org/?github-kbastani%2Fgists%2F%2Fmeta%2FTimeScaleEventMetaModel.adoc ac
  • 12.
  • 13. Progress What is natural language search? What do brains and graphs have in common? How do you model time as a graph? How do you model time-based events on a graph? How do you anticipate natural language queries and map those to results? How do you transform answers into questions?
  • 14.
  • 15. Progress What is natural language search? What do brains and graphs have in common? How do you model time as a graph? How do you model time-based events on a graph? How do you anticipate natural language queries and map those to results? How do you transform answers into questions?
  • 16. Neo4j allows you to store information as a series of paths, and that is really valuable for giving a user options when it comes to search. It starts with something I call “Search Cache”
  • 17. Search Cache A search cache is a repository of all relevant paths condensed into a hierarchical data store. A hierarchical data store is like folder paths that model a storage collection into a linear path. (Dimensionality Reduction) An address is a hierarchy, revealing a path. ex. http://www.neo4j.com/download ex. > rootneo4j-communitybinneo4j.sh Natural language path: > what is the matrix?
  • 18.
  • 19. Type Ahead / Autocomplete For search it comes down to enumerating over all possibilities and then mapping those paths to an action. http://kbastani.github.io/predictive-autocomplete Never do real time processing for natural language search (It is a hard problem -which means it will take time*)
  • 20. Distributed Caching Frameworks Take a distributed approach to building out your search cache. Use Neo4j to model your network and then enumerate over all possibilities as a query and add each possibility to a search cache. Distribute the load to a network of compute instances like MapReduce. In C# at http://kbastani.github.io/predictive-autocomplete
  • 21. How do I build a search cache? The best way to do this is using blob storage. I use Windows Azure, but you can use any data storage as long as it maps to a JSON file via HTTP GET request. ex. HTTP GET ../natural/language/search/is/cool .. Working on open source project using C#
  • 22. Progress What is natural language search? What do brains and graphs have in common? How do you model time as a graph? How do you model time-based events on a graph? How do you anticipate natural language queries and map those to results? How do you transform answers into questions?
  • 23. How to transform answers into questions? You have a bunch of answers already in natural language. Each language has a specific template that allows you to transform an answer into a question. “X is Y” -> “What is X?” Is X a Person? Then “Who is X?” Add “What is X?” to the search cache. Example: http://www.arktera.com/
  • 25. Neo4j Events http://www.graphconnect.com New York: November 5-6 London: November 18-19 http://www.graphconnect.com/videos Watch the videos! Very valuable insights from our community
  • 26. Neo4j Trainings Interested in Neo4j training? Talk to me after!
  • 27. Thanks! Follow me on Twitter! @kennybastani Connect with me on LinkedIn /in/kennybastani