SlideShare uma empresa Scribd logo
1 de 18
Baixar para ler offline
PERSONAL KNOWLEDGE GRAPHS
Krisztian Balog
University of Stavanger

@krisztianbalog
Dagstuhl Seminar on Conversa>onal Search | Dagstuhl, Germany, November 2019
KNOWLEDGE GRAPHS ARE POWERFUL ASSETS
FOR A BROAD RANGE OF SEARCH,
RECOMMENDATION, AND MINING SCENARIOS
What is the capital of
Norway?
What is its popula>on?
KNOWLEDGE GRAPHS ARE POWERFUL ASSETS
FOR A BROAD RANGE OF SEARCH,
RECOMMENDATION, AND MINING SCENARIOS
They tend to focus on prominent, globally important en@@es
This rules out many en@@es we interact with on a daily basis!
MOTIVATIONAL SCENARIO
I would like to get some new strings
for my guitar
AIOK, would that be your electric guitar or
the acoustic one?
The electric one.
AIAlright. I can repeat your Amazon order of
3 months ago, or you can go by a music
store on Elm street on the way to your
dentist appointment this afternoon.
MOTIVATIONAL SCENARIO #2
AISince you're running a half marathon at
Hackney in May, may I suggest you
undertake a 10k run this weekend?
Yes, that sounds like a good idea. Any
suggestions for a not too popular route 

that I haven't done before?
AISure thing. I'll upload some routes to the
running app on your phone.
Cheers mate!
IT IS DIFFICULT TO IMAGINE A TRULY
PERSONAL CONVERSATIONAL ASSISTANT
WITHOUT IT HAVING ACCESS TO
STRUCTURED PERSONAL INFORMATION
PERSONAL KNOWLEDGE GRAPHS
A personal knowledge graph (PKG) is
a resource of structured information
about entities that are of personal
interest to the user
Key differences from general KGs:
• Entities of personal interest to the user
• Distinctive shape (“spiderweb” layout)
• Links between a PKG and external
sources are inherent to its nature
ASSOCIATED PROBLEMS
• Knowledge representation
• Semantic annotation of text
• Population and maintenance
• Integration with external sources
KNOWLEDGE REPRESENTATION
Task: representing entities and their properties
• KGs are organized according to a knowledge model (schema)
Peculiarities/challenges
• Entities need to be (directly/indirectly) connected to the user
• Not all attributes have to be filled; the focus is on personal relevance
• Information about entities can be very sparse
• Some entities may not have any digital presence
• Relations can be highly temporal in nature
RQ1
How should en@@es and their proper@es and rela@ons be represented,
considering the vast but sparse set of possible predicates and their short-lived
nature?
SEMANTIC ANNOTATION OF TEXT
Task: annotating text with respect to a knowledge repository (commonly
known as entity linking)
• Usually involves mention detection, entity disambiguation, and NIL-
detection
Challenges
• Entities might have little to no digital presence
• Entities are not necessarily proper nouns
• Linking, NIL-detection, and KG population are intertwined
RQ2a How can en@ty linking be performed against a personal knowledge graph,
where structured en@ty informa@on to rely on is poten@ally absent?
When should en@ty linking be performed against a personal knowledge graph
as opposed to a general-purpose KG?
RQ2b
POPULATION AND MAINTENANCE
Task: extending a KG from external sources (KB acceleration/
population) or via internal inferencing
• Verification of facts in the KG
Challenges
• Single curator => More automation is desired than for KGs, but the
user should still be in control
• The first mention of an entity should trigger population
• Properties may be inferred from the context
RQ3 How can personal knowledge graphs be automa@cally populated and reliably
maintained?
INTEGRATION WITH EXTERNAL SOURCES
Task: recognizing the same entity across multiple data sources
• Also known as: object resolution, record linkage
Challenges
• One-to-many, as opposed to one-to-one linkage
• Continuous process, not a one-off effort
• Two-way synchronization would be desired
• Conflicting facts or relations need resolving by the user
RQ4 How should external knowledge sources con@nuously be integrated with, in a
two-way process, poten@ally involving the user?
THERE IS MORE…
Evaluation
• This would require an environment where users can interact with a
PKG
Implementation
• Storage (cloud vs. device), security, privacy, access control, ...
Utilization
• From specific apps (calendar, health/wellbeing app, etc.) to a truly
personal assistant
TAKING CONTROL OF OUR DATA
Present Alternative
service #1
service #2
service #3
service #1
service #2
service #3
Different service providers each have some (possibly
overlapping) portion of the person’s PKG
The person has full control over her PKG and may grant access
to different service providers to specific parts of the PKG
http://bit.ly/ictir2019-pkg

Mais conteúdo relacionado

Mais procurados

Matrix Factorization Techniques For Recommender Systems
Matrix Factorization Techniques For Recommender SystemsMatrix Factorization Techniques For Recommender Systems
Matrix Factorization Techniques For Recommender Systems
Lei Guo
 
Recommender system algorithm and architecture
Recommender system algorithm and architectureRecommender system algorithm and architecture
Recommender system algorithm and architecture
Liang Xiang
 
Zipline—Airbnb’s Declarative Feature Engineering Framework
Zipline—Airbnb’s Declarative Feature Engineering FrameworkZipline—Airbnb’s Declarative Feature Engineering Framework
Zipline—Airbnb’s Declarative Feature Engineering Framework
Databricks
 
When Kafka Meets the Scaling and Reliability needs of World's Largest Retaile...
When Kafka Meets the Scaling and Reliability needs of World's Largest Retaile...When Kafka Meets the Scaling and Reliability needs of World's Largest Retaile...
When Kafka Meets the Scaling and Reliability needs of World's Largest Retaile...
confluent
 

Mais procurados (20)

Crafting Recommenders: the Shallow and the Deep of it!
Crafting Recommenders: the Shallow and the Deep of it! Crafting Recommenders: the Shallow and the Deep of it!
Crafting Recommenders: the Shallow and the Deep of it!
 
Spark + S3 + R3를 이용한 데이터 분석 시스템 만들기
Spark + S3 + R3를 이용한 데이터 분석 시스템 만들기Spark + S3 + R3를 이용한 데이터 분석 시스템 만들기
Spark + S3 + R3를 이용한 데이터 분석 시스템 만들기
 
Matrix Factorization Techniques For Recommender Systems
Matrix Factorization Techniques For Recommender SystemsMatrix Factorization Techniques For Recommender Systems
Matrix Factorization Techniques For Recommender Systems
 
Recommender Systems
Recommender SystemsRecommender Systems
Recommender Systems
 
Recommender system algorithm and architecture
Recommender system algorithm and architectureRecommender system algorithm and architecture
Recommender system algorithm and architecture
 
Ontogeny of Unit Tests in Test Driven Development
Ontogeny of Unit Tests in Test Driven DevelopmentOntogeny of Unit Tests in Test Driven Development
Ontogeny of Unit Tests in Test Driven Development
 
Shallow and Deep Latent Models for Recommender System
Shallow and Deep Latent Models for Recommender SystemShallow and Deep Latent Models for Recommender System
Shallow and Deep Latent Models for Recommender System
 
Context-aware Recommendation: A Quick View
Context-aware Recommendation: A Quick ViewContext-aware Recommendation: A Quick View
Context-aware Recommendation: A Quick View
 
The architecture of search engines in Booking.com
The architecture of search engines in Booking.comThe architecture of search engines in Booking.com
The architecture of search engines in Booking.com
 
How to Choose The Right Database on AWS - Berlin Summit - 2019
How to Choose The Right Database on AWS - Berlin Summit - 2019How to Choose The Right Database on AWS - Berlin Summit - 2019
How to Choose The Right Database on AWS - Berlin Summit - 2019
 
Context Aware Recommendations at Netflix
Context Aware Recommendations at NetflixContext Aware Recommendations at Netflix
Context Aware Recommendations at Netflix
 
Zipline—Airbnb’s Declarative Feature Engineering Framework
Zipline—Airbnb’s Declarative Feature Engineering FrameworkZipline—Airbnb’s Declarative Feature Engineering Framework
Zipline—Airbnb’s Declarative Feature Engineering Framework
 
Personalized Page Generation for Browsing Recommendations
Personalized Page Generation for Browsing RecommendationsPersonalized Page Generation for Browsing Recommendations
Personalized Page Generation for Browsing Recommendations
 
When Kafka Meets the Scaling and Reliability needs of World's Largest Retaile...
When Kafka Meets the Scaling and Reliability needs of World's Largest Retaile...When Kafka Meets the Scaling and Reliability needs of World's Largest Retaile...
When Kafka Meets the Scaling and Reliability needs of World's Largest Retaile...
 
로그 기깔나게 잘 디자인하는 법
로그 기깔나게 잘 디자인하는 법로그 기깔나게 잘 디자인하는 법
로그 기깔나게 잘 디자인하는 법
 
데이터 기반 이커머스 개인화 추천 기획 | 마켓컬리 Market Kurly
데이터 기반 이커머스 개인화 추천 기획 | 마켓컬리 Market Kurly데이터 기반 이커머스 개인화 추천 기획 | 마켓컬리 Market Kurly
데이터 기반 이커머스 개인화 추천 기획 | 마켓컬리 Market Kurly
 
GRU4Rec v2 - Recurrent Neural Networks with Top-k Gains for Session-based Rec...
GRU4Rec v2 - Recurrent Neural Networks with Top-k Gains for Session-based Rec...GRU4Rec v2 - Recurrent Neural Networks with Top-k Gains for Session-based Rec...
GRU4Rec v2 - Recurrent Neural Networks with Top-k Gains for Session-based Rec...
 
Near Real Time Indexing: Presented by Umesh Prasad & Thejus V M, Flipkart
Near Real Time Indexing: Presented by Umesh Prasad & Thejus V M, FlipkartNear Real Time Indexing: Presented by Umesh Prasad & Thejus V M, Flipkart
Near Real Time Indexing: Presented by Umesh Prasad & Thejus V M, Flipkart
 
Virtual Flink Forward 2020: Netflix Data Mesh: Composable Data Processing - J...
Virtual Flink Forward 2020: Netflix Data Mesh: Composable Data Processing - J...Virtual Flink Forward 2020: Netflix Data Mesh: Composable Data Processing - J...
Virtual Flink Forward 2020: Netflix Data Mesh: Composable Data Processing - J...
 
Deep Learning in Recommender Systems - RecSys Summer School 2017
Deep Learning in Recommender Systems - RecSys Summer School 2017Deep Learning in Recommender Systems - RecSys Summer School 2017
Deep Learning in Recommender Systems - RecSys Summer School 2017
 

Semelhante a Personal Knowledge Graphs

Deduplication and Author-Disambiguation of Streaming Records via Supervised M...
Deduplication and Author-Disambiguation of Streaming Records via Supervised M...Deduplication and Author-Disambiguation of Streaming Records via Supervised M...
Deduplication and Author-Disambiguation of Streaming Records via Supervised M...
Spark Summit
 
Authority of assertion in repository contributions to the PID graph / George ...
Authority of assertion in repository contributions to the PID graph / George ...Authority of assertion in repository contributions to the PID graph / George ...
Authority of assertion in repository contributions to the PID graph / George ...
CILIP MDG
 
Building a Knowledge Graph with Spark and NLP: How We Recommend Novel Drugs t...
Building a Knowledge Graph with Spark and NLP: How We Recommend Novel Drugs t...Building a Knowledge Graph with Spark and NLP: How We Recommend Novel Drugs t...
Building a Knowledge Graph with Spark and NLP: How We Recommend Novel Drugs t...
Databricks
 

Semelhante a Personal Knowledge Graphs (20)

On Entities and Evaluation
On Entities and EvaluationOn Entities and Evaluation
On Entities and Evaluation
 
Odsc 2019 entity_reputation_knowledge_graph
Odsc 2019 entity_reputation_knowledge_graphOdsc 2019 entity_reputation_knowledge_graph
Odsc 2019 entity_reputation_knowledge_graph
 
Serendipity
SerendipitySerendipity
Serendipity
 
Serendipity-neo4j
Serendipity-neo4jSerendipity-neo4j
Serendipity-neo4j
 
Applications for Social Networking Strategies in an Agency Context
Applications for Social Networking Strategies in an Agency ContextApplications for Social Networking Strategies in an Agency Context
Applications for Social Networking Strategies in an Agency Context
 
Applications for Social Networking Strategies in an Agency Context: Exploitin...
Applications for Social Networking Strategies in an Agency Context: Exploitin...Applications for Social Networking Strategies in an Agency Context: Exploitin...
Applications for Social Networking Strategies in an Agency Context: Exploitin...
 
Open IE tutorial 2018
Open IE tutorial 2018Open IE tutorial 2018
Open IE tutorial 2018
 
AI Beyond Deep Learning
AI Beyond Deep LearningAI Beyond Deep Learning
AI Beyond Deep Learning
 
SCRY @ ISWC'15, Diversity++
SCRY @ ISWC'15, Diversity++SCRY @ ISWC'15, Diversity++
SCRY @ ISWC'15, Diversity++
 
Resource Description Pres and Paper
Resource Description Pres and PaperResource Description Pres and Paper
Resource Description Pres and Paper
 
Deduplication and Author-Disambiguation of Streaming Records via Supervised M...
Deduplication and Author-Disambiguation of Streaming Records via Supervised M...Deduplication and Author-Disambiguation of Streaming Records via Supervised M...
Deduplication and Author-Disambiguation of Streaming Records via Supervised M...
 
Cataloging Presentation
Cataloging PresentationCataloging Presentation
Cataloging Presentation
 
Authority of assertion in repository contributions to the PID graph / George ...
Authority of assertion in repository contributions to the PID graph / George ...Authority of assertion in repository contributions to the PID graph / George ...
Authority of assertion in repository contributions to the PID graph / George ...
 
An Introduction to NOSQL, Graph Databases and Neo4j
An Introduction to NOSQL, Graph Databases and Neo4jAn Introduction to NOSQL, Graph Databases and Neo4j
An Introduction to NOSQL, Graph Databases and Neo4j
 
Building a Knowledge Graph with Spark and NLP: How We Recommend Novel Drugs t...
Building a Knowledge Graph with Spark and NLP: How We Recommend Novel Drugs t...Building a Knowledge Graph with Spark and NLP: How We Recommend Novel Drugs t...
Building a Knowledge Graph with Spark and NLP: How We Recommend Novel Drugs t...
 
Ngsp
NgspNgsp
Ngsp
 
Human Machine Collaboration for Improved Analytical Processes
Human Machine Collaboration for Improved Analytical ProcessesHuman Machine Collaboration for Improved Analytical Processes
Human Machine Collaboration for Improved Analytical Processes
 
What Is GDS and Neo4j’s GDS Library
What Is GDS and Neo4j’s GDS LibraryWhat Is GDS and Neo4j’s GDS Library
What Is GDS and Neo4j’s GDS Library
 
Data Communities - reusable data in and outside your organization.
Data Communities - reusable data in and outside your organization.Data Communities - reusable data in and outside your organization.
Data Communities - reusable data in and outside your organization.
 
Managing 'Big Data' in the social sciences: the contribution of an analytico-...
Managing 'Big Data' in the social sciences: the contribution of an analytico-...Managing 'Big Data' in the social sciences: the contribution of an analytico-...
Managing 'Big Data' in the social sciences: the contribution of an analytico-...
 

Mais de krisztianbalog

What Does Conversational Information Access Exactly Mean and How to Evaluate It?
What Does Conversational Information Access Exactly Mean and How to Evaluate It?What Does Conversational Information Access Exactly Mean and How to Evaluate It?
What Does Conversational Information Access Exactly Mean and How to Evaluate It?
krisztianbalog
 

Mais de krisztianbalog (18)

Towards Filling the Gap in Conversational Search: From Passage Retrieval to C...
Towards Filling the Gap in Conversational Search: From Passage Retrieval to C...Towards Filling the Gap in Conversational Search: From Passage Retrieval to C...
Towards Filling the Gap in Conversational Search: From Passage Retrieval to C...
 
Conversational AI from an Information Retrieval Perspective: Remaining Challe...
Conversational AI from an Information Retrieval Perspective: Remaining Challe...Conversational AI from an Information Retrieval Perspective: Remaining Challe...
Conversational AI from an Information Retrieval Perspective: Remaining Challe...
 
What Does Conversational Information Access Exactly Mean and How to Evaluate It?
What Does Conversational Information Access Exactly Mean and How to Evaluate It?What Does Conversational Information Access Exactly Mean and How to Evaluate It?
What Does Conversational Information Access Exactly Mean and How to Evaluate It?
 
Entities for Augmented Intelligence
Entities for Augmented IntelligenceEntities for Augmented Intelligence
Entities for Augmented Intelligence
 
Table Retrieval and Generation
Table Retrieval and GenerationTable Retrieval and Generation
Table Retrieval and Generation
 
Entity Search: The Last Decade and the Next
Entity Search: The Last Decade and the NextEntity Search: The Last Decade and the Next
Entity Search: The Last Decade and the Next
 
Overview of the TREC 2016 Open Search track: Academic Search Edition
Overview of the TREC 2016 Open Search track: Academic Search EditionOverview of the TREC 2016 Open Search track: Academic Search Edition
Overview of the TREC 2016 Open Search track: Academic Search Edition
 
Overview of the Living Labs for IR Evaluation (LL4IR) CLEF Lab
Overview of the Living Labs for IR Evaluation (LL4IR) CLEF LabOverview of the Living Labs for IR Evaluation (LL4IR) CLEF Lab
Overview of the Living Labs for IR Evaluation (LL4IR) CLEF Lab
 
Entity Linking
Entity LinkingEntity Linking
Entity Linking
 
Evaluation Initiatives for Entity-oriented Search
Evaluation Initiatives for Entity-oriented SearchEvaluation Initiatives for Entity-oriented Search
Evaluation Initiatives for Entity-oriented Search
 
Entity Retrieval (tutorial organized by Radialpoint in Montreal)
Entity Retrieval (tutorial organized by Radialpoint in Montreal)Entity Retrieval (tutorial organized by Radialpoint in Montreal)
Entity Retrieval (tutorial organized by Radialpoint in Montreal)
 
Entity Retrieval (WSDM 2014 tutorial)
Entity Retrieval (WSDM 2014 tutorial)Entity Retrieval (WSDM 2014 tutorial)
Entity Retrieval (WSDM 2014 tutorial)
 
Time-aware Evaluation of Cumulative Citation Recommendation Systems
Time-aware Evaluation of Cumulative Citation Recommendation SystemsTime-aware Evaluation of Cumulative Citation Recommendation Systems
Time-aware Evaluation of Cumulative Citation Recommendation Systems
 
Entity Retrieval (SIGIR 2013 tutorial)
Entity Retrieval (SIGIR 2013 tutorial)Entity Retrieval (SIGIR 2013 tutorial)
Entity Retrieval (SIGIR 2013 tutorial)
 
Multi-step Classification Approaches to Cumulative Citation Recommendation
Multi-step Classification Approaches to Cumulative Citation RecommendationMulti-step Classification Approaches to Cumulative Citation Recommendation
Multi-step Classification Approaches to Cumulative Citation Recommendation
 
Entity Retrieval (WWW 2013 tutorial)
Entity Retrieval (WWW 2013 tutorial)Entity Retrieval (WWW 2013 tutorial)
Entity Retrieval (WWW 2013 tutorial)
 
Semistructured Data Seach
Semistructured Data SeachSemistructured Data Seach
Semistructured Data Seach
 
Collection Ranking and Selection for Federated Entity Search
Collection Ranking and Selection for Federated Entity SearchCollection Ranking and Selection for Federated Entity Search
Collection Ranking and Selection for Federated Entity Search
 

Último

IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
Enterprise Knowledge
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 

Último (20)

Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 

Personal Knowledge Graphs

  • 1. PERSONAL KNOWLEDGE GRAPHS Krisztian Balog University of Stavanger
 @krisztianbalog Dagstuhl Seminar on Conversa>onal Search | Dagstuhl, Germany, November 2019
  • 2. KNOWLEDGE GRAPHS ARE POWERFUL ASSETS FOR A BROAD RANGE OF SEARCH, RECOMMENDATION, AND MINING SCENARIOS
  • 3.
  • 4. What is the capital of Norway?
  • 5. What is its popula>on?
  • 6. KNOWLEDGE GRAPHS ARE POWERFUL ASSETS FOR A BROAD RANGE OF SEARCH, RECOMMENDATION, AND MINING SCENARIOS They tend to focus on prominent, globally important en@@es This rules out many en@@es we interact with on a daily basis!
  • 7. MOTIVATIONAL SCENARIO I would like to get some new strings for my guitar AIOK, would that be your electric guitar or the acoustic one? The electric one. AIAlright. I can repeat your Amazon order of 3 months ago, or you can go by a music store on Elm street on the way to your dentist appointment this afternoon.
  • 8. MOTIVATIONAL SCENARIO #2 AISince you're running a half marathon at Hackney in May, may I suggest you undertake a 10k run this weekend? Yes, that sounds like a good idea. Any suggestions for a not too popular route 
 that I haven't done before? AISure thing. I'll upload some routes to the running app on your phone. Cheers mate!
  • 9. IT IS DIFFICULT TO IMAGINE A TRULY PERSONAL CONVERSATIONAL ASSISTANT WITHOUT IT HAVING ACCESS TO STRUCTURED PERSONAL INFORMATION
  • 10. PERSONAL KNOWLEDGE GRAPHS A personal knowledge graph (PKG) is a resource of structured information about entities that are of personal interest to the user Key differences from general KGs: • Entities of personal interest to the user • Distinctive shape (“spiderweb” layout) • Links between a PKG and external sources are inherent to its nature
  • 11. ASSOCIATED PROBLEMS • Knowledge representation • Semantic annotation of text • Population and maintenance • Integration with external sources
  • 12. KNOWLEDGE REPRESENTATION Task: representing entities and their properties • KGs are organized according to a knowledge model (schema) Peculiarities/challenges • Entities need to be (directly/indirectly) connected to the user • Not all attributes have to be filled; the focus is on personal relevance • Information about entities can be very sparse • Some entities may not have any digital presence • Relations can be highly temporal in nature RQ1 How should en@@es and their proper@es and rela@ons be represented, considering the vast but sparse set of possible predicates and their short-lived nature?
  • 13. SEMANTIC ANNOTATION OF TEXT Task: annotating text with respect to a knowledge repository (commonly known as entity linking) • Usually involves mention detection, entity disambiguation, and NIL- detection Challenges • Entities might have little to no digital presence • Entities are not necessarily proper nouns • Linking, NIL-detection, and KG population are intertwined RQ2a How can en@ty linking be performed against a personal knowledge graph, where structured en@ty informa@on to rely on is poten@ally absent? When should en@ty linking be performed against a personal knowledge graph as opposed to a general-purpose KG? RQ2b
  • 14. POPULATION AND MAINTENANCE Task: extending a KG from external sources (KB acceleration/ population) or via internal inferencing • Verification of facts in the KG Challenges • Single curator => More automation is desired than for KGs, but the user should still be in control • The first mention of an entity should trigger population • Properties may be inferred from the context RQ3 How can personal knowledge graphs be automa@cally populated and reliably maintained?
  • 15. INTEGRATION WITH EXTERNAL SOURCES Task: recognizing the same entity across multiple data sources • Also known as: object resolution, record linkage Challenges • One-to-many, as opposed to one-to-one linkage • Continuous process, not a one-off effort • Two-way synchronization would be desired • Conflicting facts or relations need resolving by the user RQ4 How should external knowledge sources con@nuously be integrated with, in a two-way process, poten@ally involving the user?
  • 16. THERE IS MORE… Evaluation • This would require an environment where users can interact with a PKG Implementation • Storage (cloud vs. device), security, privacy, access control, ... Utilization • From specific apps (calendar, health/wellbeing app, etc.) to a truly personal assistant
  • 17. TAKING CONTROL OF OUR DATA Present Alternative service #1 service #2 service #3 service #1 service #2 service #3 Different service providers each have some (possibly overlapping) portion of the person’s PKG The person has full control over her PKG and may grant access to different service providers to specific parts of the PKG