SlideShare uma empresa Scribd logo
1 de 38
Baixar para ler offline
Julien Plu
julien.plu@eurecom.fr
@julienplu
Can Deep Learning Techniques
Improve Entity Linking?
Me, Myself and I
§ Master in artificial intelligence from UM2 in 2012
§ Research engineer at Orange for 2 years
§ PhD Student at EURECOM since July 2014
§ Lead the Semantic Web section at Developpez.com
§ Co-author of the book: Web de données Méthodes et
outils pour les données liées
§ Areas of expertise: Semantic Web, Natural
Language Processing and Machine Learning
2016/06/21 - Invited talk to the LIRMM - Montpellier - 2
Use Case: Bringing Context to Documents
NEWSWIRES
TWEETS
SEARCH
QUERIES
SUBTITLES
2016/06/21 - Invited talk to the LIRMM - Montpellier - 3
Example: Recognize and link entities in Tweets
https://www.youtube.com/watch?v=Rmug-PUyIzI
2016/06/21 - Invited talk to the LIRMM - Montpellier - 4
Example: Recognize and link entities in Tweets
https://www.youtube.com/watch?v=Rmug-PUyIzI
2016/06/21 - Invited talk to the LIRMM - Montpellier - 5
Part-of-Speech Tagging on Tweets
Tampa NNP
Bay NNP
Lightning NNP
vs CC
Canadiens NNP
in IN
Montreal NNP
tonight NN
with IN
@erikmannens USR
#hockey HT
#NHL HT
https://gate.ac.uk/wiki/twitter-postagger.html
(N)ER: What is NHL?
(N)EL: Which Montreal
are we talking about?
2016/06/21 - Invited talk to the LIRMM - Montpellier - 6
What is NHL? Type Ambiguity
ORGANIZATION
PLACE
RAILWAY LINE
2016/06/21 - Invited talk to the LIRMM - Montpellier - 7
(Named) Entity Recognition
Tampa NNP ORG
Bay NNP ORG
Lightning NNP ORG
vs CC O
Canadiens NNP ORG
in IN O
Montreal NNP LOC
tonight NN O
with IN O
@erikmannens USR PER
#hockey HT THG
#NHL HT ORG
2016/06/21 - Invited talk to the LIRMM - Montpellier - 8
What is Montreal? Name Ambiguity
Montréal, Ardèche Montréal, Aude Montréal, Gers
Montréal,
Québec
Montreal,
Wisconsin
2016/06/21 - Invited talk to the LIRMM - Montpellier - 9
Popular Knowledge Bases
2016/06/21 - Invited talk to the LIRMM - Montpellier - 10
(Named) Entity Linking
Tampa NNP ORG http://dbpedia.org/resource/Tampa_Bay_Lightning
Bay NNP ORG http://dbpedia.org/resource/Tampa_Bay_Lightning
Lightning NNP ORG http://dbpedia.org/resource/Tampa_Bay_Lightning
vs CC O
Canadiens NNP ORG http://dbpedia.org/resource/Canadiens
in IN O
Montreal NNP LOC http://dbpedia.org/resource/Montreal
tonight NN O
with IN O
@erikmannens USR PER NIL
#hockey HT THG http://dbpedia.org/resource/Hockey
#NHL HT ORG http://dbpedia.org/resource/National_Hockey_League
2016/06/21 - Invited talk to the LIRMM - Montpellier - 11
Test with Babelfy, TagMe, Spotlight, AIDA and ADEL
§ http://babelfy.org/
§ https://tagme.d4science.org/tagme/
§ https://dbpedia-spotlight.github.io/demo/
§ https://gate.d5.mpi-inf.mpg.de/webaida/
2016/06/21 - Invited talk to the LIRMM - Montpellier - 12
Different Approaches
E2E approaches:
A dictionary of mentions
and links is built from a
referent KB. A text is split in
n-grams that are used to
look up candidate links
from the dictionary. A
selection function is used
to pick up the best match
Linguistic-based
approaches:
A text is parsed by a NER
classifier. Entity mentions
are used to look up
resources in a referent KB.
A ranking function is used
to select the best match
ADEL is a combination of both to
make a hybrid approach
2016/06/21 - Invited talk to the LIRMM - Montpellier - 13
ADEL from 30,000 foots
ADEL
Entity Extraction
Entity Linking Index
2016/06/21 - Invited talk to the LIRMM - Montpellier - 14
§ POS Tagger:
Ø bidirectional
CMM (left to right and
right to left)
§ NER Combiner:
Ø Use a combination of CRF with Gibbs sampling (Monte Carlo as graph inference method)
models. A simple CRF model could be:
PER PER PERO OOO
X X X X XX XXXX
X set of features for the current word: word capitalized, previous word is “de”, next word is a
NNP, … Suppose P(PER | X, PER, O, LOC) = P(PER | X, neighbors(PER)) then X with PER is a CRF
Jimmy Page , knowing the professionalism of John Paul Jones
Entity Extraction: Extractors Module
PER PERO
2016/06/21 - Invited talk to the LIRMM - Montpellier - 15
CRF Models Combination in details
§ Apply multiple CRF models over the same piece of
text
§ Merge the results into one single output
2016/06/21 - Invited talk to the LIRMM - Montpellier - 16
Entity Extraction: Overlap Resolution
§ Detect overlaps among
boundaries of entities
coming from the
extractors
§ Different heuristics can be applied:
Ø Merge: (“United States” and “States of America” => “United States of
America”) default behavior
Ø Simple Substring: (“Florence” and “Florence May Harding” => ”Florence”
and “May Harding”)
Ø Smart Substring: (”Giants of New York” and “New York” => “Giants” and
“New York”)
2016/06/21 - Invited talk to the LIRMM - Montpellier - 17
Index: Indexing
§ Use DBpedia and Wikipedia
as knowledge bases
§ Integrate external data such
as PageRank scores from
Hasso Platner Institute
§ Backend sytem with
Elasticsearch and Couchbase
§ Turn DBpedia and Wikipedia
into a CSV-based generic
format
2016/06/21 - Invited talk to the LIRMM - Montpellier - 18
Entity Linking: Linking tasks
§ Generate candidate links for all
extracted mentions:
Ø If any, they go to the linking method
Ø If not, they are linked to NIL via NIL
Clustering module
§ Linking method:
Ø Filter out candidates that have different
types than the one given by NER
Ø ADEL linear formula:
𝑟 𝑙 = 𝑎. 𝐿 𝑚, 𝑡𝑖𝑡𝑙𝑒 + 𝑏. max 𝐿 𝑚, 𝑅 + 𝑐. max 𝐿 𝑚, 𝐷 . 𝑃𝑅(𝑙)
r(l): the score of the candidate l
L: the Levenshtein distance
m:	the extracted mention
title: the title of the candidate l
R: the set of redirect pages associated to the candidate l
D: the set of disambiguation pages associated to the candidate l
PR: Pagerank associated to the candidate l	
a,	b	and c are weights
following the properties:
a	>	b	>	c	 and a	+	b	+	c	=	1
2016/06/21 - Invited talk to the LIRMM - Montpellier - 19
Results
§ ADEL over OKE2015
§ ADEL over OKE2016
§ ADEL over NEEL2016
Precision Recall F-measure
extraction 85.1 89.7 87.3
recognition 75.3 59 66.2
linking 85.4 42.7 57
Precision Recall F-measure
extraction 81.5 72.4 76.6
recognition 74.8 66.5 70.4
linking 52.8 45.8 49.1
Precision Recall F-measure
extraction 80.6 91.0 85.5
recognition 57.5 64.9 61.0
linking 49.9 58.3 53.8
2016/06/21 - Invited talk to the LIRMM - Montpellier - 20
Issues with current methods
§ Supervised methods
Ø Efficient but needs a training set for every dataset
Ø Not robust enough if the type of text or entities change
Ø Mostly associated with an E2E approach
Ø Inappropriate to detect NIL entities
§ Unsupervised methods
Ø Difficult to compute the relatedness among the candidates of
each entity
Ø Graph-based of linear formula are sometimes long to compute
Ø Difficult to manipulate emerging entities in case of graph-based
approach
2016/06/21 - Invited talk to the LIRMM - Montpellier - 21
Deep Learning for Textual Content
http://y2u.be/cJIILew6l28
https://youtu.be/mp6UsuRteNw?t=1h17m50s
2016/06/21 - Invited talk to the LIRMM - Montpellier - 22
From Machine Learning to Deep Learning: Logistic
Classifier
§ Logistic Classifier => Linear Classifier
W . X + b = Y
weights bias
trained
input data scores
(logits)
Soft
max
S(Y)
probabilities
D(S(Y),L)
L
Cross-Entropy
Evaluate how much the model
deviates from the GS
1-Hot
Labels
Multinomial Logistic Classification
2016/06/21 - Invited talk to the LIRMM - Montpellier - 23
From Machine Learning to Deep Learning: Deep
Neural Network
§ Neural Network => Non Linear Classifier
X x W +b Y S(Y) L
RELU
H
X W +b X W +b Y S(Y) LX
RELU: Rectified Linear Units (Activation Function)
Hidden Layers
2016/06/21 - Invited talk to the LIRMM - Montpellier - 24
Why Understanding Language is Difficult?
§ Human language has great variability
Ø Similar concepts are expressed in different ways, (e.g. kitty vs
cat)
§ Human language has great ambiguity
Ø Similar expressions mean different concepts, (e.g. New York
vs New York Times)
§ The meaning of text is usually vague and latent
Ø No clear supervision signal to learn from
§ Learning semantic meaning of texts is a key
challenge in NLP
2016/06/21 - Invited talk to the LIRMM - Montpellier - 25
Word Embeddings
§ Find a way to represent and measure how much two
different words have same/similar meaning
§ Need a huge amount of labelled data then better
using an unsupervised approach
cat kitty
?
2016/06/21 - Invited talk to the LIRMM - Montpellier - 26
Word Embeddings
The purrs
This hunts mice
Cat-like behaving
the same way
§ Context gives a
good idea that
words are
similar
§ Goal is to predict
words context in
order to treat
cat-like words
similarly
2016/06/21 - Invited talk to the LIRMM - Montpellier - 27
Word Embeddings
kittycat
dog
car
lion
pet
tiger
§ Map words to small
vectors (embeddings)
§ Embeddings are
close to each other in
the words space
when they have
similar meaning
2016/06/21 - Invited talk to the LIRMM - Montpellier - 28
Word2Vec
§ Developed by Google in 2013
§ Produce word embeddings
§ Takes a large corpus of text as input and produce a
vector space as output
§ Word vectors are positioned in the vector space
such that words that share common contexts in the
corpus are located in close proximity to one another
in the vector space
§ The goal is to provide semantically similar words to a
given word
2016/06/21 - Invited talk to the LIRMM - Montpellier - 29
Word2Vec: How it works?
The quick brown fox jumps over the lazy dog
window
Vfox
Logistic classifiers
§ Map every word to an embedding
§ Use a window around a selected word
§ Use the embedding of the selected word to predict
the context of the word
2016/06/21 - Invited talk to the LIRMM - Montpellier - 30
Word2Vec: How it works?
§ Measure the closeness of two word embeddings
with cosine similarity is better than with L2 because
the length is not relevant for the classification
§ Normalize all embeddings to get them in unit norm
form
cat
kittyL2
Vkitty
Vcat
cosine
𝑉GHI 	−	 𝑉KLIIM N
N
𝑉GHI. 𝑉KLIIM
𝑉GHI . 𝑉KLIIM
L2 =
cosine =
2016/06/21 - Invited talk to the LIRMM - Montpellier - 31
Word2Vec schema
§ Compares a target from the context of the input word
§ Compute softmax over a huge vocabulary vector can be very
inefficient
§ To solve this issue, use sampled softmax
cat W Vcat + b
Vcat
.
.
0
0
0
1
0
0
.
.
r
a
n
d
o
m
s
a
m
p
l
e
cross
entropy
softmax
logistic classifier
purr
2016/06/21 - Invited talk to the LIRMM - Montpellier - 32
DSSM: Deep Structured Semantic Model
§ Developed by Microsoft in 2013
§ Compute similarity between vectors
§ Generic enough for being applied to many more
cases than what Word2Vec can do (Web search,
ads, question answering, machine translation, word
embeddings…)
§ Training made with backpropagation
§ The layers can be either: DNN, CNN or RNN
2016/06/21 - Invited talk to the LIRMM - Montpellier - 33
DSSM schema with CNN
DSSM
sim(X,Y)
Word sequence xt
Relevance measured by
cosine similarity
Learning: maximize
the similarity
between X (source)
and Y (target)
128 128
X Y
w1,w2,…,wTX w1,w2,…,wTY
2016/06/21 - Invited talk to the LIRMM - Montpellier - 34
DSSM schema with CNN
X Y
128 128
sim(X,Y)
w1,w2,…,wTX w1,w2,…,wTY
Relevance measured by
cosine similarity
Learning: maximize
the similarity
between X (source)
and Y (target)
f(.) g(.)
Representation: use
DNN to extract
abstract semantic
representations
Word sequence xt
2016/06/21 - Invited talk to the LIRMM - Montpellier - 35
DSSM schema with CNN
sim(X,Y)
Word sequence xt
Relevance measured by
cosine similarity
Learning: maximize
the similarity
between X (source)
and Y (target)
Representation: use
DNN to extract
abstract semantic
representations
128 128
X Y
w1,w2,…,wTX w1,w2,…,wTY
Word hashing layer ft
Convolutional layer ct
Max pooling layer v
Semantic layer h
Convolutional and
max pooling layer:
identify keywords
(concepts) in X and Y
Word hashing: use
letter-trigram as raw
input to handle very
large vocabulary
2016/06/21 - Invited talk to the LIRMM - Montpellier - 36
Conclusion
§ Current methods for entity linking do not exploit
enough semantics
§ Deep Learning technics might be used to better take
into account the semantic
§ Using Word2Vec to rank the entity candidates from
the most semantically similar to the less one
§ Using DSSM to measure the relatedness between
the candidates of each extracted mention
2016/06/21 - Invited talk to the LIRMM - Montpellier - 37
Questions?
Thank you for listening!
http://multimediasemantics.github.io/adel
http://jplu.github.io
julien.plu@eurecom.fr
@julienplu
http://www.slideshare.net/julienplu
2016/06/21 - Invited talk to the LIRMM - Montpellier - 38

Mais conteúdo relacionado

Destaque

Web sémantique : données liées et sémantique des schémas
Web sémantique : données liées et sémantique des schémasWeb sémantique : données liées et sémantique des schémas
Web sémantique : données liées et sémantique des schémasFabien Gandon
 
Le Web sémantique ? Kézako ?!!
Le Web sémantique ? Kézako ?!! Le Web sémantique ? Kézako ?!!
Le Web sémantique ? Kézako ?!! Julien PLU
 
Referendum Oosterweelverbinding - Doe mee!
Referendum Oosterweelverbinding - Doe mee!Referendum Oosterweelverbinding - Doe mee!
Referendum Oosterweelverbinding - Doe mee!Stedelijke Jeugdraad
 
Quick overview of the use of linked data in sociallearn
Quick overview of the use of linked data in sociallearnQuick overview of the use of linked data in sociallearn
Quick overview of the use of linked data in sociallearnMathieu d'Aquin
 
Les technologies du Web appliquées aux données structurées (2ème partie : Rel...
Les technologies du Web appliquées aux données structurées (2ème partie : Rel...Les technologies du Web appliquées aux données structurées (2ème partie : Rel...
Les technologies du Web appliquées aux données structurées (2ème partie : Rel...Gautier Poupeau
 
Deep Learning vs Multidimensional Classification in Human-Guided Text Mining
Deep Learning vs Multidimensional Classification in Human-Guided Text MiningDeep Learning vs Multidimensional Classification in Human-Guided Text Mining
Deep Learning vs Multidimensional Classification in Human-Guided Text MiningTokyo University of Science
 
SPARQL, comment illuminer vos mashups en consommant les données du Linked Data ?
SPARQL, comment illuminer vos mashups en consommant les données du Linked Data ?SPARQL, comment illuminer vos mashups en consommant les données du Linked Data ?
SPARQL, comment illuminer vos mashups en consommant les données du Linked Data ?Antidot
 
Representing Texts as contextualized Entity Centric Linked Data Graphs
Representing Texts as contextualized Entity Centric Linked Data GraphsRepresenting Texts as contextualized Entity Centric Linked Data Graphs
Representing Texts as contextualized Entity Centric Linked Data GraphsAndre Freitas
 
WiSS Challenge - Day 2
WiSS Challenge - Day 2WiSS Challenge - Day 2
WiSS Challenge - Day 2Andre Freitas
 
On the Semantic Mapping of Schema-agnostic Queries: A Preliminary Study
On the Semantic Mapping of Schema-agnostic Queries: A Preliminary StudyOn the Semantic Mapping of Schema-agnostic Queries: A Preliminary Study
On the Semantic Mapping of Schema-agnostic Queries: A Preliminary Study Andre Freitas
 
WISS QA Do it yourself Question answering over Linked Data
WISS QA Do it yourself Question answering over Linked DataWISS QA Do it yourself Question answering over Linked Data
WISS QA Do it yourself Question answering over Linked DataAndre Freitas
 
Schema-Agnostic Queries (SAQ-2015): Semantic Web Challenge
Schema-Agnostic Queries (SAQ-2015): Semantic Web ChallengeSchema-Agnostic Queries (SAQ-2015): Semantic Web Challenge
Schema-Agnostic Queries (SAQ-2015): Semantic Web ChallengeAndre Freitas
 
Natural Language Queries over Heterogeneous Linked Data Graphs: A Distributio...
Natural Language Queries over Heterogeneous Linked Data Graphs: A Distributio...Natural Language Queries over Heterogeneous Linked Data Graphs: A Distributio...
Natural Language Queries over Heterogeneous Linked Data Graphs: A Distributio...Andre Freitas
 
Disambiguating Polysemous Queries For Document Retrieval
Disambiguating Polysemous Queries For Document RetrievalDisambiguating Polysemous Queries For Document Retrieval
Disambiguating Polysemous Queries For Document RetrievalMadhusudan Daad
 
Introduction to question answering for linked data & big data
Introduction to question answering for linked data & big dataIntroduction to question answering for linked data & big data
Introduction to question answering for linked data & big dataAndre Freitas
 
Comparative study of different ranking algorithms adopted by search engine
Comparative study of  different ranking algorithms adopted by search engineComparative study of  different ranking algorithms adopted by search engine
Comparative study of different ranking algorithms adopted by search engineEchelon Institute of Technology
 
Semantic Interpretation of User Query for Question Answering on Interlinked Data
Semantic Interpretation of User Query for Question Answering on Interlinked DataSemantic Interpretation of User Query for Question Answering on Interlinked Data
Semantic Interpretation of User Query for Question Answering on Interlinked DataSaeedeh Shekarpour
 
Open domain Question Answering System - Research project in NLP
Open domain  Question Answering System - Research project in NLPOpen domain  Question Answering System - Research project in NLP
Open domain Question Answering System - Research project in NLPGVS Chaitanya
 
Ontology-Based Word Sense Disambiguation for Scientific Literature
Ontology-Based Word Sense Disambiguation for Scientific LiteratureOntology-Based Word Sense Disambiguation for Scientific Literature
Ontology-Based Word Sense Disambiguation for Scientific LiteratureeXascale Infolab
 
Semantic Relation Classification: Task Formalisation and Refinement
Semantic Relation Classification: Task Formalisation and RefinementSemantic Relation Classification: Task Formalisation and Refinement
Semantic Relation Classification: Task Formalisation and RefinementAndre Freitas
 

Destaque (20)

Web sémantique : données liées et sémantique des schémas
Web sémantique : données liées et sémantique des schémasWeb sémantique : données liées et sémantique des schémas
Web sémantique : données liées et sémantique des schémas
 
Le Web sémantique ? Kézako ?!!
Le Web sémantique ? Kézako ?!! Le Web sémantique ? Kézako ?!!
Le Web sémantique ? Kézako ?!!
 
Referendum Oosterweelverbinding - Doe mee!
Referendum Oosterweelverbinding - Doe mee!Referendum Oosterweelverbinding - Doe mee!
Referendum Oosterweelverbinding - Doe mee!
 
Quick overview of the use of linked data in sociallearn
Quick overview of the use of linked data in sociallearnQuick overview of the use of linked data in sociallearn
Quick overview of the use of linked data in sociallearn
 
Les technologies du Web appliquées aux données structurées (2ème partie : Rel...
Les technologies du Web appliquées aux données structurées (2ème partie : Rel...Les technologies du Web appliquées aux données structurées (2ème partie : Rel...
Les technologies du Web appliquées aux données structurées (2ème partie : Rel...
 
Deep Learning vs Multidimensional Classification in Human-Guided Text Mining
Deep Learning vs Multidimensional Classification in Human-Guided Text MiningDeep Learning vs Multidimensional Classification in Human-Guided Text Mining
Deep Learning vs Multidimensional Classification in Human-Guided Text Mining
 
SPARQL, comment illuminer vos mashups en consommant les données du Linked Data ?
SPARQL, comment illuminer vos mashups en consommant les données du Linked Data ?SPARQL, comment illuminer vos mashups en consommant les données du Linked Data ?
SPARQL, comment illuminer vos mashups en consommant les données du Linked Data ?
 
Representing Texts as contextualized Entity Centric Linked Data Graphs
Representing Texts as contextualized Entity Centric Linked Data GraphsRepresenting Texts as contextualized Entity Centric Linked Data Graphs
Representing Texts as contextualized Entity Centric Linked Data Graphs
 
WiSS Challenge - Day 2
WiSS Challenge - Day 2WiSS Challenge - Day 2
WiSS Challenge - Day 2
 
On the Semantic Mapping of Schema-agnostic Queries: A Preliminary Study
On the Semantic Mapping of Schema-agnostic Queries: A Preliminary StudyOn the Semantic Mapping of Schema-agnostic Queries: A Preliminary Study
On the Semantic Mapping of Schema-agnostic Queries: A Preliminary Study
 
WISS QA Do it yourself Question answering over Linked Data
WISS QA Do it yourself Question answering over Linked DataWISS QA Do it yourself Question answering over Linked Data
WISS QA Do it yourself Question answering over Linked Data
 
Schema-Agnostic Queries (SAQ-2015): Semantic Web Challenge
Schema-Agnostic Queries (SAQ-2015): Semantic Web ChallengeSchema-Agnostic Queries (SAQ-2015): Semantic Web Challenge
Schema-Agnostic Queries (SAQ-2015): Semantic Web Challenge
 
Natural Language Queries over Heterogeneous Linked Data Graphs: A Distributio...
Natural Language Queries over Heterogeneous Linked Data Graphs: A Distributio...Natural Language Queries over Heterogeneous Linked Data Graphs: A Distributio...
Natural Language Queries over Heterogeneous Linked Data Graphs: A Distributio...
 
Disambiguating Polysemous Queries For Document Retrieval
Disambiguating Polysemous Queries For Document RetrievalDisambiguating Polysemous Queries For Document Retrieval
Disambiguating Polysemous Queries For Document Retrieval
 
Introduction to question answering for linked data & big data
Introduction to question answering for linked data & big dataIntroduction to question answering for linked data & big data
Introduction to question answering for linked data & big data
 
Comparative study of different ranking algorithms adopted by search engine
Comparative study of  different ranking algorithms adopted by search engineComparative study of  different ranking algorithms adopted by search engine
Comparative study of different ranking algorithms adopted by search engine
 
Semantic Interpretation of User Query for Question Answering on Interlinked Data
Semantic Interpretation of User Query for Question Answering on Interlinked DataSemantic Interpretation of User Query for Question Answering on Interlinked Data
Semantic Interpretation of User Query for Question Answering on Interlinked Data
 
Open domain Question Answering System - Research project in NLP
Open domain  Question Answering System - Research project in NLPOpen domain  Question Answering System - Research project in NLP
Open domain Question Answering System - Research project in NLP
 
Ontology-Based Word Sense Disambiguation for Scientific Literature
Ontology-Based Word Sense Disambiguation for Scientific LiteratureOntology-Based Word Sense Disambiguation for Scientific Literature
Ontology-Based Word Sense Disambiguation for Scientific Literature
 
Semantic Relation Classification: Task Formalisation and Refinement
Semantic Relation Classification: Task Formalisation and RefinementSemantic Relation Classification: Task Formalisation and Refinement
Semantic Relation Classification: Task Formalisation and Refinement
 

Semelhante a Can Deep Learning Techniques Improve Entity Linking?

Decentralized Data Management for the Semantic Web
Decentralized Data Management for the Semantic WebDecentralized Data Management for the Semantic Web
Decentralized Data Management for the Semantic Webhala Skaf
 
Ariadne's Thread -- Exploring a world of networked information built from fre...
Ariadne's Thread -- Exploring a world of networked information built from fre...Ariadne's Thread -- Exploring a world of networked information built from fre...
Ariadne's Thread -- Exploring a world of networked information built from fre...Shenghui Wang
 
Context Determines Content - An Approach to Resource Recommendation in Folkso...
Context Determines Content - An Approach to Resource Recommendation in Folkso...Context Determines Content - An Approach to Resource Recommendation in Folkso...
Context Determines Content - An Approach to Resource Recommendation in Folkso...Mojisola Erdt née Anjorin
 
Chinese Character Decomposition for Neural MT with Multi-Word Expressions
Chinese Character Decomposition for  Neural MT with Multi-Word ExpressionsChinese Character Decomposition for  Neural MT with Multi-Word Expressions
Chinese Character Decomposition for Neural MT with Multi-Word ExpressionsLifeng (Aaron) Han
 
Profile-based Dataset Recommendation for RDF Data Linking
Profile-based Dataset Recommendation for RDF Data Linking  Profile-based Dataset Recommendation for RDF Data Linking
Profile-based Dataset Recommendation for RDF Data Linking Mohamed BEN ELLEFI
 
PLaNet talk @ LKL Knowledge Seminar, 30 Jan, 2008
PLaNet talk @ LKL Knowledge Seminar, 30 Jan, 2008PLaNet talk @ LKL Knowledge Seminar, 30 Jan, 2008
PLaNet talk @ LKL Knowledge Seminar, 30 Jan, 2008Yishay Mor
 
Context Semantic Analysis: a knowledge-based technique for computing inter-do...
Context Semantic Analysis: a knowledge-based technique for computing inter-do...Context Semantic Analysis: a knowledge-based technique for computing inter-do...
Context Semantic Analysis: a knowledge-based technique for computing inter-do...Fabio Benedetti
 
Freddy Limpens: From folksonomies to ontologies: a socio-technical solution.
Freddy Limpens: From folksonomies to ontologies: a socio-technical solution.Freddy Limpens: From folksonomies to ontologies: a socio-technical solution.
Freddy Limpens: From folksonomies to ontologies: a socio-technical solution.PhiloWeb
 
Modeling for Sustainability
Modeling for SustainabilityModeling for Sustainability
Modeling for SustainabilityBenoit Combemale
 
Analyzing the Evolution of Vocabulary Terms and Their Impact on the LOD Cloud
Analyzing the Evolution of Vocabulary Terms and Their Impact on the LOD CloudAnalyzing the Evolution of Vocabulary Terms and Their Impact on the LOD Cloud
Analyzing the Evolution of Vocabulary Terms and Their Impact on the LOD CloudMOVING Project
 
Platforms and the Semantic Web
Platforms and the Semantic WebPlatforms and the Semantic Web
Platforms and the Semantic WebDanny Ayers
 
Machine Learning & Embeddings for Large Knowledge Graphs
Machine Learning & Embeddings  for Large Knowledge GraphsMachine Learning & Embeddings  for Large Knowledge Graphs
Machine Learning & Embeddings for Large Knowledge GraphsHeiko Paulheim
 
Publishing Linked Open Data on the Web & the Role of Ontologies
Publishing Linked Open Data on the Web & the Role of OntologiesPublishing Linked Open Data on the Web & the Role of Ontologies
Publishing Linked Open Data on the Web & the Role of OntologiesMaría Poveda Villalón
 
What_do_Knowledge_Graph_Embeddings_Learn.pdf
What_do_Knowledge_Graph_Embeddings_Learn.pdfWhat_do_Knowledge_Graph_Embeddings_Learn.pdf
What_do_Knowledge_Graph_Embeddings_Learn.pdfHeiko Paulheim
 
Mathematical Modeling for Practical Problems
Mathematical Modeling for Practical ProblemsMathematical Modeling for Practical Problems
Mathematical Modeling for Practical ProblemsLiwei Ren任力偉
 
An Empirical Comparison of Knowledge Graph Embeddings for Item Recommendation
An Empirical Comparison of Knowledge Graph Embeddings for Item RecommendationAn Empirical Comparison of Knowledge Graph Embeddings for Item Recommendation
An Empirical Comparison of Knowledge Graph Embeddings for Item RecommendationEnrico Palumbo
 

Semelhante a Can Deep Learning Techniques Improve Entity Linking? (20)

Decentralized Data Management for the Semantic Web
Decentralized Data Management for the Semantic WebDecentralized Data Management for the Semantic Web
Decentralized Data Management for the Semantic Web
 
Ariadne's Thread -- Exploring a world of networked information built from fre...
Ariadne's Thread -- Exploring a world of networked information built from fre...Ariadne's Thread -- Exploring a world of networked information built from fre...
Ariadne's Thread -- Exploring a world of networked information built from fre...
 
Rs web context_content__v4.0__20120908_ma
Rs web context_content__v4.0__20120908_maRs web context_content__v4.0__20120908_ma
Rs web context_content__v4.0__20120908_ma
 
Context Determines Content - An Approach to Resource Recommendation in Folkso...
Context Determines Content - An Approach to Resource Recommendation in Folkso...Context Determines Content - An Approach to Resource Recommendation in Folkso...
Context Determines Content - An Approach to Resource Recommendation in Folkso...
 
Chinese Character Decomposition for Neural MT with Multi-Word Expressions
Chinese Character Decomposition for  Neural MT with Multi-Word ExpressionsChinese Character Decomposition for  Neural MT with Multi-Word Expressions
Chinese Character Decomposition for Neural MT with Multi-Word Expressions
 
How to supervise a thesis in NLP in the ChatGPT era? By Laure Soulier
How to supervise a thesis in NLP in the ChatGPT era? By Laure SoulierHow to supervise a thesis in NLP in the ChatGPT era? By Laure Soulier
How to supervise a thesis in NLP in the ChatGPT era? By Laure Soulier
 
Profile-based Dataset Recommendation for RDF Data Linking
Profile-based Dataset Recommendation for RDF Data Linking  Profile-based Dataset Recommendation for RDF Data Linking
Profile-based Dataset Recommendation for RDF Data Linking
 
PLaNet talk @ LKL Knowledge Seminar, 30 Jan, 2008
PLaNet talk @ LKL Knowledge Seminar, 30 Jan, 2008PLaNet talk @ LKL Knowledge Seminar, 30 Jan, 2008
PLaNet talk @ LKL Knowledge Seminar, 30 Jan, 2008
 
Context Semantic Analysis: a knowledge-based technique for computing inter-do...
Context Semantic Analysis: a knowledge-based technique for computing inter-do...Context Semantic Analysis: a knowledge-based technique for computing inter-do...
Context Semantic Analysis: a knowledge-based technique for computing inter-do...
 
Freddy Limpens: From folksonomies to ontologies: a socio-technical solution.
Freddy Limpens: From folksonomies to ontologies: a socio-technical solution.Freddy Limpens: From folksonomies to ontologies: a socio-technical solution.
Freddy Limpens: From folksonomies to ontologies: a socio-technical solution.
 
Digital repertoires of poetry metrics: towards a Linked Open Data ecosystem
Digital repertoires of poetry metrics: towards a Linked Open Data ecosystemDigital repertoires of poetry metrics: towards a Linked Open Data ecosystem
Digital repertoires of poetry metrics: towards a Linked Open Data ecosystem
 
Modeling for Sustainability
Modeling for SustainabilityModeling for Sustainability
Modeling for Sustainability
 
Analyzing the Evolution of Vocabulary Terms and Their Impact on the LOD Cloud
Analyzing the Evolution of Vocabulary Terms and Their Impact on the LOD CloudAnalyzing the Evolution of Vocabulary Terms and Their Impact on the LOD Cloud
Analyzing the Evolution of Vocabulary Terms and Their Impact on the LOD Cloud
 
Platforms and the Semantic Web
Platforms and the Semantic WebPlatforms and the Semantic Web
Platforms and the Semantic Web
 
OpenFOAM Training v5-1-en
OpenFOAM Training v5-1-enOpenFOAM Training v5-1-en
OpenFOAM Training v5-1-en
 
Machine Learning & Embeddings for Large Knowledge Graphs
Machine Learning & Embeddings  for Large Knowledge GraphsMachine Learning & Embeddings  for Large Knowledge Graphs
Machine Learning & Embeddings for Large Knowledge Graphs
 
Publishing Linked Open Data on the Web & the Role of Ontologies
Publishing Linked Open Data on the Web & the Role of OntologiesPublishing Linked Open Data on the Web & the Role of Ontologies
Publishing Linked Open Data on the Web & the Role of Ontologies
 
What_do_Knowledge_Graph_Embeddings_Learn.pdf
What_do_Knowledge_Graph_Embeddings_Learn.pdfWhat_do_Knowledge_Graph_Embeddings_Learn.pdf
What_do_Knowledge_Graph_Embeddings_Learn.pdf
 
Mathematical Modeling for Practical Problems
Mathematical Modeling for Practical ProblemsMathematical Modeling for Practical Problems
Mathematical Modeling for Practical Problems
 
An Empirical Comparison of Knowledge Graph Embeddings for Item Recommendation
An Empirical Comparison of Knowledge Graph Embeddings for Item RecommendationAn Empirical Comparison of Knowledge Graph Embeddings for Item Recommendation
An Empirical Comparison of Knowledge Graph Embeddings for Item Recommendation
 

Último

Large Language Models for Test Case Evolution and Repair
Large Language Models for Test Case Evolution and RepairLarge Language Models for Test Case Evolution and Repair
Large Language Models for Test Case Evolution and RepairLionel Briand
 
GraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4j
GraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4jGraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4j
GraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4jNeo4j
 
Revolutionizing the Digital Transformation Office - Leveraging OnePlan’s AI a...
Revolutionizing the Digital Transformation Office - Leveraging OnePlan’s AI a...Revolutionizing the Digital Transformation Office - Leveraging OnePlan’s AI a...
Revolutionizing the Digital Transformation Office - Leveraging OnePlan’s AI a...OnePlan Solutions
 
Effectively Troubleshoot 9 Types of OutOfMemoryError
Effectively Troubleshoot 9 Types of OutOfMemoryErrorEffectively Troubleshoot 9 Types of OutOfMemoryError
Effectively Troubleshoot 9 Types of OutOfMemoryErrorTier1 app
 
Pros and Cons of Selenium In Automation Testing_ A Comprehensive Assessment.pdf
Pros and Cons of Selenium In Automation Testing_ A Comprehensive Assessment.pdfPros and Cons of Selenium In Automation Testing_ A Comprehensive Assessment.pdf
Pros and Cons of Selenium In Automation Testing_ A Comprehensive Assessment.pdfkalichargn70th171
 
The Role of IoT and Sensor Technology in Cargo Cloud Solutions.pptx
The Role of IoT and Sensor Technology in Cargo Cloud Solutions.pptxThe Role of IoT and Sensor Technology in Cargo Cloud Solutions.pptx
The Role of IoT and Sensor Technology in Cargo Cloud Solutions.pptxRTS corp
 
[ CNCF Q1 2024 ] Intro to Continuous Profiling and Grafana Pyroscope.pdf
[ CNCF Q1 2024 ] Intro to Continuous Profiling and Grafana Pyroscope.pdf[ CNCF Q1 2024 ] Intro to Continuous Profiling and Grafana Pyroscope.pdf
[ CNCF Q1 2024 ] Intro to Continuous Profiling and Grafana Pyroscope.pdfSteve Caron
 
OpenChain AI Study Group - Europe and Asia Recap - 2024-04-11 - Full Recording
OpenChain AI Study Group - Europe and Asia Recap - 2024-04-11 - Full RecordingOpenChain AI Study Group - Europe and Asia Recap - 2024-04-11 - Full Recording
OpenChain AI Study Group - Europe and Asia Recap - 2024-04-11 - Full RecordingShane Coughlan
 
Osi security architecture in network.pptx
Osi security architecture in network.pptxOsi security architecture in network.pptx
Osi security architecture in network.pptxVinzoCenzo
 
Leveraging AI for Mobile App Testing on Real Devices | Applitools + Kobiton
Leveraging AI for Mobile App Testing on Real Devices | Applitools + KobitonLeveraging AI for Mobile App Testing on Real Devices | Applitools + Kobiton
Leveraging AI for Mobile App Testing on Real Devices | Applitools + KobitonApplitools
 
Advantages of Cargo Cloud Solutions.pptx
Advantages of Cargo Cloud Solutions.pptxAdvantages of Cargo Cloud Solutions.pptx
Advantages of Cargo Cloud Solutions.pptxRTS corp
 
SAM Training Session - How to use EXCEL ?
SAM Training Session - How to use EXCEL ?SAM Training Session - How to use EXCEL ?
SAM Training Session - How to use EXCEL ?Alexandre Beguel
 
eSoftTools IMAP Backup Software and migration tools
eSoftTools IMAP Backup Software and migration toolseSoftTools IMAP Backup Software and migration tools
eSoftTools IMAP Backup Software and migration toolsosttopstonverter
 
What’s New in VictoriaMetrics: Q1 2024 Updates
What’s New in VictoriaMetrics: Q1 2024 UpdatesWhat’s New in VictoriaMetrics: Q1 2024 Updates
What’s New in VictoriaMetrics: Q1 2024 UpdatesVictoriaMetrics
 
Zer0con 2024 final share short version.pdf
Zer0con 2024 final share short version.pdfZer0con 2024 final share short version.pdf
Zer0con 2024 final share short version.pdfmaor17
 
Mastering Project Planning with Microsoft Project 2016.pptx
Mastering Project Planning with Microsoft Project 2016.pptxMastering Project Planning with Microsoft Project 2016.pptx
Mastering Project Planning with Microsoft Project 2016.pptxAS Design & AST.
 
Introduction to Firebase Workshop Slides
Introduction to Firebase Workshop SlidesIntroduction to Firebase Workshop Slides
Introduction to Firebase Workshop Slidesvaideheekore1
 
VictoriaMetrics Q1 Meet Up '24 - Community & News Update
VictoriaMetrics Q1 Meet Up '24 - Community & News UpdateVictoriaMetrics Q1 Meet Up '24 - Community & News Update
VictoriaMetrics Q1 Meet Up '24 - Community & News UpdateVictoriaMetrics
 
Tech Tuesday Slides - Introduction to Project Management with OnePlan's Work ...
Tech Tuesday Slides - Introduction to Project Management with OnePlan's Work ...Tech Tuesday Slides - Introduction to Project Management with OnePlan's Work ...
Tech Tuesday Slides - Introduction to Project Management with OnePlan's Work ...OnePlan Solutions
 
Understanding Flamingo - DeepMind's VLM Architecture
Understanding Flamingo - DeepMind's VLM ArchitectureUnderstanding Flamingo - DeepMind's VLM Architecture
Understanding Flamingo - DeepMind's VLM Architecturerahul_net
 

Último (20)

Large Language Models for Test Case Evolution and Repair
Large Language Models for Test Case Evolution and RepairLarge Language Models for Test Case Evolution and Repair
Large Language Models for Test Case Evolution and Repair
 
GraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4j
GraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4jGraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4j
GraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4j
 
Revolutionizing the Digital Transformation Office - Leveraging OnePlan’s AI a...
Revolutionizing the Digital Transformation Office - Leveraging OnePlan’s AI a...Revolutionizing the Digital Transformation Office - Leveraging OnePlan’s AI a...
Revolutionizing the Digital Transformation Office - Leveraging OnePlan’s AI a...
 
Effectively Troubleshoot 9 Types of OutOfMemoryError
Effectively Troubleshoot 9 Types of OutOfMemoryErrorEffectively Troubleshoot 9 Types of OutOfMemoryError
Effectively Troubleshoot 9 Types of OutOfMemoryError
 
Pros and Cons of Selenium In Automation Testing_ A Comprehensive Assessment.pdf
Pros and Cons of Selenium In Automation Testing_ A Comprehensive Assessment.pdfPros and Cons of Selenium In Automation Testing_ A Comprehensive Assessment.pdf
Pros and Cons of Selenium In Automation Testing_ A Comprehensive Assessment.pdf
 
The Role of IoT and Sensor Technology in Cargo Cloud Solutions.pptx
The Role of IoT and Sensor Technology in Cargo Cloud Solutions.pptxThe Role of IoT and Sensor Technology in Cargo Cloud Solutions.pptx
The Role of IoT and Sensor Technology in Cargo Cloud Solutions.pptx
 
[ CNCF Q1 2024 ] Intro to Continuous Profiling and Grafana Pyroscope.pdf
[ CNCF Q1 2024 ] Intro to Continuous Profiling and Grafana Pyroscope.pdf[ CNCF Q1 2024 ] Intro to Continuous Profiling and Grafana Pyroscope.pdf
[ CNCF Q1 2024 ] Intro to Continuous Profiling and Grafana Pyroscope.pdf
 
OpenChain AI Study Group - Europe and Asia Recap - 2024-04-11 - Full Recording
OpenChain AI Study Group - Europe and Asia Recap - 2024-04-11 - Full RecordingOpenChain AI Study Group - Europe and Asia Recap - 2024-04-11 - Full Recording
OpenChain AI Study Group - Europe and Asia Recap - 2024-04-11 - Full Recording
 
Osi security architecture in network.pptx
Osi security architecture in network.pptxOsi security architecture in network.pptx
Osi security architecture in network.pptx
 
Leveraging AI for Mobile App Testing on Real Devices | Applitools + Kobiton
Leveraging AI for Mobile App Testing on Real Devices | Applitools + KobitonLeveraging AI for Mobile App Testing on Real Devices | Applitools + Kobiton
Leveraging AI for Mobile App Testing on Real Devices | Applitools + Kobiton
 
Advantages of Cargo Cloud Solutions.pptx
Advantages of Cargo Cloud Solutions.pptxAdvantages of Cargo Cloud Solutions.pptx
Advantages of Cargo Cloud Solutions.pptx
 
SAM Training Session - How to use EXCEL ?
SAM Training Session - How to use EXCEL ?SAM Training Session - How to use EXCEL ?
SAM Training Session - How to use EXCEL ?
 
eSoftTools IMAP Backup Software and migration tools
eSoftTools IMAP Backup Software and migration toolseSoftTools IMAP Backup Software and migration tools
eSoftTools IMAP Backup Software and migration tools
 
What’s New in VictoriaMetrics: Q1 2024 Updates
What’s New in VictoriaMetrics: Q1 2024 UpdatesWhat’s New in VictoriaMetrics: Q1 2024 Updates
What’s New in VictoriaMetrics: Q1 2024 Updates
 
Zer0con 2024 final share short version.pdf
Zer0con 2024 final share short version.pdfZer0con 2024 final share short version.pdf
Zer0con 2024 final share short version.pdf
 
Mastering Project Planning with Microsoft Project 2016.pptx
Mastering Project Planning with Microsoft Project 2016.pptxMastering Project Planning with Microsoft Project 2016.pptx
Mastering Project Planning with Microsoft Project 2016.pptx
 
Introduction to Firebase Workshop Slides
Introduction to Firebase Workshop SlidesIntroduction to Firebase Workshop Slides
Introduction to Firebase Workshop Slides
 
VictoriaMetrics Q1 Meet Up '24 - Community & News Update
VictoriaMetrics Q1 Meet Up '24 - Community & News UpdateVictoriaMetrics Q1 Meet Up '24 - Community & News Update
VictoriaMetrics Q1 Meet Up '24 - Community & News Update
 
Tech Tuesday Slides - Introduction to Project Management with OnePlan's Work ...
Tech Tuesday Slides - Introduction to Project Management with OnePlan's Work ...Tech Tuesday Slides - Introduction to Project Management with OnePlan's Work ...
Tech Tuesday Slides - Introduction to Project Management with OnePlan's Work ...
 
Understanding Flamingo - DeepMind's VLM Architecture
Understanding Flamingo - DeepMind's VLM ArchitectureUnderstanding Flamingo - DeepMind's VLM Architecture
Understanding Flamingo - DeepMind's VLM Architecture
 

Can Deep Learning Techniques Improve Entity Linking?

  • 1. Julien Plu julien.plu@eurecom.fr @julienplu Can Deep Learning Techniques Improve Entity Linking?
  • 2. Me, Myself and I § Master in artificial intelligence from UM2 in 2012 § Research engineer at Orange for 2 years § PhD Student at EURECOM since July 2014 § Lead the Semantic Web section at Developpez.com § Co-author of the book: Web de données Méthodes et outils pour les données liées § Areas of expertise: Semantic Web, Natural Language Processing and Machine Learning 2016/06/21 - Invited talk to the LIRMM - Montpellier - 2
  • 3. Use Case: Bringing Context to Documents NEWSWIRES TWEETS SEARCH QUERIES SUBTITLES 2016/06/21 - Invited talk to the LIRMM - Montpellier - 3
  • 4. Example: Recognize and link entities in Tweets https://www.youtube.com/watch?v=Rmug-PUyIzI 2016/06/21 - Invited talk to the LIRMM - Montpellier - 4
  • 5. Example: Recognize and link entities in Tweets https://www.youtube.com/watch?v=Rmug-PUyIzI 2016/06/21 - Invited talk to the LIRMM - Montpellier - 5
  • 6. Part-of-Speech Tagging on Tweets Tampa NNP Bay NNP Lightning NNP vs CC Canadiens NNP in IN Montreal NNP tonight NN with IN @erikmannens USR #hockey HT #NHL HT https://gate.ac.uk/wiki/twitter-postagger.html (N)ER: What is NHL? (N)EL: Which Montreal are we talking about? 2016/06/21 - Invited talk to the LIRMM - Montpellier - 6
  • 7. What is NHL? Type Ambiguity ORGANIZATION PLACE RAILWAY LINE 2016/06/21 - Invited talk to the LIRMM - Montpellier - 7
  • 8. (Named) Entity Recognition Tampa NNP ORG Bay NNP ORG Lightning NNP ORG vs CC O Canadiens NNP ORG in IN O Montreal NNP LOC tonight NN O with IN O @erikmannens USR PER #hockey HT THG #NHL HT ORG 2016/06/21 - Invited talk to the LIRMM - Montpellier - 8
  • 9. What is Montreal? Name Ambiguity Montréal, Ardèche Montréal, Aude Montréal, Gers Montréal, Québec Montreal, Wisconsin 2016/06/21 - Invited talk to the LIRMM - Montpellier - 9
  • 10. Popular Knowledge Bases 2016/06/21 - Invited talk to the LIRMM - Montpellier - 10
  • 11. (Named) Entity Linking Tampa NNP ORG http://dbpedia.org/resource/Tampa_Bay_Lightning Bay NNP ORG http://dbpedia.org/resource/Tampa_Bay_Lightning Lightning NNP ORG http://dbpedia.org/resource/Tampa_Bay_Lightning vs CC O Canadiens NNP ORG http://dbpedia.org/resource/Canadiens in IN O Montreal NNP LOC http://dbpedia.org/resource/Montreal tonight NN O with IN O @erikmannens USR PER NIL #hockey HT THG http://dbpedia.org/resource/Hockey #NHL HT ORG http://dbpedia.org/resource/National_Hockey_League 2016/06/21 - Invited talk to the LIRMM - Montpellier - 11
  • 12. Test with Babelfy, TagMe, Spotlight, AIDA and ADEL § http://babelfy.org/ § https://tagme.d4science.org/tagme/ § https://dbpedia-spotlight.github.io/demo/ § https://gate.d5.mpi-inf.mpg.de/webaida/ 2016/06/21 - Invited talk to the LIRMM - Montpellier - 12
  • 13. Different Approaches E2E approaches: A dictionary of mentions and links is built from a referent KB. A text is split in n-grams that are used to look up candidate links from the dictionary. A selection function is used to pick up the best match Linguistic-based approaches: A text is parsed by a NER classifier. Entity mentions are used to look up resources in a referent KB. A ranking function is used to select the best match ADEL is a combination of both to make a hybrid approach 2016/06/21 - Invited talk to the LIRMM - Montpellier - 13
  • 14. ADEL from 30,000 foots ADEL Entity Extraction Entity Linking Index 2016/06/21 - Invited talk to the LIRMM - Montpellier - 14
  • 15. § POS Tagger: Ø bidirectional CMM (left to right and right to left) § NER Combiner: Ø Use a combination of CRF with Gibbs sampling (Monte Carlo as graph inference method) models. A simple CRF model could be: PER PER PERO OOO X X X X XX XXXX X set of features for the current word: word capitalized, previous word is “de”, next word is a NNP, … Suppose P(PER | X, PER, O, LOC) = P(PER | X, neighbors(PER)) then X with PER is a CRF Jimmy Page , knowing the professionalism of John Paul Jones Entity Extraction: Extractors Module PER PERO 2016/06/21 - Invited talk to the LIRMM - Montpellier - 15
  • 16. CRF Models Combination in details § Apply multiple CRF models over the same piece of text § Merge the results into one single output 2016/06/21 - Invited talk to the LIRMM - Montpellier - 16
  • 17. Entity Extraction: Overlap Resolution § Detect overlaps among boundaries of entities coming from the extractors § Different heuristics can be applied: Ø Merge: (“United States” and “States of America” => “United States of America”) default behavior Ø Simple Substring: (“Florence” and “Florence May Harding” => ”Florence” and “May Harding”) Ø Smart Substring: (”Giants of New York” and “New York” => “Giants” and “New York”) 2016/06/21 - Invited talk to the LIRMM - Montpellier - 17
  • 18. Index: Indexing § Use DBpedia and Wikipedia as knowledge bases § Integrate external data such as PageRank scores from Hasso Platner Institute § Backend sytem with Elasticsearch and Couchbase § Turn DBpedia and Wikipedia into a CSV-based generic format 2016/06/21 - Invited talk to the LIRMM - Montpellier - 18
  • 19. Entity Linking: Linking tasks § Generate candidate links for all extracted mentions: Ø If any, they go to the linking method Ø If not, they are linked to NIL via NIL Clustering module § Linking method: Ø Filter out candidates that have different types than the one given by NER Ø ADEL linear formula: 𝑟 𝑙 = 𝑎. 𝐿 𝑚, 𝑡𝑖𝑡𝑙𝑒 + 𝑏. max 𝐿 𝑚, 𝑅 + 𝑐. max 𝐿 𝑚, 𝐷 . 𝑃𝑅(𝑙) r(l): the score of the candidate l L: the Levenshtein distance m: the extracted mention title: the title of the candidate l R: the set of redirect pages associated to the candidate l D: the set of disambiguation pages associated to the candidate l PR: Pagerank associated to the candidate l a, b and c are weights following the properties: a > b > c and a + b + c = 1 2016/06/21 - Invited talk to the LIRMM - Montpellier - 19
  • 20. Results § ADEL over OKE2015 § ADEL over OKE2016 § ADEL over NEEL2016 Precision Recall F-measure extraction 85.1 89.7 87.3 recognition 75.3 59 66.2 linking 85.4 42.7 57 Precision Recall F-measure extraction 81.5 72.4 76.6 recognition 74.8 66.5 70.4 linking 52.8 45.8 49.1 Precision Recall F-measure extraction 80.6 91.0 85.5 recognition 57.5 64.9 61.0 linking 49.9 58.3 53.8 2016/06/21 - Invited talk to the LIRMM - Montpellier - 20
  • 21. Issues with current methods § Supervised methods Ø Efficient but needs a training set for every dataset Ø Not robust enough if the type of text or entities change Ø Mostly associated with an E2E approach Ø Inappropriate to detect NIL entities § Unsupervised methods Ø Difficult to compute the relatedness among the candidates of each entity Ø Graph-based of linear formula are sometimes long to compute Ø Difficult to manipulate emerging entities in case of graph-based approach 2016/06/21 - Invited talk to the LIRMM - Montpellier - 21
  • 22. Deep Learning for Textual Content http://y2u.be/cJIILew6l28 https://youtu.be/mp6UsuRteNw?t=1h17m50s 2016/06/21 - Invited talk to the LIRMM - Montpellier - 22
  • 23. From Machine Learning to Deep Learning: Logistic Classifier § Logistic Classifier => Linear Classifier W . X + b = Y weights bias trained input data scores (logits) Soft max S(Y) probabilities D(S(Y),L) L Cross-Entropy Evaluate how much the model deviates from the GS 1-Hot Labels Multinomial Logistic Classification 2016/06/21 - Invited talk to the LIRMM - Montpellier - 23
  • 24. From Machine Learning to Deep Learning: Deep Neural Network § Neural Network => Non Linear Classifier X x W +b Y S(Y) L RELU H X W +b X W +b Y S(Y) LX RELU: Rectified Linear Units (Activation Function) Hidden Layers 2016/06/21 - Invited talk to the LIRMM - Montpellier - 24
  • 25. Why Understanding Language is Difficult? § Human language has great variability Ø Similar concepts are expressed in different ways, (e.g. kitty vs cat) § Human language has great ambiguity Ø Similar expressions mean different concepts, (e.g. New York vs New York Times) § The meaning of text is usually vague and latent Ø No clear supervision signal to learn from § Learning semantic meaning of texts is a key challenge in NLP 2016/06/21 - Invited talk to the LIRMM - Montpellier - 25
  • 26. Word Embeddings § Find a way to represent and measure how much two different words have same/similar meaning § Need a huge amount of labelled data then better using an unsupervised approach cat kitty ? 2016/06/21 - Invited talk to the LIRMM - Montpellier - 26
  • 27. Word Embeddings The purrs This hunts mice Cat-like behaving the same way § Context gives a good idea that words are similar § Goal is to predict words context in order to treat cat-like words similarly 2016/06/21 - Invited talk to the LIRMM - Montpellier - 27
  • 28. Word Embeddings kittycat dog car lion pet tiger § Map words to small vectors (embeddings) § Embeddings are close to each other in the words space when they have similar meaning 2016/06/21 - Invited talk to the LIRMM - Montpellier - 28
  • 29. Word2Vec § Developed by Google in 2013 § Produce word embeddings § Takes a large corpus of text as input and produce a vector space as output § Word vectors are positioned in the vector space such that words that share common contexts in the corpus are located in close proximity to one another in the vector space § The goal is to provide semantically similar words to a given word 2016/06/21 - Invited talk to the LIRMM - Montpellier - 29
  • 30. Word2Vec: How it works? The quick brown fox jumps over the lazy dog window Vfox Logistic classifiers § Map every word to an embedding § Use a window around a selected word § Use the embedding of the selected word to predict the context of the word 2016/06/21 - Invited talk to the LIRMM - Montpellier - 30
  • 31. Word2Vec: How it works? § Measure the closeness of two word embeddings with cosine similarity is better than with L2 because the length is not relevant for the classification § Normalize all embeddings to get them in unit norm form cat kittyL2 Vkitty Vcat cosine 𝑉GHI − 𝑉KLIIM N N 𝑉GHI. 𝑉KLIIM 𝑉GHI . 𝑉KLIIM L2 = cosine = 2016/06/21 - Invited talk to the LIRMM - Montpellier - 31
  • 32. Word2Vec schema § Compares a target from the context of the input word § Compute softmax over a huge vocabulary vector can be very inefficient § To solve this issue, use sampled softmax cat W Vcat + b Vcat . . 0 0 0 1 0 0 . . r a n d o m s a m p l e cross entropy softmax logistic classifier purr 2016/06/21 - Invited talk to the LIRMM - Montpellier - 32
  • 33. DSSM: Deep Structured Semantic Model § Developed by Microsoft in 2013 § Compute similarity between vectors § Generic enough for being applied to many more cases than what Word2Vec can do (Web search, ads, question answering, machine translation, word embeddings…) § Training made with backpropagation § The layers can be either: DNN, CNN or RNN 2016/06/21 - Invited talk to the LIRMM - Montpellier - 33
  • 34. DSSM schema with CNN DSSM sim(X,Y) Word sequence xt Relevance measured by cosine similarity Learning: maximize the similarity between X (source) and Y (target) 128 128 X Y w1,w2,…,wTX w1,w2,…,wTY 2016/06/21 - Invited talk to the LIRMM - Montpellier - 34
  • 35. DSSM schema with CNN X Y 128 128 sim(X,Y) w1,w2,…,wTX w1,w2,…,wTY Relevance measured by cosine similarity Learning: maximize the similarity between X (source) and Y (target) f(.) g(.) Representation: use DNN to extract abstract semantic representations Word sequence xt 2016/06/21 - Invited talk to the LIRMM - Montpellier - 35
  • 36. DSSM schema with CNN sim(X,Y) Word sequence xt Relevance measured by cosine similarity Learning: maximize the similarity between X (source) and Y (target) Representation: use DNN to extract abstract semantic representations 128 128 X Y w1,w2,…,wTX w1,w2,…,wTY Word hashing layer ft Convolutional layer ct Max pooling layer v Semantic layer h Convolutional and max pooling layer: identify keywords (concepts) in X and Y Word hashing: use letter-trigram as raw input to handle very large vocabulary 2016/06/21 - Invited talk to the LIRMM - Montpellier - 36
  • 37. Conclusion § Current methods for entity linking do not exploit enough semantics § Deep Learning technics might be used to better take into account the semantic § Using Word2Vec to rank the entity candidates from the most semantically similar to the less one § Using DSSM to measure the relatedness between the candidates of each extracted mention 2016/06/21 - Invited talk to the LIRMM - Montpellier - 37
  • 38. Questions? Thank you for listening! http://multimediasemantics.github.io/adel http://jplu.github.io julien.plu@eurecom.fr @julienplu http://www.slideshare.net/julienplu 2016/06/21 - Invited talk to the LIRMM - Montpellier - 38