SlideShare uma empresa Scribd logo
1 de 36
Baixar para ler offline
Entity Linking meets Word Sense
Disambiguation: a Unified Approach
Paper by: Andrea Moro, Alessandro Raganato, Roberto Navigli
Dipartimento di Informatica,Sapienza Universita di Roma
Presentation by: Antonio Quirós
Grupo LaBDA (Laboratorio de Bases de Datos Avanzadas)
Universidad Carlos III de Madrid
Babelfy is a unified, multilingual, graph-based approach to Entity
Linking and Word Sense Disambiguation based on a loose
identification of candidate meanings coupled with a densest subgraph
heuristic which selects high-coherence semantic interpretations.
Babelfy is based on the BabelNet 3.0 multilingual semantic network
and jointly performs disambiguation and entity linking.
Entity Linking: Discovering mentions of entities within a text and
linking them in a Knowledge Base.
Word Sense Disambiguation: Assigning meanings to word
occurrencies within a text.
Babelfy combine Entity Linking and Word Sense Disambiguation.
EL & WSD
- Unlike WSD, Babelfy allows overlapping fragments of text
ie: “Major League Baseball”
It identifies and disambiguate several nominal and entity mentions:
“Major League Baseball” - “Major League” - “League” - “Baseball”
- Unlike EL, it links not only Named Entity Mentions (“Major League
Baseball”) but also nominal mentions (“Major League”) to their
corresponding meaning in the Knowledge Base.
Babelfy approach in three steps:
One: Associate each vertex of the Semantic Network with a Semantic
Signature.
Two: Given an input text, extract all the linkable fragments and for
each fragment list the possible meanings according to the Semantic
Network.
Three: Create a graph-based semantic interpretation of the whole text
by linking the candidate meanings of the fragments using the Semantic
Signatures created in the first step, and then, extract a dense subgraph
of this representation and select the best candidate meaning for each
fragment.
Highly related
verticesPerformed only once
Either concept or named entity
Novel approach !!
Step One: (Creating the Semantic Signatures)
Assign higher weight to edges which are involved in more densely
connected areas.
This is accomplished by using “Directed Triangles” (Cycles of lenght 3)
and weight by the number of triangles they occur in.
Step One: (Creating the Semantic Signatures)
Football
weight(v, v') := |{(v, v', v'') : (v, v'), (v', v''), (v'', v) ∈ E}|+1
Ball
Basketall
Field
Sports
Court
Step One: (Creating the Semantic Signatures)
weight(Football, Sports) = | ( (Football, Sports) , (Football, Ball) , (Sports, Ball) ) , ( (Football,
Sports) , (Football, Field) , (Sports, Field) ) | = 2 + 1 = 3
Football
Ball
Field
Sports
Court
Basketall
Step One: (Creating the Semantic Signatures)
2
Football
Ball
Basketall
Field
Sports
Court
2
2 2
2
2
3
3 3
Step One: (Creating the Semantic Signatures)
After assigning weights to each edge, perform a Random Walk with
Restart to create the Semantic Signature: a set of highly related
vertices.
For a fixed number of steps, run a RWR from every vertex v of the
Semantic Network, keep track of the encountered vertices; eliminate
weakly related vertices, keeping only those items that were hit at least
η times.
Finally return the remaining vertices as SemSignv
: the Semantic
Signature of v.
Step One: (Creating the Semantic Signatures)
1: input: v, the starting vertex; , the restart probability;α
n, the number of steps to be executed; P, the transition probabilities;
, the frequency threshold.η
2: output: semSignv, set of related vertices for v.
3: function RWR(v, , n,P, )α η
4: v' := v
5: counts := newMap < Synset, Integer >
6: while n > 0 do
7: if random() > α then
8: given the transition probabilities P(·|v')
9: of v', choose a random neighbor v''
10: v' := v''
11: counts[v']++
12: else
13: restart the walk
14: v' := v
15: n := n 1−
16: for each v' in counts.keys() do
17: if counts[v'] < η then
18: remove v' from counts.keys()
19: return semSignv = counts.keys()
P(v' | v) = weight(v, v')
∑ weight(v, v'')
v'' V∈
Step Two: (Candidate Identification)
Using part-of-speech tagging, identify the set F of all textual fragments
which contains at least one noun and are substring of lexicalizations in
BabelNet.
For each f F look for candidates meanings -∈ cand(f)-: vertices
containing f or, only for named entities, a superstring of f as their
lexicalization.
Babelfy uses a loose candidate identification based on superstring
matching, instead of exact matching.
Step Two: (Candidate Identification)
example:
Word:
Sports
Candidates:
Sports
Water sports
...
Skateboarding {…, Extreme Sports, …}
...
Vertices containing f
Vertices having a superstring of f as one of its
lexicalization (Senses)
Step Three: (Candidate Disambiguation)
Create a directed graph GI
= (VI
, EI
) of the Semantic Interpretations of
the input text.
VI
: Contains all candidate meanings of all fragments
VI
:= {(v, f) : v ∈ cand(f), f F}∈
EI
: Connect two candidate meanings of different fragments if one is in
the semantic signature of the other.
Add an edge from (v, f) to (v', f') iff f ≠ f' and v' semSign∈ v
Step Three: (Candidate Disambiguation)
Once created GI
(The graph representation of all the possible
interpretations) then apply densest subgraph heuristics.
After that, the result is a sub-graph which contains those semantic
interpretations that are most coherent to each other. But this sub-graph
might still containt multiple interpretations for the same fragment.
So, the final step is to select the most suitable candidate meaning for
each fragment f given a threshold to discard semantically unrelated
candidate meanings.
Step Three: (Candidate Disambiguation)
1: input: F, the fragments in the input text; semSign, the semantic signatures;
µ, ambiguity level to be reached; cand, fragments to candidate meanings.
2: output: selected, disambiguated fragments.
3: function DISAMB(F,semSign, µ, cand)
4: VI := ;EI :=∅ ∅
5: GI := (VI,EI)
6: for each fragment f F∈ do
7: for each candidate v cand(f)∈ do
8: VI := VI {(v, f)}∪
9: for each ((v, f), (v', f')) VI × VI∈ do
10: if f ≠ f' and v' semSignv∈ then
11: EI := EI {((v, f), (v', f'))}∪
12: G*I := DENSSUB(F, cand, GI, µ)
13: selected := newMap < String,Synset >
14: for each f F s.t. (v, f) V*I∈ ∃ ∈ do
15: cand*(f) := {v : (v, f) V*I }∈
16: v* := argmaxv cand*(f)∈
score((v, f))
17: if score((v*, f)) ≥ θ then
18: selected(f) := v*
19: return selected
Function with the novel approach!!
Step Three: (Candidate Disambiguation)
Let's see an example:
“The leaf is falling from the tree on my head”
- Leaf has many candidate meanings.
- falling also has many candidate meanings.
- tree also has many candidate meanings.
And, as you might have guessed...
- Head also has many candidate meanings.
Step Three: “The leaf is falling from the tree on my head”
Music, Disc, Record, Rock
( Tree (Álbum), tree )
Thoughts, Feelings, Reason
( Mind, head )
Body, Anatomy, Falling (Accident)
( Head, head )
Guide, Group, Team, Boss
( Leader, head )
Book, Text, Paragraph, Novel
( Header, head )
cand(f)
SemSignv
Physics, Descend, Sky, High
( Fall, falling )
Music, Alicia Keys, Album
( Falling (Song), falling )
Pain, Hit, Push, Trauma
( Falling (Accident), falling )
Action, Hollywood, Cinema
( Falling (Movie), falling )
Nature, Fall, Earth, Oxygen, Leaf
( Tree, tree )
Leaf, Storage, Father, Son, Binary
( Tree (Data Structure), tree )
Node, Euler, Binary, Math, Path
( Tree (Graph Theory), tree )
Fall, Woods, Tree, Forest, Flora, Fall
( Leaf, leaf )
Text, Side, Right, Left, Book, Novel
( Leaf (Book), leaf )
Car, Motor, Vehicle, Japan, Tree
( Nissan Leaf, leaf )
Games, Visual Novel, Publisher
( Leaf (Japanese Co.), leaf )
Music, Pop, Dutch, Falling (Song)
( Leaf (Band), leaf )
(Generate a graph representation with
all possible meanings)
Step Three: (Candidate Disambiguation)
Following the algorithm, create an edge between two vertex if and only
if they do not belong to the same frangment and one is part of the
Semantic Signature of the other.
Step Three: “The leaf is falling from the tree on my head”
Music, Disc, Record, Rock
( Tree (Álbum), tree )
Thoughts, Feelings, Reason
( Mind, head )
Body, Anatomy, Falling (Accident)
( Head, head )
Guide, Group, Team, Boss
( Leader, head )
Book, Text, Paragraph, Novel
( Header, head )
cand(f)
SemSignv
Physics, Descend, Sky, High
( Fall, falling )
Music, Alicia Keys, Album
( Falling (Song), falling )
Pain, Hit, Push, Trauma
( Falling (Accident), falling )
Action, Hollywood, Cinema
( Falling (Movie), falling )
Nature, Fall, Earth, Oxygen, Leaf
( Tree, tree )
Leaf, Storage, Father, Son, Binary
( Tree (Data Structure), tree )
Node, Euler, Binary, Math, Path
( Tree (Graph Theory), tree )
Fall, Woods, Tree, Forest, Flora, Fall
( Leaf, leaf )
Text, Side, Right, Left, Book, Novel
( Leaf (Book), leaf )
Car, Motor, Vehicle, Japan, Tree
( Nissan Leaf, leaf )
Games, Visual Novel, Publisher
( Leaf (Japanese Co.), leaf )
Music, Pop, Dutch, Falling (Song)
( Leaf (Band), leaf )
(Generate a graph representation with
all possible meanings)
Step Three:
Apply densest sub-graph heuristics to obtain a sub-graph which contains those
semantic interpretations that are most coherent to each other
DENSSUB(F, cand, GI
, µ)
We'll come back to it later...
Step Three: “The leaf is falling from the tree on my head”
Body, Anatomy, Falling (Accident)
( Head, head )
Book, Text, Paragraph, Novel
( Header, head )
cand(f)
SemSignv
Physics, Descend, Sky, High
( Fall, falling )
Pain, Hit, Push, Trauma, Tree
( Falling (Accident), falling )
Nature, Root, Earth, Oxygen, Fall
( Tree, tree )
Leaf, Storage, Father, Son, Binary
( Tree (Data Structure), tree )
Fall, Woods, Tree, Forest, Flora, Fall
( Leaf, leaf )
Music, Pop, Dutch, Falling (Song)
( Leaf (Band), leaf )
(Generate a graph representation with
all possible meanings)
Let's assume this is the
output of the blackbox
Step Three:
Then we have to select the most suitable candidate meaning for each fragment f.
We use a given threshold θ to discard semantically unrealted candidates.
For each fragment f, we compute the score of each candidate for that fragment and
keep those candidates which score is higher than θ.
score((v, f)) = w(v,f) · deg((v, f))
∑ w(v',f) · deg((v', f))
v' cand(f)∈
w(v,f) := |{f' F :∈ v' s.t. ((v, f),(v', f')) or ((v', f'),(v, f)) E∃ ∈ I
}|
|F| 1−
deg(v) is the overall number of incoming and outgoing edges
deg(v) := deg+(v)+deg (v)−
Step Three:
In other words: We compute the score for each meaning by calculating it's normalized
weighted degree.
Calculate the weight for the meaning, multiply it by it's degree and divide it by the
sumatory of all scores of the candidates for that fragment.
The weight is calculated as the fraction of fragments the candidate meaning v connects
to. In other words, count the number of fragments the vertex v connects to and divide it
by the number of fragments minus one.
Fragments, not vertex. In other words, if the
vertex v connects to v' and v'' and they both
belong to the same fragment, they count as
one
Step Three: “The leaf is falling from the tree on my head”
Body, Anatomy, Falling (Accident)
( Head, head )
Book, Text, Paragraph, Novel
( Header, head )
cand(f)
SemSignv
Physics, Descend, Sky, High
( Fall, falling )
Pain, Hit, Push, Trauma, Tree
( Falling (Accident), falling )
Nature, Root, Earth, Oxygen, Fall
( Tree, tree )
Leaf, Storage, Father, Son, Binary
( Tree (Data Structure), tree )
Fall, Woods, Tree, Forest, Flora, Fall
( Leaf, leaf )
Music, Pop, Dutch, Falling (Song)
( Leaf (Band), leaf )
(Generate a graph representation with
all possible meanings)
Let's compute the weight of
(Leaf, leaf)
The number of fragments
“(Leafl, leaf)” is linked to, divided by
the number of fragments minus one:
w((Leaf, leaf)) = |{Fall, Tree}| = 2
4 – 1 3
Step Three: “The leaf is falling from the tree on my head”
Body, Anatomy, Falling (Accident)
( Head, head )
Book, Text, Paragraph, Novel
( Header, head )
cand(f)
SemSignv
Physics, Descend, Sky, High
( Fall, falling )
Pain, Hit, Push, Trauma, Tree
( Falling (Accident), falling )
Nature, Root, Earth, Oxygen, Fall
( Tree, tree )
Leaf, Storage, Father, Son, Binary
( Tree (Data Structure), tree )
Fall, Woods, Tree, Forest, Flora, Fall
( Leaf, leaf )
Music, Pop, Dutch, Falling (Song)
( Leaf (Band), leaf )
(Generate a graph representation with
all possible meanings)
And the degree of (Leaf, leaf) is
the number of incomming and
outgoing edges:
deg((Leaf, leaf)) = 3
Step Three:
For our example the computed weights and degrees are in the next table:
Step Three:
Now we can calculate the score for every candidate meaning:
For each candidate multiply it's weight by it's degree (w*d)
Then again for each candidate, divide w*d by the sum of all w*d for that fragment.
For example (Leaf, leaf)
weight((Leaf, leaf)) = 2/3
degree((Leaf, leaf)) = 4
w*d = 8/3
Sum of all others w*d for that
specific fragment (leaf) = 8/3
score((Leaf, leaf)) = 1,000
8
3 = 1
8
3
8
3
Step Three:
For our example the computed scores are in the next table:
Step Three:
Finally, we link each fragment with the highest ranking candidate meaning v* if it's score
is higher than the fixed threshold.
Four our example, for a threshold of 0,7
We keep:
Leaf (plant)
Fall
Tree
Head (as body part)
Which is correct.
Densest Sub-Graph
DENSSUB(F, cand, GI
, µ)
Back to the blackbox !!
Densest Sub-Graph
This is an approach to drastically reduce the level of ambiguity of the initial semantic
interpretation graph.
It is based on the assumption that the most suitable meanings of each text fragment will
belong to the densest area of the graph.
Identify the densest sub-graph of size at least k is NP-Hard. So Babelfy uses a heuristic
for k-partite graphs inspired by a 2-approximation greedy algorithm for arbitrary graphs.
Babelfy strategy is based on the iterative removal of low-coherence vertices.
Densest Sub-Graph
First, start with the initial semantic interpretation graph GI
(0)
at step 0.
For each step, identify the most ambiguous fragment fmax (The one with the maxumum
number of candidate meanings).
Then, discard the weakest interpretation of the current fragment fmax. This is done by
determining the lexical and semantic coherence of each candidate meaning using the
score formula showed before.
The vertex with the minimum score is removed from the graph.
Densest Sub-Graph
Then, in the next step, repeat the low-coherence removal step. And stop when the
number of remaining candidates for each fragment is below a threshold.
During each iteration, compute the average degree of the current step graph, and keep
the densest subgraph of the initial semantic interpretation graph, which is the one that
maximizes the average degree.
Densest Sub-Graph
1: input: F, the set of all fragments in the input text;
cand, from fragments to candidate meanings;
G(0)
I , the full semantic interpretation graph; µ, ambiguity level to be reached.
2: output: G*I, a dense subgraph.
3: function DENSSUB(F, cand, G(0)
I ,µ)
4: t := 0
5: G*I := G(0)
I
6: while true do
7: fmax := argmaxf F∈ |{v : (v, f) V∃ ∈ (t)
I}|
8: if |{v : (v, fmax) V∃ ∈ (t)
I }| µ≤ then
9: break;
10: vmin:= argmin score((v, fmax))
v cand(fmax)∈
11: V(t+1)
I := V(t)
I  {(vmin, fmax)}
12: E(t+1)
I := E(t)
I V∩ (t+1)
I × V(t+1)
I
13: G(t+1)
I := (V(t+1)
I, E(t+1)
I)
14: if avgdeg(G(t+1)
I) > avgdeg(G*I) then
15: G*I := G(t+1)
I
16: t := t+1
17: return G*I
Links
Reference paper about Babelfy:
A. Moro, A. Raganato, R. Navigli. Entity Linking meets Word Sense
Disambiguation: a Unified Approach. Transactions of the Association for
Computational Linguistics (TACL), 2, pp. 231-244, 2014.
http://wwwusers.di.uniroma1.it/~navigli/pubs/TACL_2014_Babelfy.pdf
Babelfy website
http://babelfy.org/
Babelnet website
http://babelnet.org/
Grupo LaBDA
http://labda.inf.uc3m.es/

Mais conteúdo relacionado

Destaque

Error analysis of Word Sense Disambiguation
Error analysis of Word Sense DisambiguationError analysis of Word Sense Disambiguation
Error analysis of Word Sense DisambiguationRubén Izquierdo Beviá
 
Topic Modeling for Information Retrieval and Word Sense Disambiguation tasks
Topic Modeling for Information Retrieval and Word Sense Disambiguation tasksTopic Modeling for Information Retrieval and Word Sense Disambiguation tasks
Topic Modeling for Information Retrieval and Word Sense Disambiguation tasksLeonardo Di Donato
 
Word Sense Disambiguation and Induction
Word Sense Disambiguation and InductionWord Sense Disambiguation and Induction
Word Sense Disambiguation and InductionLeon Derczynski
 
Ontology-Based Word Sense Disambiguation for Scientific Literature
Ontology-Based Word Sense Disambiguation for Scientific LiteratureOntology-Based Word Sense Disambiguation for Scientific Literature
Ontology-Based Word Sense Disambiguation for Scientific LiteratureeXascale Infolab
 
Lecture: Word Sense Disambiguation
Lecture: Word Sense DisambiguationLecture: Word Sense Disambiguation
Lecture: Word Sense DisambiguationMarina Santini
 
The Citizen Card - Improving Voluntary Tax Compliance in Ghana
The Citizen Card - Improving Voluntary Tax Compliance in GhanaThe Citizen Card - Improving Voluntary Tax Compliance in Ghana
The Citizen Card - Improving Voluntary Tax Compliance in GhanaYaw Osei-Tutu
 
Building a Graph of Names and Contextual Patterns for Named Entity Classifica...
Building a Graph of Names and Contextual Patterns for Named Entity Classifica...Building a Graph of Names and Contextual Patterns for Named Entity Classifica...
Building a Graph of Names and Contextual Patterns for Named Entity Classifica...Grupo HULAT
 
Perfluorocyclopentenyl (PFCP) Aryl Ether Polymers via Polycondensation of Oct...
Perfluorocyclopentenyl (PFCP) Aryl Ether Polymers via Polycondensation of Oct...Perfluorocyclopentenyl (PFCP) Aryl Ether Polymers via Polycondensation of Oct...
Perfluorocyclopentenyl (PFCP) Aryl Ether Polymers via Polycondensation of Oct...Babloo Sharma, Ph.D.
 
Detecting Drugs and Adverse Events from Spanish Health Social Media Streams
Detecting Drugs and Adverse Events from Spanish Health Social Media StreamsDetecting Drugs and Adverse Events from Spanish Health Social Media Streams
Detecting Drugs and Adverse Events from Spanish Health Social Media StreamsGrupo HULAT
 
Bis-perfluorocycloalkenyl (PFCA) aryl ether monomers towards a versatile clas...
Bis-perfluorocycloalkenyl (PFCA) aryl ether monomers towards a versatile clas...Bis-perfluorocycloalkenyl (PFCA) aryl ether monomers towards a versatile clas...
Bis-perfluorocycloalkenyl (PFCA) aryl ether monomers towards a versatile clas...Babloo Sharma, Ph.D.
 
Perfluorocyclohexenyl aryl ether polymers via polycondensation
Perfluorocyclohexenyl aryl ether polymers via   polycondensationPerfluorocyclohexenyl aryl ether polymers via   polycondensation
Perfluorocyclohexenyl aryl ether polymers via polycondensationBabloo Sharma, Ph.D.
 
An approach to User Interface Design of an accessible user agent
An approach to User Interface Design of an accessible user agent An approach to User Interface Design of an accessible user agent
An approach to User Interface Design of an accessible user agent Grupo HULAT
 
An introduction to html
An introduction to htmlAn introduction to html
An introduction to htmlkashifareed
 
Application of Information Extraction techniques to pharmacological domain: E...
Application of Information Extraction techniques to pharmacological domain: E...Application of Information Extraction techniques to pharmacological domain: E...
Application of Information Extraction techniques to pharmacological domain: E...Grupo HULAT
 

Destaque (18)

Error analysis of Word Sense Disambiguation
Error analysis of Word Sense DisambiguationError analysis of Word Sense Disambiguation
Error analysis of Word Sense Disambiguation
 
Topic Modeling for Information Retrieval and Word Sense Disambiguation tasks
Topic Modeling for Information Retrieval and Word Sense Disambiguation tasksTopic Modeling for Information Retrieval and Word Sense Disambiguation tasks
Topic Modeling for Information Retrieval and Word Sense Disambiguation tasks
 
Word Sense Disambiguation and Induction
Word Sense Disambiguation and InductionWord Sense Disambiguation and Induction
Word Sense Disambiguation and Induction
 
Ontology-Based Word Sense Disambiguation for Scientific Literature
Ontology-Based Word Sense Disambiguation for Scientific LiteratureOntology-Based Word Sense Disambiguation for Scientific Literature
Ontology-Based Word Sense Disambiguation for Scientific Literature
 
Lecture: Word Sense Disambiguation
Lecture: Word Sense DisambiguationLecture: Word Sense Disambiguation
Lecture: Word Sense Disambiguation
 
Protecting the brand
Protecting the brandProtecting the brand
Protecting the brand
 
The Citizen Card - Improving Voluntary Tax Compliance in Ghana
The Citizen Card - Improving Voluntary Tax Compliance in GhanaThe Citizen Card - Improving Voluntary Tax Compliance in Ghana
The Citizen Card - Improving Voluntary Tax Compliance in Ghana
 
RERA
RERARERA
RERA
 
Max Magazine
Max MagazineMax Magazine
Max Magazine
 
Building a Graph of Names and Contextual Patterns for Named Entity Classifica...
Building a Graph of Names and Contextual Patterns for Named Entity Classifica...Building a Graph of Names and Contextual Patterns for Named Entity Classifica...
Building a Graph of Names and Contextual Patterns for Named Entity Classifica...
 
Perfluorocyclopentenyl (PFCP) Aryl Ether Polymers via Polycondensation of Oct...
Perfluorocyclopentenyl (PFCP) Aryl Ether Polymers via Polycondensation of Oct...Perfluorocyclopentenyl (PFCP) Aryl Ether Polymers via Polycondensation of Oct...
Perfluorocyclopentenyl (PFCP) Aryl Ether Polymers via Polycondensation of Oct...
 
Detecting Drugs and Adverse Events from Spanish Health Social Media Streams
Detecting Drugs and Adverse Events from Spanish Health Social Media StreamsDetecting Drugs and Adverse Events from Spanish Health Social Media Streams
Detecting Drugs and Adverse Events from Spanish Health Social Media Streams
 
Bis-perfluorocycloalkenyl (PFCA) aryl ether monomers towards a versatile clas...
Bis-perfluorocycloalkenyl (PFCA) aryl ether monomers towards a versatile clas...Bis-perfluorocycloalkenyl (PFCA) aryl ether monomers towards a versatile clas...
Bis-perfluorocycloalkenyl (PFCA) aryl ether monomers towards a versatile clas...
 
Perfluorocyclohexenyl aryl ether polymers via polycondensation
Perfluorocyclohexenyl aryl ether polymers via   polycondensationPerfluorocyclohexenyl aryl ether polymers via   polycondensation
Perfluorocyclohexenyl aryl ether polymers via polycondensation
 
Compensation claim nsw
Compensation claim nswCompensation claim nsw
Compensation claim nsw
 
An approach to User Interface Design of an accessible user agent
An approach to User Interface Design of an accessible user agent An approach to User Interface Design of an accessible user agent
An approach to User Interface Design of an accessible user agent
 
An introduction to html
An introduction to htmlAn introduction to html
An introduction to html
 
Application of Information Extraction techniques to pharmacological domain: E...
Application of Information Extraction techniques to pharmacological domain: E...Application of Information Extraction techniques to pharmacological domain: E...
Application of Information Extraction techniques to pharmacological domain: E...
 

Semelhante a Babelfy: Entity Linking meets Word Sense Disambiguation.

Semantic Data Management in Graph Databases
Semantic Data Management in Graph DatabasesSemantic Data Management in Graph Databases
Semantic Data Management in Graph DatabasesMaribel Acosta Deibe
 
Declarative Thinking, Declarative Practice
Declarative Thinking, Declarative PracticeDeclarative Thinking, Declarative Practice
Declarative Thinking, Declarative PracticeKevlin Henney
 
Short Reference Card for R users.
Short Reference Card for R users.Short Reference Card for R users.
Short Reference Card for R users.Dr. Volkan OBAN
 
R command cheatsheet.pdf
R command cheatsheet.pdfR command cheatsheet.pdf
R command cheatsheet.pdfNgcnh947953
 
Semantic Data Management in Graph Databases: ESWC 2014 Tutorial
Semantic Data Management in Graph Databases: ESWC 2014 TutorialSemantic Data Management in Graph Databases: ESWC 2014 Tutorial
Semantic Data Management in Graph Databases: ESWC 2014 TutorialMaribel Acosta Deibe
 
Everything is composable
Everything is composableEverything is composable
Everything is composableVictor Igor
 
Real World Haskell: Lecture 2
Real World Haskell: Lecture 2Real World Haskell: Lecture 2
Real World Haskell: Lecture 2Bryan O'Sullivan
 
R Programming Reference Card
R Programming Reference CardR Programming Reference Card
R Programming Reference CardMaurice Dawson
 
Deduplication on large amounts of code
Deduplication on large amounts of codeDeduplication on large amounts of code
Deduplication on large amounts of codesource{d}
 
Cs6660 compiler design november december 2016 Answer key
Cs6660 compiler design november december 2016 Answer keyCs6660 compiler design november december 2016 Answer key
Cs6660 compiler design november december 2016 Answer keyappasami
 
Class 31: Deanonymizing
Class 31: DeanonymizingClass 31: Deanonymizing
Class 31: DeanonymizingDavid Evans
 
Fp in scala part 2
Fp in scala part 2Fp in scala part 2
Fp in scala part 2Hang Zhao
 
Sequence and Traverse - Part 3
Sequence and Traverse - Part 3Sequence and Traverse - Part 3
Sequence and Traverse - Part 3Philip Schwarz
 

Semelhante a Babelfy: Entity Linking meets Word Sense Disambiguation. (20)

Semantic Data Management in Graph Databases
Semantic Data Management in Graph DatabasesSemantic Data Management in Graph Databases
Semantic Data Management in Graph Databases
 
20170509 rand db_lesugent
20170509 rand db_lesugent20170509 rand db_lesugent
20170509 rand db_lesugent
 
Declarative Thinking, Declarative Practice
Declarative Thinking, Declarative PracticeDeclarative Thinking, Declarative Practice
Declarative Thinking, Declarative Practice
 
Reference card for R
Reference card for RReference card for R
Reference card for R
 
Short Reference Card for R users.
Short Reference Card for R users.Short Reference Card for R users.
Short Reference Card for R users.
 
R command cheatsheet.pdf
R command cheatsheet.pdfR command cheatsheet.pdf
R command cheatsheet.pdf
 
@ R reference
@ R reference@ R reference
@ R reference
 
Semantic Data Management in Graph Databases: ESWC 2014 Tutorial
Semantic Data Management in Graph Databases: ESWC 2014 TutorialSemantic Data Management in Graph Databases: ESWC 2014 Tutorial
Semantic Data Management in Graph Databases: ESWC 2014 Tutorial
 
Everything is composable
Everything is composableEverything is composable
Everything is composable
 
Real World Haskell: Lecture 2
Real World Haskell: Lecture 2Real World Haskell: Lecture 2
Real World Haskell: Lecture 2
 
R Programming Reference Card
R Programming Reference CardR Programming Reference Card
R Programming Reference Card
 
Deduplication on large amounts of code
Deduplication on large amounts of codeDeduplication on large amounts of code
Deduplication on large amounts of code
 
Cs6660 compiler design november december 2016 Answer key
Cs6660 compiler design november december 2016 Answer keyCs6660 compiler design november december 2016 Answer key
Cs6660 compiler design november december 2016 Answer key
 
Invitation to Scala
Invitation to ScalaInvitation to Scala
Invitation to Scala
 
Frp2016 3
Frp2016 3Frp2016 3
Frp2016 3
 
09-graphs.ppt
09-graphs.ppt09-graphs.ppt
09-graphs.ppt
 
Class 31: Deanonymizing
Class 31: DeanonymizingClass 31: Deanonymizing
Class 31: Deanonymizing
 
Fp in scala part 2
Fp in scala part 2Fp in scala part 2
Fp in scala part 2
 
Sequence and Traverse - Part 3
Sequence and Traverse - Part 3Sequence and Traverse - Part 3
Sequence and Traverse - Part 3
 
Introduction to Scala
Introduction to ScalaIntroduction to Scala
Introduction to Scala
 

Mais de Grupo HULAT

Interaccion 2019 lourdes moreno
Interaccion 2019 lourdes morenoInteraccion 2019 lourdes moreno
Interaccion 2019 lourdes morenoGrupo HULAT
 
Low vision interaccion2018v4
Low vision interaccion2018v4Low vision interaccion2018v4
Low vision interaccion2018v4Grupo HULAT
 
Exploring language technologies to provide support to WCAG 2.0 and E2R guidel...
Exploring language technologies to provide support to WCAG 2.0 and E2R guidel...Exploring language technologies to provide support to WCAG 2.0 and E2R guidel...
Exploring language technologies to provide support to WCAG 2.0 and E2R guidel...Grupo HULAT
 
Integration of Accessibility Requirements in the Design of Multimedia User Ag...
Integration of Accessibility Requirements in the Design of Multimedia User Ag...Integration of Accessibility Requirements in the Design of Multimedia User Ag...
Integration of Accessibility Requirements in the Design of Multimedia User Ag...Grupo HULAT
 
New Approaches to Interactive Multimedia Content Retrieval from different Sou...
New Approaches to Interactive Multimedia Content Retrieval from different Sou...New Approaches to Interactive Multimedia Content Retrieval from different Sou...
New Approaches to Interactive Multimedia Content Retrieval from different Sou...Grupo HULAT
 
Presentation "Spanish Resources in Trendminer Project"
Presentation "Spanish Resources in Trendminer Project"Presentation "Spanish Resources in Trendminer Project"
Presentation "Spanish Resources in Trendminer Project"Grupo HULAT
 
Mujeres, ciencia y tecnología. Encuesta sobre la percepción de las dificultad...
Mujeres, ciencia y tecnología. Encuesta sobre la percepción de las dificultad...Mujeres, ciencia y tecnología. Encuesta sobre la percepción de las dificultad...
Mujeres, ciencia y tecnología. Encuesta sobre la percepción de las dificultad...Grupo HULAT
 
Extraction of Drug-Drug Interactions from Biomedical Texts
Extraction of Drug-Drug Interactions from Biomedical TextsExtraction of Drug-Drug Interactions from Biomedical Texts
Extraction of Drug-Drug Interactions from Biomedical TextsGrupo HULAT
 
Lessons from the Drug-Drug Interaction Extraction Task
Lessons from the Drug-Drug Interaction Extraction TaskLessons from the Drug-Drug Interaction Extraction Task
Lessons from the Drug-Drug Interaction Extraction TaskGrupo HULAT
 
BioSEPLN 2010 Workshop on Language Technology applied to biomedical and heal...
BioSEPLN 2010 Workshop on Language Technology applied to biomedical and heal...BioSEPLN 2010 Workshop on Language Technology applied to biomedical and heal...
BioSEPLN 2010 Workshop on Language Technology applied to biomedical and heal...Grupo HULAT
 
Building a Graph of Names and Contextual Patterns for Named Entity Classifica...
Building a Graph of Names and Contextual Patterns for Named Entity Classifica...Building a Graph of Names and Contextual Patterns for Named Entity Classifica...
Building a Graph of Names and Contextual Patterns for Named Entity Classifica...Grupo HULAT
 
Accessibility to mobile interfaces for older people
Accessibility to mobile interfaces for older peopleAccessibility to mobile interfaces for older people
Accessibility to mobile interfaces for older peopleGrupo HULAT
 
Toward an integration of Web accessibility into testing processes
Toward an integration of Web accessibility into testing processesToward an integration of Web accessibility into testing processes
Toward an integration of Web accessibility into testing processesGrupo HULAT
 
Revisión de los requisitos de accesibilidad en la interacción del usuario anc...
Revisión de los requisitos de accesibilidad en la interacción del usuario anc...Revisión de los requisitos de accesibilidad en la interacción del usuario anc...
Revisión de los requisitos de accesibilidad en la interacción del usuario anc...Grupo HULAT
 
Formación y tecnologías en accesibilidad para la Universidad
Formación y tecnologías en accesibilidad para la UniversidadFormación y tecnologías en accesibilidad para la Universidad
Formación y tecnologías en accesibilidad para la UniversidadGrupo HULAT
 
Requisitos de accesibilidad web en los reproductores multimedia
Requisitos de accesibilidad web en los reproductores multimediaRequisitos de accesibilidad web en los reproductores multimedia
Requisitos de accesibilidad web en los reproductores multimediaGrupo HULAT
 
Integrating HCI in a Web accessibility engineering approach
Integrating HCI in a Web accessibility engineering approachIntegrating HCI in a Web accessibility engineering approach
Integrating HCI in a Web accessibility engineering approachGrupo HULAT
 
A MDD approach for modelling web accessibility
A MDD approach for modelling web accessibilityA MDD approach for modelling web accessibility
A MDD approach for modelling web accessibilityGrupo HULAT
 
Inclusive Usability Techniques in Requirements Analysis of Accessible Web App...
Inclusive Usability Techniques in Requirements Analysis of Accessible Web App...Inclusive Usability Techniques in Requirements Analysis of Accessible Web App...
Inclusive Usability Techniques in Requirements Analysis of Accessible Web App...Grupo HULAT
 
Adaptation Rules for Accessible Media Player Interface
Adaptation Rules for Accessible Media Player Interface Adaptation Rules for Accessible Media Player Interface
Adaptation Rules for Accessible Media Player Interface Grupo HULAT
 

Mais de Grupo HULAT (20)

Interaccion 2019 lourdes moreno
Interaccion 2019 lourdes morenoInteraccion 2019 lourdes moreno
Interaccion 2019 lourdes moreno
 
Low vision interaccion2018v4
Low vision interaccion2018v4Low vision interaccion2018v4
Low vision interaccion2018v4
 
Exploring language technologies to provide support to WCAG 2.0 and E2R guidel...
Exploring language technologies to provide support to WCAG 2.0 and E2R guidel...Exploring language technologies to provide support to WCAG 2.0 and E2R guidel...
Exploring language technologies to provide support to WCAG 2.0 and E2R guidel...
 
Integration of Accessibility Requirements in the Design of Multimedia User Ag...
Integration of Accessibility Requirements in the Design of Multimedia User Ag...Integration of Accessibility Requirements in the Design of Multimedia User Ag...
Integration of Accessibility Requirements in the Design of Multimedia User Ag...
 
New Approaches to Interactive Multimedia Content Retrieval from different Sou...
New Approaches to Interactive Multimedia Content Retrieval from different Sou...New Approaches to Interactive Multimedia Content Retrieval from different Sou...
New Approaches to Interactive Multimedia Content Retrieval from different Sou...
 
Presentation "Spanish Resources in Trendminer Project"
Presentation "Spanish Resources in Trendminer Project"Presentation "Spanish Resources in Trendminer Project"
Presentation "Spanish Resources in Trendminer Project"
 
Mujeres, ciencia y tecnología. Encuesta sobre la percepción de las dificultad...
Mujeres, ciencia y tecnología. Encuesta sobre la percepción de las dificultad...Mujeres, ciencia y tecnología. Encuesta sobre la percepción de las dificultad...
Mujeres, ciencia y tecnología. Encuesta sobre la percepción de las dificultad...
 
Extraction of Drug-Drug Interactions from Biomedical Texts
Extraction of Drug-Drug Interactions from Biomedical TextsExtraction of Drug-Drug Interactions from Biomedical Texts
Extraction of Drug-Drug Interactions from Biomedical Texts
 
Lessons from the Drug-Drug Interaction Extraction Task
Lessons from the Drug-Drug Interaction Extraction TaskLessons from the Drug-Drug Interaction Extraction Task
Lessons from the Drug-Drug Interaction Extraction Task
 
BioSEPLN 2010 Workshop on Language Technology applied to biomedical and heal...
BioSEPLN 2010 Workshop on Language Technology applied to biomedical and heal...BioSEPLN 2010 Workshop on Language Technology applied to biomedical and heal...
BioSEPLN 2010 Workshop on Language Technology applied to biomedical and heal...
 
Building a Graph of Names and Contextual Patterns for Named Entity Classifica...
Building a Graph of Names and Contextual Patterns for Named Entity Classifica...Building a Graph of Names and Contextual Patterns for Named Entity Classifica...
Building a Graph of Names and Contextual Patterns for Named Entity Classifica...
 
Accessibility to mobile interfaces for older people
Accessibility to mobile interfaces for older peopleAccessibility to mobile interfaces for older people
Accessibility to mobile interfaces for older people
 
Toward an integration of Web accessibility into testing processes
Toward an integration of Web accessibility into testing processesToward an integration of Web accessibility into testing processes
Toward an integration of Web accessibility into testing processes
 
Revisión de los requisitos de accesibilidad en la interacción del usuario anc...
Revisión de los requisitos de accesibilidad en la interacción del usuario anc...Revisión de los requisitos de accesibilidad en la interacción del usuario anc...
Revisión de los requisitos de accesibilidad en la interacción del usuario anc...
 
Formación y tecnologías en accesibilidad para la Universidad
Formación y tecnologías en accesibilidad para la UniversidadFormación y tecnologías en accesibilidad para la Universidad
Formación y tecnologías en accesibilidad para la Universidad
 
Requisitos de accesibilidad web en los reproductores multimedia
Requisitos de accesibilidad web en los reproductores multimediaRequisitos de accesibilidad web en los reproductores multimedia
Requisitos de accesibilidad web en los reproductores multimedia
 
Integrating HCI in a Web accessibility engineering approach
Integrating HCI in a Web accessibility engineering approachIntegrating HCI in a Web accessibility engineering approach
Integrating HCI in a Web accessibility engineering approach
 
A MDD approach for modelling web accessibility
A MDD approach for modelling web accessibilityA MDD approach for modelling web accessibility
A MDD approach for modelling web accessibility
 
Inclusive Usability Techniques in Requirements Analysis of Accessible Web App...
Inclusive Usability Techniques in Requirements Analysis of Accessible Web App...Inclusive Usability Techniques in Requirements Analysis of Accessible Web App...
Inclusive Usability Techniques in Requirements Analysis of Accessible Web App...
 
Adaptation Rules for Accessible Media Player Interface
Adaptation Rules for Accessible Media Player Interface Adaptation Rules for Accessible Media Player Interface
Adaptation Rules for Accessible Media Player Interface
 

Último

React Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief UtamaReact Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief UtamaHanief Utama
 
英国UN学位证,北安普顿大学毕业证书1:1制作
英国UN学位证,北安普顿大学毕业证书1:1制作英国UN学位证,北安普顿大学毕业证书1:1制作
英国UN学位证,北安普顿大学毕业证书1:1制作qr0udbr0
 
Introduction Computer Science - Software Design.pdf
Introduction Computer Science - Software Design.pdfIntroduction Computer Science - Software Design.pdf
Introduction Computer Science - Software Design.pdfFerryKemperman
 
Unveiling the Future: Sylius 2.0 New Features
Unveiling the Future: Sylius 2.0 New FeaturesUnveiling the Future: Sylius 2.0 New Features
Unveiling the Future: Sylius 2.0 New FeaturesŁukasz Chruściel
 
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...Angel Borroy López
 
Recruitment Management Software Benefits (Infographic)
Recruitment Management Software Benefits (Infographic)Recruitment Management Software Benefits (Infographic)
Recruitment Management Software Benefits (Infographic)Hr365.us smith
 
Cloud Data Center Network Construction - IEEE
Cloud Data Center Network Construction - IEEECloud Data Center Network Construction - IEEE
Cloud Data Center Network Construction - IEEEVICTOR MAESTRE RAMIREZ
 
Ahmed Motair CV April 2024 (Senior SW Developer)
Ahmed Motair CV April 2024 (Senior SW Developer)Ahmed Motair CV April 2024 (Senior SW Developer)
Ahmed Motair CV April 2024 (Senior SW Developer)Ahmed Mater
 
A healthy diet for your Java application Devoxx France.pdf
A healthy diet for your Java application Devoxx France.pdfA healthy diet for your Java application Devoxx France.pdf
A healthy diet for your Java application Devoxx France.pdfMarharyta Nedzelska
 
MYjobs Presentation Django-based project
MYjobs Presentation Django-based projectMYjobs Presentation Django-based project
MYjobs Presentation Django-based projectAnoyGreter
 
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...confluent
 
Software Coding for software engineering
Software Coding for software engineeringSoftware Coding for software engineering
Software Coding for software engineeringssuserb3a23b
 
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...Natan Silnitsky
 
Precise and Complete Requirements? An Elusive Goal
Precise and Complete Requirements? An Elusive GoalPrecise and Complete Requirements? An Elusive Goal
Precise and Complete Requirements? An Elusive GoalLionel Briand
 
Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...
Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...
Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...OnePlan Solutions
 
Implementing Zero Trust strategy with Azure
Implementing Zero Trust strategy with AzureImplementing Zero Trust strategy with Azure
Implementing Zero Trust strategy with AzureDinusha Kumarasiri
 
Unveiling Design Patterns: A Visual Guide with UML Diagrams
Unveiling Design Patterns: A Visual Guide with UML DiagramsUnveiling Design Patterns: A Visual Guide with UML Diagrams
Unveiling Design Patterns: A Visual Guide with UML DiagramsAhmed Mohamed
 
Folding Cheat Sheet #4 - fourth in a series
Folding Cheat Sheet #4 - fourth in a seriesFolding Cheat Sheet #4 - fourth in a series
Folding Cheat Sheet #4 - fourth in a seriesPhilip Schwarz
 

Último (20)

React Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief UtamaReact Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief Utama
 
英国UN学位证,北安普顿大学毕业证书1:1制作
英国UN学位证,北安普顿大学毕业证书1:1制作英国UN学位证,北安普顿大学毕业证书1:1制作
英国UN学位证,北安普顿大学毕业证书1:1制作
 
Introduction Computer Science - Software Design.pdf
Introduction Computer Science - Software Design.pdfIntroduction Computer Science - Software Design.pdf
Introduction Computer Science - Software Design.pdf
 
Unveiling the Future: Sylius 2.0 New Features
Unveiling the Future: Sylius 2.0 New FeaturesUnveiling the Future: Sylius 2.0 New Features
Unveiling the Future: Sylius 2.0 New Features
 
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...
 
Recruitment Management Software Benefits (Infographic)
Recruitment Management Software Benefits (Infographic)Recruitment Management Software Benefits (Infographic)
Recruitment Management Software Benefits (Infographic)
 
Cloud Data Center Network Construction - IEEE
Cloud Data Center Network Construction - IEEECloud Data Center Network Construction - IEEE
Cloud Data Center Network Construction - IEEE
 
Hot Sexy call girls in Patel Nagar🔝 9953056974 🔝 escort Service
Hot Sexy call girls in Patel Nagar🔝 9953056974 🔝 escort ServiceHot Sexy call girls in Patel Nagar🔝 9953056974 🔝 escort Service
Hot Sexy call girls in Patel Nagar🔝 9953056974 🔝 escort Service
 
Ahmed Motair CV April 2024 (Senior SW Developer)
Ahmed Motair CV April 2024 (Senior SW Developer)Ahmed Motair CV April 2024 (Senior SW Developer)
Ahmed Motair CV April 2024 (Senior SW Developer)
 
A healthy diet for your Java application Devoxx France.pdf
A healthy diet for your Java application Devoxx France.pdfA healthy diet for your Java application Devoxx France.pdf
A healthy diet for your Java application Devoxx France.pdf
 
Advantages of Odoo ERP 17 for Your Business
Advantages of Odoo ERP 17 for Your BusinessAdvantages of Odoo ERP 17 for Your Business
Advantages of Odoo ERP 17 for Your Business
 
MYjobs Presentation Django-based project
MYjobs Presentation Django-based projectMYjobs Presentation Django-based project
MYjobs Presentation Django-based project
 
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...
 
Software Coding for software engineering
Software Coding for software engineeringSoftware Coding for software engineering
Software Coding for software engineering
 
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...
 
Precise and Complete Requirements? An Elusive Goal
Precise and Complete Requirements? An Elusive GoalPrecise and Complete Requirements? An Elusive Goal
Precise and Complete Requirements? An Elusive Goal
 
Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...
Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...
Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...
 
Implementing Zero Trust strategy with Azure
Implementing Zero Trust strategy with AzureImplementing Zero Trust strategy with Azure
Implementing Zero Trust strategy with Azure
 
Unveiling Design Patterns: A Visual Guide with UML Diagrams
Unveiling Design Patterns: A Visual Guide with UML DiagramsUnveiling Design Patterns: A Visual Guide with UML Diagrams
Unveiling Design Patterns: A Visual Guide with UML Diagrams
 
Folding Cheat Sheet #4 - fourth in a series
Folding Cheat Sheet #4 - fourth in a seriesFolding Cheat Sheet #4 - fourth in a series
Folding Cheat Sheet #4 - fourth in a series
 

Babelfy: Entity Linking meets Word Sense Disambiguation.

  • 1. Entity Linking meets Word Sense Disambiguation: a Unified Approach Paper by: Andrea Moro, Alessandro Raganato, Roberto Navigli Dipartimento di Informatica,Sapienza Universita di Roma Presentation by: Antonio Quirós Grupo LaBDA (Laboratorio de Bases de Datos Avanzadas) Universidad Carlos III de Madrid
  • 2. Babelfy is a unified, multilingual, graph-based approach to Entity Linking and Word Sense Disambiguation based on a loose identification of candidate meanings coupled with a densest subgraph heuristic which selects high-coherence semantic interpretations. Babelfy is based on the BabelNet 3.0 multilingual semantic network and jointly performs disambiguation and entity linking.
  • 3. Entity Linking: Discovering mentions of entities within a text and linking them in a Knowledge Base. Word Sense Disambiguation: Assigning meanings to word occurrencies within a text. Babelfy combine Entity Linking and Word Sense Disambiguation. EL & WSD
  • 4. - Unlike WSD, Babelfy allows overlapping fragments of text ie: “Major League Baseball” It identifies and disambiguate several nominal and entity mentions: “Major League Baseball” - “Major League” - “League” - “Baseball” - Unlike EL, it links not only Named Entity Mentions (“Major League Baseball”) but also nominal mentions (“Major League”) to their corresponding meaning in the Knowledge Base.
  • 5. Babelfy approach in three steps: One: Associate each vertex of the Semantic Network with a Semantic Signature. Two: Given an input text, extract all the linkable fragments and for each fragment list the possible meanings according to the Semantic Network. Three: Create a graph-based semantic interpretation of the whole text by linking the candidate meanings of the fragments using the Semantic Signatures created in the first step, and then, extract a dense subgraph of this representation and select the best candidate meaning for each fragment. Highly related verticesPerformed only once Either concept or named entity Novel approach !!
  • 6. Step One: (Creating the Semantic Signatures) Assign higher weight to edges which are involved in more densely connected areas. This is accomplished by using “Directed Triangles” (Cycles of lenght 3) and weight by the number of triangles they occur in.
  • 7. Step One: (Creating the Semantic Signatures) Football weight(v, v') := |{(v, v', v'') : (v, v'), (v', v''), (v'', v) ∈ E}|+1 Ball Basketall Field Sports Court
  • 8. Step One: (Creating the Semantic Signatures) weight(Football, Sports) = | ( (Football, Sports) , (Football, Ball) , (Sports, Ball) ) , ( (Football, Sports) , (Football, Field) , (Sports, Field) ) | = 2 + 1 = 3 Football Ball Field Sports Court Basketall
  • 9. Step One: (Creating the Semantic Signatures) 2 Football Ball Basketall Field Sports Court 2 2 2 2 2 3 3 3
  • 10. Step One: (Creating the Semantic Signatures) After assigning weights to each edge, perform a Random Walk with Restart to create the Semantic Signature: a set of highly related vertices. For a fixed number of steps, run a RWR from every vertex v of the Semantic Network, keep track of the encountered vertices; eliminate weakly related vertices, keeping only those items that were hit at least η times. Finally return the remaining vertices as SemSignv : the Semantic Signature of v.
  • 11. Step One: (Creating the Semantic Signatures) 1: input: v, the starting vertex; , the restart probability;α n, the number of steps to be executed; P, the transition probabilities; , the frequency threshold.η 2: output: semSignv, set of related vertices for v. 3: function RWR(v, , n,P, )α η 4: v' := v 5: counts := newMap < Synset, Integer > 6: while n > 0 do 7: if random() > α then 8: given the transition probabilities P(·|v') 9: of v', choose a random neighbor v'' 10: v' := v'' 11: counts[v']++ 12: else 13: restart the walk 14: v' := v 15: n := n 1− 16: for each v' in counts.keys() do 17: if counts[v'] < η then 18: remove v' from counts.keys() 19: return semSignv = counts.keys() P(v' | v) = weight(v, v') ∑ weight(v, v'') v'' V∈
  • 12. Step Two: (Candidate Identification) Using part-of-speech tagging, identify the set F of all textual fragments which contains at least one noun and are substring of lexicalizations in BabelNet. For each f F look for candidates meanings -∈ cand(f)-: vertices containing f or, only for named entities, a superstring of f as their lexicalization. Babelfy uses a loose candidate identification based on superstring matching, instead of exact matching.
  • 13. Step Two: (Candidate Identification) example: Word: Sports Candidates: Sports Water sports ... Skateboarding {…, Extreme Sports, …} ... Vertices containing f Vertices having a superstring of f as one of its lexicalization (Senses)
  • 14. Step Three: (Candidate Disambiguation) Create a directed graph GI = (VI , EI ) of the Semantic Interpretations of the input text. VI : Contains all candidate meanings of all fragments VI := {(v, f) : v ∈ cand(f), f F}∈ EI : Connect two candidate meanings of different fragments if one is in the semantic signature of the other. Add an edge from (v, f) to (v', f') iff f ≠ f' and v' semSign∈ v
  • 15. Step Three: (Candidate Disambiguation) Once created GI (The graph representation of all the possible interpretations) then apply densest subgraph heuristics. After that, the result is a sub-graph which contains those semantic interpretations that are most coherent to each other. But this sub-graph might still containt multiple interpretations for the same fragment. So, the final step is to select the most suitable candidate meaning for each fragment f given a threshold to discard semantically unrelated candidate meanings.
  • 16. Step Three: (Candidate Disambiguation) 1: input: F, the fragments in the input text; semSign, the semantic signatures; µ, ambiguity level to be reached; cand, fragments to candidate meanings. 2: output: selected, disambiguated fragments. 3: function DISAMB(F,semSign, µ, cand) 4: VI := ;EI :=∅ ∅ 5: GI := (VI,EI) 6: for each fragment f F∈ do 7: for each candidate v cand(f)∈ do 8: VI := VI {(v, f)}∪ 9: for each ((v, f), (v', f')) VI × VI∈ do 10: if f ≠ f' and v' semSignv∈ then 11: EI := EI {((v, f), (v', f'))}∪ 12: G*I := DENSSUB(F, cand, GI, µ) 13: selected := newMap < String,Synset > 14: for each f F s.t. (v, f) V*I∈ ∃ ∈ do 15: cand*(f) := {v : (v, f) V*I }∈ 16: v* := argmaxv cand*(f)∈ score((v, f)) 17: if score((v*, f)) ≥ θ then 18: selected(f) := v* 19: return selected Function with the novel approach!!
  • 17. Step Three: (Candidate Disambiguation) Let's see an example: “The leaf is falling from the tree on my head” - Leaf has many candidate meanings. - falling also has many candidate meanings. - tree also has many candidate meanings. And, as you might have guessed... - Head also has many candidate meanings.
  • 18. Step Three: “The leaf is falling from the tree on my head” Music, Disc, Record, Rock ( Tree (Álbum), tree ) Thoughts, Feelings, Reason ( Mind, head ) Body, Anatomy, Falling (Accident) ( Head, head ) Guide, Group, Team, Boss ( Leader, head ) Book, Text, Paragraph, Novel ( Header, head ) cand(f) SemSignv Physics, Descend, Sky, High ( Fall, falling ) Music, Alicia Keys, Album ( Falling (Song), falling ) Pain, Hit, Push, Trauma ( Falling (Accident), falling ) Action, Hollywood, Cinema ( Falling (Movie), falling ) Nature, Fall, Earth, Oxygen, Leaf ( Tree, tree ) Leaf, Storage, Father, Son, Binary ( Tree (Data Structure), tree ) Node, Euler, Binary, Math, Path ( Tree (Graph Theory), tree ) Fall, Woods, Tree, Forest, Flora, Fall ( Leaf, leaf ) Text, Side, Right, Left, Book, Novel ( Leaf (Book), leaf ) Car, Motor, Vehicle, Japan, Tree ( Nissan Leaf, leaf ) Games, Visual Novel, Publisher ( Leaf (Japanese Co.), leaf ) Music, Pop, Dutch, Falling (Song) ( Leaf (Band), leaf ) (Generate a graph representation with all possible meanings)
  • 19. Step Three: (Candidate Disambiguation) Following the algorithm, create an edge between two vertex if and only if they do not belong to the same frangment and one is part of the Semantic Signature of the other.
  • 20. Step Three: “The leaf is falling from the tree on my head” Music, Disc, Record, Rock ( Tree (Álbum), tree ) Thoughts, Feelings, Reason ( Mind, head ) Body, Anatomy, Falling (Accident) ( Head, head ) Guide, Group, Team, Boss ( Leader, head ) Book, Text, Paragraph, Novel ( Header, head ) cand(f) SemSignv Physics, Descend, Sky, High ( Fall, falling ) Music, Alicia Keys, Album ( Falling (Song), falling ) Pain, Hit, Push, Trauma ( Falling (Accident), falling ) Action, Hollywood, Cinema ( Falling (Movie), falling ) Nature, Fall, Earth, Oxygen, Leaf ( Tree, tree ) Leaf, Storage, Father, Son, Binary ( Tree (Data Structure), tree ) Node, Euler, Binary, Math, Path ( Tree (Graph Theory), tree ) Fall, Woods, Tree, Forest, Flora, Fall ( Leaf, leaf ) Text, Side, Right, Left, Book, Novel ( Leaf (Book), leaf ) Car, Motor, Vehicle, Japan, Tree ( Nissan Leaf, leaf ) Games, Visual Novel, Publisher ( Leaf (Japanese Co.), leaf ) Music, Pop, Dutch, Falling (Song) ( Leaf (Band), leaf ) (Generate a graph representation with all possible meanings)
  • 21. Step Three: Apply densest sub-graph heuristics to obtain a sub-graph which contains those semantic interpretations that are most coherent to each other DENSSUB(F, cand, GI , µ) We'll come back to it later...
  • 22. Step Three: “The leaf is falling from the tree on my head” Body, Anatomy, Falling (Accident) ( Head, head ) Book, Text, Paragraph, Novel ( Header, head ) cand(f) SemSignv Physics, Descend, Sky, High ( Fall, falling ) Pain, Hit, Push, Trauma, Tree ( Falling (Accident), falling ) Nature, Root, Earth, Oxygen, Fall ( Tree, tree ) Leaf, Storage, Father, Son, Binary ( Tree (Data Structure), tree ) Fall, Woods, Tree, Forest, Flora, Fall ( Leaf, leaf ) Music, Pop, Dutch, Falling (Song) ( Leaf (Band), leaf ) (Generate a graph representation with all possible meanings) Let's assume this is the output of the blackbox
  • 23. Step Three: Then we have to select the most suitable candidate meaning for each fragment f. We use a given threshold θ to discard semantically unrealted candidates. For each fragment f, we compute the score of each candidate for that fragment and keep those candidates which score is higher than θ. score((v, f)) = w(v,f) · deg((v, f)) ∑ w(v',f) · deg((v', f)) v' cand(f)∈ w(v,f) := |{f' F :∈ v' s.t. ((v, f),(v', f')) or ((v', f'),(v, f)) E∃ ∈ I }| |F| 1− deg(v) is the overall number of incoming and outgoing edges deg(v) := deg+(v)+deg (v)−
  • 24. Step Three: In other words: We compute the score for each meaning by calculating it's normalized weighted degree. Calculate the weight for the meaning, multiply it by it's degree and divide it by the sumatory of all scores of the candidates for that fragment. The weight is calculated as the fraction of fragments the candidate meaning v connects to. In other words, count the number of fragments the vertex v connects to and divide it by the number of fragments minus one. Fragments, not vertex. In other words, if the vertex v connects to v' and v'' and they both belong to the same fragment, they count as one
  • 25. Step Three: “The leaf is falling from the tree on my head” Body, Anatomy, Falling (Accident) ( Head, head ) Book, Text, Paragraph, Novel ( Header, head ) cand(f) SemSignv Physics, Descend, Sky, High ( Fall, falling ) Pain, Hit, Push, Trauma, Tree ( Falling (Accident), falling ) Nature, Root, Earth, Oxygen, Fall ( Tree, tree ) Leaf, Storage, Father, Son, Binary ( Tree (Data Structure), tree ) Fall, Woods, Tree, Forest, Flora, Fall ( Leaf, leaf ) Music, Pop, Dutch, Falling (Song) ( Leaf (Band), leaf ) (Generate a graph representation with all possible meanings) Let's compute the weight of (Leaf, leaf) The number of fragments “(Leafl, leaf)” is linked to, divided by the number of fragments minus one: w((Leaf, leaf)) = |{Fall, Tree}| = 2 4 – 1 3
  • 26. Step Three: “The leaf is falling from the tree on my head” Body, Anatomy, Falling (Accident) ( Head, head ) Book, Text, Paragraph, Novel ( Header, head ) cand(f) SemSignv Physics, Descend, Sky, High ( Fall, falling ) Pain, Hit, Push, Trauma, Tree ( Falling (Accident), falling ) Nature, Root, Earth, Oxygen, Fall ( Tree, tree ) Leaf, Storage, Father, Son, Binary ( Tree (Data Structure), tree ) Fall, Woods, Tree, Forest, Flora, Fall ( Leaf, leaf ) Music, Pop, Dutch, Falling (Song) ( Leaf (Band), leaf ) (Generate a graph representation with all possible meanings) And the degree of (Leaf, leaf) is the number of incomming and outgoing edges: deg((Leaf, leaf)) = 3
  • 27. Step Three: For our example the computed weights and degrees are in the next table:
  • 28. Step Three: Now we can calculate the score for every candidate meaning: For each candidate multiply it's weight by it's degree (w*d) Then again for each candidate, divide w*d by the sum of all w*d for that fragment. For example (Leaf, leaf) weight((Leaf, leaf)) = 2/3 degree((Leaf, leaf)) = 4 w*d = 8/3 Sum of all others w*d for that specific fragment (leaf) = 8/3 score((Leaf, leaf)) = 1,000 8 3 = 1 8 3 8 3
  • 29. Step Three: For our example the computed scores are in the next table:
  • 30. Step Three: Finally, we link each fragment with the highest ranking candidate meaning v* if it's score is higher than the fixed threshold. Four our example, for a threshold of 0,7 We keep: Leaf (plant) Fall Tree Head (as body part) Which is correct.
  • 31. Densest Sub-Graph DENSSUB(F, cand, GI , µ) Back to the blackbox !!
  • 32. Densest Sub-Graph This is an approach to drastically reduce the level of ambiguity of the initial semantic interpretation graph. It is based on the assumption that the most suitable meanings of each text fragment will belong to the densest area of the graph. Identify the densest sub-graph of size at least k is NP-Hard. So Babelfy uses a heuristic for k-partite graphs inspired by a 2-approximation greedy algorithm for arbitrary graphs. Babelfy strategy is based on the iterative removal of low-coherence vertices.
  • 33. Densest Sub-Graph First, start with the initial semantic interpretation graph GI (0) at step 0. For each step, identify the most ambiguous fragment fmax (The one with the maxumum number of candidate meanings). Then, discard the weakest interpretation of the current fragment fmax. This is done by determining the lexical and semantic coherence of each candidate meaning using the score formula showed before. The vertex with the minimum score is removed from the graph.
  • 34. Densest Sub-Graph Then, in the next step, repeat the low-coherence removal step. And stop when the number of remaining candidates for each fragment is below a threshold. During each iteration, compute the average degree of the current step graph, and keep the densest subgraph of the initial semantic interpretation graph, which is the one that maximizes the average degree.
  • 35. Densest Sub-Graph 1: input: F, the set of all fragments in the input text; cand, from fragments to candidate meanings; G(0) I , the full semantic interpretation graph; µ, ambiguity level to be reached. 2: output: G*I, a dense subgraph. 3: function DENSSUB(F, cand, G(0) I ,µ) 4: t := 0 5: G*I := G(0) I 6: while true do 7: fmax := argmaxf F∈ |{v : (v, f) V∃ ∈ (t) I}| 8: if |{v : (v, fmax) V∃ ∈ (t) I }| µ≤ then 9: break; 10: vmin:= argmin score((v, fmax)) v cand(fmax)∈ 11: V(t+1) I := V(t) I {(vmin, fmax)} 12: E(t+1) I := E(t) I V∩ (t+1) I × V(t+1) I 13: G(t+1) I := (V(t+1) I, E(t+1) I) 14: if avgdeg(G(t+1) I) > avgdeg(G*I) then 15: G*I := G(t+1) I 16: t := t+1 17: return G*I
  • 36. Links Reference paper about Babelfy: A. Moro, A. Raganato, R. Navigli. Entity Linking meets Word Sense Disambiguation: a Unified Approach. Transactions of the Association for Computational Linguistics (TACL), 2, pp. 231-244, 2014. http://wwwusers.di.uniroma1.it/~navigli/pubs/TACL_2014_Babelfy.pdf Babelfy website http://babelfy.org/ Babelnet website http://babelnet.org/ Grupo LaBDA http://labda.inf.uc3m.es/