SlideShare uma empresa Scribd logo
1 de 54
Baixar para ler offline
Gaza War

Britches
World War II

Berlin Wall

Woodstock
1950

1900

1910

1970

1920

9/11

Gulf War

1930

1980

1940

1950

1990

1960

BET Hiphop Awards
2000

1970

1980

2010

1990

2000

David Graus, Maria-Hendrike Peetz,
Daan Odijk, Maarten de Rijke, Ork de Rooij

2010
Entity Linking for a personalized timeline of historic events

•

Motivation

•

Method
•
•

Part II: Generate User Profile

•

Part III: Matching Events to User Profile

•
•

Part I: Fetch Candidate Historic Events

Part IV: Scoring & Ranking Events
Future Work
•

[…] To design and build innovative and robust prototypes and
demos for tools that analyse and/or integrate open web data for
educational purposes.
History education
Personalized historic timeline

Gaza War

Britches
World War II

Berlin Wall

Woodstock
1950

1900

1910

1970

1920

9/11

Gulf War

1930

1980

1940

1950

1990

1960

BET Hiphop Awards
2000

1970

1980

2010

1990

2000

2010
Part I: Candidate Historic Events
Part I: Candidate Historic Events

select	
  ?concept	
  	
  
where	
  {	
  	
  
	
   ?concept	
  rdf:type	
  dbpedia-­‐owl:Event	
  	
  
	
   }
concept	
  	
  
	
  
ept	
  rdf:type	
  dbpedia-­‐owl:Event	
  	
  
concept	
  	
  
	
  
ept	
  rdf:type	
  dbpedia-­‐owl:Event	
  	
  
Part II: User Profile

MY FACEBOOK
PROFILE

BIO

POST
POST

LIKES

POST
Extract Information from Facebook profile

MY FACEBOOK
PROFILE

BIO

POST
POST

LIKES

POST
Access Facebook profile

MY FACEBOOK
PROFILE

BIO

POST
POST

LIKES

POST

{	
  
"id":	
  "1183880085",	
  
"likes":	
  {	
  
	
  	
  	
  	
  "data":	
  [	
  
	
  	
  	
  	
  	
  	
  {	
  
	
  	
  	
  	
  	
  	
  	
  	
  "category":	
  "Musician/band",	
  
	
  	
  	
  	
  	
  	
  	
  	
  "created_time":	
  "2013-­‐10-­‐27T11:37:51+0
	
  	
  	
  	
  	
  	
  	
  	
  "name":	
  "NAS",	
  
	
  	
  	
  	
  	
  	
  	
  	
  "id":	
  "113591595350795"	
  
	
  	
  	
  	
  	
  	
  },	
  
	
  	
  	
  	
  	
  	
  {	
  
	
  	
  	
  	
  	
  	
  	
  	
  "category":	
  "Company",	
  
	
  	
  	
  	
  	
  	
  	
  	
  "created_time":	
  "2013-­‐10-­‐17T07:45:36+0
	
  	
  	
  	
  	
  	
  	
  	
  "name":	
  "Infinibase",	
  
	
  	
  	
  	
  	
  	
  	
  	
  "id":	
  "573216229380347"	
  
	
  	
  	
  	
  	
  	
  },	
  
	
  	
  	
  	
  	
  	
  {	
  
	
  	
  	
  	
  	
  	
  	
  	
  "category":	
  "Magazine",	
  
	
  	
  	
  	
  	
  	
  	
  	
  "created_time":	
  "2013-­‐10-­‐04T13:55:10+0
	
  	
  	
  	
  	
  	
  	
  	
  "name":	
  "New	
  Scientist	
  NL",	
  
	
  	
  	
  	
  	
  	
  	
  	
  "id":	
  "369158433181445"	
  
	
  	
  	
  	
  	
  	
  },	
  
Extract text
attributes

•
•
•
•
•
•

{	
  
"id":	
  "1183880085",	
  
"likes":	
  {	
  
	
  	
  	
  	
  "data":	
  [	
  
	
  	
  	
  	
  	
  	
  {	
  
	
  	
  	
  	
  	
  	
  	
  	
  "category":	
  "Musician/band",	
  
	
  	
  	
  	
  	
  	
  	
  	
  "created_time":	
  "2013-­‐10-­‐27T11:37:51+0000",	
  
	
  	
  	
  	
  	
  	
  	
  	
  "name":	
  "NAS",	
  
	
  	
  	
  	
  	
  	
  	
  	
  "id":	
  "113591595350795"	
  
	
  	
  	
  	
  	
  	
  },	
  
	
  	
  	
  	
  	
  	
  {	
  
	
  	
  	
  	
  	
  	
  	
  	
  "category":	
  "Company",	
  
	
  	
  	
  	
  	
  	
  	
  	
  "created_time":	
  "2013-­‐10-­‐17T07:45:36+0000",	
  
	
  	
  	
  	
  	
  	
  	
  	
  "name":	
  "Infinibase",	
  
	
  	
  	
  	
  	
  	
  	
  	
  "id":	
  "573216229380347"	
  
	
  	
  	
  	
  	
  	
  },	
  
	
  	
  	
  	
  	
  	
  {	
  
	
  	
  	
  	
  	
  	
  	
  	
  "category":	
  "Magazine",	
  
	
  	
  	
  	
  	
  	
  	
  	
  "created_time":	
  "2013-­‐10-­‐04T13:55:10+0000",	
  
	
  	
  	
  	
  	
  	
  	
  	
  "name":	
  "New	
  Scientist	
  NL",	
  
	
  	
  	
  	
  	
  	
  	
  	
  "id":	
  "369158433181445"	
  
	
  	
  	
  	
  	
  	
  },	
  
	
  	
  	
  	
  	
  	
  {	
  
	
  	
  	
  	
  	
  	
  	
  	
  "category":	
  "Tv	
  show",	
  
	
  	
  	
  	
  	
  	
  	
  	
  "created_time":	
  "2010-­‐05-­‐09T01:06:27+0000",	
  
	
  	
  	
  	
  	
  	
  	
  	
  "name":	
  "The	
  Wire",	
  
	
  	
  	
  	
  	
  	
  	
  	
  "id":	
  "5991693871"	
  
	
  	
  	
  	
  	
  	
  }	
  ]	
  
}

•
•
•
•
•
•
•
•
•
•
•
•
•

•

Story	
  
Omroep	
  Maxim	
  
Gamer01	
  
Breaking	
  Bad	
  
AT5	
  
Mad	
  Men	
  
The	
  Wire	
  
Monty	
  Python's	
  
Flying	
  Circus	
  
Flight	
  of	
  the	
  
Conchords	
  
Donnie	
  Darko	
  
Flevopark	
  Film	
  
Festival	
  
Do	
  The	
  Right	
  
Thing	
  
A	
  Clockwork	
  
Orange	
  
Wild	
  Style	
  
Princess	
  
Mononoke	
  
The	
  Fountain	
  
Pi	
  
Northfork	
  
La	
  Haine	
  
Zen	
  and	
  the	
  Art	
  
of	
  Motorcycle	
  
Maintenance	
  
Moon	
  Palace	
  

•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•

Fountainhead	
  
The	
  Wind-­‐Up	
  
Bird	
  Chronicle	
  
Wu-­‐Tang	
  
J.Cole	
  
NAS	
  
Pusha	
  T	
  
ASAP	
  Rocky	
  
Ab-­‐Soul	
  
Chance	
  The	
  
Rapper	
  
Cannibal	
  Ox	
  
Bonobo	
  
Aesop	
  Rock	
  
Boards	
  Of	
  
Canada	
  
Jurassic	
  5	
  
GREMS	
  
Quasimoto	
  
Strange	
  Journey	
  
Volume	
  Three	
  
Drop	
  Velvet	
  
MODESELEKTOR	
  
IAM	
  
Derek	
  
The	
  Onion	
  
Imgur	
  
De	
  Speld	
  
Wu-­‐Tang	
  
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•

ASAP	
  Rocky	
  
Ab-­‐Soul	
  
Chance	
  The	
  Rapper	
  
Cannibal	
  Ox	
  
Bonobo	
  
Aesop	
  Rock	
  
Boards	
  Of	
  Canada	
  
Jurassic	
  5	
  
GREMS	
  
Quasimoto	
  
Strange	
  Journey	
  Volume	
  Three	
  
Drop	
  Velvet	
  
MODESELEKTOR	
  
IAM	
  
Derek	
  
The	
  Onion	
  
Imgur	
  
De	
  Speld	
  
Wu-­‐Tang	
  
J.Cole	
  
I	
  Am	
  Fucking	
  Ambivalent	
  About	
  
Science	
  
NAS	
  
Pusha	
  T	
  
ASAP	
  Rocky	
  
Chrietitie	
  
Infinibase	
  
Marktplaatspoxc3xabzie	
  
Jeannette	
  Span	
  :	
  Spelen	
  
Entity Linking
•

Given a Knowledge Base

•

Link mentions of entities (or concepts) to their referent entities
Entity Linking
•

From Wikipedia:
•

Extract anchor texts (words used to link to Wikipedia pages)
!
!
!
!
!
!

•

For each n-gram n ↔ Wikipedia page W estimate:
•

Probability of using n-gram n to refer to Wikipedia page W
Entity Linking Example
Link Probability
“Nas” occurs 2475x in Wikipedia

!

is anchor

1.723x

is no anchor

752x
Entity Linking Example
Link Probability
“Nas” occurs 2475x in Wikipedia

!

is anchor

1723/2475

=

69,6%

is no anchor

752/2475

=

30.4%
Entity Linking Example
Commonness
•

Nas is used to refer to:
•

http://en.wikipedia.org/wiki/Nas

•

http://en.wikipedia.org/wiki/Naas

•

http://en.wikipedia.org/wiki/Nås

•

http://en.wikipedia.org/wiki/Nas (Ikaria)

•

http://en.wikipedia.org/wiki/Untitled Nas album
Entity Linking Example
Commonness
•

Nas is used to refer to:
•

http://en.wikipedia.org/wiki/Nas

14x

•

http://en.wikipedia.org/wiki/Naas

4x

•

http://en.wikipedia.org/wiki/Nås

3x

•

http://en.wikipedia.org/wiki/Nas (Ikaria)

2x

•

http://en.wikipedia.org/wiki/Untitled Nas album

2x
Entity Linking Example
Commonness
•

Nas is used to refer to:
•

http://en.wikipedia.org/wiki/Nas

14/25 =

56%

•

http://en.wikipedia.org/wiki/Naas

4/25 =

1.6%

•

http://en.wikipedia.org/wiki/Nås

3/25 =

1.2%

•

http://en.wikipedia.org/wiki/Nas (Ikaria)

2/25 =

0.8%

•

http://en.wikipedia.org/wiki/Untitled Nas album

2/25 =

0.8%
{	
  
	
  	
  	
  	
  "text":	
  "Nas",	
  
	
  	
  	
  	
  "links":	
  [	
  
	
  	
  	
  	
  	
  	
  	
  	
  {	
  
	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  "senseProbability":	
  0.726027397260274,	
  
	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  "title":	
  "Nas",	
  
	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  "url":	
  "http://en.wikipedia.org/wiki/Nas"	
  
	
  	
  	
  	
  	
  	
  	
  	
  },	
  
	
  	
  	
  	
  	
  	
  	
  	
  {	
  
	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  "senseProbability":	
  0.125,	
  
	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  "title":	
  "Naas",	
  
	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  "url":	
  "http://en.wikipedia.org/wiki/Naas"	
  
	
  	
  	
  	
  	
  	
  	
  	
  },	
  
	
  	
  	
  	
  	
  	
  	
  	
  {	
  
	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  "senseProbability":	
  0.1111111111111111,	
  
	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  "title":	
  "Nås",	
  
	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  "url":	
  "http://en.wikipedia.org/wiki/N%C3%A5s"	
  
	
  	
  	
  	
  	
  	
  	
  	
  },	
  
	
  	
  	
  	
  	
  	
  	
  	
  {	
  
	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  "senseProbability":	
  0.0006523157208088715,	
  
	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  "title":	
  "Nas	
  (Ikaria)",	
  
	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  "url":	
  "http://en.wikipedia.org/wiki/Nas%20%28Ikaria%29"	
  
	
  	
  	
  	
  	
  	
  	
  	
  },	
  
	
  	
  	
  	
  	
  	
  	
  	
  {	
  
	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  "senseProbability":	
  0.0006523157208088715,	
  
	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  "title":	
  "Untitled	
  Nas	
  album",	
  
	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  "url":	
  "http://en.wikipedia.org/wiki/Untitled%20Nas%20album"	
  
	
  	
  	
  	
  	
  	
  	
  	
  }	
  
}
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•

AT5	
  
Mad	
  Men	
  
The	
  Wire	
  
Monty	
  Python's	
  Flying	
  
Circus	
  
Flight	
  of	
  the	
  Conchords	
  
Donnie	
  Darko	
  
Flevopark	
  Film	
  Festival	
  
Do	
  The	
  Right	
  Thing	
  
A	
  Clockwork	
  Orange	
  
Wild	
  Style	
  
Princess	
  Mononoke	
  
The	
  Fountain	
  
Pi	
  
Northfork	
  
La	
  Haine	
  
Zen	
  and	
  the	
  Art	
  of	
  
Motorcycle	
  Maintenance	
  
Moon	
  Palace	
  
The	
  Fountainhead	
  
The	
  Wind-­‐Up	
  Bird	
  
Chronicle	
  
Wu-­‐Tang	
  
J.Cole	
  
Match Events to Profile Entities
Match Events to Profile Entities
Map Events to Wikipedia Entities
Match Events to Profile Entities
Matching metric #1: link overlap
Matching metric #1: link overlap
U.S.

Hiphop

NAS

Kanye!
West

Jay-Z
Damian!
Marley
Global!
War

U.S.
U.S.
Allies
Hiphop

Axis

NAS

Kanye!
West

Jay-Z
Damian!
Marley

World!
War II
Global!
War

U.S.
U.S.
Allies
Hiphop

Axis

NAS

Kanye!
West

Jay-Z
Damian!
Marley

1
World!
War II
Global!
War

1

U.S.

World!
War II

U.S.
Allies
Hiphop

Axis

NAS

Kanye!
West

Jay-Z
Damian!
Marley

Jay-Z

Hiphop

Kanye!
West

Link
#4

51st!
Grammy!
Awards
Global!
War

1

U.S.

World!
War II

U.S.
Allies
Hiphop

Axis

NAS

Kanye!
West

Jay-Z
Damian!
Marley

Jay-Z

Hiphop

Kanye!
West

Link
#4

3
51st!
Grammy!
Awards
Matching metric #2: direct link

U.S.

Hiphop

NAS

Kanye!
West

Jay-Z
Damian!
Marley

Jay-Z

Hiphop

Kanye!
West

51st!
Grammy!
Awards
Matching metric #3: textual similarity
NAS

51st!
Grammy!
Awards
Matching metric #3: textual similarity
NAS

51st!
Grammy!
Awards
Matching metric #3: textual similarity
NAS

51st!
Grammy!
Awards
51st!
Grammy!
Awards

World!
War II

Score: 0.74

Score: 0.35
Combine scores & rank events
	
  	
  	
  	
  "5043324":	
  {	
  
	
  	
  	
  	
  	
  	
  "event_title":	
  "Iraq	
  War",	
  
	
  	
  	
  	
  	
  	
  "related_entity_title":	
  "The	
  Wire",	
  
	
  	
  	
  	
  	
  	
  "score":	
  1.0,	
  
	
  	
  	
  	
  	
  	
  "event_date":	
  "2003-­‐03-­‐20"	
  
	
  	
  	
  	
  },	
  
	
  	
  	
  	
  "1376628":	
  {	
  
	
  	
  	
  	
  	
  	
  "event_title":	
  "Blankets	
  (comics)",	
  
	
  	
  	
  	
  	
  	
  "related_entity_title":	
  "Princess	
  Mononoke",	
  
	
  	
  	
  	
  	
  	
  "score":	
  0.11465851113504691,	
  
	
  	
  	
  	
  	
  	
  "event_date":	
  "2003-­‐07-­‐23"	
  
	
  	
  	
  	
  },	
  
	
  	
  	
  	
  "15694206":	
  {	
  
	
  	
  	
  	
  	
  	
  "event_title":	
  "2006	
  LG	
  Hockey	
  Games",	
  
	
  	
  	
  	
  	
  	
  "related_entity_title":	
  "Reimersholme",	
  
	
  	
  	
  	
  	
  	
  "score":	
  0.3467068139664613,	
  
	
  	
  	
  	
  	
  	
  "event_date":	
  "2006-­‐04-­‐29"	
  
	
  	
  	
  	
  },	
  
	
  	
  	
  	
  "4861876":	
  {	
  
	
  	
  	
  	
  	
  	
  "event_title":	
  "2005	
  UEFA	
  Champions	
  League	
  Final",	
  
	
  	
  	
  	
  	
  	
  "related_entity_title":	
  "Istanbul",	
  
	
  	
  	
  	
  	
  	
  "score":	
  1.0,	
  
	
  	
  	
  	
  	
  	
  "event_date":	
  "2005-­‐05-­‐25"	
  
	
  	
  	
  	
  },	
  
	
  	
  	
  	
  "31966809":	
  {	
  
	
  	
  	
  	
  	
  	
  "event_title":	
  "63rd	
  Primetime	
  Emmy	
  Awards",	
  
	
  	
  	
  	
  	
  	
  "related_entity_title":	
  "Mad	
  Men",	
  
	
  	
  	
  	
  	
  	
  "score":	
  0.04039278737569369,	
  
	
  	
  	
  	
  	
  	
  "event_date":	
  "2011-­‐09-­‐18"	
  
	
  	
  	
  	
  },
Future Work
•

Log interactions

•

Interpret clicks as (implicit) feedback:
•

Click on Event: user is interested

•

No click on Event: user is not

•

Learn scoring & ranking functions
Thank you! Questions?
Try yourHistory:
See our poster:

http://apps.facebook.com/yourHistory

#98

!
!
!
!






David Graus


d.p.graus@uva.nl
@dvdgrs

Mais conteúdo relacionado

Semelhante a yourHistory - entity linking for a personalized timeline of historic events

Civil War Data 150 at DLF Fall Forum 2011
Civil War Data 150 at DLF Fall Forum 2011Civil War Data 150 at DLF Fall Forum 2011
Civil War Data 150 at DLF Fall Forum 2011
Jon Voss
 
The NoTube BeanCounter: Aggregating User Data for Television Programme Recomm...
The NoTube BeanCounter: Aggregating User Data for Television Programme Recomm...The NoTube BeanCounter: Aggregating User Data for Television Programme Recomm...
The NoTube BeanCounter: Aggregating User Data for Television Programme Recomm...
MODUL Technology GmbH
 
Remixable Media Week 2 Seminar
Remixable Media Week 2 SeminarRemixable Media Week 2 Seminar
Remixable Media Week 2 Seminar
Michela Ledwidge
 

Semelhante a yourHistory - entity linking for a personalized timeline of historic events (20)

Linked Data Progress - IFLA 2013
Linked Data Progress - IFLA 2013Linked Data Progress - IFLA 2013
Linked Data Progress - IFLA 2013
 
The Web of Data is Our Oyster
The Web of Data is Our OysterThe Web of Data is Our Oyster
The Web of Data is Our Oyster
 
ServerSide Javascript on Freebase - SF JavaScript meetup #9
ServerSide Javascript on Freebase - SF JavaScript meetup #9ServerSide Javascript on Freebase - SF JavaScript meetup #9
ServerSide Javascript on Freebase - SF JavaScript meetup #9
 
Hacking The Newsroom
Hacking The NewsroomHacking The Newsroom
Hacking The Newsroom
 
Civil War Data 150 at DLF Fall Forum 2011
Civil War Data 150 at DLF Fall Forum 2011Civil War Data 150 at DLF Fall Forum 2011
Civil War Data 150 at DLF Fall Forum 2011
 
The Commons on Filckr: a primer
The Commons on Filckr: a primerThe Commons on Filckr: a primer
The Commons on Filckr: a primer
 
The NoTube BeanCounter: Aggregating User Data for Television Programme Recomm...
The NoTube BeanCounter: Aggregating User Data for Television Programme Recomm...The NoTube BeanCounter: Aggregating User Data for Television Programme Recomm...
The NoTube BeanCounter: Aggregating User Data for Television Programme Recomm...
 
Web Driven Revolution For Library Data
Web Driven Revolution For Library DataWeb Driven Revolution For Library Data
Web Driven Revolution For Library Data
 
Data Science - The Most Profitable Movie Characteristic
Data Science -  The Most Profitable Movie CharacteristicData Science -  The Most Profitable Movie Characteristic
Data Science - The Most Profitable Movie Characteristic
 
The ARK Identifier Scheme at Ten Years Old
The ARK Identifier Scheme at Ten Years OldThe ARK Identifier Scheme at Ten Years Old
The ARK Identifier Scheme at Ten Years Old
 
Looking at Content Recommendations through a Search Lens - Extended Version
Looking at Content Recommendations through a Search Lens - Extended VersionLooking at Content Recommendations through a Search Lens - Extended Version
Looking at Content Recommendations through a Search Lens - Extended Version
 
Remixable Media Week 2 Seminar
Remixable Media Week 2 SeminarRemixable Media Week 2 Seminar
Remixable Media Week 2 Seminar
 
Maps and Math CwiC
Maps and Math CwiCMaps and Math CwiC
Maps and Math CwiC
 
Event stream processing using Kafka streams
Event stream processing using Kafka streamsEvent stream processing using Kafka streams
Event stream processing using Kafka streams
 
GRASS GIS, Star Trek and old Video Tape – a reference case on audiovisual pre...
GRASS GIS, Star Trek and old Video Tape – a reference case on audiovisual pre...GRASS GIS, Star Trek and old Video Tape – a reference case on audiovisual pre...
GRASS GIS, Star Trek and old Video Tape – a reference case on audiovisual pre...
 
HackMIT Lightning Talk
HackMIT Lightning TalkHackMIT Lightning Talk
HackMIT Lightning Talk
 
Intellectualproperty
IntellectualpropertyIntellectualproperty
Intellectualproperty
 
Freebase - Semantic Technologies 2010 Code Camp
Freebase - Semantic Technologies 2010 Code CampFreebase - Semantic Technologies 2010 Code Camp
Freebase - Semantic Technologies 2010 Code Camp
 
Warcbase Building a Scalable Platform on HBase and Hadoop - Part Two: Histori...
Warcbase Building a Scalable Platform on HBase and Hadoop - Part Two: Histori...Warcbase Building a Scalable Platform on HBase and Hadoop - Part Two: Histori...
Warcbase Building a Scalable Platform on HBase and Hadoop - Part Two: Histori...
 
Warcbase: Building a Scalable Platform on HBase and Hadoop - Part Two, Histor...
Warcbase: Building a Scalable Platform on HBase and Hadoop - Part Two, Histor...Warcbase: Building a Scalable Platform on HBase and Hadoop - Part Two, Histor...
Warcbase: Building a Scalable Platform on HBase and Hadoop - Part Two, Histor...
 

Mais de David Graus

Dynamic Collective Entity Representations for Entity Ranking
Dynamic Collective Entity Representations for Entity RankingDynamic Collective Entity Representations for Entity Ranking
Dynamic Collective Entity Representations for Entity Ranking
David Graus
 
David Graus - Entity Linking (at SEA), Search Engines Amsterdam, Fri June 27th
David Graus - Entity Linking (at SEA), Search Engines Amsterdam, Fri June 27thDavid Graus - Entity Linking (at SEA), Search Engines Amsterdam, Fri June 27th
David Graus - Entity Linking (at SEA), Search Engines Amsterdam, Fri June 27th
David Graus
 
Semantic Search in E-Discovery
Semantic Search in E-DiscoverySemantic Search in E-Discovery
Semantic Search in E-Discovery
David Graus
 

Mais de David Graus (18)

Pragmatic ethical and fair AI for data scientists
Pragmatic ethical and fair AI for data scientistsPragmatic ethical and fair AI for data scientists
Pragmatic ethical and fair AI for data scientists
 
Bias in Recommendations
Bias in RecommendationsBias in Recommendations
Bias in Recommendations
 
RecSys in the Media Industry: Relevance, Recency, Popularity, and Diversity.
RecSys in the Media Industry: Relevance, Recency, Popularity, and Diversity.RecSys in the Media Industry: Relevance, Recency, Popularity, and Diversity.
RecSys in the Media Industry: Relevance, Recency, Popularity, and Diversity.
 
CAT/AI: Computer Assisted Translation 
Assessment for Impact
CAT/AI: Computer Assisted Translation 
Assessment for ImpactCAT/AI: Computer Assisted Translation 
Assessment for Impact
CAT/AI: Computer Assisted Translation 
Assessment for Impact
 
Opening the Black Box of User Profiles in Content-based Recommender Systems
Opening the Black Box of User Profiles in Content-based Recommender SystemsOpening the Black Box of User Profiles in Content-based Recommender Systems
Opening the Black Box of User Profiles in Content-based Recommender Systems
 
Zoeken, vinden, en aanbevelen: personalisatie vs. privacy
Zoeken, vinden, en aanbevelen: personalisatie vs. privacyZoeken, vinden, en aanbevelen: personalisatie vs. privacy
Zoeken, vinden, en aanbevelen: personalisatie vs. privacy
 
Layman's Talk: Entities of Interest --- Discovery in Digital Traces
Layman's Talk: Entities of Interest --- Discovery in Digital TracesLayman's Talk: Entities of Interest --- Discovery in Digital Traces
Layman's Talk: Entities of Interest --- Discovery in Digital Traces
 
Financial News Mining @ PyData Amsterdam
Financial News Mining @ PyData AmsterdamFinancial News Mining @ PyData Amsterdam
Financial News Mining @ PyData Amsterdam
 
De Macht van Data --- Hoe algoritmen ons leven vormgeven
De Macht van Data --- Hoe algoritmen ons leven vormgevenDe Macht van Data --- Hoe algoritmen ons leven vormgeven
De Macht van Data --- Hoe algoritmen ons leven vormgeven
 
Financial News Mining @ FD Mediagroep/Company.info
Financial News Mining @ FD Mediagroep/Company.infoFinancial News Mining @ FD Mediagroep/Company.info
Financial News Mining @ FD Mediagroep/Company.info
 
Analyzing and Predicting Task Reminders
Analyzing and Predicting Task RemindersAnalyzing and Predicting Task Reminders
Analyzing and Predicting Task Reminders
 
Dynamic Collective Entity Representations for Entity Ranking
Dynamic Collective Entity Representations for Entity RankingDynamic Collective Entity Representations for Entity Ranking
Dynamic Collective Entity Representations for Entity Ranking
 
Dynamic Collective Entity Representations for Entity Ranking
Dynamic Collective Entity Representations for Entity RankingDynamic Collective Entity Representations for Entity Ranking
Dynamic Collective Entity Representations for Entity Ranking
 
Understanding Email Traffic
Understanding Email TrafficUnderstanding Email Traffic
Understanding Email Traffic
 
David Graus - Entity Linking (at SEA), Search Engines Amsterdam, Fri June 27th
David Graus - Entity Linking (at SEA), Search Engines Amsterdam, Fri June 27thDavid Graus - Entity Linking (at SEA), Search Engines Amsterdam, Fri June 27th
David Graus - Entity Linking (at SEA), Search Engines Amsterdam, Fri June 27th
 
Semantic Search in E-Discovery
Semantic Search in E-DiscoverySemantic Search in E-Discovery
Semantic Search in E-Discovery
 
Semantic Annotation of the Cyttron Database
Semantic Annotation of the Cyttron DatabaseSemantic Annotation of the Cyttron Database
Semantic Annotation of the Cyttron Database
 
Semantic annotation, clustering and visualization
Semantic annotation, clustering and visualizationSemantic annotation, clustering and visualization
Semantic annotation, clustering and visualization
 

Último

Capstone slidedeck for my capstone final edition.pdf
Capstone slidedeck for my capstone final edition.pdfCapstone slidedeck for my capstone final edition.pdf
Capstone slidedeck for my capstone final edition.pdf
eliklein8
 
Capstone slidedeck for my capstone project part 2.pdf
Capstone slidedeck for my capstone project part 2.pdfCapstone slidedeck for my capstone project part 2.pdf
Capstone slidedeck for my capstone project part 2.pdf
eliklein8
 

Último (20)

College & House wife Call Girls in Paharganj 9634446618 -Best Escort call gi...
College & House wife  Call Girls in Paharganj 9634446618 -Best Escort call gi...College & House wife  Call Girls in Paharganj 9634446618 -Best Escort call gi...
College & House wife Call Girls in Paharganj 9634446618 -Best Escort call gi...
 
Production diary Film the city powerpoint
Production diary Film the city powerpointProduction diary Film the city powerpoint
Production diary Film the city powerpoint
 
Hire↠Young Call Girls in Hari Nagar (Delhi) ☎️ 9205541914 ☎️ Independent Esco...
Hire↠Young Call Girls in Hari Nagar (Delhi) ☎️ 9205541914 ☎️ Independent Esco...Hire↠Young Call Girls in Hari Nagar (Delhi) ☎️ 9205541914 ☎️ Independent Esco...
Hire↠Young Call Girls in Hari Nagar (Delhi) ☎️ 9205541914 ☎️ Independent Esco...
 
Enjoy Night⚡Call Girls Palam Vihar Gurgaon >༒8448380779 Escort Service
Enjoy Night⚡Call Girls Palam Vihar Gurgaon >༒8448380779 Escort ServiceEnjoy Night⚡Call Girls Palam Vihar Gurgaon >༒8448380779 Escort Service
Enjoy Night⚡Call Girls Palam Vihar Gurgaon >༒8448380779 Escort Service
 
Film show pre-production powerpoint for site
Film show pre-production powerpoint for siteFilm show pre-production powerpoint for site
Film show pre-production powerpoint for site
 
SEO Expert in USA - 5 Ways to Improve Your Local Ranking - Macaw Digital.pdf
SEO Expert in USA - 5 Ways to Improve Your Local Ranking - Macaw Digital.pdfSEO Expert in USA - 5 Ways to Improve Your Local Ranking - Macaw Digital.pdf
SEO Expert in USA - 5 Ways to Improve Your Local Ranking - Macaw Digital.pdf
 
Capstone slidedeck for my capstone final edition.pdf
Capstone slidedeck for my capstone final edition.pdfCapstone slidedeck for my capstone final edition.pdf
Capstone slidedeck for my capstone final edition.pdf
 
Pondicherry Call Girls Book Now 8617697112 Top Class Pondicherry Escort Servi...
Pondicherry Call Girls Book Now 8617697112 Top Class Pondicherry Escort Servi...Pondicherry Call Girls Book Now 8617697112 Top Class Pondicherry Escort Servi...
Pondicherry Call Girls Book Now 8617697112 Top Class Pondicherry Escort Servi...
 
Ignite Your Online Influence: Sociocosmos - Where Social Media Magic Happens
Ignite Your Online Influence: Sociocosmos - Where Social Media Magic HappensIgnite Your Online Influence: Sociocosmos - Where Social Media Magic Happens
Ignite Your Online Influence: Sociocosmos - Where Social Media Magic Happens
 
Call Girls In South Ex. Delhi O9654467111 Women Seeking Men
Call Girls In South Ex. Delhi O9654467111 Women Seeking MenCall Girls In South Ex. Delhi O9654467111 Women Seeking Men
Call Girls In South Ex. Delhi O9654467111 Women Seeking Men
 
Ready to get noticed? Partner with Sociocosmos
Ready to get noticed? Partner with SociocosmosReady to get noticed? Partner with Sociocosmos
Ready to get noticed? Partner with Sociocosmos
 
BDSM⚡Call Girls in Sector 76 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 76 Noida Escorts >༒8448380779 Escort ServiceBDSM⚡Call Girls in Sector 76 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 76 Noida Escorts >༒8448380779 Escort Service
 
VIP Call Girls Morena 9332606886 Free Home Delivery 5500 Only
VIP Call Girls Morena 9332606886 Free Home Delivery 5500 OnlyVIP Call Girls Morena 9332606886 Free Home Delivery 5500 Only
VIP Call Girls Morena 9332606886 Free Home Delivery 5500 Only
 
Social media marketing/Seo expert and digital marketing
Social media marketing/Seo expert and digital marketingSocial media marketing/Seo expert and digital marketing
Social media marketing/Seo expert and digital marketing
 
CASH PAYMENT ON GIRL HAND TO HAND HOUSEWIFE
CASH PAYMENT ON GIRL HAND TO HAND HOUSEWIFECASH PAYMENT ON GIRL HAND TO HAND HOUSEWIFE
CASH PAYMENT ON GIRL HAND TO HAND HOUSEWIFE
 
Capstone slide deck on the TikTok revolution
Capstone slide deck on the TikTok revolutionCapstone slide deck on the TikTok revolution
Capstone slide deck on the TikTok revolution
 
Capstone slidedeck for my capstone project part 2.pdf
Capstone slidedeck for my capstone project part 2.pdfCapstone slidedeck for my capstone project part 2.pdf
Capstone slidedeck for my capstone project part 2.pdf
 
Interpreting the brief for the media IDY
Interpreting the brief for the media IDYInterpreting the brief for the media IDY
Interpreting the brief for the media IDY
 
Film the city investagation powerpoint :)
Film the city investagation powerpoint :)Film the city investagation powerpoint :)
Film the city investagation powerpoint :)
 
Craft Your Legacy: Invest in YouTube Presence from Sociocosmos"
Craft Your Legacy: Invest in YouTube Presence from Sociocosmos"Craft Your Legacy: Invest in YouTube Presence from Sociocosmos"
Craft Your Legacy: Invest in YouTube Presence from Sociocosmos"
 

yourHistory - entity linking for a personalized timeline of historic events

  • 1. Gaza War Britches World War II Berlin Wall Woodstock 1950 1900 1910 1970 1920 9/11 Gulf War 1930 1980 1940 1950 1990 1960 BET Hiphop Awards 2000 1970 1980 2010 1990 2000 David Graus, Maria-Hendrike Peetz, Daan Odijk, Maarten de Rijke, Ork de Rooij 2010
  • 2. Entity Linking for a personalized timeline of historic events • Motivation • Method • • Part II: Generate User Profile • Part III: Matching Events to User Profile • • Part I: Fetch Candidate Historic Events Part IV: Scoring & Ranking Events Future Work
  • 3. • […] To design and build innovative and robust prototypes and demos for tools that analyse and/or integrate open web data for educational purposes.
  • 5.
  • 6. Personalized historic timeline Gaza War Britches World War II Berlin Wall Woodstock 1950 1900 1910 1970 1920 9/11 Gulf War 1930 1980 1940 1950 1990 1960 BET Hiphop Awards 2000 1970 1980 2010 1990 2000 2010
  • 7. Part I: Candidate Historic Events
  • 8. Part I: Candidate Historic Events select  ?concept     where  {       ?concept  rdf:type  dbpedia-­‐owl:Event       }
  • 9. concept       ept  rdf:type  dbpedia-­‐owl:Event    
  • 10. concept       ept  rdf:type  dbpedia-­‐owl:Event    
  • 11.
  • 12.
  • 13.
  • 14. Part II: User Profile MY FACEBOOK PROFILE BIO POST POST LIKES POST
  • 15.
  • 16.
  • 17.
  • 18.
  • 19.
  • 20. Extract Information from Facebook profile MY FACEBOOK PROFILE BIO POST POST LIKES POST
  • 21. Access Facebook profile MY FACEBOOK PROFILE BIO POST POST LIKES POST {   "id":  "1183880085",   "likes":  {          "data":  [              {                  "category":  "Musician/band",                  "created_time":  "2013-­‐10-­‐27T11:37:51+0                "name":  "NAS",                  "id":  "113591595350795"              },              {                  "category":  "Company",                  "created_time":  "2013-­‐10-­‐17T07:45:36+0                "name":  "Infinibase",                  "id":  "573216229380347"              },              {                  "category":  "Magazine",                  "created_time":  "2013-­‐10-­‐04T13:55:10+0                "name":  "New  Scientist  NL",                  "id":  "369158433181445"              },  
  • 22. Extract text attributes • • • • • • {   "id":  "1183880085",   "likes":  {          "data":  [              {                  "category":  "Musician/band",                  "created_time":  "2013-­‐10-­‐27T11:37:51+0000",                  "name":  "NAS",                  "id":  "113591595350795"              },              {                  "category":  "Company",                  "created_time":  "2013-­‐10-­‐17T07:45:36+0000",                  "name":  "Infinibase",                  "id":  "573216229380347"              },              {                  "category":  "Magazine",                  "created_time":  "2013-­‐10-­‐04T13:55:10+0000",                  "name":  "New  Scientist  NL",                  "id":  "369158433181445"              },              {                  "category":  "Tv  show",                  "created_time":  "2010-­‐05-­‐09T01:06:27+0000",                  "name":  "The  Wire",                  "id":  "5991693871"              }  ]   } • • • • • • • • • • • • • • Story   Omroep  Maxim   Gamer01   Breaking  Bad   AT5   Mad  Men   The  Wire   Monty  Python's   Flying  Circus   Flight  of  the   Conchords   Donnie  Darko   Flevopark  Film   Festival   Do  The  Right   Thing   A  Clockwork   Orange   Wild  Style   Princess   Mononoke   The  Fountain   Pi   Northfork   La  Haine   Zen  and  the  Art   of  Motorcycle   Maintenance   Moon  Palace   • • • • • • • • • • • • • • • • • • • • • • • • Fountainhead   The  Wind-­‐Up   Bird  Chronicle   Wu-­‐Tang   J.Cole   NAS   Pusha  T   ASAP  Rocky   Ab-­‐Soul   Chance  The   Rapper   Cannibal  Ox   Bonobo   Aesop  Rock   Boards  Of   Canada   Jurassic  5   GREMS   Quasimoto   Strange  Journey   Volume  Three   Drop  Velvet   MODESELEKTOR   IAM   Derek   The  Onion   Imgur   De  Speld   Wu-­‐Tang  
  • 23. • • • • • • • • • • • • • • • • • • • • • • • • • • • • ASAP  Rocky   Ab-­‐Soul   Chance  The  Rapper   Cannibal  Ox   Bonobo   Aesop  Rock   Boards  Of  Canada   Jurassic  5   GREMS   Quasimoto   Strange  Journey  Volume  Three   Drop  Velvet   MODESELEKTOR   IAM   Derek   The  Onion   Imgur   De  Speld   Wu-­‐Tang   J.Cole   I  Am  Fucking  Ambivalent  About   Science   NAS   Pusha  T   ASAP  Rocky   Chrietitie   Infinibase   Marktplaatspoxc3xabzie   Jeannette  Span  :  Spelen  
  • 24. Entity Linking • Given a Knowledge Base • Link mentions of entities (or concepts) to their referent entities
  • 25. Entity Linking • From Wikipedia: • Extract anchor texts (words used to link to Wikipedia pages) ! ! ! ! ! ! • For each n-gram n ↔ Wikipedia page W estimate: • Probability of using n-gram n to refer to Wikipedia page W
  • 26. Entity Linking Example Link Probability “Nas” occurs 2475x in Wikipedia ! is anchor 1.723x is no anchor 752x
  • 27. Entity Linking Example Link Probability “Nas” occurs 2475x in Wikipedia ! is anchor 1723/2475 = 69,6% is no anchor 752/2475 = 30.4%
  • 28. Entity Linking Example Commonness • Nas is used to refer to: • http://en.wikipedia.org/wiki/Nas • http://en.wikipedia.org/wiki/Naas • http://en.wikipedia.org/wiki/Nås • http://en.wikipedia.org/wiki/Nas (Ikaria) • http://en.wikipedia.org/wiki/Untitled Nas album
  • 29. Entity Linking Example Commonness • Nas is used to refer to: • http://en.wikipedia.org/wiki/Nas 14x • http://en.wikipedia.org/wiki/Naas 4x • http://en.wikipedia.org/wiki/Nås 3x • http://en.wikipedia.org/wiki/Nas (Ikaria) 2x • http://en.wikipedia.org/wiki/Untitled Nas album 2x
  • 30. Entity Linking Example Commonness • Nas is used to refer to: • http://en.wikipedia.org/wiki/Nas 14/25 = 56% • http://en.wikipedia.org/wiki/Naas 4/25 = 1.6% • http://en.wikipedia.org/wiki/Nås 3/25 = 1.2% • http://en.wikipedia.org/wiki/Nas (Ikaria) 2/25 = 0.8% • http://en.wikipedia.org/wiki/Untitled Nas album 2/25 = 0.8%
  • 31. {          "text":  "Nas",          "links":  [                  {                          "senseProbability":  0.726027397260274,                          "title":  "Nas",                          "url":  "http://en.wikipedia.org/wiki/Nas"                  },                  {                          "senseProbability":  0.125,                          "title":  "Naas",                          "url":  "http://en.wikipedia.org/wiki/Naas"                  },                  {                          "senseProbability":  0.1111111111111111,                          "title":  "Nås",                          "url":  "http://en.wikipedia.org/wiki/N%C3%A5s"                  },                  {                          "senseProbability":  0.0006523157208088715,                          "title":  "Nas  (Ikaria)",                          "url":  "http://en.wikipedia.org/wiki/Nas%20%28Ikaria%29"                  },                  {                          "senseProbability":  0.0006523157208088715,                          "title":  "Untitled  Nas  album",                          "url":  "http://en.wikipedia.org/wiki/Untitled%20Nas%20album"                  }   }
  • 32. • • • • • • • • • • • • • • • • • • • • • AT5   Mad  Men   The  Wire   Monty  Python's  Flying   Circus   Flight  of  the  Conchords   Donnie  Darko   Flevopark  Film  Festival   Do  The  Right  Thing   A  Clockwork  Orange   Wild  Style   Princess  Mononoke   The  Fountain   Pi   Northfork   La  Haine   Zen  and  the  Art  of   Motorcycle  Maintenance   Moon  Palace   The  Fountainhead   The  Wind-­‐Up  Bird   Chronicle   Wu-­‐Tang   J.Cole  
  • 33.
  • 34. Match Events to Profile Entities
  • 35. Match Events to Profile Entities
  • 36. Map Events to Wikipedia Entities
  • 37. Match Events to Profile Entities
  • 38. Matching metric #1: link overlap
  • 39. Matching metric #1: link overlap
  • 45. Matching metric #2: direct link U.S. Hiphop NAS Kanye! West Jay-Z Damian! Marley Jay-Z Hiphop Kanye! West 51st! Grammy! Awards
  • 46. Matching metric #3: textual similarity NAS 51st! Grammy! Awards
  • 47. Matching metric #3: textual similarity NAS 51st! Grammy! Awards
  • 48. Matching metric #3: textual similarity NAS 51st! Grammy! Awards
  • 50. Combine scores & rank events        "5043324":  {              "event_title":  "Iraq  War",              "related_entity_title":  "The  Wire",              "score":  1.0,              "event_date":  "2003-­‐03-­‐20"          },          "1376628":  {              "event_title":  "Blankets  (comics)",              "related_entity_title":  "Princess  Mononoke",              "score":  0.11465851113504691,              "event_date":  "2003-­‐07-­‐23"          },          "15694206":  {              "event_title":  "2006  LG  Hockey  Games",              "related_entity_title":  "Reimersholme",              "score":  0.3467068139664613,              "event_date":  "2006-­‐04-­‐29"          },          "4861876":  {              "event_title":  "2005  UEFA  Champions  League  Final",              "related_entity_title":  "Istanbul",              "score":  1.0,              "event_date":  "2005-­‐05-­‐25"          },          "31966809":  {              "event_title":  "63rd  Primetime  Emmy  Awards",              "related_entity_title":  "Mad  Men",              "score":  0.04039278737569369,              "event_date":  "2011-­‐09-­‐18"          },
  • 51.
  • 52.
  • 53. Future Work • Log interactions • Interpret clicks as (implicit) feedback: • Click on Event: user is interested • No click on Event: user is not • Learn scoring & ranking functions
  • 54. Thank you! Questions? Try yourHistory: See our poster: http://apps.facebook.com/yourHistory
 #98 ! ! ! ! 

 

 David Graus

 d.p.graus@uva.nl @dvdgrs