SlideShare uma empresa Scribd logo
1 de 38
Baixar para ler offline
filannim@cs.man.ac.uk 
School of Computer Science 
presentation 1st AHA! Workshop, COLING 2014 
Dublin, 23/08/2014 
Mining temporal 
footprints from Wikipedia 
Michele Filannino, Goran Nenadic
presentation 1st AHA! Workshop, COLING 2014 
Dublin, 23/08/2014 / 25 
introduction 
■ Temporal information is crucial for organising 
structured and unstructured data 
■ Several temporal information extraction (TIE) 
systems are nowadays available 
● thanks to TempEval challenge series 
2
presentation 1st AHA! Workshop, COLING 2014 
Dublin, 23/08/2014 / 25 
ManTIME 
URL: http://www.cs.man.ac.uk/~filannim/mantime.html 
3
Test with long text 4 / 23
presentation 1st AHA! Workshop, COLING 2014 
Dublin, 23/08/2014 / 25 
temporal footprint 
A temporal footprint is a 
continuous period on the time-line 
that temporally defines the 
existence of a particular concept. 
Immanuel Kant, Paul Guyer, and Allen W Wood. 1998. Critique of pure reason. Cambridge 
University Press. 
5
presentation 1st AHA! Workshop, COLING 2014 
Dublin, 23/08/2014 / 25 
problem 
Can we predict temporal footprints from 
encyclopaedic descriptions of concepts? 
■ input: textual description of a concept 
■ output: prediction of a temporal 
interval
Web 
Cellphone 
Computer 
Car 
Richard Feynman 
Bicycle 
Carl Friedrich Gauss 
French revolution 
Age of Enlightenment 
Galileo Galilei 
Leonardo Da Vinci 
Christopher Columbus 
Renaissance 
Arming sword 
High Middle Ages 
Gengis Khan 
1000 1100 1200 1300 1400 1500 1600 1700 1800 1900 2000 
Object Person Historical period 
Examples of temporal footprints 7 / 23
8 / 23
8 / 23
8 / 23
8 / 23
presentation 1st AHA! Workshop, COLING 2014 
Dublin, 23/08/2014 / 25 
methodology 
1. date mention extraction 
2. outlier filtering 
3. normal distribution fitting 
4. prediction 
9
presentation 1st AHA! Workshop, COLING 2014 
1360 1410 1460 1510 1560 1610 1660 1710 1760 1810 
Dublin, 23/08/2014 / 25 
date mentions extraction 
0.050 
0.038 
freq 
0.025 
0.013 
0.000 
time (in years) 
10
presentation 1st AHA! Workshop, COLING 2014 
outlier filtering 
γ param. 
1360 1410 1460 1510 1560 1610 1660 1710 1760 1810 
Dublin, 23/08/2014 / 25 
freq 
0.050 
0.038 
0.025 
0.013 
0.000 
time (in years) 
Gamma parameter controls the outlier region’s boundaries. 
11
presentation 1st AHA! Workshop, COLING 2014 
normal distribution fitting 
0.050 
0.038 
freq 
1360 1410 1460 1510 1560 1610 1660 1710 1760 1810 
12 Dublin, 23/08/2014 / 25 
0.025 
0.013 
0.000 
time (in years) 
Alpha and Beta parameters control the size and offset of the gaussian bell. 
α param.
presentation 1st AHA! Workshop, COLING 2014 
normal distribution fitting 
0.050 
0.038 
freq 
1360 1410 1460 1510 1560 1610 1660 1710 1760 1810 
12 Dublin, 23/08/2014 / 25 
0.025 
0.013 
0.000 
time (in years) 
Alpha and Beta parameters control the size and offset of the gaussian bell. 
α param.
presentation 1st AHA! Workshop, COLING 2014 
normal distribution fitting 
β param. 
1360 1410 1460 1510 1560 1610 1660 1710 1760 1810 
Dublin, 23/08/2014 / 25 
freq 
0.050 
0.038 
0.025 
0.013 
0.000 
time (in years) 
Alpha and Beta parameters control the size and offset of the gaussian bell. 
13
presentation 1st AHA! Workshop, COLING 2014 
normal distribution fitting 
β param. 
1360 1410 1460 1510 1560 1610 1660 1710 1760 1810 
Dublin, 23/08/2014 / 25 
freq 
0.050 
0.038 
0.025 
0.013 
0.000 
time (in years) 
Alpha and Beta parameters control the size and offset of the gaussian bell. 
13
presentation 1st AHA! Workshop, COLING 2014 
Dublin, 23/08/2014 / 25 
error measure 
gold 
prediction 
Fatima De Carvalho. 1996. Histogrammes et indices de proximite ́en analyse donne és 
symboliques. Acyes de l’e ćole d’e t́e ́sur l’analyse des donne és symboliques. LISE-CEREMADE, 
Universite ́de Paris IX Dauphine, pages 101–127. 
14 
union overlap
presentation 1st AHA! Workshop, COLING 2014 
Dublin, 23/08/2014 / 25 
error measure 
Fatima De Carvalho. 1996. Histogrammes et indices de proximite ́en analyse donne és 
symboliques. Acyes de l’e ćole d’e t́e ́sur l’analyse des donne és symboliques. LISE-CEREMADE, 
Universite ́de Paris IX Dauphine, pages 101–127. 
15 
union 
gold 
prediction
presentation 1st AHA! Workshop, COLING 2014 
Dublin, 23/08/2014 / 25 
strategies 
A. RegEx 
B. RegEx + Filtering 
C. RegEx + Filtering + Gaussian fitting 
D. HeidelTime + Filtering + Gaussian fitting 
16
presentation 1st AHA! Workshop, COLING 2014 
Dublin, 23/08/2014 / 25 
evaluation 
■ subject: people 
■ lived from 1000 AD to 2014 
● text from Wikipedia web pages 
● year of birth and death from DBpedia 
■ 228,824 people collected 
■ simple definition of temporal footprint 
● birth and death dates 
17
#people 
500 
400 
300 
200 
100 
0 
0 250 500 750 1000 1250 1500 1750 2000 2250 2500 2750 3000 3250 3500 3750 
#words 
People per textual length 1 8 / 23
presentation 1st AHA! Workshop, COLING 2014 
Dublin, 23/08/2014 / 25 
aggregate results 
19 
Strategy 
Mean 
Distance 
Error 
Standard 
Deviation 
RegEx 0.2636 0.3409 
RegEx + Filtering 0.2596 0.3090 
RegEx + Filtering + Gaussian fitting 0.3503 0.2430 
HeidelTime + Filtering + Gaussian fitting 0.5980 0.2470
presentation 1st AHA! Workshop, COLING 2014 
1112 3336 5560 7785 10009 12233 14458 16682 18906 21131 23355 25579 27804 
Dublin, 23/08/2014 / 25 
results 
1.0 
0.8 
0.6 
MDE 
0.4 
0.2 
0.0 
#words 
20 RegEx RegEx + Filtering 
HeidelTime + Filtering + Gaussian fitting RegEx + Filtering + Gaussian fitting
presentation 1st AHA! Workshop, COLING 2014 
1112 3336 5560 7785 10009 12233 14458 16682 18906 21131 23355 25579 27804 
Dublin, 23/08/2014 / 25 
results 
1.0 
0.8 
0.6 
MDE 
0.4 
0.2 
0.0 
#words 
20 RegEx RegEx + Filtering 
HeidelTime + Filtering + Gaussian fitting RegEx + Filtering + Gaussian fitting
presentation 1st AHA! Workshop, COLING 2014 
results 
■ Galileo Galilei (1564-1642), prediction: 1556-1654 
Dublin, 23/08/2014 / 25 
E: 0.204 
21
presentation 1st AHA! Workshop, COLING 2014 
results 
■ Robin Williams (1951 - 2014), prediction: 1953-2006 
Dublin, 23/08/2014 / 25 
E: 0.159 
22
presentation 1st AHA! Workshop, COLING 2014 
other types of temporal footprint? 
■ Christopher Columbus will die in 2057 ?! 
Dublin, 23/08/2014 / 25 
Prediction: 1366-2057 (1451-1506), E: 0.92 
23
presentation 1st AHA! Workshop, COLING 2014 
other types of temporal footprint? 
■ Christopher Columbus will die in 2057 ?! 
Dublin, 23/08/2014 / 25 
Prediction: 1366-2057 (1451-1506), E: 0.92 
23
presentation 1st AHA! Workshop, COLING 2014 
other types of temporal footprint? 
■ Christopher Columbus will die in 2057 ?! 
Dublin, 23/08/2014 / 25 
Prediction: 1366-2057 (1451-1506), E: 0.92 
23
presentation 1st AHA! Workshop, COLING 2014 
other types of temporal footprint? 
■ Christopher Columbus will die in 2057 ?! 
Dublin, 23/08/2014 / 25 
Prediction: 1366-2057 (1451-1506), E: 0.92 
23 
AHA!
presentation 1st AHA! Workshop, COLING 2014 
physical existence vs. social coverage 
■ Anne Frank’s footprint is shifted in the future 
24 
Dublin, 23/08/2014 / 25
presentation 1st AHA! Workshop, COLING 2014 
physical existence vs. social coverage 
■ Anne Frank’s footprint is shifted in the future 
24 
Dublin, 23/08/2014 / 25
presentation 1st AHA! Workshop, COLING 2014 
physical existence vs. social coverage 
■ Anne Frank’s footprint is shifted in the future 
24 
Dublin, 23/08/2014 / 25
presentation 1st AHA! Workshop, COLING 2014 
conclusions 
■ how the methodology behaves on different 
Dublin, 23/08/2014 / 25 
languages? how on different sources? 
■ oracle-like side-effect behaviour: 
• Apple Inc. will be closed down this year 
• Stanford University will be closed down in 2029 
■ Future works 
• mixture of normal distributions 
25
Thank you.
? QUESTIONS 
Contact: 
filannim@cs.man.ac.uk 
! 
Visit: 
tinyurl.com/temporal-footprints

Mais conteúdo relacionado

Destaque

Using machine learning to predict temporal orientation of search engines’ que...
Using machine learning to predict temporal orientation of search engines’ que...Using machine learning to predict temporal orientation of search engines’ que...
Using machine learning to predict temporal orientation of search engines’ que...
Michele Filannino
 
Algoritmo di text-similarity per l'annotazione semantica di Web Service
Algoritmo di text-similarity per l'annotazione semantica di Web ServiceAlgoritmo di text-similarity per l'annotazione semantica di Web Service
Algoritmo di text-similarity per l'annotazione semantica di Web Service
Michele Filannino
 
Femmes M D
Femmes M DFemmes M D
Femmes M D
klaus124
 
Semantic Web Service Annotation
Semantic Web Service AnnotationSemantic Web Service Annotation
Semantic Web Service Annotation
Michele Filannino
 
Modulo di serendipità in un Item Recommender System
Modulo di serendipità in un Item Recommender SystemModulo di serendipità in un Item Recommender System
Modulo di serendipità in un Item Recommender System
Michele Filannino
 
Funny Management Stories
Funny Management StoriesFunny Management Stories
Funny Management Stories
sajithmarakar
 

Destaque (11)

Using machine learning to predict temporal orientation of search engines’ que...
Using machine learning to predict temporal orientation of search engines’ que...Using machine learning to predict temporal orientation of search engines’ que...
Using machine learning to predict temporal orientation of search engines’ que...
 
My research taster project
My research taster projectMy research taster project
My research taster project
 
Tecniche fuzzy per l'elaborazione del linguaggio naturale
Tecniche fuzzy per l'elaborazione del linguaggio naturaleTecniche fuzzy per l'elaborazione del linguaggio naturale
Tecniche fuzzy per l'elaborazione del linguaggio naturale
 
Algoritmo di text-similarity per l'annotazione semantica di Web Service
Algoritmo di text-similarity per l'annotazione semantica di Web ServiceAlgoritmo di text-similarity per l'annotazione semantica di Web Service
Algoritmo di text-similarity per l'annotazione semantica di Web Service
 
Femmes M D
Femmes M DFemmes M D
Femmes M D
 
Sviluppo di un algoritmo di similarità a supporto dell'annotazione semantica ...
Sviluppo di un algoritmo di similarità a supporto dell'annotazione semantica ...Sviluppo di un algoritmo di similarità a supporto dell'annotazione semantica ...
Sviluppo di un algoritmo di similarità a supporto dell'annotazione semantica ...
 
Temporal information extraction in the general and clinical domain
Temporal information extraction in the general and clinical domainTemporal information extraction in the general and clinical domain
Temporal information extraction in the general and clinical domain
 
Can computers understand time?
Can computers understand time?Can computers understand time?
Can computers understand time?
 
Semantic Web Service Annotation
Semantic Web Service AnnotationSemantic Web Service Annotation
Semantic Web Service Annotation
 
Modulo di serendipità in un Item Recommender System
Modulo di serendipità in un Item Recommender SystemModulo di serendipità in un Item Recommender System
Modulo di serendipità in un Item Recommender System
 
Funny Management Stories
Funny Management StoriesFunny Management Stories
Funny Management Stories
 

Último

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdf
PirithiRaju
 
development of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virusdevelopment of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virus
NazaninKarimi6
 
Conjugation, transduction and transformation
Conjugation, transduction and transformationConjugation, transduction and transformation
Conjugation, transduction and transformation
Areesha Ahmad
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
PirithiRaju
 
Digital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptxDigital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptx
MohamedFarag457087
 

Último (20)

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdf
 
development of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virusdevelopment of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virus
 
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
 
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
 
Conjugation, transduction and transformation
Conjugation, transduction and transformationConjugation, transduction and transformation
Conjugation, transduction and transformation
 
Forensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfForensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdf
 
Introduction to Viruses
Introduction to VirusesIntroduction to Viruses
Introduction to Viruses
 
Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)
 
module for grade 9 for distance learning
module for grade 9 for distance learningmodule for grade 9 for distance learning
module for grade 9 for distance learning
 
Thyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate ProfessorThyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate Professor
 
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑
 
GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
 
Digital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptxDigital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptx
 
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verifiedConnaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified
 
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
 
Zoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdfZoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdf
 
pumpkin fruit fly, water melon fruit fly, cucumber fruit fly
pumpkin fruit fly, water melon fruit fly, cucumber fruit flypumpkin fruit fly, water melon fruit fly, cucumber fruit fly
pumpkin fruit fly, water melon fruit fly, cucumber fruit fly
 

Mining temporal footprints from Wikipedia

  • 1. filannim@cs.man.ac.uk School of Computer Science presentation 1st AHA! Workshop, COLING 2014 Dublin, 23/08/2014 Mining temporal footprints from Wikipedia Michele Filannino, Goran Nenadic
  • 2. presentation 1st AHA! Workshop, COLING 2014 Dublin, 23/08/2014 / 25 introduction ■ Temporal information is crucial for organising structured and unstructured data ■ Several temporal information extraction (TIE) systems are nowadays available ● thanks to TempEval challenge series 2
  • 3. presentation 1st AHA! Workshop, COLING 2014 Dublin, 23/08/2014 / 25 ManTIME URL: http://www.cs.man.ac.uk/~filannim/mantime.html 3
  • 4. Test with long text 4 / 23
  • 5. presentation 1st AHA! Workshop, COLING 2014 Dublin, 23/08/2014 / 25 temporal footprint A temporal footprint is a continuous period on the time-line that temporally defines the existence of a particular concept. Immanuel Kant, Paul Guyer, and Allen W Wood. 1998. Critique of pure reason. Cambridge University Press. 5
  • 6. presentation 1st AHA! Workshop, COLING 2014 Dublin, 23/08/2014 / 25 problem Can we predict temporal footprints from encyclopaedic descriptions of concepts? ■ input: textual description of a concept ■ output: prediction of a temporal interval
  • 7. Web Cellphone Computer Car Richard Feynman Bicycle Carl Friedrich Gauss French revolution Age of Enlightenment Galileo Galilei Leonardo Da Vinci Christopher Columbus Renaissance Arming sword High Middle Ages Gengis Khan 1000 1100 1200 1300 1400 1500 1600 1700 1800 1900 2000 Object Person Historical period Examples of temporal footprints 7 / 23
  • 12. presentation 1st AHA! Workshop, COLING 2014 Dublin, 23/08/2014 / 25 methodology 1. date mention extraction 2. outlier filtering 3. normal distribution fitting 4. prediction 9
  • 13. presentation 1st AHA! Workshop, COLING 2014 1360 1410 1460 1510 1560 1610 1660 1710 1760 1810 Dublin, 23/08/2014 / 25 date mentions extraction 0.050 0.038 freq 0.025 0.013 0.000 time (in years) 10
  • 14. presentation 1st AHA! Workshop, COLING 2014 outlier filtering γ param. 1360 1410 1460 1510 1560 1610 1660 1710 1760 1810 Dublin, 23/08/2014 / 25 freq 0.050 0.038 0.025 0.013 0.000 time (in years) Gamma parameter controls the outlier region’s boundaries. 11
  • 15. presentation 1st AHA! Workshop, COLING 2014 normal distribution fitting 0.050 0.038 freq 1360 1410 1460 1510 1560 1610 1660 1710 1760 1810 12 Dublin, 23/08/2014 / 25 0.025 0.013 0.000 time (in years) Alpha and Beta parameters control the size and offset of the gaussian bell. α param.
  • 16. presentation 1st AHA! Workshop, COLING 2014 normal distribution fitting 0.050 0.038 freq 1360 1410 1460 1510 1560 1610 1660 1710 1760 1810 12 Dublin, 23/08/2014 / 25 0.025 0.013 0.000 time (in years) Alpha and Beta parameters control the size and offset of the gaussian bell. α param.
  • 17. presentation 1st AHA! Workshop, COLING 2014 normal distribution fitting β param. 1360 1410 1460 1510 1560 1610 1660 1710 1760 1810 Dublin, 23/08/2014 / 25 freq 0.050 0.038 0.025 0.013 0.000 time (in years) Alpha and Beta parameters control the size and offset of the gaussian bell. 13
  • 18. presentation 1st AHA! Workshop, COLING 2014 normal distribution fitting β param. 1360 1410 1460 1510 1560 1610 1660 1710 1760 1810 Dublin, 23/08/2014 / 25 freq 0.050 0.038 0.025 0.013 0.000 time (in years) Alpha and Beta parameters control the size and offset of the gaussian bell. 13
  • 19. presentation 1st AHA! Workshop, COLING 2014 Dublin, 23/08/2014 / 25 error measure gold prediction Fatima De Carvalho. 1996. Histogrammes et indices de proximite ́en analyse donne és symboliques. Acyes de l’e ćole d’e t́e ́sur l’analyse des donne és symboliques. LISE-CEREMADE, Universite ́de Paris IX Dauphine, pages 101–127. 14 union overlap
  • 20. presentation 1st AHA! Workshop, COLING 2014 Dublin, 23/08/2014 / 25 error measure Fatima De Carvalho. 1996. Histogrammes et indices de proximite ́en analyse donne és symboliques. Acyes de l’e ćole d’e t́e ́sur l’analyse des donne és symboliques. LISE-CEREMADE, Universite ́de Paris IX Dauphine, pages 101–127. 15 union gold prediction
  • 21. presentation 1st AHA! Workshop, COLING 2014 Dublin, 23/08/2014 / 25 strategies A. RegEx B. RegEx + Filtering C. RegEx + Filtering + Gaussian fitting D. HeidelTime + Filtering + Gaussian fitting 16
  • 22. presentation 1st AHA! Workshop, COLING 2014 Dublin, 23/08/2014 / 25 evaluation ■ subject: people ■ lived from 1000 AD to 2014 ● text from Wikipedia web pages ● year of birth and death from DBpedia ■ 228,824 people collected ■ simple definition of temporal footprint ● birth and death dates 17
  • 23. #people 500 400 300 200 100 0 0 250 500 750 1000 1250 1500 1750 2000 2250 2500 2750 3000 3250 3500 3750 #words People per textual length 1 8 / 23
  • 24. presentation 1st AHA! Workshop, COLING 2014 Dublin, 23/08/2014 / 25 aggregate results 19 Strategy Mean Distance Error Standard Deviation RegEx 0.2636 0.3409 RegEx + Filtering 0.2596 0.3090 RegEx + Filtering + Gaussian fitting 0.3503 0.2430 HeidelTime + Filtering + Gaussian fitting 0.5980 0.2470
  • 25. presentation 1st AHA! Workshop, COLING 2014 1112 3336 5560 7785 10009 12233 14458 16682 18906 21131 23355 25579 27804 Dublin, 23/08/2014 / 25 results 1.0 0.8 0.6 MDE 0.4 0.2 0.0 #words 20 RegEx RegEx + Filtering HeidelTime + Filtering + Gaussian fitting RegEx + Filtering + Gaussian fitting
  • 26. presentation 1st AHA! Workshop, COLING 2014 1112 3336 5560 7785 10009 12233 14458 16682 18906 21131 23355 25579 27804 Dublin, 23/08/2014 / 25 results 1.0 0.8 0.6 MDE 0.4 0.2 0.0 #words 20 RegEx RegEx + Filtering HeidelTime + Filtering + Gaussian fitting RegEx + Filtering + Gaussian fitting
  • 27. presentation 1st AHA! Workshop, COLING 2014 results ■ Galileo Galilei (1564-1642), prediction: 1556-1654 Dublin, 23/08/2014 / 25 E: 0.204 21
  • 28. presentation 1st AHA! Workshop, COLING 2014 results ■ Robin Williams (1951 - 2014), prediction: 1953-2006 Dublin, 23/08/2014 / 25 E: 0.159 22
  • 29. presentation 1st AHA! Workshop, COLING 2014 other types of temporal footprint? ■ Christopher Columbus will die in 2057 ?! Dublin, 23/08/2014 / 25 Prediction: 1366-2057 (1451-1506), E: 0.92 23
  • 30. presentation 1st AHA! Workshop, COLING 2014 other types of temporal footprint? ■ Christopher Columbus will die in 2057 ?! Dublin, 23/08/2014 / 25 Prediction: 1366-2057 (1451-1506), E: 0.92 23
  • 31. presentation 1st AHA! Workshop, COLING 2014 other types of temporal footprint? ■ Christopher Columbus will die in 2057 ?! Dublin, 23/08/2014 / 25 Prediction: 1366-2057 (1451-1506), E: 0.92 23
  • 32. presentation 1st AHA! Workshop, COLING 2014 other types of temporal footprint? ■ Christopher Columbus will die in 2057 ?! Dublin, 23/08/2014 / 25 Prediction: 1366-2057 (1451-1506), E: 0.92 23 AHA!
  • 33. presentation 1st AHA! Workshop, COLING 2014 physical existence vs. social coverage ■ Anne Frank’s footprint is shifted in the future 24 Dublin, 23/08/2014 / 25
  • 34. presentation 1st AHA! Workshop, COLING 2014 physical existence vs. social coverage ■ Anne Frank’s footprint is shifted in the future 24 Dublin, 23/08/2014 / 25
  • 35. presentation 1st AHA! Workshop, COLING 2014 physical existence vs. social coverage ■ Anne Frank’s footprint is shifted in the future 24 Dublin, 23/08/2014 / 25
  • 36. presentation 1st AHA! Workshop, COLING 2014 conclusions ■ how the methodology behaves on different Dublin, 23/08/2014 / 25 languages? how on different sources? ■ oracle-like side-effect behaviour: • Apple Inc. will be closed down this year • Stanford University will be closed down in 2029 ■ Future works • mixture of normal distributions 25
  • 38. ? QUESTIONS Contact: filannim@cs.man.ac.uk ! Visit: tinyurl.com/temporal-footprints