SlideShare uma empresa Scribd logo
1 de 11
Baixar para ler offline
Counting Events and Participants within
Highly Ambiguous Data covering a very long tail
SemEval-2018 Task 5
Marten Postma, Filip Ilievski, Piek Vossen
{m.c.postma, f.ilievski, piek.vossen}@vu.nl
Input: Event properties
"2-5109": {
"event_types": "injuring",
"location": {“state” :
"http://dbpedia.org/resource/Iowa"},
"subtask": 2,
"time": {“year”: "2017"},
"verbose_question": "How many
'injuring' events happened in 2017 (year)
in ('Iowa') (state) ?"
}
Input: Event types
"2-5109": {
"event_types": "injuring",
"location": {“state” :
"http://dbpedia.org/resource/Iowa"},
"subtask": 2,
"time": {“year”: "2017"},
"verbose_question": "How many
'injuring' events happened in 2017 (year)
in ('Iowa') (state) ?"
}
Subtasks
● Subtask S1: Event Questions with Answer=1
○ Which killing incident happened in 2014 in Columbus, OH?
● Subtask S2 Event Questions with Answer=any number
○ How many killing incidents happened in 2016 in Columbus, MS?
● Subtask S3 Participant Questions with Answer=any number
○ How many people were killed in 2016 in Columbus, MS?
Optionally, participants could also provide the text mentions of events in the
documents.
Data
3 Domains: gun violence, fire
disasters, business.
The data consists of local news
articles and reports on small-world
events and participants.
The data is split into trial and
test parts.
Evaluation
1. Incident-level evaluation – Did you get the question right?
a. Accuracy (exact answer matching)
b. RMSE
2. Document-level evaluation – How many of the gold documents did you
retrieve?
a. Precision, recall and F1-value
3. Mention-level evaluation – Did you extract the correct coreference chains?
a. BLANC, CEAF_E, CEAF_M, MUC, BCUB
Participating systems
Task 5 had four participating systems.
NewsReader and ID-DE built a knowledge graph of incidents, which was then
queried for each question based on its constraints.
NAI-SEA performed clustering on a document level. FEUP applied a supervised
approach to address participants and locations separately.
All systems aided their clustering by extracting temporal expressions, word
senses, participants, locations, semantic roles, ...
Results Subtask 2
Team Incident-level accuracy norm Document-level accuracy norm
FEUP 26.38 (1) 30.51 (4)
*NewsReader 21.87 (2) 36.91 (3)
NAI-SEA 17.35 (4) 50.52 (1)
ID-DE 13.74 (5) 37.24 (2)
Baseline 18.25 (3) 26.38 (5)
See the full paper for more analysis of system results
Thank you for your attention

Mais conteúdo relacionado

Semelhante a SemEval-2018 task 5: Counting events and participants in the long tail

DH11: Browsing Highly Interconnected Humanities Databases Through Multi-Resul...
DH11: Browsing Highly Interconnected Humanities Databases Through Multi-Resul...DH11: Browsing Highly Interconnected Humanities Databases Through Multi-Resul...
DH11: Browsing Highly Interconnected Humanities Databases Through Multi-Resul...
Michele Pasin
 
Event-based MultiMedia Search and Retrieval for Question Answering
Event-based MultiMedia Search and Retrieval for Question AnsweringEvent-based MultiMedia Search and Retrieval for Question Answering
Event-based MultiMedia Search and Retrieval for Question Answering
Benoit HUET
 

Semelhante a SemEval-2018 task 5: Counting events and participants in the long tail (15)

DH11: Browsing Highly Interconnected Humanities Databases Through Multi-Resul...
DH11: Browsing Highly Interconnected Humanities Databases Through Multi-Resul...DH11: Browsing Highly Interconnected Humanities Databases Through Multi-Resul...
DH11: Browsing Highly Interconnected Humanities Databases Through Multi-Resul...
 
Graph-based multimodal clustering for social event detection in large collect...
Graph-based multimodal clustering for social event detection in large collect...Graph-based multimodal clustering for social event detection in large collect...
Graph-based multimodal clustering for social event detection in large collect...
 
Crowdsourcing the Quality of Knowledge Graphs: A DBpedia Study
Crowdsourcing the Quality of Knowledge Graphs:A DBpedia StudyCrowdsourcing the Quality of Knowledge Graphs:A DBpedia Study
Crowdsourcing the Quality of Knowledge Graphs: A DBpedia Study
 
Cross-Platform File System Activity Monitoring and Forensics - A Semantic App...
Cross-Platform File System Activity Monitoring and Forensics - A Semantic App...Cross-Platform File System Activity Monitoring and Forensics - A Semantic App...
Cross-Platform File System Activity Monitoring and Forensics - A Semantic App...
 
ACT Talk, Giuseppe Totaro: High Performance Computing for Distributed Indexin...
ACT Talk, Giuseppe Totaro: High Performance Computing for Distributed Indexin...ACT Talk, Giuseppe Totaro: High Performance Computing for Distributed Indexin...
ACT Talk, Giuseppe Totaro: High Performance Computing for Distributed Indexin...
 
Frontiers of Computational Journalism week 1 - Introduction and High Dimensio...
Frontiers of Computational Journalism week 1 - Introduction and High Dimensio...Frontiers of Computational Journalism week 1 - Introduction and High Dimensio...
Frontiers of Computational Journalism week 1 - Introduction and High Dimensio...
 
InSTEDD HISA Conference
InSTEDD HISA ConferenceInSTEDD HISA Conference
InSTEDD HISA Conference
 
InSTEDD: Collaboration in Disease Surveillance & Response
InSTEDD: Collaboration in Disease Surveillance & ResponseInSTEDD: Collaboration in Disease Surveillance & Response
InSTEDD: Collaboration in Disease Surveillance & Response
 
Towards Context-Aware Search and Analysis on Social Media Data
Towards Context-Aware Search and Analysis on Social Media DataTowards Context-Aware Search and Analysis on Social Media Data
Towards Context-Aware Search and Analysis on Social Media Data
 
Yuntech present
Yuntech presentYuntech present
Yuntech present
 
Meliorating usable document density for online event detection
Meliorating usable document density for online event detectionMeliorating usable document density for online event detection
Meliorating usable document density for online event detection
 
Event-based MultiMedia Search and Retrieval for Question Answering
Event-based MultiMedia Search and Retrieval for Question AnsweringEvent-based MultiMedia Search and Retrieval for Question Answering
Event-based MultiMedia Search and Retrieval for Question Answering
 
CIS - GeoMemes Research - June 2012 Update
CIS  - GeoMemes Research - June 2012 UpdateCIS  - GeoMemes Research - June 2012 Update
CIS - GeoMemes Research - June 2012 Update
 
Rule-based Information Extraction from Disease Outbreak Reports
Rule-based Information Extraction from Disease Outbreak ReportsRule-based Information Extraction from Disease Outbreak Reports
Rule-based Information Extraction from Disease Outbreak Reports
 
Handling crowdsourced geographic information
Handling crowdsourced geographic informationHandling crowdsourced geographic information
Handling crowdsourced geographic information
 

Mais de Filip Ilievski

Mais de Filip Ilievski (11)

The Commonsense Knowledge Graph
The Commonsense Knowledge GraphThe Commonsense Knowledge Graph
The Commonsense Knowledge Graph
 
Commonsense knowledge in Wikidata
Commonsense knowledge in WikidataCommonsense knowledge in Wikidata
Commonsense knowledge in Wikidata
 
A look inside Babelfy: Examining the bubble
A look inside Babelfy: Examining the bubbleA look inside Babelfy: Examining the bubble
A look inside Babelfy: Examining the bubble
 
2nd Spinoza workshop: Looking at the Long Tail - introductory slides
2nd Spinoza workshop: Looking at the Long Tail - introductory slides2nd Spinoza workshop: Looking at the Long Tail - introductory slides
2nd Spinoza workshop: Looking at the Long Tail - introductory slides
 
Systematic Study of Long Tail Phenomena in Entity Linking
Systematic Study of Long Tail Phenomena in Entity LinkingSystematic Study of Long Tail Phenomena in Entity Linking
Systematic Study of Long Tail Phenomena in Entity Linking
 
NoSQL databases
NoSQL databasesNoSQL databases
NoSQL databases
 
LOTUS: Adaptive Text Search for Big Linked Data
LOTUS: Adaptive Text Search for Big Linked DataLOTUS: Adaptive Text Search for Big Linked Data
LOTUS: Adaptive Text Search for Big Linked Data
 
Lotus: Linked Open Text UnleaShed - ISWC COLD '15
Lotus: Linked Open Text UnleaShed - ISWC COLD '15Lotus: Linked Open Text UnleaShed - ISWC COLD '15
Lotus: Linked Open Text UnleaShed - ISWC COLD '15
 
NAF2SEM and cross-document Event Coreference
NAF2SEM and cross-document Event CoreferenceNAF2SEM and cross-document Event Coreference
NAF2SEM and cross-document Event Coreference
 
Mini seminar presentation on context-based NED optimization
Mini seminar presentation on context-based NED optimizationMini seminar presentation on context-based NED optimization
Mini seminar presentation on context-based NED optimization
 
CLiN 25: NED with two-stage coherence optimization
CLiN 25: NED with two-stage coherence optimizationCLiN 25: NED with two-stage coherence optimization
CLiN 25: NED with two-stage coherence optimization
 

Último

SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptxSCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
RizalinePalanog2
 
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
PirithiRaju
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOST
Sérgio Sacani
 
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdf
PirithiRaju
 
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
ssuser79fe74
 
Bacterial Identification and Classifications
Bacterial Identification and ClassificationsBacterial Identification and Classifications
Bacterial Identification and Classifications
Areesha Ahmad
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Sérgio Sacani
 

Último (20)

9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 60009654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
 
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptxSCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
 
Chemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfChemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdf
 
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
 
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
 
Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdf
 
Zoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfZoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdf
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOST
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )
 
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
 
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdf
 
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
 
Bacterial Identification and Classifications
Bacterial Identification and ClassificationsBacterial Identification and Classifications
Bacterial Identification and Classifications
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
 
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
 
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
 
Green chemistry and Sustainable development.pptx
Green chemistry  and Sustainable development.pptxGreen chemistry  and Sustainable development.pptx
Green chemistry and Sustainable development.pptx
 
Botany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questionsBotany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questions
 

SemEval-2018 task 5: Counting events and participants in the long tail

  • 1. Counting Events and Participants within Highly Ambiguous Data covering a very long tail SemEval-2018 Task 5 Marten Postma, Filip Ilievski, Piek Vossen {m.c.postma, f.ilievski, piek.vossen}@vu.nl
  • 2.
  • 3. Input: Event properties "2-5109": { "event_types": "injuring", "location": {“state” : "http://dbpedia.org/resource/Iowa"}, "subtask": 2, "time": {“year”: "2017"}, "verbose_question": "How many 'injuring' events happened in 2017 (year) in ('Iowa') (state) ?" }
  • 4. Input: Event types "2-5109": { "event_types": "injuring", "location": {“state” : "http://dbpedia.org/resource/Iowa"}, "subtask": 2, "time": {“year”: "2017"}, "verbose_question": "How many 'injuring' events happened in 2017 (year) in ('Iowa') (state) ?" }
  • 5. Subtasks ● Subtask S1: Event Questions with Answer=1 ○ Which killing incident happened in 2014 in Columbus, OH? ● Subtask S2 Event Questions with Answer=any number ○ How many killing incidents happened in 2016 in Columbus, MS? ● Subtask S3 Participant Questions with Answer=any number ○ How many people were killed in 2016 in Columbus, MS? Optionally, participants could also provide the text mentions of events in the documents.
  • 6. Data 3 Domains: gun violence, fire disasters, business. The data consists of local news articles and reports on small-world events and participants. The data is split into trial and test parts.
  • 7. Evaluation 1. Incident-level evaluation – Did you get the question right? a. Accuracy (exact answer matching) b. RMSE 2. Document-level evaluation – How many of the gold documents did you retrieve? a. Precision, recall and F1-value 3. Mention-level evaluation – Did you extract the correct coreference chains? a. BLANC, CEAF_E, CEAF_M, MUC, BCUB
  • 8. Participating systems Task 5 had four participating systems. NewsReader and ID-DE built a knowledge graph of incidents, which was then queried for each question based on its constraints. NAI-SEA performed clustering on a document level. FEUP applied a supervised approach to address participants and locations separately. All systems aided their clustering by extracting temporal expressions, word senses, participants, locations, semantic roles, ...
  • 9. Results Subtask 2 Team Incident-level accuracy norm Document-level accuracy norm FEUP 26.38 (1) 30.51 (4) *NewsReader 21.87 (2) 36.91 (3) NAI-SEA 17.35 (4) 50.52 (1) ID-DE 13.74 (5) 37.24 (2) Baseline 18.25 (3) 26.38 (5)
  • 10. See the full paper for more analysis of system results
  • 11. Thank you for your attention