SlideShare uma empresa Scribd logo
1 de 33
Baixar para ler offline
A Crowd-Annotated Spanish Corpus for Humor
Analysis
Santiago Castro, Luis Chiruzzo, Aiala Rosá, Diego Garat and Guillermo
Moncecchi
July 20th
, 2018
Grupo de Procesamiento de Lenguaje Natural,
Universidad de la República — Uruguay
1
Outline
Background
Extraction
Annotation
Dataset
Analysis
Conclusion
HAHA Task
2
Background
Background i
• Humor Detection is about telling if a text is humorous
(e. g., a joke).
My grandpa came to America looking for freedom, but it
didn’t work out, in the next flight my grandma was coming.
IT’S REALLY HOT
3
Background ii
• Some previous work, such as Barbieri and Saggion (2014),
Mihalcea and Strapparava (2005), and Sjöbergh and Araki
(2007), created binary Humor Classifiers for short texts
written in English.
• They extracted one-liners from the Internet and from
Twitter, such as:
Beauty is in the eye of the beer holder.
• Castro et al. (2016) worked on Spanish tweets since our
group is interested in leveraging tools for Spanish.
• Back then, we conceived the first and only Spanish dataset
to study Humor.
4
Background iii
• Castro et al. (2016) corpus provided 40k tweets from 18
accounts, with 34k annotations. The annotators decided if
the tweets were humorous or not, and if so they rated
them from 1 to 5.
• However, the dataset has some issues:
1. low inter-annotator agreement (Fleiss’ κ = 0.3654)
2. limited variety of sources (humorous: 9 Twitter accounts,
non-humorous: 3 about news accounts, 3 about
inspirational thoughts and 3 about curious facts)
3. very few annotations per tweet (less than 2 in average,
around 500 with ≥ 5 annotations)
4. only 6k were considered humorous by the crowd
5
Background iv
6
Related work
Potash, Romanov, and Rumshisky (2017) built a corpus based
on tweets in English that aims to distinguish the degree of
funniness in a given tweet. They used the tweet set issued in
response to a TV game show, labeling which tweets were
considered humorous by the show. Used in SemEval 2017 Task
6 — #HashtagWars.
7
Extraction
Extraction i
1. We wanted to have at least 20k tweets as balanced as
possible, at least 5 annotations each.
2. We fetched tweets from 50 humorous accounts from
Spanish speaking countries, taking 12k at random.
3. We fetched tweet samples written in Spanish throughout
February 2018, taking 12k at random.
8
Extraction ii
4. As expected, both sources contained a mix of humorous
and non-humorous tweets.
9
Annotation
Annotation i
We built a web page, similar to the one used by Castro et al.
(2016):
10
Annotation ii
clasificahumor.com
11
Annotation iii
• Tweets were randomly shown to annotators, but avoiding
duplicates (by using web cookies).
• We wanted UI to be the more intuitive and
self-explanatory as possible, trying not to induce any bias
on users and letting them come up with their own
definition of humor.
• The simple and friendly interface is meant to keep the
users engaged and having fun while classifying tweets.
12
Annotation iv
• People annotated from March 8th to 27th, 2018.
• The first tweets shown to every session were the same: 3
tweets for which we know a clear answer.
• During the annotation process, we added around 4,500
tweets coming from humorous accounts to help the
balance.
13
Dataset
Dataset i
• The dataset consists of two CSV files: tweets and
annotations.
tweet ID origin
24 humorous account
tweet ID session ID date value
24 YOH113F…C4R 2018-03-15 19:30:34 2
14
Dataset ii
• 27,282 tweets
• 117,800 annotations (including 2,959 skips)
• 107,634 “high quality” annotations (excluding skips)
15
Analysis
Annotation Distribution
0 2 4 6 8 10 12
0
2,000
4,000
6,000
8,000
10,000
12,000
Number of annotations
Tweets
16
Class Distribution
1%3.2%
7%
10.3%13.3%
65.2%
Excellent
Good
Regular
Little Funny
Not Funny
Not Humorous
17
Annotators Distribution
1 10 100 1000
0
20k
40k
60k
80k
100k
Annotators
Annotations
18
Agreement
• Krippendorff’s α = 0.5710 (vs. 0.3654)
• If we include the “low quality”, α = 0.5512
• Funniness: α = 0.1625
• If we only consider the 11 annotators who tagged more
than a 1,000 times (who tagged 50,939 times in total), the
humor and funniness agreement are respectively 0.6345
and 0.2635.
19
Conclusion
Conclusion
• We created a better version of a dataset to study Humor in
Spanish. 27,282 tweets coming from multiple sources, with
107,634 annotations “high quality” annotations.
• Significant inter-annotator agreement value.
• It is also a first step to study subjectivity. Although more
annotations per tweet would be appropriate, there is a
subset of a thousand tweets with at least six annotations
that could be used to study people’s opinion on the same
instances.
20
HAHA Task
HAHA Task
• An IberEval 2018 task.
• Two subtasks: Humor Classification and Funniness
Average Prediction.
• Subset of 20k tweets.
• 3 participants,
• 7 and 2 submissions respectively.
21
Analysis
Category Votes Hits
Humorous
3/5 52.25%
4/5 75.33%
5/5 85.04%
Not humorous
3/5 68.54%
4/5 80.83%
5/5 82.42%
22
References i
References
Barbieri, Francesco and Horacio Saggion (2014). “Automatic
Detection of Irony and Humour in Twitter”. In: ICCC,
pp. 155–162.
Castro, Santiago et al. (2016). “Is This a Joke? Detecting Humor
in Spanish Tweets”. In: Ibero-American Conference on
Artificial Intelligence. Springer, pp. 139–150. doi:
10.1007/978-3-319-47955-2_12.
23
References ii
Fleiss, Joseph L (1971). “Measuring nominal scale agreement
among many raters”. In: Psychological bulletin 76.5, p. 378.
doi: 10.1037/h0031619.
Krippendorff, Klaus (2012). Content analysis: An introduction to
its methodology. Sage. doi:
10.1111/j.1468-4446.2007.00153_10.x.
Mihalcea, Rada and Carlo Strapparava (2005). “Making
Computers Laugh: Investigations in Automatic Humor
Recognition”. In: Proceedings of the Conference on Human
Language Technology and Empirical Methods in Natural
Language Processing. HLT ’05. Vancouver, British Columbia,
Canada: Association for Computational Linguistics,
pp. 531–538. doi: 10.3115/1220575.1220642.
24
References iii
Potash, Peter, Alexey Romanov, and Anna Rumshisky (2017).
“SemEval-2017 Task 6:# HashtagWars: Learning a sense of
humor”. In: Proceedings of the 11th International Workshop
on Semantic Evaluation (SemEval-2017), pp. 49–57. doi:
10.18653/v1/s17-2004.
Sjöbergh, Jonas and Kenji Araki (2007). “Recognizing Humor
Without Recognizing Meaning”. In: WILF. Ed. by
Francesco Masulli, Sushmita Mitra, and Gabriella Pasi.
Vol. 4578. Lecture Notes in Computer Science. Springer,
pp. 469–476. isbn: 978-3-540-73399-7. doi:
10.1007/978-3-540-73400-0_59.
25
Questions?
https://pln-fing-udelar.github.io/humor/
26

Mais conteúdo relacionado

Mais de Association for Computational Linguistics

Venkatesh Duppada - 2017 - SeerNet at EmoInt-2017: Tweet Emotion Intensity Es...
Venkatesh Duppada - 2017 - SeerNet at EmoInt-2017: Tweet Emotion Intensity Es...Venkatesh Duppada - 2017 - SeerNet at EmoInt-2017: Tweet Emotion Intensity Es...
Venkatesh Duppada - 2017 - SeerNet at EmoInt-2017: Tweet Emotion Intensity Es...Association for Computational Linguistics
 
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 Workshop
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 WorkshopSatoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 Workshop
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 WorkshopAssociation for Computational Linguistics
 
John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...
John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...
John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...Association for Computational Linguistics
 
John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...
John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...
John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...Association for Computational Linguistics
 
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...Association for Computational Linguistics
 
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...Association for Computational Linguistics
 
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 Workshop
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 WorkshopSatoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 Workshop
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 WorkshopAssociation for Computational Linguistics
 
Graham Neubig - 2015 - Neural Reranking Improves Subjective Quality of Machin...
Graham Neubig - 2015 - Neural Reranking Improves Subjective Quality of Machin...Graham Neubig - 2015 - Neural Reranking Improves Subjective Quality of Machin...
Graham Neubig - 2015 - Neural Reranking Improves Subjective Quality of Machin...Association for Computational Linguistics
 
Graham Neubig - 2015 - Neural Reranking Improves Subjective Quality of Machin...
Graham Neubig - 2015 - Neural Reranking Improves Subjective Quality of Machin...Graham Neubig - 2015 - Neural Reranking Improves Subjective Quality of Machin...
Graham Neubig - 2015 - Neural Reranking Improves Subjective Quality of Machin...Association for Computational Linguistics
 
Terumasa Ehara - 2015 - System Combination of RBMT plus SPE and Preordering p...
Terumasa Ehara - 2015 - System Combination of RBMT plus SPE and Preordering p...Terumasa Ehara - 2015 - System Combination of RBMT plus SPE and Preordering p...
Terumasa Ehara - 2015 - System Combination of RBMT plus SPE and Preordering p...Association for Computational Linguistics
 
Terumasa Ehara - 2015 - System Combination of RBMT plus SPE and Preordering p...
Terumasa Ehara - 2015 - System Combination of RBMT plus SPE and Preordering p...Terumasa Ehara - 2015 - System Combination of RBMT plus SPE and Preordering p...
Terumasa Ehara - 2015 - System Combination of RBMT plus SPE and Preordering p...Association for Computational Linguistics
 
Toshiaki Nakazawa - 2015 - Overview of the 2nd Workshop on Asian Translation
Toshiaki Nakazawa  - 2015 - Overview of the 2nd Workshop on Asian TranslationToshiaki Nakazawa  - 2015 - Overview of the 2nd Workshop on Asian Translation
Toshiaki Nakazawa - 2015 - Overview of the 2nd Workshop on Asian TranslationAssociation for Computational Linguistics
 
Hua Shan - 2015 - A Dependency-to-String Model for Chinese-Japanese SMT System
Hua Shan - 2015 - A Dependency-to-String Model for Chinese-Japanese SMT SystemHua Shan - 2015 - A Dependency-to-String Model for Chinese-Japanese SMT System
Hua Shan - 2015 - A Dependency-to-String Model for Chinese-Japanese SMT SystemAssociation for Computational Linguistics
 
Wei Yang - 2015 - Sampling-based Alignment and Hierarchical Sub-sentential Al...
Wei Yang - 2015 - Sampling-based Alignment and Hierarchical Sub-sentential Al...Wei Yang - 2015 - Sampling-based Alignment and Hierarchical Sub-sentential Al...
Wei Yang - 2015 - Sampling-based Alignment and Hierarchical Sub-sentential Al...Association for Computational Linguistics
 
Wei Yang - 2015 - Sampling-based Alignment and Hierarchical Sub-sentential Al...
Wei Yang - 2015 - Sampling-based Alignment and Hierarchical Sub-sentential Al...Wei Yang - 2015 - Sampling-based Alignment and Hierarchical Sub-sentential Al...
Wei Yang - 2015 - Sampling-based Alignment and Hierarchical Sub-sentential Al...Association for Computational Linguistics
 
Katsuhito Sudoh - 2015 Chinese-to-Japanese Patent Machine Translation based o...
Katsuhito Sudoh - 2015 Chinese-to-Japanese Patent Machine Translation based o...Katsuhito Sudoh - 2015 Chinese-to-Japanese Patent Machine Translation based o...
Katsuhito Sudoh - 2015 Chinese-to-Japanese Patent Machine Translation based o...Association for Computational Linguistics
 
Junki Matsuo - 2015 - Source Phrase Segmentation and Translation for Japanese...
Junki Matsuo - 2015 - Source Phrase Segmentation and Translation for Japanese...Junki Matsuo - 2015 - Source Phrase Segmentation and Translation for Japanese...
Junki Matsuo - 2015 - Source Phrase Segmentation and Translation for Japanese...Association for Computational Linguistics
 

Mais de Association for Computational Linguistics (20)

Venkatesh Duppada - 2017 - SeerNet at EmoInt-2017: Tweet Emotion Intensity Es...
Venkatesh Duppada - 2017 - SeerNet at EmoInt-2017: Tweet Emotion Intensity Es...Venkatesh Duppada - 2017 - SeerNet at EmoInt-2017: Tweet Emotion Intensity Es...
Venkatesh Duppada - 2017 - SeerNet at EmoInt-2017: Tweet Emotion Intensity Es...
 
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 Workshop
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 WorkshopSatoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 Workshop
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 Workshop
 
Chenchen Ding - 2015 - NICT at WAT 2015
Chenchen Ding - 2015 - NICT at WAT 2015Chenchen Ding - 2015 - NICT at WAT 2015
Chenchen Ding - 2015 - NICT at WAT 2015
 
John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...
John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...
John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...
 
John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...
John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...
John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...
 
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...
 
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...
 
Hyoung-Gyu Lee - 2015 - NAVER Machine Translation System for WAT 2015
Hyoung-Gyu Lee - 2015 - NAVER Machine Translation System for WAT 2015Hyoung-Gyu Lee - 2015 - NAVER Machine Translation System for WAT 2015
Hyoung-Gyu Lee - 2015 - NAVER Machine Translation System for WAT 2015
 
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 Workshop
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 WorkshopSatoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 Workshop
Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 Workshop
 
Chenchen Ding - 2015 - NICT at WAT 2015
Chenchen Ding - 2015 - NICT at WAT 2015Chenchen Ding - 2015 - NICT at WAT 2015
Chenchen Ding - 2015 - NICT at WAT 2015
 
Graham Neubig - 2015 - Neural Reranking Improves Subjective Quality of Machin...
Graham Neubig - 2015 - Neural Reranking Improves Subjective Quality of Machin...Graham Neubig - 2015 - Neural Reranking Improves Subjective Quality of Machin...
Graham Neubig - 2015 - Neural Reranking Improves Subjective Quality of Machin...
 
Graham Neubig - 2015 - Neural Reranking Improves Subjective Quality of Machin...
Graham Neubig - 2015 - Neural Reranking Improves Subjective Quality of Machin...Graham Neubig - 2015 - Neural Reranking Improves Subjective Quality of Machin...
Graham Neubig - 2015 - Neural Reranking Improves Subjective Quality of Machin...
 
Terumasa Ehara - 2015 - System Combination of RBMT plus SPE and Preordering p...
Terumasa Ehara - 2015 - System Combination of RBMT plus SPE and Preordering p...Terumasa Ehara - 2015 - System Combination of RBMT plus SPE and Preordering p...
Terumasa Ehara - 2015 - System Combination of RBMT plus SPE and Preordering p...
 
Terumasa Ehara - 2015 - System Combination of RBMT plus SPE and Preordering p...
Terumasa Ehara - 2015 - System Combination of RBMT plus SPE and Preordering p...Terumasa Ehara - 2015 - System Combination of RBMT plus SPE and Preordering p...
Terumasa Ehara - 2015 - System Combination of RBMT plus SPE and Preordering p...
 
Toshiaki Nakazawa - 2015 - Overview of the 2nd Workshop on Asian Translation
Toshiaki Nakazawa  - 2015 - Overview of the 2nd Workshop on Asian TranslationToshiaki Nakazawa  - 2015 - Overview of the 2nd Workshop on Asian Translation
Toshiaki Nakazawa - 2015 - Overview of the 2nd Workshop on Asian Translation
 
Hua Shan - 2015 - A Dependency-to-String Model for Chinese-Japanese SMT System
Hua Shan - 2015 - A Dependency-to-String Model for Chinese-Japanese SMT SystemHua Shan - 2015 - A Dependency-to-String Model for Chinese-Japanese SMT System
Hua Shan - 2015 - A Dependency-to-String Model for Chinese-Japanese SMT System
 
Wei Yang - 2015 - Sampling-based Alignment and Hierarchical Sub-sentential Al...
Wei Yang - 2015 - Sampling-based Alignment and Hierarchical Sub-sentential Al...Wei Yang - 2015 - Sampling-based Alignment and Hierarchical Sub-sentential Al...
Wei Yang - 2015 - Sampling-based Alignment and Hierarchical Sub-sentential Al...
 
Wei Yang - 2015 - Sampling-based Alignment and Hierarchical Sub-sentential Al...
Wei Yang - 2015 - Sampling-based Alignment and Hierarchical Sub-sentential Al...Wei Yang - 2015 - Sampling-based Alignment and Hierarchical Sub-sentential Al...
Wei Yang - 2015 - Sampling-based Alignment and Hierarchical Sub-sentential Al...
 
Katsuhito Sudoh - 2015 Chinese-to-Japanese Patent Machine Translation based o...
Katsuhito Sudoh - 2015 Chinese-to-Japanese Patent Machine Translation based o...Katsuhito Sudoh - 2015 Chinese-to-Japanese Patent Machine Translation based o...
Katsuhito Sudoh - 2015 Chinese-to-Japanese Patent Machine Translation based o...
 
Junki Matsuo - 2015 - Source Phrase Segmentation and Translation for Japanese...
Junki Matsuo - 2015 - Source Phrase Segmentation and Translation for Japanese...Junki Matsuo - 2015 - Source Phrase Segmentation and Translation for Japanese...
Junki Matsuo - 2015 - Source Phrase Segmentation and Translation for Japanese...
 

Último

Mental Health Awareness - a toolkit for supporting young minds
Mental Health Awareness - a toolkit for supporting young mindsMental Health Awareness - a toolkit for supporting young minds
Mental Health Awareness - a toolkit for supporting young mindsPooky Knightsmith
 
Reading and Writing Skills 11 quarter 4 melc 1
Reading and Writing Skills 11 quarter 4 melc 1Reading and Writing Skills 11 quarter 4 melc 1
Reading and Writing Skills 11 quarter 4 melc 1GloryAnnCastre1
 
Team Lead Succeed – Helping you and your team achieve high-performance teamwo...
Team Lead Succeed – Helping you and your team achieve high-performance teamwo...Team Lead Succeed – Helping you and your team achieve high-performance teamwo...
Team Lead Succeed – Helping you and your team achieve high-performance teamwo...Association for Project Management
 
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)lakshayb543
 
MS4 level being good citizen -imperative- (1) (1).pdf
MS4 level   being good citizen -imperative- (1) (1).pdfMS4 level   being good citizen -imperative- (1) (1).pdf
MS4 level being good citizen -imperative- (1) (1).pdfMr Bounab Samir
 
Q-Factor General Quiz-7th April 2024, Quiz Club NITW
Q-Factor General Quiz-7th April 2024, Quiz Club NITWQ-Factor General Quiz-7th April 2024, Quiz Club NITW
Q-Factor General Quiz-7th April 2024, Quiz Club NITWQuiz Club NITW
 
Concurrency Control in Database Management system
Concurrency Control in Database Management systemConcurrency Control in Database Management system
Concurrency Control in Database Management systemChristalin Nelson
 
Using Grammatical Signals Suitable to Patterns of Idea Development
Using Grammatical Signals Suitable to Patterns of Idea DevelopmentUsing Grammatical Signals Suitable to Patterns of Idea Development
Using Grammatical Signals Suitable to Patterns of Idea Developmentchesterberbo7
 
ROLES IN A STAGE PRODUCTION in arts.pptx
ROLES IN A STAGE PRODUCTION in arts.pptxROLES IN A STAGE PRODUCTION in arts.pptx
ROLES IN A STAGE PRODUCTION in arts.pptxVanesaIglesias10
 
Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...
Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...
Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...DhatriParmar
 
4.11.24 Mass Incarceration and the New Jim Crow.pptx
4.11.24 Mass Incarceration and the New Jim Crow.pptx4.11.24 Mass Incarceration and the New Jim Crow.pptx
4.11.24 Mass Incarceration and the New Jim Crow.pptxmary850239
 
4.11.24 Poverty and Inequality in America.pptx
4.11.24 Poverty and Inequality in America.pptx4.11.24 Poverty and Inequality in America.pptx
4.11.24 Poverty and Inequality in America.pptxmary850239
 
Narcotic and Non Narcotic Analgesic..pdf
Narcotic and Non Narcotic Analgesic..pdfNarcotic and Non Narcotic Analgesic..pdf
Narcotic and Non Narcotic Analgesic..pdfPrerana Jadhav
 
ClimART Action | eTwinning Project
ClimART Action    |    eTwinning ProjectClimART Action    |    eTwinning Project
ClimART Action | eTwinning Projectjordimapav
 
Man or Manufactured_ Redefining Humanity Through Biopunk Narratives.pptx
Man or Manufactured_ Redefining Humanity Through Biopunk Narratives.pptxMan or Manufactured_ Redefining Humanity Through Biopunk Narratives.pptx
Man or Manufactured_ Redefining Humanity Through Biopunk Narratives.pptxDhatriParmar
 
Active Learning Strategies (in short ALS).pdf
Active Learning Strategies (in short ALS).pdfActive Learning Strategies (in short ALS).pdf
Active Learning Strategies (in short ALS).pdfPatidar M
 
Transaction Management in Database Management System
Transaction Management in Database Management SystemTransaction Management in Database Management System
Transaction Management in Database Management SystemChristalin Nelson
 
DIFFERENT BASKETRY IN THE PHILIPPINES PPT.pptx
DIFFERENT BASKETRY IN THE PHILIPPINES PPT.pptxDIFFERENT BASKETRY IN THE PHILIPPINES PPT.pptx
DIFFERENT BASKETRY IN THE PHILIPPINES PPT.pptxMichelleTuguinay1
 

Último (20)

Mental Health Awareness - a toolkit for supporting young minds
Mental Health Awareness - a toolkit for supporting young mindsMental Health Awareness - a toolkit for supporting young minds
Mental Health Awareness - a toolkit for supporting young minds
 
INCLUSIVE EDUCATION PRACTICES FOR TEACHERS AND TRAINERS.pptx
INCLUSIVE EDUCATION PRACTICES FOR TEACHERS AND TRAINERS.pptxINCLUSIVE EDUCATION PRACTICES FOR TEACHERS AND TRAINERS.pptx
INCLUSIVE EDUCATION PRACTICES FOR TEACHERS AND TRAINERS.pptx
 
Reading and Writing Skills 11 quarter 4 melc 1
Reading and Writing Skills 11 quarter 4 melc 1Reading and Writing Skills 11 quarter 4 melc 1
Reading and Writing Skills 11 quarter 4 melc 1
 
Team Lead Succeed – Helping you and your team achieve high-performance teamwo...
Team Lead Succeed – Helping you and your team achieve high-performance teamwo...Team Lead Succeed – Helping you and your team achieve high-performance teamwo...
Team Lead Succeed – Helping you and your team achieve high-performance teamwo...
 
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
 
MS4 level being good citizen -imperative- (1) (1).pdf
MS4 level   being good citizen -imperative- (1) (1).pdfMS4 level   being good citizen -imperative- (1) (1).pdf
MS4 level being good citizen -imperative- (1) (1).pdf
 
Q-Factor General Quiz-7th April 2024, Quiz Club NITW
Q-Factor General Quiz-7th April 2024, Quiz Club NITWQ-Factor General Quiz-7th April 2024, Quiz Club NITW
Q-Factor General Quiz-7th April 2024, Quiz Club NITW
 
Concurrency Control in Database Management system
Concurrency Control in Database Management systemConcurrency Control in Database Management system
Concurrency Control in Database Management system
 
Using Grammatical Signals Suitable to Patterns of Idea Development
Using Grammatical Signals Suitable to Patterns of Idea DevelopmentUsing Grammatical Signals Suitable to Patterns of Idea Development
Using Grammatical Signals Suitable to Patterns of Idea Development
 
ROLES IN A STAGE PRODUCTION in arts.pptx
ROLES IN A STAGE PRODUCTION in arts.pptxROLES IN A STAGE PRODUCTION in arts.pptx
ROLES IN A STAGE PRODUCTION in arts.pptx
 
Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...
Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...
Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...
 
4.11.24 Mass Incarceration and the New Jim Crow.pptx
4.11.24 Mass Incarceration and the New Jim Crow.pptx4.11.24 Mass Incarceration and the New Jim Crow.pptx
4.11.24 Mass Incarceration and the New Jim Crow.pptx
 
4.11.24 Poverty and Inequality in America.pptx
4.11.24 Poverty and Inequality in America.pptx4.11.24 Poverty and Inequality in America.pptx
4.11.24 Poverty and Inequality in America.pptx
 
Narcotic and Non Narcotic Analgesic..pdf
Narcotic and Non Narcotic Analgesic..pdfNarcotic and Non Narcotic Analgesic..pdf
Narcotic and Non Narcotic Analgesic..pdf
 
ClimART Action | eTwinning Project
ClimART Action    |    eTwinning ProjectClimART Action    |    eTwinning Project
ClimART Action | eTwinning Project
 
Faculty Profile prashantha K EEE dept Sri Sairam college of Engineering
Faculty Profile prashantha K EEE dept Sri Sairam college of EngineeringFaculty Profile prashantha K EEE dept Sri Sairam college of Engineering
Faculty Profile prashantha K EEE dept Sri Sairam college of Engineering
 
Man or Manufactured_ Redefining Humanity Through Biopunk Narratives.pptx
Man or Manufactured_ Redefining Humanity Through Biopunk Narratives.pptxMan or Manufactured_ Redefining Humanity Through Biopunk Narratives.pptx
Man or Manufactured_ Redefining Humanity Through Biopunk Narratives.pptx
 
Active Learning Strategies (in short ALS).pdf
Active Learning Strategies (in short ALS).pdfActive Learning Strategies (in short ALS).pdf
Active Learning Strategies (in short ALS).pdf
 
Transaction Management in Database Management System
Transaction Management in Database Management SystemTransaction Management in Database Management System
Transaction Management in Database Management System
 
DIFFERENT BASKETRY IN THE PHILIPPINES PPT.pptx
DIFFERENT BASKETRY IN THE PHILIPPINES PPT.pptxDIFFERENT BASKETRY IN THE PHILIPPINES PPT.pptx
DIFFERENT BASKETRY IN THE PHILIPPINES PPT.pptx
 

Castro - 2018 - A Crowd-Annotated Spanish Corpus for Humour Analysis

  • 1. A Crowd-Annotated Spanish Corpus for Humor Analysis Santiago Castro, Luis Chiruzzo, Aiala Rosá, Diego Garat and Guillermo Moncecchi July 20th , 2018 Grupo de Procesamiento de Lenguaje Natural, Universidad de la República — Uruguay 1
  • 4. Background i • Humor Detection is about telling if a text is humorous (e. g., a joke). My grandpa came to America looking for freedom, but it didn’t work out, in the next flight my grandma was coming. IT’S REALLY HOT 3
  • 5. Background ii • Some previous work, such as Barbieri and Saggion (2014), Mihalcea and Strapparava (2005), and Sjöbergh and Araki (2007), created binary Humor Classifiers for short texts written in English. • They extracted one-liners from the Internet and from Twitter, such as: Beauty is in the eye of the beer holder. • Castro et al. (2016) worked on Spanish tweets since our group is interested in leveraging tools for Spanish. • Back then, we conceived the first and only Spanish dataset to study Humor. 4
  • 6. Background iii • Castro et al. (2016) corpus provided 40k tweets from 18 accounts, with 34k annotations. The annotators decided if the tweets were humorous or not, and if so they rated them from 1 to 5. • However, the dataset has some issues: 1. low inter-annotator agreement (Fleiss’ κ = 0.3654) 2. limited variety of sources (humorous: 9 Twitter accounts, non-humorous: 3 about news accounts, 3 about inspirational thoughts and 3 about curious facts) 3. very few annotations per tweet (less than 2 in average, around 500 with ≥ 5 annotations) 4. only 6k were considered humorous by the crowd 5
  • 8. Related work Potash, Romanov, and Rumshisky (2017) built a corpus based on tweets in English that aims to distinguish the degree of funniness in a given tweet. They used the tweet set issued in response to a TV game show, labeling which tweets were considered humorous by the show. Used in SemEval 2017 Task 6 — #HashtagWars. 7
  • 10. Extraction i 1. We wanted to have at least 20k tweets as balanced as possible, at least 5 annotations each. 2. We fetched tweets from 50 humorous accounts from Spanish speaking countries, taking 12k at random. 3. We fetched tweet samples written in Spanish throughout February 2018, taking 12k at random. 8
  • 11. Extraction ii 4. As expected, both sources contained a mix of humorous and non-humorous tweets. 9
  • 13. Annotation i We built a web page, similar to the one used by Castro et al. (2016): 10
  • 15. Annotation iii • Tweets were randomly shown to annotators, but avoiding duplicates (by using web cookies). • We wanted UI to be the more intuitive and self-explanatory as possible, trying not to induce any bias on users and letting them come up with their own definition of humor. • The simple and friendly interface is meant to keep the users engaged and having fun while classifying tweets. 12
  • 16. Annotation iv • People annotated from March 8th to 27th, 2018. • The first tweets shown to every session were the same: 3 tweets for which we know a clear answer. • During the annotation process, we added around 4,500 tweets coming from humorous accounts to help the balance. 13
  • 18. Dataset i • The dataset consists of two CSV files: tweets and annotations. tweet ID origin 24 humorous account tweet ID session ID date value 24 YOH113F…C4R 2018-03-15 19:30:34 2 14
  • 19. Dataset ii • 27,282 tweets • 117,800 annotations (including 2,959 skips) • 107,634 “high quality” annotations (excluding skips) 15
  • 21. Annotation Distribution 0 2 4 6 8 10 12 0 2,000 4,000 6,000 8,000 10,000 12,000 Number of annotations Tweets 16
  • 23. Annotators Distribution 1 10 100 1000 0 20k 40k 60k 80k 100k Annotators Annotations 18
  • 24. Agreement • Krippendorff’s α = 0.5710 (vs. 0.3654) • If we include the “low quality”, α = 0.5512 • Funniness: α = 0.1625 • If we only consider the 11 annotators who tagged more than a 1,000 times (who tagged 50,939 times in total), the humor and funniness agreement are respectively 0.6345 and 0.2635. 19
  • 26. Conclusion • We created a better version of a dataset to study Humor in Spanish. 27,282 tweets coming from multiple sources, with 107,634 annotations “high quality” annotations. • Significant inter-annotator agreement value. • It is also a first step to study subjectivity. Although more annotations per tweet would be appropriate, there is a subset of a thousand tweets with at least six annotations that could be used to study people’s opinion on the same instances. 20
  • 28. HAHA Task • An IberEval 2018 task. • Two subtasks: Humor Classification and Funniness Average Prediction. • Subset of 20k tweets. • 3 participants, • 7 and 2 submissions respectively. 21
  • 29. Analysis Category Votes Hits Humorous 3/5 52.25% 4/5 75.33% 5/5 85.04% Not humorous 3/5 68.54% 4/5 80.83% 5/5 82.42% 22
  • 30. References i References Barbieri, Francesco and Horacio Saggion (2014). “Automatic Detection of Irony and Humour in Twitter”. In: ICCC, pp. 155–162. Castro, Santiago et al. (2016). “Is This a Joke? Detecting Humor in Spanish Tweets”. In: Ibero-American Conference on Artificial Intelligence. Springer, pp. 139–150. doi: 10.1007/978-3-319-47955-2_12. 23
  • 31. References ii Fleiss, Joseph L (1971). “Measuring nominal scale agreement among many raters”. In: Psychological bulletin 76.5, p. 378. doi: 10.1037/h0031619. Krippendorff, Klaus (2012). Content analysis: An introduction to its methodology. Sage. doi: 10.1111/j.1468-4446.2007.00153_10.x. Mihalcea, Rada and Carlo Strapparava (2005). “Making Computers Laugh: Investigations in Automatic Humor Recognition”. In: Proceedings of the Conference on Human Language Technology and Empirical Methods in Natural Language Processing. HLT ’05. Vancouver, British Columbia, Canada: Association for Computational Linguistics, pp. 531–538. doi: 10.3115/1220575.1220642. 24
  • 32. References iii Potash, Peter, Alexey Romanov, and Anna Rumshisky (2017). “SemEval-2017 Task 6:# HashtagWars: Learning a sense of humor”. In: Proceedings of the 11th International Workshop on Semantic Evaluation (SemEval-2017), pp. 49–57. doi: 10.18653/v1/s17-2004. Sjöbergh, Jonas and Kenji Araki (2007). “Recognizing Humor Without Recognizing Meaning”. In: WILF. Ed. by Francesco Masulli, Sushmita Mitra, and Gabriella Pasi. Vol. 4578. Lecture Notes in Computer Science. Springer, pp. 469–476. isbn: 978-3-540-73399-7. doi: 10.1007/978-3-540-73400-0_59. 25