SlideShare uma empresa Scribd logo
1 de 12
A Fine-Grained Sentiment Analysis Approach for
Detecting Crisis Related Microposts
Axel Schulz, Tung Dang Thanh,
Dr. Heiko Paulheim, Dr. Immanuel Schweizer
May, 14 2013
The work is partly funded by a grant of the German Federal Ministry for Education and
Research
Telecooperation Lab
Technische Universität Darmstadt
Motivation
Problem: Fragmented Situational Picture
?
Decision making based on
information from:
• Onsite rescue squads
• Traditional data sources
Bystanders report
additional information
about current situation
Decision makers must be aware of all relevant information in their environment
Valuable information from user-generated content is not usable for decision
makers because of:
• Heterogeneous and unstructured nature of the data
• Lack of time to analyze flood of data
A Fine-Grained Sentiment Analysis Approach for Detecting Crisis Related Microposts 2
Vision
 Vision: Increasing the situational picture by making user-generated content usable for
decision makers
 Sentiment analysis can help to differentiate important information from unimportant
one
 Current approaches focus on a three-class problem (Negative, Positive, Neutral)
 A more fine-grained differentiation could help detecting relevant tweets
 7 classes [Ekman]: Anger, Disgust, Fear, Sadness, Surprise, Happiness, Neutral
Enhanced Situational Picture
!
Decision making based on
information from:
• Onsite rescue squads
• Traditional data sources
Bystanders report
additional information
about current situation
A Fine-Grained Sentiment Analysis Approach for Detecting Crisis Related Microposts 3
Approach: Reference Pipeline
Feature Extraction
•Unigram Extraction
•Extraction of Part-of-speech
Features
•Character Trigram/Fourgram
•Syntactic Features
•Sentiment Features
Preprocessing
•Extraction of irrelevant
words, links, and user
mentions
•Handling Negations
•Abbreviation Resolution
•Category Extraction
Tweets
Classification
•Naïve Bayes Binary Model
•Naïve Bayes Multinomial
Model
•Support Vector Machine
A Fine-Grained Sentiment Analysis Approach for Detecting Crisis Related Microposts 4
Testing all combinations of
 Machine learning methods
 and features
Metrics for performance evaluation
 Accuracy, Precision, Recall
Results calculated using stratified 10-fold cross validation
Evaluation: Approach
A Fine-Grained Sentiment Analysis Approach for Detecting Crisis Related Microposts 6
7 Classes: 6 basic emotions + neutral
 SET1: 114 English tweets (Seattle)
 Labeled by at least eight persons and more than 50% agreement
 SET2: 1951 English tweets (Seattle)
 Surprise: positive surprise, negative surprise
 Each tweet labeled by one person using MTurk
3 Classes: Negative, Positive, Neutral
 SET2_GP: grouping SET2
 “Disgust”, “Fear”, “Sadness”, “Surprise with negative meaning” into the negative
class,
 “Happiness”, “Surprise with positive meaning” into the positive class
 “None” into the neutral class
 872 positive tweets, 598 negative tweets, and 481 neutral tweets
Evaluation: Datasets
A Fine-Grained Sentiment Analysis Approach for Detecting Crisis Related Microposts 7
Evaluation: Results
7-classes
SET 1 SET2
3-classes
SET2_GP
Accuracy 0.658 0.605 0.657 0.564 0.503 0.535 0.641 0.566 0.626
Avg. Precision 0.615 0.519 0.597 0.482 0.45 0.489 0.645 0.565 0.625
Avg. Recall 0.658 0.605 0.658 0.564 0.504 0.535 0.641 0.566 0.625
F-Measure 0.61 0.525 0.598 0.492 0.394 0.505 0.64 0.564 0.624
Class. Method NBB NBM SVM NBM NBB SVM NBM NBB SVM
Unigram x x x x x x x x
Syntactic Features x x x
Sentiment Features x x x x
POS Tagging x x x x x
Character tri-gram x
A Fine-Grained Sentiment Analysis Approach for Detecting Crisis Related Microposts 8
Evaluation: Crisis Related Results
 Tweets collected during “hurricane” Sandy in October 2012
 60 situational awareness tweets, 140 random, non-contributing tweets
 Results: using “Fear” tweets outperforms “Negative” and random baseline (30%
Accuracy)
 Found with 7-classes, but not with 3-classes
"Day 3. No power. Limited Food. Limited shelter. Must survive. #Sandy" [7-classes:
Fear, 3-classes: Neutral]
Detected Contributing to SA Accuracy Recall
7-classes
Fear 96 38 0.395 0.633
Disgust 41 10 0.243 0.166
Fear & Disgust 137 48 0.35 0.80
3-classes Negative 41 12 0.292 0.20
A Fine-Grained Sentiment Analysis Approach for Detecting Crisis Related Microposts 9
Example Application and Outlook
• For detecting tweets contributing to situational awareness
• Combination with geolocalization approaches, filtering etc.
A Fine-Grained Sentiment Analysis Approach for Detecting Crisis Related Microposts 10
Contribution
• Novel sentiment analysis approach for detecting seven sentiment
classes
• Preliminary evaluation: shows promising results towards detecting
crisis related tweets
Future Work
Larger training set needed
Combination of different means necessary for more valuable
pipeline
Conclusion & Outlook
A Fine-Grained Sentiment Analysis Approach for Detecting Crisis Related Microposts 11
THANK YOU!
Questions?
Can also be addressed to:
aschulz@tk.informatik.tu-darmstadt.de
12A Fine-Grained Sentiment Analysis Approach for Detecting Crisis Related Microposts
 Agarwal, A., Xie, B., Vovsha, I., Rambow, O., & Passonneau, R. (2011). Sentiment analysis of Twitter data.
Proceedings of the Workshop on Languages in Social Media. Portland, Oregon.
 Barbosa, L., & Feng, J. (2010). Robust sentiment detection on Twitter from biased and noisy data.
Proceedings of the 23rd International Conference on Computational Linguistics. Beijing, China.
 Ekman, P. (1992) An argument for basic emotions. Cognition & Emotion, 6, 3-4, 169-200.
 Esuli, A., & Sebastiani, F. (2006). SentiWordNet: A Publicly Available Lexical Resource for Opinion Mining.
Proceedings of the 5th Conference on Language Resources and Evaluation. Genova, IT.
 Jiang, L., Yu, M., Zhou, M., Liu, X., & Zhao, T. (2011). Target-dependent Twitter Sentiment Classification.
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics.
Portland, Oregon.
 Nagy, A. and Stamberger, J (2012) Proceedings of the 9th International ISCRAM
Conference, Vancouver, CA.
 Nielsen, A. (2011). A new ANEW: Evaluation of a word list for sentiment analysis in microposts. Journal of
the International Linguistic Association, 93-98.
 Pang, B., & Lee, L. (2006). Opinion Mining and Sentiment Analysis. Foundations and Trends in Information
Retrieval, 91-231.
 Vieweg, S., Hughes, A. L., Starbird, K., Palen, L. (2010) Micropostging during two natural hazards events.
Proceedings of the 28th international conference on Human factors in computing systems, Atlanta, GA.
 Witten, I. H. and Frank, E. (2005). Data Mining: Practice Machine Learning Tools and Techniques, 2nd
Edition, San Francisco, Morgan Kaufmann Publishers.
A Fine-Grained Sentiment Analysis Approach for Detecting Crisis Related Microposts 13
Bibliography

Mais conteúdo relacionado

Mais procurados

Inferential statistics hand out (2)
Inferential statistics hand out (2)Inferential statistics hand out (2)
Inferential statistics hand out (2)
Kimberly Ann Yabut
 
Basic Statistical Concepts & Decision-Making
Basic Statistical Concepts & Decision-MakingBasic Statistical Concepts & Decision-Making
Basic Statistical Concepts & Decision-Making
Penn State University
 
Chapter 3 part1-Design of Experiments
Chapter 3 part1-Design of ExperimentsChapter 3 part1-Design of Experiments
Chapter 3 part1-Design of Experiments
nszakir
 
Mat 540 week 1 quiz 1 2nd set
Mat 540 week 1  quiz 1  2nd set Mat 540 week 1  quiz 1  2nd set
Mat 540 week 1 quiz 1 2nd set
getwisdom
 
Carma internet research module n-bias
Carma internet research module   n-biasCarma internet research module   n-bias
Carma internet research module n-bias
Syracuse University
 
research on NUI
research on NUIresearch on NUI
research on NUI
Jay Gou
 
Aron chpt 7 ed effect size f2011
Aron chpt 7 ed effect size f2011Aron chpt 7 ed effect size f2011
Aron chpt 7 ed effect size f2011
Sandra Nicks
 

Mais procurados (20)

Fuzzy soft set theory
Fuzzy soft set theoryFuzzy soft set theory
Fuzzy soft set theory
 
beyond objectivity and subjectivity; a discussion paper
beyond objectivity and subjectivity; a discussion paperbeyond objectivity and subjectivity; a discussion paper
beyond objectivity and subjectivity; a discussion paper
 
P-values in crisis
P-values in crisisP-values in crisis
P-values in crisis
 
Discussion a 4th BFFF Harvard
Discussion a 4th BFFF HarvardDiscussion a 4th BFFF Harvard
Discussion a 4th BFFF Harvard
 
Network meta-analysis with integrated nested Laplace approximations
Network meta-analysis with integrated nested Laplace approximationsNetwork meta-analysis with integrated nested Laplace approximations
Network meta-analysis with integrated nested Laplace approximations
 
Inferential statistics hand out (2)
Inferential statistics hand out (2)Inferential statistics hand out (2)
Inferential statistics hand out (2)
 
Thompson Sampling for Machine Learning - Ruben Mak
Thompson Sampling for Machine Learning - Ruben MakThompson Sampling for Machine Learning - Ruben Mak
Thompson Sampling for Machine Learning - Ruben Mak
 
Causal Inference from Pragmatic Trials.
Causal Inference from Pragmatic Trials.Causal Inference from Pragmatic Trials.
Causal Inference from Pragmatic Trials.
 
Basic Statistical Concepts & Decision-Making
Basic Statistical Concepts & Decision-MakingBasic Statistical Concepts & Decision-Making
Basic Statistical Concepts & Decision-Making
 
D. Mayo: Replication Research Under an Error Statistical Philosophy
D. Mayo: Replication Research Under an Error Statistical Philosophy D. Mayo: Replication Research Under an Error Statistical Philosophy
D. Mayo: Replication Research Under an Error Statistical Philosophy
 
COVID and Causal Inference -- CogX 6/2020
COVID and Causal Inference -- CogX 6/2020COVID and Causal Inference -- CogX 6/2020
COVID and Causal Inference -- CogX 6/2020
 
Chapter 3 part1-Design of Experiments
Chapter 3 part1-Design of ExperimentsChapter 3 part1-Design of Experiments
Chapter 3 part1-Design of Experiments
 
6.hypothesistesting.06
6.hypothesistesting.066.hypothesistesting.06
6.hypothesistesting.06
 
Mat 540 week 1 quiz 1 2nd set
Mat 540 week 1  quiz 1  2nd set Mat 540 week 1  quiz 1  2nd set
Mat 540 week 1 quiz 1 2nd set
 
Carma internet research module n-bias
Carma internet research module   n-biasCarma internet research module   n-bias
Carma internet research module n-bias
 
A Cartoon Guide to Causal Inference
A Cartoon Guide to Causal InferenceA Cartoon Guide to Causal Inference
A Cartoon Guide to Causal Inference
 
Business Basic Statistics
Business Basic StatisticsBusiness Basic Statistics
Business Basic Statistics
 
research on NUI
research on NUIresearch on NUI
research on NUI
 
Replication Crises and the Statistics Wars: Hidden Controversies
Replication Crises and the Statistics Wars: Hidden ControversiesReplication Crises and the Statistics Wars: Hidden Controversies
Replication Crises and the Statistics Wars: Hidden Controversies
 
Aron chpt 7 ed effect size f2011
Aron chpt 7 ed effect size f2011Aron chpt 7 ed effect size f2011
Aron chpt 7 ed effect size f2011
 

Semelhante a ISCRAM 2013: A Fine-Grained Sentiment Analysis Approach for Detecting Crisis Related Microposts

PowerpointPresentationFrench
PowerpointPresentationFrenchPowerpointPresentationFrench
PowerpointPresentationFrench
Nicholas Roy
 

Semelhante a ISCRAM 2013: A Fine-Grained Sentiment Analysis Approach for Detecting Crisis Related Microposts (20)

Depression Detection in Tweets using Logistic Regression Model
Depression Detection in Tweets using Logistic Regression ModelDepression Detection in Tweets using Logistic Regression Model
Depression Detection in Tweets using Logistic Regression Model
 
Social Media Sentiment Analysis fro Depression detection Using Machine Learni...
Social Media Sentiment Analysis fro Depression detection Using Machine Learni...Social Media Sentiment Analysis fro Depression detection Using Machine Learni...
Social Media Sentiment Analysis fro Depression detection Using Machine Learni...
 
A data mining tool for the detection of suicide in social networks
A data mining tool for the detection of suicide in social networksA data mining tool for the detection of suicide in social networks
A data mining tool for the detection of suicide in social networks
 
Sentiment Analysis of Social Media Content: A multi-tool for listening to you...
Sentiment Analysis of Social Media Content: A multi-tool for listening to you...Sentiment Analysis of Social Media Content: A multi-tool for listening to you...
Sentiment Analysis of Social Media Content: A multi-tool for listening to you...
 
Biosurveillance 2.0: Lecture at Emory University
Biosurveillance 2.0: Lecture at Emory UniversityBiosurveillance 2.0: Lecture at Emory University
Biosurveillance 2.0: Lecture at Emory University
 
Depression prognosis using natural language processing and machine learning ...
Depression prognosis using natural language processing and  machine learning ...Depression prognosis using natural language processing and  machine learning ...
Depression prognosis using natural language processing and machine learning ...
 
Qualitative methods in Psychology Research
Qualitative methods in Psychology ResearchQualitative methods in Psychology Research
Qualitative methods in Psychology Research
 
Subconscious Crowdsourcing: A Feasible Data Collection Mechanism for Mental D...
Subconscious Crowdsourcing: A Feasible Data Collection Mechanism for Mental D...Subconscious Crowdsourcing: A Feasible Data Collection Mechanism for Mental D...
Subconscious Crowdsourcing: A Feasible Data Collection Mechanism for Mental D...
 
Ppr r101-session-3-smart
Ppr r101-session-3-smartPpr r101-session-3-smart
Ppr r101-session-3-smart
 
Computational Social Science:The Collaborative Futures of Big Data, Computer ...
Computational Social Science:The Collaborative Futures of Big Data, Computer ...Computational Social Science:The Collaborative Futures of Big Data, Computer ...
Computational Social Science:The Collaborative Futures of Big Data, Computer ...
 
chapter 1.pptx
chapter 1.pptxchapter 1.pptx
chapter 1.pptx
 
Observational studies in social media
Observational studies in social mediaObservational studies in social media
Observational studies in social media
 
Asking Better Questions How Presentation Formats Influence Information Search
Asking Better Questions  How Presentation Formats Influence Information SearchAsking Better Questions  How Presentation Formats Influence Information Search
Asking Better Questions How Presentation Formats Influence Information Search
 
Emotion Detection
Emotion DetectionEmotion Detection
Emotion Detection
 
IRJET- Deep Neural Network based Mechanism to Compute Depression in Socia...
IRJET-  	  Deep Neural Network based Mechanism to Compute Depression in Socia...IRJET-  	  Deep Neural Network based Mechanism to Compute Depression in Socia...
IRJET- Deep Neural Network based Mechanism to Compute Depression in Socia...
 
Methodology
MethodologyMethodology
Methodology
 
Brm unit iii - cheet sheet
Brm   unit iii - cheet sheetBrm   unit iii - cheet sheet
Brm unit iii - cheet sheet
 
A review on sentiment analysis and emotion detection.pptx
A review on sentiment analysis and emotion detection.pptxA review on sentiment analysis and emotion detection.pptx
A review on sentiment analysis and emotion detection.pptx
 
Relevance of statistics sgd-slideshare
Relevance of statistics sgd-slideshareRelevance of statistics sgd-slideshare
Relevance of statistics sgd-slideshare
 
PowerpointPresentationFrench
PowerpointPresentationFrenchPowerpointPresentationFrench
PowerpointPresentationFrench
 

Mais de ISCRAM Events

ISCRAM 2013: Twitter Integration and Content Moderation in GDACSmobile
ISCRAM 2013: Twitter Integration and Content Moderation in GDACSmobileISCRAM 2013: Twitter Integration and Content Moderation in GDACSmobile
ISCRAM 2013: Twitter Integration and Content Moderation in GDACSmobile
ISCRAM Events
 

Mais de ISCRAM Events (20)

ISCRAM 2013: Supporting multi-level situation awareness in crisis management
ISCRAM 2013: Supporting multi-level situation awareness in crisis managementISCRAM 2013: Supporting multi-level situation awareness in crisis management
ISCRAM 2013: Supporting multi-level situation awareness in crisis management
 
ISCRAM 2013: Smartphones as an Alerting, Command and Control System for the P...
ISCRAM 2013: Smartphones as an Alerting, Command and Control System for the P...ISCRAM 2013: Smartphones as an Alerting, Command and Control System for the P...
ISCRAM 2013: Smartphones as an Alerting, Command and Control System for the P...
 
ISCRAM 2013: Social media in C2 Proof-of-principle experiment
ISCRAM 2013: Social media in C2 Proof-of-principle experimentISCRAM 2013: Social media in C2 Proof-of-principle experiment
ISCRAM 2013: Social media in C2 Proof-of-principle experiment
 
ISCRAM 2013: Leading Cats: How to Effectively Command Collectives
ISCRAM 2013: Leading Cats: How to Effectively Command CollectivesISCRAM 2013: Leading Cats: How to Effectively Command Collectives
ISCRAM 2013: Leading Cats: How to Effectively Command Collectives
 
ISCRAM 2013: Community-based Comprehensive Recovery Closing collaboration gap...
ISCRAM 2013: Community-based Comprehensive Recovery Closing collaboration gap...ISCRAM 2013: Community-based Comprehensive Recovery Closing collaboration gap...
ISCRAM 2013: Community-based Comprehensive Recovery Closing collaboration gap...
 
ISCRAM 2013: Designing towards an impact evaluation framework for a collabora...
ISCRAM 2013: Designing towards an impact evaluation framework for a collabora...ISCRAM 2013: Designing towards an impact evaluation framework for a collabora...
ISCRAM 2013: Designing towards an impact evaluation framework for a collabora...
 
ISCRAM 2013: Context Ontology for Humanitarian Assistance in Crisis Response
ISCRAM 2013: Context Ontology for Humanitarian Assistance in Crisis ResponseISCRAM 2013: Context Ontology for Humanitarian Assistance in Crisis Response
ISCRAM 2013: Context Ontology for Humanitarian Assistance in Crisis Response
 
ISCRAM 2013: Meeting the Sphere Standards a case analysis of earthquake respo...
ISCRAM 2013: Meeting the Sphere Standards a case analysis of earthquake respo...ISCRAM 2013: Meeting the Sphere Standards a case analysis of earthquake respo...
ISCRAM 2013: Meeting the Sphere Standards a case analysis of earthquake respo...
 
ISCRAM 2013: Building robust supply networks for effective and efficient disa...
ISCRAM 2013: Building robust supply networks for effective and efficient disa...ISCRAM 2013: Building robust supply networks for effective and efficient disa...
ISCRAM 2013: Building robust supply networks for effective and efficient disa...
 
ISCRAM 2013: A multi-objective optimization model for relocating relief goods...
ISCRAM 2013: A multi-objective optimization model for relocating relief goods...ISCRAM 2013: A multi-objective optimization model for relocating relief goods...
ISCRAM 2013: A multi-objective optimization model for relocating relief goods...
 
ISCRAM 2013: Kimberly Roberson - UNHCR
ISCRAM 2013: Kimberly Roberson - UNHCRISCRAM 2013: Kimberly Roberson - UNHCR
ISCRAM 2013: Kimberly Roberson - UNHCR
 
ISCRAM 2013: A CBR Detection Framework Using Fuzzy Logic- WIP
ISCRAM 2013: A CBR Detection Framework Using Fuzzy Logic- WIPISCRAM 2013: A CBR Detection Framework Using Fuzzy Logic- WIP
ISCRAM 2013: A CBR Detection Framework Using Fuzzy Logic- WIP
 
ISCRAM 2013: A decision support system for effective use of probability forec...
ISCRAM 2013: A decision support system for effective use of probability forec...ISCRAM 2013: A decision support system for effective use of probability forec...
ISCRAM 2013: A decision support system for effective use of probability forec...
 
ISCRAM 2013: Extracting Information Nuggets from Disaster-Related Messages i...
ISCRAM 2013: Extracting Information Nuggets from  Disaster-Related Messages i...ISCRAM 2013: Extracting Information Nuggets from  Disaster-Related Messages i...
ISCRAM 2013: Extracting Information Nuggets from Disaster-Related Messages i...
 
ISCRAM 2013: Social Media-based Event Detection for Crisis Management in the ...
ISCRAM 2013: Social Media-based Event Detection for Crisis Management in the ...ISCRAM 2013: Social Media-based Event Detection for Crisis Management in the ...
ISCRAM 2013: Social Media-based Event Detection for Crisis Management in the ...
 
ISCRAM 2013: Interoperability during a Cross-Border Firefighting Operation at...
ISCRAM 2013: Interoperability during a Cross-Border Firefighting Operation at...ISCRAM 2013: Interoperability during a Cross-Border Firefighting Operation at...
ISCRAM 2013: Interoperability during a Cross-Border Firefighting Operation at...
 
ISCRAM 2013: Lessons Learnt from the 2011 Great East Japan Earthquake and Tsu...
ISCRAM 2013: Lessons Learnt from the 2011 Great East Japan Earthquake and Tsu...ISCRAM 2013: Lessons Learnt from the 2011 Great East Japan Earthquake and Tsu...
ISCRAM 2013: Lessons Learnt from the 2011 Great East Japan Earthquake and Tsu...
 
ISCRAM 2013: Analysis of a German First Responder Exercise: Requirements for ...
ISCRAM 2013: Analysis of a German First Responder Exercise: Requirements for ...ISCRAM 2013: Analysis of a German First Responder Exercise: Requirements for ...
ISCRAM 2013: Analysis of a German First Responder Exercise: Requirements for ...
 
ISCRAM 2013: Impact of the distribution and enrichment of information on the ...
ISCRAM 2013: Impact of the distribution and enrichment of information on the ...ISCRAM 2013: Impact of the distribution and enrichment of information on the ...
ISCRAM 2013: Impact of the distribution and enrichment of information on the ...
 
ISCRAM 2013: Twitter Integration and Content Moderation in GDACSmobile
ISCRAM 2013: Twitter Integration and Content Moderation in GDACSmobileISCRAM 2013: Twitter Integration and Content Moderation in GDACSmobile
ISCRAM 2013: Twitter Integration and Content Moderation in GDACSmobile
 

Último

1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
QucHHunhnh
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
heathfieldcps1
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
PECB
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
ciinovamais
 

Último (20)

Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
 
PROCESS RECORDING FORMAT.docx
PROCESS      RECORDING        FORMAT.docxPROCESS      RECORDING        FORMAT.docx
PROCESS RECORDING FORMAT.docx
 
Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptx
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan Fellows
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibit
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdf
 
psychiatric nursing HISTORY COLLECTION .docx
psychiatric  nursing HISTORY  COLLECTION  .docxpsychiatric  nursing HISTORY  COLLECTION  .docx
psychiatric nursing HISTORY COLLECTION .docx
 
Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 

ISCRAM 2013: A Fine-Grained Sentiment Analysis Approach for Detecting Crisis Related Microposts

  • 1. A Fine-Grained Sentiment Analysis Approach for Detecting Crisis Related Microposts Axel Schulz, Tung Dang Thanh, Dr. Heiko Paulheim, Dr. Immanuel Schweizer May, 14 2013 The work is partly funded by a grant of the German Federal Ministry for Education and Research Telecooperation Lab Technische Universität Darmstadt
  • 2. Motivation Problem: Fragmented Situational Picture ? Decision making based on information from: • Onsite rescue squads • Traditional data sources Bystanders report additional information about current situation Decision makers must be aware of all relevant information in their environment Valuable information from user-generated content is not usable for decision makers because of: • Heterogeneous and unstructured nature of the data • Lack of time to analyze flood of data A Fine-Grained Sentiment Analysis Approach for Detecting Crisis Related Microposts 2
  • 3. Vision  Vision: Increasing the situational picture by making user-generated content usable for decision makers  Sentiment analysis can help to differentiate important information from unimportant one  Current approaches focus on a three-class problem (Negative, Positive, Neutral)  A more fine-grained differentiation could help detecting relevant tweets  7 classes [Ekman]: Anger, Disgust, Fear, Sadness, Surprise, Happiness, Neutral Enhanced Situational Picture ! Decision making based on information from: • Onsite rescue squads • Traditional data sources Bystanders report additional information about current situation A Fine-Grained Sentiment Analysis Approach for Detecting Crisis Related Microposts 3
  • 4. Approach: Reference Pipeline Feature Extraction •Unigram Extraction •Extraction of Part-of-speech Features •Character Trigram/Fourgram •Syntactic Features •Sentiment Features Preprocessing •Extraction of irrelevant words, links, and user mentions •Handling Negations •Abbreviation Resolution •Category Extraction Tweets Classification •Naïve Bayes Binary Model •Naïve Bayes Multinomial Model •Support Vector Machine A Fine-Grained Sentiment Analysis Approach for Detecting Crisis Related Microposts 4
  • 5. Testing all combinations of  Machine learning methods  and features Metrics for performance evaluation  Accuracy, Precision, Recall Results calculated using stratified 10-fold cross validation Evaluation: Approach A Fine-Grained Sentiment Analysis Approach for Detecting Crisis Related Microposts 6
  • 6. 7 Classes: 6 basic emotions + neutral  SET1: 114 English tweets (Seattle)  Labeled by at least eight persons and more than 50% agreement  SET2: 1951 English tweets (Seattle)  Surprise: positive surprise, negative surprise  Each tweet labeled by one person using MTurk 3 Classes: Negative, Positive, Neutral  SET2_GP: grouping SET2  “Disgust”, “Fear”, “Sadness”, “Surprise with negative meaning” into the negative class,  “Happiness”, “Surprise with positive meaning” into the positive class  “None” into the neutral class  872 positive tweets, 598 negative tweets, and 481 neutral tweets Evaluation: Datasets A Fine-Grained Sentiment Analysis Approach for Detecting Crisis Related Microposts 7
  • 7. Evaluation: Results 7-classes SET 1 SET2 3-classes SET2_GP Accuracy 0.658 0.605 0.657 0.564 0.503 0.535 0.641 0.566 0.626 Avg. Precision 0.615 0.519 0.597 0.482 0.45 0.489 0.645 0.565 0.625 Avg. Recall 0.658 0.605 0.658 0.564 0.504 0.535 0.641 0.566 0.625 F-Measure 0.61 0.525 0.598 0.492 0.394 0.505 0.64 0.564 0.624 Class. Method NBB NBM SVM NBM NBB SVM NBM NBB SVM Unigram x x x x x x x x Syntactic Features x x x Sentiment Features x x x x POS Tagging x x x x x Character tri-gram x A Fine-Grained Sentiment Analysis Approach for Detecting Crisis Related Microposts 8
  • 8. Evaluation: Crisis Related Results  Tweets collected during “hurricane” Sandy in October 2012  60 situational awareness tweets, 140 random, non-contributing tweets  Results: using “Fear” tweets outperforms “Negative” and random baseline (30% Accuracy)  Found with 7-classes, but not with 3-classes "Day 3. No power. Limited Food. Limited shelter. Must survive. #Sandy" [7-classes: Fear, 3-classes: Neutral] Detected Contributing to SA Accuracy Recall 7-classes Fear 96 38 0.395 0.633 Disgust 41 10 0.243 0.166 Fear & Disgust 137 48 0.35 0.80 3-classes Negative 41 12 0.292 0.20 A Fine-Grained Sentiment Analysis Approach for Detecting Crisis Related Microposts 9
  • 9. Example Application and Outlook • For detecting tweets contributing to situational awareness • Combination with geolocalization approaches, filtering etc. A Fine-Grained Sentiment Analysis Approach for Detecting Crisis Related Microposts 10
  • 10. Contribution • Novel sentiment analysis approach for detecting seven sentiment classes • Preliminary evaluation: shows promising results towards detecting crisis related tweets Future Work Larger training set needed Combination of different means necessary for more valuable pipeline Conclusion & Outlook A Fine-Grained Sentiment Analysis Approach for Detecting Crisis Related Microposts 11
  • 11. THANK YOU! Questions? Can also be addressed to: aschulz@tk.informatik.tu-darmstadt.de 12A Fine-Grained Sentiment Analysis Approach for Detecting Crisis Related Microposts
  • 12.  Agarwal, A., Xie, B., Vovsha, I., Rambow, O., & Passonneau, R. (2011). Sentiment analysis of Twitter data. Proceedings of the Workshop on Languages in Social Media. Portland, Oregon.  Barbosa, L., & Feng, J. (2010). Robust sentiment detection on Twitter from biased and noisy data. Proceedings of the 23rd International Conference on Computational Linguistics. Beijing, China.  Ekman, P. (1992) An argument for basic emotions. Cognition & Emotion, 6, 3-4, 169-200.  Esuli, A., & Sebastiani, F. (2006). SentiWordNet: A Publicly Available Lexical Resource for Opinion Mining. Proceedings of the 5th Conference on Language Resources and Evaluation. Genova, IT.  Jiang, L., Yu, M., Zhou, M., Liu, X., & Zhao, T. (2011). Target-dependent Twitter Sentiment Classification. Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics. Portland, Oregon.  Nagy, A. and Stamberger, J (2012) Proceedings of the 9th International ISCRAM Conference, Vancouver, CA.  Nielsen, A. (2011). A new ANEW: Evaluation of a word list for sentiment analysis in microposts. Journal of the International Linguistic Association, 93-98.  Pang, B., & Lee, L. (2006). Opinion Mining and Sentiment Analysis. Foundations and Trends in Information Retrieval, 91-231.  Vieweg, S., Hughes, A. L., Starbird, K., Palen, L. (2010) Micropostging during two natural hazards events. Proceedings of the 28th international conference on Human factors in computing systems, Atlanta, GA.  Witten, I. H. and Frank, E. (2005). Data Mining: Practice Machine Learning Tools and Techniques, 2nd Edition, San Francisco, Morgan Kaufmann Publishers. A Fine-Grained Sentiment Analysis Approach for Detecting Crisis Related Microposts 13 Bibliography