SlideShare uma empresa Scribd logo
1 de 18
Xiaowen Ding, Bing Liu and Philip Yu




Presenter: Quang Nguyen
Date: 2010.10.18
Saltlux Vietnam Development Center
   Featured-based Opinion Mining Tasks
    Task 1: Identify and extract object features F that have been
      commented on by an opinion holder (e.g., a reviewer).
    Task 2: Determine whether the opinions on the features F are
      positive, negative or neutral.
    Task 3: Group feature synonyms.
    • Produce a feature-based opinion summary of multiple reviews.


   This paper focuses on Task 2 assuming that features
    have been discovered


                                                                     2
 Opinion Words
  • Positive: beautiful, wonderful, good, amazing,
  • Negative: bad, poor, terrible, cost someone an
    arm and a leg (idiom).

 One effective approach is to use opinion lexicon,
 opinion words.
  • Identify all opinion words in a sentence
  • Aggregate these words to give the final opinion to
    each feature.



                                                         3
 Dictionary-based    approaches
  • Start from a seed opinion words
  • Use Wordnet’s hierarchy and synsets to acquire
   more opinion words
 Corpus-based approaches: extract opinion
 words from large corpora using syntactic
 rules and co-occurrence patterns

  Do not deal well with context dependent
  words!
                                                     4
 Improve lexicon-based approaches using
 context dependent opinion words
  • Negative: “The bedroom is very small”
  • Positive: “The Nokia N3100 is so small as to be
   put in any pockets”
 Propose  a function for aggregating multiple
  opinion words in the same sentence
 Consider explicit and implicit opinions



                                                      5
 Intra-sentence conjunction rule
 Pseudo intra-sentence conjunction
 Inter-sentence conjunction rule




                                      6
 Opinion
        on both sides of “and” should be
 the same
  • E.g., “This camera takes great pictures and has a
   long battery life”.

 Not   likely to say:
  • “This camera takes great pictures and has a short
   battery life.”



                                                        7
 Sometimes,  one may not use an explicit
 conjunction “and”.
  • Same opinion in same sentence, unless there is a
    “but”-like clause
  • E.g., “The camera has a long battery life, which is
    great”




                                                          8
 Peopleusually express the same opinion
 across sentences
  • unless there is an indication of opinion change
    using words such as “but” and “however”
  • E.g., “The picture quality is amazing. The battery life is
   long”
 Not   so natural to say:
  • “The picture quality is amazing. The battery life is
   short”



                                                                 9
 Opinion lexicon is far from sufficient. It needs
 special handling:
  • Negation/But Rule




  • Non-negation contains negative word, e.g., “I like this camera
    not just because it is beautiful”
  • Not contrary, but has a “but”, e.g., ““I not only like the picture
    quality of this camera, but also its size”
  • …



                                                                         10
 Implicit
         Feature is determined through
 adjectives (implicit feature indicator)
  • E.g., “This camera is very small”
  “small” is indicator for “size”
  • E.g., “This camera is very heavy”
  • “heavy” is indicator for “weight”




                                           11
 An object O is an entity which can be a product,
  person, event, organization, or topic
 An object O is represented with a finite set of features,
  F = {f1, f2, …, fn}.
    • Each feature fi in F can be expressed with a finite set of words
      or phrases Wi, which are synonyms.

   Model of a review: An opinion holder j comments on a
    subset of the features Sj F of object O.
    • For each feature fk   Sj that j comments on, he/she
       chooses a word or phrase from Wk to describe the
        feature, and
       expresses a positive, negative or neutral opinion on fk.

                                                                         12
   Input: a pair (f, s), where f is a product feature and s is a
    sentence that contains f.
   Output: whether the opinion on f in s is pos, neg, or neut.



                 wi: opinion word
                 V: set of all opinion words
                 dis(wi, f): distance between wi and f
                 SO: semantic orientation of wi (+1, -1, 0)




                                                                    13
14
15
Precision Recall F-Score
FBS
(M. Hu and B. Liu. Mining and         0.93    0.76    0.83
summarizing customer
reviews. KDD’04, 2004)

OPINE
(A-M. Popescu and O. Etzioni.
Extracting Product Features
                                      0.86    0.89    0.87
and Opinions from Reviews. EMNLP-
05, 2005.)
Opinion Observer                      0.92    0.91    0.91
(this paper)




                                                               16
   Xiaowen Ding, Bing Liu, and Philip S. Yu, A Holistic
    Lexicon-Based Approach to Opinion Mining, Proceedings
    of the international conference on Web search and web
    data mining, USA, 2008




                                                            17
18

Mais conteúdo relacionado

Semelhante a A holistic lexicon based approach to opinion mining

Technical Style Workshop Part 1
Technical Style Workshop Part 1Technical Style Workshop Part 1
Technical Style Workshop Part 1Eddie Hollon
 
Evaluation of research and planning 2
Evaluation of research and planning 2Evaluation of research and planning 2
Evaluation of research and planning 2ddelargy
 
Fmp peer assess
Fmp peer assessFmp peer assess
Fmp peer assessiain bruce
 
Lens unit 2010 5
Lens unit 2010 5Lens unit 2010 5
Lens unit 2010 5Dave Eddy
 
Assignment revision edu 697 week 4 2
Assignment revision edu 697 week 4 2Assignment revision edu 697 week 4 2
Assignment revision edu 697 week 4 2Machel Sweeney
 
BAEB601 Chapter 1: Introduction to Research Methodology
BAEB601 Chapter 1: Introduction to Research MethodologyBAEB601 Chapter 1: Introduction to Research Methodology
BAEB601 Chapter 1: Introduction to Research MethodologyDr Nur Suhaili Ramli
 
Tutorial 13 (explicit ugc + sentiment analysis)
Tutorial 13 (explicit ugc + sentiment analysis)Tutorial 13 (explicit ugc + sentiment analysis)
Tutorial 13 (explicit ugc + sentiment analysis)Kira
 
Ewrt 211 class 5
Ewrt 211 class 5Ewrt 211 class 5
Ewrt 211 class 5kimpalmore
 
Learning Local Lessons in Software Engineering
Learning Local Lessons in Software EngineeringLearning Local Lessons in Software Engineering
Learning Local Lessons in Software EngineeringCS, NcState
 
Whole education feb 7th 2013 literacy
Whole education feb 7th 2013 literacyWhole education feb 7th 2013 literacy
Whole education feb 7th 2013 literacyWholeeducation
 
Tuesday, march 20, 2012
Tuesday, march 20, 2012Tuesday, march 20, 2012
Tuesday, march 20, 2012Regina Navejar
 
Sentiment analysis and opinion mining
Sentiment analysis and opinion miningSentiment analysis and opinion mining
Sentiment analysis and opinion miningSumit Sony
 
Reading Street
Reading StreetReading Street
Reading Streetcavalcic
 
Reading Street
Reading StreetReading Street
Reading Streetcavalcic
 
How to write the summary
How to write the summaryHow to write the summary
How to write the summaryjschultz
 
Monday, march 19, 2012
Monday, march 19, 2012Monday, march 19, 2012
Monday, march 19, 2012Regina Navejar
 
Ontology based opinion mining for book reviews
Ontology based opinion mining for book reviewsOntology based opinion mining for book reviews
Ontology based opinion mining for book reviewsfirzhan naqash
 

Semelhante a A holistic lexicon based approach to opinion mining (20)

Technical Style Workshop Part 1
Technical Style Workshop Part 1Technical Style Workshop Part 1
Technical Style Workshop Part 1
 
Evaluation of research and planning 2
Evaluation of research and planning 2Evaluation of research and planning 2
Evaluation of research and planning 2
 
Fmp peer assess
Fmp peer assessFmp peer assess
Fmp peer assess
 
Lens unit 2010 5
Lens unit 2010 5Lens unit 2010 5
Lens unit 2010 5
 
Lens unit 2010 5
Lens unit 2010 5Lens unit 2010 5
Lens unit 2010 5
 
Assignment revision edu 697 week 4 2
Assignment revision edu 697 week 4 2Assignment revision edu 697 week 4 2
Assignment revision edu 697 week 4 2
 
BAEB601 Chapter 1: Introduction to Research Methodology
BAEB601 Chapter 1: Introduction to Research MethodologyBAEB601 Chapter 1: Introduction to Research Methodology
BAEB601 Chapter 1: Introduction to Research Methodology
 
Tutorial 13 (explicit ugc + sentiment analysis)
Tutorial 13 (explicit ugc + sentiment analysis)Tutorial 13 (explicit ugc + sentiment analysis)
Tutorial 13 (explicit ugc + sentiment analysis)
 
Ewrt 211 class 5
Ewrt 211 class 5Ewrt 211 class 5
Ewrt 211 class 5
 
Learning Local Lessons in Software Engineering
Learning Local Lessons in Software EngineeringLearning Local Lessons in Software Engineering
Learning Local Lessons in Software Engineering
 
Whole education feb 7th 2013 literacy
Whole education feb 7th 2013 literacyWhole education feb 7th 2013 literacy
Whole education feb 7th 2013 literacy
 
Bagwords
BagwordsBagwords
Bagwords
 
Tuesday, march 20, 2012
Tuesday, march 20, 2012Tuesday, march 20, 2012
Tuesday, march 20, 2012
 
Assignment writing at university 2017 ginofransman
Assignment writing at university 2017 ginofransmanAssignment writing at university 2017 ginofransman
Assignment writing at university 2017 ginofransman
 
Sentiment analysis and opinion mining
Sentiment analysis and opinion miningSentiment analysis and opinion mining
Sentiment analysis and opinion mining
 
Reading Street
Reading StreetReading Street
Reading Street
 
Reading Street
Reading StreetReading Street
Reading Street
 
How to write the summary
How to write the summaryHow to write the summary
How to write the summary
 
Monday, march 19, 2012
Monday, march 19, 2012Monday, march 19, 2012
Monday, march 19, 2012
 
Ontology based opinion mining for book reviews
Ontology based opinion mining for book reviewsOntology based opinion mining for book reviews
Ontology based opinion mining for book reviews
 

Mais de Nguyen Quang

Deep Reinforcement Learning
Deep Reinforcement LearningDeep Reinforcement Learning
Deep Reinforcement LearningNguyen Quang
 
Deep Dialog System Review
Deep Dialog System ReviewDeep Dialog System Review
Deep Dialog System ReviewNguyen Quang
 
Sequence to Sequence Learning with Neural Networks
Sequence to Sequence Learning with Neural NetworksSequence to Sequence Learning with Neural Networks
Sequence to Sequence Learning with Neural NetworksNguyen Quang
 
Introduction to cassandra
Introduction to cassandraIntroduction to cassandra
Introduction to cassandraNguyen Quang
 
Web browser architecture
Web browser architectureWeb browser architecture
Web browser architectureNguyen Quang
 
X Query for beginner
X Query for beginnerX Query for beginner
X Query for beginnerNguyen Quang
 
Redistributable introtoscrum
Redistributable introtoscrumRedistributable introtoscrum
Redistributable introtoscrumNguyen Quang
 
Text categorization
Text categorizationText categorization
Text categorizationNguyen Quang
 

Mais de Nguyen Quang (13)

Apache Zookeeper
Apache ZookeeperApache Zookeeper
Apache Zookeeper
 
Apache Storm
Apache StormApache Storm
Apache Storm
 
Deep Reinforcement Learning
Deep Reinforcement LearningDeep Reinforcement Learning
Deep Reinforcement Learning
 
Deep Dialog System Review
Deep Dialog System ReviewDeep Dialog System Review
Deep Dialog System Review
 
Sequence to Sequence Learning with Neural Networks
Sequence to Sequence Learning with Neural NetworksSequence to Sequence Learning with Neural Networks
Sequence to Sequence Learning with Neural Networks
 
Introduction to cassandra
Introduction to cassandraIntroduction to cassandra
Introduction to cassandra
 
Web browser architecture
Web browser architectureWeb browser architecture
Web browser architecture
 
Eclipse orion
Eclipse orionEclipse orion
Eclipse orion
 
X Query for beginner
X Query for beginnerX Query for beginner
X Query for beginner
 
Html 5
Html 5Html 5
Html 5
 
Redistributable introtoscrum
Redistributable introtoscrumRedistributable introtoscrum
Redistributable introtoscrum
 
Text categorization
Text categorizationText categorization
Text categorization
 
Overview of NoSQL
Overview of NoSQLOverview of NoSQL
Overview of NoSQL
 

Último

Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...apidays
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyKhushali Kathiriya
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native ApplicationsWSO2
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...apidays
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistandanishmna97
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Zilliz
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontologyjohnbeverley2021
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Victor Rentea
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDropbox
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Angeliki Cooney
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesrafiqahmad00786416
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxRemote DBA Services
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024The Digital Insurer
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 

Último (20)

Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptx
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 

A holistic lexicon based approach to opinion mining

  • 1. Xiaowen Ding, Bing Liu and Philip Yu Presenter: Quang Nguyen Date: 2010.10.18 Saltlux Vietnam Development Center
  • 2. Featured-based Opinion Mining Tasks Task 1: Identify and extract object features F that have been commented on by an opinion holder (e.g., a reviewer). Task 2: Determine whether the opinions on the features F are positive, negative or neutral. Task 3: Group feature synonyms. • Produce a feature-based opinion summary of multiple reviews.  This paper focuses on Task 2 assuming that features have been discovered 2
  • 3.  Opinion Words • Positive: beautiful, wonderful, good, amazing, • Negative: bad, poor, terrible, cost someone an arm and a leg (idiom).  One effective approach is to use opinion lexicon, opinion words. • Identify all opinion words in a sentence • Aggregate these words to give the final opinion to each feature. 3
  • 4.  Dictionary-based approaches • Start from a seed opinion words • Use Wordnet’s hierarchy and synsets to acquire more opinion words  Corpus-based approaches: extract opinion words from large corpora using syntactic rules and co-occurrence patterns Do not deal well with context dependent words! 4
  • 5.  Improve lexicon-based approaches using context dependent opinion words • Negative: “The bedroom is very small” • Positive: “The Nokia N3100 is so small as to be put in any pockets”  Propose a function for aggregating multiple opinion words in the same sentence  Consider explicit and implicit opinions 5
  • 6.  Intra-sentence conjunction rule  Pseudo intra-sentence conjunction  Inter-sentence conjunction rule 6
  • 7.  Opinion on both sides of “and” should be the same • E.g., “This camera takes great pictures and has a long battery life”.  Not likely to say: • “This camera takes great pictures and has a short battery life.” 7
  • 8.  Sometimes, one may not use an explicit conjunction “and”. • Same opinion in same sentence, unless there is a “but”-like clause • E.g., “The camera has a long battery life, which is great” 8
  • 9.  Peopleusually express the same opinion across sentences • unless there is an indication of opinion change using words such as “but” and “however” • E.g., “The picture quality is amazing. The battery life is long”  Not so natural to say: • “The picture quality is amazing. The battery life is short” 9
  • 10.  Opinion lexicon is far from sufficient. It needs special handling: • Negation/But Rule • Non-negation contains negative word, e.g., “I like this camera not just because it is beautiful” • Not contrary, but has a “but”, e.g., ““I not only like the picture quality of this camera, but also its size” • … 10
  • 11.  Implicit Feature is determined through adjectives (implicit feature indicator) • E.g., “This camera is very small” “small” is indicator for “size” • E.g., “This camera is very heavy” • “heavy” is indicator for “weight” 11
  • 12.  An object O is an entity which can be a product, person, event, organization, or topic  An object O is represented with a finite set of features, F = {f1, f2, …, fn}. • Each feature fi in F can be expressed with a finite set of words or phrases Wi, which are synonyms.  Model of a review: An opinion holder j comments on a subset of the features Sj F of object O. • For each feature fk Sj that j comments on, he/she  chooses a word or phrase from Wk to describe the feature, and  expresses a positive, negative or neutral opinion on fk. 12
  • 13. Input: a pair (f, s), where f is a product feature and s is a sentence that contains f.  Output: whether the opinion on f in s is pos, neg, or neut. wi: opinion word V: set of all opinion words dis(wi, f): distance between wi and f SO: semantic orientation of wi (+1, -1, 0) 13
  • 14. 14
  • 15. 15
  • 16. Precision Recall F-Score FBS (M. Hu and B. Liu. Mining and 0.93 0.76 0.83 summarizing customer reviews. KDD’04, 2004) OPINE (A-M. Popescu and O. Etzioni. Extracting Product Features 0.86 0.89 0.87 and Opinions from Reviews. EMNLP- 05, 2005.) Opinion Observer 0.92 0.91 0.91 (this paper) 16
  • 17. Xiaowen Ding, Bing Liu, and Philip S. Yu, A Holistic Lexicon-Based Approach to Opinion Mining, Proceedings of the international conference on Web search and web data mining, USA, 2008 17
  • 18. 18