SlideShare uma empresa Scribd logo
1 de 18
Xiaowen Ding, Bing Liu and Philip Yu




Presenter: Quang Nguyen
Date: 2010.10.18
Saltlux Vietnam Development Center
   Featured-based Opinion Mining Tasks
    Task 1: Identify and extract object features F that have been
      commented on by an opinion holder (e.g., a reviewer).
    Task 2: Determine whether the opinions on the features F are
      positive, negative or neutral.
    Task 3: Group feature synonyms.
    • Produce a feature-based opinion summary of multiple reviews.


   This paper focuses on Task 2 assuming that features
    have been discovered


                                                                     2
 Opinion Words
  • Positive: beautiful, wonderful, good, amazing,
  • Negative: bad, poor, terrible, cost someone an
    arm and a leg (idiom).

 One effective approach is to use opinion lexicon,
 opinion words.
  • Identify all opinion words in a sentence
  • Aggregate these words to give the final opinion to
    each feature.



                                                         3
 Dictionary-based    approaches
  • Start from a seed opinion words
  • Use Wordnet’s hierarchy and synsets to acquire
   more opinion words
 Corpus-based approaches: extract opinion
 words from large corpora using syntactic
 rules and co-occurrence patterns

  Do not deal well with context dependent
  words!
                                                     4
 Improve lexicon-based approaches using
 context dependent opinion words
  • Negative: “The bedroom is very small”
  • Positive: “The Nokia N3100 is so small as to be
   put in any pockets”
 Propose  a function for aggregating multiple
  opinion words in the same sentence
 Consider explicit and implicit opinions



                                                      5
 Intra-sentence conjunction rule
 Pseudo intra-sentence conjunction
 Inter-sentence conjunction rule




                                      6
 Opinion
        on both sides of “and” should be
 the same
  • E.g., “This camera takes great pictures and has a
   long battery life”.

 Not   likely to say:
  • “This camera takes great pictures and has a short
   battery life.”



                                                        7
 Sometimes,  one may not use an explicit
 conjunction “and”.
  • Same opinion in same sentence, unless there is a
    “but”-like clause
  • E.g., “The camera has a long battery life, which is
    great”




                                                          8
 Peopleusually express the same opinion
 across sentences
  • unless there is an indication of opinion change
    using words such as “but” and “however”
  • E.g., “The picture quality is amazing. The battery life is
   long”
 Not   so natural to say:
  • “The picture quality is amazing. The battery life is
   short”



                                                                 9
 Opinion lexicon is far from sufficient. It needs
 special handling:
  • Negation/But Rule




  • Non-negation contains negative word, e.g., “I like this camera
    not just because it is beautiful”
  • Not contrary, but has a “but”, e.g., ““I not only like the picture
    quality of this camera, but also its size”
  • …



                                                                         10
 Implicit
         Feature is determined through
 adjectives (implicit feature indicator)
  • E.g., “This camera is very small”
  “small” is indicator for “size”
  • E.g., “This camera is very heavy”
  • “heavy” is indicator for “weight”




                                           11
 An object O is an entity which can be a product,
  person, event, organization, or topic
 An object O is represented with a finite set of features,
  F = {f1, f2, …, fn}.
    • Each feature fi in F can be expressed with a finite set of words
      or phrases Wi, which are synonyms.

   Model of a review: An opinion holder j comments on a
    subset of the features Sj F of object O.
    • For each feature fk   Sj that j comments on, he/she
       chooses a word or phrase from Wk to describe the
        feature, and
       expresses a positive, negative or neutral opinion on fk.

                                                                         12
   Input: a pair (f, s), where f is a product feature and s is a
    sentence that contains f.
   Output: whether the opinion on f in s is pos, neg, or neut.



                 wi: opinion word
                 V: set of all opinion words
                 dis(wi, f): distance between wi and f
                 SO: semantic orientation of wi (+1, -1, 0)




                                                                    13
14
15
Precision Recall F-Score
FBS
(M. Hu and B. Liu. Mining and         0.93    0.76    0.83
summarizing customer
reviews. KDD’04, 2004)

OPINE
(A-M. Popescu and O. Etzioni.
Extracting Product Features
                                      0.86    0.89    0.87
and Opinions from Reviews. EMNLP-
05, 2005.)
Opinion Observer                      0.92    0.91    0.91
(this paper)




                                                               16
   Xiaowen Ding, Bing Liu, and Philip S. Yu, A Holistic
    Lexicon-Based Approach to Opinion Mining, Proceedings
    of the international conference on Web search and web
    data mining, USA, 2008




                                                            17
18

Mais conteúdo relacionado

Semelhante a Xiaowen Ding, Bing Liu and Philip Yu: A Holistic Lexicon-Based Approach to Opinion Mining

Technical Style Workshop Part 1
Technical Style Workshop Part 1Technical Style Workshop Part 1
Technical Style Workshop Part 1Eddie Hollon
 
Evaluation of research and planning 2
Evaluation of research and planning 2Evaluation of research and planning 2
Evaluation of research and planning 2ddelargy
 
Fmp peer assess
Fmp peer assessFmp peer assess
Fmp peer assessiain bruce
 
Lens unit 2010 5
Lens unit 2010 5Lens unit 2010 5
Lens unit 2010 5Dave Eddy
 
Assignment revision edu 697 week 4 2
Assignment revision edu 697 week 4 2Assignment revision edu 697 week 4 2
Assignment revision edu 697 week 4 2Machel Sweeney
 
BAEB601 Chapter 1: Introduction to Research Methodology
BAEB601 Chapter 1: Introduction to Research MethodologyBAEB601 Chapter 1: Introduction to Research Methodology
BAEB601 Chapter 1: Introduction to Research MethodologyDr Nur Suhaili Ramli
 
Tutorial 13 (explicit ugc + sentiment analysis)
Tutorial 13 (explicit ugc + sentiment analysis)Tutorial 13 (explicit ugc + sentiment analysis)
Tutorial 13 (explicit ugc + sentiment analysis)Kira
 
Ewrt 211 class 5
Ewrt 211 class 5Ewrt 211 class 5
Ewrt 211 class 5kimpalmore
 
Learning Local Lessons in Software Engineering
Learning Local Lessons in Software EngineeringLearning Local Lessons in Software Engineering
Learning Local Lessons in Software EngineeringCS, NcState
 
Whole education feb 7th 2013 literacy
Whole education feb 7th 2013 literacyWhole education feb 7th 2013 literacy
Whole education feb 7th 2013 literacyWholeeducation
 
Tuesday, march 20, 2012
Tuesday, march 20, 2012Tuesday, march 20, 2012
Tuesday, march 20, 2012Regina Navejar
 
Sentiment analysis and opinion mining
Sentiment analysis and opinion miningSentiment analysis and opinion mining
Sentiment analysis and opinion miningSumit Sony
 
Reading Street
Reading StreetReading Street
Reading Streetcavalcic
 
Reading Street
Reading StreetReading Street
Reading Streetcavalcic
 
How to write the summary
How to write the summaryHow to write the summary
How to write the summaryjschultz
 
Monday, march 19, 2012
Monday, march 19, 2012Monday, march 19, 2012
Monday, march 19, 2012Regina Navejar
 
Ontology based opinion mining for book reviews
Ontology based opinion mining for book reviewsOntology based opinion mining for book reviews
Ontology based opinion mining for book reviewsfirzhan naqash
 

Semelhante a Xiaowen Ding, Bing Liu and Philip Yu: A Holistic Lexicon-Based Approach to Opinion Mining (20)

Technical Style Workshop Part 1
Technical Style Workshop Part 1Technical Style Workshop Part 1
Technical Style Workshop Part 1
 
Evaluation of research and planning 2
Evaluation of research and planning 2Evaluation of research and planning 2
Evaluation of research and planning 2
 
Fmp peer assess
Fmp peer assessFmp peer assess
Fmp peer assess
 
Lens unit 2010 5
Lens unit 2010 5Lens unit 2010 5
Lens unit 2010 5
 
Lens unit 2010 5
Lens unit 2010 5Lens unit 2010 5
Lens unit 2010 5
 
Assignment revision edu 697 week 4 2
Assignment revision edu 697 week 4 2Assignment revision edu 697 week 4 2
Assignment revision edu 697 week 4 2
 
BAEB601 Chapter 1: Introduction to Research Methodology
BAEB601 Chapter 1: Introduction to Research MethodologyBAEB601 Chapter 1: Introduction to Research Methodology
BAEB601 Chapter 1: Introduction to Research Methodology
 
Tutorial 13 (explicit ugc + sentiment analysis)
Tutorial 13 (explicit ugc + sentiment analysis)Tutorial 13 (explicit ugc + sentiment analysis)
Tutorial 13 (explicit ugc + sentiment analysis)
 
Ewrt 211 class 5
Ewrt 211 class 5Ewrt 211 class 5
Ewrt 211 class 5
 
Learning Local Lessons in Software Engineering
Learning Local Lessons in Software EngineeringLearning Local Lessons in Software Engineering
Learning Local Lessons in Software Engineering
 
Whole education feb 7th 2013 literacy
Whole education feb 7th 2013 literacyWhole education feb 7th 2013 literacy
Whole education feb 7th 2013 literacy
 
Bagwords
BagwordsBagwords
Bagwords
 
Tuesday, march 20, 2012
Tuesday, march 20, 2012Tuesday, march 20, 2012
Tuesday, march 20, 2012
 
Assignment writing at university 2017 ginofransman
Assignment writing at university 2017 ginofransmanAssignment writing at university 2017 ginofransman
Assignment writing at university 2017 ginofransman
 
Sentiment analysis and opinion mining
Sentiment analysis and opinion miningSentiment analysis and opinion mining
Sentiment analysis and opinion mining
 
Reading Street
Reading StreetReading Street
Reading Street
 
Reading Street
Reading StreetReading Street
Reading Street
 
How to write the summary
How to write the summaryHow to write the summary
How to write the summary
 
Monday, march 19, 2012
Monday, march 19, 2012Monday, march 19, 2012
Monday, march 19, 2012
 
Ontology based opinion mining for book reviews
Ontology based opinion mining for book reviewsOntology based opinion mining for book reviews
Ontology based opinion mining for book reviews
 

Mais de Nguyen Quang

Deep Reinforcement Learning
Deep Reinforcement LearningDeep Reinforcement Learning
Deep Reinforcement LearningNguyen Quang
 
Deep Dialog System Review
Deep Dialog System ReviewDeep Dialog System Review
Deep Dialog System ReviewNguyen Quang
 
Sequence to Sequence Learning with Neural Networks
Sequence to Sequence Learning with Neural NetworksSequence to Sequence Learning with Neural Networks
Sequence to Sequence Learning with Neural NetworksNguyen Quang
 
Introduction to cassandra
Introduction to cassandraIntroduction to cassandra
Introduction to cassandraNguyen Quang
 
Web browser architecture
Web browser architectureWeb browser architecture
Web browser architectureNguyen Quang
 
X Query for beginner
X Query for beginnerX Query for beginner
X Query for beginnerNguyen Quang
 
Redistributable introtoscrum
Redistributable introtoscrumRedistributable introtoscrum
Redistributable introtoscrumNguyen Quang
 
Text categorization
Text categorizationText categorization
Text categorizationNguyen Quang
 

Mais de Nguyen Quang (13)

Apache Zookeeper
Apache ZookeeperApache Zookeeper
Apache Zookeeper
 
Apache Storm
Apache StormApache Storm
Apache Storm
 
Deep Reinforcement Learning
Deep Reinforcement LearningDeep Reinforcement Learning
Deep Reinforcement Learning
 
Deep Dialog System Review
Deep Dialog System ReviewDeep Dialog System Review
Deep Dialog System Review
 
Sequence to Sequence Learning with Neural Networks
Sequence to Sequence Learning with Neural NetworksSequence to Sequence Learning with Neural Networks
Sequence to Sequence Learning with Neural Networks
 
Introduction to cassandra
Introduction to cassandraIntroduction to cassandra
Introduction to cassandra
 
Web browser architecture
Web browser architectureWeb browser architecture
Web browser architecture
 
Eclipse orion
Eclipse orionEclipse orion
Eclipse orion
 
X Query for beginner
X Query for beginnerX Query for beginner
X Query for beginner
 
Html 5
Html 5Html 5
Html 5
 
Redistributable introtoscrum
Redistributable introtoscrumRedistributable introtoscrum
Redistributable introtoscrum
 
Text categorization
Text categorizationText categorization
Text categorization
 
Overview of NoSQL
Overview of NoSQLOverview of NoSQL
Overview of NoSQL
 

Último

CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...gurkirankumar98700
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
The Evolution of Money: Digital Transformation and CBDCs in Central Banking
The Evolution of Money: Digital Transformation and CBDCs in Central BankingThe Evolution of Money: Digital Transformation and CBDCs in Central Banking
The Evolution of Money: Digital Transformation and CBDCs in Central BankingSelcen Ozturkcan
 

Último (20)

CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
The Evolution of Money: Digital Transformation and CBDCs in Central Banking
The Evolution of Money: Digital Transformation and CBDCs in Central BankingThe Evolution of Money: Digital Transformation and CBDCs in Central Banking
The Evolution of Money: Digital Transformation and CBDCs in Central Banking
 

Xiaowen Ding, Bing Liu and Philip Yu: A Holistic Lexicon-Based Approach to Opinion Mining

  • 1. Xiaowen Ding, Bing Liu and Philip Yu Presenter: Quang Nguyen Date: 2010.10.18 Saltlux Vietnam Development Center
  • 2. Featured-based Opinion Mining Tasks Task 1: Identify and extract object features F that have been commented on by an opinion holder (e.g., a reviewer). Task 2: Determine whether the opinions on the features F are positive, negative or neutral. Task 3: Group feature synonyms. • Produce a feature-based opinion summary of multiple reviews.  This paper focuses on Task 2 assuming that features have been discovered 2
  • 3.  Opinion Words • Positive: beautiful, wonderful, good, amazing, • Negative: bad, poor, terrible, cost someone an arm and a leg (idiom).  One effective approach is to use opinion lexicon, opinion words. • Identify all opinion words in a sentence • Aggregate these words to give the final opinion to each feature. 3
  • 4.  Dictionary-based approaches • Start from a seed opinion words • Use Wordnet’s hierarchy and synsets to acquire more opinion words  Corpus-based approaches: extract opinion words from large corpora using syntactic rules and co-occurrence patterns Do not deal well with context dependent words! 4
  • 5.  Improve lexicon-based approaches using context dependent opinion words • Negative: “The bedroom is very small” • Positive: “The Nokia N3100 is so small as to be put in any pockets”  Propose a function for aggregating multiple opinion words in the same sentence  Consider explicit and implicit opinions 5
  • 6.  Intra-sentence conjunction rule  Pseudo intra-sentence conjunction  Inter-sentence conjunction rule 6
  • 7.  Opinion on both sides of “and” should be the same • E.g., “This camera takes great pictures and has a long battery life”.  Not likely to say: • “This camera takes great pictures and has a short battery life.” 7
  • 8.  Sometimes, one may not use an explicit conjunction “and”. • Same opinion in same sentence, unless there is a “but”-like clause • E.g., “The camera has a long battery life, which is great” 8
  • 9.  Peopleusually express the same opinion across sentences • unless there is an indication of opinion change using words such as “but” and “however” • E.g., “The picture quality is amazing. The battery life is long”  Not so natural to say: • “The picture quality is amazing. The battery life is short” 9
  • 10.  Opinion lexicon is far from sufficient. It needs special handling: • Negation/But Rule • Non-negation contains negative word, e.g., “I like this camera not just because it is beautiful” • Not contrary, but has a “but”, e.g., ““I not only like the picture quality of this camera, but also its size” • … 10
  • 11.  Implicit Feature is determined through adjectives (implicit feature indicator) • E.g., “This camera is very small” “small” is indicator for “size” • E.g., “This camera is very heavy” • “heavy” is indicator for “weight” 11
  • 12.  An object O is an entity which can be a product, person, event, organization, or topic  An object O is represented with a finite set of features, F = {f1, f2, …, fn}. • Each feature fi in F can be expressed with a finite set of words or phrases Wi, which are synonyms.  Model of a review: An opinion holder j comments on a subset of the features Sj F of object O. • For each feature fk Sj that j comments on, he/she  chooses a word or phrase from Wk to describe the feature, and  expresses a positive, negative or neutral opinion on fk. 12
  • 13. Input: a pair (f, s), where f is a product feature and s is a sentence that contains f.  Output: whether the opinion on f in s is pos, neg, or neut. wi: opinion word V: set of all opinion words dis(wi, f): distance between wi and f SO: semantic orientation of wi (+1, -1, 0) 13
  • 14. 14
  • 15. 15
  • 16. Precision Recall F-Score FBS (M. Hu and B. Liu. Mining and 0.93 0.76 0.83 summarizing customer reviews. KDD’04, 2004) OPINE (A-M. Popescu and O. Etzioni. Extracting Product Features 0.86 0.89 0.87 and Opinions from Reviews. EMNLP- 05, 2005.) Opinion Observer 0.92 0.91 0.91 (this paper) 16
  • 17. Xiaowen Ding, Bing Liu, and Philip S. Yu, A Holistic Lexicon-Based Approach to Opinion Mining, Proceedings of the international conference on Web search and web data mining, USA, 2008 17
  • 18. 18