O SlideShare utiliza cookies para otimizar a funcionalidade e o desempenho do site, assim como para apresentar publicidade mais relevante aos nossos usuários. Se você continuar a navegar o site, você aceita o uso de cookies. Leia nosso Contrato do Usuário e nossa Política de Privacidade.
O SlideShare utiliza cookies para otimizar a funcionalidade e o desempenho do site, assim como para apresentar publicidade mais relevante aos nossos usuários. Se você continuar a utilizar o site, você aceita o uso de cookies. Leia nossa Política de Privacidade e nosso Contrato do Usuário para obter mais detalhes.
In recent times, research activities in the areas of Opinion and Sentiment analysis in natural language texts and other media are gaining ground under the umbrella of subjectivity analysis. The reason may be the huge amount of available text data in the Social Web in the forms of news, reviews, blogs, chats and even twitter. Though Sentiment analysis from natural lan-guage text is a multifaceted and multidisciplinary problem, in general, the term “sentiment” is used in reference to the automatic analysis of evaluative text.
SENTIMENT ANALYZER AND OPINION
MINING ON TWITTER DATA STREAM
Name of supervisor -Ms. Akanksha Bhardwaj
Name of Student -Saumya Chaturvedi
Modules in the Project wrt Design
Control Flow Diagram
Stream twitter data on a particular topic on real time basis
Checking for semantic orientation of this data efficiently using
supervised learning approach.
Modules in the Project wrt Design
PYTHON PROGRAMMING LANGUAGE TOOLS
Python programming language was chosen because the
existing systems are programmed in Python, therefore using the
same language is natural. The availability of modules that
helped in the development process contributed to choosing the
language. Python offers good portability across different
environments. The tools used include NeuroLab library which
provides pre-existing neural network algorithms implemented in
Python, and Natural Language Toolkit that was used for Part-of-
Natural Language Toolkit
Natural Language Toolkit (NTLK) is an open source library,
distributed under Apache 2.0 license, multiplatform collection of
tools and modules for natural language processing and text
analysis, available in Python programming language. It provides a
large set of tools and algorithms for text tokenization, stemming,
tagging, chunking, parsing, classification, clustering,
measurements and semantic interpretation. It also provides an
access to many text corpora and lexicon. In this work, NLTK is
used part-of-speech tagging from documents.
Tweepy is one of python’s open source
libraries which play an important role in this
project. It is used to stream twitter data stream
on a real time basis. It is hosted on GitHub and
enables Python to communicate with Twitter
platform and use its API
Our review-oriented website would serve as an efficient
tool for media analysis on various products.
This idea can also be extended in a way to serve as an
effective review-summarizer that would not only produce
the poll result of the reviews but would, also summarize
the text of various review in simple words for the user.
This project can directly affect the industry’s time and
performance, in terms of the customer relationship.
It is possible to know what user wants to express by
their reviews. That is what is the concept of –