SlideShare uma empresa Scribd logo
1 de 3
Baixar para ler offline
International Association of Scientific Innovation and Research (IASIR)
(An Association Unifying the Sciences, Engineering, and Applied Research)
International Journal of Engineering, Business and Enterprise
Applications (IJEBEA)
www.iasir.net
IJEBEA 14-271; © 2014, IJEBEA All Rights Reserved Page 119
ISSN (Print): 2279-0020
ISSN (Online): 2279-0039
An Empirical Study of Extracting information for Business Intelligence
V.Jayaraj 1
V.Mahalakshmi2*
1
Associate Professor 2
Research Scholar
1,2
School of Computer Science & Engineering, Bharathidasan University, Tiruchirappalli-24, Tamilnadu, India
__________________________________________________________________________________________
Abstract: Sentimental/opinion analysis is an emerging area of research in text mining. Sentimental analysis or opinion
mining refers to identify and extract subjective information in source materials. As a response to the growing
availability of informal opinionated texts like blog posts and product reviews, comments, forums which is collectively
called as user generated contents. A field of sentimental analysis has sprung up in the past decades to address the
question what do people feel about certain topic? Bringing together researchers in computer science, data mining,
sentimental analysis expand the traditional fact-based text analysis to enable opinion- oriented information systems. This
paper provides an overall survey about sentiment analysis or opinion mining related to Business intelligence.
Keywords: Opinion mining, Opinion analysis, Text Mining, Business Intelligence.
___________________________________________________________________________________________
I. INTRODUCTION
Dealing with the ever growing information in the internet opinion mining plays an essential part in our
information gathering before taking an decision. Opinion mining is the area of research refers to identify
and extract subjective information in source materials. Opinion mining is also referred as sentimental
analysis. Opinion Mining concentrates on classifying documents according to their source materials [1]. The
main goal of an Sentimental analysis is to determine the polarity of comments (positive, negative or neutral)
by extracting features and components of the object that have been commented on in each document .A
basic task in sentiment analysis is classifying the polarity of a given text at the document, sentence, or
feature/aspect level whether the expressed opinion in a document, a sentence or an entity feature/aspect is
positive, negative, or neutral[2]. As a response to the growing availability of informal opinionated texts like
blog posts and product review websites, a field of sentimental analysis has sprung up in the past decades to
address the question What do people feel about certain topic? Sentiment classification classifies whether an
opinionated document as positive or negative [3]. A text document is classified using a machine learning
techniques (Naive Bayes, Maximum Entropy, support vector machines)[4]. A piece of text can be used as an
feature or object in opinion mining. The opinion expressed in every document is either direct opinion or
comparative opinion. Direct opinion express a target, a person etc. (e.g) I bought an Nokia x2 mobile.
Comparative opinion express e.g. laptop x is cheaper than laptop y. Opinion mining task is carried out in the
sentence and document levels. Subjectivity/ Sentence level opinion mining is performed by two tasks.
Subjectivity classification identifies whether a sentence is subjective or objective. The research in the field
started with sentiment and subjectivity classification, which treated the problem as text classification
problem. Sentiment classification classifies whether an opinionated document as positive or negative.
Subjectivity classification identifies whether a sentence is subjective or objective. Many applications
required more detailed analysis because the user wants to know the opinion of others. Let us consider the
following example, (1)I bought an galaxy mobile 4 days ago. (2) It was an beautiful phone.(3)The touch screen
was really superb. (4)The Voice excellence was also good. (5)However, my father was fight with me as I
didn’t inform him before I bought it.(6)He felt that the mobile was too costly, and wanted me to return it to
the shop.
The question is: what we want to know from this review? There are several opinions in this review
(2),(3),(4) express positive opinion, while (5) and (6) express negative opinion. The opinion in sentence (2)
is on galaxy mobile, (3) is on touch screen and (4) voice excellence are the features of galaxy mobile.
Sentence (6) is on the cost of a galaxy mobile .This is an important place to understand the users are
interested on other opinions, but not on all. With this example in mind, we can define opinion mining ,an
opinion can be expressed as target, opinion holder, opinion and orientation, direct opinion, comparative
opinions.Finding the relevant information about companies from the multiple sources on the web has become
increasingly important for business analysts. To get an accurate result of a business entity, text mining tools
have been used. With the appropriate tools, company analyst would have to read thousands of reports, news
articles etc.This paper is organized as follows: In section 2, various research works has to be analyzed in order to
enhance our work. In section 3, our discussion has been described in details. Finally, the paper is concluded by
summarizing the work
V.Jayaraj et al., International Journal of Engineering, Business and Enterprise Applications, 8(2), March-May., 2014, pp. 119-121
IJEBEA 14-271; © 2014, IJEBEA All Rights Reserved Page 120
II. RELATED WORK
Wenhg Zhang et al., [7] identified the weekness of the product by using weakness finder algorithm. The
algorithm extract the implicit and Explicit features using morpheme based method and hownet based method
to determine the polarity of each sentence. The weakness of the product has to be identified because to
know the unsatisfication of the customers and compared with the competitors product reviews to improve
their product weakness. Guang Qiu et al.,[8] proposed an advertising strategy DASA to promote
advertisement and then to identify the negative review of the customers. These approaches uses pre-set rules,
and also design an prototype system for the users.
Shumin Zhou, et al.,[9] proposed an architecture to connect the government and the people .The customers
may post their opinion by mobile or internet named as information collection channels. The architecture is
named as people opinion collection processing. the dataflow process starts and then collect and processing
pocp This POCP promotes to build the harmony society.
To evaluate the extraction system, we use traditional metrics for information extraction Chinchor et al., [10]
calculate the precision, recall, and F-measure values . Precision measures the number of correctly identified
items as a percentage of the number of items identified. It measures how many of the items that the system
identified were actually correct, regardless of whether it also failed to retrieve correct items. The higher the
precision, the better the system is at ensuring that what is identified is correct. Recall measures the number of
correctly identified items as a percentage of the total number of correct items measuring how many of the
items that should have been identified actually were identified. The higher the recall rate, the better the
system is at not missing correct items. The F-measure is often used in conjunction with Precision and Recall,
as a weighted average of the two usually an application requires a balance between Precision and Recall.
Horacio Saggion et al., [11] finding the relevant information about companies from the multiple sources on
the web has become increasingly important for business analysts. To get an accurate results of an business
entity, text mining tools have been used. With the appropriate tools, company analyst would have to read
thousands of reports, news articles etc. M.Rushdi Saleh et al.,[12] Opinion mining is receiving more attention
due to the increase of blogs, forums, websites etc, support vector machine has been used for testing the
dataset and using several weighted schemes.
In this work, Support Vector Machines have been applied in order to classify a set of opinions as positives
or negatives.svm has achieved good results in opinion mining.svm has also been successfully achieved in
many classification tasks. SVM has applied with different features in order to test how the sentiment
classification is affected. Different weighting schemes (TFIDF,BO) and n-grams techniques are used. By
using the svm tool sentiment orientation classification was fulfilled. Symbolic approaches and machine
learning techniques are extended in order to attack the classification of reviews .
Dietmar Gräbner et al., [13] proposed a lexicon based approach to classify the customer reviews based on
sentimental analysis .when the precision and recall values exceeds the given baseline of our approach with the
algorithm for sentimental analysis proved to be successful. Generate a reliable classification approach of
customer reviews by applying lexicon based sentimental analysis. Three steps to be carried out to create an
lexicon 1.build an lexicon with semantic orientation 2. Create an sentimental analysis based lexicon to
generate classification reviews 3.classification results are evaluated with quantitative ratings.
Zhongwu Zhai et al.,[14] proposed an several methods have been proposed to extract product features from
the reviews. very limited work has been done in the clustering. Lexical similarity can be used in clustering
but it was not still accurate because with very high similarities are reliable. so to overcome these problems
proposed an semi supervised learning. For semi supervised learning, use the EM algorithm formulated in
which is based on NaïveBayes classification. EM algorithm performs much better when compare to the
other algorithm. Due the poor performance of the unsupervised methods an EM algorithm based on Naïve
Bayes classification is adapted to solve this problem. After a semi supervised method applied then connect
feature expressions using sharing words, and then merge components using lexical similarity and select the
leader components as labeled data. Alexandra Balahur et al.,[15] proposed an method to evaluate an used
generate content. In order get knowledge from user generated content, automatic methods must to be
developed. To multi document summarization of opinions from blogs, forums etc.
Vast different approaches have been used to identify the positive a n d n e g a t i v e opinions and then
summarize the opinions. The aim of the work is to study the manner in which opinion can be summarized, so
that they obtained summary can be used in real-life applications e.g marketing, decision- making.
Business Intelligence (BI) is a process for increasing the competitive advantages of a business by intelligent use
of available information collection for users to make wise decision [16], [17]. It was well known that some
techniques and resources such as data warehouses, multidimensional models, and ad hoc reports are related to
Business Intelligence [18]. Although these techniques and resources have served us well, they do not totally
cover the full scope of business intelligence [19].
V.Jayaraj et al., International Journal of Engineering, Business and Enterprise Applications, 8(2), March-May., 2014, pp. 119-121
IJEBEA 14-271; © 2014, IJEBEA All Rights Reserved Page 121
III. DISCUSSION
Sentimental analysis play a vital role in business intelligence and also organizations. Decision making is big
issue always in many organizations.80%of information in companies are unstructured data .To get the
relevant information from that unstructured information plays an main role for the analyst Information
Retrieval concepts plays an main role in classifying unstructured data. By using this technique our work can be
extended and the meaningful data can be retrieved. People get the others opinion to make some decision
about product or services by this ways.
 Finding opinions while purchasing the product
 Finding the opinion of the competitor products
 Finding opinions on tender result
Finally getting an relevant information about product or services plays an main role in an
organizations. The core objective of the paper is to develop a methodology to mine the useful information from
the unstructured textual content in order to improve the business intelligence. The mining process can be
achieved by new emerging technology, which is variant from data mining. With the help of text mining, the
user can able to discover previously unknown knowledge in text, by automatically extracting information from
different written resources developed in natural languages. It can be now familiar because of its approaches to
information management, research and analysis. Thus, text mining is the extension of data mining and obtains
the goal of extracting meaningful data from different sources of textual documents. In data mining, the
collection of data is stored in the repository known as Data Warehouse. Likely, in text mining, the collection of
documents is stored in the repository known as Document Warehouse. From this Document Warehouse, the
text has to be extracted using text mining.
IV. CONCLUSION AND FUTURE WORK
In this literature survey paper it is observed that opinion mining play a vital role to make decision about
product or services. Finding the relevant opinions expressed on the web, classifying them and filtering only
the positive opinions is not helpful enough for the users. They will still have to sift through thousands of text
snippets, containing relevant, but also much redundant information. Many organizations are carry out more
research in unstructured data. To get the relevant information text mining and information retrieval concepts
has been utilized. The work can be further extended to areas like neural networks, XML data information
retrieval. In XML retrieval by using configuration techniques a data retrieval time can be optimized.
V. REFERENCE
[1] Bo Pang, Lillian Lee, and Shivakumar Vaithyanathan,“Thumbs up? Sentiment classification using machine learning
techniques”, In Proceedings of the Conferenceon Empirical Methods in Natural Language Processing(EMNLP), pages 79–86,
2002.
[2] Dietmar Gräbnera, Markus Zankerb, Günther Fliedlb and Matthias Fuchsc “Classification of Customer Reviews based on
Sentiment Analysis” In 19th Conference on Information and Communication Technologies in Tourism (ENTER), Springer,
Helsingborg, Sweden, 2012.
[3] Turney, P, "Thumbs Up or Thumbs Down? Semantic orientation Applied to Unsupervised Classification of Reviews", ACL‟02,
2002.
[4] Mital K. Dalal,Mukesh A.Zaveri “Automatic Text Classification: A Technical Review” In International Journal of Computer
Applications (0975 – 8887) Volume 28– No.2, August 2011
[5] Kateryna Rybina “ Sentiment analysis of contexts around query terms in documents cin technical universitat Dresden, October
2012
[6] Pang Bo, and Lee Lillian. Opinion Mining and Sentiment Analysis. 2008.
[7] Wenhao Zhang, Hua Xu , Wei Wan “Weakness finder : Find Product Weakness from Chinese reviews by using aspect based
sentimental analysis” in Expert systems with application 2012
[8] Guang Qiu, Xiaofei He, Feng Zhang, Yuan Shi, JiaJun Bu, Chun chen“DASA:Dissatisfaction –oriented Advertising Based on
sentimental Analysis” in Expert Systems with Applications2010
[9] Shumin Zhou, Jumei Ai, Congnian Xu, Bin Tang” The collection and processing platform of the peoples opinion Based on SMS
and Internet” in IEEE 2007
[10] Chinchor, N. (1992). MUC-4 Evaluation Metrics. In Proceedings of the Fourth Message Understanding Conference, pp. 22–29.
[11] Horacio Saggion “Extracting Opinions and Facts for Business Intelligence” http://www.nist.gov/tac/
[12] M. Rushdi Saleh, M.T. Martín-Valdivia “Experiments with SVM to classify opinions in different domains” in Expert Systems
with Applications 38 (2011) 14799–14804
[13] Dietmar Gräbner “Classification of Customer Reviews based on Sentiment Analysis” in 19th Conference on Information and
Communication Technologies in Tourism (ENTER), Springer 2012
[14] Zhongwu Zhai “Clustering Product Features for Opinion Mining” University of Illinois at Chicago.
[15] Alexandra Balahur“ Challenges and solutions in the opinion summarization of user-generated content” © Springer
Science+Business Media, LLC 2012
[16 ] B. de Ville, “Microsoft Data Mining: Integrated Business Intelligence for e-Commerce and Knowledge Management”, Boston:
Digital Press, 2001.
[17] P. Bergeron, C. A. Hiller, “Competitive intelligence”, in B. Cronin, Annual Review of Information Science and Technology,
Medford, N.J.: Information Today, vol. 36, chapter 8, 2002.
[18] Bhujade, Vaishali, “Knowledge Discovery in Text Mining Technique using Association Rules Extraction”, Computational
Intelligence and Communication Networks (CICN), International Conference on oct. 2011.
[19] M. J. A. Berry, G. Linoff, “Data Mining Techniques: For Marketing, Sales, and Customer Relationship Management”, Wiley
Computer Publishing, 2nd edition, 2004.

Mais conteúdo relacionado

Mais procurados

TOWARDS AUTOMATIC DETECTION OF SENTIMENTS IN CUSTOMER REVIEWS
TOWARDS AUTOMATIC DETECTION OF SENTIMENTS IN CUSTOMER REVIEWSTOWARDS AUTOMATIC DETECTION OF SENTIMENTS IN CUSTOMER REVIEWS
TOWARDS AUTOMATIC DETECTION OF SENTIMENTS IN CUSTOMER REVIEWS
ijistjournal
 
Ijmer 46067276
Ijmer 46067276Ijmer 46067276
Ijmer 46067276
IJMER
 

Mais procurados (19)

TOWARDS AUTOMATIC DETECTION OF SENTIMENTS IN CUSTOMER REVIEWS
TOWARDS AUTOMATIC DETECTION OF SENTIMENTS IN CUSTOMER REVIEWSTOWARDS AUTOMATIC DETECTION OF SENTIMENTS IN CUSTOMER REVIEWS
TOWARDS AUTOMATIC DETECTION OF SENTIMENTS IN CUSTOMER REVIEWS
 
Word2Vec model for sentiment analysis of product reviews in Indonesian language
Word2Vec model for sentiment analysis of product reviews in Indonesian languageWord2Vec model for sentiment analysis of product reviews in Indonesian language
Word2Vec model for sentiment analysis of product reviews in Indonesian language
 
Ijmer 46067276
Ijmer 46067276Ijmer 46067276
Ijmer 46067276
 
Book recommendation system using opinion mining technique
Book recommendation system using opinion mining techniqueBook recommendation system using opinion mining technique
Book recommendation system using opinion mining technique
 
IRJET- A Survey on Graph based Approaches in Sentiment Analysis
IRJET- A Survey on Graph based Approaches in Sentiment AnalysisIRJET- A Survey on Graph based Approaches in Sentiment Analysis
IRJET- A Survey on Graph based Approaches in Sentiment Analysis
 
Opinion mining of customer reviews
Opinion mining of customer reviewsOpinion mining of customer reviews
Opinion mining of customer reviews
 
Empirical Model of Supervised Learning Approach for Opinion Mining
Empirical Model of Supervised Learning Approach for Opinion MiningEmpirical Model of Supervised Learning Approach for Opinion Mining
Empirical Model of Supervised Learning Approach for Opinion Mining
 
IRJET- Survey of Classification of Business Reviews using Sentiment Analysis
IRJET- Survey of Classification of Business Reviews using Sentiment AnalysisIRJET- Survey of Classification of Business Reviews using Sentiment Analysis
IRJET- Survey of Classification of Business Reviews using Sentiment Analysis
 
International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)
 
IRJET- Efficiently Analyzing and Detecting Fake Reviews
IRJET- Efficiently Analyzing and Detecting Fake ReviewsIRJET- Efficiently Analyzing and Detecting Fake Reviews
IRJET- Efficiently Analyzing and Detecting Fake Reviews
 
Opinion mining on newspaper headlines using SVM and NLP
Opinion mining on newspaper headlines using SVM and NLPOpinion mining on newspaper headlines using SVM and NLP
Opinion mining on newspaper headlines using SVM and NLP
 
Hybrid Classifier for Sentiment Analysis using Effective Pipelining
Hybrid Classifier for Sentiment Analysis using Effective PipeliningHybrid Classifier for Sentiment Analysis using Effective Pipelining
Hybrid Classifier for Sentiment Analysis using Effective Pipelining
 
IRJET- A Review on: Sentiment Polarity Analysis on Twitter Data from Diff...
IRJET-  	  A Review on: Sentiment Polarity Analysis on Twitter Data from Diff...IRJET-  	  A Review on: Sentiment Polarity Analysis on Twitter Data from Diff...
IRJET- A Review on: Sentiment Polarity Analysis on Twitter Data from Diff...
 
IRJET- Physical Design of Approximate Multiplier for Area and Power Efficiency
IRJET- Physical Design of Approximate Multiplier for Area and Power EfficiencyIRJET- Physical Design of Approximate Multiplier for Area and Power Efficiency
IRJET- Physical Design of Approximate Multiplier for Area and Power Efficiency
 
IRJET- Analysis of Brand Value Prediction based on Social Media Data
IRJET-  	  Analysis of Brand Value Prediction based on Social Media DataIRJET-  	  Analysis of Brand Value Prediction based on Social Media Data
IRJET- Analysis of Brand Value Prediction based on Social Media Data
 
IRJET- Opinion Mining and Sentiment Analysis for Online Review
IRJET-  	  Opinion Mining and Sentiment Analysis for Online ReviewIRJET-  	  Opinion Mining and Sentiment Analysis for Online Review
IRJET- Opinion Mining and Sentiment Analysis for Online Review
 
Sentiment Analysis Using Hybrid Approach: A Survey
Sentiment Analysis Using Hybrid Approach: A SurveySentiment Analysis Using Hybrid Approach: A Survey
Sentiment Analysis Using Hybrid Approach: A Survey
 
Co extracting opinion targets and opinion words from online reviews based on ...
Co extracting opinion targets and opinion words from online reviews based on ...Co extracting opinion targets and opinion words from online reviews based on ...
Co extracting opinion targets and opinion words from online reviews based on ...
 
IRJET- Product Aspect Ranking
IRJET-  	  Product Aspect RankingIRJET-  	  Product Aspect Ranking
IRJET- Product Aspect Ranking
 

Destaque (8)

Ijebea14 287
Ijebea14 287Ijebea14 287
Ijebea14 287
 
Ijebea14 285
Ijebea14 285Ijebea14 285
Ijebea14 285
 
Ijebea14 272
Ijebea14 272Ijebea14 272
Ijebea14 272
 
Ijebea14 270
Ijebea14 270Ijebea14 270
Ijebea14 270
 
Ijebea14 276
Ijebea14 276Ijebea14 276
Ijebea14 276
 
Ijebea14 277
Ijebea14 277Ijebea14 277
Ijebea14 277
 
Ijebea14 278
Ijebea14 278Ijebea14 278
Ijebea14 278
 
Ijebea14 267
Ijebea14 267Ijebea14 267
Ijebea14 267
 

Semelhante a Ijebea14 271

TOWARDS AUTOMATIC DETECTION OF SENTIMENTS IN CUSTOMER REVIEWS
TOWARDS AUTOMATIC DETECTION OF SENTIMENTS IN CUSTOMER REVIEWSTOWARDS AUTOMATIC DETECTION OF SENTIMENTS IN CUSTOMER REVIEWS
TOWARDS AUTOMATIC DETECTION OF SENTIMENTS IN CUSTOMER REVIEWS
ijistjournal
 
SENTIMENT ANALYSIS ON PRODUCT FEATURES BASED ON LEXICON APPROACH USING NATURA...
SENTIMENT ANALYSIS ON PRODUCT FEATURES BASED ON LEXICON APPROACH USING NATURA...SENTIMENT ANALYSIS ON PRODUCT FEATURES BASED ON LEXICON APPROACH USING NATURA...
SENTIMENT ANALYSIS ON PRODUCT FEATURES BASED ON LEXICON APPROACH USING NATURA...
ijnlc
 
Ijmer 46067276
Ijmer 46067276Ijmer 46067276
Ijmer 46067276
IJMER
 

Semelhante a Ijebea14 271 (20)

TOWARDS AUTOMATIC DETECTION OF SENTIMENTS IN CUSTOMER REVIEWS
TOWARDS AUTOMATIC DETECTION OF SENTIMENTS IN CUSTOMER REVIEWSTOWARDS AUTOMATIC DETECTION OF SENTIMENTS IN CUSTOMER REVIEWS
TOWARDS AUTOMATIC DETECTION OF SENTIMENTS IN CUSTOMER REVIEWS
 
Co-Extracting Opinions from Online Reviews
Co-Extracting Opinions from Online ReviewsCo-Extracting Opinions from Online Reviews
Co-Extracting Opinions from Online Reviews
 
International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)
 
IRJET- Interpreting Public Sentiments Variation by using FB-LDA Technique
IRJET- Interpreting Public Sentiments Variation by using FB-LDA TechniqueIRJET- Interpreting Public Sentiments Variation by using FB-LDA Technique
IRJET- Interpreting Public Sentiments Variation by using FB-LDA Technique
 
Sentiment analysis on unstructured review
Sentiment analysis on unstructured reviewSentiment analysis on unstructured review
Sentiment analysis on unstructured review
 
IRJET- Sentiment Analysis: Algorithmic and Opinion Mining Approach
IRJET- Sentiment Analysis: Algorithmic and Opinion Mining ApproachIRJET- Sentiment Analysis: Algorithmic and Opinion Mining Approach
IRJET- Sentiment Analysis: Algorithmic and Opinion Mining Approach
 
Framework for Product Recommandation for Review Dataset
Framework for Product Recommandation for Review DatasetFramework for Product Recommandation for Review Dataset
Framework for Product Recommandation for Review Dataset
 
An Approach To Sentiment Analysis
An Approach To Sentiment AnalysisAn Approach To Sentiment Analysis
An Approach To Sentiment Analysis
 
SENTIMENT ANALYSIS ON PRODUCT FEATURES BASED ON LEXICON APPROACH USING NATURA...
SENTIMENT ANALYSIS ON PRODUCT FEATURES BASED ON LEXICON APPROACH USING NATURA...SENTIMENT ANALYSIS ON PRODUCT FEATURES BASED ON LEXICON APPROACH USING NATURA...
SENTIMENT ANALYSIS ON PRODUCT FEATURES BASED ON LEXICON APPROACH USING NATURA...
 
SENTIMENT ANALYSIS ON PRODUCT FEATURES BASED ON LEXICON APPROACH USING NATURA...
SENTIMENT ANALYSIS ON PRODUCT FEATURES BASED ON LEXICON APPROACH USING NATURA...SENTIMENT ANALYSIS ON PRODUCT FEATURES BASED ON LEXICON APPROACH USING NATURA...
SENTIMENT ANALYSIS ON PRODUCT FEATURES BASED ON LEXICON APPROACH USING NATURA...
 
Ijmer 46067276
Ijmer 46067276Ijmer 46067276
Ijmer 46067276
 
A Review on Sentimental Analysis of Application Reviews
A Review on Sentimental Analysis of Application ReviewsA Review on Sentimental Analysis of Application Reviews
A Review on Sentimental Analysis of Application Reviews
 
Correlation of feature score to to overall sentiment score for identifying th...
Correlation of feature score to to overall sentiment score for identifying th...Correlation of feature score to to overall sentiment score for identifying th...
Correlation of feature score to to overall sentiment score for identifying th...
 
Sentiment Analysis and Classification of Tweets using Data Mining
Sentiment Analysis and Classification of Tweets using Data MiningSentiment Analysis and Classification of Tweets using Data Mining
Sentiment Analysis and Classification of Tweets using Data Mining
 
A Study On Sentiment Analysis Methods And Tools
A Study On Sentiment Analysis  Methods And ToolsA Study On Sentiment Analysis  Methods And Tools
A Study On Sentiment Analysis Methods And Tools
 
2
22
2
 
OPINION MINING AND ANALYSIS: A SURVEY
OPINION MINING AND ANALYSIS: A SURVEYOPINION MINING AND ANALYSIS: A SURVEY
OPINION MINING AND ANALYSIS: A SURVEY
 
Sentiment analysis on_unstructured_review-1
Sentiment analysis on_unstructured_review-1Sentiment analysis on_unstructured_review-1
Sentiment analysis on_unstructured_review-1
 
An Opinion Mining and Sentiment Analysis Techniques: A Survey
An Opinion Mining and Sentiment Analysis Techniques: A SurveyAn Opinion Mining and Sentiment Analysis Techniques: A Survey
An Opinion Mining and Sentiment Analysis Techniques: A Survey
 
Analyzing sentiment system to specify polarity by lexicon-based
Analyzing sentiment system to specify polarity by lexicon-basedAnalyzing sentiment system to specify polarity by lexicon-based
Analyzing sentiment system to specify polarity by lexicon-based
 

Mais de Iasir Journals

Mais de Iasir Journals (20)

ijetcas14 650
ijetcas14 650ijetcas14 650
ijetcas14 650
 
Ijetcas14 648
Ijetcas14 648Ijetcas14 648
Ijetcas14 648
 
Ijetcas14 647
Ijetcas14 647Ijetcas14 647
Ijetcas14 647
 
Ijetcas14 643
Ijetcas14 643Ijetcas14 643
Ijetcas14 643
 
Ijetcas14 641
Ijetcas14 641Ijetcas14 641
Ijetcas14 641
 
Ijetcas14 639
Ijetcas14 639Ijetcas14 639
Ijetcas14 639
 
Ijetcas14 632
Ijetcas14 632Ijetcas14 632
Ijetcas14 632
 
Ijetcas14 624
Ijetcas14 624Ijetcas14 624
Ijetcas14 624
 
Ijetcas14 619
Ijetcas14 619Ijetcas14 619
Ijetcas14 619
 
Ijetcas14 615
Ijetcas14 615Ijetcas14 615
Ijetcas14 615
 
Ijetcas14 608
Ijetcas14 608Ijetcas14 608
Ijetcas14 608
 
Ijetcas14 605
Ijetcas14 605Ijetcas14 605
Ijetcas14 605
 
Ijetcas14 604
Ijetcas14 604Ijetcas14 604
Ijetcas14 604
 
Ijetcas14 598
Ijetcas14 598Ijetcas14 598
Ijetcas14 598
 
Ijetcas14 594
Ijetcas14 594Ijetcas14 594
Ijetcas14 594
 
Ijetcas14 593
Ijetcas14 593Ijetcas14 593
Ijetcas14 593
 
Ijetcas14 591
Ijetcas14 591Ijetcas14 591
Ijetcas14 591
 
Ijetcas14 589
Ijetcas14 589Ijetcas14 589
Ijetcas14 589
 
Ijetcas14 585
Ijetcas14 585Ijetcas14 585
Ijetcas14 585
 
Ijetcas14 584
Ijetcas14 584Ijetcas14 584
Ijetcas14 584
 

Último

VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
dharasingh5698
 
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
dollysharma2066
 
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Christo Ananth
 

Último (20)

Generative AI or GenAI technology based PPT
Generative AI or GenAI technology based PPTGenerative AI or GenAI technology based PPT
Generative AI or GenAI technology based PPT
 
Unit 1 - Soil Classification and Compaction.pdf
Unit 1 - Soil Classification and Compaction.pdfUnit 1 - Soil Classification and Compaction.pdf
Unit 1 - Soil Classification and Compaction.pdf
 
Unleashing the Power of the SORA AI lastest leap
Unleashing the Power of the SORA AI lastest leapUnleashing the Power of the SORA AI lastest leap
Unleashing the Power of the SORA AI lastest leap
 
data_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdfdata_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdf
 
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
 
Intze Overhead Water Tank Design by Working Stress - IS Method.pdf
Intze Overhead Water Tank  Design by Working Stress - IS Method.pdfIntze Overhead Water Tank  Design by Working Stress - IS Method.pdf
Intze Overhead Water Tank Design by Working Stress - IS Method.pdf
 
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
 
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
 
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
 
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Booking
 
Double Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torqueDouble Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torque
 
Water Industry Process Automation & Control Monthly - April 2024
Water Industry Process Automation & Control Monthly - April 2024Water Industry Process Automation & Control Monthly - April 2024
Water Industry Process Automation & Control Monthly - April 2024
 
(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7
(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7
(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7
 
Call for Papers - International Journal of Intelligent Systems and Applicatio...
Call for Papers - International Journal of Intelligent Systems and Applicatio...Call for Papers - International Journal of Intelligent Systems and Applicatio...
Call for Papers - International Journal of Intelligent Systems and Applicatio...
 
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
 
Booking open Available Pune Call Girls Pargaon 6297143586 Call Hot Indian Gi...
Booking open Available Pune Call Girls Pargaon  6297143586 Call Hot Indian Gi...Booking open Available Pune Call Girls Pargaon  6297143586 Call Hot Indian Gi...
Booking open Available Pune Call Girls Pargaon 6297143586 Call Hot Indian Gi...
 
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
 
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdfONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
 
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete RecordCCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
 
Bhosari ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready For ...
Bhosari ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready For ...Bhosari ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready For ...
Bhosari ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready For ...
 

Ijebea14 271

  • 1. International Association of Scientific Innovation and Research (IASIR) (An Association Unifying the Sciences, Engineering, and Applied Research) International Journal of Engineering, Business and Enterprise Applications (IJEBEA) www.iasir.net IJEBEA 14-271; © 2014, IJEBEA All Rights Reserved Page 119 ISSN (Print): 2279-0020 ISSN (Online): 2279-0039 An Empirical Study of Extracting information for Business Intelligence V.Jayaraj 1 V.Mahalakshmi2* 1 Associate Professor 2 Research Scholar 1,2 School of Computer Science & Engineering, Bharathidasan University, Tiruchirappalli-24, Tamilnadu, India __________________________________________________________________________________________ Abstract: Sentimental/opinion analysis is an emerging area of research in text mining. Sentimental analysis or opinion mining refers to identify and extract subjective information in source materials. As a response to the growing availability of informal opinionated texts like blog posts and product reviews, comments, forums which is collectively called as user generated contents. A field of sentimental analysis has sprung up in the past decades to address the question what do people feel about certain topic? Bringing together researchers in computer science, data mining, sentimental analysis expand the traditional fact-based text analysis to enable opinion- oriented information systems. This paper provides an overall survey about sentiment analysis or opinion mining related to Business intelligence. Keywords: Opinion mining, Opinion analysis, Text Mining, Business Intelligence. ___________________________________________________________________________________________ I. INTRODUCTION Dealing with the ever growing information in the internet opinion mining plays an essential part in our information gathering before taking an decision. Opinion mining is the area of research refers to identify and extract subjective information in source materials. Opinion mining is also referred as sentimental analysis. Opinion Mining concentrates on classifying documents according to their source materials [1]. The main goal of an Sentimental analysis is to determine the polarity of comments (positive, negative or neutral) by extracting features and components of the object that have been commented on in each document .A basic task in sentiment analysis is classifying the polarity of a given text at the document, sentence, or feature/aspect level whether the expressed opinion in a document, a sentence or an entity feature/aspect is positive, negative, or neutral[2]. As a response to the growing availability of informal opinionated texts like blog posts and product review websites, a field of sentimental analysis has sprung up in the past decades to address the question What do people feel about certain topic? Sentiment classification classifies whether an opinionated document as positive or negative [3]. A text document is classified using a machine learning techniques (Naive Bayes, Maximum Entropy, support vector machines)[4]. A piece of text can be used as an feature or object in opinion mining. The opinion expressed in every document is either direct opinion or comparative opinion. Direct opinion express a target, a person etc. (e.g) I bought an Nokia x2 mobile. Comparative opinion express e.g. laptop x is cheaper than laptop y. Opinion mining task is carried out in the sentence and document levels. Subjectivity/ Sentence level opinion mining is performed by two tasks. Subjectivity classification identifies whether a sentence is subjective or objective. The research in the field started with sentiment and subjectivity classification, which treated the problem as text classification problem. Sentiment classification classifies whether an opinionated document as positive or negative. Subjectivity classification identifies whether a sentence is subjective or objective. Many applications required more detailed analysis because the user wants to know the opinion of others. Let us consider the following example, (1)I bought an galaxy mobile 4 days ago. (2) It was an beautiful phone.(3)The touch screen was really superb. (4)The Voice excellence was also good. (5)However, my father was fight with me as I didn’t inform him before I bought it.(6)He felt that the mobile was too costly, and wanted me to return it to the shop. The question is: what we want to know from this review? There are several opinions in this review (2),(3),(4) express positive opinion, while (5) and (6) express negative opinion. The opinion in sentence (2) is on galaxy mobile, (3) is on touch screen and (4) voice excellence are the features of galaxy mobile. Sentence (6) is on the cost of a galaxy mobile .This is an important place to understand the users are interested on other opinions, but not on all. With this example in mind, we can define opinion mining ,an opinion can be expressed as target, opinion holder, opinion and orientation, direct opinion, comparative opinions.Finding the relevant information about companies from the multiple sources on the web has become increasingly important for business analysts. To get an accurate result of a business entity, text mining tools have been used. With the appropriate tools, company analyst would have to read thousands of reports, news articles etc.This paper is organized as follows: In section 2, various research works has to be analyzed in order to enhance our work. In section 3, our discussion has been described in details. Finally, the paper is concluded by summarizing the work
  • 2. V.Jayaraj et al., International Journal of Engineering, Business and Enterprise Applications, 8(2), March-May., 2014, pp. 119-121 IJEBEA 14-271; © 2014, IJEBEA All Rights Reserved Page 120 II. RELATED WORK Wenhg Zhang et al., [7] identified the weekness of the product by using weakness finder algorithm. The algorithm extract the implicit and Explicit features using morpheme based method and hownet based method to determine the polarity of each sentence. The weakness of the product has to be identified because to know the unsatisfication of the customers and compared with the competitors product reviews to improve their product weakness. Guang Qiu et al.,[8] proposed an advertising strategy DASA to promote advertisement and then to identify the negative review of the customers. These approaches uses pre-set rules, and also design an prototype system for the users. Shumin Zhou, et al.,[9] proposed an architecture to connect the government and the people .The customers may post their opinion by mobile or internet named as information collection channels. The architecture is named as people opinion collection processing. the dataflow process starts and then collect and processing pocp This POCP promotes to build the harmony society. To evaluate the extraction system, we use traditional metrics for information extraction Chinchor et al., [10] calculate the precision, recall, and F-measure values . Precision measures the number of correctly identified items as a percentage of the number of items identified. It measures how many of the items that the system identified were actually correct, regardless of whether it also failed to retrieve correct items. The higher the precision, the better the system is at ensuring that what is identified is correct. Recall measures the number of correctly identified items as a percentage of the total number of correct items measuring how many of the items that should have been identified actually were identified. The higher the recall rate, the better the system is at not missing correct items. The F-measure is often used in conjunction with Precision and Recall, as a weighted average of the two usually an application requires a balance between Precision and Recall. Horacio Saggion et al., [11] finding the relevant information about companies from the multiple sources on the web has become increasingly important for business analysts. To get an accurate results of an business entity, text mining tools have been used. With the appropriate tools, company analyst would have to read thousands of reports, news articles etc. M.Rushdi Saleh et al.,[12] Opinion mining is receiving more attention due to the increase of blogs, forums, websites etc, support vector machine has been used for testing the dataset and using several weighted schemes. In this work, Support Vector Machines have been applied in order to classify a set of opinions as positives or negatives.svm has achieved good results in opinion mining.svm has also been successfully achieved in many classification tasks. SVM has applied with different features in order to test how the sentiment classification is affected. Different weighting schemes (TFIDF,BO) and n-grams techniques are used. By using the svm tool sentiment orientation classification was fulfilled. Symbolic approaches and machine learning techniques are extended in order to attack the classification of reviews . Dietmar Gräbner et al., [13] proposed a lexicon based approach to classify the customer reviews based on sentimental analysis .when the precision and recall values exceeds the given baseline of our approach with the algorithm for sentimental analysis proved to be successful. Generate a reliable classification approach of customer reviews by applying lexicon based sentimental analysis. Three steps to be carried out to create an lexicon 1.build an lexicon with semantic orientation 2. Create an sentimental analysis based lexicon to generate classification reviews 3.classification results are evaluated with quantitative ratings. Zhongwu Zhai et al.,[14] proposed an several methods have been proposed to extract product features from the reviews. very limited work has been done in the clustering. Lexical similarity can be used in clustering but it was not still accurate because with very high similarities are reliable. so to overcome these problems proposed an semi supervised learning. For semi supervised learning, use the EM algorithm formulated in which is based on NaïveBayes classification. EM algorithm performs much better when compare to the other algorithm. Due the poor performance of the unsupervised methods an EM algorithm based on Naïve Bayes classification is adapted to solve this problem. After a semi supervised method applied then connect feature expressions using sharing words, and then merge components using lexical similarity and select the leader components as labeled data. Alexandra Balahur et al.,[15] proposed an method to evaluate an used generate content. In order get knowledge from user generated content, automatic methods must to be developed. To multi document summarization of opinions from blogs, forums etc. Vast different approaches have been used to identify the positive a n d n e g a t i v e opinions and then summarize the opinions. The aim of the work is to study the manner in which opinion can be summarized, so that they obtained summary can be used in real-life applications e.g marketing, decision- making. Business Intelligence (BI) is a process for increasing the competitive advantages of a business by intelligent use of available information collection for users to make wise decision [16], [17]. It was well known that some techniques and resources such as data warehouses, multidimensional models, and ad hoc reports are related to Business Intelligence [18]. Although these techniques and resources have served us well, they do not totally cover the full scope of business intelligence [19].
  • 3. V.Jayaraj et al., International Journal of Engineering, Business and Enterprise Applications, 8(2), March-May., 2014, pp. 119-121 IJEBEA 14-271; © 2014, IJEBEA All Rights Reserved Page 121 III. DISCUSSION Sentimental analysis play a vital role in business intelligence and also organizations. Decision making is big issue always in many organizations.80%of information in companies are unstructured data .To get the relevant information from that unstructured information plays an main role for the analyst Information Retrieval concepts plays an main role in classifying unstructured data. By using this technique our work can be extended and the meaningful data can be retrieved. People get the others opinion to make some decision about product or services by this ways.  Finding opinions while purchasing the product  Finding the opinion of the competitor products  Finding opinions on tender result Finally getting an relevant information about product or services plays an main role in an organizations. The core objective of the paper is to develop a methodology to mine the useful information from the unstructured textual content in order to improve the business intelligence. The mining process can be achieved by new emerging technology, which is variant from data mining. With the help of text mining, the user can able to discover previously unknown knowledge in text, by automatically extracting information from different written resources developed in natural languages. It can be now familiar because of its approaches to information management, research and analysis. Thus, text mining is the extension of data mining and obtains the goal of extracting meaningful data from different sources of textual documents. In data mining, the collection of data is stored in the repository known as Data Warehouse. Likely, in text mining, the collection of documents is stored in the repository known as Document Warehouse. From this Document Warehouse, the text has to be extracted using text mining. IV. CONCLUSION AND FUTURE WORK In this literature survey paper it is observed that opinion mining play a vital role to make decision about product or services. Finding the relevant opinions expressed on the web, classifying them and filtering only the positive opinions is not helpful enough for the users. They will still have to sift through thousands of text snippets, containing relevant, but also much redundant information. Many organizations are carry out more research in unstructured data. To get the relevant information text mining and information retrieval concepts has been utilized. The work can be further extended to areas like neural networks, XML data information retrieval. In XML retrieval by using configuration techniques a data retrieval time can be optimized. V. REFERENCE [1] Bo Pang, Lillian Lee, and Shivakumar Vaithyanathan,“Thumbs up? Sentiment classification using machine learning techniques”, In Proceedings of the Conferenceon Empirical Methods in Natural Language Processing(EMNLP), pages 79–86, 2002. [2] Dietmar Gräbnera, Markus Zankerb, Günther Fliedlb and Matthias Fuchsc “Classification of Customer Reviews based on Sentiment Analysis” In 19th Conference on Information and Communication Technologies in Tourism (ENTER), Springer, Helsingborg, Sweden, 2012. [3] Turney, P, "Thumbs Up or Thumbs Down? Semantic orientation Applied to Unsupervised Classification of Reviews", ACL‟02, 2002. [4] Mital K. Dalal,Mukesh A.Zaveri “Automatic Text Classification: A Technical Review” In International Journal of Computer Applications (0975 – 8887) Volume 28– No.2, August 2011 [5] Kateryna Rybina “ Sentiment analysis of contexts around query terms in documents cin technical universitat Dresden, October 2012 [6] Pang Bo, and Lee Lillian. Opinion Mining and Sentiment Analysis. 2008. [7] Wenhao Zhang, Hua Xu , Wei Wan “Weakness finder : Find Product Weakness from Chinese reviews by using aspect based sentimental analysis” in Expert systems with application 2012 [8] Guang Qiu, Xiaofei He, Feng Zhang, Yuan Shi, JiaJun Bu, Chun chen“DASA:Dissatisfaction –oriented Advertising Based on sentimental Analysis” in Expert Systems with Applications2010 [9] Shumin Zhou, Jumei Ai, Congnian Xu, Bin Tang” The collection and processing platform of the peoples opinion Based on SMS and Internet” in IEEE 2007 [10] Chinchor, N. (1992). MUC-4 Evaluation Metrics. In Proceedings of the Fourth Message Understanding Conference, pp. 22–29. [11] Horacio Saggion “Extracting Opinions and Facts for Business Intelligence” http://www.nist.gov/tac/ [12] M. Rushdi Saleh, M.T. Martín-Valdivia “Experiments with SVM to classify opinions in different domains” in Expert Systems with Applications 38 (2011) 14799–14804 [13] Dietmar Gräbner “Classification of Customer Reviews based on Sentiment Analysis” in 19th Conference on Information and Communication Technologies in Tourism (ENTER), Springer 2012 [14] Zhongwu Zhai “Clustering Product Features for Opinion Mining” University of Illinois at Chicago. [15] Alexandra Balahur“ Challenges and solutions in the opinion summarization of user-generated content” © Springer Science+Business Media, LLC 2012 [16 ] B. de Ville, “Microsoft Data Mining: Integrated Business Intelligence for e-Commerce and Knowledge Management”, Boston: Digital Press, 2001. [17] P. Bergeron, C. A. Hiller, “Competitive intelligence”, in B. Cronin, Annual Review of Information Science and Technology, Medford, N.J.: Information Today, vol. 36, chapter 8, 2002. [18] Bhujade, Vaishali, “Knowledge Discovery in Text Mining Technique using Association Rules Extraction”, Computational Intelligence and Communication Networks (CICN), International Conference on oct. 2011. [19] M. J. A. Berry, G. Linoff, “Data Mining Techniques: For Marketing, Sales, and Customer Relationship Management”, Wiley Computer Publishing, 2nd edition, 2004.