SlideShare uma empresa Scribd logo
1 de 10
Predict the Interesting of an article
Using Twitter
Chitra khatwani
Yashasvi girdhar
Khyati chandu
R.K. Srinivas
The project aims at measuring the
interestingness of articles by analyzing the
tweets related to the entities in the article.
●
Application:
– We can order the articles for a search query
according to their interestingness.
– Suggesting news articles to users on websites
Approach Followed
●
Extract all the named entities from the article
> Two methods can be followed
●
Using NLTK Library
●
Using A list of Wikipedia Titles
We have used the second approach, because the nltk
library misses out many important entities, in some
cases.
Approach Followed
●
Shortlist all the dominant entities from the
extracted entities
– Dominant entities are those, which are most
frequently talked about in the article.
– Methods:
●
Can be decided based on the frequency of entities
●
Entities occurring in the title of the article
Approach Followed
●
Mine all the tweets related to all the dominant
entities
●
Done using Twitter Search API
●
Need to collect the tweets of the entities, around the date
when the article was published.
●
Need to parse the tweets before storing them, to make
thhem ready for the next steps.
Approach Followed
●
Categorize each tweet as +ve , -ve or neutral
– Consider all the unigrams tokens equally
– Score each token using the naive bayes formula
– Sum up the scores of all the tokens to calculate the
score for an entitiy
Approach Followed
●
Predict the interestingness of the article, using
the number of positive and negative tweets
We have followed the below approach :
– Less is the difference between number of positive
tweets and number of negative tweets, more is the
interestingness of the article.
– On the other hand, if the number of positive entities
outweighs the number of negative entities, or vice-
versa, the article is considered less interesting.
Datasets used
●
For Articles
– A set of random news articles taken from the BBC
News Dataset
●
For Sentiment Analysis
– Mejaj Dataset
●
Built on the basis of categorizing tweets on the basis of
predefined list of positive and negative words
– Standford Dataset
Challenges
●
Collecting the right set of articles for testing our
model
●
Finding the Right dataset for twitter and then,
deciding upon the parameters, to categorize the
tweet
●
Deciding upon the appropriate algorithm for
deciding the interestingness of the article,
based on the +ve and -ve tweets
Conclusion
●
Social Media, such as twitter in this case, is a
very common medium for people nowadays, to
express their opinions about something.
This can be leveraged as a very powerful
medium, in predicting the nature of the data
published on the web, specially millions of
articles that are published each day.
This can also be used in suggesting the articles
to the users.
References
●
Mining Sentiments from Tweets, Siel, IIIT-Hyderabad

Mais conteúdo relacionado

Semelhante a Predict Interestingness of An Article Using Twitter

SENTIMENT ANALYSIS OF TWITTER DATA
SENTIMENT ANALYSIS OF TWITTER DATASENTIMENT ANALYSIS OF TWITTER DATA
SENTIMENT ANALYSIS OF TWITTER DATA
anargha gangadharan
 
SENTIMENT ANALYSIS OF TWITTER DATA
SENTIMENT ANALYSIS OF TWITTER DATASENTIMENT ANALYSIS OF TWITTER DATA
SENTIMENT ANALYSIS OF TWITTER DATA
Parvathy Devaraj
 
Twitter Sentiment Prediction.pptx
Twitter Sentiment Prediction.pptxTwitter Sentiment Prediction.pptx
Twitter Sentiment Prediction.pptx
Krishnesh Pujari
 
Sentiment Analysis of Twitter tweets using supervised classification technique
Sentiment Analysis of Twitter tweets using supervised classification technique Sentiment Analysis of Twitter tweets using supervised classification technique
Sentiment Analysis of Twitter tweets using supervised classification technique
IJERA Editor
 
GeospatialDataAnalysis
GeospatialDataAnalysisGeospatialDataAnalysis
GeospatialDataAnalysis
Taylor Graham
 

Semelhante a Predict Interestingness of An Article Using Twitter (20)

Twitter as a personalizable information service ii
Twitter as a personalizable information service iiTwitter as a personalizable information service ii
Twitter as a personalizable information service ii
 
Sentiment Analysis of Twitter Data
Sentiment Analysis of Twitter DataSentiment Analysis of Twitter Data
Sentiment Analysis of Twitter Data
 
social network analysis project twitter sentimental analysis
social network analysis project twitter sentimental analysissocial network analysis project twitter sentimental analysis
social network analysis project twitter sentimental analysis
 
Twitter Sentiment Analysis.pdf
Twitter Sentiment Analysis.pdfTwitter Sentiment Analysis.pdf
Twitter Sentiment Analysis.pdf
 
SENTIMENT ANALYSIS OF TWITTER DATA
SENTIMENT ANALYSIS OF TWITTER DATASENTIMENT ANALYSIS OF TWITTER DATA
SENTIMENT ANALYSIS OF TWITTER DATA
 
REAL TIME SENTIMENT ANALYSIS OF TWITTER DATA
REAL TIME SENTIMENT ANALYSIS OF TWITTER DATAREAL TIME SENTIMENT ANALYSIS OF TWITTER DATA
REAL TIME SENTIMENT ANALYSIS OF TWITTER DATA
 
SENTIMENT ANALYSIS OF TWITTER DATA
SENTIMENT ANALYSIS OF TWITTER DATASENTIMENT ANALYSIS OF TWITTER DATA
SENTIMENT ANALYSIS OF TWITTER DATA
 
Twitter Sentiment Analysis
Twitter Sentiment AnalysisTwitter Sentiment Analysis
Twitter Sentiment Analysis
 
Twitter Sentiment Prediction.pptx
Twitter Sentiment Prediction.pptxTwitter Sentiment Prediction.pptx
Twitter Sentiment Prediction.pptx
 
Detection and Analysis of Twitter Trending Topics via Link-Anomaly Detection
Detection and Analysis of Twitter Trending Topics via Link-Anomaly DetectionDetection and Analysis of Twitter Trending Topics via Link-Anomaly Detection
Detection and Analysis of Twitter Trending Topics via Link-Anomaly Detection
 
Adressing Volume and Velocity Challenge on the Social Web using Crowd Sourced...
Adressing Volume and Velocity Challenge on the Social Web using Crowd Sourced...Adressing Volume and Velocity Challenge on the Social Web using Crowd Sourced...
Adressing Volume and Velocity Challenge on the Social Web using Crowd Sourced...
 
Recommender systems
Recommender systemsRecommender systems
Recommender systems
 
Building a Recommender systems by Vivek Murugesan - Technical Architect at Cr...
Building a Recommender systems by Vivek Murugesan - Technical Architect at Cr...Building a Recommender systems by Vivek Murugesan - Technical Architect at Cr...
Building a Recommender systems by Vivek Murugesan - Technical Architect at Cr...
 
SNATZ Technology
SNATZ TechnologySNATZ Technology
SNATZ Technology
 
Final Year PPT on Twitter App
Final Year PPT on Twitter AppFinal Year PPT on Twitter App
Final Year PPT on Twitter App
 
Sentiment Analysis of Twitter tweets using supervised classification technique
Sentiment Analysis of Twitter tweets using supervised classification technique Sentiment Analysis of Twitter tweets using supervised classification technique
Sentiment Analysis of Twitter tweets using supervised classification technique
 
Complex networks - Update
Complex networks - UpdateComplex networks - Update
Complex networks - Update
 
Social Network Analysis Basics for Social Media Profs - Handout
Social Network Analysis Basics for Social Media Profs - HandoutSocial Network Analysis Basics for Social Media Profs - Handout
Social Network Analysis Basics for Social Media Profs - Handout
 
GeospatialDataAnalysis
GeospatialDataAnalysisGeospatialDataAnalysis
GeospatialDataAnalysis
 
Finding News Curators in Twitter
Finding News Curators in TwitterFinding News Curators in Twitter
Finding News Curators in Twitter
 

Último

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 

Último (20)

From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 

Predict Interestingness of An Article Using Twitter

  • 1. Predict the Interesting of an article Using Twitter Chitra khatwani Yashasvi girdhar Khyati chandu R.K. Srinivas
  • 2. The project aims at measuring the interestingness of articles by analyzing the tweets related to the entities in the article. ● Application: – We can order the articles for a search query according to their interestingness. – Suggesting news articles to users on websites
  • 3. Approach Followed ● Extract all the named entities from the article > Two methods can be followed ● Using NLTK Library ● Using A list of Wikipedia Titles We have used the second approach, because the nltk library misses out many important entities, in some cases.
  • 4. Approach Followed ● Shortlist all the dominant entities from the extracted entities – Dominant entities are those, which are most frequently talked about in the article. – Methods: ● Can be decided based on the frequency of entities ● Entities occurring in the title of the article
  • 5. Approach Followed ● Mine all the tweets related to all the dominant entities ● Done using Twitter Search API ● Need to collect the tweets of the entities, around the date when the article was published. ● Need to parse the tweets before storing them, to make thhem ready for the next steps.
  • 6. Approach Followed ● Categorize each tweet as +ve , -ve or neutral – Consider all the unigrams tokens equally – Score each token using the naive bayes formula – Sum up the scores of all the tokens to calculate the score for an entitiy
  • 7. Approach Followed ● Predict the interestingness of the article, using the number of positive and negative tweets We have followed the below approach : – Less is the difference between number of positive tweets and number of negative tweets, more is the interestingness of the article. – On the other hand, if the number of positive entities outweighs the number of negative entities, or vice- versa, the article is considered less interesting.
  • 8. Datasets used ● For Articles – A set of random news articles taken from the BBC News Dataset ● For Sentiment Analysis – Mejaj Dataset ● Built on the basis of categorizing tweets on the basis of predefined list of positive and negative words – Standford Dataset
  • 9. Challenges ● Collecting the right set of articles for testing our model ● Finding the Right dataset for twitter and then, deciding upon the parameters, to categorize the tweet ● Deciding upon the appropriate algorithm for deciding the interestingness of the article, based on the +ve and -ve tweets
  • 10. Conclusion ● Social Media, such as twitter in this case, is a very common medium for people nowadays, to express their opinions about something. This can be leveraged as a very powerful medium, in predicting the nature of the data published on the web, specially millions of articles that are published each day. This can also be used in suggesting the articles to the users. References ● Mining Sentiments from Tweets, Siel, IIIT-Hyderabad