SlideShare uma empresa Scribd logo
1 de 22
© This document contains confidential and proprietary information of Adroitent. It is furnished for evaluation purposes
only. Except with the express prior written permission of Adroitent, this document and the information contained herein
may not be published, disclosed, or used for any other purpose. | www.adroitent.com
Understanding Social
Media Analytics
SANDEEP SEERAPU
• Web is no longer a static library that people passively browse
• Web is a place where people:
o Consume and create content
o Interact with other people:
 Internet forums, Blogs, Social networks, Twitter, Wikis, Podcasts, Slide
sharing, Bookmark sharing, Product reviews, Comments, …
• DATA POINT: Facebook traffic tops Google (for USA)
• March 2010: FB > 7% of US traffic
http://money.cnn.com/2010/03/16/technology/facebook_most_visited
Social Media : Big Change
• Rich and big data:
• Billions users, billions contents
• Textual, Multimedia (image, videos, etc.)
• Billions of connections
• Behaviours, preferences, trends...
• Data is open and easy to access
• It’s easy to get data from Social Media
• Datasets
• Developers APIs
• Spidering the Web
Social Media : Rich and Big data
Social Media : Opportunities
Any user can share and contribute content, express opinions, link to others
This means: Can data-mine opinions and behaviours of millions of users to gain
insights into:
• Human behaviour
• Marketing analytics
• Product sentiment
What can we do with this data?
• Consumer Brand Analytics
• What are people saying about our brand?
• Marketing Communications
• Significant spending on marketing, advertising:
• Companies trying to position their products
• Brand analytics helps to determine whether such campaigns are effective
• Product reviews
• Automatically mine product reviews for information on product features, new
requests, …
• Easy to use, Comfortable chair, Light weight, Sturdy, Good price
Applications: Reputation Management
• Citizen response
• Solicit citizen feedback on bills debated in Congress
• What new issues are being raised, what aspects of bill are popular, unpopular
• Political Campaigns
• Why do people support a candidate?
• Law enforcement
• Gang members boast about their activities on Facebook
• Protests being planned through Twitter
• NYT: Sending the Police Before There’s a Crime
http://www.nytimes.com/2011/08/16/us/16police.html?_r=1
Applications: Citizen Response
• Viral marketing:
• Personalized recommendations Online forum users are
• Brand advocates:
• 79.2% of forum contributors help a friend to make a decision about a product
• purchase (47.6% of non-contributors).
• 65% of forum contributors share advice (offline and in person) based on
information that they’ve read online (35% of non-contributors)
http://www.socialmediaexaminer.com/new-studies-show-value-of-social-media
Applications: Social Media Marketing
Information Flow
How do we capture and model
the flow of information?
Given that social media generate a wealth of consumer data, how can brands turn raw
social media comment data from Twitter, Facebook, blogs, and forums into actionable
business insights? The answer lies in the application of text-mining and semantic
technology to these new sources of unstructured data.
How does it work?
• Text mining is similar to data mining in that it is aimed at identifying interesting patterns
in data
• The first step in any text-mining effort is to identify the text-based sources to be
analysed and gather this material through information retrieval or selecting the corpus
that comprises the set of textual files and content of interest.
• Extensive NLP is deployed that invokes "part of speech tagging" and text sequencing to
parse for syntax (that is, tokenizing text) and applying Named Entity Recognition (that is,
identifying the mention of brands, people's names, places, common abbreviations, and
so on).
Text mining and semantic methods
Unique challenges exist when setting out to apply text mining to social media
data. The data that social networking sites, blogs, and forums generate falls in
the category of what is commonly referred to as big data. The data is
unstructured and semi-structured, petabytes are generated around larger
brands on a daily basis, and traditional relational databases cannot efficiently
scale to support real-time analytics based on the data. Big data and NoSQL
database solutions are therefore required.
Social media datamarts and big data
There are several commercial and open source options for text-mining software and
applications.
Of the open source text mining tools, RapidMiner and R appear to be two of the most
popular. R has a wider user base; a programming language in which source code is
required, it has a large selection of algorithms. However, scalability is an issue with R so it's
not ideal for large datasets without workarounds. RapidMiner has a smaller user base, but
it doesn't require source code and has a powerful user interface (UI).
Embedded is a list of other Text Mining tools:
Text mining tools
Who does these Text Mining?
Spinn3r is a web service that provides raw access to posts, articles, tweets, status
updates, etc. being published - in real or near real time, allowing you to focus on building
your application, mashup, or search engine. We find the sources, index their content and
take care of all the heavy lifting around delivering large amounts of relevant data.
They publish an API for companies to build Analytic products on top of this data
• Spinn3r Dataset: http://spinn3r.com
• 30 million articles/day (50GB of data)
• 20,000 news sources + millions blogs and forums
• And lots of Tweets and public Facebook posts
Gnip and DataSift are among the many others who provide these
kind of Datasets
Dataset Providers
Now that you have the Datasets,
What Next?
Product Companies
There are many product companies who use these datasets and build analytical products
for organizations:
InsideView
With InsideView CRM+, your marketing, sales, and service teams can:
• Research market, company, contact, and competitor information
• Use real-time news and social network connections to target new leads and engage with
customers
• Enrich leads to help sales move from lead to win
• One-click integration with CRM to update leads and contacts into your CRM
Tealeaf
Tealeaf's Customer Behavior Analysis Suite
• Improving online customer experience is a top priority for many organizations and
Tealeaf's Customer Behavior Analysis Suite was created with this goal in mind. By
utilizing cxImpact, cxResults and cxView in concert, companies have both the
quantitative data, as well as the qualitative experience information necessary to
understand customers' true experiences
And similarly
Further list of product companies those provide analytical tools from datasets
www.sprinklr.com
www.leadformix.com
www.xactlycorp.com
www.moxiesoft.com
www.synaptris.com
www.quinstreet.com
www.enirogroup.com/en
www.saama.com
www.mu-sigma.com
And many more..
Conceptually, what do these
tools provide?
Sentiment analysis depends on an appropriate subjectivity lexicon that understands the
relative positive, neutral or negative context of a word or expression. It is both language
and context specific.
A good example can be seen below:
I find PRODUCTX to be very good and useful, but it is a bit too expensive.
The expression (and therefore the PRODUCTX) is rated as positive, since there are two
positive words “good” and “useful” – and one negative word “expensive”. In addition, one
of the positive words is enhanced with the word “very” while the negative word is put
into perspective by the qualifier “a bit”. The more advanced the lexica, the more detailed
the analysis and the findings can be.
Sentiment analysis is a well-established, stand-alone predictive analytic technique.
Sentiment Analysis: Predictive Analytic Technique
These tools are generally cloud-based applications that pull many different social media
data sources (datasets) together including communities and blogs. They are able to do
this because they generally incorporate a massive back end infrastructure that constantly
crawls and captures new data as it occurs from the API’s.
They all provide an interface to filter the data and enter selection criteria to look across a
broad range of channel choices. The results usually take some form of a visual scorecard
that combines different graphical and tabular techniques for displaying the summarized
information. Many allow an interactive “drill down” to see further details, most of them
allowing you to drill right through to the original source of the data.
Social Media Scorecards
Technologies Used by these Product Companies
Big Data Technologies:
• Hadoop Frameworks (hdfs, Pig, Hive, oozie, Hbase, Mahout),
• Cloudera (CDH3 & CDH4) distributions,
• Postgres+ Postgis,
• Cassandra
Languages:
• Java,
• Perl
Cloud computing technologies:
• Amazon Web Services (AWS) / Amazon EC2,
• Amazon S3,
• Amazon EMR,
• Amazon Cloud watch
© This document contains confidential and proprietary information of Adroitent. It is furnished for evaluation purposes
only. Except with the express prior written permission of Adroitent, this document and the information contained herein
may not be published, disclosed, or used for any other purpose. | www.adroitent.com

Mais conteúdo relacionado

Último

Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...apidays
 
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUKSpring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUKJago de Vreede
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Victor Rentea
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfOverkill Security
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...apidays
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Jeffrey Haguewood
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfOrbitshub
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWERMadyBayot
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Orbitshub
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024The Digital Insurer
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxRustici Software
 

Último (20)

Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUKSpring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 

Destaque

How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthThinkNow
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfmarketingartwork
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024Neil Kimberley
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)contently
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024Albert Qian
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsKurio // The Social Media Age(ncy)
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Search Engine Journal
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summarySpeakerHub
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next Tessa Mero
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentLily Ray
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best PracticesVit Horky
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project managementMindGenius
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...RachelPearson36
 
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Applitools
 
12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at WorkGetSmarter
 

Destaque (20)

How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
 
Skeleton Culture Code
Skeleton Culture CodeSkeleton Culture Code
Skeleton Culture Code
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
 
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
 
12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work
 
ChatGPT webinar slides
ChatGPT webinar slidesChatGPT webinar slides
ChatGPT webinar slides
 

Understanding Social Media Analytics : Big Picture

  • 1. © This document contains confidential and proprietary information of Adroitent. It is furnished for evaluation purposes only. Except with the express prior written permission of Adroitent, this document and the information contained herein may not be published, disclosed, or used for any other purpose. | www.adroitent.com Understanding Social Media Analytics SANDEEP SEERAPU
  • 2. • Web is no longer a static library that people passively browse • Web is a place where people: o Consume and create content o Interact with other people:  Internet forums, Blogs, Social networks, Twitter, Wikis, Podcasts, Slide sharing, Bookmark sharing, Product reviews, Comments, … • DATA POINT: Facebook traffic tops Google (for USA) • March 2010: FB > 7% of US traffic http://money.cnn.com/2010/03/16/technology/facebook_most_visited Social Media : Big Change
  • 3. • Rich and big data: • Billions users, billions contents • Textual, Multimedia (image, videos, etc.) • Billions of connections • Behaviours, preferences, trends... • Data is open and easy to access • It’s easy to get data from Social Media • Datasets • Developers APIs • Spidering the Web Social Media : Rich and Big data
  • 4. Social Media : Opportunities Any user can share and contribute content, express opinions, link to others This means: Can data-mine opinions and behaviours of millions of users to gain insights into: • Human behaviour • Marketing analytics • Product sentiment
  • 5. What can we do with this data?
  • 6. • Consumer Brand Analytics • What are people saying about our brand? • Marketing Communications • Significant spending on marketing, advertising: • Companies trying to position their products • Brand analytics helps to determine whether such campaigns are effective • Product reviews • Automatically mine product reviews for information on product features, new requests, … • Easy to use, Comfortable chair, Light weight, Sturdy, Good price Applications: Reputation Management
  • 7. • Citizen response • Solicit citizen feedback on bills debated in Congress • What new issues are being raised, what aspects of bill are popular, unpopular • Political Campaigns • Why do people support a candidate? • Law enforcement • Gang members boast about their activities on Facebook • Protests being planned through Twitter • NYT: Sending the Police Before There’s a Crime http://www.nytimes.com/2011/08/16/us/16police.html?_r=1 Applications: Citizen Response
  • 8. • Viral marketing: • Personalized recommendations Online forum users are • Brand advocates: • 79.2% of forum contributors help a friend to make a decision about a product • purchase (47.6% of non-contributors). • 65% of forum contributors share advice (offline and in person) based on information that they’ve read online (35% of non-contributors) http://www.socialmediaexaminer.com/new-studies-show-value-of-social-media Applications: Social Media Marketing
  • 9. Information Flow How do we capture and model the flow of information?
  • 10. Given that social media generate a wealth of consumer data, how can brands turn raw social media comment data from Twitter, Facebook, blogs, and forums into actionable business insights? The answer lies in the application of text-mining and semantic technology to these new sources of unstructured data. How does it work? • Text mining is similar to data mining in that it is aimed at identifying interesting patterns in data • The first step in any text-mining effort is to identify the text-based sources to be analysed and gather this material through information retrieval or selecting the corpus that comprises the set of textual files and content of interest. • Extensive NLP is deployed that invokes "part of speech tagging" and text sequencing to parse for syntax (that is, tokenizing text) and applying Named Entity Recognition (that is, identifying the mention of brands, people's names, places, common abbreviations, and so on). Text mining and semantic methods
  • 11. Unique challenges exist when setting out to apply text mining to social media data. The data that social networking sites, blogs, and forums generate falls in the category of what is commonly referred to as big data. The data is unstructured and semi-structured, petabytes are generated around larger brands on a daily basis, and traditional relational databases cannot efficiently scale to support real-time analytics based on the data. Big data and NoSQL database solutions are therefore required. Social media datamarts and big data
  • 12. There are several commercial and open source options for text-mining software and applications. Of the open source text mining tools, RapidMiner and R appear to be two of the most popular. R has a wider user base; a programming language in which source code is required, it has a large selection of algorithms. However, scalability is an issue with R so it's not ideal for large datasets without workarounds. RapidMiner has a smaller user base, but it doesn't require source code and has a powerful user interface (UI). Embedded is a list of other Text Mining tools: Text mining tools
  • 13. Who does these Text Mining?
  • 14. Spinn3r is a web service that provides raw access to posts, articles, tweets, status updates, etc. being published - in real or near real time, allowing you to focus on building your application, mashup, or search engine. We find the sources, index their content and take care of all the heavy lifting around delivering large amounts of relevant data. They publish an API for companies to build Analytic products on top of this data • Spinn3r Dataset: http://spinn3r.com • 30 million articles/day (50GB of data) • 20,000 news sources + millions blogs and forums • And lots of Tweets and public Facebook posts Gnip and DataSift are among the many others who provide these kind of Datasets Dataset Providers
  • 15. Now that you have the Datasets, What Next?
  • 16. Product Companies There are many product companies who use these datasets and build analytical products for organizations: InsideView With InsideView CRM+, your marketing, sales, and service teams can: • Research market, company, contact, and competitor information • Use real-time news and social network connections to target new leads and engage with customers • Enrich leads to help sales move from lead to win • One-click integration with CRM to update leads and contacts into your CRM Tealeaf Tealeaf's Customer Behavior Analysis Suite • Improving online customer experience is a top priority for many organizations and Tealeaf's Customer Behavior Analysis Suite was created with this goal in mind. By utilizing cxImpact, cxResults and cxView in concert, companies have both the quantitative data, as well as the qualitative experience information necessary to understand customers' true experiences
  • 17. And similarly Further list of product companies those provide analytical tools from datasets www.sprinklr.com www.leadformix.com www.xactlycorp.com www.moxiesoft.com www.synaptris.com www.quinstreet.com www.enirogroup.com/en www.saama.com www.mu-sigma.com And many more..
  • 18. Conceptually, what do these tools provide?
  • 19. Sentiment analysis depends on an appropriate subjectivity lexicon that understands the relative positive, neutral or negative context of a word or expression. It is both language and context specific. A good example can be seen below: I find PRODUCTX to be very good and useful, but it is a bit too expensive. The expression (and therefore the PRODUCTX) is rated as positive, since there are two positive words “good” and “useful” – and one negative word “expensive”. In addition, one of the positive words is enhanced with the word “very” while the negative word is put into perspective by the qualifier “a bit”. The more advanced the lexica, the more detailed the analysis and the findings can be. Sentiment analysis is a well-established, stand-alone predictive analytic technique. Sentiment Analysis: Predictive Analytic Technique
  • 20. These tools are generally cloud-based applications that pull many different social media data sources (datasets) together including communities and blogs. They are able to do this because they generally incorporate a massive back end infrastructure that constantly crawls and captures new data as it occurs from the API’s. They all provide an interface to filter the data and enter selection criteria to look across a broad range of channel choices. The results usually take some form of a visual scorecard that combines different graphical and tabular techniques for displaying the summarized information. Many allow an interactive “drill down” to see further details, most of them allowing you to drill right through to the original source of the data. Social Media Scorecards
  • 21. Technologies Used by these Product Companies Big Data Technologies: • Hadoop Frameworks (hdfs, Pig, Hive, oozie, Hbase, Mahout), • Cloudera (CDH3 & CDH4) distributions, • Postgres+ Postgis, • Cassandra Languages: • Java, • Perl Cloud computing technologies: • Amazon Web Services (AWS) / Amazon EC2, • Amazon S3, • Amazon EMR, • Amazon Cloud watch
  • 22. © This document contains confidential and proprietary information of Adroitent. It is furnished for evaluation purposes only. Except with the express prior written permission of Adroitent, this document and the information contained herein may not be published, disclosed, or used for any other purpose. | www.adroitent.com