SlideShare uma empresa Scribd logo
1 de 27
Baixar para ler offline
Social Data Analytics
using
IBM Big Data technologies

Vijay Bommireddipalli vijayrb@us.ibm.com
Development Manager, Social Data Accelerator
IBM Big Data

October 21, 2013

© 2012 IBM Corporation
Please note
IBM’s statements regarding its plans, directions, and intent are subject to change or
withdrawal without notice at IBM’s sole discretion.
Information regarding potential future products is intended to outline our general
product direction and it should not be relied on in making a purchasing decision.
The information mentioned regarding potential future products is not a commitment,
promise, or legal obligation to deliver any material, code or functionality. Information
about potential future products may not be incorporated into any contract. The
development, release, and timing of any future features or functionality described for
our products remains at our sole discretion.
Performance is based on measurements and projections using standard IBM
benchmarks in a controlled environment. The actual throughput or performance that
any user will experience will vary depending upon many factors, including
considerations such as the amount of multiprogramming in the user’s job stream, the
I/O configuration, the storage configuration, and the workload processed. Therefore,
no assurance can be given that an individual user will achieve results similar to those
stated here.

2

© 2011 IBM Corporation
Before we begin …

3

© 2011 IBM Corporation
Tag ! You’re it !
- Micro-segmentation

4

© 2011 IBM Corporation
Social Data Analytics
- Using social media as a rich source of information
Behavior

Maybe our politicians should take
a playbook out of the rivalry
between duke/unc and take it
to the courts
http://ity.com/wfUsir

I'm at Mickey's Irish Pub Downtown
(206 3rd St, Court Ave, Raleigh) w/ 2
others http://4sq.com/gbsaYR
@silliesylvia good!!! U
Interest
shouldnt! Think about the
Location
important stuff, like ur 43rd
birthday ;)
@silliesylvia I <3 your leather
Consumption
btw happy birthday Sylvia ;)
leggings!! Its so katniss!!
dear redbox please have
kings speech for my new tv
colin firth movie marathon

Age

Intent to consume
@silliesylvia $10 dollars says
matthew & mary get married
next season :)
#downtownabbey
OMG OMG. just
dropped my new ipad3
crappola!!!
Consumption

5

Prediction

Interest

@bamagirl can’t wait to
watch sherlock with you!
Oh, robert downey jr, I still
love you but bbc is so
amazing

Intent to consume

360 degree profile
Personal Attributes
• Sylvia Campbell, Female, In a
Relationship
• 32 years old, birthday on 7/17
• Lives near Raleigh, NC
• College graduate; Income of 80-120k
Buzz/Sentiment
• Retweets BF’s comments
• Interest in BBC shows: Downton Abbey,
Sherlock, Fringe, (P&P?)
• Sherlock Holmes, Robert Downey, Jr.
• Hunger Games, Katniss/J. Lawrence
Interests/Behavior
• Watch movies, tv shows
• Romance plots, “hero types”, strong
women
• Uses iPad 3, Redbox, Hulu
• Shopping , interest in sales/deals
• Duke/ UNC basketball
© 2011 IBM Corporation
Social Data Analytics
- Comprehensive Entity Extraction and Integration
Name: Jane Doe
Id: jaydee
Address: Home of
the Buccaneers
Interests: running,
yoga, football…

Name: Jane Doe
Name: Jane Doe, Cava
Address: Tampa, FL
Address: Tampa, Fl
Twitter: jaydee
Twitter: @maryguida
Blog Topic: food
Blog Topic: politics
Hobbies: running, yoga, …
Hobbies: running, yoga, …
Relationships: Tony C (brother)…
Relationships: Tony C (brother)…

Name: J Doe
Blog Topic: food

Entity
Integration

Name: jane
Address: Tampa, FL
Relationships: Tony C
(brother)., …

All names are fictitious
6

Challenges:

Scale
 1000’s sites, 100s millions users


Complex matching decisions
 Partial, noisy and incomplete profile
attributes
 Only 3% of consumers have sufficient
attribute information in their profiles.
© 2011 IBM Corporation
Consumer Intelligence
Timely Insights
• Intent to buy various products
• Current Location

Personal Attributes
• Identifiers: name, address, age,
gender, occupation…
• Interests: sports, pets, cuisine…
• Life Cycle Status: marital, parental

Social Media based
360-degree
Consumer Profiles

• Personal relationships: family,
friends and roommates…
• Business relationships: co-workers
and work/interest network…

• Life-changing events: relocation,
having a baby, getting married,
getting divorced, buying a house…

What should I buy?? A mini laptop with Windows 7
OR a Apple MacBook!??!

Location announcements
I'm at Starbucks Parque Tezontle
http://4sq.com/fYReSj
7

• Personal preferences of products
• Product Purchase history

Relationships

Life Events

Monetizable intent to buy
I need a new
products digital camera for my food pictures,
any recommendations around 300?

Products Interests

Life Events
College: Off to Stanford for my MBA! Bbye chicago!
Looks like we'll be moving to New Orleans sooner than I
thought.

Intent to buy a house
I'm thinking about buying a home in Buckingham Estates
per a recommendation. Anyone have advice on that area?
#atx #austinrealestate #austin

© 2011 IBM Corporation
Social Data Analytics
- Profile construction

8

© 2011 IBM Corporation
Social Data Analytics
- Profile construction

9

© 2011 IBM Corporation
Big Data Platform and Accelerators - Summary


Software components that
accelerate development and/or
implementation of specific
solutions or use cases on top
of the Big Data platform



Provide business logic, data
processing, and
UI/visualization, tailored for a
given use case



Analytic Applications

Bundled with Big Data platform
components – InfoSphere
BigInsights and InfoSphere
Streams

BI /
Exploration / Functional Industry Predictive Content
Reporting Visualization
App
App
Analytics Analytics

IBM Big Data Platform
Visualization
& Discovery

Applications &
Development

Systems
Management

Accelerators
Hadoop
System

Stream
Computing

Data
Warehouse

Contextual
Search

Key Benefits


Information Integration & Governance

Time to value



Leverage best practices
around implementation of a
given use case.

Cloud | Mobile | Security

10

© 2011 IBM Corporation
Social Media Analytics Architecture
Online flow: Data-in-motion analysis

Real time analytics.
Pre-defined views
and charts

Stream Computing and Analytics

Social Media

Data Ingest
and Prep

Entity
Analytics:
Profile
Resolution

Extract Buzz,
Intent ,
Sentiment

Dashboard

BigInsights System and Analytics

Social Media
Data

Extract Buzz,
Intent ,
Sentiment And
Consumer
Profiles

Entity
Analytics and
Integration

Comprehensive
Social Media
Customer
Profiles

Pre-defined
Workbooks and
Dashboards

Offline flow: Data-at-rest analysis

Data Explorer
Index using
Push API

Ad hoc access

Optional: Indexed Search
11

© 2011 IBM Corporation
SDA 1.2
 Social Media Sources Supported
– Gnip, Boardreader
– Tweets, Boards, Blogs

 Analyze Streaming data as well as data at rest

– Streams for processing of streaming data
– BigInsights/Hadoop for input, output and configuration data

 Key Micro-segmentation Attributes (out-of-box)

– Personal Info: Gender, Location, Parental status, Marital status, Employment
– Interests: Movie interest, Comic book fan, Product interest, Current customer
of, Products owned
– ** Attributes can be added in (requires some development effort)

 Entity resolution across the different social media sources

12

© 2011 IBM Corporation
SDA 1.2

 Outputs/Measures (out-of-box)
–
–
–
–

Buzz
Sentiment
Intent to buy/start service
Intend to attend/see

 Example use cases
–
–
–
–

Retail – Lead generation, Brand management
Financial – Lead generation and Brand management
Media & Entertainment: Brand management
Generic

 Visualization using BigSheets
 Extendable/Customizable Solution

13

© 2011 IBM Corporation
SDA - Acting on the insights
 Metrics based understanding of Feedback in Social Media
– And more importantly Feedback from whom !

 Comprehensive (social media) profiles with microsegmentation
information
 Campaign execution can be done in Social Media
 Entity resolution across the different social media sources
 External (social media) to Internal (CRM) linkage **coming

14

© 2011 IBM Corporation
SDA Outputs
 Pre-defined Workbooks
 Dashboards

 Granular outputs for further slicing and dicing by Data Scientists

15

© 2011 IBM Corporation
SDA Conceptual Flow

16

© 2011 IBM Corporation
BigInsights & Streams Text Analytics
High Performance rule based Information Extraction Engine
 Highly scalable solution available for at-rest and in-motion analytics
 Pre-built extractors, and toolkit to build custom Extractors

• Rich Extractor library supports multiple languages
• Declarative Information Extraction (IE) system based on an algebraic
framework

Sophisticated tooling to help build, test, and refine rules
Developed at IBM Research since 2004
Embedded in several IBM products
• BigInsights, Streams.
• Lotus Notes
• Cognos Consumer Insights
What is TA

17

Why
Biginsights
TA

How is TA
Deployed
& used

Dev. tools

© 2011 IBM Corporation
Applications of Text analytics
Broad range of applications in many industries
• CRM Analytics
Voice of customer
Product and Services gap analysis
Customer churn

• Social Media Analytics
Purchase intent
Customer churn prediction
Reputational Risk

• Digital Piracy
Illegal broadcast of streaming and video content

• Log Analytics
Failure analysis and root cause identification
Availability assurance

• Regulatory Compliance
Data Redaction
• Identify and protect sensitive information
18

What is TA

Why
Biginsights
TA

How is TA
Deployed
& used

Dev. tools

© 2011 IBM Corporation
Performance Comparison (with ANNIE open source **)
Task: Named Entity Recognition
Dataset : Different document collections from the Enron corpus obtained by randomly sampling 1000 documents for each
size

Throughput (KB/sec)

700
600
500
400
ANNIE
Open Source Entity Tagger

300

>10x faster
< 60% memory

SystemT

200
100
0
0

20

40

60

80

100

Average document size (KB)

** http://dl.acm.org/citation.cfm?id=1858681.1858695

Performance comparison with GATE 5
What is TA

19

Why
Biginsights
TA

How is TA
Deployed
& used

Dev. tools

© 2011 IBM Corporation
Text Analytics Development Flow
 Declarative language for extractor logic
 Optimization and deployment to scalable runtime
Extracted
Information

Development Tooling

Extractor

Text Analytics
Optimizer

Compiled
Operator
Graph

Text Analytics
Runtime

Sample Input
Documents
Rule based language
Annotator Query Language - AQL
with familiar SQL-like syntax
Specify annotator semantics
declaratively

Choose an efficient
execution plan

Highly scalable,
embeddable Java runtime

What is TA

20

Why
Biginsights
TA

How is TA
Deployed
& used

Dev. tools

© 2011 IBM Corporation
Invoking Text Analytics within BigInsights

Document
encoded as
JSON record.

Jaql runtime coordinates a
multi-stage map-reduce flow.
JAQL Function Wrapper

Input Record
{
label: “http://www.ibm ...”,
text: “<html>n<head> …”
}

AQL
SystemT
Optimizer
Dictionaries
21

Input
Adapter

SystemT
Runtime

Compiled
Plan

Output
Adapter

Output Record
{
label: “http://www.ibm ...”,
text: “<html>n<head> …”
Person:
[
{ firstName: [10, 15],
lastName: [16, 25] },
…
{ firstName: [1042, 1045],
lastName: [1046, 1050] }
],
Hyperlink:
[
{ anchorText: [25, 33] },
…
{ anchorText: [990, 997] }
],
H1: …

Annotations added as
additional attributes to
JSON} record.

© 2011 IBM Corporation
Additional Advantages of IBM Text Analytics
Quality: Drives effectiveness of entire application
• Enables high accuracy and coverage
Performance: Dominant cost is CPU
• Process large documents and large number of documents
with high throughput
Explain-ability
• Determine the cause of errors and fix it without affecting the
remaining correct results

Reusability: easily adaptable for a different domain
• The development platform must enable layers of abstractions to be built and easily reused
in a different domain
Expressivity
• Rule language with a rich set of operators available to enable complex extraction tasks

What is TA

22

Why
Biginsights
TA

How is TA
Deployed
& used

Dev. tools

© 2011 IBM Corporation
BigInsights Text Analytics Development

What is TA

23

Why
Biginsights
TA

How is TA
Deployed
& used

Dev. tools

© 2011 IBM Corporation
AQL editor with content assist

24

What is TA

Why
Biginsights
TA

How is TA
Deployed
& used

Dev. tools

© 2011 IBM Corporation
Understanding the lineage of results

Click to drill down and see
the rules that triggered
inclusion of results

Explain and search
through the results

What is TA

25

Why
Biginsights
TA

How is TA
Deployed
& used

Dev. tools

© 2011 IBM Corporation
IBM Text Analytics for Big Data
High Performance Information Extraction Engine

Analysis can be applied to data at-rest and in-motion
• Build extractor once and use with BigInsights or Streams
Parallel execution scales to Big Data volumes
• Linearly scalable to extremely high volumes
Highly customizable to a variety of domains and languages
• Pre-built extractors available out of the box
Sophisticated tooling enables ease of development and refinement of results

26

© 2011 IBM Corporation
Thank you

27

© 2011 IBM Corporation

Mais conteúdo relacionado

Mais procurados

Buzzient Overview Svb Mar232010 Final
Buzzient Overview Svb Mar232010 FinalBuzzient Overview Svb Mar232010 Final
Buzzient Overview Svb Mar232010 FinalTBJ Investments, LLC
 
Mobile Analytics (Mobile BI) - A Game Changer
Mobile Analytics (Mobile BI) - A Game Changer Mobile Analytics (Mobile BI) - A Game Changer
Mobile Analytics (Mobile BI) - A Game Changer Jitender Aswani
 
App Lifecycle Management for enterprise mobile apps
App Lifecycle Management for enterprise mobile appsApp Lifecycle Management for enterprise mobile apps
App Lifecycle Management for enterprise mobile appsVishal Jain
 
My Research through a BPM Lense
My Research through a BPM LenseMy Research through a BPM Lense
My Research through a BPM LenseHamid Motahari
 
Introduction to Mobile Business Intelligence
Introduction to Mobile Business IntelligenceIntroduction to Mobile Business Intelligence
Introduction to Mobile Business IntelligenceVamshi Vangapally
 
Mobile Computing / Mobile BI Market Study 2014
Mobile Computing / Mobile BI Market Study 2014Mobile Computing / Mobile BI Market Study 2014
Mobile Computing / Mobile BI Market Study 2014BiBoard.Org
 
Presentation by VondaSoft at the Zensar #TechShowcase. An iSPIRT ProductNatio...
Presentation by VondaSoft at the Zensar #TechShowcase. An iSPIRT ProductNatio...Presentation by VondaSoft at the Zensar #TechShowcase. An iSPIRT ProductNatio...
Presentation by VondaSoft at the Zensar #TechShowcase. An iSPIRT ProductNatio...ProductNation/iSPIRT
 
Anuj sharma i labscamp
Anuj sharma   i labscampAnuj sharma   i labscamp
Anuj sharma i labscampssuserc3420c
 
Cognitive Work Assistants - Vision and Open Challenges
Cognitive Work Assistants - Vision and Open ChallengesCognitive Work Assistants - Vision and Open Challenges
Cognitive Work Assistants - Vision and Open ChallengesHamid Motahari
 

Mais procurados (11)

Buzzient Overview Svb Mar232010 Final
Buzzient Overview Svb Mar232010 FinalBuzzient Overview Svb Mar232010 Final
Buzzient Overview Svb Mar232010 Final
 
Mobile Analytics (Mobile BI) - A Game Changer
Mobile Analytics (Mobile BI) - A Game Changer Mobile Analytics (Mobile BI) - A Game Changer
Mobile Analytics (Mobile BI) - A Game Changer
 
App Lifecycle Management for enterprise mobile apps
App Lifecycle Management for enterprise mobile appsApp Lifecycle Management for enterprise mobile apps
App Lifecycle Management for enterprise mobile apps
 
My Research through a BPM Lense
My Research through a BPM LenseMy Research through a BPM Lense
My Research through a BPM Lense
 
Introduction to Mobile Business Intelligence
Introduction to Mobile Business IntelligenceIntroduction to Mobile Business Intelligence
Introduction to Mobile Business Intelligence
 
Mobile Computing / Mobile BI Market Study 2014
Mobile Computing / Mobile BI Market Study 2014Mobile Computing / Mobile BI Market Study 2014
Mobile Computing / Mobile BI Market Study 2014
 
Developing a Mobile Strategy to Reach Our Clients
Developing a Mobile Strategy to Reach Our ClientsDeveloping a Mobile Strategy to Reach Our Clients
Developing a Mobile Strategy to Reach Our Clients
 
MOBILE BI
MOBILE BIMOBILE BI
MOBILE BI
 
Presentation by VondaSoft at the Zensar #TechShowcase. An iSPIRT ProductNatio...
Presentation by VondaSoft at the Zensar #TechShowcase. An iSPIRT ProductNatio...Presentation by VondaSoft at the Zensar #TechShowcase. An iSPIRT ProductNatio...
Presentation by VondaSoft at the Zensar #TechShowcase. An iSPIRT ProductNatio...
 
Anuj sharma i labscamp
Anuj sharma   i labscampAnuj sharma   i labscamp
Anuj sharma i labscamp
 
Cognitive Work Assistants - Vision and Open Challenges
Cognitive Work Assistants - Vision and Open ChallengesCognitive Work Assistants - Vision and Open Challenges
Cognitive Work Assistants - Vision and Open Challenges
 

Destaque

The BrainYard Webcast - Social Business & Social Analytics - 3/1/12
The BrainYard Webcast - Social Business & Social Analytics - 3/1/12The BrainYard Webcast - Social Business & Social Analytics - 3/1/12
The BrainYard Webcast - Social Business & Social Analytics - 3/1/12Mark Heid
 
IBM-Why Big Data?
IBM-Why Big Data?IBM-Why Big Data?
IBM-Why Big Data?Kun Le
 
The web economy in 20 minutes
The web economy in 20 minutesThe web economy in 20 minutes
The web economy in 20 minutesClio Bulgarella
 
The Chronicles of a Mobile-Web Economy
The Chronicles of a Mobile-Web EconomyThe Chronicles of a Mobile-Web Economy
The Chronicles of a Mobile-Web EconomyBernard Leong
 
Enterprise Search in the Big Data Era: Recent Developments and Open Challenges
Enterprise Search in the Big Data Era: Recent Developments and Open ChallengesEnterprise Search in the Big Data Era: Recent Developments and Open Challenges
Enterprise Search in the Big Data Era: Recent Developments and Open ChallengesYunyao Li
 
Hadoop in Data Warehousing
Hadoop in Data WarehousingHadoop in Data Warehousing
Hadoop in Data WarehousingAlexey Grigorev
 
mapReduce for machine learning
mapReduce for machine learning mapReduce for machine learning
mapReduce for machine learning Pranya Prabhakar
 
High-level languages for Big Data Analytics (Presentation)
High-level languages for Big Data Analytics (Presentation)High-level languages for Big Data Analytics (Presentation)
High-level languages for Big Data Analytics (Presentation)Jose Luis Lopez Pino
 
Mobile Cloud Computing: Big Picture
Mobile Cloud Computing: Big PictureMobile Cloud Computing: Big Picture
Mobile Cloud Computing: Big PictureReza Rahimi
 
Rapid KL Transport System
Rapid KL Transport System Rapid KL Transport System
Rapid KL Transport System Yousef Abujubba
 
Internet of Things (IoT) - We Are at the Tip of An Iceberg
Internet of Things (IoT) - We Are at the Tip of An IcebergInternet of Things (IoT) - We Are at the Tip of An Iceberg
Internet of Things (IoT) - We Are at the Tip of An IcebergDr. Mazlan Abbas
 
10 Revealing Statistics About Compensation & Benefits You should Know
10 Revealing Statistics About Compensation & Benefits You should Know10 Revealing Statistics About Compensation & Benefits You should Know
10 Revealing Statistics About Compensation & Benefits You should KnowElodie A.
 
How to Become a Thought Leader in Your Niche
How to Become a Thought Leader in Your NicheHow to Become a Thought Leader in Your Niche
How to Become a Thought Leader in Your NicheLeslie Samuel
 

Destaque (18)

The BrainYard Webcast - Social Business & Social Analytics - 3/1/12
The BrainYard Webcast - Social Business & Social Analytics - 3/1/12The BrainYard Webcast - Social Business & Social Analytics - 3/1/12
The BrainYard Webcast - Social Business & Social Analytics - 3/1/12
 
IBM-Why Big Data?
IBM-Why Big Data?IBM-Why Big Data?
IBM-Why Big Data?
 
Pig Experience
Pig ExperiencePig Experience
Pig Experience
 
Skilwise Big data
Skilwise Big dataSkilwise Big data
Skilwise Big data
 
Michalis Vafopoulos - Web Economy in perspective
Michalis Vafopoulos - Web Economy in perspectiveMichalis Vafopoulos - Web Economy in perspective
Michalis Vafopoulos - Web Economy in perspective
 
The web economy in 20 minutes
The web economy in 20 minutesThe web economy in 20 minutes
The web economy in 20 minutes
 
The Chronicles of a Mobile-Web Economy
The Chronicles of a Mobile-Web EconomyThe Chronicles of a Mobile-Web Economy
The Chronicles of a Mobile-Web Economy
 
Enterprise Search in the Big Data Era: Recent Developments and Open Challenges
Enterprise Search in the Big Data Era: Recent Developments and Open ChallengesEnterprise Search in the Big Data Era: Recent Developments and Open Challenges
Enterprise Search in the Big Data Era: Recent Developments and Open Challenges
 
Hadoop in Data Warehousing
Hadoop in Data WarehousingHadoop in Data Warehousing
Hadoop in Data Warehousing
 
mapReduce for machine learning
mapReduce for machine learning mapReduce for machine learning
mapReduce for machine learning
 
High-level languages for Big Data Analytics (Presentation)
High-level languages for Big Data Analytics (Presentation)High-level languages for Big Data Analytics (Presentation)
High-level languages for Big Data Analytics (Presentation)
 
Big data ppt
Big data pptBig data ppt
Big data ppt
 
Mobile Cloud Computing: Big Picture
Mobile Cloud Computing: Big PictureMobile Cloud Computing: Big Picture
Mobile Cloud Computing: Big Picture
 
Rapid KL Transport System
Rapid KL Transport System Rapid KL Transport System
Rapid KL Transport System
 
Data visualization
Data visualizationData visualization
Data visualization
 
Internet of Things (IoT) - We Are at the Tip of An Iceberg
Internet of Things (IoT) - We Are at the Tip of An IcebergInternet of Things (IoT) - We Are at the Tip of An Iceberg
Internet of Things (IoT) - We Are at the Tip of An Iceberg
 
10 Revealing Statistics About Compensation & Benefits You should Know
10 Revealing Statistics About Compensation & Benefits You should Know10 Revealing Statistics About Compensation & Benefits You should Know
10 Revealing Statistics About Compensation & Benefits You should Know
 
How to Become a Thought Leader in Your Niche
How to Become a Thought Leader in Your NicheHow to Become a Thought Leader in Your Niche
How to Become a Thought Leader in Your Niche
 

Semelhante a Social Data Analytics using IBM Big Data Technologies

Analyzing your market: How Big Data plus Social Equals Better Engagement
Analyzing your market: How Big Data plus Social Equals Better EngagementAnalyzing your market: How Big Data plus Social Equals Better Engagement
Analyzing your market: How Big Data plus Social Equals Better Engagementcadmef
 
Top 7 BI Trends for 2015
Top 7 BI Trends for 2015Top 7 BI Trends for 2015
Top 7 BI Trends for 2015Logi Analytics
 
What Will You Build with IBM Watson on Bluemix?
What Will You Build with IBM Watson on Bluemix?What Will You Build with IBM Watson on Bluemix?
What Will You Build with IBM Watson on Bluemix?Swami Chandrasekaran
 
OC Big Data Monthly Meetup #6 - Session 1 - IBM
OC Big Data Monthly Meetup #6 - Session 1 - IBMOC Big Data Monthly Meetup #6 - Session 1 - IBM
OC Big Data Monthly Meetup #6 - Session 1 - IBMBig Data Joe™ Rossi
 
SD Big Data Monthly Meetup #4 - Session 1 - IBM
SD Big Data Monthly Meetup #4 - Session 1 - IBMSD Big Data Monthly Meetup #4 - Session 1 - IBM
SD Big Data Monthly Meetup #4 - Session 1 - IBMBig Data Joe™ Rossi
 
Value proposition for big data isv partners 0714
Value proposition for big data isv partners 0714Value proposition for big data isv partners 0714
Value proposition for big data isv partners 0714Niu Bai
 
Building intelligent APIs - Andy Thurai, IBM
Building intelligent APIs - Andy Thurai, IBMBuilding intelligent APIs - Andy Thurai, IBM
Building intelligent APIs - Andy Thurai, IBMPAPIs.io
 
What’s New and Coming in IBM Connections
What’s New and Coming in IBM ConnectionsWhat’s New and Coming in IBM Connections
What’s New and Coming in IBM ConnectionsLetsConnect
 
Unlock your Big Data with Analytics and BI on Office 365 - OFF103
Unlock your Big Data with Analytics and BI on Office 365 - OFF103Unlock your Big Data with Analytics and BI on Office 365 - OFF103
Unlock your Big Data with Analytics and BI on Office 365 - OFF103Brian Culver
 
Usama Fayyad talk in South Africa: From BigData to Data Science
Usama Fayyad talk in South Africa:  From BigData to Data ScienceUsama Fayyad talk in South Africa:  From BigData to Data Science
Usama Fayyad talk in South Africa: From BigData to Data ScienceUsama Fayyad
 
Ways Artificial Intelligence Can Improve Your Business with IBM Watson
Ways Artificial Intelligence Can Improve Your Business with IBM WatsonWays Artificial Intelligence Can Improve Your Business with IBM Watson
Ways Artificial Intelligence Can Improve Your Business with IBM WatsonMarkus Van Kempen
 
Unlock your Big Data with Analytics and BI on Office 365
Unlock your Big Data with Analytics and BI on Office 365Unlock your Big Data with Analytics and BI on Office 365
Unlock your Big Data with Analytics and BI on Office 365Brian Culver
 
Informix REST API Tutorial
Informix REST API TutorialInformix REST API Tutorial
Informix REST API TutorialBrian Hughes
 
Outside the Comfort Zone: Cross Industry Use Cases in Big Data Analytics
Outside the Comfort Zone: Cross Industry Use Cases in Big Data AnalyticsOutside the Comfort Zone: Cross Industry Use Cases in Big Data Analytics
Outside the Comfort Zone: Cross Industry Use Cases in Big Data AnalyticsRising Media Ltd.
 
SPS Utah 2016 - Unlock your big data with analytics and BI on Office 365
SPS Utah 2016 - Unlock your big data with analytics and BI on Office 365SPS Utah 2016 - Unlock your big data with analytics and BI on Office 365
SPS Utah 2016 - Unlock your big data with analytics and BI on Office 365Brian Culver
 
Developing on Mobile vs Desktop (Dreamforce 2011, Lawrence Coburn, DoubleDutch))
Developing on Mobile vs Desktop (Dreamforce 2011, Lawrence Coburn, DoubleDutch))Developing on Mobile vs Desktop (Dreamforce 2011, Lawrence Coburn, DoubleDutch))
Developing on Mobile vs Desktop (Dreamforce 2011, Lawrence Coburn, DoubleDutch))DoubleDutch
 
20131030 hkpasea presentation[1]
20131030 hkpasea presentation[1]20131030 hkpasea presentation[1]
20131030 hkpasea presentation[1]Alex Hung
 
SPT 104 Unlock your big data with analytics and BI on Office 365
SPT 104 Unlock your big data with analytics and BI on Office 365SPT 104 Unlock your big data with analytics and BI on Office 365
SPT 104 Unlock your big data with analytics and BI on Office 365Brian Culver
 
Predicitve analytics for marketing 05 21-2014 Shree Dandekar
Predicitve analytics for marketing 05 21-2014 Shree DandekarPredicitve analytics for marketing 05 21-2014 Shree Dandekar
Predicitve analytics for marketing 05 21-2014 Shree DandekarShree Dandekar
 

Semelhante a Social Data Analytics using IBM Big Data Technologies (20)

Analyzing your market: How Big Data plus Social Equals Better Engagement
Analyzing your market: How Big Data plus Social Equals Better EngagementAnalyzing your market: How Big Data plus Social Equals Better Engagement
Analyzing your market: How Big Data plus Social Equals Better Engagement
 
Top 7 BI Trends for 2015
Top 7 BI Trends for 2015Top 7 BI Trends for 2015
Top 7 BI Trends for 2015
 
What Will You Build with IBM Watson on Bluemix?
What Will You Build with IBM Watson on Bluemix?What Will You Build with IBM Watson on Bluemix?
What Will You Build with IBM Watson on Bluemix?
 
OC Big Data Monthly Meetup #6 - Session 1 - IBM
OC Big Data Monthly Meetup #6 - Session 1 - IBMOC Big Data Monthly Meetup #6 - Session 1 - IBM
OC Big Data Monthly Meetup #6 - Session 1 - IBM
 
SD Big Data Monthly Meetup #4 - Session 1 - IBM
SD Big Data Monthly Meetup #4 - Session 1 - IBMSD Big Data Monthly Meetup #4 - Session 1 - IBM
SD Big Data Monthly Meetup #4 - Session 1 - IBM
 
Value proposition for big data isv partners 0714
Value proposition for big data isv partners 0714Value proposition for big data isv partners 0714
Value proposition for big data isv partners 0714
 
Iod 2013 Jackman Schwenger
Iod 2013 Jackman SchwengerIod 2013 Jackman Schwenger
Iod 2013 Jackman Schwenger
 
Building intelligent APIs - Andy Thurai, IBM
Building intelligent APIs - Andy Thurai, IBMBuilding intelligent APIs - Andy Thurai, IBM
Building intelligent APIs - Andy Thurai, IBM
 
What’s New and Coming in IBM Connections
What’s New and Coming in IBM ConnectionsWhat’s New and Coming in IBM Connections
What’s New and Coming in IBM Connections
 
Unlock your Big Data with Analytics and BI on Office 365 - OFF103
Unlock your Big Data with Analytics and BI on Office 365 - OFF103Unlock your Big Data with Analytics and BI on Office 365 - OFF103
Unlock your Big Data with Analytics and BI on Office 365 - OFF103
 
Usama Fayyad talk in South Africa: From BigData to Data Science
Usama Fayyad talk in South Africa:  From BigData to Data ScienceUsama Fayyad talk in South Africa:  From BigData to Data Science
Usama Fayyad talk in South Africa: From BigData to Data Science
 
Ways Artificial Intelligence Can Improve Your Business with IBM Watson
Ways Artificial Intelligence Can Improve Your Business with IBM WatsonWays Artificial Intelligence Can Improve Your Business with IBM Watson
Ways Artificial Intelligence Can Improve Your Business with IBM Watson
 
Unlock your Big Data with Analytics and BI on Office 365
Unlock your Big Data with Analytics and BI on Office 365Unlock your Big Data with Analytics and BI on Office 365
Unlock your Big Data with Analytics and BI on Office 365
 
Informix REST API Tutorial
Informix REST API TutorialInformix REST API Tutorial
Informix REST API Tutorial
 
Outside the Comfort Zone: Cross Industry Use Cases in Big Data Analytics
Outside the Comfort Zone: Cross Industry Use Cases in Big Data AnalyticsOutside the Comfort Zone: Cross Industry Use Cases in Big Data Analytics
Outside the Comfort Zone: Cross Industry Use Cases in Big Data Analytics
 
SPS Utah 2016 - Unlock your big data with analytics and BI on Office 365
SPS Utah 2016 - Unlock your big data with analytics and BI on Office 365SPS Utah 2016 - Unlock your big data with analytics and BI on Office 365
SPS Utah 2016 - Unlock your big data with analytics and BI on Office 365
 
Developing on Mobile vs Desktop (Dreamforce 2011, Lawrence Coburn, DoubleDutch))
Developing on Mobile vs Desktop (Dreamforce 2011, Lawrence Coburn, DoubleDutch))Developing on Mobile vs Desktop (Dreamforce 2011, Lawrence Coburn, DoubleDutch))
Developing on Mobile vs Desktop (Dreamforce 2011, Lawrence Coburn, DoubleDutch))
 
20131030 hkpasea presentation[1]
20131030 hkpasea presentation[1]20131030 hkpasea presentation[1]
20131030 hkpasea presentation[1]
 
SPT 104 Unlock your big data with analytics and BI on Office 365
SPT 104 Unlock your big data with analytics and BI on Office 365SPT 104 Unlock your big data with analytics and BI on Office 365
SPT 104 Unlock your big data with analytics and BI on Office 365
 
Predicitve analytics for marketing 05 21-2014 Shree Dandekar
Predicitve analytics for marketing 05 21-2014 Shree DandekarPredicitve analytics for marketing 05 21-2014 Shree Dandekar
Predicitve analytics for marketing 05 21-2014 Shree Dandekar
 

Mais de Nicolas Morales

Benchmarking SQL-on-Hadoop Systems: TPC or not TPC?
Benchmarking SQL-on-Hadoop Systems: TPC or not TPC?Benchmarking SQL-on-Hadoop Systems: TPC or not TPC?
Benchmarking SQL-on-Hadoop Systems: TPC or not TPC?Nicolas Morales
 
Getting started with Hadoop on the Cloud with Bluemix
Getting started with Hadoop on the Cloud with BluemixGetting started with Hadoop on the Cloud with Bluemix
Getting started with Hadoop on the Cloud with BluemixNicolas Morales
 
InfoSphere BigInsights for Hadoop @ IBM Insight 2014
InfoSphere BigInsights for Hadoop @ IBM Insight 2014InfoSphere BigInsights for Hadoop @ IBM Insight 2014
InfoSphere BigInsights for Hadoop @ IBM Insight 2014Nicolas Morales
 
IBM Big SQL @ Insight 2014
IBM Big SQL @ Insight 2014IBM Big SQL @ Insight 2014
IBM Big SQL @ Insight 2014Nicolas Morales
 
Big SQL Competitive Summary - Vendor Landscape
Big SQL Competitive Summary - Vendor LandscapeBig SQL Competitive Summary - Vendor Landscape
Big SQL Competitive Summary - Vendor LandscapeNicolas Morales
 
60 minutes in the cloud: Predictive analytics made easy
60 minutes in the cloud: Predictive analytics made easy60 minutes in the cloud: Predictive analytics made easy
60 minutes in the cloud: Predictive analytics made easyNicolas Morales
 
Challenges of Building a First Class SQL-on-Hadoop Engine
Challenges of Building a First Class SQL-on-Hadoop EngineChallenges of Building a First Class SQL-on-Hadoop Engine
Challenges of Building a First Class SQL-on-Hadoop EngineNicolas Morales
 
Big SQL 3.0 - Toronto Meetup -- May 2014
Big SQL 3.0 - Toronto Meetup -- May 2014Big SQL 3.0 - Toronto Meetup -- May 2014
Big SQL 3.0 - Toronto Meetup -- May 2014Nicolas Morales
 
SQL-on-Hadoop without compromise: Big SQL 3.0
SQL-on-Hadoop without compromise: Big SQL 3.0SQL-on-Hadoop without compromise: Big SQL 3.0
SQL-on-Hadoop without compromise: Big SQL 3.0Nicolas Morales
 
Taming Big Data with Big SQL 3.0
Taming Big Data with Big SQL 3.0Taming Big Data with Big SQL 3.0
Taming Big Data with Big SQL 3.0Nicolas Morales
 
Big SQL 3.0: Datawarehouse-grade Performance on Hadoop - At last!
Big SQL 3.0: Datawarehouse-grade Performance on Hadoop - At last!Big SQL 3.0: Datawarehouse-grade Performance on Hadoop - At last!
Big SQL 3.0: Datawarehouse-grade Performance on Hadoop - At last!Nicolas Morales
 
Security and Audit for Big Data
Security and Audit for Big DataSecurity and Audit for Big Data
Security and Audit for Big DataNicolas Morales
 

Mais de Nicolas Morales (14)

Benchmarking SQL-on-Hadoop Systems: TPC or not TPC?
Benchmarking SQL-on-Hadoop Systems: TPC or not TPC?Benchmarking SQL-on-Hadoop Systems: TPC or not TPC?
Benchmarking SQL-on-Hadoop Systems: TPC or not TPC?
 
Getting started with Hadoop on the Cloud with Bluemix
Getting started with Hadoop on the Cloud with BluemixGetting started with Hadoop on the Cloud with Bluemix
Getting started with Hadoop on the Cloud with Bluemix
 
InfoSphere BigInsights for Hadoop @ IBM Insight 2014
InfoSphere BigInsights for Hadoop @ IBM Insight 2014InfoSphere BigInsights for Hadoop @ IBM Insight 2014
InfoSphere BigInsights for Hadoop @ IBM Insight 2014
 
IBM Big SQL @ Insight 2014
IBM Big SQL @ Insight 2014IBM Big SQL @ Insight 2014
IBM Big SQL @ Insight 2014
 
Big SQL Competitive Summary - Vendor Landscape
Big SQL Competitive Summary - Vendor LandscapeBig SQL Competitive Summary - Vendor Landscape
Big SQL Competitive Summary - Vendor Landscape
 
60 minutes in the cloud: Predictive analytics made easy
60 minutes in the cloud: Predictive analytics made easy60 minutes in the cloud: Predictive analytics made easy
60 minutes in the cloud: Predictive analytics made easy
 
Challenges of Building a First Class SQL-on-Hadoop Engine
Challenges of Building a First Class SQL-on-Hadoop EngineChallenges of Building a First Class SQL-on-Hadoop Engine
Challenges of Building a First Class SQL-on-Hadoop Engine
 
Big SQL 3.0 - Toronto Meetup -- May 2014
Big SQL 3.0 - Toronto Meetup -- May 2014Big SQL 3.0 - Toronto Meetup -- May 2014
Big SQL 3.0 - Toronto Meetup -- May 2014
 
SQL-on-Hadoop without compromise: Big SQL 3.0
SQL-on-Hadoop without compromise: Big SQL 3.0SQL-on-Hadoop without compromise: Big SQL 3.0
SQL-on-Hadoop without compromise: Big SQL 3.0
 
Taming Big Data with Big SQL 3.0
Taming Big Data with Big SQL 3.0Taming Big Data with Big SQL 3.0
Taming Big Data with Big SQL 3.0
 
Big SQL 3.0: Datawarehouse-grade Performance on Hadoop - At last!
Big SQL 3.0: Datawarehouse-grade Performance on Hadoop - At last!Big SQL 3.0: Datawarehouse-grade Performance on Hadoop - At last!
Big SQL 3.0: Datawarehouse-grade Performance on Hadoop - At last!
 
Text Analytics
Text Analytics Text Analytics
Text Analytics
 
Security and Audit for Big Data
Security and Audit for Big DataSecurity and Audit for Big Data
Security and Audit for Big Data
 
Machine Data Analytics
Machine Data AnalyticsMachine Data Analytics
Machine Data Analytics
 

Último

Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessPixlogix Infotech
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 

Último (20)

Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 

Social Data Analytics using IBM Big Data Technologies

  • 1. Social Data Analytics using IBM Big Data technologies Vijay Bommireddipalli vijayrb@us.ibm.com Development Manager, Social Data Accelerator IBM Big Data October 21, 2013 © 2012 IBM Corporation
  • 2. Please note IBM’s statements regarding its plans, directions, and intent are subject to change or withdrawal without notice at IBM’s sole discretion. Information regarding potential future products is intended to outline our general product direction and it should not be relied on in making a purchasing decision. The information mentioned regarding potential future products is not a commitment, promise, or legal obligation to deliver any material, code or functionality. Information about potential future products may not be incorporated into any contract. The development, release, and timing of any future features or functionality described for our products remains at our sole discretion. Performance is based on measurements and projections using standard IBM benchmarks in a controlled environment. The actual throughput or performance that any user will experience will vary depending upon many factors, including considerations such as the amount of multiprogramming in the user’s job stream, the I/O configuration, the storage configuration, and the workload processed. Therefore, no assurance can be given that an individual user will achieve results similar to those stated here. 2 © 2011 IBM Corporation
  • 3. Before we begin … 3 © 2011 IBM Corporation
  • 4. Tag ! You’re it ! - Micro-segmentation 4 © 2011 IBM Corporation
  • 5. Social Data Analytics - Using social media as a rich source of information Behavior Maybe our politicians should take a playbook out of the rivalry between duke/unc and take it to the courts http://ity.com/wfUsir I'm at Mickey's Irish Pub Downtown (206 3rd St, Court Ave, Raleigh) w/ 2 others http://4sq.com/gbsaYR @silliesylvia good!!! U Interest shouldnt! Think about the Location important stuff, like ur 43rd birthday ;) @silliesylvia I <3 your leather Consumption btw happy birthday Sylvia ;) leggings!! Its so katniss!! dear redbox please have kings speech for my new tv colin firth movie marathon Age Intent to consume @silliesylvia $10 dollars says matthew & mary get married next season :) #downtownabbey OMG OMG. just dropped my new ipad3 crappola!!! Consumption 5 Prediction Interest @bamagirl can’t wait to watch sherlock with you! Oh, robert downey jr, I still love you but bbc is so amazing Intent to consume 360 degree profile Personal Attributes • Sylvia Campbell, Female, In a Relationship • 32 years old, birthday on 7/17 • Lives near Raleigh, NC • College graduate; Income of 80-120k Buzz/Sentiment • Retweets BF’s comments • Interest in BBC shows: Downton Abbey, Sherlock, Fringe, (P&P?) • Sherlock Holmes, Robert Downey, Jr. • Hunger Games, Katniss/J. Lawrence Interests/Behavior • Watch movies, tv shows • Romance plots, “hero types”, strong women • Uses iPad 3, Redbox, Hulu • Shopping , interest in sales/deals • Duke/ UNC basketball © 2011 IBM Corporation
  • 6. Social Data Analytics - Comprehensive Entity Extraction and Integration Name: Jane Doe Id: jaydee Address: Home of the Buccaneers Interests: running, yoga, football… Name: Jane Doe Name: Jane Doe, Cava Address: Tampa, FL Address: Tampa, Fl Twitter: jaydee Twitter: @maryguida Blog Topic: food Blog Topic: politics Hobbies: running, yoga, … Hobbies: running, yoga, … Relationships: Tony C (brother)… Relationships: Tony C (brother)… Name: J Doe Blog Topic: food Entity Integration Name: jane Address: Tampa, FL Relationships: Tony C (brother)., … All names are fictitious 6 Challenges:  Scale  1000’s sites, 100s millions users  Complex matching decisions  Partial, noisy and incomplete profile attributes  Only 3% of consumers have sufficient attribute information in their profiles. © 2011 IBM Corporation
  • 7. Consumer Intelligence Timely Insights • Intent to buy various products • Current Location Personal Attributes • Identifiers: name, address, age, gender, occupation… • Interests: sports, pets, cuisine… • Life Cycle Status: marital, parental Social Media based 360-degree Consumer Profiles • Personal relationships: family, friends and roommates… • Business relationships: co-workers and work/interest network… • Life-changing events: relocation, having a baby, getting married, getting divorced, buying a house… What should I buy?? A mini laptop with Windows 7 OR a Apple MacBook!??! Location announcements I'm at Starbucks Parque Tezontle http://4sq.com/fYReSj 7 • Personal preferences of products • Product Purchase history Relationships Life Events Monetizable intent to buy I need a new products digital camera for my food pictures, any recommendations around 300? Products Interests Life Events College: Off to Stanford for my MBA! Bbye chicago! Looks like we'll be moving to New Orleans sooner than I thought. Intent to buy a house I'm thinking about buying a home in Buckingham Estates per a recommendation. Anyone have advice on that area? #atx #austinrealestate #austin © 2011 IBM Corporation
  • 8. Social Data Analytics - Profile construction 8 © 2011 IBM Corporation
  • 9. Social Data Analytics - Profile construction 9 © 2011 IBM Corporation
  • 10. Big Data Platform and Accelerators - Summary  Software components that accelerate development and/or implementation of specific solutions or use cases on top of the Big Data platform  Provide business logic, data processing, and UI/visualization, tailored for a given use case  Analytic Applications Bundled with Big Data platform components – InfoSphere BigInsights and InfoSphere Streams BI / Exploration / Functional Industry Predictive Content Reporting Visualization App App Analytics Analytics IBM Big Data Platform Visualization & Discovery Applications & Development Systems Management Accelerators Hadoop System Stream Computing Data Warehouse Contextual Search Key Benefits  Information Integration & Governance Time to value  Leverage best practices around implementation of a given use case. Cloud | Mobile | Security 10 © 2011 IBM Corporation
  • 11. Social Media Analytics Architecture Online flow: Data-in-motion analysis Real time analytics. Pre-defined views and charts Stream Computing and Analytics Social Media Data Ingest and Prep Entity Analytics: Profile Resolution Extract Buzz, Intent , Sentiment Dashboard BigInsights System and Analytics Social Media Data Extract Buzz, Intent , Sentiment And Consumer Profiles Entity Analytics and Integration Comprehensive Social Media Customer Profiles Pre-defined Workbooks and Dashboards Offline flow: Data-at-rest analysis Data Explorer Index using Push API Ad hoc access Optional: Indexed Search 11 © 2011 IBM Corporation
  • 12. SDA 1.2  Social Media Sources Supported – Gnip, Boardreader – Tweets, Boards, Blogs  Analyze Streaming data as well as data at rest – Streams for processing of streaming data – BigInsights/Hadoop for input, output and configuration data  Key Micro-segmentation Attributes (out-of-box) – Personal Info: Gender, Location, Parental status, Marital status, Employment – Interests: Movie interest, Comic book fan, Product interest, Current customer of, Products owned – ** Attributes can be added in (requires some development effort)  Entity resolution across the different social media sources 12 © 2011 IBM Corporation
  • 13. SDA 1.2  Outputs/Measures (out-of-box) – – – – Buzz Sentiment Intent to buy/start service Intend to attend/see  Example use cases – – – – Retail – Lead generation, Brand management Financial – Lead generation and Brand management Media & Entertainment: Brand management Generic  Visualization using BigSheets  Extendable/Customizable Solution 13 © 2011 IBM Corporation
  • 14. SDA - Acting on the insights  Metrics based understanding of Feedback in Social Media – And more importantly Feedback from whom !  Comprehensive (social media) profiles with microsegmentation information  Campaign execution can be done in Social Media  Entity resolution across the different social media sources  External (social media) to Internal (CRM) linkage **coming 14 © 2011 IBM Corporation
  • 15. SDA Outputs  Pre-defined Workbooks  Dashboards  Granular outputs for further slicing and dicing by Data Scientists 15 © 2011 IBM Corporation
  • 16. SDA Conceptual Flow 16 © 2011 IBM Corporation
  • 17. BigInsights & Streams Text Analytics High Performance rule based Information Extraction Engine  Highly scalable solution available for at-rest and in-motion analytics  Pre-built extractors, and toolkit to build custom Extractors • Rich Extractor library supports multiple languages • Declarative Information Extraction (IE) system based on an algebraic framework Sophisticated tooling to help build, test, and refine rules Developed at IBM Research since 2004 Embedded in several IBM products • BigInsights, Streams. • Lotus Notes • Cognos Consumer Insights What is TA 17 Why Biginsights TA How is TA Deployed & used Dev. tools © 2011 IBM Corporation
  • 18. Applications of Text analytics Broad range of applications in many industries • CRM Analytics Voice of customer Product and Services gap analysis Customer churn • Social Media Analytics Purchase intent Customer churn prediction Reputational Risk • Digital Piracy Illegal broadcast of streaming and video content • Log Analytics Failure analysis and root cause identification Availability assurance • Regulatory Compliance Data Redaction • Identify and protect sensitive information 18 What is TA Why Biginsights TA How is TA Deployed & used Dev. tools © 2011 IBM Corporation
  • 19. Performance Comparison (with ANNIE open source **) Task: Named Entity Recognition Dataset : Different document collections from the Enron corpus obtained by randomly sampling 1000 documents for each size Throughput (KB/sec) 700 600 500 400 ANNIE Open Source Entity Tagger 300 >10x faster < 60% memory SystemT 200 100 0 0 20 40 60 80 100 Average document size (KB) ** http://dl.acm.org/citation.cfm?id=1858681.1858695 Performance comparison with GATE 5 What is TA 19 Why Biginsights TA How is TA Deployed & used Dev. tools © 2011 IBM Corporation
  • 20. Text Analytics Development Flow  Declarative language for extractor logic  Optimization and deployment to scalable runtime Extracted Information Development Tooling Extractor Text Analytics Optimizer Compiled Operator Graph Text Analytics Runtime Sample Input Documents Rule based language Annotator Query Language - AQL with familiar SQL-like syntax Specify annotator semantics declaratively Choose an efficient execution plan Highly scalable, embeddable Java runtime What is TA 20 Why Biginsights TA How is TA Deployed & used Dev. tools © 2011 IBM Corporation
  • 21. Invoking Text Analytics within BigInsights Document encoded as JSON record. Jaql runtime coordinates a multi-stage map-reduce flow. JAQL Function Wrapper Input Record { label: “http://www.ibm ...”, text: “<html>n<head> …” } AQL SystemT Optimizer Dictionaries 21 Input Adapter SystemT Runtime Compiled Plan Output Adapter Output Record { label: “http://www.ibm ...”, text: “<html>n<head> …” Person: [ { firstName: [10, 15], lastName: [16, 25] }, … { firstName: [1042, 1045], lastName: [1046, 1050] } ], Hyperlink: [ { anchorText: [25, 33] }, … { anchorText: [990, 997] } ], H1: … Annotations added as additional attributes to JSON} record. © 2011 IBM Corporation
  • 22. Additional Advantages of IBM Text Analytics Quality: Drives effectiveness of entire application • Enables high accuracy and coverage Performance: Dominant cost is CPU • Process large documents and large number of documents with high throughput Explain-ability • Determine the cause of errors and fix it without affecting the remaining correct results Reusability: easily adaptable for a different domain • The development platform must enable layers of abstractions to be built and easily reused in a different domain Expressivity • Rule language with a rich set of operators available to enable complex extraction tasks What is TA 22 Why Biginsights TA How is TA Deployed & used Dev. tools © 2011 IBM Corporation
  • 23. BigInsights Text Analytics Development What is TA 23 Why Biginsights TA How is TA Deployed & used Dev. tools © 2011 IBM Corporation
  • 24. AQL editor with content assist 24 What is TA Why Biginsights TA How is TA Deployed & used Dev. tools © 2011 IBM Corporation
  • 25. Understanding the lineage of results Click to drill down and see the rules that triggered inclusion of results Explain and search through the results What is TA 25 Why Biginsights TA How is TA Deployed & used Dev. tools © 2011 IBM Corporation
  • 26. IBM Text Analytics for Big Data High Performance Information Extraction Engine Analysis can be applied to data at-rest and in-motion • Build extractor once and use with BigInsights or Streams Parallel execution scales to Big Data volumes • Linearly scalable to extremely high volumes Highly customizable to a variety of domains and languages • Pre-built extractors available out of the box Sophisticated tooling enables ease of development and refinement of results 26 © 2011 IBM Corporation
  • 27. Thank you 27 © 2011 IBM Corporation