SlideShare uma empresa Scribd logo
1 de 30
Baixar para ler offline
Large Scale Data Analytics
Shankar Radhakrishnan
shankar.r3@cognizant.com
linkedin.com/in/connect2shankar
Scenario
• Insurer uses meteorological data for pricing model
• At present data from 2000 weather stations are
collected for analysis
• Plan is to use 10,000 weather station data 

( or more )
• Stochastic simulation needs to run to ID pattern in
weather data, to determine pricing
• Volumetric : peta-bytes of information 

( for 1 region )
2
Trends
3
Data Analytics Is Mostly About $$, Customers, Markets
4
How Widespread Is Data Analytics?
5
Expectations On Payback Period ( Aggressive )
6
Large Scale Data Analytics
7
“Involves using different algorithms, 

distributed platforms, tools and techniques
to analyze big data and provide actionable
insights”
Big Data
“ Data sets that are very large in volume and complex “
8
New platforms, tools and techniques

have emerged to manage Big Data
We broke away from traditional

ways to process and analyze them
Data Structures
 
Vector, Matrix,
Or Complex
Structure
Free Text
Image or
Binary Data
Data “bags”
Iterative
Logic Or
Complex
Branching
Advanced
Analytic
Routines
Rapidly
Repeated
Measurements
Extreme
Low
Latency
Access to
all data
required
Search Ranking X X X X X X
Ad Tracking X X X X X X X X  
Location or Proximity Tracking X   X X     X X  
Social CRM X X X X X X      X
Document Similarity Testing X X X X X X   X X
Genomic Analysis X X X X X
Customer Cohort groups X X   X X X     X
Fraud Detection X X X X X X X X X
Smart Utility Metering X X X X X X
Churn Analysis X X X X X X   X  
Satellite Image Analysis X X X X
Game Gesture Analysis X X X X X X X X
Data Bag Exploration X X X X X X
9
Business Interests : Well Informed Customer Executive
10
Speech to Text
Conversion
Voice Data
Unstructured data Analytical System
Customer Persona
• Customer Persona -
Demographics,

Top interactions, 

Channel Preferences,
Dissatisfies
• Customer Lifetime Value
• Recent Contact History
• Customer Sentiment &
Trend during the call
Customer’s state of mind
Sentimental
Analysis
Social media
Depositions
Complaints
Other Channel
information
(ATM, Branch)
Big Data Warehouse
Traditional Warehouse
Decision Engine • Customer Executive Dashboard
presents all intelligence
required to make a decision
• The decision engine also
presents important decisions
to be taken for the particular
customer issue
Well Informed Customer Executive…
Customer calls
BankingCallCenter
Executive
understands the
customerproblemExecutive authenticates
customer and pulls up
CustomerPersona
Executive reviews
risk of attrition
against Customer
LifetimeValue
Executive reviews
Last 5 call center
and banking
transactions
Executiveviews
customer’s state of
mind (riskof
attrition )through a
barometer chart
Analytical Solution -
Converts Speech to
textAnalytical engine
listens to
customer voice
Suggested top 5
Actions required
DecisionEngine
Executive performs below actions based on his analysis and
recommendations from Decision engine
1. Reversal of overdraft fee
2. One time fee waiver on Cheque book (predicting customer
need based on historic usage cycles )
3. Cash back Reward card for a minimum spend of $X through
debitcard
4. Offer interest revision for investment products or mortgage
5. Promote new mutual funds or credit cards based on
customer willingness
Analytical engine
monitors
sentiment
Executive analyzes Customer
Persona (demographic /
Preferences / Satisfiers /
dissatisfiesetc )
11
Business Interests : Fraud Prevention
12
Envisaged Benefits
▪ New fraud patterns can be identified by building ‘analytical models’ to run against historical data
▪ ‘Web crawling’, ‘Contextual text analysis’, ‘Natural Language Processing’ allows fraud behavior
identification from social media. It may increase Fraud detection success rate
▪ ‘Real time’ models to capture behavioral patters and do pattern analysis against History data to
evaluate Fraud case validity. The model learns by self and updates ‘Fraud pattern master sets.
▪ Brings ‘artificial intelligent’ fraud pattern detection and analysis
▪ ‘Real time’ (in the order of .5-1 minute refresh rate) alerts to Fraud analysts about ‘self learned’ fraud
patterns based on new customer behavior patterns
Big Data Usage
▪ Formation of key value groups to the order of XcY (where X no. of attributes that are relevant to Fraud
and Y is no. of attributes that should be combined to identify patterns)
▪ High speed history data loading from source systems
▪ Efficient Real time fraud detection by identifying patterns through customer behavioral events and
processing them over X yrs. of history data – e.g. using HBase
Scenario
Formation of Fraud pattern reference tables using
▪ Real time data coming from different departments like IVR, WEB, Customer profile, Transactions etc
▪ Real time Mining and analysis of history data to form prior patterns (no. of years in range to 50-100 TB)
Fraud Pattern Detection…
13
Legacy Fraud
Data
Customer
Profile Data
IVR Audio
Data
Web / Online
Card
Transaction
Data
Fraud
Pattern
Master Table
Fraud Analyst
History Data
Processing to
determine
Fraud
Patterns over
X years
Real-time
Customer
Behavior
Analysis for
Fraud
Detection
Customer
Behavior Change
Events
Customer
Behavior Change
Events
Customer
Behavior Change
Events
Real time Analysis of
behavior patterns over
historical data
Real time update to
Master Table on New
Fraud Patterns
Real time alert to 

Fraud Analyst
RDBMS RDBMS
(JSON
Files) RDBMS
Customer
Behavior Change
Events
Fraud Prevention…
14
Benefits
15
BenefitsIndustry
Financial services
▪ Customer Insights – Integrating Transactional data (CRM/Payments) and unstructured Social feeds
▪ Regulatory Compliance – Risk exposures across asset classes, LOBs and firms
▪ Fraud Detection in Credit Cards & Financial Crimes (AML) in Banks
Travel, Hospitality & Retail
▪ Customer centricity – Customer behavior analysis from Omni channel retailing & Social feeds
▪ Markdown Optimization – Improve markdown based on actual customer buying patters
▪ Market basket analysis – Narrow down market basket analysis by demographics
Life Science
▪ Improve targeting & predictions – Automatic Detection of Adverse Drug Effects (ADEs)
▪ Patient data analysis – Longitudinal Patient Data (LPD) analysis
▪ Predictive Sciences – Analyze Preclinical Side Effect Profiles of Marketed Drugs
Healthcare (Payers & Providers)
▪ Cost of Care – Drug effectiveness & Cost of Care Analysis based on electronic Health Records (EMR)
▪ Self Service Healthcare – Increase in mHealth & eHealth to allow consumer access to health information
▪ Claims Analytics – Analyze insurance claims data for fraud detection & preferred treatment plans
Communication,
Media & Entertainment
▪ Discover churn patterns based on Call data records (CDRs) and activity in subscribers’ networks
▪ Digital Asset Management (DAM) – Analyze & capitalize digital data assets
Manufacturing
▪ Proactive Maintenance & Recommendation – Sensor Monitoring for automobile, buildings & machinery
▪ Energy Efficiency – Leveraging Smart meters for utility energy consumption
▪ Location or Proximity Tracking – Location based analytics using GPS Data
Hi-Tech
▪ Extend and complement conventional information supply chain with big data path
▪ Predictive analysis and real time decision support
Hadoop
16
Hadoop - HDFS
17
Hadoop - MapReduce
18
Hadoop - MapReduce
19
Apache Spark
20
Spark
Iterative
Processing
Batch
Processing
Machine
Learning
SQL
Stream
Processing
Graph
Processing
Hadoop
21
NoSQL Databases
22
NoSQL Databases
23
Modern Data Architecture
24
Lambda Architecture
25
Lambda Architecture
26
Data Analytics Lifecycle
27
Analytics - Trends
• Big Data Analytics In The Cloud
• AWS, AWS-Redshift
• Hadoop
• Enterprise Data Operating
System
• Data Analytics Platform
• SQL on Hadoop
• NoSQL
• IoT ( Internet of Things )
28
• Multi-polar Analytics
• Predictive Analytics ( Spark )
• In-memory Analytics
• Data Lake
• Deep Learning
• Machine Learning
• Neural Networks
• Data Monetization
Q & A
Thank You !
“Any Sufficiently Advanced Technology Is
Indistinguishable From Magic “
- Arthur C. Clarke

Mais conteúdo relacionado

Mais procurados

Big Data Retail Banking
Big Data Retail Banking Big Data Retail Banking
Big Data Retail Banking
Sandeep Bhagat
 

Mais procurados (20)

Rulex big data and analytics
Rulex big data and analyticsRulex big data and analytics
Rulex big data and analytics
 
Data Analytics in Azure Cloud
Data Analytics in Azure CloudData Analytics in Azure Cloud
Data Analytics in Azure Cloud
 
AI powered decision making in banks
AI powered decision making in banksAI powered decision making in banks
AI powered decision making in banks
 
Predictive analytics km chicago
Predictive analytics km chicagoPredictive analytics km chicago
Predictive analytics km chicago
 
Banking Big Data Analytics
Banking Big Data AnalyticsBanking Big Data Analytics
Banking Big Data Analytics
 
Data mining
Data miningData mining
Data mining
 
Data mining on Financial Data
Data mining on Financial DataData mining on Financial Data
Data mining on Financial Data
 
How advanced analytics is impacting the banking sector
How advanced analytics is impacting the banking sectorHow advanced analytics is impacting the banking sector
How advanced analytics is impacting the banking sector
 
Big Data Use-Cases across industries (Georg Polzer, Teralytics)
Big Data Use-Cases across industries (Georg Polzer, Teralytics)Big Data Use-Cases across industries (Georg Polzer, Teralytics)
Big Data Use-Cases across industries (Georg Polzer, Teralytics)
 
Big Data Retail Banking
Big Data Retail Banking Big Data Retail Banking
Big Data Retail Banking
 
Big data in marketing at harvard business club nick1 june 15 2013
Big data in marketing at harvard business club nick1 june 15 2013Big data in marketing at harvard business club nick1 june 15 2013
Big data in marketing at harvard business club nick1 june 15 2013
 
Data science in finance industry
Data science in finance industryData science in finance industry
Data science in finance industry
 
Big Data in Banking (Data Science Thailand Meetup #2)
Big Data in Banking (Data Science Thailand Meetup #2)Big Data in Banking (Data Science Thailand Meetup #2)
Big Data in Banking (Data Science Thailand Meetup #2)
 
How is Big Data extending the life of the banking sector?
How is Big Data extending the life of the banking sector?How is Big Data extending the life of the banking sector?
How is Big Data extending the life of the banking sector?
 
Introduction to Data Analytics
Introduction to Data AnalyticsIntroduction to Data Analytics
Introduction to Data Analytics
 
Finance and Accounting BPM
Finance and Accounting BPMFinance and Accounting BPM
Finance and Accounting BPM
 
Hidden security and privacy consequences around mobility (Infosec 2013)
Hidden security and privacy consequences around mobility (Infosec 2013)Hidden security and privacy consequences around mobility (Infosec 2013)
Hidden security and privacy consequences around mobility (Infosec 2013)
 
Data driven approach to KYC
Data driven approach to KYCData driven approach to KYC
Data driven approach to KYC
 
Analytics in banking preview deck - june 2013
Analytics in banking   preview deck - june 2013Analytics in banking   preview deck - june 2013
Analytics in banking preview deck - june 2013
 
USE OF DATA MINING IN BANKING SECTOR
USE OF DATA MINING IN BANKING SECTORUSE OF DATA MINING IN BANKING SECTOR
USE OF DATA MINING IN BANKING SECTOR
 

Semelhante a Large Scale Data Analytics

McKinsey MassTLC Big Data Seminar Keynote - February 28, 2014
McKinsey MassTLC Big Data Seminar Keynote - February 28, 2014McKinsey MassTLC Big Data Seminar Keynote - February 28, 2014
McKinsey MassTLC Big Data Seminar Keynote - February 28, 2014
MassTLC
 

Semelhante a Large Scale Data Analytics (20)

Big Data Analytics Summit - April, 2014
Big Data Analytics Summit - April, 2014Big Data Analytics Summit - April, 2014
Big Data Analytics Summit - April, 2014
 
¿Como los modelos predictivos cambian los negocios?
¿Como los modelos predictivos cambian los negocios?¿Como los modelos predictivos cambian los negocios?
¿Como los modelos predictivos cambian los negocios?
 
5733 a deep dive into IBM Watson Foundation for CSP (WFC)
5733   a deep dive into IBM Watson Foundation for CSP (WFC)5733   a deep dive into IBM Watson Foundation for CSP (WFC)
5733 a deep dive into IBM Watson Foundation for CSP (WFC)
 
How to Become an Analytics Ready Insurer - with Informatica and Hortonworks
How to Become an Analytics Ready Insurer - with Informatica and HortonworksHow to Become an Analytics Ready Insurer - with Informatica and Hortonworks
How to Become an Analytics Ready Insurer - with Informatica and Hortonworks
 
EVOLVING PATTERNS IN BIG DATA - NEIL AVERY
EVOLVING PATTERNS IN BIG DATA - NEIL AVERYEVOLVING PATTERNS IN BIG DATA - NEIL AVERY
EVOLVING PATTERNS IN BIG DATA - NEIL AVERY
 
Cognitive Computing and Data Science expertise at SoftServe
Cognitive Computing and Data Science expertise at SoftServeCognitive Computing and Data Science expertise at SoftServe
Cognitive Computing and Data Science expertise at SoftServe
 
Big Data use cases in telcos
Big Data use cases in telcosBig Data use cases in telcos
Big Data use cases in telcos
 
Big Data use cases in telcos
Big Data use cases in telcosBig Data use cases in telcos
Big Data use cases in telcos
 
Predictive Analytics - An Overview
Predictive Analytics - An OverviewPredictive Analytics - An Overview
Predictive Analytics - An Overview
 
Predictive Analytics Overview
Predictive Analytics OverviewPredictive Analytics Overview
Predictive Analytics Overview
 
McKinsey MassTLC Big Data Seminar Keynote - February 28, 2014
McKinsey MassTLC Big Data Seminar Keynote - February 28, 2014McKinsey MassTLC Big Data Seminar Keynote - February 28, 2014
McKinsey MassTLC Big Data Seminar Keynote - February 28, 2014
 
Relying on Data for Strategic Decision-Making--Financial Services Experience
Relying on Data for Strategic Decision-Making--Financial Services ExperienceRelying on Data for Strategic Decision-Making--Financial Services Experience
Relying on Data for Strategic Decision-Making--Financial Services Experience
 
Big Data Done Right by Successful Organizations
Big Data Done Right by Successful OrganizationsBig Data Done Right by Successful Organizations
Big Data Done Right by Successful Organizations
 
Big data analytics in payments
Big data analytics in payments Big data analytics in payments
Big data analytics in payments
 
Aanlytics on Telecom
Aanlytics on TelecomAanlytics on Telecom
Aanlytics on Telecom
 
Captricity at Corinium Chief Data Officer Forum Keynote - Brian Cox
Captricity at Corinium Chief Data Officer Forum Keynote - Brian Cox Captricity at Corinium Chief Data Officer Forum Keynote - Brian Cox
Captricity at Corinium Chief Data Officer Forum Keynote - Brian Cox
 
Big Data solution for multi-national Bank
Big Data solution for multi-national BankBig Data solution for multi-national Bank
Big Data solution for multi-national Bank
 
Problems of Application of Machine Learning in the CRM - panel
Problems of Application of Machine Learning in the CRM - panel Problems of Application of Machine Learning in the CRM - panel
Problems of Application of Machine Learning in the CRM - panel
 
Claims
ClaimsClaims
Claims
 
Big Data Forum - Phoenix
Big Data Forum - PhoenixBig Data Forum - Phoenix
Big Data Forum - Phoenix
 

Último

Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 

Último (20)

ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 

Large Scale Data Analytics

  • 1. Large Scale Data Analytics Shankar Radhakrishnan shankar.r3@cognizant.com linkedin.com/in/connect2shankar
  • 2. Scenario • Insurer uses meteorological data for pricing model • At present data from 2000 weather stations are collected for analysis • Plan is to use 10,000 weather station data 
 ( or more ) • Stochastic simulation needs to run to ID pattern in weather data, to determine pricing • Volumetric : peta-bytes of information 
 ( for 1 region ) 2
  • 4. Data Analytics Is Mostly About $$, Customers, Markets 4
  • 5. How Widespread Is Data Analytics? 5
  • 6. Expectations On Payback Period ( Aggressive ) 6
  • 7. Large Scale Data Analytics 7 “Involves using different algorithms, 
 distributed platforms, tools and techniques to analyze big data and provide actionable insights”
  • 8. Big Data “ Data sets that are very large in volume and complex “ 8 New platforms, tools and techniques
 have emerged to manage Big Data We broke away from traditional
 ways to process and analyze them
  • 9. Data Structures   Vector, Matrix, Or Complex Structure Free Text Image or Binary Data Data “bags” Iterative Logic Or Complex Branching Advanced Analytic Routines Rapidly Repeated Measurements Extreme Low Latency Access to all data required Search Ranking X X X X X X Ad Tracking X X X X X X X X   Location or Proximity Tracking X   X X     X X   Social CRM X X X X X X      X Document Similarity Testing X X X X X X   X X Genomic Analysis X X X X X Customer Cohort groups X X   X X X     X Fraud Detection X X X X X X X X X Smart Utility Metering X X X X X X Churn Analysis X X X X X X   X   Satellite Image Analysis X X X X Game Gesture Analysis X X X X X X X X Data Bag Exploration X X X X X X 9
  • 10. Business Interests : Well Informed Customer Executive 10 Speech to Text Conversion Voice Data Unstructured data Analytical System Customer Persona • Customer Persona - Demographics,
 Top interactions, 
 Channel Preferences, Dissatisfies • Customer Lifetime Value • Recent Contact History • Customer Sentiment & Trend during the call Customer’s state of mind Sentimental Analysis Social media Depositions Complaints Other Channel information (ATM, Branch) Big Data Warehouse Traditional Warehouse Decision Engine • Customer Executive Dashboard presents all intelligence required to make a decision • The decision engine also presents important decisions to be taken for the particular customer issue
  • 11. Well Informed Customer Executive… Customer calls BankingCallCenter Executive understands the customerproblemExecutive authenticates customer and pulls up CustomerPersona Executive reviews risk of attrition against Customer LifetimeValue Executive reviews Last 5 call center and banking transactions Executiveviews customer’s state of mind (riskof attrition )through a barometer chart Analytical Solution - Converts Speech to textAnalytical engine listens to customer voice Suggested top 5 Actions required DecisionEngine Executive performs below actions based on his analysis and recommendations from Decision engine 1. Reversal of overdraft fee 2. One time fee waiver on Cheque book (predicting customer need based on historic usage cycles ) 3. Cash back Reward card for a minimum spend of $X through debitcard 4. Offer interest revision for investment products or mortgage 5. Promote new mutual funds or credit cards based on customer willingness Analytical engine monitors sentiment Executive analyzes Customer Persona (demographic / Preferences / Satisfiers / dissatisfiesetc ) 11
  • 12. Business Interests : Fraud Prevention 12 Envisaged Benefits ▪ New fraud patterns can be identified by building ‘analytical models’ to run against historical data ▪ ‘Web crawling’, ‘Contextual text analysis’, ‘Natural Language Processing’ allows fraud behavior identification from social media. It may increase Fraud detection success rate ▪ ‘Real time’ models to capture behavioral patters and do pattern analysis against History data to evaluate Fraud case validity. The model learns by self and updates ‘Fraud pattern master sets. ▪ Brings ‘artificial intelligent’ fraud pattern detection and analysis ▪ ‘Real time’ (in the order of .5-1 minute refresh rate) alerts to Fraud analysts about ‘self learned’ fraud patterns based on new customer behavior patterns Big Data Usage ▪ Formation of key value groups to the order of XcY (where X no. of attributes that are relevant to Fraud and Y is no. of attributes that should be combined to identify patterns) ▪ High speed history data loading from source systems ▪ Efficient Real time fraud detection by identifying patterns through customer behavioral events and processing them over X yrs. of history data – e.g. using HBase Scenario Formation of Fraud pattern reference tables using ▪ Real time data coming from different departments like IVR, WEB, Customer profile, Transactions etc ▪ Real time Mining and analysis of history data to form prior patterns (no. of years in range to 50-100 TB)
  • 13. Fraud Pattern Detection… 13 Legacy Fraud Data Customer Profile Data IVR Audio Data Web / Online Card Transaction Data Fraud Pattern Master Table Fraud Analyst History Data Processing to determine Fraud Patterns over X years Real-time Customer Behavior Analysis for Fraud Detection Customer Behavior Change Events Customer Behavior Change Events Customer Behavior Change Events Real time Analysis of behavior patterns over historical data Real time update to Master Table on New Fraud Patterns Real time alert to 
 Fraud Analyst RDBMS RDBMS (JSON Files) RDBMS Customer Behavior Change Events
  • 15. Benefits 15 BenefitsIndustry Financial services ▪ Customer Insights – Integrating Transactional data (CRM/Payments) and unstructured Social feeds ▪ Regulatory Compliance – Risk exposures across asset classes, LOBs and firms ▪ Fraud Detection in Credit Cards & Financial Crimes (AML) in Banks Travel, Hospitality & Retail ▪ Customer centricity – Customer behavior analysis from Omni channel retailing & Social feeds ▪ Markdown Optimization – Improve markdown based on actual customer buying patters ▪ Market basket analysis – Narrow down market basket analysis by demographics Life Science ▪ Improve targeting & predictions – Automatic Detection of Adverse Drug Effects (ADEs) ▪ Patient data analysis – Longitudinal Patient Data (LPD) analysis ▪ Predictive Sciences – Analyze Preclinical Side Effect Profiles of Marketed Drugs Healthcare (Payers & Providers) ▪ Cost of Care – Drug effectiveness & Cost of Care Analysis based on electronic Health Records (EMR) ▪ Self Service Healthcare – Increase in mHealth & eHealth to allow consumer access to health information ▪ Claims Analytics – Analyze insurance claims data for fraud detection & preferred treatment plans Communication, Media & Entertainment ▪ Discover churn patterns based on Call data records (CDRs) and activity in subscribers’ networks ▪ Digital Asset Management (DAM) – Analyze & capitalize digital data assets Manufacturing ▪ Proactive Maintenance & Recommendation – Sensor Monitoring for automobile, buildings & machinery ▪ Energy Efficiency – Leveraging Smart meters for utility energy consumption ▪ Location or Proximity Tracking – Location based analytics using GPS Data Hi-Tech ▪ Extend and complement conventional information supply chain with big data path ▪ Predictive analysis and real time decision support
  • 28. Analytics - Trends • Big Data Analytics In The Cloud • AWS, AWS-Redshift • Hadoop • Enterprise Data Operating System • Data Analytics Platform • SQL on Hadoop • NoSQL • IoT ( Internet of Things ) 28 • Multi-polar Analytics • Predictive Analytics ( Spark ) • In-memory Analytics • Data Lake • Deep Learning • Machine Learning • Neural Networks • Data Monetization
  • 29. Q & A
  • 30. Thank You ! “Any Sufficiently Advanced Technology Is Indistinguishable From Magic “ - Arthur C. Clarke