SlideShare uma empresa Scribd logo
1 de 30
Baixar para ler offline
Large Scale Data Analytics
Shankar Radhakrishnan
shankar.r3@cognizant.com
linkedin.com/in/connect2shankar
Scenario
• Insurer uses meteorological data for pricing model
• At present data from 2000 weather stations are
collected for analysis
• Plan is to use 10,000 weather station data 

( or more )
• Stochastic simulation needs to run to ID pattern in
weather data, to determine pricing
• Volumetric : peta-bytes of information 

( for 1 region )
2
Trends
3
Data Analytics Is Mostly About $$, Customers, Markets
4
How Widespread Is Data Analytics?
5
Expectations On Payback Period ( Aggressive )
6
Large Scale Data Analytics
7
“Involves using different algorithms, 

distributed platforms, tools and techniques
to analyze big data and provide actionable
insights”
Big Data
“ Data sets that are very large in volume and complex “
8
New platforms, tools and techniques

have emerged to manage Big Data
We broke away from traditional

ways to process and analyze them
Data Structures
 
Vector, Matrix,
Or Complex
Structure
Free Text
Image or
Binary Data
Data “bags”
Iterative
Logic Or
Complex
Branching
Advanced
Analytic
Routines
Rapidly
Repeated
Measurements
Extreme
Low
Latency
Access to
all data
required
Search Ranking X X X X X X
Ad Tracking X X X X X X X X  
Location or Proximity Tracking X   X X     X X  
Social CRM X X X X X X      X
Document Similarity Testing X X X X X X   X X
Genomic Analysis X X X X X
Customer Cohort groups X X   X X X     X
Fraud Detection X X X X X X X X X
Smart Utility Metering X X X X X X
Churn Analysis X X X X X X   X  
Satellite Image Analysis X X X X
Game Gesture Analysis X X X X X X X X
Data Bag Exploration X X X X X X
9
Business Interests : Well Informed Customer Executive
10
Speech to Text
Conversion
Voice Data
Unstructured data Analytical System
Customer Persona
• Customer Persona -
Demographics,

Top interactions, 

Channel Preferences,
Dissatisfies
• Customer Lifetime Value
• Recent Contact History
• Customer Sentiment &
Trend during the call
Customer’s state of mind
Sentimental
Analysis
Social media
Depositions
Complaints
Other Channel
information
(ATM, Branch)
Big Data Warehouse
Traditional Warehouse
Decision Engine • Customer Executive Dashboard
presents all intelligence
required to make a decision
• The decision engine also
presents important decisions
to be taken for the particular
customer issue
Well Informed Customer Executive…
Customer calls
BankingCallCenter
Executive
understands the
customerproblemExecutive authenticates
customer and pulls up
CustomerPersona
Executive reviews
risk of attrition
against Customer
LifetimeValue
Executive reviews
Last 5 call center
and banking
transactions
Executiveviews
customer’s state of
mind (riskof
attrition )through a
barometer chart
Analytical Solution -
Converts Speech to
textAnalytical engine
listens to
customer voice
Suggested top 5
Actions required
DecisionEngine
Executive performs below actions based on his analysis and
recommendations from Decision engine
1. Reversal of overdraft fee
2. One time fee waiver on Cheque book (predicting customer
need based on historic usage cycles )
3. Cash back Reward card for a minimum spend of $X through
debitcard
4. Offer interest revision for investment products or mortgage
5. Promote new mutual funds or credit cards based on
customer willingness
Analytical engine
monitors
sentiment
Executive analyzes Customer
Persona (demographic /
Preferences / Satisfiers /
dissatisfiesetc )
11
Business Interests : Fraud Prevention
12
Envisaged Benefits
▪ New fraud patterns can be identified by building ‘analytical models’ to run against historical data
▪ ‘Web crawling’, ‘Contextual text analysis’, ‘Natural Language Processing’ allows fraud behavior
identification from social media. It may increase Fraud detection success rate
▪ ‘Real time’ models to capture behavioral patters and do pattern analysis against History data to
evaluate Fraud case validity. The model learns by self and updates ‘Fraud pattern master sets.
▪ Brings ‘artificial intelligent’ fraud pattern detection and analysis
▪ ‘Real time’ (in the order of .5-1 minute refresh rate) alerts to Fraud analysts about ‘self learned’ fraud
patterns based on new customer behavior patterns
Big Data Usage
▪ Formation of key value groups to the order of XcY (where X no. of attributes that are relevant to Fraud
and Y is no. of attributes that should be combined to identify patterns)
▪ High speed history data loading from source systems
▪ Efficient Real time fraud detection by identifying patterns through customer behavioral events and
processing them over X yrs. of history data – e.g. using HBase
Scenario
Formation of Fraud pattern reference tables using
▪ Real time data coming from different departments like IVR, WEB, Customer profile, Transactions etc
▪ Real time Mining and analysis of history data to form prior patterns (no. of years in range to 50-100 TB)
Fraud Pattern Detection…
13
Legacy Fraud
Data
Customer
Profile Data
IVR Audio
Data
Web / Online
Card
Transaction
Data
Fraud
Pattern
Master Table
Fraud Analyst
History Data
Processing to
determine
Fraud
Patterns over
X years
Real-time
Customer
Behavior
Analysis for
Fraud
Detection
Customer
Behavior Change
Events
Customer
Behavior Change
Events
Customer
Behavior Change
Events
Real time Analysis of
behavior patterns over
historical data
Real time update to
Master Table on New
Fraud Patterns
Real time alert to 

Fraud Analyst
RDBMS RDBMS
(JSON
Files) RDBMS
Customer
Behavior Change
Events
Fraud Prevention…
14
Benefits
15
BenefitsIndustry
Financial services
▪ Customer Insights – Integrating Transactional data (CRM/Payments) and unstructured Social feeds
▪ Regulatory Compliance – Risk exposures across asset classes, LOBs and firms
▪ Fraud Detection in Credit Cards & Financial Crimes (AML) in Banks
Travel, Hospitality & Retail
▪ Customer centricity – Customer behavior analysis from Omni channel retailing & Social feeds
▪ Markdown Optimization – Improve markdown based on actual customer buying patters
▪ Market basket analysis – Narrow down market basket analysis by demographics
Life Science
▪ Improve targeting & predictions – Automatic Detection of Adverse Drug Effects (ADEs)
▪ Patient data analysis – Longitudinal Patient Data (LPD) analysis
▪ Predictive Sciences – Analyze Preclinical Side Effect Profiles of Marketed Drugs
Healthcare (Payers & Providers)
▪ Cost of Care – Drug effectiveness & Cost of Care Analysis based on electronic Health Records (EMR)
▪ Self Service Healthcare – Increase in mHealth & eHealth to allow consumer access to health information
▪ Claims Analytics – Analyze insurance claims data for fraud detection & preferred treatment plans
Communication,
Media & Entertainment
▪ Discover churn patterns based on Call data records (CDRs) and activity in subscribers’ networks
▪ Digital Asset Management (DAM) – Analyze & capitalize digital data assets
Manufacturing
▪ Proactive Maintenance & Recommendation – Sensor Monitoring for automobile, buildings & machinery
▪ Energy Efficiency – Leveraging Smart meters for utility energy consumption
▪ Location or Proximity Tracking – Location based analytics using GPS Data
Hi-Tech
▪ Extend and complement conventional information supply chain with big data path
▪ Predictive analysis and real time decision support
Hadoop
16
Hadoop - HDFS
17
Hadoop - MapReduce
18
Hadoop - MapReduce
19
Apache Spark
20
Spark
Iterative
Processing
Batch
Processing
Machine
Learning
SQL
Stream
Processing
Graph
Processing
Hadoop
21
NoSQL Databases
22
NoSQL Databases
23
Modern Data Architecture
24
Lambda Architecture
25
Lambda Architecture
26
Data Analytics Lifecycle
27
Analytics - Trends
• Big Data Analytics In The Cloud
• AWS, AWS-Redshift
• Hadoop
• Enterprise Data Operating
System
• Data Analytics Platform
• SQL on Hadoop
• NoSQL
• IoT ( Internet of Things )
28
• Multi-polar Analytics
• Predictive Analytics ( Spark )
• In-memory Analytics
• Data Lake
• Deep Learning
• Machine Learning
• Neural Networks
• Data Monetization
Q & A
Thank You !
“Any Sufficiently Advanced Technology Is
Indistinguishable From Magic “
- Arthur C. Clarke

Mais conteúdo relacionado

Mais procurados

Data Analytics in Azure Cloud
Data Analytics in Azure CloudData Analytics in Azure Cloud
Data Analytics in Azure CloudMicrosoft Canada
 
AI powered decision making in banks
AI powered decision making in banksAI powered decision making in banks
AI powered decision making in banksPankaj Baid
 
Predictive analytics km chicago
Predictive analytics km chicagoPredictive analytics km chicago
Predictive analytics km chicagoKM Chicago
 
How advanced analytics is impacting the banking sector
How advanced analytics is impacting the banking sectorHow advanced analytics is impacting the banking sector
How advanced analytics is impacting the banking sectorMichael Haddad
 
Big Data Use-Cases across industries (Georg Polzer, Teralytics)
Big Data Use-Cases across industries (Georg Polzer, Teralytics)Big Data Use-Cases across industries (Georg Polzer, Teralytics)
Big Data Use-Cases across industries (Georg Polzer, Teralytics)Swiss Big Data User Group
 
Big Data Retail Banking
Big Data Retail Banking Big Data Retail Banking
Big Data Retail Banking Sandeep Bhagat
 
Big data in marketing at harvard business club nick1 june 15 2013
Big data in marketing at harvard business club nick1 june 15 2013Big data in marketing at harvard business club nick1 june 15 2013
Big data in marketing at harvard business club nick1 june 15 2013nkabra
 
Big Data in Banking (Data Science Thailand Meetup #2)
Big Data in Banking (Data Science Thailand Meetup #2)Big Data in Banking (Data Science Thailand Meetup #2)
Big Data in Banking (Data Science Thailand Meetup #2)Data Science Thailand
 
How is Big Data extending the life of the banking sector?
How is Big Data extending the life of the banking sector?How is Big Data extending the life of the banking sector?
How is Big Data extending the life of the banking sector?NexSoftsys
 
Introduction to Data Analytics
Introduction to Data AnalyticsIntroduction to Data Analytics
Introduction to Data AnalyticsUtkarsh Sharma
 
Finance and Accounting BPM
Finance and Accounting BPMFinance and Accounting BPM
Finance and Accounting BPMBob Samuels
 
Hidden security and privacy consequences around mobility (Infosec 2013)
Hidden security and privacy consequences around mobility (Infosec 2013)Hidden security and privacy consequences around mobility (Infosec 2013)
Hidden security and privacy consequences around mobility (Infosec 2013)Huntsman Security
 
Data driven approach to KYC
Data driven approach to KYCData driven approach to KYC
Data driven approach to KYCPankaj Baid
 
Analytics in banking preview deck - june 2013
Analytics in banking   preview deck - june 2013Analytics in banking   preview deck - june 2013
Analytics in banking preview deck - june 2013Everest Group
 
USE OF DATA MINING IN BANKING SECTOR
USE OF DATA MINING IN BANKING SECTORUSE OF DATA MINING IN BANKING SECTOR
USE OF DATA MINING IN BANKING SECTORarpit bhadoriya
 

Mais procurados (20)

Rulex big data and analytics
Rulex big data and analyticsRulex big data and analytics
Rulex big data and analytics
 
Data Analytics in Azure Cloud
Data Analytics in Azure CloudData Analytics in Azure Cloud
Data Analytics in Azure Cloud
 
AI powered decision making in banks
AI powered decision making in banksAI powered decision making in banks
AI powered decision making in banks
 
Predictive analytics km chicago
Predictive analytics km chicagoPredictive analytics km chicago
Predictive analytics km chicago
 
Banking Big Data Analytics
Banking Big Data AnalyticsBanking Big Data Analytics
Banking Big Data Analytics
 
Data mining
Data miningData mining
Data mining
 
Data mining on Financial Data
Data mining on Financial DataData mining on Financial Data
Data mining on Financial Data
 
How advanced analytics is impacting the banking sector
How advanced analytics is impacting the banking sectorHow advanced analytics is impacting the banking sector
How advanced analytics is impacting the banking sector
 
Big Data Use-Cases across industries (Georg Polzer, Teralytics)
Big Data Use-Cases across industries (Georg Polzer, Teralytics)Big Data Use-Cases across industries (Georg Polzer, Teralytics)
Big Data Use-Cases across industries (Georg Polzer, Teralytics)
 
Big Data Retail Banking
Big Data Retail Banking Big Data Retail Banking
Big Data Retail Banking
 
Big data in marketing at harvard business club nick1 june 15 2013
Big data in marketing at harvard business club nick1 june 15 2013Big data in marketing at harvard business club nick1 june 15 2013
Big data in marketing at harvard business club nick1 june 15 2013
 
Data science in finance industry
Data science in finance industryData science in finance industry
Data science in finance industry
 
Big Data in Banking (Data Science Thailand Meetup #2)
Big Data in Banking (Data Science Thailand Meetup #2)Big Data in Banking (Data Science Thailand Meetup #2)
Big Data in Banking (Data Science Thailand Meetup #2)
 
How is Big Data extending the life of the banking sector?
How is Big Data extending the life of the banking sector?How is Big Data extending the life of the banking sector?
How is Big Data extending the life of the banking sector?
 
Introduction to Data Analytics
Introduction to Data AnalyticsIntroduction to Data Analytics
Introduction to Data Analytics
 
Finance and Accounting BPM
Finance and Accounting BPMFinance and Accounting BPM
Finance and Accounting BPM
 
Hidden security and privacy consequences around mobility (Infosec 2013)
Hidden security and privacy consequences around mobility (Infosec 2013)Hidden security and privacy consequences around mobility (Infosec 2013)
Hidden security and privacy consequences around mobility (Infosec 2013)
 
Data driven approach to KYC
Data driven approach to KYCData driven approach to KYC
Data driven approach to KYC
 
Analytics in banking preview deck - june 2013
Analytics in banking   preview deck - june 2013Analytics in banking   preview deck - june 2013
Analytics in banking preview deck - june 2013
 
USE OF DATA MINING IN BANKING SECTOR
USE OF DATA MINING IN BANKING SECTORUSE OF DATA MINING IN BANKING SECTOR
USE OF DATA MINING IN BANKING SECTOR
 

Semelhante a Large Scale Data Analytics

Big Data Analytics Summit - April, 2014
Big Data Analytics Summit - April, 2014Big Data Analytics Summit - April, 2014
Big Data Analytics Summit - April, 2014shankar_radhakrishnan
 
¿Como los modelos predictivos cambian los negocios?
¿Como los modelos predictivos cambian los negocios?¿Como los modelos predictivos cambian los negocios?
¿Como los modelos predictivos cambian los negocios?Fabricio Quintanilla
 
5733 a deep dive into IBM Watson Foundation for CSP (WFC)
5733   a deep dive into IBM Watson Foundation for CSP (WFC)5733   a deep dive into IBM Watson Foundation for CSP (WFC)
5733 a deep dive into IBM Watson Foundation for CSP (WFC)Arvind Sathi
 
How to Become an Analytics Ready Insurer - with Informatica and Hortonworks
How to Become an Analytics Ready Insurer - with Informatica and HortonworksHow to Become an Analytics Ready Insurer - with Informatica and Hortonworks
How to Become an Analytics Ready Insurer - with Informatica and HortonworksHortonworks
 
EVOLVING PATTERNS IN BIG DATA - NEIL AVERY
EVOLVING PATTERNS IN BIG DATA - NEIL AVERYEVOLVING PATTERNS IN BIG DATA - NEIL AVERY
EVOLVING PATTERNS IN BIG DATA - NEIL AVERYBig Data Week
 
Cognitive Computing and Data Science expertise at SoftServe
Cognitive Computing and Data Science expertise at SoftServeCognitive Computing and Data Science expertise at SoftServe
Cognitive Computing and Data Science expertise at SoftServeIurii Milovanov
 
Predictive Analytics - An Overview
Predictive Analytics - An OverviewPredictive Analytics - An Overview
Predictive Analytics - An OverviewMachinePulse
 
McKinsey MassTLC Big Data Seminar Keynote - February 28, 2014
McKinsey MassTLC Big Data Seminar Keynote - February 28, 2014McKinsey MassTLC Big Data Seminar Keynote - February 28, 2014
McKinsey MassTLC Big Data Seminar Keynote - February 28, 2014MassTLC
 
Relying on Data for Strategic Decision-Making--Financial Services Experience
Relying on Data for Strategic Decision-Making--Financial Services ExperienceRelying on Data for Strategic Decision-Making--Financial Services Experience
Relying on Data for Strategic Decision-Making--Financial Services ExperienceCloudera, Inc.
 
Big Data Done Right by Successful Organizations
Big Data Done Right by Successful OrganizationsBig Data Done Right by Successful Organizations
Big Data Done Right by Successful OrganizationsEuro IT Group
 
Big data analytics in payments
Big data analytics in payments Big data analytics in payments
Big data analytics in payments Ashish Anand
 
Captricity at Corinium Chief Data Officer Forum Keynote - Brian Cox
Captricity at Corinium Chief Data Officer Forum Keynote - Brian Cox Captricity at Corinium Chief Data Officer Forum Keynote - Brian Cox
Captricity at Corinium Chief Data Officer Forum Keynote - Brian Cox Captricity
 
Big Data solution for multi-national Bank
Big Data solution for multi-national BankBig Data solution for multi-national Bank
Big Data solution for multi-national BankRitu Sarkar
 
Problems of Application of Machine Learning in the CRM - panel
Problems of Application of Machine Learning in the CRM - panel Problems of Application of Machine Learning in the CRM - panel
Problems of Application of Machine Learning in the CRM - panel Data Science Society
 

Semelhante a Large Scale Data Analytics (20)

Big Data Analytics Summit - April, 2014
Big Data Analytics Summit - April, 2014Big Data Analytics Summit - April, 2014
Big Data Analytics Summit - April, 2014
 
¿Como los modelos predictivos cambian los negocios?
¿Como los modelos predictivos cambian los negocios?¿Como los modelos predictivos cambian los negocios?
¿Como los modelos predictivos cambian los negocios?
 
5733 a deep dive into IBM Watson Foundation for CSP (WFC)
5733   a deep dive into IBM Watson Foundation for CSP (WFC)5733   a deep dive into IBM Watson Foundation for CSP (WFC)
5733 a deep dive into IBM Watson Foundation for CSP (WFC)
 
How to Become an Analytics Ready Insurer - with Informatica and Hortonworks
How to Become an Analytics Ready Insurer - with Informatica and HortonworksHow to Become an Analytics Ready Insurer - with Informatica and Hortonworks
How to Become an Analytics Ready Insurer - with Informatica and Hortonworks
 
EVOLVING PATTERNS IN BIG DATA - NEIL AVERY
EVOLVING PATTERNS IN BIG DATA - NEIL AVERYEVOLVING PATTERNS IN BIG DATA - NEIL AVERY
EVOLVING PATTERNS IN BIG DATA - NEIL AVERY
 
Cognitive Computing and Data Science expertise at SoftServe
Cognitive Computing and Data Science expertise at SoftServeCognitive Computing and Data Science expertise at SoftServe
Cognitive Computing and Data Science expertise at SoftServe
 
Big Data use cases in telcos
Big Data use cases in telcosBig Data use cases in telcos
Big Data use cases in telcos
 
Big Data use cases in telcos
Big Data use cases in telcosBig Data use cases in telcos
Big Data use cases in telcos
 
Predictive Analytics - An Overview
Predictive Analytics - An OverviewPredictive Analytics - An Overview
Predictive Analytics - An Overview
 
Predictive Analytics Overview
Predictive Analytics OverviewPredictive Analytics Overview
Predictive Analytics Overview
 
McKinsey MassTLC Big Data Seminar Keynote - February 28, 2014
McKinsey MassTLC Big Data Seminar Keynote - February 28, 2014McKinsey MassTLC Big Data Seminar Keynote - February 28, 2014
McKinsey MassTLC Big Data Seminar Keynote - February 28, 2014
 
Relying on Data for Strategic Decision-Making--Financial Services Experience
Relying on Data for Strategic Decision-Making--Financial Services ExperienceRelying on Data for Strategic Decision-Making--Financial Services Experience
Relying on Data for Strategic Decision-Making--Financial Services Experience
 
Big Data Done Right by Successful Organizations
Big Data Done Right by Successful OrganizationsBig Data Done Right by Successful Organizations
Big Data Done Right by Successful Organizations
 
Big data analytics in payments
Big data analytics in payments Big data analytics in payments
Big data analytics in payments
 
Aanlytics on Telecom
Aanlytics on TelecomAanlytics on Telecom
Aanlytics on Telecom
 
Captricity at Corinium Chief Data Officer Forum Keynote - Brian Cox
Captricity at Corinium Chief Data Officer Forum Keynote - Brian Cox Captricity at Corinium Chief Data Officer Forum Keynote - Brian Cox
Captricity at Corinium Chief Data Officer Forum Keynote - Brian Cox
 
Big Data solution for multi-national Bank
Big Data solution for multi-national BankBig Data solution for multi-national Bank
Big Data solution for multi-national Bank
 
Problems of Application of Machine Learning in the CRM - panel
Problems of Application of Machine Learning in the CRM - panel Problems of Application of Machine Learning in the CRM - panel
Problems of Application of Machine Learning in the CRM - panel
 
Claims
ClaimsClaims
Claims
 
Big Data Forum - Phoenix
Big Data Forum - PhoenixBig Data Forum - Phoenix
Big Data Forum - Phoenix
 

Último

CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 

Último (20)

CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 

Large Scale Data Analytics

  • 1. Large Scale Data Analytics Shankar Radhakrishnan shankar.r3@cognizant.com linkedin.com/in/connect2shankar
  • 2. Scenario • Insurer uses meteorological data for pricing model • At present data from 2000 weather stations are collected for analysis • Plan is to use 10,000 weather station data 
 ( or more ) • Stochastic simulation needs to run to ID pattern in weather data, to determine pricing • Volumetric : peta-bytes of information 
 ( for 1 region ) 2
  • 4. Data Analytics Is Mostly About $$, Customers, Markets 4
  • 5. How Widespread Is Data Analytics? 5
  • 6. Expectations On Payback Period ( Aggressive ) 6
  • 7. Large Scale Data Analytics 7 “Involves using different algorithms, 
 distributed platforms, tools and techniques to analyze big data and provide actionable insights”
  • 8. Big Data “ Data sets that are very large in volume and complex “ 8 New platforms, tools and techniques
 have emerged to manage Big Data We broke away from traditional
 ways to process and analyze them
  • 9. Data Structures   Vector, Matrix, Or Complex Structure Free Text Image or Binary Data Data “bags” Iterative Logic Or Complex Branching Advanced Analytic Routines Rapidly Repeated Measurements Extreme Low Latency Access to all data required Search Ranking X X X X X X Ad Tracking X X X X X X X X   Location or Proximity Tracking X   X X     X X   Social CRM X X X X X X      X Document Similarity Testing X X X X X X   X X Genomic Analysis X X X X X Customer Cohort groups X X   X X X     X Fraud Detection X X X X X X X X X Smart Utility Metering X X X X X X Churn Analysis X X X X X X   X   Satellite Image Analysis X X X X Game Gesture Analysis X X X X X X X X Data Bag Exploration X X X X X X 9
  • 10. Business Interests : Well Informed Customer Executive 10 Speech to Text Conversion Voice Data Unstructured data Analytical System Customer Persona • Customer Persona - Demographics,
 Top interactions, 
 Channel Preferences, Dissatisfies • Customer Lifetime Value • Recent Contact History • Customer Sentiment & Trend during the call Customer’s state of mind Sentimental Analysis Social media Depositions Complaints Other Channel information (ATM, Branch) Big Data Warehouse Traditional Warehouse Decision Engine • Customer Executive Dashboard presents all intelligence required to make a decision • The decision engine also presents important decisions to be taken for the particular customer issue
  • 11. Well Informed Customer Executive… Customer calls BankingCallCenter Executive understands the customerproblemExecutive authenticates customer and pulls up CustomerPersona Executive reviews risk of attrition against Customer LifetimeValue Executive reviews Last 5 call center and banking transactions Executiveviews customer’s state of mind (riskof attrition )through a barometer chart Analytical Solution - Converts Speech to textAnalytical engine listens to customer voice Suggested top 5 Actions required DecisionEngine Executive performs below actions based on his analysis and recommendations from Decision engine 1. Reversal of overdraft fee 2. One time fee waiver on Cheque book (predicting customer need based on historic usage cycles ) 3. Cash back Reward card for a minimum spend of $X through debitcard 4. Offer interest revision for investment products or mortgage 5. Promote new mutual funds or credit cards based on customer willingness Analytical engine monitors sentiment Executive analyzes Customer Persona (demographic / Preferences / Satisfiers / dissatisfiesetc ) 11
  • 12. Business Interests : Fraud Prevention 12 Envisaged Benefits ▪ New fraud patterns can be identified by building ‘analytical models’ to run against historical data ▪ ‘Web crawling’, ‘Contextual text analysis’, ‘Natural Language Processing’ allows fraud behavior identification from social media. It may increase Fraud detection success rate ▪ ‘Real time’ models to capture behavioral patters and do pattern analysis against History data to evaluate Fraud case validity. The model learns by self and updates ‘Fraud pattern master sets. ▪ Brings ‘artificial intelligent’ fraud pattern detection and analysis ▪ ‘Real time’ (in the order of .5-1 minute refresh rate) alerts to Fraud analysts about ‘self learned’ fraud patterns based on new customer behavior patterns Big Data Usage ▪ Formation of key value groups to the order of XcY (where X no. of attributes that are relevant to Fraud and Y is no. of attributes that should be combined to identify patterns) ▪ High speed history data loading from source systems ▪ Efficient Real time fraud detection by identifying patterns through customer behavioral events and processing them over X yrs. of history data – e.g. using HBase Scenario Formation of Fraud pattern reference tables using ▪ Real time data coming from different departments like IVR, WEB, Customer profile, Transactions etc ▪ Real time Mining and analysis of history data to form prior patterns (no. of years in range to 50-100 TB)
  • 13. Fraud Pattern Detection… 13 Legacy Fraud Data Customer Profile Data IVR Audio Data Web / Online Card Transaction Data Fraud Pattern Master Table Fraud Analyst History Data Processing to determine Fraud Patterns over X years Real-time Customer Behavior Analysis for Fraud Detection Customer Behavior Change Events Customer Behavior Change Events Customer Behavior Change Events Real time Analysis of behavior patterns over historical data Real time update to Master Table on New Fraud Patterns Real time alert to 
 Fraud Analyst RDBMS RDBMS (JSON Files) RDBMS Customer Behavior Change Events
  • 15. Benefits 15 BenefitsIndustry Financial services ▪ Customer Insights – Integrating Transactional data (CRM/Payments) and unstructured Social feeds ▪ Regulatory Compliance – Risk exposures across asset classes, LOBs and firms ▪ Fraud Detection in Credit Cards & Financial Crimes (AML) in Banks Travel, Hospitality & Retail ▪ Customer centricity – Customer behavior analysis from Omni channel retailing & Social feeds ▪ Markdown Optimization – Improve markdown based on actual customer buying patters ▪ Market basket analysis – Narrow down market basket analysis by demographics Life Science ▪ Improve targeting & predictions – Automatic Detection of Adverse Drug Effects (ADEs) ▪ Patient data analysis – Longitudinal Patient Data (LPD) analysis ▪ Predictive Sciences – Analyze Preclinical Side Effect Profiles of Marketed Drugs Healthcare (Payers & Providers) ▪ Cost of Care – Drug effectiveness & Cost of Care Analysis based on electronic Health Records (EMR) ▪ Self Service Healthcare – Increase in mHealth & eHealth to allow consumer access to health information ▪ Claims Analytics – Analyze insurance claims data for fraud detection & preferred treatment plans Communication, Media & Entertainment ▪ Discover churn patterns based on Call data records (CDRs) and activity in subscribers’ networks ▪ Digital Asset Management (DAM) – Analyze & capitalize digital data assets Manufacturing ▪ Proactive Maintenance & Recommendation – Sensor Monitoring for automobile, buildings & machinery ▪ Energy Efficiency – Leveraging Smart meters for utility energy consumption ▪ Location or Proximity Tracking – Location based analytics using GPS Data Hi-Tech ▪ Extend and complement conventional information supply chain with big data path ▪ Predictive analysis and real time decision support
  • 28. Analytics - Trends • Big Data Analytics In The Cloud • AWS, AWS-Redshift • Hadoop • Enterprise Data Operating System • Data Analytics Platform • SQL on Hadoop • NoSQL • IoT ( Internet of Things ) 28 • Multi-polar Analytics • Predictive Analytics ( Spark ) • In-memory Analytics • Data Lake • Deep Learning • Machine Learning • Neural Networks • Data Monetization
  • 29. Q & A
  • 30. Thank You ! “Any Sufficiently Advanced Technology Is Indistinguishable From Magic “ - Arthur C. Clarke