SlideShare uma empresa Scribd logo
1 de 26
New Data Project Plan
Anderson, Brykman, Gero, Lakhani, Matthews
PREDICT 480 – SECTION 55
Introduction
Company Background
 DERAK is an analytical support team for a young credit card company
 Company is 5 years old
 Latest venture is to consolidate data sources across the organization as well as
collect data from mobile applications to gain customer loyalty
 Why did we select this company?
 The combination of credit card data, social media, GPS, mobile data, etc. offer
exiting opportunities for modeling to give value to the customer
Introduction
SWOT Analysis
 Strengths
 Company is becoming analytically mature (Stage III on Maturity Model)
 Diversity of products to spread risk
 Weaknesses
 Not yet a significant brand in Credit Card market
 As a new company, we have limited data from 2008 Financial Crisis
 Opportunities
 Gain new insights on customers through mobile app
 Utilize data for predictive modeling
 Explore uses of GPS data
 Sell customer data to third party companies
 Threats
 Credit Card Market is mature with several key players. Gaining significant market share is
difficult and expensive
 Customer privacy concerns and government regulations
Introduction
3 Key Issues to Address with Analytics
1. Limited understanding about our customer
1. We need to know more about our customer in order to offer them more value than
the competition
2. Credit card default risk
1. Increase volume but maintain quality
3. Fraud
1. We need to increase trust with the customer by protecting them financially
Introduction
Database and Data preparation plan
 Our database and data preparation plan will help us solve this issues by:
 Helping us to gather more intimate data on our customer to better understand
their shopping preferences, personal interests, and measure customer loyalty
 We plan to utilize the new data to identify possible natural clusters of our
customers that may help use to identify segments that could carry higher
probability of default risk
 Through mobile applications, we can use GPS data to identify transactions that
may be fraudulent in order to better protect our customers
Literature and Data Sources Overview
 Document your information and data sources.
 Describe methods relevant to:
 Acquiring
 Storing
 Maintaining
 Accessing the data that you need
 Cite references that you have used to guide your thinking about
data sources and methods, and include these in the reference list
a the end of the paper.
Acquiring Data
 Company Databases1
 Demographic information
 Banking plans
 Credit/Debit balances
 Transactional data
 Social media3
 Twitter
 Hashtags
 @Branding
 Facebook
 Status updates
 ‘Likes’
 Contact Network
1 https://customers.microsoft.com/Pages/Download.aspx?id=13928/
2 http://www.sitetechsystems.com/top-10-ways-to-use-gis-in-retail-banking/
2 www.mmaglobal.com/files/mbankingoverview.pdf
3 http://www.bearingpoint.com/ecomaXL/files/0615_WP_EN_Social_CRM_final_web.pdf
 Mobile Phone Data2
 GPS Location
 Mobile client application
 Mobile web
 Short Message Service
(SMS)
 Contact Lists
Bank
MobileSocial
media
Data Integration
Central location
Dealing with data issues
and data preparation
 Integration of results from
various data sources into a
central location for analysis1
 Integration of results from
feeds (structured and
unstructured)1
1 https://www.in.capgemini.com/resource-file-
access/resource/pdf/A_Case_for_Enterprise_Data_Management_for_Banking.pdf
Storage
 Considerations close to real time or longer term
access1
 Governance for retention periods, data ownership
and entry of new data sources2
 Maintain backups2
 Accessible2
 Document sources and validations taking place2
1 http://www.oracle.com/us/products/middleware/data-integration/oracle-goldengate-realtime-access-2031152.pdf
2 http://www.osfi-bsif.gc.ca/eng/docs/data_maint_ja06.pdf
Maintaining data
 Backing up data1
 Deleting data based on established retention1
 Performance optimization2
1 http://www.osfi-bsif.gc.ca/eng/docs/data_maint_ja06.pdf
2 http://www.oracle.com/technetwork/database/bi-datawarehousing/twp-bp-for-stats-gather-12c-1967354.pdf
Accessing
 Access via graphical user interfaces (GUIs) through
applications for
 Structured data or unstructured data
 Real time or ex-post data
 Access for
 System administration and maintenance
 Validation, editing and estimating (VEE)
 Analyzing and modelling processed data (Ultimate Goal)
 Audit access controls1
1 http://www.osfi-bsif.gc.ca/eng/docs/data_maint_ja06.pdf
Criteria
 Describe the systems and methods used to:
 Acquire data
 Store data
 Maintain data
 Access data
 Describe the infrastructure.
 Describe how these systems and methods work together.
Acquire data
 Python and R languages will be used for data acquisition.
 Lightweight and flexible.
 Mature - industry standards.
 Multiple libraries available for data handling and analytics.
 Data acquisition
 Facebook1 and Twitter2 provide APIs for access
 Python supported
 Returns JSON format
 Company data
 Years of company credit card history available.
1 https://developers.facebook.com/docs/graph-api/using-graph-api/v2.5
2 https://dev.twitter.com/overview/api
Storing, Maintaining and Accessing
 Oracle DBMS for internal data
 PostgreSQL for analytics
 Supports structured and unstructured data.
 Better performance than MongoDB
 Purchase history tied to customers maintained for 90 days.
 Data used for analytic models maintained for 90 days.
 Data anonymized and maintained for 2 years.
 Used for general trending and analysis.
 Sold to third parties.
 Views will be created in the PostgreSQL environment to allow access for the
mobile application.
Infrastructure and How it works.
Data Preparation
 Text Analysis
 Cleanse the data for keywords
 GPS Data
 Broad scale demographic segmentation
 Contact Networks
 Identifiable features of people and groups
Data Issues
Problems
 Application Permissions
 Consumer Participation
 Inaccurate Text Analysis
 Data Storage and Processing
Solutions
 Provide benefits for permissions
 Entice consumers with offers
 Segmented Analysis
 Commodity Clustered Servers
Data Quality
 Outliers & Incomplete data
 Used where applicable, likely excluded
 Bootstrapping
 Estimation of additional sampling distributions
 Influential observations
 Used to further define possible segmentations
In Conclusion…
Opportunities
 DERAK is well positioned to create key analytical insights to our credit card
customer, which will benefit its services by:
 Improving its market access
 Gaining new insights on its credit card customers through mobile app
 Utilizing the collected data for predictive modeling (spending habits, etc.)
 Exploring uses of GPS data
 Selling anonymized customer data to third party companies
In Conclusion…
Full Services
In Conclusion…
Data Sources
In Conclusion…
Systems and Methods
In Conclusion…
Data Issues and Preparation
In Conclusion…
Direction and future state
 Refining data collection and
analytics
 Improving data mining methods
 Expanding our services and market
reach to other industries
 Team and Presenters (in order of appearance)
 Aaron Matthews
 Ketan Lakhani
 Eric Gero
 Daniel Anderson
 Raphael Brykman
References
 Royal Bank of Scotland Case Study (2013)
 Mobile Banking Overview (January 2009)
 10 ways to use GIS data in Retail Banking (2012)
 A Case for Enterprise Data Management in Banking (2012)
 Data Maintenance at IRB Institutions (2006)
 Best Practices for Gathering Optimizer Statistics with Oracle Database 12c

Mais conteúdo relacionado

Mais procurados

Customer Segmentation Project
Customer Segmentation ProjectCustomer Segmentation Project
Customer Segmentation ProjectAditya Ekawade
 
MDM AS A METHODOLOGY
MDM AS A METHODOLOGYMDM AS A METHODOLOGY
MDM AS A METHODOLOGYJanet Wetter
 
Big Data Analytics for Contact Centers
Big Data Analytics for Contact CentersBig Data Analytics for Contact Centers
Big Data Analytics for Contact CentersRajender K Salgam
 
Data driven approach to KYC
Data driven approach to KYCData driven approach to KYC
Data driven approach to KYCPankaj Baid
 
Mastering Big Data: The Next Big Leap for Master Data Management
Mastering Big Data: The Next Big Leap for Master Data ManagementMastering Big Data: The Next Big Leap for Master Data Management
Mastering Big Data: The Next Big Leap for Master Data ManagementCognizant
 
University Single Constituent View Repository ( SCoRe)
University Single Constituent View Repository ( SCoRe)University Single Constituent View Repository ( SCoRe)
University Single Constituent View Repository ( SCoRe)Hemant Verma
 
Appregatta Portfolio Overview
Appregatta Portfolio OverviewAppregatta Portfolio Overview
Appregatta Portfolio Overviewgueste5e2095c
 
How can banks maximise the value of their customer data?
How can banks maximise the value of their customer data?How can banks maximise the value of their customer data?
How can banks maximise the value of their customer data?Ben Gilchriest
 
Big Data Alchemy: How can Banks Maximize the Value of their Customer Data?
Big Data Alchemy: How can Banks Maximize the Value of their Customer Data?Big Data Alchemy: How can Banks Maximize the Value of their Customer Data?
Big Data Alchemy: How can Banks Maximize the Value of their Customer Data?VIRGOkonsult
 
Big data alchemy - how can banks maximize the value of their customer data
Big data alchemy - how can banks maximize the value of their customer dataBig data alchemy - how can banks maximize the value of their customer data
Big data alchemy - how can banks maximize the value of their customer dataRick Bouter
 
Marketsoft and marketing cube data quality to cc-v3
Marketsoft and marketing cube   data quality to cc-v3Marketsoft and marketing cube   data quality to cc-v3
Marketsoft and marketing cube data quality to cc-v3Marketsoft
 
IRJET- Strength and Workability of High Volume Fly Ash Self-Compacting Concre...
IRJET- Strength and Workability of High Volume Fly Ash Self-Compacting Concre...IRJET- Strength and Workability of High Volume Fly Ash Self-Compacting Concre...
IRJET- Strength and Workability of High Volume Fly Ash Self-Compacting Concre...IRJET Journal
 
The Big Returns from Big Data
The Big Returns from Big Data The Big Returns from Big Data
The Big Returns from Big Data EMC
 

Mais procurados (16)

Customer Segmentation Project
Customer Segmentation ProjectCustomer Segmentation Project
Customer Segmentation Project
 
MDM AS A METHODOLOGY
MDM AS A METHODOLOGYMDM AS A METHODOLOGY
MDM AS A METHODOLOGY
 
Big Data Analytics for Contact Centers
Big Data Analytics for Contact CentersBig Data Analytics for Contact Centers
Big Data Analytics for Contact Centers
 
Data driven approach to KYC
Data driven approach to KYCData driven approach to KYC
Data driven approach to KYC
 
Mastering Big Data: The Next Big Leap for Master Data Management
Mastering Big Data: The Next Big Leap for Master Data ManagementMastering Big Data: The Next Big Leap for Master Data Management
Mastering Big Data: The Next Big Leap for Master Data Management
 
University Single Constituent View Repository ( SCoRe)
University Single Constituent View Repository ( SCoRe)University Single Constituent View Repository ( SCoRe)
University Single Constituent View Repository ( SCoRe)
 
Appregatta Portfolio Overview
Appregatta Portfolio OverviewAppregatta Portfolio Overview
Appregatta Portfolio Overview
 
How can banks maximise the value of their customer data?
How can banks maximise the value of their customer data?How can banks maximise the value of their customer data?
How can banks maximise the value of their customer data?
 
Big Data Alchemy: How can Banks Maximize the Value of their Customer Data?
Big Data Alchemy: How can Banks Maximize the Value of their Customer Data?Big Data Alchemy: How can Banks Maximize the Value of their Customer Data?
Big Data Alchemy: How can Banks Maximize the Value of their Customer Data?
 
Big data alchemy - how can banks maximize the value of their customer data
Big data alchemy - how can banks maximize the value of their customer dataBig data alchemy - how can banks maximize the value of their customer data
Big data alchemy - how can banks maximize the value of their customer data
 
Marketsoft and marketing cube data quality to cc-v3
Marketsoft and marketing cube   data quality to cc-v3Marketsoft and marketing cube   data quality to cc-v3
Marketsoft and marketing cube data quality to cc-v3
 
IRJET- Strength and Workability of High Volume Fly Ash Self-Compacting Concre...
IRJET- Strength and Workability of High Volume Fly Ash Self-Compacting Concre...IRJET- Strength and Workability of High Volume Fly Ash Self-Compacting Concre...
IRJET- Strength and Workability of High Volume Fly Ash Self-Compacting Concre...
 
The Big Returns from Big Data
The Big Returns from Big Data The Big Returns from Big Data
The Big Returns from Big Data
 
Unit 4 Advanced Data Analytics
Unit 4 Advanced Data AnalyticsUnit 4 Advanced Data Analytics
Unit 4 Advanced Data Analytics
 
Apics12
Apics12Apics12
Apics12
 
BI
BIBI
BI
 

Destaque

модель создания социотехнических систем о.р.мусина часть4
модель создания социотехнических систем о.р.мусина часть4модель создания социотехнических систем о.р.мусина часть4
модель создания социотехнических систем о.р.мусина часть4Олег Радикович Мусин
 
Shannell Brown-Curriculum Vitae
Shannell Brown-Curriculum VitaeShannell Brown-Curriculum Vitae
Shannell Brown-Curriculum VitaeSHANNELL BROWN
 
Classroom rules
Classroom rulesClassroom rules
Classroom rulestfoster412
 
The 3 Telltale Signs You Need To Move To A New Recruitment Company
The 3 Telltale Signs You Need To Move To A New Recruitment CompanyThe 3 Telltale Signs You Need To Move To A New Recruitment Company
The 3 Telltale Signs You Need To Move To A New Recruitment CompanyGSR2R
 
Derak project plan overview
Derak   project plan overviewDerak   project plan overview
Derak project plan overviewEric Gero
 
Afm nroi project ppt
Afm nroi project pptAfm nroi project ppt
Afm nroi project pptAnoop Sharma
 
Coldplay Magazine Advert Analysis
Coldplay Magazine Advert AnalysisColdplay Magazine Advert Analysis
Coldplay Magazine Advert Analysiselleahmedia
 
Opening address: National Nursing Informatics Conference, HiNZ 2015
Opening address: National Nursing Informatics Conference, HiNZ 2015Opening address: National Nursing Informatics Conference, HiNZ 2015
Opening address: National Nursing Informatics Conference, HiNZ 2015Health Informatics New Zealand
 
You Me At Six, Hold Me Down Advert Analysis
You Me At Six, Hold Me Down Advert AnalysisYou Me At Six, Hold Me Down Advert Analysis
You Me At Six, Hold Me Down Advert Analysiselleahmedia
 
Homeostasis. Documento para tercero medio, biología, plan común.
Homeostasis. Documento para tercero medio, biología, plan común.Homeostasis. Documento para tercero medio, biología, plan común.
Homeostasis. Documento para tercero medio, biología, plan común.Hogar
 

Destaque (13)

модель создания социотехнических систем о.р.мусина часть4
модель создания социотехнических систем о.р.мусина часть4модель создания социотехнических систем о.р.мусина часть4
модель создания социотехнических систем о.р.мусина часть4
 
Correo_primeros
Correo_primerosCorreo_primeros
Correo_primeros
 
Shannell Brown-Curriculum Vitae
Shannell Brown-Curriculum VitaeShannell Brown-Curriculum Vitae
Shannell Brown-Curriculum Vitae
 
Classroom rules
Classroom rulesClassroom rules
Classroom rules
 
Grades.PDF
Grades.PDFGrades.PDF
Grades.PDF
 
The 3 Telltale Signs You Need To Move To A New Recruitment Company
The 3 Telltale Signs You Need To Move To A New Recruitment CompanyThe 3 Telltale Signs You Need To Move To A New Recruitment Company
The 3 Telltale Signs You Need To Move To A New Recruitment Company
 
Derak project plan overview
Derak   project plan overviewDerak   project plan overview
Derak project plan overview
 
Afm nroi project ppt
Afm nroi project pptAfm nroi project ppt
Afm nroi project ppt
 
Coldplay Magazine Advert Analysis
Coldplay Magazine Advert AnalysisColdplay Magazine Advert Analysis
Coldplay Magazine Advert Analysis
 
Opening address: National Nursing Informatics Conference, HiNZ 2015
Opening address: National Nursing Informatics Conference, HiNZ 2015Opening address: National Nursing Informatics Conference, HiNZ 2015
Opening address: National Nursing Informatics Conference, HiNZ 2015
 
Connectography: Parag Khanna
Connectography: Parag Khanna Connectography: Parag Khanna
Connectography: Parag Khanna
 
You Me At Six, Hold Me Down Advert Analysis
You Me At Six, Hold Me Down Advert AnalysisYou Me At Six, Hold Me Down Advert Analysis
You Me At Six, Hold Me Down Advert Analysis
 
Homeostasis. Documento para tercero medio, biología, plan común.
Homeostasis. Documento para tercero medio, biología, plan común.Homeostasis. Documento para tercero medio, biología, plan común.
Homeostasis. Documento para tercero medio, biología, plan común.
 

Semelhante a Derak Presentation

C21027_Aditya_Big Data Analytics In Baking Sector.pptx
C21027_Aditya_Big Data Analytics In Baking Sector.pptxC21027_Aditya_Big Data Analytics In Baking Sector.pptx
C21027_Aditya_Big Data Analytics In Baking Sector.pptxAdityaDeshpande674450
 
DataOps - Big Data and AI World London - March 2020 - Harvinder Atwal
DataOps - Big Data and AI World London - March 2020 - Harvinder AtwalDataOps - Big Data and AI World London - March 2020 - Harvinder Atwal
DataOps - Big Data and AI World London - March 2020 - Harvinder AtwalHarvinder Atwal
 
Business intelligence and analytics
Business intelligence and analyticsBusiness intelligence and analytics
Business intelligence and analyticsYogesh Supekar
 
Choosing the Right Big Data Architecture for your Business
Choosing the Right Big Data Architecture for your BusinessChoosing the Right Big Data Architecture for your Business
Choosing the Right Big Data Architecture for your BusinessChicago Hadoop Users Group
 
Simplify our analytics strategy
Simplify our analytics strategySimplify our analytics strategy
Simplify our analytics strategysaurabh sethia
 
Big Data Analytics in light of Financial Industry
Big Data Analytics in light of Financial Industry Big Data Analytics in light of Financial Industry
Big Data Analytics in light of Financial Industry Capgemini
 
Simplify your analytics strategy
Simplify your analytics strategySimplify your analytics strategy
Simplify your analytics strategyShikhar Gupta
 
Importance of Data Analytics
 Importance of Data Analytics Importance of Data Analytics
Importance of Data AnalyticsProduct School
 
Bridging the Gap Between Business Objectives and Data Strategy
Bridging the Gap Between Business Objectives and Data StrategyBridging the Gap Between Business Objectives and Data Strategy
Bridging the Gap Between Business Objectives and Data StrategyRNayak3
 
Rising Significance of Big Data Analytics for Exponential Growth.docx
Rising Significance of Big Data Analytics for Exponential Growth.docxRising Significance of Big Data Analytics for Exponential Growth.docx
Rising Significance of Big Data Analytics for Exponential Growth.docxSG Analytics
 
Data democratization the key to future proofing data culture
Data democratization the key to future proofing data cultureData democratization the key to future proofing data culture
Data democratization the key to future proofing data culturePolestarsolutions
 
Empowering your Enterprise with a Self-Service Data Marketplace (ASEAN)
Empowering your Enterprise with a Self-Service Data Marketplace (ASEAN)Empowering your Enterprise with a Self-Service Data Marketplace (ASEAN)
Empowering your Enterprise with a Self-Service Data Marketplace (ASEAN)Denodo
 
Business Intellegence
Business IntellegenceBusiness Intellegence
Business IntellegenceKallol Sarkar
 
Big Data is Here for Financial Services White Paper
Big Data is Here for Financial Services White PaperBig Data is Here for Financial Services White Paper
Big Data is Here for Financial Services White PaperExperian
 

Semelhante a Derak Presentation (20)

C21027_Aditya_Big Data Analytics In Baking Sector.pptx
C21027_Aditya_Big Data Analytics In Baking Sector.pptxC21027_Aditya_Big Data Analytics In Baking Sector.pptx
C21027_Aditya_Big Data Analytics In Baking Sector.pptx
 
DataOps - Big Data and AI World London - March 2020 - Harvinder Atwal
DataOps - Big Data and AI World London - March 2020 - Harvinder AtwalDataOps - Big Data and AI World London - March 2020 - Harvinder Atwal
DataOps - Big Data and AI World London - March 2020 - Harvinder Atwal
 
Business intelligence and analytics
Business intelligence and analyticsBusiness intelligence and analytics
Business intelligence and analytics
 
Choosing the Right Big Data Architecture for your Business
Choosing the Right Big Data Architecture for your BusinessChoosing the Right Big Data Architecture for your Business
Choosing the Right Big Data Architecture for your Business
 
Big Data
Big DataBig Data
Big Data
 
Simplify our analytics strategy
Simplify our analytics strategySimplify our analytics strategy
Simplify our analytics strategy
 
uae views on big data
  uae views on  big data  uae views on  big data
uae views on big data
 
Achieving Business Success with Data.pdf
Achieving Business Success with Data.pdfAchieving Business Success with Data.pdf
Achieving Business Success with Data.pdf
 
Big Data Analytics in light of Financial Industry
Big Data Analytics in light of Financial Industry Big Data Analytics in light of Financial Industry
Big Data Analytics in light of Financial Industry
 
Simplify your analytics strategy
Simplify your analytics strategySimplify your analytics strategy
Simplify your analytics strategy
 
Importance of Data Analytics
 Importance of Data Analytics Importance of Data Analytics
Importance of Data Analytics
 
Bridging the Gap Between Business Objectives and Data Strategy
Bridging the Gap Between Business Objectives and Data StrategyBridging the Gap Between Business Objectives and Data Strategy
Bridging the Gap Between Business Objectives and Data Strategy
 
Rising Significance of Big Data Analytics for Exponential Growth.docx
Rising Significance of Big Data Analytics for Exponential Growth.docxRising Significance of Big Data Analytics for Exponential Growth.docx
Rising Significance of Big Data Analytics for Exponential Growth.docx
 
Data democratization the key to future proofing data culture
Data democratization the key to future proofing data cultureData democratization the key to future proofing data culture
Data democratization the key to future proofing data culture
 
Big Data Analytics Services - BrainCave Soft.pdf
Big Data Analytics Services - BrainCave Soft.pdfBig Data Analytics Services - BrainCave Soft.pdf
Big Data Analytics Services - BrainCave Soft.pdf
 
Big Data Analytics Services - BrainCave Soft.pdf
Big Data Analytics Services - BrainCave Soft.pdfBig Data Analytics Services - BrainCave Soft.pdf
Big Data Analytics Services - BrainCave Soft.pdf
 
Get your data analytics strategy right!
Get your data analytics strategy right!Get your data analytics strategy right!
Get your data analytics strategy right!
 
Empowering your Enterprise with a Self-Service Data Marketplace (ASEAN)
Empowering your Enterprise with a Self-Service Data Marketplace (ASEAN)Empowering your Enterprise with a Self-Service Data Marketplace (ASEAN)
Empowering your Enterprise with a Self-Service Data Marketplace (ASEAN)
 
Business Intellegence
Business IntellegenceBusiness Intellegence
Business Intellegence
 
Big Data is Here for Financial Services White Paper
Big Data is Here for Financial Services White PaperBig Data is Here for Financial Services White Paper
Big Data is Here for Financial Services White Paper
 

Último

Bank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBoston Institute of Analytics
 
INTRODUCTION TO Natural language processing
INTRODUCTION TO Natural language processingINTRODUCTION TO Natural language processing
INTRODUCTION TO Natural language processingsocarem879
 
modul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxmodul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxaleedritatuxx
 
Cyber awareness ppt on the recorded data
Cyber awareness ppt on the recorded dataCyber awareness ppt on the recorded data
Cyber awareness ppt on the recorded dataTecnoIncentive
 
wepik-insightful-infographics-a-data-visualization-overview-20240401133220kwr...
wepik-insightful-infographics-a-data-visualization-overview-20240401133220kwr...wepik-insightful-infographics-a-data-visualization-overview-20240401133220kwr...
wepik-insightful-infographics-a-data-visualization-overview-20240401133220kwr...KarteekMane1
 
Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Seán Kennedy
 
Unveiling the Role of Social Media Suspect Investigators in Preventing Online...
Unveiling the Role of Social Media Suspect Investigators in Preventing Online...Unveiling the Role of Social Media Suspect Investigators in Preventing Online...
Unveiling the Role of Social Media Suspect Investigators in Preventing Online...Milind Agarwal
 
What To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptxWhat To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptxSimranPal17
 
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024Susanna-Assunta Sansone
 
Semantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxSemantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxMike Bennett
 
SMOTE and K-Fold Cross Validation-Presentation.pptx
SMOTE and K-Fold Cross Validation-Presentation.pptxSMOTE and K-Fold Cross Validation-Presentation.pptx
SMOTE and K-Fold Cross Validation-Presentation.pptxHaritikaChhatwal1
 
convolutional neural network and its applications.pdf
convolutional neural network and its applications.pdfconvolutional neural network and its applications.pdf
convolutional neural network and its applications.pdfSubhamKumar3239
 
Real-Time AI Streaming - AI Max Princeton
Real-Time AI  Streaming - AI Max PrincetonReal-Time AI  Streaming - AI Max Princeton
Real-Time AI Streaming - AI Max PrincetonTimothy Spann
 
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...Thomas Poetter
 
Digital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing worksDigital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing worksdeepakthakur548787
 
Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 217djon017
 
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Boston Institute of Analytics
 
Networking Case Study prepared by teacher.pptx
Networking Case Study prepared by teacher.pptxNetworking Case Study prepared by teacher.pptx
Networking Case Study prepared by teacher.pptxHimangsuNath
 

Último (20)

Insurance Churn Prediction Data Analysis Project
Insurance Churn Prediction Data Analysis ProjectInsurance Churn Prediction Data Analysis Project
Insurance Churn Prediction Data Analysis Project
 
Bank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis Project
 
INTRODUCTION TO Natural language processing
INTRODUCTION TO Natural language processingINTRODUCTION TO Natural language processing
INTRODUCTION TO Natural language processing
 
modul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxmodul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptx
 
Cyber awareness ppt on the recorded data
Cyber awareness ppt on the recorded dataCyber awareness ppt on the recorded data
Cyber awareness ppt on the recorded data
 
Data Analysis Project: Stroke Prediction
Data Analysis Project: Stroke PredictionData Analysis Project: Stroke Prediction
Data Analysis Project: Stroke Prediction
 
wepik-insightful-infographics-a-data-visualization-overview-20240401133220kwr...
wepik-insightful-infographics-a-data-visualization-overview-20240401133220kwr...wepik-insightful-infographics-a-data-visualization-overview-20240401133220kwr...
wepik-insightful-infographics-a-data-visualization-overview-20240401133220kwr...
 
Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...
 
Unveiling the Role of Social Media Suspect Investigators in Preventing Online...
Unveiling the Role of Social Media Suspect Investigators in Preventing Online...Unveiling the Role of Social Media Suspect Investigators in Preventing Online...
Unveiling the Role of Social Media Suspect Investigators in Preventing Online...
 
What To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptxWhat To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptx
 
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
 
Semantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxSemantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptx
 
SMOTE and K-Fold Cross Validation-Presentation.pptx
SMOTE and K-Fold Cross Validation-Presentation.pptxSMOTE and K-Fold Cross Validation-Presentation.pptx
SMOTE and K-Fold Cross Validation-Presentation.pptx
 
convolutional neural network and its applications.pdf
convolutional neural network and its applications.pdfconvolutional neural network and its applications.pdf
convolutional neural network and its applications.pdf
 
Real-Time AI Streaming - AI Max Princeton
Real-Time AI  Streaming - AI Max PrincetonReal-Time AI  Streaming - AI Max Princeton
Real-Time AI Streaming - AI Max Princeton
 
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
 
Digital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing worksDigital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing works
 
Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2
 
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
 
Networking Case Study prepared by teacher.pptx
Networking Case Study prepared by teacher.pptxNetworking Case Study prepared by teacher.pptx
Networking Case Study prepared by teacher.pptx
 

Derak Presentation

  • 1. New Data Project Plan Anderson, Brykman, Gero, Lakhani, Matthews PREDICT 480 – SECTION 55
  • 2. Introduction Company Background  DERAK is an analytical support team for a young credit card company  Company is 5 years old  Latest venture is to consolidate data sources across the organization as well as collect data from mobile applications to gain customer loyalty  Why did we select this company?  The combination of credit card data, social media, GPS, mobile data, etc. offer exiting opportunities for modeling to give value to the customer
  • 3. Introduction SWOT Analysis  Strengths  Company is becoming analytically mature (Stage III on Maturity Model)  Diversity of products to spread risk  Weaknesses  Not yet a significant brand in Credit Card market  As a new company, we have limited data from 2008 Financial Crisis  Opportunities  Gain new insights on customers through mobile app  Utilize data for predictive modeling  Explore uses of GPS data  Sell customer data to third party companies  Threats  Credit Card Market is mature with several key players. Gaining significant market share is difficult and expensive  Customer privacy concerns and government regulations
  • 4. Introduction 3 Key Issues to Address with Analytics 1. Limited understanding about our customer 1. We need to know more about our customer in order to offer them more value than the competition 2. Credit card default risk 1. Increase volume but maintain quality 3. Fraud 1. We need to increase trust with the customer by protecting them financially
  • 5. Introduction Database and Data preparation plan  Our database and data preparation plan will help us solve this issues by:  Helping us to gather more intimate data on our customer to better understand their shopping preferences, personal interests, and measure customer loyalty  We plan to utilize the new data to identify possible natural clusters of our customers that may help use to identify segments that could carry higher probability of default risk  Through mobile applications, we can use GPS data to identify transactions that may be fraudulent in order to better protect our customers
  • 6. Literature and Data Sources Overview  Document your information and data sources.  Describe methods relevant to:  Acquiring  Storing  Maintaining  Accessing the data that you need  Cite references that you have used to guide your thinking about data sources and methods, and include these in the reference list a the end of the paper.
  • 7. Acquiring Data  Company Databases1  Demographic information  Banking plans  Credit/Debit balances  Transactional data  Social media3  Twitter  Hashtags  @Branding  Facebook  Status updates  ‘Likes’  Contact Network 1 https://customers.microsoft.com/Pages/Download.aspx?id=13928/ 2 http://www.sitetechsystems.com/top-10-ways-to-use-gis-in-retail-banking/ 2 www.mmaglobal.com/files/mbankingoverview.pdf 3 http://www.bearingpoint.com/ecomaXL/files/0615_WP_EN_Social_CRM_final_web.pdf  Mobile Phone Data2  GPS Location  Mobile client application  Mobile web  Short Message Service (SMS)  Contact Lists
  • 8. Bank MobileSocial media Data Integration Central location Dealing with data issues and data preparation  Integration of results from various data sources into a central location for analysis1  Integration of results from feeds (structured and unstructured)1 1 https://www.in.capgemini.com/resource-file- access/resource/pdf/A_Case_for_Enterprise_Data_Management_for_Banking.pdf
  • 9. Storage  Considerations close to real time or longer term access1  Governance for retention periods, data ownership and entry of new data sources2  Maintain backups2  Accessible2  Document sources and validations taking place2 1 http://www.oracle.com/us/products/middleware/data-integration/oracle-goldengate-realtime-access-2031152.pdf 2 http://www.osfi-bsif.gc.ca/eng/docs/data_maint_ja06.pdf
  • 10. Maintaining data  Backing up data1  Deleting data based on established retention1  Performance optimization2 1 http://www.osfi-bsif.gc.ca/eng/docs/data_maint_ja06.pdf 2 http://www.oracle.com/technetwork/database/bi-datawarehousing/twp-bp-for-stats-gather-12c-1967354.pdf
  • 11. Accessing  Access via graphical user interfaces (GUIs) through applications for  Structured data or unstructured data  Real time or ex-post data  Access for  System administration and maintenance  Validation, editing and estimating (VEE)  Analyzing and modelling processed data (Ultimate Goal)  Audit access controls1 1 http://www.osfi-bsif.gc.ca/eng/docs/data_maint_ja06.pdf
  • 12. Criteria  Describe the systems and methods used to:  Acquire data  Store data  Maintain data  Access data  Describe the infrastructure.  Describe how these systems and methods work together.
  • 13. Acquire data  Python and R languages will be used for data acquisition.  Lightweight and flexible.  Mature - industry standards.  Multiple libraries available for data handling and analytics.  Data acquisition  Facebook1 and Twitter2 provide APIs for access  Python supported  Returns JSON format  Company data  Years of company credit card history available. 1 https://developers.facebook.com/docs/graph-api/using-graph-api/v2.5 2 https://dev.twitter.com/overview/api
  • 14. Storing, Maintaining and Accessing  Oracle DBMS for internal data  PostgreSQL for analytics  Supports structured and unstructured data.  Better performance than MongoDB  Purchase history tied to customers maintained for 90 days.  Data used for analytic models maintained for 90 days.  Data anonymized and maintained for 2 years.  Used for general trending and analysis.  Sold to third parties.  Views will be created in the PostgreSQL environment to allow access for the mobile application.
  • 16. Data Preparation  Text Analysis  Cleanse the data for keywords  GPS Data  Broad scale demographic segmentation  Contact Networks  Identifiable features of people and groups
  • 17. Data Issues Problems  Application Permissions  Consumer Participation  Inaccurate Text Analysis  Data Storage and Processing Solutions  Provide benefits for permissions  Entice consumers with offers  Segmented Analysis  Commodity Clustered Servers
  • 18. Data Quality  Outliers & Incomplete data  Used where applicable, likely excluded  Bootstrapping  Estimation of additional sampling distributions  Influential observations  Used to further define possible segmentations
  • 19. In Conclusion… Opportunities  DERAK is well positioned to create key analytical insights to our credit card customer, which will benefit its services by:  Improving its market access  Gaining new insights on its credit card customers through mobile app  Utilizing the collected data for predictive modeling (spending habits, etc.)  Exploring uses of GPS data  Selling anonymized customer data to third party companies
  • 23. In Conclusion… Data Issues and Preparation
  • 24. In Conclusion… Direction and future state  Refining data collection and analytics  Improving data mining methods  Expanding our services and market reach to other industries
  • 25.  Team and Presenters (in order of appearance)  Aaron Matthews  Ketan Lakhani  Eric Gero  Daniel Anderson  Raphael Brykman
  • 26. References  Royal Bank of Scotland Case Study (2013)  Mobile Banking Overview (January 2009)  10 ways to use GIS data in Retail Banking (2012)  A Case for Enterprise Data Management in Banking (2012)  Data Maintenance at IRB Institutions (2006)  Best Practices for Gathering Optimizer Statistics with Oracle Database 12c