SlideShare uma empresa Scribd logo
1 de 23
Baixar para ler offline
Bas Geerdink, 8 June 2017
Streaming Analytics at
IT Manager at ING Analytics
Academic background: A.I. & Informatics
Working in IT since 2004
At ING since 2013
@bgeerdink https://nl.linkedin.com/in/geerdink
Bas Geerdink
Strategy
ING is a top financial enterprise, operating since 1881
Customers
33 Million
Private, Corporate and
Institutional Customers
Countries
41
In Europe, Asia,
Australia, North and
South America
Employees
52,000
Market leaders Benelux
Growth markets
Commercial Banking
Challengers
4
ING’s Think Forward strategy requires a data-driven approach
5
We’re making a shift from batch to real-time
6
Secure and reliable
Relevant
Personal
Omnichannel
Predictive Actionable
We’re building a Streaming Analytics platform with
high-end capabilities
7
• High performance & low latency to manage and process
users’ events.
• Scalability & availability to build new features, scenarios and
integration with other external systems.
• Fault tolerance mechanism to ensure high quality output
with strong information consistency.
• Online machine learning and business rules management
capabilities.
• Fraud detection:
• Detect possible fraudulous transactions
• Identify possible money laundering accounts
• Actionable Insights:
• Small financial insights and robotized advise
• Customer-defined alerts and push notifications
• Offer chat/call when customers don’t finish their customer journey
• Marketing and commercial offerings (‘Next Best Action’)
Main use cases for a Streaming Analytics Platform
8
• Wisdom of the crowd:
• making use of similar customers to make better predictions
• Online learning:
• making use of customer feedback in real-time to train models
• Beyond banking:
• enabling IoT
Future scenarios
9
Streaming Data Platform
The Streaming Data Platform has to be probabilistic in nature
11
Deterministic Probabilistic
• Business rules / decision trees
• Fixed logic (if-then-else)
• Rigid
• Machine learning
• Fuzzy logic
• Flexible
Streaming Data Platform Architecture
CEP Engine Machine Learning Engine Post-Processor
“detect pattern”
“determine
relevancy”
“produce
notification”
Application Flow:
Business Flow:
“Data generator”
Formats outgoing messages
e.g. create push message in HTML format
Get
Customer
Profile
Apply
Selection
Criteria
Score
Notificatio
ns
Detect
Pattern
Get
Intermed
Event
Get Raw
Events
Send
Output
Create
Intermed
Event
Get
Business
Event
Create
Business
Event
Format
Event
“Data gatherer”
Load stream of raw events and
combine/aggregate them into a
meaningful business event
e.g. multiple transactions within one
hour lead to detection of ‘out of
money’ pattern
“Event scorer”
Scoring of PMML models to determine
the personal and relevant follow-up
actions
e.g. given customer is running out of
money, give a suggestion to transfer
money from savings to checkings
account
Streaming Data Platform Architecture
13
CEP Engine Machine Learning Engine Post-Processor
Raw
Event
Business
Event
Notification
Event
“detect pattern”
“determine
relevancy”
“produce
notification”
Application Flow :
Kafka Events:
State:
Business Flow:
Customer
Features
Notification
Definitions
Models
Get
Customer
Profile
Apply
Selection
Criteria
Score
Notificatio
ns
Detect
Pattern
Get
Intermed
Event
Get Raw
Events
Send
Output
Business
Users
System configuration: Machine
Learning
Environment
Configuration
GUI
(future scope) Data
Scientists
Create
Intermed
Event
Get
Business
Event
Create
Business
Event
Intermediate
Event
Format
Event
Data Lake
CEP
Functions
Feedback loop
heap
openscoring.io
Technology
Flink:
• originated as research project “Stratosphere” at Technische Universität Berlin
• is an open source framework for distributed, in-memory (big) data analytics
• Contains Java, Scala, and Python APIs for streams (DataStream), batch (DataSet), and
relational data (Table API)
Benefits:
• true streaming: per-event processing, no micro-batching
• high volumes, low latency
Features:
• state management and fault tolerancy: savepointing, checkpointing  exactly-once
semanctics
• time windowing: event time, flexible (e.g. sum all transactions in the past minute)
• Complex Event Processing, Machine Learning, SQL (at various levels of maturity)
What is Flink, and why use it?
15
PMML is the de facto standard for model representation
• A PMML file contains all pre- and post-processing as well as one or more machine learning models
• Support of hot-replacing an algorithm (configuration) at runtime by just providing an updated PMML
model and interpreting (scoring) it with openscoring.io
• PMML models can be created by a large number of tools, including: Spark, SAS, R, Knime, Python, H20.ai
• Large number of supported machine learning models
PMML (Predictive Model Markup Language) bridges the gap
between data scientists and data engineers
16
Type Model support
Simple business rules • Rulesets, score cards
Complex business rules • Sequences, association rules
Predictive models • Naive Bayes
• k-Nearest Neighbors
• Regression: linear, logistic, regularized
• Trees: decision trees, random forest
• Support Vector Machines
Advanced models • Bayesian Network
• Neural Network (Deep Learning)
• Gaussian Process
Other • Time Series
• Cluster Models
Openscoring.io
• Cross functional squads
• business, dev, ops, data scientist
• Reusing ING standard building blocks and processes
• Continuous integration & delivery in DTAP environment
• Develop: Gitlab  Jenkins  Artifactory
• Using Flink’s savepointing mechanism
• No data loss, guaranteed
• Limited downtime
• Scalability, resilience and robustness
• Flink gives us automatic restart on failure, using checkpointing mechnism
• Rolling updates of infrastructure and middleware leads to no downtime
Development & operations according to the ING way of working
17
Design
Develop
TestDeploy
Monitor
The application, in binary form:
• provides a framework for its business users,
• gives just enough configurability to be effective but not
overcomplex
• gives flexibility in the form of a DSL and GUI (future scope)
The application’s internal components (the code, the artifacts):
• Provide a toolbox for developers
• Contain clearly defined building blocks, for maximum
reusability
• Are open source within ING
The best of both worlds – Functional and Modular Flexibility
18
Squad
In ING’s one way of working, ‘business’ and ‘IT’ go hand-in-hand
SquadSquad
Chapter
Guild
DS
DA
OpsOps
CJE
DS
DA
DS
CJE
DS
Ops
DS
CJE
Dev DevDevDevDev
Tribe
Conclusions
 Bigger streaming data volumes to handle, with higher performance demands
 Enterprise-readyness and security of Flink is continuously improving
 With PMML we have a standard format that is very well suited for multidisciplinary
teams, where data scientists work on the same user stories as engineers
 We set up a streaming analytics community within ING, and work together with industry
leaders (Lightbend, DataArtisans)
 ING build a streaming analytics platform is as a generic, multi-purpose solution with
fraud detection and actionable insights as it first use cases
Why is Streaming Analytics with Flink the right choice for ING,
moving forward?
21
22
Questions
Follow us to stay a step ahead
We’re hiring!
ING.com
YouTube.com/ING
SlideShare.net/ING@ING_News LinkedIn.com/company/ING
Flickr.com/INGGroupFacebook.com/ING

Mais conteúdo relacionado

Mais procurados

50 Shades of Data - Dutch Oracle Architects Platform (February 2018)
50 Shades of Data - Dutch Oracle Architects Platform (February 2018)50 Shades of Data - Dutch Oracle Architects Platform (February 2018)
50 Shades of Data - Dutch Oracle Architects Platform (February 2018)
Lucas Jellema
 
Near realtime AI deployment with huge data and super low latency - Levi Brack...
Near realtime AI deployment with huge data and super low latency - Levi Brack...Near realtime AI deployment with huge data and super low latency - Levi Brack...
Near realtime AI deployment with huge data and super low latency - Levi Brack...
Sri Ambati
 
Automatic Model Documentation with H2O
Automatic Model Documentation with H2OAutomatic Model Documentation with H2O
Automatic Model Documentation with H2O
Sri Ambati
 

Mais procurados (20)

The Art of Intelligence – A Practical Introduction Machine Learning for Oracl...
The Art of Intelligence – A Practical Introduction Machine Learning for Oracl...The Art of Intelligence – A Practical Introduction Machine Learning for Oracl...
The Art of Intelligence – A Practical Introduction Machine Learning for Oracl...
 
H2O.ai's Driverless AI
H2O.ai's Driverless AIH2O.ai's Driverless AI
H2O.ai's Driverless AI
 
50 Shades of Data - Dutch Oracle Architects Platform (February 2018)
50 Shades of Data - Dutch Oracle Architects Platform (February 2018)50 Shades of Data - Dutch Oracle Architects Platform (February 2018)
50 Shades of Data - Dutch Oracle Architects Platform (February 2018)
 
ML with Power BI for Business and Pros
ML with Power BI for Business and ProsML with Power BI for Business and Pros
ML with Power BI for Business and Pros
 
ML Model Deployment and Scoring on the Edge with Automatic ML & DF
ML Model Deployment and Scoring on the Edge with Automatic ML & DFML Model Deployment and Scoring on the Edge with Automatic ML & DF
ML Model Deployment and Scoring on the Edge with Automatic ML & DF
 
H2O Driverless AI Workshop
H2O Driverless AI WorkshopH2O Driverless AI Workshop
H2O Driverless AI Workshop
 
Neo4j GraphTalk Düsseldorf - Building intelligent solutions with Graphs
Neo4j GraphTalk Düsseldorf - Building intelligent solutions with GraphsNeo4j GraphTalk Düsseldorf - Building intelligent solutions with Graphs
Neo4j GraphTalk Düsseldorf - Building intelligent solutions with Graphs
 
Introducción al Aprendizaje Automatico con H2O-3 (1)
Introducción al Aprendizaje Automatico con H2O-3 (1)Introducción al Aprendizaje Automatico con H2O-3 (1)
Introducción al Aprendizaje Automatico con H2O-3 (1)
 
Predicting Banking Customer Needs with an Agile Approach to Analytics in the ...
Predicting Banking Customer Needs with an Agile Approach to Analytics in the ...Predicting Banking Customer Needs with an Agile Approach to Analytics in the ...
Predicting Banking Customer Needs with an Agile Approach to Analytics in the ...
 
platform for Machine Learning
 platform for Machine Learning platform for Machine Learning
platform for Machine Learning
 
Megan Kurka, H2O.ai - AutoDoc with H2O Driverless AI - H2O World 2019 NYC
Megan Kurka, H2O.ai - AutoDoc with H2O Driverless AI - H2O World 2019 NYCMegan Kurka, H2O.ai - AutoDoc with H2O Driverless AI - H2O World 2019 NYC
Megan Kurka, H2O.ai - AutoDoc with H2O Driverless AI - H2O World 2019 NYC
 
Building a Modern FinTech Big Data Infrastructure
Building a Modern FinTech Big Data InfrastructureBuilding a Modern FinTech Big Data Infrastructure
Building a Modern FinTech Big Data Infrastructure
 
Near realtime AI deployment with huge data and super low latency - Levi Brack...
Near realtime AI deployment with huge data and super low latency - Levi Brack...Near realtime AI deployment with huge data and super low latency - Levi Brack...
Near realtime AI deployment with huge data and super low latency - Levi Brack...
 
HOW TO APPLY BIG DATA ANALYTICS AND MACHINE LEARNING TO REAL TIME PROCESSING ...
HOW TO APPLY BIG DATA ANALYTICS AND MACHINE LEARNING TO REAL TIME PROCESSING ...HOW TO APPLY BIG DATA ANALYTICS AND MACHINE LEARNING TO REAL TIME PROCESSING ...
HOW TO APPLY BIG DATA ANALYTICS AND MACHINE LEARNING TO REAL TIME PROCESSING ...
 
Sudhir Rawat, Sr Techonology Evangelist at Microsoft SQL Business Intelligenc...
Sudhir Rawat, Sr Techonology Evangelist at Microsoft SQL Business Intelligenc...Sudhir Rawat, Sr Techonology Evangelist at Microsoft SQL Business Intelligenc...
Sudhir Rawat, Sr Techonology Evangelist at Microsoft SQL Business Intelligenc...
 
Automatic Model Documentation with H2O
Automatic Model Documentation with H2OAutomatic Model Documentation with H2O
Automatic Model Documentation with H2O
 
Neo4j GraphTalk Düsseldorf - How Graphs revolutionise Identity & Access Manag...
Neo4j GraphTalk Düsseldorf - How Graphs revolutionise Identity & Access Manag...Neo4j GraphTalk Düsseldorf - How Graphs revolutionise Identity & Access Manag...
Neo4j GraphTalk Düsseldorf - How Graphs revolutionise Identity & Access Manag...
 
Building Intelligent Solutions with Graphs, Stefan Kolmar, Neo4j
Building Intelligent Solutions with Graphs, Stefan Kolmar, Neo4jBuilding Intelligent Solutions with Graphs, Stefan Kolmar, Neo4j
Building Intelligent Solutions with Graphs, Stefan Kolmar, Neo4j
 
Strata 2016 - Architecting for Change: LinkedIn's new data ecosystem
Strata 2016 - Architecting for Change: LinkedIn's new data ecosystemStrata 2016 - Architecting for Change: LinkedIn's new data ecosystem
Strata 2016 - Architecting for Change: LinkedIn's new data ecosystem
 
Mastering Your Customer Data on Apache Spark by Elliott Cordo
Mastering Your Customer Data on Apache Spark by Elliott CordoMastering Your Customer Data on Apache Spark by Elliott Cordo
Mastering Your Customer Data on Apache Spark by Elliott Cordo
 

Semelhante a Fast Data at ING – the why, what and how of the streaming analytics platform in a financial enterprise

Using Data Science to Build an End-to-End Recommendation System
Using Data Science to Build an End-to-End Recommendation SystemUsing Data Science to Build an End-to-End Recommendation System
Using Data Science to Build an End-to-End Recommendation System
VMware Tanzu
 
Delia_J_Micu_resume_arch_Jan_2015
Delia_J_Micu_resume_arch_Jan_2015Delia_J_Micu_resume_arch_Jan_2015
Delia_J_Micu_resume_arch_Jan_2015
Delia J. Micu
 
Vitalii Bondarenko and Eugene Berko "Cloud AI Platform as an accelerator of e...
Vitalii Bondarenko and Eugene Berko "Cloud AI Platform as an accelerator of e...Vitalii Bondarenko and Eugene Berko "Cloud AI Platform as an accelerator of e...
Vitalii Bondarenko and Eugene Berko "Cloud AI Platform as an accelerator of e...
Lviv Startup Club
 
The differing ways to monitor and instrument
The differing ways to monitor and instrumentThe differing ways to monitor and instrument
The differing ways to monitor and instrument
Jonah Kowall
 
Digital_IOT_(Microsoft_Solution).pdf
Digital_IOT_(Microsoft_Solution).pdfDigital_IOT_(Microsoft_Solution).pdf
Digital_IOT_(Microsoft_Solution).pdf
ssuserd23711
 
Run Book Automation: Why All Roads Lead to It
Run Book Automation: Why All Roads Lead to ItRun Book Automation: Why All Roads Lead to It
Run Book Automation: Why All Roads Lead to It
elliando dias
 

Semelhante a Fast Data at ING – the why, what and how of the streaming analytics platform in a financial enterprise (20)

Flink Forward Berlin 2017: Bas Geerdink, Martijn Visser - Fast Data at ING - ...
Flink Forward Berlin 2017: Bas Geerdink, Martijn Visser - Fast Data at ING - ...Flink Forward Berlin 2017: Bas Geerdink, Martijn Visser - Fast Data at ING - ...
Flink Forward Berlin 2017: Bas Geerdink, Martijn Visser - Fast Data at ING - ...
 
Building a Real-Time Security Application Using Log Data and Machine Learning...
Building a Real-Time Security Application Using Log Data and Machine Learning...Building a Real-Time Security Application Using Log Data and Machine Learning...
Building a Real-Time Security Application Using Log Data and Machine Learning...
 
IMCSummit 2015 - Day 1 Developer Track - Implementing Operational Intelligenc...
IMCSummit 2015 - Day 1 Developer Track - Implementing Operational Intelligenc...IMCSummit 2015 - Day 1 Developer Track - Implementing Operational Intelligenc...
IMCSummit 2015 - Day 1 Developer Track - Implementing Operational Intelligenc...
 
Ahmed El Mawaziny CV
Ahmed El Mawaziny CVAhmed El Mawaziny CV
Ahmed El Mawaziny CV
 
Accelerate ML Deployment with H2O Driverless AI on AWS
Accelerate ML Deployment with H2O Driverless AI on AWSAccelerate ML Deployment with H2O Driverless AI on AWS
Accelerate ML Deployment with H2O Driverless AI on AWS
 
ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...
ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...
ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...
 
Using Data Science to Build an End-to-End Recommendation System
Using Data Science to Build an End-to-End Recommendation SystemUsing Data Science to Build an End-to-End Recommendation System
Using Data Science to Build an End-to-End Recommendation System
 
Delia_J_Micu_resume_arch_Jan_2015
Delia_J_Micu_resume_arch_Jan_2015Delia_J_Micu_resume_arch_Jan_2015
Delia_J_Micu_resume_arch_Jan_2015
 
Vitalii Bondarenko and Eugene Berko "Cloud AI Platform as an accelerator of e...
Vitalii Bondarenko and Eugene Berko "Cloud AI Platform as an accelerator of e...Vitalii Bondarenko and Eugene Berko "Cloud AI Platform as an accelerator of e...
Vitalii Bondarenko and Eugene Berko "Cloud AI Platform as an accelerator of e...
 
AI projects - Lifecyle & Best Practices
AI projects - Lifecyle & Best PracticesAI projects - Lifecyle & Best Practices
AI projects - Lifecyle & Best Practices
 
What Does Artificial Intelligence Have to Do with IT Operations?
What Does Artificial Intelligence Have to Do with IT Operations?What Does Artificial Intelligence Have to Do with IT Operations?
What Does Artificial Intelligence Have to Do with IT Operations?
 
Introduction to Big Data using AWS Services
Introduction to Big Data using AWS ServicesIntroduction to Big Data using AWS Services
Introduction to Big Data using AWS Services
 
The differing ways to monitor and instrument
The differing ways to monitor and instrumentThe differing ways to monitor and instrument
The differing ways to monitor and instrument
 
Flink Forward Berlin 2017 Keynote: Ferd Scheepers - Taking away customer fric...
Flink Forward Berlin 2017 Keynote: Ferd Scheepers - Taking away customer fric...Flink Forward Berlin 2017 Keynote: Ferd Scheepers - Taking away customer fric...
Flink Forward Berlin 2017 Keynote: Ferd Scheepers - Taking away customer fric...
 
ITMAGINATION - competences, facts, technologies, clients
ITMAGINATION - competences, facts, technologies, clientsITMAGINATION - competences, facts, technologies, clients
ITMAGINATION - competences, facts, technologies, clients
 
Fractional Chief AI Officer Services For Hire
Fractional Chief AI Officer Services For HireFractional Chief AI Officer Services For Hire
Fractional Chief AI Officer Services For Hire
 
Digital_IOT_(Microsoft_Solution).pdf
Digital_IOT_(Microsoft_Solution).pdfDigital_IOT_(Microsoft_Solution).pdf
Digital_IOT_(Microsoft_Solution).pdf
 
Ai platform at scale
Ai platform at scaleAi platform at scale
Ai platform at scale
 
Run Book Automation: Why All Roads Lead to It
Run Book Automation: Why All Roads Lead to ItRun Book Automation: Why All Roads Lead to It
Run Book Automation: Why All Roads Lead to It
 
Shaik Niyas Ahamed M Resume
Shaik Niyas Ahamed M ResumeShaik Niyas Ahamed M Resume
Shaik Niyas Ahamed M Resume
 

Último

Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
amitlee9823
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
amitlee9823
 
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
amitlee9823
 
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
amitlee9823
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
ZurliaSoop
 
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night StandCall Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
amitlee9823
 
Probability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter LessonsProbability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter Lessons
JoseMangaJr1
 
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
amitlee9823
 
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
amitlee9823
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
MarinCaroMartnezBerg
 

Último (20)

Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptx
 
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
 
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
 
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics Program
 
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night StandCall Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
 
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
 
Probability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter LessonsProbability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter Lessons
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptx
 
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
 
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
 

Fast Data at ING – the why, what and how of the streaming analytics platform in a financial enterprise

  • 1. Bas Geerdink, 8 June 2017 Streaming Analytics at
  • 2. IT Manager at ING Analytics Academic background: A.I. & Informatics Working in IT since 2004 At ING since 2013 @bgeerdink https://nl.linkedin.com/in/geerdink Bas Geerdink
  • 4. ING is a top financial enterprise, operating since 1881 Customers 33 Million Private, Corporate and Institutional Customers Countries 41 In Europe, Asia, Australia, North and South America Employees 52,000 Market leaders Benelux Growth markets Commercial Banking Challengers 4
  • 5. ING’s Think Forward strategy requires a data-driven approach 5
  • 6. We’re making a shift from batch to real-time 6 Secure and reliable Relevant Personal Omnichannel Predictive Actionable
  • 7. We’re building a Streaming Analytics platform with high-end capabilities 7 • High performance & low latency to manage and process users’ events. • Scalability & availability to build new features, scenarios and integration with other external systems. • Fault tolerance mechanism to ensure high quality output with strong information consistency. • Online machine learning and business rules management capabilities.
  • 8. • Fraud detection: • Detect possible fraudulous transactions • Identify possible money laundering accounts • Actionable Insights: • Small financial insights and robotized advise • Customer-defined alerts and push notifications • Offer chat/call when customers don’t finish their customer journey • Marketing and commercial offerings (‘Next Best Action’) Main use cases for a Streaming Analytics Platform 8
  • 9. • Wisdom of the crowd: • making use of similar customers to make better predictions • Online learning: • making use of customer feedback in real-time to train models • Beyond banking: • enabling IoT Future scenarios 9
  • 11. The Streaming Data Platform has to be probabilistic in nature 11 Deterministic Probabilistic • Business rules / decision trees • Fixed logic (if-then-else) • Rigid • Machine learning • Fuzzy logic • Flexible
  • 12. Streaming Data Platform Architecture CEP Engine Machine Learning Engine Post-Processor “detect pattern” “determine relevancy” “produce notification” Application Flow: Business Flow: “Data generator” Formats outgoing messages e.g. create push message in HTML format Get Customer Profile Apply Selection Criteria Score Notificatio ns Detect Pattern Get Intermed Event Get Raw Events Send Output Create Intermed Event Get Business Event Create Business Event Format Event “Data gatherer” Load stream of raw events and combine/aggregate them into a meaningful business event e.g. multiple transactions within one hour lead to detection of ‘out of money’ pattern “Event scorer” Scoring of PMML models to determine the personal and relevant follow-up actions e.g. given customer is running out of money, give a suggestion to transfer money from savings to checkings account
  • 13. Streaming Data Platform Architecture 13 CEP Engine Machine Learning Engine Post-Processor Raw Event Business Event Notification Event “detect pattern” “determine relevancy” “produce notification” Application Flow : Kafka Events: State: Business Flow: Customer Features Notification Definitions Models Get Customer Profile Apply Selection Criteria Score Notificatio ns Detect Pattern Get Intermed Event Get Raw Events Send Output Business Users System configuration: Machine Learning Environment Configuration GUI (future scope) Data Scientists Create Intermed Event Get Business Event Create Business Event Intermediate Event Format Event Data Lake CEP Functions Feedback loop heap openscoring.io
  • 15. Flink: • originated as research project “Stratosphere” at Technische Universität Berlin • is an open source framework for distributed, in-memory (big) data analytics • Contains Java, Scala, and Python APIs for streams (DataStream), batch (DataSet), and relational data (Table API) Benefits: • true streaming: per-event processing, no micro-batching • high volumes, low latency Features: • state management and fault tolerancy: savepointing, checkpointing  exactly-once semanctics • time windowing: event time, flexible (e.g. sum all transactions in the past minute) • Complex Event Processing, Machine Learning, SQL (at various levels of maturity) What is Flink, and why use it? 15
  • 16. PMML is the de facto standard for model representation • A PMML file contains all pre- and post-processing as well as one or more machine learning models • Support of hot-replacing an algorithm (configuration) at runtime by just providing an updated PMML model and interpreting (scoring) it with openscoring.io • PMML models can be created by a large number of tools, including: Spark, SAS, R, Knime, Python, H20.ai • Large number of supported machine learning models PMML (Predictive Model Markup Language) bridges the gap between data scientists and data engineers 16 Type Model support Simple business rules • Rulesets, score cards Complex business rules • Sequences, association rules Predictive models • Naive Bayes • k-Nearest Neighbors • Regression: linear, logistic, regularized • Trees: decision trees, random forest • Support Vector Machines Advanced models • Bayesian Network • Neural Network (Deep Learning) • Gaussian Process Other • Time Series • Cluster Models Openscoring.io
  • 17. • Cross functional squads • business, dev, ops, data scientist • Reusing ING standard building blocks and processes • Continuous integration & delivery in DTAP environment • Develop: Gitlab  Jenkins  Artifactory • Using Flink’s savepointing mechanism • No data loss, guaranteed • Limited downtime • Scalability, resilience and robustness • Flink gives us automatic restart on failure, using checkpointing mechnism • Rolling updates of infrastructure and middleware leads to no downtime Development & operations according to the ING way of working 17 Design Develop TestDeploy Monitor
  • 18. The application, in binary form: • provides a framework for its business users, • gives just enough configurability to be effective but not overcomplex • gives flexibility in the form of a DSL and GUI (future scope) The application’s internal components (the code, the artifacts): • Provide a toolbox for developers • Contain clearly defined building blocks, for maximum reusability • Are open source within ING The best of both worlds – Functional and Modular Flexibility 18
  • 19. Squad In ING’s one way of working, ‘business’ and ‘IT’ go hand-in-hand SquadSquad Chapter Guild DS DA OpsOps CJE DS DA DS CJE DS Ops DS CJE Dev DevDevDevDev Tribe
  • 21.  Bigger streaming data volumes to handle, with higher performance demands  Enterprise-readyness and security of Flink is continuously improving  With PMML we have a standard format that is very well suited for multidisciplinary teams, where data scientists work on the same user stories as engineers  We set up a streaming analytics community within ING, and work together with industry leaders (Lightbend, DataArtisans)  ING build a streaming analytics platform is as a generic, multi-purpose solution with fraud detection and actionable insights as it first use cases Why is Streaming Analytics with Flink the right choice for ING, moving forward? 21
  • 23. Follow us to stay a step ahead We’re hiring! ING.com YouTube.com/ING SlideShare.net/ING@ING_News LinkedIn.com/company/ING Flickr.com/INGGroupFacebook.com/ING