SlideShare uma empresa Scribd logo
1 de 32
Fighting Financial Crime
with Artificial Intelligence
Tim Seears
Area Practice Director, Data Science – Asia Pacific & Korea
2
So far we have delivered
150+ successful projects for
100+ clients worldwide.
500+ employees
(2017)
Vendor-neutral with an
open source focus.
Full spectrum consulting, data
engineering, data science &
support.
Apache
Hadoop
and cloud
ecosystem
integration.
Founded in 2010
industry thought
leader.
Fixed fee offerings
for data science and
engineering.
Who Is Think Big?
1st Big Data provider 100%
focused around open source.
3
Help customers be financially confident
and achieve their ambitions by making
daily banking and important financial
decisions easy.
19,000
employees
(2017)
1,800 corporate and
institutional customers
Covering Personal Banking, Business
Banking, Corporates & Institutions and
Wealth Management.
2.7 million
personal
customers
236,000 small and
medium-sized
business customers
Making banking
easier for over 145
years.
Who is the customer?
Leading Nordic universal bank with strong
local roots and bridges to the rest of the
world
4
PayPal WalmartJohn Deere Lowe’s Wells FargoJP Morgan
This is Happening in the Enterprise today
5
“
’’
Over the next decade, AI won’t replace
managers, but managers who use AI
will replace those who don’t.
HBR July 2017
7
Data Driven Approach to Fight Fraud
Fast evolving fraud sophistication, AI
Ambitions for Fraud Project
Become client’s advanced
analytics blueprint
Data driven approach to real
time scoring of transactions
Reduce false-positives &
Enhance fraud detection rate
ONLY ~40%
of fraud cases are detected
Low Detection Rate
99.5%
of cases are not
fraud related
Many false positives
Challenges for Fraud Detection
Tens of Millions
€ lost each month
High Fraud Loss
© 2017 Teradata
8
Fraud Types – Customer Initiated
Nigeria/
Investment Scam
CEO
Fraud
Customer
Initiated
© 2017 Teradata
Beneficiary Account
Change
Fake
Invoice
Rental Scam/
Goods Not
Received
9
ID Theft
SPEAR
Phishing
Vishing/Support Scam
Malware
Phishing/Smishing
Fraud Types – Fraudster Initiated
Fraudster
Initiated
© 2017 Teradata
10
Modeling
Challenges
© 2017 Teradata
• Class imbalance
(100,000:1 non-fraud vs. fraud)
• Assigning fraud labels from
historic data
• Fraud is ambiguous
• Not all features available in
real-time
• Most machine learning sees
transactions atomically
11
Advanced Platform for Fraud: Data-Driven Approach
How It Works:
• Understanding the domain
• Gathering and preparing the data
• Automatically generating the rules and recognizing fraudulent patterns by
training models on historical data
• Automatically maintaining the engine by retraining the model
Pros:
• Automatic/data-driven/objective
inference of the rules
• Ability to detect patterns in a high
dimensional data input
• Fast detection of new/changing
fraudulent patterns
Cons:
• Might be unintuitive and hardly
interpretable
• Data preparation and feature aggregation
is time consuming
© 2017 Teradata
12
Phase One
© 2017 Teradata
13
Sep Oct Nov Dec Jan Feb Mar Apr May Jun Jul
Project kick-off
30th September
Teamwork to Deliver Value in Each Project Iteration
2016 2017
Kick-off: Data
Scientist track
Go-Live plan set
for the project
19th December
Kick-off:
Engineering track
Models successfully in
Shadow Production
4th March
First round trip
test transaction
First production
virtual machine
Production Hardening
for HA and Security
Full productionisation of
the Fraud Engine
Cross-
functional
Collaboration
Cont. Deep
Learning
modeling
© 2017 Teradata
14
>
Banking
Anti-Fraud
Solution
© 2017 Teradata
Banking Anti-Fraud Solution
By leveraging the power of a
thoughtful and strong data
and analytics strategy, we
unleash high impact business
outcomes.
• Multiple models running in production
at the same time
• Mix of traditional and advance deep
learning methods
• AnalyticsOps: Deploying machine
learning models in production
Model Management
Framework
• Organisation and silos of data
• Real-time data integration
• Security and Procedures: following
existing bank procedures
Data Modelling, Pipeline
and Ingestion
• Hard to operationalize insights
• Availability of analytic capabilities/ skills
and data
• Interpreting the results of machine
learning models
Machine Learning and
Artificial Intelligence
15
Advanced Platform for Fraud: Data-Driven Approach
Manual
Evaluation
eBanking
Mobile
Pay
Business
Online
Global Payment
Interface
Fraud?
Create
Verification
Fraud
Payment
Weblog Data
Basic Customer
Data
Historic
Transactions
Customer Product
Data
Aggregated
Customer Data
Fraud?
Advanced Analytics
Platform for Fraud
Central Fraud Engine
Process Payment to
Beneficiary
Yes/Maybe
No
Yes !
Return Payment
to Customer
Update
Fraud Data
© 2017 Teradata
No
16
Model Management
Real Time Scoring
System
Production
Operations
Source
Systems
Data
Managers
Framework to Enhance AnalyticsOps Capabilities
Log
Transactions
Add Historic
Data
Model
Performance
Promote
Models
© 2017 Teradata
Data Scientists
Development & Deployment
Management
Data Science Lab
Data Analysis &
Model Development
Data
Management
Developers &
Release Managers
17
Key Requirement: Model Interpretation
• We have deployed LIME (Locally Interpretable Model Explanation) for
customers
– Improves trust
– Compliance with EU’s General Data Protection Regulation (GDPR)
© 2017 Teradata
17.6% Fraud Probability
Customer Amount
Debit Amount
Avg. # Trans Cred Acc
# Prev to This Dest
# Xfer Accounts
X% score due to:
+ transfer amount
+ destination country
+ last year monthly spend
What features are most important to
this decision?
18
Machine Learning Results
(Live System: 60 transactions/sec.)
Ensemble of boosted decision trees and
logistic regression.
From online validation of the model:
● 25-30% false positive reduction, with
over 35% increase in detection rate
● Opportunity to expand model with
additional features, retrain on recent data
and add additional models to
the ensemble.
● Models can be expanded to additional
channels
Rule Engine on
validation set
© 2017 Teradata
19
Deep Learning
© 2017 Teradata
20
Current models can
only catch ~70% of all
fraud cases
Deep Learning Opportunity
Traditional ML models view
transactions atomically
Often missed fraud
transactions are part of
a series
Capturing correlation
across many features
© 2017 Teradata
21
Three Deep Learning Architectures to Deliver Value
• Designed for spatial
correlated features, but
by transforming
transactions into a
2D image, we can
learn temporal
correlated features.
• Deeper ConvNet allows
learning more complex
& general features.
Goal: Learn kernels from
temporal & static features
to gain insight into the
characteristics of fraud.
• Learn temporal
information and classify
if the sequence of
transactions
contains fraud.
• Shares knowledge
across learning time.
Goal: Learn transaction
patterns within a window.
Two solutions can be
tested: flag fraud or predict
next transaction and define
an error.
• Learn how to generate
normal transactions,
potentially large volumes
of non-fraud data.
• AE provide a low level
representation of the
data.
Goal: Build a model that
learns how to generate
non-fraud data. To detect
fraud, define a
reconstruction error rate for
the fraud cases
Auto-Encoders
LSTM
ConvNet
© 2017 Teradata
22
How Can We Create an Image From Bank Transactions?
t0 X_0, X_1, ... X_n
dt
t1 X_0, X_1, ... X_n
t2 X_0, X_1, ... X_n
ts X_0, X_1, ... X_n
...
Top k Features Correlation
...
X_0
X_41 X_5
X_30
X_29X_31X_10
X_37
X_3
X_1
X_42 X_40
X_32
X_15X_35X_2
X_16
X_31
X_2
X_3 X_15
X_4
X_1X_28X_40
X_31
X_49
X_n
X_26 X_9
X_40
X_35X_28X_2
X_17
X_1
...
X_0
X_41 X_5
X_30
X_29X_31X_10
X_37
X_3
X_1
X_42 X_40
X_32
X_15X_35X_2
X_16
X_31
X_2
X_3 X_15
X_4
X_1X_28X_40
X_31
X_49
X_n
X_26 X_9
X_40
X_35X_28X_2
X_17
X_1
...
...
...
...
...
X_0
X_41 X_5
X_30
X_29X_31X_10
X_37
X_3
X_1
X_42 X_40
X_32
X_15X_35X_2
X_16
X_31
X_2
X_3 X_15
X_4
X_1X_28X_40
X_31
X_49
X_n
X_26 X_9
X_40
X_35X_28X_2
X_17
X_1
N
dt
t0
t1
ts
Input Output
Raw Features
Add correlated features in a
clock-wise manner
© 2017 Teradata
Image size is:
[10 x 3, 50 x 3, 1]
23
Convolutional Layers for Trans2D
Kernels
of 3x3
...X_0 X_1 X_2 X_n
...X_0 X_1 X_2 X_n...
...
...
...
...X_0 X_1 X_2 X_n
Features
Time
Strides of 3
Strides
of 1
First Convolutional Layer Architecture
© 2017 Teradata
24
2D Transaction Image Example
© 2017 Teradata
Non-fraud Transaction Image
Non-fraud
Fraud Transaction Image
X-axis: features, Y-axis: time
Fraud
Non-fraud
25
Network Architecture for CNNs
Fraud
Normal
50
30
25
15
13
8CNN
Fraud
Normal50
30
25
15
25
15
25
15
25
15
© 2017 Teradata
26
Inside the ResNet model
64 Filters
Activations After
the CNN
Residual Blocks
FraudNon-fraud
© 2017 Teradata
27
Deep Learning
First Results
on the fraud verification dataset
Comparison of the three deep learning
models andthe traditional machine
learning ensemble model.
© 2017 Teradata
• Ensemble model (AUC 0.89)
• ConvNets (AUC 0.95)
• LSTM (AUC 0.90)
• ResNet (AUC 0.94)
28
Lessons Learned: Take-Aways From client project
Deep
learning
adoption
from pictures
to financial
transactions
Enhancement of
data quality &
cluster
capabilities with
data ingestion
Building
Analytics
Ops
capabilities
to support
business
units
Leveraging
experience
from Fraud
advanced
analytics to
deliver extra
use cases
© 2017 Teradata
29
Success: From PowerPoint to production in 8 sprints
Team effort: Thorough collaboration across IMD, GFU and Think Big
Synergy: Successfully spearheaded innovation in all involved systems
Inspiration: Bank advanced analytics blueprint sets a generic scene for combatting new
challenges in advanced analytics
Agile influence: Using an agile approach we were able to quickly deliver within the
challenging timeframe.
© 2017 Teradata
Big Data Team – Lessons Learned
3030 © 2017 Teradata
31
Appendix
© 2017 Teradata
32
Advanced Analytics Fraud Platform
• Develop a scalable and expandable platform which follows the Bank’s blueprint of
digitalization
• 100 % data-driven approach to find patterns in the data and complement the
existing fraud engine
• Use Hadoop to handle the large data volumes for training models on transaction
data
• Implement a real-time solution that can score live transactions such as
e-banking, credit card and mobile payments.
• Reduce amount of false-positives by at least 20-40 %
© 2017 Teradata
33
Ambitions for the Advance Analytics project
• The goal is to enhance fraud detection and
reduce false positives across products.
• Fraud is only the first of many possible
advanced analytics use cases.
• The project leads to the creation of the Bank’s
advanced analytics blueprint.
• Bank’s ambition is to become one of the bank
leading in advanced analytics capabilities.
Challenges in Fraud Detection at client
• Low detection rate: Existing human-written
rule engine catches ~40% of fraud cases.
• Many false positives: 99.5% of all cases
investigated are not fraud related.
• High fraud loss: Tens of millions of € total
fraud per month.
• Mobile payments are quickly growing in
number.
• Fraud evolving rapidly, with increased
sophistication: Bank must modernize its anti-
fraud arsenal.
© 2017 Teradata

Mais conteúdo relacionado

Mais procurados

Responsible Generative AI
Responsible Generative AIResponsible Generative AI
Responsible Generative AICMassociates
 
Machine Learning in Banking Sector
Machine Learning in Banking SectorMachine Learning in Banking Sector
Machine Learning in Banking SectorKnoldus Inc.
 
Exploring Opportunities in the Generative AI Value Chain.pdf
Exploring Opportunities in the Generative AI Value Chain.pdfExploring Opportunities in the Generative AI Value Chain.pdf
Exploring Opportunities in the Generative AI Value Chain.pdfDung Hoang
 
Is Machine learning useful for Fraud Prevention?
Is Machine learning useful for Fraud Prevention?Is Machine learning useful for Fraud Prevention?
Is Machine learning useful for Fraud Prevention?Andrea Dal Pozzolo
 
Generative AI at the edge.pdf
Generative AI at the edge.pdfGenerative AI at the edge.pdf
Generative AI at the edge.pdfQualcomm Research
 
AI powered decision making in banks
AI powered decision making in banksAI powered decision making in banks
AI powered decision making in banksPankaj Baid
 
Generative AI Use-cases for Enterprise - First Session
Generative AI Use-cases for Enterprise - First SessionGenerative AI Use-cases for Enterprise - First Session
Generative AI Use-cases for Enterprise - First SessionGene Leybzon
 
Artificial Intelligence in cybersecurity
Artificial Intelligence in cybersecurityArtificial Intelligence in cybersecurity
Artificial Intelligence in cybersecuritySmartlearningUK
 
Introduction to RAG (Retrieval Augmented Generation) and its application
Introduction to RAG (Retrieval Augmented Generation) and its applicationIntroduction to RAG (Retrieval Augmented Generation) and its application
Introduction to RAG (Retrieval Augmented Generation) and its applicationKnoldus Inc.
 
AI and Cybersecurity - Food for Thought
AI and Cybersecurity - Food for ThoughtAI and Cybersecurity - Food for Thought
AI and Cybersecurity - Food for ThoughtNUS-ISS
 
How Does Generative AI Actually Work? (a quick semi-technical introduction to...
How Does Generative AI Actually Work? (a quick semi-technical introduction to...How Does Generative AI Actually Work? (a quick semi-technical introduction to...
How Does Generative AI Actually Work? (a quick semi-technical introduction to...ssuser4edc93
 
“Responsible AI: Tools and Frameworks for Developing AI Solutions,” a Present...
“Responsible AI: Tools and Frameworks for Developing AI Solutions,” a Present...“Responsible AI: Tools and Frameworks for Developing AI Solutions,” a Present...
“Responsible AI: Tools and Frameworks for Developing AI Solutions,” a Present...Edge AI and Vision Alliance
 
Generative AI Use cases for Enterprise - Second Session
Generative AI Use cases for Enterprise - Second SessionGenerative AI Use cases for Enterprise - Second Session
Generative AI Use cases for Enterprise - Second SessionGene Leybzon
 
An Introduction to Generative AI
An Introduction  to Generative AIAn Introduction  to Generative AI
An Introduction to Generative AICori Faklaris
 
Using the power of Generative AI at scale
Using the power of Generative AI at scaleUsing the power of Generative AI at scale
Using the power of Generative AI at scaleMaxim Salnikov
 
BBS-248 Artificial Intelligence (AI) for Financial Services
BBS-248 Artificial Intelligence (AI) for Financial ServicesBBS-248 Artificial Intelligence (AI) for Financial Services
BBS-248 Artificial Intelligence (AI) for Financial ServicesOzgur Karakaya
 

Mais procurados (20)

Responsible Generative AI
Responsible Generative AIResponsible Generative AI
Responsible Generative AI
 
Machine Learning in Banking Sector
Machine Learning in Banking SectorMachine Learning in Banking Sector
Machine Learning in Banking Sector
 
Exploring Opportunities in the Generative AI Value Chain.pdf
Exploring Opportunities in the Generative AI Value Chain.pdfExploring Opportunities in the Generative AI Value Chain.pdf
Exploring Opportunities in the Generative AI Value Chain.pdf
 
Starling Bank - A Cloud Bank
Starling Bank - A Cloud BankStarling Bank - A Cloud Bank
Starling Bank - A Cloud Bank
 
Is Machine learning useful for Fraud Prevention?
Is Machine learning useful for Fraud Prevention?Is Machine learning useful for Fraud Prevention?
Is Machine learning useful for Fraud Prevention?
 
Generative AI at the edge.pdf
Generative AI at the edge.pdfGenerative AI at the edge.pdf
Generative AI at the edge.pdf
 
AI powered decision making in banks
AI powered decision making in banksAI powered decision making in banks
AI powered decision making in banks
 
Generative AI Use-cases for Enterprise - First Session
Generative AI Use-cases for Enterprise - First SessionGenerative AI Use-cases for Enterprise - First Session
Generative AI Use-cases for Enterprise - First Session
 
Artificial Intelligence in cybersecurity
Artificial Intelligence in cybersecurityArtificial Intelligence in cybersecurity
Artificial Intelligence in cybersecurity
 
Open banking as a service
Open banking as a serviceOpen banking as a service
Open banking as a service
 
Introduction to RAG (Retrieval Augmented Generation) and its application
Introduction to RAG (Retrieval Augmented Generation) and its applicationIntroduction to RAG (Retrieval Augmented Generation) and its application
Introduction to RAG (Retrieval Augmented Generation) and its application
 
AI and Cybersecurity - Food for Thought
AI and Cybersecurity - Food for ThoughtAI and Cybersecurity - Food for Thought
AI and Cybersecurity - Food for Thought
 
Artificial intelligence in financial sector
Artificial intelligence in financial sectorArtificial intelligence in financial sector
Artificial intelligence in financial sector
 
How Does Generative AI Actually Work? (a quick semi-technical introduction to...
How Does Generative AI Actually Work? (a quick semi-technical introduction to...How Does Generative AI Actually Work? (a quick semi-technical introduction to...
How Does Generative AI Actually Work? (a quick semi-technical introduction to...
 
“Responsible AI: Tools and Frameworks for Developing AI Solutions,” a Present...
“Responsible AI: Tools and Frameworks for Developing AI Solutions,” a Present...“Responsible AI: Tools and Frameworks for Developing AI Solutions,” a Present...
“Responsible AI: Tools and Frameworks for Developing AI Solutions,” a Present...
 
Generative AI Use cases for Enterprise - Second Session
Generative AI Use cases for Enterprise - Second SessionGenerative AI Use cases for Enterprise - Second Session
Generative AI Use cases for Enterprise - Second Session
 
An Introduction to Generative AI
An Introduction  to Generative AIAn Introduction  to Generative AI
An Introduction to Generative AI
 
Using the power of Generative AI at scale
Using the power of Generative AI at scaleUsing the power of Generative AI at scale
Using the power of Generative AI at scale
 
BBS-248 Artificial Intelligence (AI) for Financial Services
BBS-248 Artificial Intelligence (AI) for Financial ServicesBBS-248 Artificial Intelligence (AI) for Financial Services
BBS-248 Artificial Intelligence (AI) for Financial Services
 
Generative AI.pptx
Generative AI.pptxGenerative AI.pptx
Generative AI.pptx
 

Semelhante a Fighting Financial Crime with Artificial Intelligence

Fighting financial fraud at Danske Bank with artificial intelligence
Fighting financial fraud at Danske Bank with artificial intelligenceFighting financial fraud at Danske Bank with artificial intelligence
Fighting financial fraud at Danske Bank with artificial intelligenceRon Bodkin
 
CTO Radshow Hamburg17 - Keynote - The CxO responsibilities in Big Data and AI...
CTO Radshow Hamburg17 - Keynote - The CxO responsibilities in Big Data and AI...CTO Radshow Hamburg17 - Keynote - The CxO responsibilities in Big Data and AI...
CTO Radshow Hamburg17 - Keynote - The CxO responsibilities in Big Data and AI...Santiago Cabrera-Naranjo
 
SAP Forum Ankara 2017 - "Verinin Merkezine Seyahat"
SAP Forum Ankara 2017 - "Verinin Merkezine Seyahat"SAP Forum Ankara 2017 - "Verinin Merkezine Seyahat"
SAP Forum Ankara 2017 - "Verinin Merkezine Seyahat"MDS ap
 
Building for the future of AI and Machine Learning at scale
Building for the future of AI and Machine Learning at scaleBuilding for the future of AI and Machine Learning at scale
Building for the future of AI and Machine Learning at scaleMichael Nealey
 
Data & Analytic Innovations: 5 lessons from our customers
Data & Analytic Innovations: 5 lessons from our customersData & Analytic Innovations: 5 lessons from our customers
Data & Analytic Innovations: 5 lessons from our customersNick Smith
 
TDWI 17 Munich - Are enterprises ready for the 4th industrial revolution? - S...
TDWI 17 Munich - Are enterprises ready for the 4th industrial revolution? - S...TDWI 17 Munich - Are enterprises ready for the 4th industrial revolution? - S...
TDWI 17 Munich - Are enterprises ready for the 4th industrial revolution? - S...Santiago Cabrera-Naranjo
 
Graph+AI for Fin. Services
Graph+AI for Fin. ServicesGraph+AI for Fin. Services
Graph+AI for Fin. ServicesTigerGraph
 
Emerging opportunities in the age of data
Emerging opportunities in the age of dataEmerging opportunities in the age of data
Emerging opportunities in the age of dataEjaz Siddiqui
 
20181129 keynote augmented intelligence and artificial intelligence
20181129 keynote augmented intelligence and artificial intelligence20181129 keynote augmented intelligence and artificial intelligence
20181129 keynote augmented intelligence and artificial intelligenceSantiago Cabrera-Naranjo
 
Webinar - Fighting Bank Fraud with Real-time Graph Database
Webinar - Fighting Bank Fraud with Real-time Graph Database Webinar - Fighting Bank Fraud with Real-time Graph Database
Webinar - Fighting Bank Fraud with Real-time Graph Database DataStax
 
Self-Service Data Science for Leveraging ML & AI on All of Your Data
Self-Service Data Science for Leveraging ML & AI on All of Your DataSelf-Service Data Science for Leveraging ML & AI on All of Your Data
Self-Service Data Science for Leveraging ML & AI on All of Your DataMapR Technologies
 
The Journey to Big Data Analytics
The Journey to Big Data AnalyticsThe Journey to Big Data Analytics
The Journey to Big Data AnalyticsDr.Stefan Radtke
 
Big Data LDN 2017: The New Dominant Companies Are Running on Data
Big Data LDN 2017: The New Dominant Companies Are Running on DataBig Data LDN 2017: The New Dominant Companies Are Running on Data
Big Data LDN 2017: The New Dominant Companies Are Running on DataMatt Stubbs
 
Big Data LDN 2017: The New Dominant Companies Are Running on Data
Big Data LDN 2017: The New Dominant Companies Are Running on DataBig Data LDN 2017: The New Dominant Companies Are Running on Data
Big Data LDN 2017: The New Dominant Companies Are Running on DataMatt Stubbs
 
The new dominant companies are running on data
The new dominant companies are running on data The new dominant companies are running on data
The new dominant companies are running on data SnapLogic
 
Why Everything You Know About bigdata Is A Lie
Why Everything You Know About bigdata Is A LieWhy Everything You Know About bigdata Is A Lie
Why Everything You Know About bigdata Is A LieSunil Ranka
 
The Big Data Revolution: The Next Generation of Finance
The Big Data Revolution: The Next Generation of Finance The Big Data Revolution: The Next Generation of Finance
The Big Data Revolution: The Next Generation of Finance accenture
 
Accountex UK keynote_2016_May 9 2016
Accountex UK keynote_2016_May 9 2016Accountex UK keynote_2016_May 9 2016
Accountex UK keynote_2016_May 9 2016David Patton
 

Semelhante a Fighting Financial Crime with Artificial Intelligence (20)

Fighting financial fraud at Danske Bank with artificial intelligence
Fighting financial fraud at Danske Bank with artificial intelligenceFighting financial fraud at Danske Bank with artificial intelligence
Fighting financial fraud at Danske Bank with artificial intelligence
 
Opportunities derived by AI
Opportunities derived by AIOpportunities derived by AI
Opportunities derived by AI
 
CTO Radshow Hamburg17 - Keynote - The CxO responsibilities in Big Data and AI...
CTO Radshow Hamburg17 - Keynote - The CxO responsibilities in Big Data and AI...CTO Radshow Hamburg17 - Keynote - The CxO responsibilities in Big Data and AI...
CTO Radshow Hamburg17 - Keynote - The CxO responsibilities in Big Data and AI...
 
SAP Forum Ankara 2017 - "Verinin Merkezine Seyahat"
SAP Forum Ankara 2017 - "Verinin Merkezine Seyahat"SAP Forum Ankara 2017 - "Verinin Merkezine Seyahat"
SAP Forum Ankara 2017 - "Verinin Merkezine Seyahat"
 
Building for the future of AI and Machine Learning at scale
Building for the future of AI and Machine Learning at scaleBuilding for the future of AI and Machine Learning at scale
Building for the future of AI and Machine Learning at scale
 
Data & Analytic Innovations: 5 lessons from our customers
Data & Analytic Innovations: 5 lessons from our customersData & Analytic Innovations: 5 lessons from our customers
Data & Analytic Innovations: 5 lessons from our customers
 
TDWI 17 Munich - Are enterprises ready for the 4th industrial revolution? - S...
TDWI 17 Munich - Are enterprises ready for the 4th industrial revolution? - S...TDWI 17 Munich - Are enterprises ready for the 4th industrial revolution? - S...
TDWI 17 Munich - Are enterprises ready for the 4th industrial revolution? - S...
 
Graph+AI for Fin. Services
Graph+AI for Fin. ServicesGraph+AI for Fin. Services
Graph+AI for Fin. Services
 
Emerging opportunities in the age of data
Emerging opportunities in the age of dataEmerging opportunities in the age of data
Emerging opportunities in the age of data
 
20181129 keynote augmented intelligence and artificial intelligence
20181129 keynote augmented intelligence and artificial intelligence20181129 keynote augmented intelligence and artificial intelligence
20181129 keynote augmented intelligence and artificial intelligence
 
Webinar - Fighting Bank Fraud with Real-time Graph Database
Webinar - Fighting Bank Fraud with Real-time Graph Database Webinar - Fighting Bank Fraud with Real-time Graph Database
Webinar - Fighting Bank Fraud with Real-time Graph Database
 
Self-Service Data Science for Leveraging ML & AI on All of Your Data
Self-Service Data Science for Leveraging ML & AI on All of Your DataSelf-Service Data Science for Leveraging ML & AI on All of Your Data
Self-Service Data Science for Leveraging ML & AI on All of Your Data
 
The Journey to Big Data Analytics
The Journey to Big Data AnalyticsThe Journey to Big Data Analytics
The Journey to Big Data Analytics
 
Taming Big Data With Modern Software Architecture
Taming Big Data  With Modern Software ArchitectureTaming Big Data  With Modern Software Architecture
Taming Big Data With Modern Software Architecture
 
Big Data LDN 2017: The New Dominant Companies Are Running on Data
Big Data LDN 2017: The New Dominant Companies Are Running on DataBig Data LDN 2017: The New Dominant Companies Are Running on Data
Big Data LDN 2017: The New Dominant Companies Are Running on Data
 
Big Data LDN 2017: The New Dominant Companies Are Running on Data
Big Data LDN 2017: The New Dominant Companies Are Running on DataBig Data LDN 2017: The New Dominant Companies Are Running on Data
Big Data LDN 2017: The New Dominant Companies Are Running on Data
 
The new dominant companies are running on data
The new dominant companies are running on data The new dominant companies are running on data
The new dominant companies are running on data
 
Why Everything You Know About bigdata Is A Lie
Why Everything You Know About bigdata Is A LieWhy Everything You Know About bigdata Is A Lie
Why Everything You Know About bigdata Is A Lie
 
The Big Data Revolution: The Next Generation of Finance
The Big Data Revolution: The Next Generation of Finance The Big Data Revolution: The Next Generation of Finance
The Big Data Revolution: The Next Generation of Finance
 
Accountex UK keynote_2016_May 9 2016
Accountex UK keynote_2016_May 9 2016Accountex UK keynote_2016_May 9 2016
Accountex UK keynote_2016_May 9 2016
 

Mais de DataWorks Summit

Floating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache RatisFloating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache RatisDataWorks Summit
 
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFiTracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFiDataWorks Summit
 
HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...DataWorks Summit
 
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...DataWorks Summit
 
Managing the Dewey Decimal System
Managing the Dewey Decimal SystemManaging the Dewey Decimal System
Managing the Dewey Decimal SystemDataWorks Summit
 
Practical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist ExamplePractical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist ExampleDataWorks Summit
 
HBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at UberHBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at UberDataWorks Summit
 
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and PhoenixScaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and PhoenixDataWorks Summit
 
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFiBuilding the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFiDataWorks Summit
 
Supporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability ImprovementsSupporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability ImprovementsDataWorks Summit
 
Security Framework for Multitenant Architecture
Security Framework for Multitenant ArchitectureSecurity Framework for Multitenant Architecture
Security Framework for Multitenant ArchitectureDataWorks Summit
 
Presto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything EnginePresto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything EngineDataWorks Summit
 
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...DataWorks Summit
 
Extending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google CloudExtending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google CloudDataWorks Summit
 
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFiEvent-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFiDataWorks Summit
 
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache RangerSecuring Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache RangerDataWorks Summit
 
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...DataWorks Summit
 
Computer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near YouComputer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near YouDataWorks Summit
 
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache SparkBig Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache SparkDataWorks Summit
 

Mais de DataWorks Summit (20)

Data Science Crash Course
Data Science Crash CourseData Science Crash Course
Data Science Crash Course
 
Floating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache RatisFloating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache Ratis
 
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFiTracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
 
HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...
 
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
 
Managing the Dewey Decimal System
Managing the Dewey Decimal SystemManaging the Dewey Decimal System
Managing the Dewey Decimal System
 
Practical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist ExamplePractical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist Example
 
HBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at UberHBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at Uber
 
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and PhoenixScaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
 
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFiBuilding the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
 
Supporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability ImprovementsSupporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability Improvements
 
Security Framework for Multitenant Architecture
Security Framework for Multitenant ArchitectureSecurity Framework for Multitenant Architecture
Security Framework for Multitenant Architecture
 
Presto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything EnginePresto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything Engine
 
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
 
Extending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google CloudExtending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google Cloud
 
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFiEvent-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
 
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache RangerSecuring Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
 
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
 
Computer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near YouComputer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near You
 
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache SparkBig Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
 

Último

Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS:  6 Ways to Automate Your Data IntegrationBridging Between CAD & GIS:  6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integrationmarketing932765
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 
React Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App FrameworkReact Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App FrameworkPixlogix Infotech
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
QCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architecturesQCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architecturesBernd Ruecker
 
Design pattern talk by Kaya Weers - 2024 (v2)
Design pattern talk by Kaya Weers - 2024 (v2)Design pattern talk by Kaya Weers - 2024 (v2)
Design pattern talk by Kaya Weers - 2024 (v2)Kaya Weers
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesKari Kakkonen
 
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality AssuranceInflectra
 
Generative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptxGenerative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptxfnnc6jmgwh
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfIngrid Airi González
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI AgeCprime
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterMydbops
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
Glenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security ObservabilityGlenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security Observabilityitnewsafrica
 
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...Nikki Chapple
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxLoriGlavin3
 
Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Hiroshi SHIBATA
 
Decarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityDecarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityIES VE
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Strongerpanagenda
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 

Último (20)

Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS:  6 Ways to Automate Your Data IntegrationBridging Between CAD & GIS:  6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 
React Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App FrameworkReact Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App Framework
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
QCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architecturesQCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architectures
 
Design pattern talk by Kaya Weers - 2024 (v2)
Design pattern talk by Kaya Weers - 2024 (v2)Design pattern talk by Kaya Weers - 2024 (v2)
Design pattern talk by Kaya Weers - 2024 (v2)
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examples
 
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
 
Generative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptxGenerative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptx
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdf
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI Age
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL Router
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
Glenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security ObservabilityGlenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security Observability
 
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
 
Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024
 
Decarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityDecarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a reality
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 

Fighting Financial Crime with Artificial Intelligence

  • 1. Fighting Financial Crime with Artificial Intelligence Tim Seears Area Practice Director, Data Science – Asia Pacific & Korea
  • 2. 2 So far we have delivered 150+ successful projects for 100+ clients worldwide. 500+ employees (2017) Vendor-neutral with an open source focus. Full spectrum consulting, data engineering, data science & support. Apache Hadoop and cloud ecosystem integration. Founded in 2010 industry thought leader. Fixed fee offerings for data science and engineering. Who Is Think Big? 1st Big Data provider 100% focused around open source.
  • 3. 3 Help customers be financially confident and achieve their ambitions by making daily banking and important financial decisions easy. 19,000 employees (2017) 1,800 corporate and institutional customers Covering Personal Banking, Business Banking, Corporates & Institutions and Wealth Management. 2.7 million personal customers 236,000 small and medium-sized business customers Making banking easier for over 145 years. Who is the customer? Leading Nordic universal bank with strong local roots and bridges to the rest of the world
  • 4. 4 PayPal WalmartJohn Deere Lowe’s Wells FargoJP Morgan This is Happening in the Enterprise today
  • 5. 5 “ ’’ Over the next decade, AI won’t replace managers, but managers who use AI will replace those who don’t. HBR July 2017
  • 6. 7 Data Driven Approach to Fight Fraud Fast evolving fraud sophistication, AI Ambitions for Fraud Project Become client’s advanced analytics blueprint Data driven approach to real time scoring of transactions Reduce false-positives & Enhance fraud detection rate ONLY ~40% of fraud cases are detected Low Detection Rate 99.5% of cases are not fraud related Many false positives Challenges for Fraud Detection Tens of Millions € lost each month High Fraud Loss © 2017 Teradata
  • 7. 8 Fraud Types – Customer Initiated Nigeria/ Investment Scam CEO Fraud Customer Initiated © 2017 Teradata Beneficiary Account Change Fake Invoice Rental Scam/ Goods Not Received
  • 8. 9 ID Theft SPEAR Phishing Vishing/Support Scam Malware Phishing/Smishing Fraud Types – Fraudster Initiated Fraudster Initiated © 2017 Teradata
  • 9. 10 Modeling Challenges © 2017 Teradata • Class imbalance (100,000:1 non-fraud vs. fraud) • Assigning fraud labels from historic data • Fraud is ambiguous • Not all features available in real-time • Most machine learning sees transactions atomically
  • 10. 11 Advanced Platform for Fraud: Data-Driven Approach How It Works: • Understanding the domain • Gathering and preparing the data • Automatically generating the rules and recognizing fraudulent patterns by training models on historical data • Automatically maintaining the engine by retraining the model Pros: • Automatic/data-driven/objective inference of the rules • Ability to detect patterns in a high dimensional data input • Fast detection of new/changing fraudulent patterns Cons: • Might be unintuitive and hardly interpretable • Data preparation and feature aggregation is time consuming © 2017 Teradata
  • 12. 13 Sep Oct Nov Dec Jan Feb Mar Apr May Jun Jul Project kick-off 30th September Teamwork to Deliver Value in Each Project Iteration 2016 2017 Kick-off: Data Scientist track Go-Live plan set for the project 19th December Kick-off: Engineering track Models successfully in Shadow Production 4th March First round trip test transaction First production virtual machine Production Hardening for HA and Security Full productionisation of the Fraud Engine Cross- functional Collaboration Cont. Deep Learning modeling © 2017 Teradata
  • 13. 14 > Banking Anti-Fraud Solution © 2017 Teradata Banking Anti-Fraud Solution By leveraging the power of a thoughtful and strong data and analytics strategy, we unleash high impact business outcomes. • Multiple models running in production at the same time • Mix of traditional and advance deep learning methods • AnalyticsOps: Deploying machine learning models in production Model Management Framework • Organisation and silos of data • Real-time data integration • Security and Procedures: following existing bank procedures Data Modelling, Pipeline and Ingestion • Hard to operationalize insights • Availability of analytic capabilities/ skills and data • Interpreting the results of machine learning models Machine Learning and Artificial Intelligence
  • 14. 15 Advanced Platform for Fraud: Data-Driven Approach Manual Evaluation eBanking Mobile Pay Business Online Global Payment Interface Fraud? Create Verification Fraud Payment Weblog Data Basic Customer Data Historic Transactions Customer Product Data Aggregated Customer Data Fraud? Advanced Analytics Platform for Fraud Central Fraud Engine Process Payment to Beneficiary Yes/Maybe No Yes ! Return Payment to Customer Update Fraud Data © 2017 Teradata No
  • 15. 16 Model Management Real Time Scoring System Production Operations Source Systems Data Managers Framework to Enhance AnalyticsOps Capabilities Log Transactions Add Historic Data Model Performance Promote Models © 2017 Teradata Data Scientists Development & Deployment Management Data Science Lab Data Analysis & Model Development Data Management Developers & Release Managers
  • 16. 17 Key Requirement: Model Interpretation • We have deployed LIME (Locally Interpretable Model Explanation) for customers – Improves trust – Compliance with EU’s General Data Protection Regulation (GDPR) © 2017 Teradata 17.6% Fraud Probability Customer Amount Debit Amount Avg. # Trans Cred Acc # Prev to This Dest # Xfer Accounts X% score due to: + transfer amount + destination country + last year monthly spend What features are most important to this decision?
  • 17. 18 Machine Learning Results (Live System: 60 transactions/sec.) Ensemble of boosted decision trees and logistic regression. From online validation of the model: ● 25-30% false positive reduction, with over 35% increase in detection rate ● Opportunity to expand model with additional features, retrain on recent data and add additional models to the ensemble. ● Models can be expanded to additional channels Rule Engine on validation set © 2017 Teradata
  • 19. 20 Current models can only catch ~70% of all fraud cases Deep Learning Opportunity Traditional ML models view transactions atomically Often missed fraud transactions are part of a series Capturing correlation across many features © 2017 Teradata
  • 20. 21 Three Deep Learning Architectures to Deliver Value • Designed for spatial correlated features, but by transforming transactions into a 2D image, we can learn temporal correlated features. • Deeper ConvNet allows learning more complex & general features. Goal: Learn kernels from temporal & static features to gain insight into the characteristics of fraud. • Learn temporal information and classify if the sequence of transactions contains fraud. • Shares knowledge across learning time. Goal: Learn transaction patterns within a window. Two solutions can be tested: flag fraud or predict next transaction and define an error. • Learn how to generate normal transactions, potentially large volumes of non-fraud data. • AE provide a low level representation of the data. Goal: Build a model that learns how to generate non-fraud data. To detect fraud, define a reconstruction error rate for the fraud cases Auto-Encoders LSTM ConvNet © 2017 Teradata
  • 21. 22 How Can We Create an Image From Bank Transactions? t0 X_0, X_1, ... X_n dt t1 X_0, X_1, ... X_n t2 X_0, X_1, ... X_n ts X_0, X_1, ... X_n ... Top k Features Correlation ... X_0 X_41 X_5 X_30 X_29X_31X_10 X_37 X_3 X_1 X_42 X_40 X_32 X_15X_35X_2 X_16 X_31 X_2 X_3 X_15 X_4 X_1X_28X_40 X_31 X_49 X_n X_26 X_9 X_40 X_35X_28X_2 X_17 X_1 ... X_0 X_41 X_5 X_30 X_29X_31X_10 X_37 X_3 X_1 X_42 X_40 X_32 X_15X_35X_2 X_16 X_31 X_2 X_3 X_15 X_4 X_1X_28X_40 X_31 X_49 X_n X_26 X_9 X_40 X_35X_28X_2 X_17 X_1 ... ... ... ... ... X_0 X_41 X_5 X_30 X_29X_31X_10 X_37 X_3 X_1 X_42 X_40 X_32 X_15X_35X_2 X_16 X_31 X_2 X_3 X_15 X_4 X_1X_28X_40 X_31 X_49 X_n X_26 X_9 X_40 X_35X_28X_2 X_17 X_1 N dt t0 t1 ts Input Output Raw Features Add correlated features in a clock-wise manner © 2017 Teradata Image size is: [10 x 3, 50 x 3, 1]
  • 22. 23 Convolutional Layers for Trans2D Kernels of 3x3 ...X_0 X_1 X_2 X_n ...X_0 X_1 X_2 X_n... ... ... ... ...X_0 X_1 X_2 X_n Features Time Strides of 3 Strides of 1 First Convolutional Layer Architecture © 2017 Teradata
  • 23. 24 2D Transaction Image Example © 2017 Teradata Non-fraud Transaction Image Non-fraud Fraud Transaction Image X-axis: features, Y-axis: time Fraud Non-fraud
  • 24. 25 Network Architecture for CNNs Fraud Normal 50 30 25 15 13 8CNN Fraud Normal50 30 25 15 25 15 25 15 25 15 © 2017 Teradata
  • 25. 26 Inside the ResNet model 64 Filters Activations After the CNN Residual Blocks FraudNon-fraud © 2017 Teradata
  • 26. 27 Deep Learning First Results on the fraud verification dataset Comparison of the three deep learning models andthe traditional machine learning ensemble model. © 2017 Teradata • Ensemble model (AUC 0.89) • ConvNets (AUC 0.95) • LSTM (AUC 0.90) • ResNet (AUC 0.94)
  • 27. 28 Lessons Learned: Take-Aways From client project Deep learning adoption from pictures to financial transactions Enhancement of data quality & cluster capabilities with data ingestion Building Analytics Ops capabilities to support business units Leveraging experience from Fraud advanced analytics to deliver extra use cases © 2017 Teradata
  • 28. 29 Success: From PowerPoint to production in 8 sprints Team effort: Thorough collaboration across IMD, GFU and Think Big Synergy: Successfully spearheaded innovation in all involved systems Inspiration: Bank advanced analytics blueprint sets a generic scene for combatting new challenges in advanced analytics Agile influence: Using an agile approach we were able to quickly deliver within the challenging timeframe. © 2017 Teradata Big Data Team – Lessons Learned
  • 29. 3030 © 2017 Teradata
  • 31. 32 Advanced Analytics Fraud Platform • Develop a scalable and expandable platform which follows the Bank’s blueprint of digitalization • 100 % data-driven approach to find patterns in the data and complement the existing fraud engine • Use Hadoop to handle the large data volumes for training models on transaction data • Implement a real-time solution that can score live transactions such as e-banking, credit card and mobile payments. • Reduce amount of false-positives by at least 20-40 % © 2017 Teradata
  • 32. 33 Ambitions for the Advance Analytics project • The goal is to enhance fraud detection and reduce false positives across products. • Fraud is only the first of many possible advanced analytics use cases. • The project leads to the creation of the Bank’s advanced analytics blueprint. • Bank’s ambition is to become one of the bank leading in advanced analytics capabilities. Challenges in Fraud Detection at client • Low detection rate: Existing human-written rule engine catches ~40% of fraud cases. • Many false positives: 99.5% of all cases investigated are not fraud related. • High fraud loss: Tens of millions of € total fraud per month. • Mobile payments are quickly growing in number. • Fraud evolving rapidly, with increased sophistication: Bank must modernize its anti- fraud arsenal. © 2017 Teradata