SlideShare uma empresa Scribd logo
1 de 45
Real-Time Fraud
Detection at Grab
Aravind Velamur Srinivasan
Engineering Lead, Grab Technology Corp
2
+ The Real-Time Big Data Database
+ Drop-in replacement for Cassandra
+ 10X the performance & consistent, low latency
+ Open source and enterprise editions
+ New: Scylla Cloud, DBaaS
+ Founded by the creators of KVM hypervisor
+ HQs: Palo Alto, CA; Herzelia, Israel
About ScyllaDB
+ Engineering Lead at Grab Technologies
+ Streaming Platform
+ Data Platform Infrastructure
+ Worked at Uber Inc prior to Grab
+ Prior to that worked for Isilon/EMC (now DELL)
+ Masters in Computer Science from University of Virginia (UVa)
+ Contact: aravindvelamur@gmail.com
Presenter
Background
Overview of TechStack at Grab
Streaming Ecosystem at Grab
Why ScyllaDB
ScyllaDB use at Grab (Fraud Detection)
Conclusion
Q&A
Agenda
Background
Introduction to Grab
+ One of the most frequently used mobile
platforms in Southeast Asia
+ Multiple services (multi-million users
per day and growing!)
+ Transport
+ Food
+ Payment
+ Shopping
+ Package Delivery
+ ...
Tech Stack Overview
Tech Stack at Grab
+ Backend Services
+ Golang
+ More than 100s of services
+ Inter-Service Communication
+ protobuf/gRPC (for the most part)
Tech Stack at Grab
Streaming Ecosystem
at Grab
Streaming Ecosystem at Grab
Streaming Ecosystem at Grab
Service A
Service C
Service D
Service G
Service F
Service E
Service H
Service B
Why ScyllaDB
Why ScyllaDB
Why ScyllaDB
+ Recap of Grab Tech
Why ScyllaDB
+ Recap of Grab Tech
+ Microservices (Golang)
Why ScyllaDB
+ Recap of Grab Tech
+ Microservices (Golang)
+ Kafka for streaming
Why ScyllaDB
+ Recap of Grab Tech
+ Microservices (Golang)
+ Kafka for streaming
+ Why? Loose coupling and Scale! doh! :)
Why ScyllaDB
➔ Stream Processing
➔ Transforms
➔ Aggregates
➔ Joins
➔ ...
Why ScyllaDB
+ Required a State store (metadata store) which has the following
characteristics:
+ Ability to handle very high throughput
+ Ability to handle bursts (doh!)
+ Ability to scale out (i.e, handle hockey stick like growth)
+ Very low latencies (both write and reads) - Near Real-Time
+ Low operational overhead
+ Cost efficient
Why ScyllaDB
ScyllaDB USAGE
ScyllaDB Usage
Use Cases
ScyllaDB Usage
Low Latency
Use Cases
ScyllaDB Usage
Low Latency
High
Throughput
Use Cases
ScyllaDB Usage
Low Latency
High
Throughput
Easy Scale
Use Cases
ScyllaDB Usage
Low Latency
High
Throughput
Easy Scale Build Stats
Use Cases
Number of Use cases revolving around:
+ Stream Aggregation &
+ For Eg:- Fraud Detection in Real-Time
+ Stream Statistics
+ ...
ScyllaDB Usage
RT Fraud Detection
+ Why? Eg: Fraud Detection
https://www.scmp.com/week-asia/business/article/2154770/who-pays-when-indonesian-ride-sharing-fraud-goes-full-throttle
https://www.techrepublic.com/article/seven-ride-sharing-scams-to-watch-out-for/
+ Grab’s own Blog:
https://www.grab.com/sg/blog/why-anti-fraud-technology-is-the-secret-sauce-to-winning-ride-hailing-platforms/
ScyllaDB Usage - Fraud Detection
+ Why? Eg:- Fraud Detection
ScyllaDB Usage - Fraud Detection
...
The big influx of capital into the industry has led to fraudsters, sometimes individuals,
sometimes organised in gangs, trying to game incentive and sign-up schemes. As a
result, a stolen ride-hailing driver profile today is worth up to US$30 on the black market,
even more than stolen credit card information.
…
ScyllaDB Usage - Why Fraud Detection?
+ Use Kafka streams to do real-time Fraud detection
+ Simple Use Case:
+ Some scammers use fake GPS to say they are online at multiple locations
ScyllaDB Usage - Fraud Detection
+ Simple Use case:
ScyllaDB Usage - Fraud Detection
Grab
Service
GPS
driver_location topic
Fraud
Detection
Service
Algorithm
+ Complex Use Case:
+ Fraudsters evolve :)
+ Consuming one topic is not enough
+ Example:
ScyllaDB Usage - Fraud Detection
Use fake GPS tools and modded phones to simulate driving
behaviour and completed rides to game the system.
ScyllaDB Usage - Fraud Detection
Topic 1
Stream
Processing
Engine
Grab Service(s)
Grab Service(s)
Grab Service(s)
Topic 2
Topic 3
Combined
Topic
Fraud
Detection
Service
Algorithm
+ In summary using ScyllaDB
+ Joined multiple Kafka streams together in Real-Time!!
+ Like joining SQL tables together!
ScyllaDB Usage - Fraud Detection
ScyllaDB Other Usages
ScyllaDB Usage - Stream Statistics
+ Teams like to find out:
+ Counts of … (eg: count of rides in the past month, etc)
+ Raw stats for a particular key
+ Business users - require stats on city/country data
ScyllaDB Usage - Stream Statistics
ScyllaDB Usage - Stream Statistics
+ Requires a fast store for raw stats
+ Ability to build Time series on top of it.
Conclusion
ScyllaDB at Grab So Far...
Overall
+ Great experience
+ Cost effective
+ Very responsive team
+ As good as advertised :)
+ Growing within the company…
ScyllaDB at Grab So Far...
+ Some hiccups
+ nodetool repair - Writes and read timeouts every time when the run finishes
(even with -pr)
+ Wish nodes can join the cluster faster
+ Right now takes a really long time (approx 1TB data on each node across all
keyspaces) - takes ~2.5 hours to add a node
+ Understand why but can we design something asynchronous?
+ Better error logging
Q&A
Stay in touch
aravindvelamur@gmail.com
Thank you
United States
1900 Embarcadero Road
Palo Alto, CA 94303
Israel
11 Galgalei Haplada
Herzelia, Israel
www.scylladb.com
@scylladb

Mais conteúdo relacionado

Mais procurados

Building Reliable Data Lakes at Scale with Delta Lake
Building Reliable Data Lakes at Scale with Delta LakeBuilding Reliable Data Lakes at Scale with Delta Lake
Building Reliable Data Lakes at Scale with Delta Lake
Databricks
 

Mais procurados (20)

Simplify and Scale Data Engineering Pipelines with Delta Lake
Simplify and Scale Data Engineering Pipelines with Delta LakeSimplify and Scale Data Engineering Pipelines with Delta Lake
Simplify and Scale Data Engineering Pipelines with Delta Lake
 
Integrating NiFi and Flink
Integrating NiFi and FlinkIntegrating NiFi and Flink
Integrating NiFi and Flink
 
Data Lakehouse Symposium | Day 4
Data Lakehouse Symposium | Day 4Data Lakehouse Symposium | Day 4
Data Lakehouse Symposium | Day 4
 
Keeping Up with the ELK Stack: Elasticsearch, Kibana, Beats, and Logstash
Keeping Up with the ELK Stack: Elasticsearch, Kibana, Beats, and LogstashKeeping Up with the ELK Stack: Elasticsearch, Kibana, Beats, and Logstash
Keeping Up with the ELK Stack: Elasticsearch, Kibana, Beats, and Logstash
 
Streaming Data Lakes using Kafka Connect + Apache Hudi | Vinoth Chandar, Apac...
Streaming Data Lakes using Kafka Connect + Apache Hudi | Vinoth Chandar, Apac...Streaming Data Lakes using Kafka Connect + Apache Hudi | Vinoth Chandar, Apac...
Streaming Data Lakes using Kafka Connect + Apache Hudi | Vinoth Chandar, Apac...
 
Modernizing to a Cloud Data Architecture
Modernizing to a Cloud Data ArchitectureModernizing to a Cloud Data Architecture
Modernizing to a Cloud Data Architecture
 
Data Lake Overview
Data Lake OverviewData Lake Overview
Data Lake Overview
 
Building Reliable Data Lakes at Scale with Delta Lake
Building Reliable Data Lakes at Scale with Delta LakeBuilding Reliable Data Lakes at Scale with Delta Lake
Building Reliable Data Lakes at Scale with Delta Lake
 
Introducing the Snowflake Computing Cloud Data Warehouse
Introducing the Snowflake Computing Cloud Data WarehouseIntroducing the Snowflake Computing Cloud Data Warehouse
Introducing the Snowflake Computing Cloud Data Warehouse
 
Change Data Feed in Delta
Change Data Feed in DeltaChange Data Feed in Delta
Change Data Feed in Delta
 
Introduction to KSQL: Streaming SQL for Apache Kafka®
Introduction to KSQL: Streaming SQL for Apache Kafka®Introduction to KSQL: Streaming SQL for Apache Kafka®
Introduction to KSQL: Streaming SQL for Apache Kafka®
 
Service Mesh - Observability
Service Mesh - ObservabilityService Mesh - Observability
Service Mesh - Observability
 
Data platform modernization with Databricks.pptx
Data platform modernization with Databricks.pptxData platform modernization with Databricks.pptx
Data platform modernization with Databricks.pptx
 
Architect’s Open-Source Guide for a Data Mesh Architecture
Architect’s Open-Source Guide for a Data Mesh ArchitectureArchitect’s Open-Source Guide for a Data Mesh Architecture
Architect’s Open-Source Guide for a Data Mesh Architecture
 
Introduction to AWS Glue
Introduction to AWS GlueIntroduction to AWS Glue
Introduction to AWS Glue
 
Simplify CDC Pipeline with Spark Streaming SQL and Delta Lake
Simplify CDC Pipeline with Spark Streaming SQL and Delta LakeSimplify CDC Pipeline with Spark Streaming SQL and Delta Lake
Simplify CDC Pipeline with Spark Streaming SQL and Delta Lake
 
Making Apache Spark Better with Delta Lake
Making Apache Spark Better with Delta LakeMaking Apache Spark Better with Delta Lake
Making Apache Spark Better with Delta Lake
 
Apache Flink, AWS Kinesis, Analytics
Apache Flink, AWS Kinesis, Analytics Apache Flink, AWS Kinesis, Analytics
Apache Flink, AWS Kinesis, Analytics
 
Apache Kafka® Use Cases for Financial Services
Apache Kafka® Use Cases for Financial ServicesApache Kafka® Use Cases for Financial Services
Apache Kafka® Use Cases for Financial Services
 
Batch Processing at Scale with Flink & Iceberg
Batch Processing at Scale with Flink & IcebergBatch Processing at Scale with Flink & Iceberg
Batch Processing at Scale with Flink & Iceberg
 

Semelhante a Real-time Fraud Detection for Southeast Asia’s Leading Mobile Platform

События, шины и интеграция данных в непростом мире микросервисов / Валентин Г...
События, шины и интеграция данных в непростом мире микросервисов / Валентин Г...События, шины и интеграция данных в непростом мире микросервисов / Валентин Г...
События, шины и интеграция данных в непростом мире микросервисов / Валентин Г...
Ontico
 
Introducing Project Alternator - Scylla’s Open-Source DynamoDB-compatible API
Introducing Project Alternator - Scylla’s Open-Source DynamoDB-compatible APIIntroducing Project Alternator - Scylla’s Open-Source DynamoDB-compatible API
Introducing Project Alternator - Scylla’s Open-Source DynamoDB-compatible API
ScyllaDB
 

Semelhante a Real-time Fraud Detection for Southeast Asia’s Leading Mobile Platform (20)

Scylla Summit 2018: Grab and Scylla: Driving Southeast Asia Forward
Scylla Summit 2018: Grab and Scylla: Driving Southeast Asia ForwardScylla Summit 2018: Grab and Scylla: Driving Southeast Asia Forward
Scylla Summit 2018: Grab and Scylla: Driving Southeast Asia Forward
 
Build DynamoDB-Compatible Apps with Python
Build DynamoDB-Compatible Apps with PythonBuild DynamoDB-Compatible Apps with Python
Build DynamoDB-Compatible Apps with Python
 
Running a DynamoDB-compatible Database on Managed Kubernetes Services
Running a DynamoDB-compatible Database on Managed Kubernetes ServicesRunning a DynamoDB-compatible Database on Managed Kubernetes Services
Running a DynamoDB-compatible Database on Managed Kubernetes Services
 
Optimizing Performance in Rust for Low-Latency Database Drivers
Optimizing Performance in Rust for Low-Latency Database DriversOptimizing Performance in Rust for Low-Latency Database Drivers
Optimizing Performance in Rust for Low-Latency Database Drivers
 
Learning Rust the Hard Way for a Production Kafka + ScyllaDB Pipeline
Learning Rust the Hard Way for a Production Kafka + ScyllaDB PipelineLearning Rust the Hard Way for a Production Kafka + ScyllaDB Pipeline
Learning Rust the Hard Way for a Production Kafka + ScyllaDB Pipeline
 
Eliminating Volatile Latencies Inside Rakuten’s NoSQL Migration
Eliminating  Volatile Latencies Inside Rakuten’s NoSQL MigrationEliminating  Volatile Latencies Inside Rakuten’s NoSQL Migration
Eliminating Volatile Latencies Inside Rakuten’s NoSQL Migration
 
Scylla Virtual Workshop 2022
Scylla Virtual Workshop 2022Scylla Virtual Workshop 2022
Scylla Virtual Workshop 2022
 
ScyllaDB Virtual Workshop
ScyllaDB Virtual WorkshopScyllaDB Virtual Workshop
ScyllaDB Virtual Workshop
 
Running a Cost-Effective DynamoDB-Compatible Database on Managed Kubernetes S...
Running a Cost-Effective DynamoDB-Compatible Database on Managed Kubernetes S...Running a Cost-Effective DynamoDB-Compatible Database on Managed Kubernetes S...
Running a Cost-Effective DynamoDB-Compatible Database on Managed Kubernetes S...
 
Build Low-Latency Applications in Rust on ScyllaDB
Build Low-Latency Applications in Rust on ScyllaDBBuild Low-Latency Applications in Rust on ScyllaDB
Build Low-Latency Applications in Rust on ScyllaDB
 
ClickHouse Paris Meetup. ClickHouse Analytical DBMS, Introduction. By Alexand...
ClickHouse Paris Meetup. ClickHouse Analytical DBMS, Introduction. By Alexand...ClickHouse Paris Meetup. ClickHouse Analytical DBMS, Introduction. By Alexand...
ClickHouse Paris Meetup. ClickHouse Analytical DBMS, Introduction. By Alexand...
 
Alternator webinar september 2019
Alternator webinar   september 2019Alternator webinar   september 2019
Alternator webinar september 2019
 
Introducing Scylla Cloud
Introducing Scylla CloudIntroducing Scylla Cloud
Introducing Scylla Cloud
 
JavaOne 2015: Scaling micro services at Gilt
JavaOne 2015: Scaling micro services at GiltJavaOne 2015: Scaling micro services at Gilt
JavaOne 2015: Scaling micro services at Gilt
 
Transforming the Database: Critical Innovations for Performance at Scale
Transforming the Database: Critical Innovations for Performance at ScaleTransforming the Database: Critical Innovations for Performance at Scale
Transforming the Database: Critical Innovations for Performance at Scale
 
Simpler, faster, cheaper Enterprise Apps using only Spring Boot on GCP
Simpler, faster, cheaper Enterprise Apps using only Spring Boot on GCPSimpler, faster, cheaper Enterprise Apps using only Spring Boot on GCP
Simpler, faster, cheaper Enterprise Apps using only Spring Boot on GCP
 
События, шины и интеграция данных в непростом мире микросервисов / Валентин Г...
События, шины и интеграция данных в непростом мире микросервисов / Валентин Г...События, шины и интеграция данных в непростом мире микросервисов / Валентин Г...
События, шины и интеграция данных в непростом мире микросервисов / Валентин Г...
 
Introducing Project Alternator - Scylla’s Open-Source DynamoDB-compatible API
Introducing Project Alternator - Scylla’s Open-Source DynamoDB-compatible APIIntroducing Project Alternator - Scylla’s Open-Source DynamoDB-compatible API
Introducing Project Alternator - Scylla’s Open-Source DynamoDB-compatible API
 
JOSA TechTalks - Downgrade your Costs
JOSA TechTalks - Downgrade your CostsJOSA TechTalks - Downgrade your Costs
JOSA TechTalks - Downgrade your Costs
 
Lessons from Building Large-Scale, Multi-Cloud, SaaS Software at Databricks
Lessons from Building Large-Scale, Multi-Cloud, SaaS Software at DatabricksLessons from Building Large-Scale, Multi-Cloud, SaaS Software at Databricks
Lessons from Building Large-Scale, Multi-Cloud, SaaS Software at Databricks
 

Mais de ScyllaDB

Mais de ScyllaDB (20)

Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
What Developers Need to Unlearn for High Performance NoSQL
What Developers Need to Unlearn for High Performance NoSQLWhat Developers Need to Unlearn for High Performance NoSQL
What Developers Need to Unlearn for High Performance NoSQL
 
Low Latency at Extreme Scale: Proven Practices & Pitfalls
Low Latency at Extreme Scale: Proven Practices & PitfallsLow Latency at Extreme Scale: Proven Practices & Pitfalls
Low Latency at Extreme Scale: Proven Practices & Pitfalls
 
Dissecting Real-World Database Performance Dilemmas
Dissecting Real-World Database Performance DilemmasDissecting Real-World Database Performance Dilemmas
Dissecting Real-World Database Performance Dilemmas
 
Beyond Linear Scaling: A New Path for Performance with ScyllaDB
Beyond Linear Scaling: A New Path for Performance with ScyllaDBBeyond Linear Scaling: A New Path for Performance with ScyllaDB
Beyond Linear Scaling: A New Path for Performance with ScyllaDB
 
Dissecting Real-World Database Performance Dilemmas
Dissecting Real-World Database Performance DilemmasDissecting Real-World Database Performance Dilemmas
Dissecting Real-World Database Performance Dilemmas
 
Database Performance at Scale Masterclass: Workload Characteristics by Felipe...
Database Performance at Scale Masterclass: Workload Characteristics by Felipe...Database Performance at Scale Masterclass: Workload Characteristics by Felipe...
Database Performance at Scale Masterclass: Workload Characteristics by Felipe...
 
Database Performance at Scale Masterclass: Database Internals by Pavel Emelya...
Database Performance at Scale Masterclass: Database Internals by Pavel Emelya...Database Performance at Scale Masterclass: Database Internals by Pavel Emelya...
Database Performance at Scale Masterclass: Database Internals by Pavel Emelya...
 
Database Performance at Scale Masterclass: Driver Strategies by Piotr Sarna
Database Performance at Scale Masterclass: Driver Strategies by Piotr SarnaDatabase Performance at Scale Masterclass: Driver Strategies by Piotr Sarna
Database Performance at Scale Masterclass: Driver Strategies by Piotr Sarna
 
Replacing Your Cache with ScyllaDB
Replacing Your Cache with ScyllaDBReplacing Your Cache with ScyllaDB
Replacing Your Cache with ScyllaDB
 
Powering Real-Time Apps with ScyllaDB_ Low Latency & Linear Scalability
Powering Real-Time Apps with ScyllaDB_ Low Latency & Linear ScalabilityPowering Real-Time Apps with ScyllaDB_ Low Latency & Linear Scalability
Powering Real-Time Apps with ScyllaDB_ Low Latency & Linear Scalability
 
7 Reasons Not to Put an External Cache in Front of Your Database.pptx
7 Reasons Not to Put an External Cache in Front of Your Database.pptx7 Reasons Not to Put an External Cache in Front of Your Database.pptx
7 Reasons Not to Put an External Cache in Front of Your Database.pptx
 
Getting the most out of ScyllaDB
Getting the most out of ScyllaDBGetting the most out of ScyllaDB
Getting the most out of ScyllaDB
 
NoSQL Database Migration Masterclass - Session 2: The Anatomy of a Migration
NoSQL Database Migration Masterclass - Session 2: The Anatomy of a MigrationNoSQL Database Migration Masterclass - Session 2: The Anatomy of a Migration
NoSQL Database Migration Masterclass - Session 2: The Anatomy of a Migration
 
NoSQL Database Migration Masterclass - Session 3: Migration Logistics
NoSQL Database Migration Masterclass - Session 3: Migration LogisticsNoSQL Database Migration Masterclass - Session 3: Migration Logistics
NoSQL Database Migration Masterclass - Session 3: Migration Logistics
 
NoSQL Data Migration Masterclass - Session 1 Migration Strategies and Challenges
NoSQL Data Migration Masterclass - Session 1 Migration Strategies and ChallengesNoSQL Data Migration Masterclass - Session 1 Migration Strategies and Challenges
NoSQL Data Migration Masterclass - Session 1 Migration Strategies and Challenges
 
DBaaS in the Real World: Risks, Rewards & Tradeoffs
DBaaS in the Real World: Risks, Rewards & TradeoffsDBaaS in the Real World: Risks, Rewards & Tradeoffs
DBaaS in the Real World: Risks, Rewards & Tradeoffs
 
NoSQL Data Modeling 101
NoSQL Data Modeling 101NoSQL Data Modeling 101
NoSQL Data Modeling 101
 
Top NoSQL Data Modeling Mistakes
Top NoSQL Data Modeling MistakesTop NoSQL Data Modeling Mistakes
Top NoSQL Data Modeling Mistakes
 
NoSQL Data Modeling Foundations — Introducing Concepts & Principles
NoSQL Data Modeling Foundations — Introducing Concepts & PrinciplesNoSQL Data Modeling Foundations — Introducing Concepts & Principles
NoSQL Data Modeling Foundations — Introducing Concepts & Principles
 

Último

Abortion Pill Prices Tembisa [(+27832195400*)] 🏥 Women's Abortion Clinic in T...
Abortion Pill Prices Tembisa [(+27832195400*)] 🏥 Women's Abortion Clinic in T...Abortion Pill Prices Tembisa [(+27832195400*)] 🏥 Women's Abortion Clinic in T...
Abortion Pill Prices Tembisa [(+27832195400*)] 🏥 Women's Abortion Clinic in T...
Medical / Health Care (+971588192166) Mifepristone and Misoprostol tablets 200mg
 
The title is not connected to what is inside
The title is not connected to what is insideThe title is not connected to what is inside
The title is not connected to what is inside
shinachiaurasa2
 
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
masabamasaba
 
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024
VictoriaMetrics
 
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
masabamasaba
 

Último (20)

Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
 
%in Hazyview+277-882-255-28 abortion pills for sale in Hazyview
%in Hazyview+277-882-255-28 abortion pills for sale in Hazyview%in Hazyview+277-882-255-28 abortion pills for sale in Hazyview
%in Hazyview+277-882-255-28 abortion pills for sale in Hazyview
 
Harnessing ChatGPT - Elevating Productivity in Today's Agile Environment
Harnessing ChatGPT  - Elevating Productivity in Today's Agile EnvironmentHarnessing ChatGPT  - Elevating Productivity in Today's Agile Environment
Harnessing ChatGPT - Elevating Productivity in Today's Agile Environment
 
Abortion Pill Prices Tembisa [(+27832195400*)] 🏥 Women's Abortion Clinic in T...
Abortion Pill Prices Tembisa [(+27832195400*)] 🏥 Women's Abortion Clinic in T...Abortion Pill Prices Tembisa [(+27832195400*)] 🏥 Women's Abortion Clinic in T...
Abortion Pill Prices Tembisa [(+27832195400*)] 🏥 Women's Abortion Clinic in T...
 
Software Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsSoftware Quality Assurance Interview Questions
Software Quality Assurance Interview Questions
 
Payment Gateway Testing Simplified_ A Step-by-Step Guide for Beginners.pdf
Payment Gateway Testing Simplified_ A Step-by-Step Guide for Beginners.pdfPayment Gateway Testing Simplified_ A Step-by-Step Guide for Beginners.pdf
Payment Gateway Testing Simplified_ A Step-by-Step Guide for Beginners.pdf
 
%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein
%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein
%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein
 
Architecture decision records - How not to get lost in the past
Architecture decision records - How not to get lost in the pastArchitecture decision records - How not to get lost in the past
Architecture decision records - How not to get lost in the past
 
The title is not connected to what is inside
The title is not connected to what is insideThe title is not connected to what is inside
The title is not connected to what is inside
 
Microsoft AI Transformation Partner Playbook.pdf
Microsoft AI Transformation Partner Playbook.pdfMicrosoft AI Transformation Partner Playbook.pdf
Microsoft AI Transformation Partner Playbook.pdf
 
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
 
%in Soweto+277-882-255-28 abortion pills for sale in soweto
%in Soweto+277-882-255-28 abortion pills for sale in soweto%in Soweto+277-882-255-28 abortion pills for sale in soweto
%in Soweto+277-882-255-28 abortion pills for sale in soweto
 
Announcing Codolex 2.0 from GDK Software
Announcing Codolex 2.0 from GDK SoftwareAnnouncing Codolex 2.0 from GDK Software
Announcing Codolex 2.0 from GDK Software
 
%in Midrand+277-882-255-28 abortion pills for sale in midrand
%in Midrand+277-882-255-28 abortion pills for sale in midrand%in Midrand+277-882-255-28 abortion pills for sale in midrand
%in Midrand+277-882-255-28 abortion pills for sale in midrand
 
Right Money Management App For Your Financial Goals
Right Money Management App For Your Financial GoalsRight Money Management App For Your Financial Goals
Right Money Management App For Your Financial Goals
 
WSO2CON2024 - It's time to go Platformless
WSO2CON2024 - It's time to go PlatformlessWSO2CON2024 - It's time to go Platformless
WSO2CON2024 - It's time to go Platformless
 
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
 
Direct Style Effect Systems - The Print[A] Example - A Comprehension Aid
Direct Style Effect Systems -The Print[A] Example- A Comprehension AidDirect Style Effect Systems -The Print[A] Example- A Comprehension Aid
Direct Style Effect Systems - The Print[A] Example - A Comprehension Aid
 
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024
 
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
 

Real-time Fraud Detection for Southeast Asia’s Leading Mobile Platform

  • 1. Real-Time Fraud Detection at Grab Aravind Velamur Srinivasan Engineering Lead, Grab Technology Corp
  • 2. 2 + The Real-Time Big Data Database + Drop-in replacement for Cassandra + 10X the performance & consistent, low latency + Open source and enterprise editions + New: Scylla Cloud, DBaaS + Founded by the creators of KVM hypervisor + HQs: Palo Alto, CA; Herzelia, Israel About ScyllaDB
  • 3. + Engineering Lead at Grab Technologies + Streaming Platform + Data Platform Infrastructure + Worked at Uber Inc prior to Grab + Prior to that worked for Isilon/EMC (now DELL) + Masters in Computer Science from University of Virginia (UVa) + Contact: aravindvelamur@gmail.com Presenter
  • 4. Background Overview of TechStack at Grab Streaming Ecosystem at Grab Why ScyllaDB ScyllaDB use at Grab (Fraud Detection) Conclusion Q&A Agenda
  • 6. Introduction to Grab + One of the most frequently used mobile platforms in Southeast Asia + Multiple services (multi-million users per day and growing!) + Transport + Food + Payment + Shopping + Package Delivery + ...
  • 8. Tech Stack at Grab + Backend Services + Golang + More than 100s of services + Inter-Service Communication + protobuf/gRPC (for the most part)
  • 12. Streaming Ecosystem at Grab Service A Service C Service D Service G Service F Service E Service H Service B
  • 15. Why ScyllaDB + Recap of Grab Tech
  • 16. Why ScyllaDB + Recap of Grab Tech + Microservices (Golang)
  • 17. Why ScyllaDB + Recap of Grab Tech + Microservices (Golang) + Kafka for streaming
  • 18. Why ScyllaDB + Recap of Grab Tech + Microservices (Golang) + Kafka for streaming + Why? Loose coupling and Scale! doh! :)
  • 19. Why ScyllaDB ➔ Stream Processing ➔ Transforms ➔ Aggregates ➔ Joins ➔ ...
  • 20. Why ScyllaDB + Required a State store (metadata store) which has the following characteristics: + Ability to handle very high throughput + Ability to handle bursts (doh!) + Ability to scale out (i.e, handle hockey stick like growth) + Very low latencies (both write and reads) - Near Real-Time + Low operational overhead + Cost efficient
  • 27. ScyllaDB Usage Low Latency High Throughput Easy Scale Build Stats Use Cases
  • 28. Number of Use cases revolving around: + Stream Aggregation & + For Eg:- Fraud Detection in Real-Time + Stream Statistics + ... ScyllaDB Usage
  • 30. + Why? Eg: Fraud Detection https://www.scmp.com/week-asia/business/article/2154770/who-pays-when-indonesian-ride-sharing-fraud-goes-full-throttle https://www.techrepublic.com/article/seven-ride-sharing-scams-to-watch-out-for/ + Grab’s own Blog: https://www.grab.com/sg/blog/why-anti-fraud-technology-is-the-secret-sauce-to-winning-ride-hailing-platforms/ ScyllaDB Usage - Fraud Detection
  • 31. + Why? Eg:- Fraud Detection ScyllaDB Usage - Fraud Detection ... The big influx of capital into the industry has led to fraudsters, sometimes individuals, sometimes organised in gangs, trying to game incentive and sign-up schemes. As a result, a stolen ride-hailing driver profile today is worth up to US$30 on the black market, even more than stolen credit card information. …
  • 32. ScyllaDB Usage - Why Fraud Detection?
  • 33. + Use Kafka streams to do real-time Fraud detection + Simple Use Case: + Some scammers use fake GPS to say they are online at multiple locations ScyllaDB Usage - Fraud Detection
  • 34. + Simple Use case: ScyllaDB Usage - Fraud Detection Grab Service GPS driver_location topic Fraud Detection Service Algorithm
  • 35. + Complex Use Case: + Fraudsters evolve :) + Consuming one topic is not enough + Example: ScyllaDB Usage - Fraud Detection Use fake GPS tools and modded phones to simulate driving behaviour and completed rides to game the system.
  • 36. ScyllaDB Usage - Fraud Detection Topic 1 Stream Processing Engine Grab Service(s) Grab Service(s) Grab Service(s) Topic 2 Topic 3 Combined Topic Fraud Detection Service Algorithm
  • 37. + In summary using ScyllaDB + Joined multiple Kafka streams together in Real-Time!! + Like joining SQL tables together! ScyllaDB Usage - Fraud Detection
  • 39. ScyllaDB Usage - Stream Statistics
  • 40. + Teams like to find out: + Counts of … (eg: count of rides in the past month, etc) + Raw stats for a particular key + Business users - require stats on city/country data ScyllaDB Usage - Stream Statistics
  • 41. ScyllaDB Usage - Stream Statistics + Requires a fast store for raw stats + Ability to build Time series on top of it.
  • 43. ScyllaDB at Grab So Far... Overall + Great experience + Cost effective + Very responsive team + As good as advertised :) + Growing within the company…
  • 44. ScyllaDB at Grab So Far... + Some hiccups + nodetool repair - Writes and read timeouts every time when the run finishes (even with -pr) + Wish nodes can join the cluster faster + Right now takes a really long time (approx 1TB data on each node across all keyspaces) - takes ~2.5 hours to add a node + Understand why but can we design something asynchronous? + Better error logging
  • 45. Q&A Stay in touch aravindvelamur@gmail.com Thank you United States 1900 Embarcadero Road Palo Alto, CA 94303 Israel 11 Galgalei Haplada Herzelia, Israel www.scylladb.com @scylladb