SlideShare uma empresa Scribd logo
1 de 32
Replacing Traditional Technologies with MongoDB
A Single Platform for all Financial Market Data
June 2014
James Blackburn & Gary Collier
Opinions expressed are those of the author and may not be shared by all personnel of Man Group plc
(‘Man’). These opinions are subject to change without notice, and are for information purposes only and do not
constitute an offer or invitation to make an investment in any financial instrument or in any product to which any
member of Man’s group of companies provides investment advisory or any other services. Any forward-looking
statements speak only as of the date on which they are made and are subject to risks and uncertainties that may
cause actual results to differ materially from those contained in the statements. Unless stated otherwise this
information is communicated by Man Investments Limited and AHL Partners LLP which are both authorised and
regulated in the UK by the Financial Conduct Authority.
© Man 2014 2
Legal Stuff
© Man 2014 3
Introductions
Gary Collier James Blackburn
© Man 2014 4
Agenda
The Story of MongoDB at AHL
1. What is a Systematic Fund Manager?
2. Low Frequency Futures and FX Data
3. Single Stock Equity Trading
4. Building a Tick Store
5. Now and the Future?
Prologue
AHL – A Systematic Fund Manager
© Man 2014 5
© Man 2014 6
Systematic Fund Management
Removing the first impedance mismatch…
© Man 2014 7
Quants and Techies Speak the Same Language
© Man 2014 8
Disparate Data Sources
DataAPI
But…
© Man 2014 9
All Data is Behind an API
Performance
User Experience
Cluster Compute
Onboarding
New Data
Impedance Mismatch
Mix of
Technologies
Is there one
Technology
which could
address?
Many
Moving Parts
Reliability
© Man 2013 10
Chapter 1
Starting Small: Low Frequency Data
© Man 2014 11
The Data
8000 rows x 200 markets
100 MB
5000000 rows x 250 markets
500 GB
Parallel Filesystem
© Man 2014 12
Previous Solution
HDF5
HDF5HDF5
HDF5 HDF5
Prop
PropProp
Prop
Prop
RDBMS
RDBMS RDBMS
© Man 2014
13
The Challenge
Fast?
Reliable?
Versionable?
Easy to extend?
© Man 2014 14
MongoDB Solution
node 85 node 96node 86 …node 87
node 1 node 2 node 12
node 73 node 84node 74
…
…
.
.
.
.
.
.
node 3
node 75
.
.
SSD
shard 1 shard 2 shard 3 shard 4
shard 1 shard 2 shard 3 shard 4
shard 1 shard 2 shard 3 shard 4
MongoDB Cluster
Linux
24 cores
96 GB RAM
Bloomberg
Adapter
JPM
Adapter
Markit
Adapter
GS
Adapter
© Man 2014 15
Performance: 200 Future Markets
Previous Solution MongoDB
100x faster to retrieve data
Consistent retrieval times
© Man 2014 16
Performance: EURUSD 1-Minute Data
Previous Solution MongoDB
2-5x faster to retrieve data
Consistent retrieval times
© Man 2014 17
Low Frequency Data - Conclusions
MongoDB faster than previous RDBMS/File Solution at…
• ALL data sizes and ALL client load levels
• …consistently
Game changing new features:
• No impedance mismatch: onboard new data in minutes
• Version Store: can ask “What did the data look like?”
Cost Savings:
• Proprietary parallel filesystem replaced by commodity
SSD’s
© Man 2013 18
Chapter 2
Getting Bigger: Single Stock Equities
© Man 2014 19
Single Stock Data - Scale
Thousands
of Stocks
Many years of
Time-series Data
Tens of different Data
Item for each Stock
Complex trading models with
many Quants sharing the Data
Trading
Signal
Derived Data
Item
Derived Data
Item
Derived Data
Item
Derived Data
Item
Derived Data
Item
Raw Data
ItemsRaw Data
ItemsRaw Data
ItemsRaw Data
ItemsRaw Data
Item
Multi-user, versioned, interactive graph-based computation
© Man 2014 20
Single Stock Data
Source Data
(Managed
RDBMS)
Raw Data
ItemsRaw Data
ItemsRaw Data
ItemsRaw Data
ItemsRaw Data
Item
Derived Data
Item
Derived Data
Item
Derived Data
Item
Derived Data
Item
Derived Data
Item
Trading
Signal
shard 1 shard 2 shard 3 shard 4
shard 1 shard 2 shard 3 shard 4
shard 1 shard 2 shard 3 shard 4
MongoDB Cluster
~1TB Data
~10,000 Stocks
~20 Years
250 Data Items Each Item is 600 MB
Single model ~150GB
Many Quants and models
Hours  Minutes
© Man 2014 21
Single Stock Trading - Conclusions
MongoDB faster than previous RDBMS/File Solution at…
• Fast interactive research
• Read/write a 600MB Data item in < 1 second
• Rebuild complex model: hours  minutes
© Man 2013 22
Chapter 3
MongoDB as a Tick Store
Almost, but not quite
© Man 2014 23
Big Data?
30TB Historic Data
Ticks/1000 per second
Sparse Data
© Man 2014 24
Third-Party Tick Stores
Typically…
• Expensive
• Proprietary query languages
• Database-centric architectures, so…
• Not ideal for cluster compute
• Unless you pay for lots of cores…
• Expensive!
So…
• A real $$$ saving opportunity!
© Man 2014 25
Architecture
Reuters
RMDSMessageBus
Bloomberg
Banks
Kafka Queue
Kafka Queue
Kafka Queue
16 shard cluster
Master + 1 replica
Linux
12 cores
256 GB RAM
96TB Disk
Infiniband network
LZ4 compressed data
MongoDB Cluster
Parallel Access
© Man 2014 26
Tick Store Performance
Infiniband
saturated
25x greater tick throughput
With just 2 machines!
© Man 2014 27
Tick Store: System Load
OtherTick Mongo (x2)N Tasks = 32
© Man 2014 28
Tick Store - Conclusions
Happy Quants!
• 25x improvement in tick throughput
• So fit models 25x as fast
Happy Accountants!
• >40x cost saving of MongoDB Support compared to
previous Tick Store licensing.
© Man 2014 29
Epilogue
Where are we now and where next?
Performance
Low Frequency Data: 100x faster
Equities Models: Hours  Seconds
Tick Data: 25x faster
© Man 2014 30
Key Facts
Cost Savings
Parallel File System  Commodity SSD’s
Proprietary Tick Store  MongoDB
Orders of magnitude $$$ savings…
Efficiencies
4 storage technologies  1
Fully utilise expensive HPC resources
Support load on team down > 50%
Game Changers
Onboard Data: Days  Minutes
Data Versioning
The technology is no longer the bottleneck
“Peopleware”
Attract and retain great Quants
Attract and retain great Techies
And attend a great conference 
© Man 2014 31
Where Next?
1. Extend the data ecosystem further
2. Broader application across the company as a whole
3. Open Source?
© Man 2014 32
Questions
Gary Collier
gcollier@ahl.com
James Blackburn
jblackburn@ahl.com

Mais conteúdo relacionado

Mais procurados

P3O Quick Guide
P3O Quick GuideP3O Quick Guide
P3O Quick Guide
Maven
 
Going Real-Time: Creating Frequently-Updating Datasets for Personalization: S...
Going Real-Time: Creating Frequently-Updating Datasets for Personalization: S...Going Real-Time: Creating Frequently-Updating Datasets for Personalization: S...
Going Real-Time: Creating Frequently-Updating Datasets for Personalization: S...
Spark Summit
 

Mais procurados (20)

Scylla Summit 2022: Building Zeotap's Privacy Compliant Customer Data Platfor...
Scylla Summit 2022: Building Zeotap's Privacy Compliant Customer Data Platfor...Scylla Summit 2022: Building Zeotap's Privacy Compliant Customer Data Platfor...
Scylla Summit 2022: Building Zeotap's Privacy Compliant Customer Data Platfor...
 
Presentation on assumption log in Project Management by MM Rahman
Presentation on assumption log in Project Management by MM RahmanPresentation on assumption log in Project Management by MM Rahman
Presentation on assumption log in Project Management by MM Rahman
 
P3O Quick Guide
P3O Quick GuideP3O Quick Guide
P3O Quick Guide
 
Five Key Considerations when Setting your MBCO
Five Key Considerations when Setting your MBCOFive Key Considerations when Setting your MBCO
Five Key Considerations when Setting your MBCO
 
An Outcome Measurement Model: Is your Agile Adoption Moving the Needle?
An Outcome Measurement Model: Is your Agile Adoption Moving the Needle?An Outcome Measurement Model: Is your Agile Adoption Moving the Needle?
An Outcome Measurement Model: Is your Agile Adoption Moving the Needle?
 
Going Real-Time: Creating Frequently-Updating Datasets for Personalization: S...
Going Real-Time: Creating Frequently-Updating Datasets for Personalization: S...Going Real-Time: Creating Frequently-Updating Datasets for Personalization: S...
Going Real-Time: Creating Frequently-Updating Datasets for Personalization: S...
 
Practitioner class 3: Contract strategy
Practitioner class 3: Contract strategyPractitioner class 3: Contract strategy
Practitioner class 3: Contract strategy
 
The New PMP Exam: Changes and Implications (With Annotation)
The New PMP Exam: Changes and Implications (With Annotation)The New PMP Exam: Changes and Implications (With Annotation)
The New PMP Exam: Changes and Implications (With Annotation)
 
Lean IT Services by Operational Excellence Consulting
Lean IT Services by Operational Excellence ConsultingLean IT Services by Operational Excellence Consulting
Lean IT Services by Operational Excellence Consulting
 
ITIL Demand Management: why August is a bad time for a presentation
ITIL Demand Management: why August is a bad time for a presentationITIL Demand Management: why August is a bad time for a presentation
ITIL Demand Management: why August is a bad time for a presentation
 
Project Management Professional (PMP)
Project Management Professional (PMP) Project Management Professional (PMP)
Project Management Professional (PMP)
 
Value Realization with SAP Ariba Solutions Approach, Measurement, and Success
Value Realization with SAP Ariba Solutions Approach, Measurement, and SuccessValue Realization with SAP Ariba Solutions Approach, Measurement, and Success
Value Realization with SAP Ariba Solutions Approach, Measurement, and Success
 
ITIL and CMMI for service
ITIL and CMMI for serviceITIL and CMMI for service
ITIL and CMMI for service
 
Netflix Keystone - How Netflix Handles Data Streams up to 11M Events/Sec
Netflix Keystone - How Netflix Handles Data Streams up to 11M Events/SecNetflix Keystone - How Netflix Handles Data Streams up to 11M Events/Sec
Netflix Keystone - How Netflix Handles Data Streams up to 11M Events/Sec
 
Creating a colaborative program governance
Creating a colaborative program governanceCreating a colaborative program governance
Creating a colaborative program governance
 
Building a Business Continuity Capability
Building a Business Continuity CapabilityBuilding a Business Continuity Capability
Building a Business Continuity Capability
 
1_Project Management Foundation
1_Project Management Foundation1_Project Management Foundation
1_Project Management Foundation
 
PMO Kick-Off Presentation
PMO Kick-Off PresentationPMO Kick-Off Presentation
PMO Kick-Off Presentation
 
KafkaConsumer - Decoupling Consumption and Processing for Better Resource Uti...
KafkaConsumer - Decoupling Consumption and Processing for Better Resource Uti...KafkaConsumer - Decoupling Consumption and Processing for Better Resource Uti...
KafkaConsumer - Decoupling Consumption and Processing for Better Resource Uti...
 
IT Asset Management by Miradore
IT Asset Management by MiradoreIT Asset Management by Miradore
IT Asset Management by Miradore
 

Destaque

Event-Based Subscription with MongoDB
Event-Based Subscription with MongoDBEvent-Based Subscription with MongoDB
Event-Based Subscription with MongoDB
MongoDB
 
How Appboy’s Marketing Automation for Apps Platform Grew 40x on the ObjectRoc...
How Appboy’s Marketing Automation for Apps Platform Grew 40x on the ObjectRoc...How Appboy’s Marketing Automation for Apps Platform Grew 40x on the ObjectRoc...
How Appboy’s Marketing Automation for Apps Platform Grew 40x on the ObjectRoc...
MongoDB
 
Building LinkedIn's Learning Platform with MongoDB
Building LinkedIn's Learning Platform with MongoDBBuilding LinkedIn's Learning Platform with MongoDB
Building LinkedIn's Learning Platform with MongoDB
MongoDB
 
Building a High-Performance Distributed Task Queue on MongoDB
Building a High-Performance Distributed Task Queue on MongoDBBuilding a High-Performance Distributed Task Queue on MongoDB
Building a High-Performance Distributed Task Queue on MongoDB
MongoDB
 

Destaque (20)

Event-Based Subscription with MongoDB
Event-Based Subscription with MongoDBEvent-Based Subscription with MongoDB
Event-Based Subscription with MongoDB
 
How Appboy’s Marketing Automation for Apps Platform Grew 40x on the ObjectRoc...
How Appboy’s Marketing Automation for Apps Platform Grew 40x on the ObjectRoc...How Appboy’s Marketing Automation for Apps Platform Grew 40x on the ObjectRoc...
How Appboy’s Marketing Automation for Apps Platform Grew 40x on the ObjectRoc...
 
MongoDB and the Connectivity Map: Making Connections Between Genetics and Dis...
MongoDB and the Connectivity Map: Making Connections Between Genetics and Dis...MongoDB and the Connectivity Map: Making Connections Between Genetics and Dis...
MongoDB and the Connectivity Map: Making Connections Between Genetics and Dis...
 
Webinar: How MongoDB is Used to Manage Reference Data - May 2014
Webinar: How MongoDB is Used to Manage Reference Data - May 2014Webinar: How MongoDB is Used to Manage Reference Data - May 2014
Webinar: How MongoDB is Used to Manage Reference Data - May 2014
 
Content Management with MongoDB by Mark Helmstetter
 Content Management with MongoDB by Mark Helmstetter Content Management with MongoDB by Mark Helmstetter
Content Management with MongoDB by Mark Helmstetter
 
Migration from SQL to MongoDB - A Case Study at TheKnot.com
Migration from SQL to MongoDB - A Case Study at TheKnot.com Migration from SQL to MongoDB - A Case Study at TheKnot.com
Migration from SQL to MongoDB - A Case Study at TheKnot.com
 
Building an An AI Startup with MongoDB at x.ai
Building an An AI Startup with MongoDB at x.aiBuilding an An AI Startup with MongoDB at x.ai
Building an An AI Startup with MongoDB at x.ai
 
MongoDB Deployment Checklist
MongoDB Deployment ChecklistMongoDB Deployment Checklist
MongoDB Deployment Checklist
 
Building LinkedIn's Learning Platform with MongoDB
Building LinkedIn's Learning Platform with MongoDBBuilding LinkedIn's Learning Platform with MongoDB
Building LinkedIn's Learning Platform with MongoDB
 
The Future of a $6 Trillion Dollar Industry vs the Past
The Future of a $6 Trillion Dollar Industry vs the PastThe Future of a $6 Trillion Dollar Industry vs the Past
The Future of a $6 Trillion Dollar Industry vs the Past
 
Webinar: Elevate Your Enterprise Architecture with In-Memory Computing
Webinar: Elevate Your Enterprise Architecture with In-Memory ComputingWebinar: Elevate Your Enterprise Architecture with In-Memory Computing
Webinar: Elevate Your Enterprise Architecture with In-Memory Computing
 
Unlocking Operational Intelligence from the Data Lake
Unlocking Operational Intelligence from the Data LakeUnlocking Operational Intelligence from the Data Lake
Unlocking Operational Intelligence from the Data Lake
 
Celebrating Diversity in FinTech
Celebrating Diversity in FinTech Celebrating Diversity in FinTech
Celebrating Diversity in FinTech
 
Idea champions testimonials
Idea champions testimonialsIdea champions testimonials
Idea champions testimonials
 
MongoDB Europe 2016 - Distributed Ledgers, Blockchain + MongoDB
MongoDB Europe 2016 - Distributed Ledgers, Blockchain + MongoDBMongoDB Europe 2016 - Distributed Ledgers, Blockchain + MongoDB
MongoDB Europe 2016 - Distributed Ledgers, Blockchain + MongoDB
 
Creators on Creating
Creators on CreatingCreators on Creating
Creators on Creating
 
Securing MongoDB to Serve an AWS-Based, Multi-Tenant, Security-Fanatic SaaS A...
Securing MongoDB to Serve an AWS-Based, Multi-Tenant, Security-Fanatic SaaS A...Securing MongoDB to Serve an AWS-Based, Multi-Tenant, Security-Fanatic SaaS A...
Securing MongoDB to Serve an AWS-Based, Multi-Tenant, Security-Fanatic SaaS A...
 
Building a High-Performance Distributed Task Queue on MongoDB
Building a High-Performance Distributed Task Queue on MongoDBBuilding a High-Performance Distributed Task Queue on MongoDB
Building a High-Performance Distributed Task Queue on MongoDB
 
Introduction to Fintech
Introduction to FintechIntroduction to Fintech
Introduction to Fintech
 
High Frequency Trading and NoSQL database
High Frequency Trading and NoSQL databaseHigh Frequency Trading and NoSQL database
High Frequency Trading and NoSQL database
 

Semelhante a Replacing Traditional Technologies with MongoDB: A Single Platform for All Financial Data at AHL

Mongo Seattle - The Business of MongoDB
Mongo Seattle - The Business of MongoDBMongo Seattle - The Business of MongoDB
Mongo Seattle - The Business of MongoDB
Justin Smestad
 

Semelhante a Replacing Traditional Technologies with MongoDB: A Single Platform for All Financial Data at AHL (20)

Salvatore Incandela "Loyalty cashback - Scaling with MongoDB"
Salvatore Incandela "Loyalty cashback - Scaling with MongoDB"Salvatore Incandela "Loyalty cashback - Scaling with MongoDB"
Salvatore Incandela "Loyalty cashback - Scaling with MongoDB"
 
Myths & Reality - Choose a DBMS tailored to your use cases
Myths & Reality - Choose a DBMS tailored to your use casesMyths & Reality - Choose a DBMS tailored to your use cases
Myths & Reality - Choose a DBMS tailored to your use cases
 
Webinar: How Banks Manage Reference Data with MongoDB
 Webinar: How Banks Manage Reference Data with MongoDB Webinar: How Banks Manage Reference Data with MongoDB
Webinar: How Banks Manage Reference Data with MongoDB
 
Mongo db operations_v2
Mongo db operations_v2Mongo db operations_v2
Mongo db operations_v2
 
Webinar: Enterprise Trends for Database-as-a-Service
Webinar: Enterprise Trends for Database-as-a-ServiceWebinar: Enterprise Trends for Database-as-a-Service
Webinar: Enterprise Trends for Database-as-a-Service
 
MongoDB.local Atlanta: Modern Data Backup and Recovery from On-Premises to th...
MongoDB.local Atlanta: Modern Data Backup and Recovery from On-Premises to th...MongoDB.local Atlanta: Modern Data Backup and Recovery from On-Premises to th...
MongoDB.local Atlanta: Modern Data Backup and Recovery from On-Premises to th...
 
Retour d'expérience d'un environnement base de données multitenant
Retour d'expérience d'un environnement base de données multitenantRetour d'expérience d'un environnement base de données multitenant
Retour d'expérience d'un environnement base de données multitenant
 
Ops Jumpstart: Admin 101
Ops Jumpstart: Admin 101Ops Jumpstart: Admin 101
Ops Jumpstart: Admin 101
 
Webinar: The OpEx Business Plan for NoSQL
 Webinar: The OpEx Business Plan for NoSQL Webinar: The OpEx Business Plan for NoSQL
Webinar: The OpEx Business Plan for NoSQL
 
Bangalore Executive Seminar 2015: Elephant In The Room - Relational to MongoDB
Bangalore Executive Seminar 2015: Elephant In The Room - Relational to MongoDBBangalore Executive Seminar 2015: Elephant In The Room - Relational to MongoDB
Bangalore Executive Seminar 2015: Elephant In The Room - Relational to MongoDB
 
Case study migration from cm13 to cm14 - Oracle Primavera P6 Collaborate 14
Case study migration from cm13 to cm14 - Oracle Primavera P6 Collaborate 14Case study migration from cm13 to cm14 - Oracle Primavera P6 Collaborate 14
Case study migration from cm13 to cm14 - Oracle Primavera P6 Collaborate 14
 
Mongo Seattle - The Business of MongoDB
Mongo Seattle - The Business of MongoDBMongo Seattle - The Business of MongoDB
Mongo Seattle - The Business of MongoDB
 
Analytics, Big Data and Nonvolatile Memory Architectures – Why you Should Car...
Analytics, Big Data and Nonvolatile Memory Architectures – Why you Should Car...Analytics, Big Data and Nonvolatile Memory Architectures – Why you Should Car...
Analytics, Big Data and Nonvolatile Memory Architectures – Why you Should Car...
 
MongoDB World 2019: Modern Data Backup and Recovery from On-premises to the P...
MongoDB World 2019: Modern Data Backup and Recovery from On-premises to the P...MongoDB World 2019: Modern Data Backup and Recovery from On-premises to the P...
MongoDB World 2019: Modern Data Backup and Recovery from On-premises to the P...
 
Evolution of DBA in the Cloud Era
 Evolution of DBA in the Cloud Era Evolution of DBA in the Cloud Era
Evolution of DBA in the Cloud Era
 
MongoDB Days Silicon Valley: Jumpstart: Ops/Admin 101
MongoDB Days Silicon Valley: Jumpstart: Ops/Admin 101MongoDB Days Silicon Valley: Jumpstart: Ops/Admin 101
MongoDB Days Silicon Valley: Jumpstart: Ops/Admin 101
 
"Dataflow: Where Power Budgets Are Won and Lost," a Presentation from Movidius
"Dataflow: Where Power Budgets Are Won and Lost," a Presentation from Movidius"Dataflow: Where Power Budgets Are Won and Lost," a Presentation from Movidius
"Dataflow: Where Power Budgets Are Won and Lost," a Presentation from Movidius
 
MongoDB Evenings Houston: Implementing EDW Using MongoDB by Purvesh Patel, Ch...
MongoDB Evenings Houston: Implementing EDW Using MongoDB by Purvesh Patel, Ch...MongoDB Evenings Houston: Implementing EDW Using MongoDB by Purvesh Patel, Ch...
MongoDB Evenings Houston: Implementing EDW Using MongoDB by Purvesh Patel, Ch...
 
Powering Microservices with MongoDB, Docker, Kubernetes & Kafka – MongoDB Eur...
Powering Microservices with MongoDB, Docker, Kubernetes & Kafka – MongoDB Eur...Powering Microservices with MongoDB, Docker, Kubernetes & Kafka – MongoDB Eur...
Powering Microservices with MongoDB, Docker, Kubernetes & Kafka – MongoDB Eur...
 
Webinar: An Enterprise Architect’s View of MongoDB
Webinar: An Enterprise Architect’s View of MongoDBWebinar: An Enterprise Architect’s View of MongoDB
Webinar: An Enterprise Architect’s View of MongoDB
 

Mais de MongoDB

Mais de MongoDB (20)

MongoDB SoCal 2020: Migrate Anything* to MongoDB Atlas
MongoDB SoCal 2020: Migrate Anything* to MongoDB AtlasMongoDB SoCal 2020: Migrate Anything* to MongoDB Atlas
MongoDB SoCal 2020: Migrate Anything* to MongoDB Atlas
 
MongoDB SoCal 2020: Go on a Data Safari with MongoDB Charts!
MongoDB SoCal 2020: Go on a Data Safari with MongoDB Charts!MongoDB SoCal 2020: Go on a Data Safari with MongoDB Charts!
MongoDB SoCal 2020: Go on a Data Safari with MongoDB Charts!
 
MongoDB SoCal 2020: Using MongoDB Services in Kubernetes: Any Platform, Devel...
MongoDB SoCal 2020: Using MongoDB Services in Kubernetes: Any Platform, Devel...MongoDB SoCal 2020: Using MongoDB Services in Kubernetes: Any Platform, Devel...
MongoDB SoCal 2020: Using MongoDB Services in Kubernetes: Any Platform, Devel...
 
MongoDB SoCal 2020: A Complete Methodology of Data Modeling for MongoDB
MongoDB SoCal 2020: A Complete Methodology of Data Modeling for MongoDBMongoDB SoCal 2020: A Complete Methodology of Data Modeling for MongoDB
MongoDB SoCal 2020: A Complete Methodology of Data Modeling for MongoDB
 
MongoDB SoCal 2020: From Pharmacist to Analyst: Leveraging MongoDB for Real-T...
MongoDB SoCal 2020: From Pharmacist to Analyst: Leveraging MongoDB for Real-T...MongoDB SoCal 2020: From Pharmacist to Analyst: Leveraging MongoDB for Real-T...
MongoDB SoCal 2020: From Pharmacist to Analyst: Leveraging MongoDB for Real-T...
 
MongoDB SoCal 2020: Best Practices for Working with IoT and Time-series Data
MongoDB SoCal 2020: Best Practices for Working with IoT and Time-series DataMongoDB SoCal 2020: Best Practices for Working with IoT and Time-series Data
MongoDB SoCal 2020: Best Practices for Working with IoT and Time-series Data
 
MongoDB SoCal 2020: MongoDB Atlas Jump Start
 MongoDB SoCal 2020: MongoDB Atlas Jump Start MongoDB SoCal 2020: MongoDB Atlas Jump Start
MongoDB SoCal 2020: MongoDB Atlas Jump Start
 
MongoDB .local San Francisco 2020: Powering the new age data demands [Infosys]
MongoDB .local San Francisco 2020: Powering the new age data demands [Infosys]MongoDB .local San Francisco 2020: Powering the new age data demands [Infosys]
MongoDB .local San Francisco 2020: Powering the new age data demands [Infosys]
 
MongoDB .local San Francisco 2020: Using Client Side Encryption in MongoDB 4.2
MongoDB .local San Francisco 2020: Using Client Side Encryption in MongoDB 4.2MongoDB .local San Francisco 2020: Using Client Side Encryption in MongoDB 4.2
MongoDB .local San Francisco 2020: Using Client Side Encryption in MongoDB 4.2
 
MongoDB .local San Francisco 2020: Using MongoDB Services in Kubernetes: any ...
MongoDB .local San Francisco 2020: Using MongoDB Services in Kubernetes: any ...MongoDB .local San Francisco 2020: Using MongoDB Services in Kubernetes: any ...
MongoDB .local San Francisco 2020: Using MongoDB Services in Kubernetes: any ...
 
MongoDB .local San Francisco 2020: Go on a Data Safari with MongoDB Charts!
MongoDB .local San Francisco 2020: Go on a Data Safari with MongoDB Charts!MongoDB .local San Francisco 2020: Go on a Data Safari with MongoDB Charts!
MongoDB .local San Francisco 2020: Go on a Data Safari with MongoDB Charts!
 
MongoDB .local San Francisco 2020: From SQL to NoSQL -- Changing Your Mindset
MongoDB .local San Francisco 2020: From SQL to NoSQL -- Changing Your MindsetMongoDB .local San Francisco 2020: From SQL to NoSQL -- Changing Your Mindset
MongoDB .local San Francisco 2020: From SQL to NoSQL -- Changing Your Mindset
 
MongoDB .local San Francisco 2020: MongoDB Atlas Jumpstart
MongoDB .local San Francisco 2020: MongoDB Atlas JumpstartMongoDB .local San Francisco 2020: MongoDB Atlas Jumpstart
MongoDB .local San Francisco 2020: MongoDB Atlas Jumpstart
 
MongoDB .local San Francisco 2020: Tips and Tricks++ for Querying and Indexin...
MongoDB .local San Francisco 2020: Tips and Tricks++ for Querying and Indexin...MongoDB .local San Francisco 2020: Tips and Tricks++ for Querying and Indexin...
MongoDB .local San Francisco 2020: Tips and Tricks++ for Querying and Indexin...
 
MongoDB .local San Francisco 2020: Aggregation Pipeline Power++
MongoDB .local San Francisco 2020: Aggregation Pipeline Power++MongoDB .local San Francisco 2020: Aggregation Pipeline Power++
MongoDB .local San Francisco 2020: Aggregation Pipeline Power++
 
MongoDB .local San Francisco 2020: A Complete Methodology of Data Modeling fo...
MongoDB .local San Francisco 2020: A Complete Methodology of Data Modeling fo...MongoDB .local San Francisco 2020: A Complete Methodology of Data Modeling fo...
MongoDB .local San Francisco 2020: A Complete Methodology of Data Modeling fo...
 
MongoDB .local San Francisco 2020: MongoDB Atlas Data Lake Technical Deep Dive
MongoDB .local San Francisco 2020: MongoDB Atlas Data Lake Technical Deep DiveMongoDB .local San Francisco 2020: MongoDB Atlas Data Lake Technical Deep Dive
MongoDB .local San Francisco 2020: MongoDB Atlas Data Lake Technical Deep Dive
 
MongoDB .local San Francisco 2020: Developing Alexa Skills with MongoDB & Golang
MongoDB .local San Francisco 2020: Developing Alexa Skills with MongoDB & GolangMongoDB .local San Francisco 2020: Developing Alexa Skills with MongoDB & Golang
MongoDB .local San Francisco 2020: Developing Alexa Skills with MongoDB & Golang
 
MongoDB .local Paris 2020: Realm : l'ingrédient secret pour de meilleures app...
MongoDB .local Paris 2020: Realm : l'ingrédient secret pour de meilleures app...MongoDB .local Paris 2020: Realm : l'ingrédient secret pour de meilleures app...
MongoDB .local Paris 2020: Realm : l'ingrédient secret pour de meilleures app...
 
MongoDB .local Paris 2020: Upply @MongoDB : Upply : Quand le Machine Learning...
MongoDB .local Paris 2020: Upply @MongoDB : Upply : Quand le Machine Learning...MongoDB .local Paris 2020: Upply @MongoDB : Upply : Quand le Machine Learning...
MongoDB .local Paris 2020: Upply @MongoDB : Upply : Quand le Machine Learning...
 

Último

Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Victor Rentea
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 

Último (20)

Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering Developers
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 

Replacing Traditional Technologies with MongoDB: A Single Platform for All Financial Data at AHL

  • 1. Replacing Traditional Technologies with MongoDB A Single Platform for all Financial Market Data June 2014 James Blackburn & Gary Collier
  • 2. Opinions expressed are those of the author and may not be shared by all personnel of Man Group plc (‘Man’). These opinions are subject to change without notice, and are for information purposes only and do not constitute an offer or invitation to make an investment in any financial instrument or in any product to which any member of Man’s group of companies provides investment advisory or any other services. Any forward-looking statements speak only as of the date on which they are made and are subject to risks and uncertainties that may cause actual results to differ materially from those contained in the statements. Unless stated otherwise this information is communicated by Man Investments Limited and AHL Partners LLP which are both authorised and regulated in the UK by the Financial Conduct Authority. © Man 2014 2 Legal Stuff
  • 3. © Man 2014 3 Introductions Gary Collier James Blackburn
  • 4. © Man 2014 4 Agenda The Story of MongoDB at AHL 1. What is a Systematic Fund Manager? 2. Low Frequency Futures and FX Data 3. Single Stock Equity Trading 4. Building a Tick Store 5. Now and the Future?
  • 5. Prologue AHL – A Systematic Fund Manager © Man 2014 5
  • 6. © Man 2014 6 Systematic Fund Management
  • 7. Removing the first impedance mismatch… © Man 2014 7 Quants and Techies Speak the Same Language
  • 8. © Man 2014 8 Disparate Data Sources DataAPI
  • 9. But… © Man 2014 9 All Data is Behind an API Performance User Experience Cluster Compute Onboarding New Data Impedance Mismatch Mix of Technologies Is there one Technology which could address? Many Moving Parts Reliability
  • 10. © Man 2013 10 Chapter 1 Starting Small: Low Frequency Data
  • 11. © Man 2014 11 The Data 8000 rows x 200 markets 100 MB 5000000 rows x 250 markets 500 GB
  • 12. Parallel Filesystem © Man 2014 12 Previous Solution HDF5 HDF5HDF5 HDF5 HDF5 Prop PropProp Prop Prop RDBMS RDBMS RDBMS
  • 13. © Man 2014 13 The Challenge Fast? Reliable? Versionable? Easy to extend?
  • 14. © Man 2014 14 MongoDB Solution node 85 node 96node 86 …node 87 node 1 node 2 node 12 node 73 node 84node 74 … … . . . . . . node 3 node 75 . . SSD shard 1 shard 2 shard 3 shard 4 shard 1 shard 2 shard 3 shard 4 shard 1 shard 2 shard 3 shard 4 MongoDB Cluster Linux 24 cores 96 GB RAM Bloomberg Adapter JPM Adapter Markit Adapter GS Adapter
  • 15. © Man 2014 15 Performance: 200 Future Markets Previous Solution MongoDB 100x faster to retrieve data Consistent retrieval times
  • 16. © Man 2014 16 Performance: EURUSD 1-Minute Data Previous Solution MongoDB 2-5x faster to retrieve data Consistent retrieval times
  • 17. © Man 2014 17 Low Frequency Data - Conclusions MongoDB faster than previous RDBMS/File Solution at… • ALL data sizes and ALL client load levels • …consistently Game changing new features: • No impedance mismatch: onboard new data in minutes • Version Store: can ask “What did the data look like?” Cost Savings: • Proprietary parallel filesystem replaced by commodity SSD’s
  • 18. © Man 2013 18 Chapter 2 Getting Bigger: Single Stock Equities
  • 19. © Man 2014 19 Single Stock Data - Scale Thousands of Stocks Many years of Time-series Data Tens of different Data Item for each Stock Complex trading models with many Quants sharing the Data
  • 20. Trading Signal Derived Data Item Derived Data Item Derived Data Item Derived Data Item Derived Data Item Raw Data ItemsRaw Data ItemsRaw Data ItemsRaw Data ItemsRaw Data Item Multi-user, versioned, interactive graph-based computation © Man 2014 20 Single Stock Data Source Data (Managed RDBMS) Raw Data ItemsRaw Data ItemsRaw Data ItemsRaw Data ItemsRaw Data Item Derived Data Item Derived Data Item Derived Data Item Derived Data Item Derived Data Item Trading Signal shard 1 shard 2 shard 3 shard 4 shard 1 shard 2 shard 3 shard 4 shard 1 shard 2 shard 3 shard 4 MongoDB Cluster ~1TB Data ~10,000 Stocks ~20 Years 250 Data Items Each Item is 600 MB Single model ~150GB Many Quants and models Hours  Minutes
  • 21. © Man 2014 21 Single Stock Trading - Conclusions MongoDB faster than previous RDBMS/File Solution at… • Fast interactive research • Read/write a 600MB Data item in < 1 second • Rebuild complex model: hours  minutes
  • 22. © Man 2013 22 Chapter 3 MongoDB as a Tick Store
  • 23. Almost, but not quite © Man 2014 23 Big Data? 30TB Historic Data Ticks/1000 per second Sparse Data
  • 24. © Man 2014 24 Third-Party Tick Stores Typically… • Expensive • Proprietary query languages • Database-centric architectures, so… • Not ideal for cluster compute • Unless you pay for lots of cores… • Expensive! So… • A real $$$ saving opportunity!
  • 25. © Man 2014 25 Architecture Reuters RMDSMessageBus Bloomberg Banks Kafka Queue Kafka Queue Kafka Queue 16 shard cluster Master + 1 replica Linux 12 cores 256 GB RAM 96TB Disk Infiniband network LZ4 compressed data MongoDB Cluster
  • 26. Parallel Access © Man 2014 26 Tick Store Performance Infiniband saturated 25x greater tick throughput With just 2 machines!
  • 27. © Man 2014 27 Tick Store: System Load OtherTick Mongo (x2)N Tasks = 32
  • 28. © Man 2014 28 Tick Store - Conclusions Happy Quants! • 25x improvement in tick throughput • So fit models 25x as fast Happy Accountants! • >40x cost saving of MongoDB Support compared to previous Tick Store licensing.
  • 29. © Man 2014 29 Epilogue Where are we now and where next?
  • 30. Performance Low Frequency Data: 100x faster Equities Models: Hours  Seconds Tick Data: 25x faster © Man 2014 30 Key Facts Cost Savings Parallel File System  Commodity SSD’s Proprietary Tick Store  MongoDB Orders of magnitude $$$ savings… Efficiencies 4 storage technologies  1 Fully utilise expensive HPC resources Support load on team down > 50% Game Changers Onboard Data: Days  Minutes Data Versioning The technology is no longer the bottleneck “Peopleware” Attract and retain great Quants Attract and retain great Techies And attend a great conference 
  • 31. © Man 2014 31 Where Next? 1. Extend the data ecosystem further 2. Broader application across the company as a whole 3. Open Source?
  • 32. © Man 2014 32 Questions Gary Collier gcollier@ahl.com James Blackburn jblackburn@ahl.com

Notas do Editor

  1. Everything running orders of magnitude faster Move from proprietary tech  commodity and MongoDB has realised significant cost savings Complexity down, and getting more out of what we have, both in hardware and people Including onboarding new data. Peopleware: often overlooked, but really the most important factor in our sorts of industries “The reason I love working here so much is because the technology is soooo good”