SlideShare uma empresa Scribd logo
1 de 18
Baixar para ler offline
SCALED RISK
Next Generation of Financial Platform
NoSQL in Financial Industry
Distributed Matters - Barcelona – 21 November 2015
Pierre Bittner - CTO
SCALED RISK
SCALED RISK 2
Integrated Big Data & Analytics Platform
SaaS or On-Premise
Hadoop/HBase + Low latency + External Consistency
+ Flexible Data Schema + In-Memory OLAP
WHAT?
HOW?
FOR? Real-Time Risk Management
WHERE?
What is Scaled Risk?
SCALED RISK 3
Why NoSQL Matters in Financial Industry?
• Volume / Velocity
§ New York Stock Exchange generates about 4−5 terabytes of data per day.
§ Algo Trading, High Frequency Trading: In 2012, accounted for 50% of all US equity trading volume.
Trade execution milli- and even microseconds.
ġ
E
Y
G
• Coherency / Availability / Security
§ Regulatory Report: Intraday Monitoring,
§ MTTR, Data Spikes on Market Event, Disaster Recovery, ACL
• Mixed workloads: Streaming and Historical Analysis – Point In Time Comparison
§ BackTesting, Replay (UTC Timestamping of all events, FIFO)
§ Lambda-architecture, Kappa-architecture
• Needs for Multi-tenancy / Data & Process Governance (Data Lake / Data Centric Arch.)
SCALED RISK 4
G
Y
E
Real-Time Enterprise-Wide
Risk Management
Improved and trustable view of
global risk and support
implementation of next regulations
Real-Time
Fraud Detection
Pre-check, real-time and
historical data verification for
trades, payments, orders, …
Real-Time
Market Analytics
On-demand live and historical
data analysis on global market
Why NoSQL Matters in Financial Industry?
Customer Story:
Market Exchange
Market Surveillance
SCALED RISK
OTC Market
• Foreign Exchange
• Debt Market (Bond)
• Commodities
• Bloomberg, FXAll,…
• …
Regulated Market
• Securities
• Options
• NYSE, Eurex
• LSE
• …
Structured Data Feeds
Booking Systems
• Trader Positions
• Intraday Events
• Valorization
• Volatility, Correlation
Referential Data
• Counterparts
• Analytical Structure
• Products Definition
• Mappings
Unstructured Data Feeds
News & Mkt Analysis
• Reuters, BBG
• Research
Social Media
• Twitter
• LinkedIn, …
Trading
• Global Positions
• Intraday funding & forecasting
• Collateral Optimization
RT Aggregated Positions
Sales
• Credit Line
• Profitability Indicator
• Customer Interests
Global
• Market Flows
• Analyst/Market Correlation
On-Demand Analysis
Market Risk Analysis
• Stress per Counterparty
Sales
• Customer alerts on
Market Trends
• Recommendation &
Lead Generation
Live Report & Alerts
• On Market Events
• Custom scenario
• Market Surveillance
5
Today’s Trading Challenge: On-Demand Live Analysis & Alerts
Risk
• CVA, Counterparty Exposure
• Limit, Stress Test
Intraday Limit Risk
• Automatic Monitoring
SCALED RISK 6
Context
Extreme performance and resilience :
Peak activity > 1M order p. second
Low Latency
Objective
On-demand market analytics out of
real-time & historical data
Resilient primary storage
Problems
High volumes, difficult access to history
SLAs for data & service availability
Customer Story: On Demand Market Surveillance for Exchange
Solution
Scaled Risk at the outflow of the matching engine
Benefits
Streamline process, consistent view
High availability and scalability
Reduced TCO
l
Result
A single system for storage and
computation of spot & historical data
for market surveillance
SCALED RISK
7
On Demand Market Surveillance: Pilot Perimeter
High Level Architecture Candidateģ
SCALED RISK
8
On Demand Market Surveillance: Pilot Perimeter
Focus on evaluating HBase frameworkå
Ø HBase performance on Read/Write
Ø HBase behavior during a node failure
Ø HBase process isolation
Ø Global consistency
Key parts of the architectureå
Ø Message Bus (Kafka)
Ø Storage System (HDFS)
Ø Operational Database (HBase)
Ø Real-Time Analytics tool (Scaled Risk)
Ø History & Data Analytics tool (SR & Spark)
Benefits of architecture (streamline process, cost, …)
not covered in this step.
Confirm Hadoop/HBase technical Stackå
Evaluate Scaled Risk performanceå
Explore Scaled Risk featureså
Pilot Perimeter
Suitability of HBase and Scaled Risk in term of
properties and performance.
Pilot duration : 2 months
SCALED RISK
9
HBase: Random Access to your Planet-Wide Data
Key-value data organization per row. Table is a namespace.å
Each cell is timestampedå
ACID per row; Rowkey for fast access and data distributionå
HBase in few words
HBase is an open-source, distributed, versioned, non-relational, scalable, wide-column data store.
Ø It is the Hadoop database, leveraging mainly on HDFS.
Ø Based on Google BigTable storage system.
Four primary operations are Get, Put, Delete and Scanå
Server-side operations with Coprocessor (Observer, Endpoint)å
Linear scalability, automatic sharding and failover supportå
Strictly consistentå
Hadoop ecostem integration (YARN), MapReduce, Hive, Sparkå
Phoenix for SQL Flavorå
SCALED RISK
NoSQL Wide Column Store Real-Time Distributed OLAP
• Dynamic Data Schema
• Schema on read and write
• Fast, Random R/W access
• Fast In-Memory Data Processing
• Full Consistency; Linear Scalability
• Open API (Valuation)
On-Demand Market Surveillance: Functional Architecture
10
LowLatencyInternalBus
Read-Isolations
As Of Date
HBase As Storage
Injector(Thrift)
• Advanced Index and search for
Data Classification and Correlation
• Semantic reconciliation
Real Time Indexing
Real-Time Alerting
0
1
2
3
4
5
6
Contrat 1 Contrat 2 Contrat 3 Contrat 4
Alert on Analytics
Volume Matching Cancel Rate
Alert on Data
REST/API/WebSocket
SCALED RISK
On-Demand Market surveillance: Technical Architecture
11
Head Node
Name Node
Head Node
Secondary Name Node
Head Node
Hbase Master
Worker Node
Region Server
Data Node
Worker Node
Region Server
Data Node
Worker Node
Region Server
Data Node
Worker Node
Region Server
Data Node
Worker Node
Region Server
Data Node
Worker Node
Region Server
Data Node
HP Loader
3 x Hadoop Head nodes:
HP ProLiant DL360Gen9 Server
8x 900GB 10k rpm SAS,128 GB RAM, 2 x (10 cores)
Intel Xeon CPU E5-2660 v3 @ 2.60GHz,
4 x 1GbE ports and 2 x 10GbE ports
6 x Hadoop worker nodes:
HP ProLiant DL380Gen9 Server
2 x 120GB SSD OS,
15 x 3TB 7.2k rpm SATA, 128 GB RAM, 2 x (10
cores) Intel Xeon CPU E5-2660 v3 @ 2.60GHz,
4 x 1GbE ports and 2 x 10GbE ports
1 x HP Smart HBA H240ar, 1 x HP Smart HBA H240
1 x HP Loader:
HP ProLiant DL380Gen8
14 x 1TB 7.2k rpm SAS, 128 GB RAM, 2 x (10 cores) Intel
Xeon CPU E5-2670 v2 @ 2.50GHz
Cluster size and components
Hadoop cluster details :
• Hadoop HDFS usable size : 60TB (Block
replication 3, no compression)
• Hadoop HDFS data disk RAW size : 241TB
• Hadoop cluster memory : 6 x 128GB = 768GB
Hadoop componentsand associated services
• Hadoop Distribution : HortonWorks HDP 2.2 Stack
• Cluster management : HP Insight CMU v7.3
• Hdfs v2.6.0
• Hbase v0.98.4
• Zookeeperv3.4.6
Other details :
• OS : RHEL - RedHat Enterprise Linux v6.5 – 64bit
• Linux filesystem for Hadoop data : ext4
• JVM used for Hadoop : Oracle Java 1.7.0_67
SCALED RISK 12
On Demand Market Surveillance : Functional Consistency
• Market Exchange Data types
§ A unique Data flow containing all types of message
§ Order messages
§ Trade messages
§ Test injector generates 1,5m in 7’ (client limitation)
E
Y
G
• Scaled Risk Data exhaustiveness control
§ Dynamic data model with two tables
§ Trade and Order messages are split
§ Test method: Messages count
Message Type Message sub type Count
Order
New 792,546
Replace 645,889
Status 40,821
Others 80
(unique order ids)
792,886
Cancel n/a 680,626
Trade n/a 137,573
• Order and Trade Life-cycle Control
§ Message fields consistency control
§ Test method: Data sampling
Message Type Count
Order Table 792,886
Trade Table 137,573
Order Id Trader Contract Qty Price Side
A 6C9 JFFCE150500000F 1 49350 Buy
B W90 JFFCE150500000F 2 49350 Sell
C MAT JFFCE150500000F 1 49345 Buy
Trade Id Trader Contract Qty Price Side
1630 6C9 JFFCE150500000F 1 49350 Buy
1630 W90 JFFCE150500000F 1 49350 Sell
1631 MAT JFFCE150500000F 1 49345 Buy
1631 W90 JFFCE150500000F 1 49345 Sell
SCALED RISK 13
On Demand Market Surveillance : Performance Indicators
Sender/Trade (per region)
• 130 K trades per second
• 800 K on cluster
Test Scenario
• 7 minutes
• 1,479,335 messages
• Stats only on Order Table
End-to-end
• Nominal Latency ~200ms
• 90% of messages with <412ms
SCALED RISK
On Demand Market Surveillance : Fault Tolerance Test
HBase is designed to be fault tolerant.
• A node fails when the white stripe appears on the
whole width of the graph.
• All nodes are impacted by the failure, and not only the
killed node (as expected remember CP).
• Another white rectangle is displayed before the node
failure.
• It represents all the messages that have been
correctly inserted before the failure, but never flushed
to disk.
• Because the WAL is deactivated by trade injector
(option), those messages were lost when regions were
moved from the killed node to other nodes.
X axis is the rowkey prefix, to show the distribution of insertion on the cluster. The Y axis is the time.
Points displayed over the entire width of the X axis means that the distribution is correct.
SCALED RISK
On Demand Market Surveillance : Fault Tolerance Test
X axis is the rowkey prefix, to show the distribution of insertion on the cluster. The Y axis is the time.
Points displayed over the entire width of the X axis means that the distribution is correct.
A second test confirms that HBase remains available even if a node fails.
Test consists in inserting data in HBase from both YCSB and trade injector clients.
• YCSB inserts data in a table distributed on 5 nodes
• Trade injector inserts data in a table distributed on 4 nodes.
• The node killed does not impact trades injection.
SCALED RISK
On Demand Market Surveillance: Next Steps
Deeper evaluation of HBase
Impact of volumes on performance
Evaluation of HA Region Servers for data access
Wider view of the targeted architecture
Overall resilience
Overall latency
Simplification
Hot zone/Cold zone
TCO
Business requirements of the project:
MIFID II impact
New services
SCALED RISK 17
Extreme flexibility thanks to our OLAP cube and Data Schema
• 360 view of the position (As Of Date, explain, multi-aggregation level)
• In-memory distributed calculation
• Sub-second end-to-end (push architecture)
Low latency internal bus
• UDP unicast, acknowledgement by UDP
• No region location pain
• Exactly once delivery, no message resent, multicast storm prevention
Resiliency
• HBase RPC poll on message losses
• HDFS message storage on overflood and region events
Overview of Scaled Risk implementation
Ħ Open Architecture
• Open Standards: seamless integration to HBase (coprocessor)
• Open API (Valuation, FIFO), Toolkit approach
SCALED RISK
www.scaledrisk.com
SCALED RISK

Mais conteúdo relacionado

Mais procurados

Performance Tuning RocksDB for Kafka Streams' State Stores (Dhruba Borthakur,...
Performance Tuning RocksDB for Kafka Streams' State Stores (Dhruba Borthakur,...Performance Tuning RocksDB for Kafka Streams' State Stores (Dhruba Borthakur,...
Performance Tuning RocksDB for Kafka Streams' State Stores (Dhruba Borthakur,...confluent
 
Scylla Summit 2022: Building Zeotap's Privacy Compliant Customer Data Platfor...
Scylla Summit 2022: Building Zeotap's Privacy Compliant Customer Data Platfor...Scylla Summit 2022: Building Zeotap's Privacy Compliant Customer Data Platfor...
Scylla Summit 2022: Building Zeotap's Privacy Compliant Customer Data Platfor...ScyllaDB
 
Kafka website activity architecture
Kafka website activity architectureKafka website activity architecture
Kafka website activity architectureOmid Vahdaty
 
Extending MariaDB with user-defined functions
Extending MariaDB with user-defined functionsExtending MariaDB with user-defined functions
Extending MariaDB with user-defined functionsMariaDB plc
 
Scylla Summit 2022: What’s New in ScyllaDB Operator for Kubernetes
Scylla Summit 2022: What’s New in ScyllaDB Operator for KubernetesScylla Summit 2022: What’s New in ScyllaDB Operator for Kubernetes
Scylla Summit 2022: What’s New in ScyllaDB Operator for KubernetesScyllaDB
 
Scylla Summit 2016: Graph Processing with Titan and Scylla
Scylla Summit 2016: Graph Processing with Titan and ScyllaScylla Summit 2016: Graph Processing with Titan and Scylla
Scylla Summit 2016: Graph Processing with Titan and ScyllaScyllaDB
 
MariaDB Platform for hybrid transactional/analytical workloads
MariaDB Platform for hybrid transactional/analytical workloadsMariaDB Platform for hybrid transactional/analytical workloads
MariaDB Platform for hybrid transactional/analytical workloadsMariaDB plc
 
How MariaDB is approaching DBaaS
How MariaDB is approaching DBaaSHow MariaDB is approaching DBaaS
How MariaDB is approaching DBaaSMariaDB plc
 
Building Distributed Systems With Riak and Riak Core
Building Distributed Systems With Riak and Riak CoreBuilding Distributed Systems With Riak and Riak Core
Building Distributed Systems With Riak and Riak CoreAndy Gross
 
Scylla Summit 2016: ScyllaDB, Present and Future
Scylla Summit 2016: ScyllaDB, Present and FutureScylla Summit 2016: ScyllaDB, Present and Future
Scylla Summit 2016: ScyllaDB, Present and FutureScyllaDB
 
Webinar slides: How to Measure Database Availability?
Webinar slides: How to Measure Database Availability?Webinar slides: How to Measure Database Availability?
Webinar slides: How to Measure Database Availability?Severalnines
 
Redis in a Multi Tenant Environment–High Availability, Monitoring & Much More!
Redis in a Multi Tenant Environment–High Availability, Monitoring & Much More! Redis in a Multi Tenant Environment–High Availability, Monitoring & Much More!
Redis in a Multi Tenant Environment–High Availability, Monitoring & Much More! Redis Labs
 
Scylla Summit 2022: How ScyllaDB Powers This Next Tech Cycle
Scylla Summit 2022: How ScyllaDB Powers This Next Tech CycleScylla Summit 2022: How ScyllaDB Powers This Next Tech Cycle
Scylla Summit 2022: How ScyllaDB Powers This Next Tech CycleScyllaDB
 
Red Hat Storage Day New York - What's New in Red Hat Ceph Storage
Red Hat Storage Day New York - What's New in Red Hat Ceph StorageRed Hat Storage Day New York - What's New in Red Hat Ceph Storage
Red Hat Storage Day New York - What's New in Red Hat Ceph StorageRed_Hat_Storage
 
Scylla Summit 2022: ORM and Query Building in Rust
Scylla Summit 2022: ORM and Query Building in RustScylla Summit 2022: ORM and Query Building in Rust
Scylla Summit 2022: ORM and Query Building in RustScyllaDB
 
Scylla Summit 2022: New AWS Instances Perfect for ScyllaDB
Scylla Summit 2022: New AWS Instances Perfect for ScyllaDBScylla Summit 2022: New AWS Instances Perfect for ScyllaDB
Scylla Summit 2022: New AWS Instances Perfect for ScyllaDBScyllaDB
 
Large Scale Data Analytics with Spark and Cassandra on the DSE Platform
Large Scale Data Analytics with Spark and Cassandra on the DSE PlatformLarge Scale Data Analytics with Spark and Cassandra on the DSE Platform
Large Scale Data Analytics with Spark and Cassandra on the DSE PlatformDataStax Academy
 
Scylla Summit 2022: ScyllaDB Rust Driver: One Driver to Rule Them All
Scylla Summit 2022: ScyllaDB Rust Driver: One Driver to Rule Them AllScylla Summit 2022: ScyllaDB Rust Driver: One Driver to Rule Them All
Scylla Summit 2022: ScyllaDB Rust Driver: One Driver to Rule Them AllScyllaDB
 

Mais procurados (20)

Performance Tuning RocksDB for Kafka Streams' State Stores (Dhruba Borthakur,...
Performance Tuning RocksDB for Kafka Streams' State Stores (Dhruba Borthakur,...Performance Tuning RocksDB for Kafka Streams' State Stores (Dhruba Borthakur,...
Performance Tuning RocksDB for Kafka Streams' State Stores (Dhruba Borthakur,...
 
Scylla Summit 2022: Building Zeotap's Privacy Compliant Customer Data Platfor...
Scylla Summit 2022: Building Zeotap's Privacy Compliant Customer Data Platfor...Scylla Summit 2022: Building Zeotap's Privacy Compliant Customer Data Platfor...
Scylla Summit 2022: Building Zeotap's Privacy Compliant Customer Data Platfor...
 
Kafka website activity architecture
Kafka website activity architectureKafka website activity architecture
Kafka website activity architecture
 
Extending MariaDB with user-defined functions
Extending MariaDB with user-defined functionsExtending MariaDB with user-defined functions
Extending MariaDB with user-defined functions
 
Scylla Summit 2022: What’s New in ScyllaDB Operator for Kubernetes
Scylla Summit 2022: What’s New in ScyllaDB Operator for KubernetesScylla Summit 2022: What’s New in ScyllaDB Operator for Kubernetes
Scylla Summit 2022: What’s New in ScyllaDB Operator for Kubernetes
 
Scylla Summit 2016: Graph Processing with Titan and Scylla
Scylla Summit 2016: Graph Processing with Titan and ScyllaScylla Summit 2016: Graph Processing with Titan and Scylla
Scylla Summit 2016: Graph Processing with Titan and Scylla
 
MariaDB Platform for hybrid transactional/analytical workloads
MariaDB Platform for hybrid transactional/analytical workloadsMariaDB Platform for hybrid transactional/analytical workloads
MariaDB Platform for hybrid transactional/analytical workloads
 
How MariaDB is approaching DBaaS
How MariaDB is approaching DBaaSHow MariaDB is approaching DBaaS
How MariaDB is approaching DBaaS
 
Building Distributed Systems With Riak and Riak Core
Building Distributed Systems With Riak and Riak CoreBuilding Distributed Systems With Riak and Riak Core
Building Distributed Systems With Riak and Riak Core
 
Scylla Summit 2016: ScyllaDB, Present and Future
Scylla Summit 2016: ScyllaDB, Present and FutureScylla Summit 2016: ScyllaDB, Present and Future
Scylla Summit 2016: ScyllaDB, Present and Future
 
Webinar slides: How to Measure Database Availability?
Webinar slides: How to Measure Database Availability?Webinar slides: How to Measure Database Availability?
Webinar slides: How to Measure Database Availability?
 
Redis in a Multi Tenant Environment–High Availability, Monitoring & Much More!
Redis in a Multi Tenant Environment–High Availability, Monitoring & Much More! Redis in a Multi Tenant Environment–High Availability, Monitoring & Much More!
Redis in a Multi Tenant Environment–High Availability, Monitoring & Much More!
 
Scylla Summit 2022: How ScyllaDB Powers This Next Tech Cycle
Scylla Summit 2022: How ScyllaDB Powers This Next Tech CycleScylla Summit 2022: How ScyllaDB Powers This Next Tech Cycle
Scylla Summit 2022: How ScyllaDB Powers This Next Tech Cycle
 
Red Hat Storage Day New York - What's New in Red Hat Ceph Storage
Red Hat Storage Day New York - What's New in Red Hat Ceph StorageRed Hat Storage Day New York - What's New in Red Hat Ceph Storage
Red Hat Storage Day New York - What's New in Red Hat Ceph Storage
 
Scylla Summit 2022: ORM and Query Building in Rust
Scylla Summit 2022: ORM and Query Building in RustScylla Summit 2022: ORM and Query Building in Rust
Scylla Summit 2022: ORM and Query Building in Rust
 
Cassandra Core Concepts
Cassandra Core ConceptsCassandra Core Concepts
Cassandra Core Concepts
 
Scylla Summit 2022: New AWS Instances Perfect for ScyllaDB
Scylla Summit 2022: New AWS Instances Perfect for ScyllaDBScylla Summit 2022: New AWS Instances Perfect for ScyllaDB
Scylla Summit 2022: New AWS Instances Perfect for ScyllaDB
 
Large Scale Data Analytics with Spark and Cassandra on the DSE Platform
Large Scale Data Analytics with Spark and Cassandra on the DSE PlatformLarge Scale Data Analytics with Spark and Cassandra on the DSE Platform
Large Scale Data Analytics with Spark and Cassandra on the DSE Platform
 
Red Hat Storage Roadmap
Red Hat Storage RoadmapRed Hat Storage Roadmap
Red Hat Storage Roadmap
 
Scylla Summit 2022: ScyllaDB Rust Driver: One Driver to Rule Them All
Scylla Summit 2022: ScyllaDB Rust Driver: One Driver to Rule Them AllScylla Summit 2022: ScyllaDB Rust Driver: One Driver to Rule Them All
Scylla Summit 2022: ScyllaDB Rust Driver: One Driver to Rule Them All
 

Semelhante a NoSQL in Financial Industry - Pierre Bittner

BigDataSpain 2016: Introduction to Apache Apex
BigDataSpain 2016: Introduction to Apache ApexBigDataSpain 2016: Introduction to Apache Apex
BigDataSpain 2016: Introduction to Apache ApexThomas Weise
 
Azure Cosmos DB - The Swiss Army NoSQL Cloud Database
Azure Cosmos DB - The Swiss Army NoSQL Cloud DatabaseAzure Cosmos DB - The Swiss Army NoSQL Cloud Database
Azure Cosmos DB - The Swiss Army NoSQL Cloud DatabaseBizTalk360
 
Agility and Scalability with MongoDB
Agility and Scalability with MongoDBAgility and Scalability with MongoDB
Agility and Scalability with MongoDBMongoDB
 
Calum McCrea, Software Engineer at Kx Systems, "Kx: How Wall Street Tech can ...
Calum McCrea, Software Engineer at Kx Systems, "Kx: How Wall Street Tech can ...Calum McCrea, Software Engineer at Kx Systems, "Kx: How Wall Street Tech can ...
Calum McCrea, Software Engineer at Kx Systems, "Kx: How Wall Street Tech can ...Dataconomy Media
 
Lessons learned from embedding Cassandra in xPatterns
Lessons learned from embedding Cassandra in xPatternsLessons learned from embedding Cassandra in xPatterns
Lessons learned from embedding Cassandra in xPatternsClaudiu Barbura
 
Aerospike Hybrid Memory Architecture
Aerospike Hybrid Memory ArchitectureAerospike Hybrid Memory Architecture
Aerospike Hybrid Memory ArchitectureAerospike, Inc.
 
Kylin and Druid Presentation
Kylin and Druid PresentationKylin and Druid Presentation
Kylin and Druid Presentationargonauts007
 
HBaseCon 2015: Trafodion - Integrating Operational SQL into HBase
HBaseCon 2015: Trafodion - Integrating Operational SQL into HBaseHBaseCon 2015: Trafodion - Integrating Operational SQL into HBase
HBaseCon 2015: Trafodion - Integrating Operational SQL into HBaseHBaseCon
 
How KeyBank Used Elastic to Build an Enterprise Monitoring Solution
How KeyBank Used Elastic to Build an Enterprise Monitoring SolutionHow KeyBank Used Elastic to Build an Enterprise Monitoring Solution
How KeyBank Used Elastic to Build an Enterprise Monitoring SolutionElasticsearch
 
Thomas Weise, Apache Apex PMC Member and Architect/Co-Founder, DataTorrent - ...
Thomas Weise, Apache Apex PMC Member and Architect/Co-Founder, DataTorrent - ...Thomas Weise, Apache Apex PMC Member and Architect/Co-Founder, DataTorrent - ...
Thomas Weise, Apache Apex PMC Member and Architect/Co-Founder, DataTorrent - ...Dataconomy Media
 
Big Data Berlin v8.0 Stream Processing with Apache Apex
Big Data Berlin v8.0 Stream Processing with Apache Apex Big Data Berlin v8.0 Stream Processing with Apache Apex
Big Data Berlin v8.0 Stream Processing with Apache Apex Apache Apex
 
Informix & IWA : Operational analytics performance
Informix & IWA : Operational analytics performanceInformix & IWA : Operational analytics performance
Informix & IWA : Operational analytics performanceKeshav Murthy
 
Spark as part of a Hybrid RDBMS Architecture-John Leach Cofounder Splice Machine
Spark as part of a Hybrid RDBMS Architecture-John Leach Cofounder Splice MachineSpark as part of a Hybrid RDBMS Architecture-John Leach Cofounder Splice Machine
Spark as part of a Hybrid RDBMS Architecture-John Leach Cofounder Splice MachineData Con LA
 
Presentacion day f-core v1.2.1.2-technical - english
Presentacion day f-core v1.2.1.2-technical - englishPresentacion day f-core v1.2.1.2-technical - english
Presentacion day f-core v1.2.1.2-technical - englishJose Luis Sanchez del Coso
 
SQL and Machine Learning on Hadoop
SQL and Machine Learning on HadoopSQL and Machine Learning on Hadoop
SQL and Machine Learning on HadoopMukund Babbar
 
Dev Lakhani, Data Scientist at Batch Insights "Real Time Big Data Applicatio...
Dev Lakhani, Data Scientist at Batch Insights  "Real Time Big Data Applicatio...Dev Lakhani, Data Scientist at Batch Insights  "Real Time Big Data Applicatio...
Dev Lakhani, Data Scientist at Batch Insights "Real Time Big Data Applicatio...Dataconomy Media
 
Big Data LDN 2016: Kick Start your Big Data project with Hyperconverged Infra...
Big Data LDN 2016: Kick Start your Big Data project with Hyperconverged Infra...Big Data LDN 2016: Kick Start your Big Data project with Hyperconverged Infra...
Big Data LDN 2016: Kick Start your Big Data project with Hyperconverged Infra...Matt Stubbs
 
Apache Big Data 2016: Next Gen Big Data Analytics with Apache Apex
Apache Big Data 2016: Next Gen Big Data Analytics with Apache ApexApache Big Data 2016: Next Gen Big Data Analytics with Apache Apex
Apache Big Data 2016: Next Gen Big Data Analytics with Apache ApexApache Apex
 

Semelhante a NoSQL in Financial Industry - Pierre Bittner (20)

BigDataSpain 2016: Introduction to Apache Apex
BigDataSpain 2016: Introduction to Apache ApexBigDataSpain 2016: Introduction to Apache Apex
BigDataSpain 2016: Introduction to Apache Apex
 
Azure Cosmos DB - The Swiss Army NoSQL Cloud Database
Azure Cosmos DB - The Swiss Army NoSQL Cloud DatabaseAzure Cosmos DB - The Swiss Army NoSQL Cloud Database
Azure Cosmos DB - The Swiss Army NoSQL Cloud Database
 
Cassandra in xPatterns
Cassandra in xPatternsCassandra in xPatterns
Cassandra in xPatterns
 
Agility and Scalability with MongoDB
Agility and Scalability with MongoDBAgility and Scalability with MongoDB
Agility and Scalability with MongoDB
 
Calum McCrea, Software Engineer at Kx Systems, "Kx: How Wall Street Tech can ...
Calum McCrea, Software Engineer at Kx Systems, "Kx: How Wall Street Tech can ...Calum McCrea, Software Engineer at Kx Systems, "Kx: How Wall Street Tech can ...
Calum McCrea, Software Engineer at Kx Systems, "Kx: How Wall Street Tech can ...
 
Lessons learned from embedding Cassandra in xPatterns
Lessons learned from embedding Cassandra in xPatternsLessons learned from embedding Cassandra in xPatterns
Lessons learned from embedding Cassandra in xPatterns
 
Aerospike Hybrid Memory Architecture
Aerospike Hybrid Memory ArchitectureAerospike Hybrid Memory Architecture
Aerospike Hybrid Memory Architecture
 
Kylin and Druid Presentation
Kylin and Druid PresentationKylin and Druid Presentation
Kylin and Druid Presentation
 
HBaseCon 2015: Trafodion - Integrating Operational SQL into HBase
HBaseCon 2015: Trafodion - Integrating Operational SQL into HBaseHBaseCon 2015: Trafodion - Integrating Operational SQL into HBase
HBaseCon 2015: Trafodion - Integrating Operational SQL into HBase
 
How KeyBank Used Elastic to Build an Enterprise Monitoring Solution
How KeyBank Used Elastic to Build an Enterprise Monitoring SolutionHow KeyBank Used Elastic to Build an Enterprise Monitoring Solution
How KeyBank Used Elastic to Build an Enterprise Monitoring Solution
 
Thomas Weise, Apache Apex PMC Member and Architect/Co-Founder, DataTorrent - ...
Thomas Weise, Apache Apex PMC Member and Architect/Co-Founder, DataTorrent - ...Thomas Weise, Apache Apex PMC Member and Architect/Co-Founder, DataTorrent - ...
Thomas Weise, Apache Apex PMC Member and Architect/Co-Founder, DataTorrent - ...
 
Big Data Berlin v8.0 Stream Processing with Apache Apex
Big Data Berlin v8.0 Stream Processing with Apache Apex Big Data Berlin v8.0 Stream Processing with Apache Apex
Big Data Berlin v8.0 Stream Processing with Apache Apex
 
Kafka & Hadoop in Rakuten
Kafka & Hadoop in RakutenKafka & Hadoop in Rakuten
Kafka & Hadoop in Rakuten
 
Informix & IWA : Operational analytics performance
Informix & IWA : Operational analytics performanceInformix & IWA : Operational analytics performance
Informix & IWA : Operational analytics performance
 
Spark as part of a Hybrid RDBMS Architecture-John Leach Cofounder Splice Machine
Spark as part of a Hybrid RDBMS Architecture-John Leach Cofounder Splice MachineSpark as part of a Hybrid RDBMS Architecture-John Leach Cofounder Splice Machine
Spark as part of a Hybrid RDBMS Architecture-John Leach Cofounder Splice Machine
 
Presentacion day f-core v1.2.1.2-technical - english
Presentacion day f-core v1.2.1.2-technical - englishPresentacion day f-core v1.2.1.2-technical - english
Presentacion day f-core v1.2.1.2-technical - english
 
SQL and Machine Learning on Hadoop
SQL and Machine Learning on HadoopSQL and Machine Learning on Hadoop
SQL and Machine Learning on Hadoop
 
Dev Lakhani, Data Scientist at Batch Insights "Real Time Big Data Applicatio...
Dev Lakhani, Data Scientist at Batch Insights  "Real Time Big Data Applicatio...Dev Lakhani, Data Scientist at Batch Insights  "Real Time Big Data Applicatio...
Dev Lakhani, Data Scientist at Batch Insights "Real Time Big Data Applicatio...
 
Big Data LDN 2016: Kick Start your Big Data project with Hyperconverged Infra...
Big Data LDN 2016: Kick Start your Big Data project with Hyperconverged Infra...Big Data LDN 2016: Kick Start your Big Data project with Hyperconverged Infra...
Big Data LDN 2016: Kick Start your Big Data project with Hyperconverged Infra...
 
Apache Big Data 2016: Next Gen Big Data Analytics with Apache Apex
Apache Big Data 2016: Next Gen Big Data Analytics with Apache ApexApache Big Data 2016: Next Gen Big Data Analytics with Apache Apex
Apache Big Data 2016: Next Gen Big Data Analytics with Apache Apex
 

Mais de distributed matters

Cloud Apps - Running Fully Distributed on Mobile Devices - Dominik Rüttimann
Cloud Apps - Running Fully Distributed on Mobile Devices - Dominik RüttimannCloud Apps - Running Fully Distributed on Mobile Devices - Dominik Rüttimann
Cloud Apps - Running Fully Distributed on Mobile Devices - Dominik Rüttimanndistributed matters
 
What and Why and How: Apache Drill ! - Tugdual Grall
What and Why and How: Apache Drill ! - Tugdual GrallWhat and Why and How: Apache Drill ! - Tugdual Grall
What and Why and How: Apache Drill ! - Tugdual Gralldistributed matters
 
Functional Operations - Susan Potter
Functional Operations - Susan PotterFunctional Operations - Susan Potter
Functional Operations - Susan Potterdistributed matters
 
Joins in a distributed world - Lucian Precup
Joins in a distributed world - Lucian Precup Joins in a distributed world - Lucian Precup
Joins in a distributed world - Lucian Precup distributed matters
 
Meet Spilo, Zalando’s HIGH-AVAILABLE POSTGRESQL CLUSTER - Feike Steenbergen
Meet Spilo, Zalando’s HIGH-AVAILABLE POSTGRESQL CLUSTER - Feike SteenbergenMeet Spilo, Zalando’s HIGH-AVAILABLE POSTGRESQL CLUSTER - Feike Steenbergen
Meet Spilo, Zalando’s HIGH-AVAILABLE POSTGRESQL CLUSTER - Feike Steenbergendistributed matters
 
NoSQL meets Microservices - Michael Hackstein
NoSQL meets Microservices -  Michael HacksteinNoSQL meets Microservices -  Michael Hackstein
NoSQL meets Microservices - Michael Hacksteindistributed matters
 
Jepsen V - Kyle Kingsbury - Key Note distributed matters Berlin 2015
Jepsen V - Kyle Kingsbury - Key Note distributed matters Berlin 2015Jepsen V - Kyle Kingsbury - Key Note distributed matters Berlin 2015
Jepsen V - Kyle Kingsbury - Key Note distributed matters Berlin 2015distributed matters
 
Replication and Synchronization Algorithms for Distributed Databases - Lena W...
Replication and Synchronization Algorithms for Distributed Databases - Lena W...Replication and Synchronization Algorithms for Distributed Databases - Lena W...
Replication and Synchronization Algorithms for Distributed Databases - Lena W...distributed matters
 
Conflict resolution with guns - Mark Nadal
Conflict resolution with guns - Mark NadalConflict resolution with guns - Mark Nadal
Conflict resolution with guns - Mark Nadaldistributed matters
 
A tale of queues — from ActiveMQ over Hazelcast to Disque - Philipp Krenn
A tale of queues — from ActiveMQ over Hazelcast to Disque - Philipp KrennA tale of queues — from ActiveMQ over Hazelcast to Disque - Philipp Krenn
A tale of queues — from ActiveMQ over Hazelcast to Disque - Philipp Krenndistributed matters
 
NoSQL's biggest lie: SQL never went away - Martin Esmann
NoSQL's biggest lie: SQL never went away - Martin EsmannNoSQL's biggest lie: SQL never went away - Martin Esmann
NoSQL's biggest lie: SQL never went away - Martin Esmanndistributed matters
 
NoSQL meets Microservices - Michael Hackstein
NoSQL meets Microservices - Michael HacksteinNoSQL meets Microservices - Michael Hackstein
NoSQL meets Microservices - Michael Hacksteindistributed matters
 
No Free Lunch, Indeed: Three Years of Microservices at SoundCloud - Phil Calcado
No Free Lunch, Indeed: Three Years of Microservices at SoundCloud - Phil CalcadoNo Free Lunch, Indeed: Three Years of Microservices at SoundCloud - Phil Calcado
No Free Lunch, Indeed: Three Years of Microservices at SoundCloud - Phil Calcadodistributed matters
 
Disque: a detailed overview of the distributed implementation - Salvatore San...
Disque: a detailed overview of the distributed implementation - Salvatore San...Disque: a detailed overview of the distributed implementation - Salvatore San...
Disque: a detailed overview of the distributed implementation - Salvatore San...distributed matters
 
Microservices - stress-free and without increased heart-attack risk - Uwe Fri...
Microservices - stress-free and without increased heart-attack risk - Uwe Fri...Microservices - stress-free and without increased heart-attack risk - Uwe Fri...
Microservices - stress-free and without increased heart-attack risk - Uwe Fri...distributed matters
 
Microservices with Netflix OSS & Spring Cloud - Arnaud Cogoluègnes
 Microservices with Netflix OSS & Spring Cloud - Arnaud Cogoluègnes Microservices with Netflix OSS & Spring Cloud - Arnaud Cogoluègnes
Microservices with Netflix OSS & Spring Cloud - Arnaud Cogoluègnesdistributed matters
 

Mais de distributed matters (17)

Cloud Apps - Running Fully Distributed on Mobile Devices - Dominik Rüttimann
Cloud Apps - Running Fully Distributed on Mobile Devices - Dominik RüttimannCloud Apps - Running Fully Distributed on Mobile Devices - Dominik Rüttimann
Cloud Apps - Running Fully Distributed on Mobile Devices - Dominik Rüttimann
 
What and Why and How: Apache Drill ! - Tugdual Grall
What and Why and How: Apache Drill ! - Tugdual GrallWhat and Why and How: Apache Drill ! - Tugdual Grall
What and Why and How: Apache Drill ! - Tugdual Grall
 
Functional Operations - Susan Potter
Functional Operations - Susan PotterFunctional Operations - Susan Potter
Functional Operations - Susan Potter
 
Joins in a distributed world - Lucian Precup
Joins in a distributed world - Lucian Precup Joins in a distributed world - Lucian Precup
Joins in a distributed world - Lucian Precup
 
Meet Spilo, Zalando’s HIGH-AVAILABLE POSTGRESQL CLUSTER - Feike Steenbergen
Meet Spilo, Zalando’s HIGH-AVAILABLE POSTGRESQL CLUSTER - Feike SteenbergenMeet Spilo, Zalando’s HIGH-AVAILABLE POSTGRESQL CLUSTER - Feike Steenbergen
Meet Spilo, Zalando’s HIGH-AVAILABLE POSTGRESQL CLUSTER - Feike Steenbergen
 
NoSQL meets Microservices - Michael Hackstein
NoSQL meets Microservices -  Michael HacksteinNoSQL meets Microservices -  Michael Hackstein
NoSQL meets Microservices - Michael Hackstein
 
Actors evolved- Rotem Hermon
Actors evolved- Rotem HermonActors evolved- Rotem Hermon
Actors evolved- Rotem Hermon
 
Jepsen V - Kyle Kingsbury - Key Note distributed matters Berlin 2015
Jepsen V - Kyle Kingsbury - Key Note distributed matters Berlin 2015Jepsen V - Kyle Kingsbury - Key Note distributed matters Berlin 2015
Jepsen V - Kyle Kingsbury - Key Note distributed matters Berlin 2015
 
Replication and Synchronization Algorithms for Distributed Databases - Lena W...
Replication and Synchronization Algorithms for Distributed Databases - Lena W...Replication and Synchronization Algorithms for Distributed Databases - Lena W...
Replication and Synchronization Algorithms for Distributed Databases - Lena W...
 
Conflict resolution with guns - Mark Nadal
Conflict resolution with guns - Mark NadalConflict resolution with guns - Mark Nadal
Conflict resolution with guns - Mark Nadal
 
A tale of queues — from ActiveMQ over Hazelcast to Disque - Philipp Krenn
A tale of queues — from ActiveMQ over Hazelcast to Disque - Philipp KrennA tale of queues — from ActiveMQ over Hazelcast to Disque - Philipp Krenn
A tale of queues — from ActiveMQ over Hazelcast to Disque - Philipp Krenn
 
NoSQL's biggest lie: SQL never went away - Martin Esmann
NoSQL's biggest lie: SQL never went away - Martin EsmannNoSQL's biggest lie: SQL never went away - Martin Esmann
NoSQL's biggest lie: SQL never went away - Martin Esmann
 
NoSQL meets Microservices - Michael Hackstein
NoSQL meets Microservices - Michael HacksteinNoSQL meets Microservices - Michael Hackstein
NoSQL meets Microservices - Michael Hackstein
 
No Free Lunch, Indeed: Three Years of Microservices at SoundCloud - Phil Calcado
No Free Lunch, Indeed: Three Years of Microservices at SoundCloud - Phil CalcadoNo Free Lunch, Indeed: Three Years of Microservices at SoundCloud - Phil Calcado
No Free Lunch, Indeed: Three Years of Microservices at SoundCloud - Phil Calcado
 
Disque: a detailed overview of the distributed implementation - Salvatore San...
Disque: a detailed overview of the distributed implementation - Salvatore San...Disque: a detailed overview of the distributed implementation - Salvatore San...
Disque: a detailed overview of the distributed implementation - Salvatore San...
 
Microservices - stress-free and without increased heart-attack risk - Uwe Fri...
Microservices - stress-free and without increased heart-attack risk - Uwe Fri...Microservices - stress-free and without increased heart-attack risk - Uwe Fri...
Microservices - stress-free and without increased heart-attack risk - Uwe Fri...
 
Microservices with Netflix OSS & Spring Cloud - Arnaud Cogoluègnes
 Microservices with Netflix OSS & Spring Cloud - Arnaud Cogoluègnes Microservices with Netflix OSS & Spring Cloud - Arnaud Cogoluègnes
Microservices with Netflix OSS & Spring Cloud - Arnaud Cogoluègnes
 

Último

Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...Boston Institute of Analytics
 
IBEF report on the Insurance market in India
IBEF report on the Insurance market in IndiaIBEF report on the Insurance market in India
IBEF report on the Insurance market in IndiaManalVerma4
 
Presentation of project of business person who are success
Presentation of project of business person who are successPresentation of project of business person who are success
Presentation of project of business person who are successPratikSingh115843
 
Non Text Magic Studio Magic Design for Presentations L&P.pdf
Non Text Magic Studio Magic Design for Presentations L&P.pdfNon Text Magic Studio Magic Design for Presentations L&P.pdf
Non Text Magic Studio Magic Design for Presentations L&P.pdfPratikPatil591646
 
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...Jack Cole
 
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...Dr Arash Najmaei ( Phd., MBA, BSc)
 
Decoding Movie Sentiments: Analyzing Reviews with Data Analysis model
Decoding Movie Sentiments: Analyzing Reviews with Data Analysis modelDecoding Movie Sentiments: Analyzing Reviews with Data Analysis model
Decoding Movie Sentiments: Analyzing Reviews with Data Analysis modelBoston Institute of Analytics
 
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdfEnglish-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdfblazblazml
 
Digital Indonesia Report 2024 by We Are Social .pdf
Digital Indonesia Report 2024 by We Are Social .pdfDigital Indonesia Report 2024 by We Are Social .pdf
Digital Indonesia Report 2024 by We Are Social .pdfNicoChristianSunaryo
 
Bank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBoston Institute of Analytics
 
Role of Consumer Insights in business transformation
Role of Consumer Insights in business transformationRole of Consumer Insights in business transformation
Role of Consumer Insights in business transformationAnnie Melnic
 
DATA ANALYSIS using various data sets like shoping data set etc
DATA ANALYSIS using various data sets like shoping data set etcDATA ANALYSIS using various data sets like shoping data set etc
DATA ANALYSIS using various data sets like shoping data set etclalithasri22
 
Statistics For Management by Richard I. Levin 8ed.pdf
Statistics For Management by Richard I. Levin 8ed.pdfStatistics For Management by Richard I. Levin 8ed.pdf
Statistics For Management by Richard I. Levin 8ed.pdfnikeshsingh56
 
Digital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing worksDigital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing worksdeepakthakur548787
 

Último (17)

Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
 
Data Analysis Project: Stroke Prediction
Data Analysis Project: Stroke PredictionData Analysis Project: Stroke Prediction
Data Analysis Project: Stroke Prediction
 
IBEF report on the Insurance market in India
IBEF report on the Insurance market in IndiaIBEF report on the Insurance market in India
IBEF report on the Insurance market in India
 
Presentation of project of business person who are success
Presentation of project of business person who are successPresentation of project of business person who are success
Presentation of project of business person who are success
 
Non Text Magic Studio Magic Design for Presentations L&P.pdf
Non Text Magic Studio Magic Design for Presentations L&P.pdfNon Text Magic Studio Magic Design for Presentations L&P.pdf
Non Text Magic Studio Magic Design for Presentations L&P.pdf
 
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...
 
2023 Survey Shows Dip in High School E-Cigarette Use
2023 Survey Shows Dip in High School E-Cigarette Use2023 Survey Shows Dip in High School E-Cigarette Use
2023 Survey Shows Dip in High School E-Cigarette Use
 
Insurance Churn Prediction Data Analysis Project
Insurance Churn Prediction Data Analysis ProjectInsurance Churn Prediction Data Analysis Project
Insurance Churn Prediction Data Analysis Project
 
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
 
Decoding Movie Sentiments: Analyzing Reviews with Data Analysis model
Decoding Movie Sentiments: Analyzing Reviews with Data Analysis modelDecoding Movie Sentiments: Analyzing Reviews with Data Analysis model
Decoding Movie Sentiments: Analyzing Reviews with Data Analysis model
 
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdfEnglish-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
 
Digital Indonesia Report 2024 by We Are Social .pdf
Digital Indonesia Report 2024 by We Are Social .pdfDigital Indonesia Report 2024 by We Are Social .pdf
Digital Indonesia Report 2024 by We Are Social .pdf
 
Bank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis Project
 
Role of Consumer Insights in business transformation
Role of Consumer Insights in business transformationRole of Consumer Insights in business transformation
Role of Consumer Insights in business transformation
 
DATA ANALYSIS using various data sets like shoping data set etc
DATA ANALYSIS using various data sets like shoping data set etcDATA ANALYSIS using various data sets like shoping data set etc
DATA ANALYSIS using various data sets like shoping data set etc
 
Statistics For Management by Richard I. Levin 8ed.pdf
Statistics For Management by Richard I. Levin 8ed.pdfStatistics For Management by Richard I. Levin 8ed.pdf
Statistics For Management by Richard I. Levin 8ed.pdf
 
Digital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing worksDigital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing works
 

NoSQL in Financial Industry - Pierre Bittner

  • 1. SCALED RISK Next Generation of Financial Platform NoSQL in Financial Industry Distributed Matters - Barcelona – 21 November 2015 Pierre Bittner - CTO SCALED RISK
  • 2. SCALED RISK 2 Integrated Big Data & Analytics Platform SaaS or On-Premise Hadoop/HBase + Low latency + External Consistency + Flexible Data Schema + In-Memory OLAP WHAT? HOW? FOR? Real-Time Risk Management WHERE? What is Scaled Risk?
  • 3. SCALED RISK 3 Why NoSQL Matters in Financial Industry? • Volume / Velocity § New York Stock Exchange generates about 4−5 terabytes of data per day. § Algo Trading, High Frequency Trading: In 2012, accounted for 50% of all US equity trading volume. Trade execution milli- and even microseconds. ġ E Y G • Coherency / Availability / Security § Regulatory Report: Intraday Monitoring, § MTTR, Data Spikes on Market Event, Disaster Recovery, ACL • Mixed workloads: Streaming and Historical Analysis – Point In Time Comparison § BackTesting, Replay (UTC Timestamping of all events, FIFO) § Lambda-architecture, Kappa-architecture • Needs for Multi-tenancy / Data & Process Governance (Data Lake / Data Centric Arch.)
  • 4. SCALED RISK 4 G Y E Real-Time Enterprise-Wide Risk Management Improved and trustable view of global risk and support implementation of next regulations Real-Time Fraud Detection Pre-check, real-time and historical data verification for trades, payments, orders, … Real-Time Market Analytics On-demand live and historical data analysis on global market Why NoSQL Matters in Financial Industry? Customer Story: Market Exchange Market Surveillance
  • 5. SCALED RISK OTC Market • Foreign Exchange • Debt Market (Bond) • Commodities • Bloomberg, FXAll,… • … Regulated Market • Securities • Options • NYSE, Eurex • LSE • … Structured Data Feeds Booking Systems • Trader Positions • Intraday Events • Valorization • Volatility, Correlation Referential Data • Counterparts • Analytical Structure • Products Definition • Mappings Unstructured Data Feeds News & Mkt Analysis • Reuters, BBG • Research Social Media • Twitter • LinkedIn, … Trading • Global Positions • Intraday funding & forecasting • Collateral Optimization RT Aggregated Positions Sales • Credit Line • Profitability Indicator • Customer Interests Global • Market Flows • Analyst/Market Correlation On-Demand Analysis Market Risk Analysis • Stress per Counterparty Sales • Customer alerts on Market Trends • Recommendation & Lead Generation Live Report & Alerts • On Market Events • Custom scenario • Market Surveillance 5 Today’s Trading Challenge: On-Demand Live Analysis & Alerts Risk • CVA, Counterparty Exposure • Limit, Stress Test Intraday Limit Risk • Automatic Monitoring
  • 6. SCALED RISK 6 Context Extreme performance and resilience : Peak activity > 1M order p. second Low Latency Objective On-demand market analytics out of real-time & historical data Resilient primary storage Problems High volumes, difficult access to history SLAs for data & service availability Customer Story: On Demand Market Surveillance for Exchange Solution Scaled Risk at the outflow of the matching engine Benefits Streamline process, consistent view High availability and scalability Reduced TCO l Result A single system for storage and computation of spot & historical data for market surveillance
  • 7. SCALED RISK 7 On Demand Market Surveillance: Pilot Perimeter High Level Architecture Candidateģ
  • 8. SCALED RISK 8 On Demand Market Surveillance: Pilot Perimeter Focus on evaluating HBase frameworkå Ø HBase performance on Read/Write Ø HBase behavior during a node failure Ø HBase process isolation Ø Global consistency Key parts of the architectureå Ø Message Bus (Kafka) Ø Storage System (HDFS) Ø Operational Database (HBase) Ø Real-Time Analytics tool (Scaled Risk) Ø History & Data Analytics tool (SR & Spark) Benefits of architecture (streamline process, cost, …) not covered in this step. Confirm Hadoop/HBase technical Stackå Evaluate Scaled Risk performanceå Explore Scaled Risk featureså Pilot Perimeter Suitability of HBase and Scaled Risk in term of properties and performance. Pilot duration : 2 months
  • 9. SCALED RISK 9 HBase: Random Access to your Planet-Wide Data Key-value data organization per row. Table is a namespace.å Each cell is timestampedå ACID per row; Rowkey for fast access and data distributionå HBase in few words HBase is an open-source, distributed, versioned, non-relational, scalable, wide-column data store. Ø It is the Hadoop database, leveraging mainly on HDFS. Ø Based on Google BigTable storage system. Four primary operations are Get, Put, Delete and Scanå Server-side operations with Coprocessor (Observer, Endpoint)å Linear scalability, automatic sharding and failover supportå Strictly consistentå Hadoop ecostem integration (YARN), MapReduce, Hive, Sparkå Phoenix for SQL Flavorå
  • 10. SCALED RISK NoSQL Wide Column Store Real-Time Distributed OLAP • Dynamic Data Schema • Schema on read and write • Fast, Random R/W access • Fast In-Memory Data Processing • Full Consistency; Linear Scalability • Open API (Valuation) On-Demand Market Surveillance: Functional Architecture 10 LowLatencyInternalBus Read-Isolations As Of Date HBase As Storage Injector(Thrift) • Advanced Index and search for Data Classification and Correlation • Semantic reconciliation Real Time Indexing Real-Time Alerting 0 1 2 3 4 5 6 Contrat 1 Contrat 2 Contrat 3 Contrat 4 Alert on Analytics Volume Matching Cancel Rate Alert on Data REST/API/WebSocket
  • 11. SCALED RISK On-Demand Market surveillance: Technical Architecture 11 Head Node Name Node Head Node Secondary Name Node Head Node Hbase Master Worker Node Region Server Data Node Worker Node Region Server Data Node Worker Node Region Server Data Node Worker Node Region Server Data Node Worker Node Region Server Data Node Worker Node Region Server Data Node HP Loader 3 x Hadoop Head nodes: HP ProLiant DL360Gen9 Server 8x 900GB 10k rpm SAS,128 GB RAM, 2 x (10 cores) Intel Xeon CPU E5-2660 v3 @ 2.60GHz, 4 x 1GbE ports and 2 x 10GbE ports 6 x Hadoop worker nodes: HP ProLiant DL380Gen9 Server 2 x 120GB SSD OS, 15 x 3TB 7.2k rpm SATA, 128 GB RAM, 2 x (10 cores) Intel Xeon CPU E5-2660 v3 @ 2.60GHz, 4 x 1GbE ports and 2 x 10GbE ports 1 x HP Smart HBA H240ar, 1 x HP Smart HBA H240 1 x HP Loader: HP ProLiant DL380Gen8 14 x 1TB 7.2k rpm SAS, 128 GB RAM, 2 x (10 cores) Intel Xeon CPU E5-2670 v2 @ 2.50GHz Cluster size and components Hadoop cluster details : • Hadoop HDFS usable size : 60TB (Block replication 3, no compression) • Hadoop HDFS data disk RAW size : 241TB • Hadoop cluster memory : 6 x 128GB = 768GB Hadoop componentsand associated services • Hadoop Distribution : HortonWorks HDP 2.2 Stack • Cluster management : HP Insight CMU v7.3 • Hdfs v2.6.0 • Hbase v0.98.4 • Zookeeperv3.4.6 Other details : • OS : RHEL - RedHat Enterprise Linux v6.5 – 64bit • Linux filesystem for Hadoop data : ext4 • JVM used for Hadoop : Oracle Java 1.7.0_67
  • 12. SCALED RISK 12 On Demand Market Surveillance : Functional Consistency • Market Exchange Data types § A unique Data flow containing all types of message § Order messages § Trade messages § Test injector generates 1,5m in 7’ (client limitation) E Y G • Scaled Risk Data exhaustiveness control § Dynamic data model with two tables § Trade and Order messages are split § Test method: Messages count Message Type Message sub type Count Order New 792,546 Replace 645,889 Status 40,821 Others 80 (unique order ids) 792,886 Cancel n/a 680,626 Trade n/a 137,573 • Order and Trade Life-cycle Control § Message fields consistency control § Test method: Data sampling Message Type Count Order Table 792,886 Trade Table 137,573 Order Id Trader Contract Qty Price Side A 6C9 JFFCE150500000F 1 49350 Buy B W90 JFFCE150500000F 2 49350 Sell C MAT JFFCE150500000F 1 49345 Buy Trade Id Trader Contract Qty Price Side 1630 6C9 JFFCE150500000F 1 49350 Buy 1630 W90 JFFCE150500000F 1 49350 Sell 1631 MAT JFFCE150500000F 1 49345 Buy 1631 W90 JFFCE150500000F 1 49345 Sell
  • 13. SCALED RISK 13 On Demand Market Surveillance : Performance Indicators Sender/Trade (per region) • 130 K trades per second • 800 K on cluster Test Scenario • 7 minutes • 1,479,335 messages • Stats only on Order Table End-to-end • Nominal Latency ~200ms • 90% of messages with <412ms
  • 14. SCALED RISK On Demand Market Surveillance : Fault Tolerance Test HBase is designed to be fault tolerant. • A node fails when the white stripe appears on the whole width of the graph. • All nodes are impacted by the failure, and not only the killed node (as expected remember CP). • Another white rectangle is displayed before the node failure. • It represents all the messages that have been correctly inserted before the failure, but never flushed to disk. • Because the WAL is deactivated by trade injector (option), those messages were lost when regions were moved from the killed node to other nodes. X axis is the rowkey prefix, to show the distribution of insertion on the cluster. The Y axis is the time. Points displayed over the entire width of the X axis means that the distribution is correct.
  • 15. SCALED RISK On Demand Market Surveillance : Fault Tolerance Test X axis is the rowkey prefix, to show the distribution of insertion on the cluster. The Y axis is the time. Points displayed over the entire width of the X axis means that the distribution is correct. A second test confirms that HBase remains available even if a node fails. Test consists in inserting data in HBase from both YCSB and trade injector clients. • YCSB inserts data in a table distributed on 5 nodes • Trade injector inserts data in a table distributed on 4 nodes. • The node killed does not impact trades injection.
  • 16. SCALED RISK On Demand Market Surveillance: Next Steps Deeper evaluation of HBase Impact of volumes on performance Evaluation of HA Region Servers for data access Wider view of the targeted architecture Overall resilience Overall latency Simplification Hot zone/Cold zone TCO Business requirements of the project: MIFID II impact New services
  • 17. SCALED RISK 17 Extreme flexibility thanks to our OLAP cube and Data Schema • 360 view of the position (As Of Date, explain, multi-aggregation level) • In-memory distributed calculation • Sub-second end-to-end (push architecture) Low latency internal bus • UDP unicast, acknowledgement by UDP • No region location pain • Exactly once delivery, no message resent, multicast storm prevention Resiliency • HBase RPC poll on message losses • HDFS message storage on overflood and region events Overview of Scaled Risk implementation Ħ Open Architecture • Open Standards: seamless integration to HBase (coprocessor) • Open API (Valuation, FIFO), Toolkit approach