SlideShare a Scribd company logo
1 of 21
Download to read offline
IMC BENEFITS FROM
MEMORY & STORAGE CONVERGENCE
DR. AMIT GOLANDER
PLEXISTOR, CTO
Data
set
ABSTRACT
In-memory compute gave up on Storage and moved the working set to Memory.
This brings tremendous performance gains, but also:
1. Consumes expensive DRAM resources
2. Puts data at risk
3. Suffers from slow recovery time when power failures occur
…
The big Question:
How will IMC look like when Memory and Storage converge?
Working
set
2
Agenda:
 History &The convergence of Memory & Storage
 Benefits – Out-of-the-box
 Benefits –That require some work
3
A LONG TIME AGO…
Ideal
Storage
Requirements for Ideal Storage:
1. Low latency reads
2. High volume persistent writes
3. Reasonable cost
4. Transparent & easy to use
CostLatency
Persistency
DRAM
HDD
SSD
Unfortunately such Storage (#2) did not exist
Big Data Middleware
4
SO MIDDLEWARE DEVELOPERS & USERS COMPROMISED
Commit
Log
Memory
Table
Storage
Table
Persistent,
Pretty Fast
Cheap
Fast
Search
acceleration
1. Storage had Horrible latency for persistent writes,
but not as bad if sequentially written
2. So IMC middleware compensated by using:
- Sequential writes at the expense of read latency
- Async writes at the risk of data loss
- Caching like crazy at the expanse of HW cost (DRAM)
- Write amplification at the expanse of HW cost (Storage)
- Compaction at the expense of HW cost (CPU)
Original requirements Vs. IMC reality:
1. Low latency reads
2. High volume (eventual) persistent writes
3. Reasonable cost
4. Transparent & easy to use 5
WHAT HAS CHANGED?
Memory & Storage are converging:
 New HW - Persistent Memory (PM, e.g. NVDIMM-N)
 New SW - Software Defined Memory (SDM)
Persistency
DRAM
HDD
SSD
PM
PM+SDM delivers:
1. Low latency reads
2. HighVolume persistent writes
3. Reasonable cost
4. Transparent & easy to use
CostLatency
SDM
SDM-ephemeral delivers:
1. Low latency reads
2. High volume persistent* writes
3. Reasonable cost
4. Transparent & easy to use**
* Persistent on orderly shutdowns, not power failures
** Easy to use within share nothing architecturesPersistency
DRAM
HDD
SSD
CostLatency
SDM-ephemeral
6
HOWTO LEVERAGE SDM?
SDM
Scenario II
New Middleware / Some work to existing
Scenario I
Existing Middleware – Out of the box
SDM
SDM
SDM
7
Agenda:
 History & the convergence of Memory & Storage
 Benefits – Out-of-the-box
 Benefits –That require some work
8
OUT OF THE BOX INTEGRATION
DRAM/PM FLASH DISK
I/O Path Memory Path
Fast Storage Huge Memory
Data Services
Virtual MemoryHDFS POSIX
Plexistor FS (Multi Tier, DAX)
Linux
1. Download & Install SDM
2. Mount m1fs
3. Run your application
9
OOB BENEFIT 1: LARGE WORKING SETS
Work set 2x Memory size
SDM at 17,000 ops/sec
XFS at 2,000 ops/sec
 Performance is highly sensitive to
Working set size > Aggregated memory size
 Working set size is dynamic and hard to predict
 Large clusters are expensive
Cassandra v3.0.2
I2.4xlarge instance on AWS
Data
set
Working
set
10
OOB BENEFIT 2: PERSISTENCY
 Performance is highly sensitive to persistency/durability requirements
 Replication/Mirroring between nodes without persistency is vulnerable to Power Failures
 Data loss risk is often not well explained. Confusion leads to wasteful behavior (#copies, Network)
0
30,000
60,000
90,000
120,000
150,000
180,000
Ops/sec
TheTraditionalTradeoff
(B) Balanced (D) Durable
MongoDB v3.2
E5-2650v3, CloudSpeed SSD
*
(*) – This actually writes two persistent copies: in Memory Table and in Commit Log
11
OOB BENEFIT 3: LONG RE-BUILD TIMES
 Nodes occasionally fail in large clusters
 Re-build take many hours to complete
due to extra pressure on the storage layer
ClientsClients
Couchbase
server
Couchbase
server
Couchbase
server
Couchbase
server
Couchbase
server
X
Couchbase v4.5 beta
E5-2650v4, CloudSpeed SSD
12
OOB BENEFIT 4: PREDICTABILITY
 No hiccups due to separate memory and storage stacks
 Highly predictable performance
time
TPS
MySQL v5.6
E5-2680v3, HGST SN150
DB load generator runs at target (not maximal) speed 13
Agenda:
 History & the convergence of Memory & Storage
 Benefits – Out-of-the-box
 Benefits –That require some work
14
BENEFITS THAT REQUIRE WORK AT THE MIDDLEWARE LAYER
 A lot of potential for Fast Queries & Simplicity
SDM
Storage
Big Data middleware
File-level FIO
E5-2650v3, CloudSpeed SSD 15
EXAMPLE - AMPOOL
16
• Fast & Standard access throughout
the data pipeline
• 56x faster ingest
3-4x faster OLTP&OLAP than HBase
6x faster Spark than Tachyon
DESIGNING MIDDLEWARE IN THE SDM ERA
1. Realize that you’re a storage/memory billionaire
– focus on your business logic
2. Use standard POSIX API and share files between frameworks (polyglot)
3. Use SDM zero-cost Clones (cp –reflink)
4. Rely on SDM Auto-tiering (If you must – hint via fadvise/madvise)
5. Consider relying on SDM Mirroring capabilities
6. Use SDM monitoring tools to understand your resource consumption
17
SUMMARY
 Memory and Storage have already started converging (SDM)
 IMC best practices are no longer the “best”
 SDM provides value to IMC out-of-the-box
but
There is even greater opportunity for those willing to integrate
Efficiency
Simplicity
18
Q & A
Free SDM download - www.plexistor.com/download/
White papers - www.plexistor.com/resources/
Blog - www.plexistor.com/blog/
amit@plexistor.com
19
HIGH AVAILABILITY - CLARIFICATION
 Almost zero-latency added for having a 2nd copy, providing that high-speed RDMA network is in place
 Public cloud deployments – Keep using your current HA strategy
 On premise deployments – Can substitute most copies with storage redundancy
App server 1
Plexistor SDM
App server 2
Plexistor SDM
App server N
Plexistor SDM
High-speed
RDMA
Open
Brick 1
Open
Brick M
20
SDM VS. XFS-DAX VS. NVML - CLARIFICATION
Plexistor
ext4/xfs
DAX
NVML
Scale Out Application
AutoTiering Application
Snapshots/Clones Application
Legacy Applications
NVML support
High availability Application
IT policy hooks
DRAM/PM
Memory Path
Virtual MemoryPOSIX
FS w/ DAX support*
Linux
App using
mmap
App using
NVML
(*) Who supports DAX:
- Plexistor SDM
- Linux xfs-dax, and ext4-dax (WIP)
- MS ReFS-dax (WIP) 21

More Related Content

Viewers also liked

Storage and memory devices
Storage and memory devicesStorage and memory devices
Storage and memory devicesHassaan13
 
Chapter 7 Storage
Chapter 7 StorageChapter 7 Storage
Chapter 7 Storagenorzaini
 
Fpga(field programmable gate array)
Fpga(field programmable gate array) Fpga(field programmable gate array)
Fpga(field programmable gate array) Iffat Anjum
 
IMC Summit 2016 Innovation - Girish Mutreja - Unveiling the X Platform
IMC Summit 2016 Innovation - Girish Mutreja - Unveiling the X PlatformIMC Summit 2016 Innovation - Girish Mutreja - Unveiling the X Platform
IMC Summit 2016 Innovation - Girish Mutreja - Unveiling the X PlatformIn-Memory Computing Summit
 
computer memory ,., .
computer memory ,., .computer memory ,., .
computer memory ,., .santokh00700
 
IMC Summit 2016 Breakout - Girish Kathalagiri - Decision Making with MLLIB, S...
IMC Summit 2016 Breakout - Girish Kathalagiri - Decision Making with MLLIB, S...IMC Summit 2016 Breakout - Girish Kathalagiri - Decision Making with MLLIB, S...
IMC Summit 2016 Breakout - Girish Kathalagiri - Decision Making with MLLIB, S...In-Memory Computing Summit
 
09. Memory, Storage (RAM, Cache, HDD, ODD, SSD, Flashdrives)
09. Memory, Storage (RAM, Cache, HDD, ODD, SSD, Flashdrives)09. Memory, Storage (RAM, Cache, HDD, ODD, SSD, Flashdrives)
09. Memory, Storage (RAM, Cache, HDD, ODD, SSD, Flashdrives)Akhila Dakshina
 
14827 shift registers
14827 shift registers14827 shift registers
14827 shift registersSandeep Kumar
 
IMC Summit 2016 Breakout - Per Minoborg - Work with Multiple Hot Terabytes in...
IMC Summit 2016 Breakout - Per Minoborg - Work with Multiple Hot Terabytes in...IMC Summit 2016 Breakout - Per Minoborg - Work with Multiple Hot Terabytes in...
IMC Summit 2016 Breakout - Per Minoborg - Work with Multiple Hot Terabytes in...In-Memory Computing Summit
 
IMC Summit 2016 Breakout - Henning Andersen - Using Lock-free and Wait-free I...
IMC Summit 2016 Breakout - Henning Andersen - Using Lock-free and Wait-free I...IMC Summit 2016 Breakout - Henning Andersen - Using Lock-free and Wait-free I...
IMC Summit 2016 Breakout - Henning Andersen - Using Lock-free and Wait-free I...In-Memory Computing Summit
 

Viewers also liked (18)

Storage and memory devices
Storage and memory devicesStorage and memory devices
Storage and memory devices
 
Fpga 02-memory-and-pl ds
Fpga 02-memory-and-pl dsFpga 02-memory-and-pl ds
Fpga 02-memory-and-pl ds
 
Chapter 7 Storage
Chapter 7 StorageChapter 7 Storage
Chapter 7 Storage
 
Fpga(field programmable gate array)
Fpga(field programmable gate array) Fpga(field programmable gate array)
Fpga(field programmable gate array)
 
Flipflop
FlipflopFlipflop
Flipflop
 
MSI Shift Registers
MSI Shift RegistersMSI Shift Registers
MSI Shift Registers
 
IMC Summit 2016 Innovation - Girish Mutreja - Unveiling the X Platform
IMC Summit 2016 Innovation - Girish Mutreja - Unveiling the X PlatformIMC Summit 2016 Innovation - Girish Mutreja - Unveiling the X Platform
IMC Summit 2016 Innovation - Girish Mutreja - Unveiling the X Platform
 
Computer Memory
Computer MemoryComputer Memory
Computer Memory
 
computer memory ,., .
computer memory ,., .computer memory ,., .
computer memory ,., .
 
FPGA Introduction
FPGA IntroductionFPGA Introduction
FPGA Introduction
 
IMC Summit 2016 Breakout - Girish Kathalagiri - Decision Making with MLLIB, S...
IMC Summit 2016 Breakout - Girish Kathalagiri - Decision Making with MLLIB, S...IMC Summit 2016 Breakout - Girish Kathalagiri - Decision Making with MLLIB, S...
IMC Summit 2016 Breakout - Girish Kathalagiri - Decision Making with MLLIB, S...
 
09. Memory, Storage (RAM, Cache, HDD, ODD, SSD, Flashdrives)
09. Memory, Storage (RAM, Cache, HDD, ODD, SSD, Flashdrives)09. Memory, Storage (RAM, Cache, HDD, ODD, SSD, Flashdrives)
09. Memory, Storage (RAM, Cache, HDD, ODD, SSD, Flashdrives)
 
What is FPGA?
What is FPGA?What is FPGA?
What is FPGA?
 
14827 shift registers
14827 shift registers14827 shift registers
14827 shift registers
 
IMC Summit 2016 Breakout - Per Minoborg - Work with Multiple Hot Terabytes in...
IMC Summit 2016 Breakout - Per Minoborg - Work with Multiple Hot Terabytes in...IMC Summit 2016 Breakout - Per Minoborg - Work with Multiple Hot Terabytes in...
IMC Summit 2016 Breakout - Per Minoborg - Work with Multiple Hot Terabytes in...
 
FPGA
FPGAFPGA
FPGA
 
IMC Summit 2016 Breakout - Henning Andersen - Using Lock-free and Wait-free I...
IMC Summit 2016 Breakout - Henning Andersen - Using Lock-free and Wait-free I...IMC Summit 2016 Breakout - Henning Andersen - Using Lock-free and Wait-free I...
IMC Summit 2016 Breakout - Henning Andersen - Using Lock-free and Wait-free I...
 
Shift Registers
Shift RegistersShift Registers
Shift Registers
 

More from In-Memory Computing Summit

IMC Summit 2016 Breakout - Roman Shtykh - Apache Ignite as a Data Processing Hub
IMC Summit 2016 Breakout - Roman Shtykh - Apache Ignite as a Data Processing HubIMC Summit 2016 Breakout - Roman Shtykh - Apache Ignite as a Data Processing Hub
IMC Summit 2016 Breakout - Roman Shtykh - Apache Ignite as a Data Processing HubIn-Memory Computing Summit
 
IMC Summit 2016 Breakout - Nikita Shamgunov - Propelling IoT Innovation with ...
IMC Summit 2016 Breakout - Nikita Shamgunov - Propelling IoT Innovation with ...IMC Summit 2016 Breakout - Nikita Shamgunov - Propelling IoT Innovation with ...
IMC Summit 2016 Breakout - Nikita Shamgunov - Propelling IoT Innovation with ...In-Memory Computing Summit
 
IMC Summit 2016 Breakout - Matt Coventon - Test Driving Streaming and CEP on ...
IMC Summit 2016 Breakout - Matt Coventon - Test Driving Streaming and CEP on ...IMC Summit 2016 Breakout - Matt Coventon - Test Driving Streaming and CEP on ...
IMC Summit 2016 Breakout - Matt Coventon - Test Driving Streaming and CEP on ...In-Memory Computing Summit
 
IMC Summit 2016 Innovation - Derek Nelson - PipelineDB: The Streaming-SQL Dat...
IMC Summit 2016 Innovation - Derek Nelson - PipelineDB: The Streaming-SQL Dat...IMC Summit 2016 Innovation - Derek Nelson - PipelineDB: The Streaming-SQL Dat...
IMC Summit 2016 Innovation - Derek Nelson - PipelineDB: The Streaming-SQL Dat...In-Memory Computing Summit
 
IMC Summit 2016 Breakout - Brian Bulkowski - NVMe, Storage Class Memory and O...
IMC Summit 2016 Breakout - Brian Bulkowski - NVMe, Storage Class Memory and O...IMC Summit 2016 Breakout - Brian Bulkowski - NVMe, Storage Class Memory and O...
IMC Summit 2016 Breakout - Brian Bulkowski - NVMe, Storage Class Memory and O...In-Memory Computing Summit
 
IMC Summit 2016 Breakout - Andy Pavlo - What Non-Volatile Memory Means for th...
IMC Summit 2016 Breakout - Andy Pavlo - What Non-Volatile Memory Means for th...IMC Summit 2016 Breakout - Andy Pavlo - What Non-Volatile Memory Means for th...
IMC Summit 2016 Breakout - Andy Pavlo - What Non-Volatile Memory Means for th...In-Memory Computing Summit
 
IMC Summit 2016 Breakout - Steve Wikes - Making IMC Enterprise Grade
IMC Summit 2016 Breakout - Steve Wikes - Making IMC Enterprise GradeIMC Summit 2016 Breakout - Steve Wikes - Making IMC Enterprise Grade
IMC Summit 2016 Breakout - Steve Wikes - Making IMC Enterprise GradeIn-Memory Computing Summit
 
IMC Summit 2016 Breakout - Noah Arliss - The Truth: How to Test Your Distribu...
IMC Summit 2016 Breakout - Noah Arliss - The Truth: How to Test Your Distribu...IMC Summit 2016 Breakout - Noah Arliss - The Truth: How to Test Your Distribu...
IMC Summit 2016 Breakout - Noah Arliss - The Truth: How to Test Your Distribu...In-Memory Computing Summit
 
IMC Summit 2016 Breakout - Aleksandar Seovic - The Illusion of Statelessness
IMC Summit 2016 Breakout - Aleksandar Seovic - The Illusion of StatelessnessIMC Summit 2016 Breakout - Aleksandar Seovic - The Illusion of Statelessness
IMC Summit 2016 Breakout - Aleksandar Seovic - The Illusion of StatelessnessIn-Memory Computing Summit
 
IMC Summit 2016 Breakout - Girish Mutreja - Extreme Transaction Processing in...
IMC Summit 2016 Breakout - Girish Mutreja - Extreme Transaction Processing in...IMC Summit 2016 Breakout - Girish Mutreja - Extreme Transaction Processing in...
IMC Summit 2016 Breakout - Girish Mutreja - Extreme Transaction Processing in...In-Memory Computing Summit
 
IMC Summit 2016 Breakout - Greg Luck - How to Speed Up Your Application Using...
IMC Summit 2016 Breakout - Greg Luck - How to Speed Up Your Application Using...IMC Summit 2016 Breakout - Greg Luck - How to Speed Up Your Application Using...
IMC Summit 2016 Breakout - Greg Luck - How to Speed Up Your Application Using...In-Memory Computing Summit
 
IMC Summit 2016 Breakout - Pandurang Naik - Demystifying In-Memory Data Grid,...
IMC Summit 2016 Breakout - Pandurang Naik - Demystifying In-Memory Data Grid,...IMC Summit 2016 Breakout - Pandurang Naik - Demystifying In-Memory Data Grid,...
IMC Summit 2016 Breakout - Pandurang Naik - Demystifying In-Memory Data Grid,...In-Memory Computing Summit
 
IMC Summit 2016 Breakout - William Bain - Implementing Extensible Data Struct...
IMC Summit 2016 Breakout - William Bain - Implementing Extensible Data Struct...IMC Summit 2016 Breakout - William Bain - Implementing Extensible Data Struct...
IMC Summit 2016 Breakout - William Bain - Implementing Extensible Data Struct...In-Memory Computing Summit
 
IMC Summit 2016 Keynote - Arthur Sainio - NVDIMM: Changes are Here So What’s ...
IMC Summit 2016 Keynote - Arthur Sainio - NVDIMM: Changes are Here So What’s ...IMC Summit 2016 Keynote - Arthur Sainio - NVDIMM: Changes are Here So What’s ...
IMC Summit 2016 Keynote - Arthur Sainio - NVDIMM: Changes are Here So What’s ...In-Memory Computing Summit
 
IMC Summit 2016 Keynote - Robert Barr - In Memory Computing for Financial Ser...
IMC Summit 2016 Keynote - Robert Barr - In Memory Computing for Financial Ser...IMC Summit 2016 Keynote - Robert Barr - In Memory Computing for Financial Ser...
IMC Summit 2016 Keynote - Robert Barr - In Memory Computing for Financial Ser...In-Memory Computing Summit
 
IMC Summit 2016 Breakout - Nikita Ivanov - Shared In-Memory RDDs – Missing Li...
IMC Summit 2016 Breakout - Nikita Ivanov - Shared In-Memory RDDs – Missing Li...IMC Summit 2016 Breakout - Nikita Ivanov - Shared In-Memory RDDs – Missing Li...
IMC Summit 2016 Breakout - Nikita Ivanov - Shared In-Memory RDDs – Missing Li...In-Memory Computing Summit
 
IMCSummite 2016 Breakout - Nikita Ivanov - Apache Ignite 2.0 Towards a Conver...
IMCSummite 2016 Breakout - Nikita Ivanov - Apache Ignite 2.0 Towards a Conver...IMCSummite 2016 Breakout - Nikita Ivanov - Apache Ignite 2.0 Towards a Conver...
IMCSummite 2016 Breakout - Nikita Ivanov - Apache Ignite 2.0 Towards a Conver...In-Memory Computing Summit
 
IMC Summit 2016 Keynote - Jason Stamper - In-Memory: The Foundation of the In...
IMC Summit 2016 Keynote - Jason Stamper - In-Memory: The Foundation of the In...IMC Summit 2016 Keynote - Jason Stamper - In-Memory: The Foundation of the In...
IMC Summit 2016 Keynote - Jason Stamper - In-Memory: The Foundation of the In...In-Memory Computing Summit
 
IMCSummit 2016 Keynote - Benzi Galili - More Memory for In-Memory Easy
IMCSummit 2016 Keynote - Benzi Galili - More Memory for In-Memory EasyIMCSummit 2016 Keynote - Benzi Galili - More Memory for In-Memory Easy
IMCSummit 2016 Keynote - Benzi Galili - More Memory for In-Memory EasyIn-Memory Computing Summit
 
IMCSummit 2016 Keynote - Abe Kleinfeld - The In-Memory Computing Landscape: L...
IMCSummit 2016 Keynote - Abe Kleinfeld - The In-Memory Computing Landscape: L...IMCSummit 2016 Keynote - Abe Kleinfeld - The In-Memory Computing Landscape: L...
IMCSummit 2016 Keynote - Abe Kleinfeld - The In-Memory Computing Landscape: L...In-Memory Computing Summit
 

More from In-Memory Computing Summit (20)

IMC Summit 2016 Breakout - Roman Shtykh - Apache Ignite as a Data Processing Hub
IMC Summit 2016 Breakout - Roman Shtykh - Apache Ignite as a Data Processing HubIMC Summit 2016 Breakout - Roman Shtykh - Apache Ignite as a Data Processing Hub
IMC Summit 2016 Breakout - Roman Shtykh - Apache Ignite as a Data Processing Hub
 
IMC Summit 2016 Breakout - Nikita Shamgunov - Propelling IoT Innovation with ...
IMC Summit 2016 Breakout - Nikita Shamgunov - Propelling IoT Innovation with ...IMC Summit 2016 Breakout - Nikita Shamgunov - Propelling IoT Innovation with ...
IMC Summit 2016 Breakout - Nikita Shamgunov - Propelling IoT Innovation with ...
 
IMC Summit 2016 Breakout - Matt Coventon - Test Driving Streaming and CEP on ...
IMC Summit 2016 Breakout - Matt Coventon - Test Driving Streaming and CEP on ...IMC Summit 2016 Breakout - Matt Coventon - Test Driving Streaming and CEP on ...
IMC Summit 2016 Breakout - Matt Coventon - Test Driving Streaming and CEP on ...
 
IMC Summit 2016 Innovation - Derek Nelson - PipelineDB: The Streaming-SQL Dat...
IMC Summit 2016 Innovation - Derek Nelson - PipelineDB: The Streaming-SQL Dat...IMC Summit 2016 Innovation - Derek Nelson - PipelineDB: The Streaming-SQL Dat...
IMC Summit 2016 Innovation - Derek Nelson - PipelineDB: The Streaming-SQL Dat...
 
IMC Summit 2016 Breakout - Brian Bulkowski - NVMe, Storage Class Memory and O...
IMC Summit 2016 Breakout - Brian Bulkowski - NVMe, Storage Class Memory and O...IMC Summit 2016 Breakout - Brian Bulkowski - NVMe, Storage Class Memory and O...
IMC Summit 2016 Breakout - Brian Bulkowski - NVMe, Storage Class Memory and O...
 
IMC Summit 2016 Breakout - Andy Pavlo - What Non-Volatile Memory Means for th...
IMC Summit 2016 Breakout - Andy Pavlo - What Non-Volatile Memory Means for th...IMC Summit 2016 Breakout - Andy Pavlo - What Non-Volatile Memory Means for th...
IMC Summit 2016 Breakout - Andy Pavlo - What Non-Volatile Memory Means for th...
 
IMC Summit 2016 Breakout - Steve Wikes - Making IMC Enterprise Grade
IMC Summit 2016 Breakout - Steve Wikes - Making IMC Enterprise GradeIMC Summit 2016 Breakout - Steve Wikes - Making IMC Enterprise Grade
IMC Summit 2016 Breakout - Steve Wikes - Making IMC Enterprise Grade
 
IMC Summit 2016 Breakout - Noah Arliss - The Truth: How to Test Your Distribu...
IMC Summit 2016 Breakout - Noah Arliss - The Truth: How to Test Your Distribu...IMC Summit 2016 Breakout - Noah Arliss - The Truth: How to Test Your Distribu...
IMC Summit 2016 Breakout - Noah Arliss - The Truth: How to Test Your Distribu...
 
IMC Summit 2016 Breakout - Aleksandar Seovic - The Illusion of Statelessness
IMC Summit 2016 Breakout - Aleksandar Seovic - The Illusion of StatelessnessIMC Summit 2016 Breakout - Aleksandar Seovic - The Illusion of Statelessness
IMC Summit 2016 Breakout - Aleksandar Seovic - The Illusion of Statelessness
 
IMC Summit 2016 Breakout - Girish Mutreja - Extreme Transaction Processing in...
IMC Summit 2016 Breakout - Girish Mutreja - Extreme Transaction Processing in...IMC Summit 2016 Breakout - Girish Mutreja - Extreme Transaction Processing in...
IMC Summit 2016 Breakout - Girish Mutreja - Extreme Transaction Processing in...
 
IMC Summit 2016 Breakout - Greg Luck - How to Speed Up Your Application Using...
IMC Summit 2016 Breakout - Greg Luck - How to Speed Up Your Application Using...IMC Summit 2016 Breakout - Greg Luck - How to Speed Up Your Application Using...
IMC Summit 2016 Breakout - Greg Luck - How to Speed Up Your Application Using...
 
IMC Summit 2016 Breakout - Pandurang Naik - Demystifying In-Memory Data Grid,...
IMC Summit 2016 Breakout - Pandurang Naik - Demystifying In-Memory Data Grid,...IMC Summit 2016 Breakout - Pandurang Naik - Demystifying In-Memory Data Grid,...
IMC Summit 2016 Breakout - Pandurang Naik - Demystifying In-Memory Data Grid,...
 
IMC Summit 2016 Breakout - William Bain - Implementing Extensible Data Struct...
IMC Summit 2016 Breakout - William Bain - Implementing Extensible Data Struct...IMC Summit 2016 Breakout - William Bain - Implementing Extensible Data Struct...
IMC Summit 2016 Breakout - William Bain - Implementing Extensible Data Struct...
 
IMC Summit 2016 Keynote - Arthur Sainio - NVDIMM: Changes are Here So What’s ...
IMC Summit 2016 Keynote - Arthur Sainio - NVDIMM: Changes are Here So What’s ...IMC Summit 2016 Keynote - Arthur Sainio - NVDIMM: Changes are Here So What’s ...
IMC Summit 2016 Keynote - Arthur Sainio - NVDIMM: Changes are Here So What’s ...
 
IMC Summit 2016 Keynote - Robert Barr - In Memory Computing for Financial Ser...
IMC Summit 2016 Keynote - Robert Barr - In Memory Computing for Financial Ser...IMC Summit 2016 Keynote - Robert Barr - In Memory Computing for Financial Ser...
IMC Summit 2016 Keynote - Robert Barr - In Memory Computing for Financial Ser...
 
IMC Summit 2016 Breakout - Nikita Ivanov - Shared In-Memory RDDs – Missing Li...
IMC Summit 2016 Breakout - Nikita Ivanov - Shared In-Memory RDDs – Missing Li...IMC Summit 2016 Breakout - Nikita Ivanov - Shared In-Memory RDDs – Missing Li...
IMC Summit 2016 Breakout - Nikita Ivanov - Shared In-Memory RDDs – Missing Li...
 
IMCSummite 2016 Breakout - Nikita Ivanov - Apache Ignite 2.0 Towards a Conver...
IMCSummite 2016 Breakout - Nikita Ivanov - Apache Ignite 2.0 Towards a Conver...IMCSummite 2016 Breakout - Nikita Ivanov - Apache Ignite 2.0 Towards a Conver...
IMCSummite 2016 Breakout - Nikita Ivanov - Apache Ignite 2.0 Towards a Conver...
 
IMC Summit 2016 Keynote - Jason Stamper - In-Memory: The Foundation of the In...
IMC Summit 2016 Keynote - Jason Stamper - In-Memory: The Foundation of the In...IMC Summit 2016 Keynote - Jason Stamper - In-Memory: The Foundation of the In...
IMC Summit 2016 Keynote - Jason Stamper - In-Memory: The Foundation of the In...
 
IMCSummit 2016 Keynote - Benzi Galili - More Memory for In-Memory Easy
IMCSummit 2016 Keynote - Benzi Galili - More Memory for In-Memory EasyIMCSummit 2016 Keynote - Benzi Galili - More Memory for In-Memory Easy
IMCSummit 2016 Keynote - Benzi Galili - More Memory for In-Memory Easy
 
IMCSummit 2016 Keynote - Abe Kleinfeld - The In-Memory Computing Landscape: L...
IMCSummit 2016 Keynote - Abe Kleinfeld - The In-Memory Computing Landscape: L...IMCSummit 2016 Keynote - Abe Kleinfeld - The In-Memory Computing Landscape: L...
IMCSummit 2016 Keynote - Abe Kleinfeld - The In-Memory Computing Landscape: L...
 

Recently uploaded

How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?XfilesPro
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAndikSusilo4
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 

Recently uploaded (20)

How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & Application
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 

IMC Summit 2016 Breakout - Amit Golander - The Benefits of Memory and Storage Convergence to In-Memory Computing

  • 1. IMC BENEFITS FROM MEMORY & STORAGE CONVERGENCE DR. AMIT GOLANDER PLEXISTOR, CTO
  • 2. Data set ABSTRACT In-memory compute gave up on Storage and moved the working set to Memory. This brings tremendous performance gains, but also: 1. Consumes expensive DRAM resources 2. Puts data at risk 3. Suffers from slow recovery time when power failures occur … The big Question: How will IMC look like when Memory and Storage converge? Working set 2
  • 3. Agenda:  History &The convergence of Memory & Storage  Benefits – Out-of-the-box  Benefits –That require some work 3
  • 4. A LONG TIME AGO… Ideal Storage Requirements for Ideal Storage: 1. Low latency reads 2. High volume persistent writes 3. Reasonable cost 4. Transparent & easy to use CostLatency Persistency DRAM HDD SSD Unfortunately such Storage (#2) did not exist Big Data Middleware 4
  • 5. SO MIDDLEWARE DEVELOPERS & USERS COMPROMISED Commit Log Memory Table Storage Table Persistent, Pretty Fast Cheap Fast Search acceleration 1. Storage had Horrible latency for persistent writes, but not as bad if sequentially written 2. So IMC middleware compensated by using: - Sequential writes at the expense of read latency - Async writes at the risk of data loss - Caching like crazy at the expanse of HW cost (DRAM) - Write amplification at the expanse of HW cost (Storage) - Compaction at the expense of HW cost (CPU) Original requirements Vs. IMC reality: 1. Low latency reads 2. High volume (eventual) persistent writes 3. Reasonable cost 4. Transparent & easy to use 5
  • 6. WHAT HAS CHANGED? Memory & Storage are converging:  New HW - Persistent Memory (PM, e.g. NVDIMM-N)  New SW - Software Defined Memory (SDM) Persistency DRAM HDD SSD PM PM+SDM delivers: 1. Low latency reads 2. HighVolume persistent writes 3. Reasonable cost 4. Transparent & easy to use CostLatency SDM SDM-ephemeral delivers: 1. Low latency reads 2. High volume persistent* writes 3. Reasonable cost 4. Transparent & easy to use** * Persistent on orderly shutdowns, not power failures ** Easy to use within share nothing architecturesPersistency DRAM HDD SSD CostLatency SDM-ephemeral 6
  • 7. HOWTO LEVERAGE SDM? SDM Scenario II New Middleware / Some work to existing Scenario I Existing Middleware – Out of the box SDM SDM SDM 7
  • 8. Agenda:  History & the convergence of Memory & Storage  Benefits – Out-of-the-box  Benefits –That require some work 8
  • 9. OUT OF THE BOX INTEGRATION DRAM/PM FLASH DISK I/O Path Memory Path Fast Storage Huge Memory Data Services Virtual MemoryHDFS POSIX Plexistor FS (Multi Tier, DAX) Linux 1. Download & Install SDM 2. Mount m1fs 3. Run your application 9
  • 10. OOB BENEFIT 1: LARGE WORKING SETS Work set 2x Memory size SDM at 17,000 ops/sec XFS at 2,000 ops/sec  Performance is highly sensitive to Working set size > Aggregated memory size  Working set size is dynamic and hard to predict  Large clusters are expensive Cassandra v3.0.2 I2.4xlarge instance on AWS Data set Working set 10
  • 11. OOB BENEFIT 2: PERSISTENCY  Performance is highly sensitive to persistency/durability requirements  Replication/Mirroring between nodes without persistency is vulnerable to Power Failures  Data loss risk is often not well explained. Confusion leads to wasteful behavior (#copies, Network) 0 30,000 60,000 90,000 120,000 150,000 180,000 Ops/sec TheTraditionalTradeoff (B) Balanced (D) Durable MongoDB v3.2 E5-2650v3, CloudSpeed SSD * (*) – This actually writes two persistent copies: in Memory Table and in Commit Log 11
  • 12. OOB BENEFIT 3: LONG RE-BUILD TIMES  Nodes occasionally fail in large clusters  Re-build take many hours to complete due to extra pressure on the storage layer ClientsClients Couchbase server Couchbase server Couchbase server Couchbase server Couchbase server X Couchbase v4.5 beta E5-2650v4, CloudSpeed SSD 12
  • 13. OOB BENEFIT 4: PREDICTABILITY  No hiccups due to separate memory and storage stacks  Highly predictable performance time TPS MySQL v5.6 E5-2680v3, HGST SN150 DB load generator runs at target (not maximal) speed 13
  • 14. Agenda:  History & the convergence of Memory & Storage  Benefits – Out-of-the-box  Benefits –That require some work 14
  • 15. BENEFITS THAT REQUIRE WORK AT THE MIDDLEWARE LAYER  A lot of potential for Fast Queries & Simplicity SDM Storage Big Data middleware File-level FIO E5-2650v3, CloudSpeed SSD 15
  • 16. EXAMPLE - AMPOOL 16 • Fast & Standard access throughout the data pipeline • 56x faster ingest 3-4x faster OLTP&OLAP than HBase 6x faster Spark than Tachyon
  • 17. DESIGNING MIDDLEWARE IN THE SDM ERA 1. Realize that you’re a storage/memory billionaire – focus on your business logic 2. Use standard POSIX API and share files between frameworks (polyglot) 3. Use SDM zero-cost Clones (cp –reflink) 4. Rely on SDM Auto-tiering (If you must – hint via fadvise/madvise) 5. Consider relying on SDM Mirroring capabilities 6. Use SDM monitoring tools to understand your resource consumption 17
  • 18. SUMMARY  Memory and Storage have already started converging (SDM)  IMC best practices are no longer the “best”  SDM provides value to IMC out-of-the-box but There is even greater opportunity for those willing to integrate Efficiency Simplicity 18
  • 19. Q & A Free SDM download - www.plexistor.com/download/ White papers - www.plexistor.com/resources/ Blog - www.plexistor.com/blog/ amit@plexistor.com 19
  • 20. HIGH AVAILABILITY - CLARIFICATION  Almost zero-latency added for having a 2nd copy, providing that high-speed RDMA network is in place  Public cloud deployments – Keep using your current HA strategy  On premise deployments – Can substitute most copies with storage redundancy App server 1 Plexistor SDM App server 2 Plexistor SDM App server N Plexistor SDM High-speed RDMA Open Brick 1 Open Brick M 20
  • 21. SDM VS. XFS-DAX VS. NVML - CLARIFICATION Plexistor ext4/xfs DAX NVML Scale Out Application AutoTiering Application Snapshots/Clones Application Legacy Applications NVML support High availability Application IT policy hooks DRAM/PM Memory Path Virtual MemoryPOSIX FS w/ DAX support* Linux App using mmap App using NVML (*) Who supports DAX: - Plexistor SDM - Linux xfs-dax, and ext4-dax (WIP) - MS ReFS-dax (WIP) 21