SlideShare uma empresa Scribd logo
1 de 17
Cloudian®
S3 Cloud Storage Platform
Case Study:
Implementing Hadoop and Elastic Map
Reduce on Scale-out Object Storage
Paul Turner
Cloudian Inc.
June 11th 2014
About Cloudian
• Hybrid cloud storage startup in Silicon Valley
– Strong venture backing: Goldman Sachs, Intel Capital
– Solid management with storage, big data, enterprise software and telco
expertise
– 50 employees, offices in Foster City, Japan and China
• Production hardened product
• Target market: mid- to large-enterprises & regional service providers
• GTM: traditional storage distribution/VARs
CLOUDIAN PARTNERS
The Challenge
• Business problem = Analysis of log data from our
customer systems to improve support (classic
‘Internet of Things’ content)
• Existing system required transformation of the data
into HDFS for analytics (slow and costly)
Goal : Reduce cost and provide faster results
6/16/2014 3
Use Case : Support Analytics
• Compare system statistics and usage
patterns to previous normal results
6/16/2014 4
Abnormal Operations
Analysis
End User Analysis
to root cause issues
Trend Analysis for
Capacity Planning and
Traffic Patterns
• Identify all operations for a particular user
and review patterns and any faults
• Build capacity and traffic trend lines based
on statistical analysis of all traffic
100tps S3 Server = 83million lines info log = 3.5GB/Day
10 Server System = 35GB/Day ~ 1TB/month
100 Customer Systems => 1.2PB Annually
Traditional Big Data Flow
Event Processing
Platform
Big Data Storage Platform
Analytics PlatformContent Storage
Consumer Activity
(Events, GPS, WiFi)
Social MediaDevice Tracking and Logs
(Event, Configuration, Usage, Performance, )
Real Time
Events
Big Data
Result of analysis
6/16/2014 5
Traditional Big Data Flow
Event Processing
Platform
Analytics Platform
(HDFS)Content
Storage
(Object, NAS)
• Wasted storage = storage for content and analytics
• Transform of data into HDFS can be costly
• High overhead of HDFS (3copy replica) for content which may
be poor quality
Logs, Config
6/16/2014 6
S3 and Hadoop
• Apache Hadoop supports S3 since Jan 2008
– http://wiki.apache.org/hadoop/AmazonS3
• Well-proven by Amazon with Elastic MapReduce
• State-of-the-art and advancing quickly to provide
much easier Hadoop over S3 – e.g. Netflix Genie
– https://github.com/Netflix/genie
6/16/2014 7
Cloudian Approach
Event Processing
Platform
AnalyticsCloudian HyperStore
Storage
• No redundant storage of data
• Hyperstore scales out with your data – adding nodes for I/O
• Analyze more - allows for efficient bulk data analysis in place
• Take advantage of multi-core CPUs – makes sense for MapReduce
• Can feed smarter data for subsequent analytic systems
• Faster time to decision
6/16/2014 8
Cloudian Hadoop Configuration
• Hadoop 2.2
• Configured for native S3 file system (etc/hadoop/core-site.xml)
– S3N native file system for reading and writing regular files on S3. The
advantage of this file system is that you can access files on S3 that were
written with other tools. Conversely, other tools can access files written using
Hadoop.
• Configure Hadoop to use Cloudian (etc/hadoop/jets3t.properties)
– s3service.s3-endpoint=CLOUDIAN_ENDPOINT
– s3service.s3-endpoint-http-port=CLOUDIAN_PORT
6/16/2014 9
Note: you can also dedicate a bucket for Hadoop analytics and then
Hadoop will chunk the content into blocks for storage – like HDFS
S3
NFS
Cloudian HyperStore® Software
 Scalable peer-to-peer architecture
 Multi-data center replication
 Multi-Tenancy and Chargeback
 Hybrid cloud-ready (any S3 cloud)
 100s of supported applications
 Optimized for any workload
 Storage for OpenStack & CloudStack
6/16/2014 10
Elastic, Distributed and Reliable
NOSQL database distributes
and replicates data
Logical Ring
Data is
automatically
replicated to
multiple nodes.
Location of data can be
designated, for instance, to
multiple datacenters and
per rack.
DC1
DC2
In theory, # of nodes in
a logical ring can be up
to 2127 (almost infinite).
Data load can be
rebalanced when a node is
added or removed.
Jun-14
116/16/2014
Enhanced HyperStore® Technology
• Policies tailored for different
object types
• Optimized for all data
• Chunking for better
performance
• Erasure Coding for deep
archive efficiency
• Reliable storage across
multi-node failures
HyperStore
Patent Pending
Small Objects
Large Objects
Active Content
File System
NOSQL DB
Erasure Coding
Deep
Archives
6/16/2014 12
Cloudian Complete S3 API
• Core REST API – Get, Put, Post, Head, Delete
• Multi-part uploads: Allows uploading large objects
in multiple parts
• Versioning: Multiple versions of same object
• Bucket Lifecycle: Auto-expiration using rules
• Server side encryption: Managed by Cloudian
• Location Constraint: Assign data to specific region
(e.g. for HIPAA compliance)
• Bucket Website: Create buckets as websites to
host web content
• Access control lists (ACLs) define access rights to
bucket and object
• And more...
Cloudian Complete S3 API
Products S3 API
Cloudian
AmpliData
Basho
Caringo
Cleversafe
EMC Atmos
NetApp Bycast
Scality
OpenStack Swift
6/16/2014 13
Seamless tiering to Amazon S3, Glacier and
other S3 Service Providers
146/16/2014
• Cloudian deployed as On-Premises
S3 cloud behind the firewall
• Automatically migrates data to AWS
using Bucket Lifecycle Policies
– Optional migration to Glacier
– Metadata maintained for
search/list of objects
• Configurable to reduce
overhead
• Read/Writes to migrated objects
– restore by default, option to
redirect to AWS/S3 Service
Provider
On-Premises S3
S3
Client/Application
Content migrated
or restored via
Bucket Lifecycle
Policies
Option to redirect
migrated content
Amazon S3
Firewall
Amazon Glacier
Big Data Storage Platform
15
Event Processing Platform Big Data Storage Platform
Input I/F Recommend
CEP Engine
Filter Judge Aggregate
Real Time Analysis
Big Data Analysis
Analyze Recommend
Data Analysis and Storage Platform
Content Storage
Consumer Activity
(Events, GPS, WiFi)
Social mediaBusiness Tracking
(goods, inventory, campaign, sales)
Smarter
Business
6/16/2014
Future Work
• Delivery of Cloudian Hadoop-ready
object storage (2HCY14)
• Integration with key Hadoop
distributions
• Locality awareness
• Potentially use new drive technology for
processing (eg HGST Ethernet drive)
• Find out more – Booth 139
6/16/2014 16
Cloudian®
S3 Cloud Storage Platform
Thank You!
Questions?
www.cloudian.com
“The Leading Provider of Hybrid Cloud Storage”

Mais conteúdo relacionado

Mais procurados

Optimizing Latency-sensitive queries for Presto at Facebook: A Collaboration ...
Optimizing Latency-sensitive queries for Presto at Facebook: A Collaboration ...Optimizing Latency-sensitive queries for Presto at Facebook: A Collaboration ...
Optimizing Latency-sensitive queries for Presto at Facebook: A Collaboration ...Alluxio, Inc.
 
Powering Data Science and AI with Apache Spark, Alluxio, and IBM
Powering Data Science and AI with Apache Spark, Alluxio, and IBMPowering Data Science and AI with Apache Spark, Alluxio, and IBM
Powering Data Science and AI with Apache Spark, Alluxio, and IBMAlluxio, Inc.
 
Presto: Fast SQL-on-Anything Across Data Lakes, DBMS, and NoSQL Data Stores
Presto: Fast SQL-on-Anything Across Data Lakes, DBMS, and NoSQL Data StoresPresto: Fast SQL-on-Anything Across Data Lakes, DBMS, and NoSQL Data Stores
Presto: Fast SQL-on-Anything Across Data Lakes, DBMS, and NoSQL Data StoresAlluxio, Inc.
 
Qubole - Big data in cloud
Qubole - Big data in cloudQubole - Big data in cloud
Qubole - Big data in cloudDmitry Tolpeko
 
ALLUXIO (formerly Tachyon): Unify Data at Memory Speed - Effective using Spar...
ALLUXIO (formerly Tachyon): Unify Data at Memory Speed - Effective using Spar...ALLUXIO (formerly Tachyon): Unify Data at Memory Speed - Effective using Spar...
ALLUXIO (formerly Tachyon): Unify Data at Memory Speed - Effective using Spar...Alluxio, Inc.
 
Big Data Case Study: Fortune 100 Telco
Big Data Case Study: Fortune 100 TelcoBig Data Case Study: Fortune 100 Telco
Big Data Case Study: Fortune 100 TelcoBlueData, Inc.
 
Backup multi-cloud solution based on named pipes
Backup multi-cloud solution based on named pipesBackup multi-cloud solution based on named pipes
Backup multi-cloud solution based on named pipesLeandro Totino Pereira
 
Azure Big Data Story
Azure Big Data StoryAzure Big Data Story
Azure Big Data StoryLynn Langit
 
The Practice of Presto & Alluxio in E-Commerce Big Data Platform
The Practice of Presto & Alluxio in E-Commerce Big Data PlatformThe Practice of Presto & Alluxio in E-Commerce Big Data Platform
The Practice of Presto & Alluxio in E-Commerce Big Data PlatformAlluxio, Inc.
 
Spark Infrastructure Made Easy
Spark Infrastructure Made EasySpark Infrastructure Made Easy
Spark Infrastructure Made EasyBlueData, Inc.
 
A New “Sparkitecture” for Modernizing your Data Warehouse: Spark Summit East ...
A New “Sparkitecture” for Modernizing your Data Warehouse: Spark Summit East ...A New “Sparkitecture” for Modernizing your Data Warehouse: Spark Summit East ...
A New “Sparkitecture” for Modernizing your Data Warehouse: Spark Summit East ...Spark Summit
 
Owning Your Own (Data) Lake House
Owning Your Own (Data) Lake HouseOwning Your Own (Data) Lake House
Owning Your Own (Data) Lake HouseData Con LA
 
Introducing Cloudian HyperStore 6.0
Introducing Cloudian HyperStore 6.0Introducing Cloudian HyperStore 6.0
Introducing Cloudian HyperStore 6.0Cloudian
 
Big Data Day LA 2016/ Use Case Driven track - Hydrator: Open Source, Code-Fre...
Big Data Day LA 2016/ Use Case Driven track - Hydrator: Open Source, Code-Fre...Big Data Day LA 2016/ Use Case Driven track - Hydrator: Open Source, Code-Fre...
Big Data Day LA 2016/ Use Case Driven track - Hydrator: Open Source, Code-Fre...Data Con LA
 
Presto query optimizer: pursuit of performance
Presto query optimizer: pursuit of performancePresto query optimizer: pursuit of performance
Presto query optimizer: pursuit of performanceDataWorks Summit
 
Move your on prem data to a lake in a Lake in Cloud
Move your on prem data to a lake in a Lake in CloudMove your on prem data to a lake in a Lake in Cloud
Move your on prem data to a lake in a Lake in CloudCAMMS
 
Introduction to Kudu - StampedeCon 2016
Introduction to Kudu - StampedeCon 2016Introduction to Kudu - StampedeCon 2016
Introduction to Kudu - StampedeCon 2016StampedeCon
 

Mais procurados (20)

Optimizing Latency-sensitive queries for Presto at Facebook: A Collaboration ...
Optimizing Latency-sensitive queries for Presto at Facebook: A Collaboration ...Optimizing Latency-sensitive queries for Presto at Facebook: A Collaboration ...
Optimizing Latency-sensitive queries for Presto at Facebook: A Collaboration ...
 
Powering Data Science and AI with Apache Spark, Alluxio, and IBM
Powering Data Science and AI with Apache Spark, Alluxio, and IBMPowering Data Science and AI with Apache Spark, Alluxio, and IBM
Powering Data Science and AI with Apache Spark, Alluxio, and IBM
 
Presto: Fast SQL-on-Anything Across Data Lakes, DBMS, and NoSQL Data Stores
Presto: Fast SQL-on-Anything Across Data Lakes, DBMS, and NoSQL Data StoresPresto: Fast SQL-on-Anything Across Data Lakes, DBMS, and NoSQL Data Stores
Presto: Fast SQL-on-Anything Across Data Lakes, DBMS, and NoSQL Data Stores
 
Qubole - Big data in cloud
Qubole - Big data in cloudQubole - Big data in cloud
Qubole - Big data in cloud
 
ALLUXIO (formerly Tachyon): Unify Data at Memory Speed - Effective using Spar...
ALLUXIO (formerly Tachyon): Unify Data at Memory Speed - Effective using Spar...ALLUXIO (formerly Tachyon): Unify Data at Memory Speed - Effective using Spar...
ALLUXIO (formerly Tachyon): Unify Data at Memory Speed - Effective using Spar...
 
Big Data Case Study: Fortune 100 Telco
Big Data Case Study: Fortune 100 TelcoBig Data Case Study: Fortune 100 Telco
Big Data Case Study: Fortune 100 Telco
 
Big data in Azure
Big data in AzureBig data in Azure
Big data in Azure
 
Backup multi-cloud solution based on named pipes
Backup multi-cloud solution based on named pipesBackup multi-cloud solution based on named pipes
Backup multi-cloud solution based on named pipes
 
Architecting a datalake
Architecting a datalakeArchitecting a datalake
Architecting a datalake
 
Azure Big Data Story
Azure Big Data StoryAzure Big Data Story
Azure Big Data Story
 
The Practice of Presto & Alluxio in E-Commerce Big Data Platform
The Practice of Presto & Alluxio in E-Commerce Big Data PlatformThe Practice of Presto & Alluxio in E-Commerce Big Data Platform
The Practice of Presto & Alluxio in E-Commerce Big Data Platform
 
Spark Infrastructure Made Easy
Spark Infrastructure Made EasySpark Infrastructure Made Easy
Spark Infrastructure Made Easy
 
A New “Sparkitecture” for Modernizing your Data Warehouse: Spark Summit East ...
A New “Sparkitecture” for Modernizing your Data Warehouse: Spark Summit East ...A New “Sparkitecture” for Modernizing your Data Warehouse: Spark Summit East ...
A New “Sparkitecture” for Modernizing your Data Warehouse: Spark Summit East ...
 
Owning Your Own (Data) Lake House
Owning Your Own (Data) Lake HouseOwning Your Own (Data) Lake House
Owning Your Own (Data) Lake House
 
HIPAA Compliance in the Cloud
HIPAA Compliance in the CloudHIPAA Compliance in the Cloud
HIPAA Compliance in the Cloud
 
Introducing Cloudian HyperStore 6.0
Introducing Cloudian HyperStore 6.0Introducing Cloudian HyperStore 6.0
Introducing Cloudian HyperStore 6.0
 
Big Data Day LA 2016/ Use Case Driven track - Hydrator: Open Source, Code-Fre...
Big Data Day LA 2016/ Use Case Driven track - Hydrator: Open Source, Code-Fre...Big Data Day LA 2016/ Use Case Driven track - Hydrator: Open Source, Code-Fre...
Big Data Day LA 2016/ Use Case Driven track - Hydrator: Open Source, Code-Fre...
 
Presto query optimizer: pursuit of performance
Presto query optimizer: pursuit of performancePresto query optimizer: pursuit of performance
Presto query optimizer: pursuit of performance
 
Move your on prem data to a lake in a Lake in Cloud
Move your on prem data to a lake in a Lake in CloudMove your on prem data to a lake in a Lake in Cloud
Move your on prem data to a lake in a Lake in Cloud
 
Introduction to Kudu - StampedeCon 2016
Introduction to Kudu - StampedeCon 2016Introduction to Kudu - StampedeCon 2016
Introduction to Kudu - StampedeCon 2016
 

Destaque

Limewood Event - EMC
Limewood Event - EMC Limewood Event - EMC
Limewood Event - EMC BlueChipICT
 
Design at Scale: A Storage Case Study
Design at Scale: A Storage Case StudyDesign at Scale: A Storage Case Study
Design at Scale: A Storage Case StudyDesignMap
 
Green Networks by Neenu
Green Networks by NeenuGreen Networks by Neenu
Green Networks by NeenuNeenu Ks
 
Twitter case study
Twitter case studyTwitter case study
Twitter case studydivya_binu
 
Microservice architecture case study
Microservice architecture case studyMicroservice architecture case study
Microservice architecture case studyRudra Tripathy
 
Cisco Systems Case Study: The Architecture Review Process Improving the IT P...
Cisco Systems Case Study: The Architecture Review  Process Improving the IT P...Cisco Systems Case Study: The Architecture Review  Process Improving the IT P...
Cisco Systems Case Study: The Architecture Review Process Improving the IT P...Susan Bouchard
 
Green storage
Green storageGreen storage
Green storagemnalls
 
Introduction to MongoDB
Introduction to MongoDBIntroduction to MongoDB
Introduction to MongoDBRavi Teja
 

Destaque (11)

Limewood Event - EMC
Limewood Event - EMC Limewood Event - EMC
Limewood Event - EMC
 
Why consolidation of data centers smart business move
Why consolidation of data centers smart business moveWhy consolidation of data centers smart business move
Why consolidation of data centers smart business move
 
Design at Scale: A Storage Case Study
Design at Scale: A Storage Case StudyDesign at Scale: A Storage Case Study
Design at Scale: A Storage Case Study
 
Green Networks by Neenu
Green Networks by NeenuGreen Networks by Neenu
Green Networks by Neenu
 
Twitter case study
Twitter case studyTwitter case study
Twitter case study
 
Microservice architecture case study
Microservice architecture case studyMicroservice architecture case study
Microservice architecture case study
 
Hadoop Trends
Hadoop TrendsHadoop Trends
Hadoop Trends
 
Cisco Systems Case Study: The Architecture Review Process Improving the IT P...
Cisco Systems Case Study: The Architecture Review  Process Improving the IT P...Cisco Systems Case Study: The Architecture Review  Process Improving the IT P...
Cisco Systems Case Study: The Architecture Review Process Improving the IT P...
 
Green storage
Green storageGreen storage
Green storage
 
Introduction to MongoDB
Introduction to MongoDBIntroduction to MongoDB
Introduction to MongoDB
 
Big data ppt
Big  data pptBig  data ppt
Big data ppt
 

Semelhante a Case Study: Implementing Hadoop and Elastic Map Reduce on Scale-out Object Storage

Enabling big data & AI workloads on the object store at DBS
Enabling big data & AI workloads on the object store at DBS Enabling big data & AI workloads on the object store at DBS
Enabling big data & AI workloads on the object store at DBS Alluxio, Inc.
 
How the Development Bank of Singapore solves on-prem compute capacity challen...
How the Development Bank of Singapore solves on-prem compute capacity challen...How the Development Bank of Singapore solves on-prem compute capacity challen...
How the Development Bank of Singapore solves on-prem compute capacity challen...Alluxio, Inc.
 
Accelerate Analytics and ML in the Hybrid Cloud Era
Accelerate Analytics and ML in the Hybrid Cloud EraAccelerate Analytics and ML in the Hybrid Cloud Era
Accelerate Analytics and ML in the Hybrid Cloud EraAlluxio, Inc.
 
Accelerating analytics in the cloud with the Starburst Presto + Alluxio stack
Accelerating analytics in the cloud with the Starburst Presto + Alluxio stackAccelerating analytics in the cloud with the Starburst Presto + Alluxio stack
Accelerating analytics in the cloud with the Starburst Presto + Alluxio stackAlluxio, Inc.
 
Alluxio 2.0 Deep Dive – Simplifying data access for cloud workloads
Alluxio 2.0 Deep Dive – Simplifying data access for cloud workloadsAlluxio 2.0 Deep Dive – Simplifying data access for cloud workloads
Alluxio 2.0 Deep Dive – Simplifying data access for cloud workloadsAlluxio, Inc.
 
Gestione gerarchica dei dati con SUSE Enterprise Storage e HPE DMF
Gestione gerarchica dei dati con SUSE Enterprise Storage e HPE DMFGestione gerarchica dei dati con SUSE Enterprise Storage e HPE DMF
Gestione gerarchica dei dati con SUSE Enterprise Storage e HPE DMFSUSE Italy
 
Shaping the Role of a Data Lake in a Modern Data Fabric Architecture
Shaping the Role of a Data Lake in a Modern Data Fabric ArchitectureShaping the Role of a Data Lake in a Modern Data Fabric Architecture
Shaping the Role of a Data Lake in a Modern Data Fabric ArchitectureDenodo
 
Accelerate Analytics and ML in the Hybrid Cloud Era
Accelerate Analytics and ML in the Hybrid Cloud EraAccelerate Analytics and ML in the Hybrid Cloud Era
Accelerate Analytics and ML in the Hybrid Cloud EraAlluxio, Inc.
 
Moving data to the cloud BY CESAR ROJAS from Pivotal
Moving data to the cloud BY CESAR ROJAS from PivotalMoving data to the cloud BY CESAR ROJAS from Pivotal
Moving data to the cloud BY CESAR ROJAS from PivotalVMware Tanzu Korea
 
Choosing the Right Data Storage Solution
Choosing the Right Data Storage SolutionChoosing the Right Data Storage Solution
Choosing the Right Data Storage SolutionAmazon Web Services
 
Apache Ignite vs Alluxio: Memory Speed Big Data Analytics
Apache Ignite vs Alluxio: Memory Speed Big Data AnalyticsApache Ignite vs Alluxio: Memory Speed Big Data Analytics
Apache Ignite vs Alluxio: Memory Speed Big Data AnalyticsDataWorks Summit
 
Big Data in the Cloud - The What, Why and How from the Experts
Big Data in the Cloud - The What, Why and How from the ExpertsBig Data in the Cloud - The What, Why and How from the Experts
Big Data in the Cloud - The What, Why and How from the ExpertsDataWorks Summit/Hadoop Summit
 
Yaron Haviv, Iguaz.io - OpenStack and BigData - OpenStack Israel 2015
Yaron Haviv, Iguaz.io - OpenStack and BigData - OpenStack Israel 2015Yaron Haviv, Iguaz.io - OpenStack and BigData - OpenStack Israel 2015
Yaron Haviv, Iguaz.io - OpenStack and BigData - OpenStack Israel 2015Cloud Native Day Tel Aviv
 
Supercharge Splunk with Cloudera

Supercharge Splunk with Cloudera
Supercharge Splunk with Cloudera

Supercharge Splunk with Cloudera
Cloudera, Inc.
 
Hadoop in the Cloud – The What, Why and How from the Experts
Hadoop in the Cloud – The What, Why and How from the ExpertsHadoop in the Cloud – The What, Why and How from the Experts
Hadoop in the Cloud – The What, Why and How from the ExpertsDataWorks Summit/Hadoop Summit
 
Deliver Best-in-Class HPC Cloud Solutions Without Losing Your Mind
Deliver Best-in-Class HPC Cloud Solutions Without Losing Your MindDeliver Best-in-Class HPC Cloud Solutions Without Losing Your Mind
Deliver Best-in-Class HPC Cloud Solutions Without Losing Your MindAvere Systems
 
From Insights to Value - Building a Modern Logical Data Lake to Drive User Ad...
From Insights to Value - Building a Modern Logical Data Lake to Drive User Ad...From Insights to Value - Building a Modern Logical Data Lake to Drive User Ad...
From Insights to Value - Building a Modern Logical Data Lake to Drive User Ad...DataWorks Summit
 
AWS re:Invent 2016: Extending Hadoop and Spark to the AWS Cloud (GPST304)
AWS re:Invent 2016: Extending Hadoop and Spark to the AWS Cloud (GPST304)AWS re:Invent 2016: Extending Hadoop and Spark to the AWS Cloud (GPST304)
AWS re:Invent 2016: Extending Hadoop and Spark to the AWS Cloud (GPST304)Amazon Web Services
 

Semelhante a Case Study: Implementing Hadoop and Elastic Map Reduce on Scale-out Object Storage (20)

Enabling big data & AI workloads on the object store at DBS
Enabling big data & AI workloads on the object store at DBS Enabling big data & AI workloads on the object store at DBS
Enabling big data & AI workloads on the object store at DBS
 
HDFCloud Workshop: HDF5 in the Cloud
HDFCloud Workshop: HDF5 in the CloudHDFCloud Workshop: HDF5 in the Cloud
HDFCloud Workshop: HDF5 in the Cloud
 
How the Development Bank of Singapore solves on-prem compute capacity challen...
How the Development Bank of Singapore solves on-prem compute capacity challen...How the Development Bank of Singapore solves on-prem compute capacity challen...
How the Development Bank of Singapore solves on-prem compute capacity challen...
 
Accelerate Analytics and ML in the Hybrid Cloud Era
Accelerate Analytics and ML in the Hybrid Cloud EraAccelerate Analytics and ML in the Hybrid Cloud Era
Accelerate Analytics and ML in the Hybrid Cloud Era
 
Accelerating analytics in the cloud with the Starburst Presto + Alluxio stack
Accelerating analytics in the cloud with the Starburst Presto + Alluxio stackAccelerating analytics in the cloud with the Starburst Presto + Alluxio stack
Accelerating analytics in the cloud with the Starburst Presto + Alluxio stack
 
Alluxio 2.0 Deep Dive – Simplifying data access for cloud workloads
Alluxio 2.0 Deep Dive – Simplifying data access for cloud workloadsAlluxio 2.0 Deep Dive – Simplifying data access for cloud workloads
Alluxio 2.0 Deep Dive – Simplifying data access for cloud workloads
 
Gestione gerarchica dei dati con SUSE Enterprise Storage e HPE DMF
Gestione gerarchica dei dati con SUSE Enterprise Storage e HPE DMFGestione gerarchica dei dati con SUSE Enterprise Storage e HPE DMF
Gestione gerarchica dei dati con SUSE Enterprise Storage e HPE DMF
 
Shaping the Role of a Data Lake in a Modern Data Fabric Architecture
Shaping the Role of a Data Lake in a Modern Data Fabric ArchitectureShaping the Role of a Data Lake in a Modern Data Fabric Architecture
Shaping the Role of a Data Lake in a Modern Data Fabric Architecture
 
Accelerate Analytics and ML in the Hybrid Cloud Era
Accelerate Analytics and ML in the Hybrid Cloud EraAccelerate Analytics and ML in the Hybrid Cloud Era
Accelerate Analytics and ML in the Hybrid Cloud Era
 
Moving data to the cloud BY CESAR ROJAS from Pivotal
Moving data to the cloud BY CESAR ROJAS from PivotalMoving data to the cloud BY CESAR ROJAS from Pivotal
Moving data to the cloud BY CESAR ROJAS from Pivotal
 
Choosing the Right Data Storage Solution
Choosing the Right Data Storage SolutionChoosing the Right Data Storage Solution
Choosing the Right Data Storage Solution
 
Apache Ignite vs Alluxio: Memory Speed Big Data Analytics
Apache Ignite vs Alluxio: Memory Speed Big Data AnalyticsApache Ignite vs Alluxio: Memory Speed Big Data Analytics
Apache Ignite vs Alluxio: Memory Speed Big Data Analytics
 
Big Data in the Cloud - The What, Why and How from the Experts
Big Data in the Cloud - The What, Why and How from the ExpertsBig Data in the Cloud - The What, Why and How from the Experts
Big Data in the Cloud - The What, Why and How from the Experts
 
Yaron Haviv, Iguaz.io - OpenStack and BigData - OpenStack Israel 2015
Yaron Haviv, Iguaz.io - OpenStack and BigData - OpenStack Israel 2015Yaron Haviv, Iguaz.io - OpenStack and BigData - OpenStack Israel 2015
Yaron Haviv, Iguaz.io - OpenStack and BigData - OpenStack Israel 2015
 
Supercharge Splunk with Cloudera

Supercharge Splunk with Cloudera
Supercharge Splunk with Cloudera

Supercharge Splunk with Cloudera

 
Hadoop in the Cloud – The What, Why and How from the Experts
Hadoop in the Cloud – The What, Why and How from the ExpertsHadoop in the Cloud – The What, Why and How from the Experts
Hadoop in the Cloud – The What, Why and How from the Experts
 
Hadoop ppt1
Hadoop ppt1Hadoop ppt1
Hadoop ppt1
 
Deliver Best-in-Class HPC Cloud Solutions Without Losing Your Mind
Deliver Best-in-Class HPC Cloud Solutions Without Losing Your MindDeliver Best-in-Class HPC Cloud Solutions Without Losing Your Mind
Deliver Best-in-Class HPC Cloud Solutions Without Losing Your Mind
 
From Insights to Value - Building a Modern Logical Data Lake to Drive User Ad...
From Insights to Value - Building a Modern Logical Data Lake to Drive User Ad...From Insights to Value - Building a Modern Logical Data Lake to Drive User Ad...
From Insights to Value - Building a Modern Logical Data Lake to Drive User Ad...
 
AWS re:Invent 2016: Extending Hadoop and Spark to the AWS Cloud (GPST304)
AWS re:Invent 2016: Extending Hadoop and Spark to the AWS Cloud (GPST304)AWS re:Invent 2016: Extending Hadoop and Spark to the AWS Cloud (GPST304)
AWS re:Invent 2016: Extending Hadoop and Spark to the AWS Cloud (GPST304)
 

Mais de Cloudian

S3 Deduplication with StorReduce and Cloudian
S3 Deduplication with StorReduce and CloudianS3 Deduplication with StorReduce and Cloudian
S3 Deduplication with StorReduce and CloudianCloudian
 
Dynamic Object Routing
Dynamic Object RoutingDynamic Object Routing
Dynamic Object RoutingCloudian
 
Cloudian and Rubrik - Hybrid Cloud based Disaster Recovery
Cloudian and Rubrik - Hybrid Cloud based Disaster RecoveryCloudian and Rubrik - Hybrid Cloud based Disaster Recovery
Cloudian and Rubrik - Hybrid Cloud based Disaster RecoveryCloudian
 
Network Setup Guide: Deploying Your Cloudian HyperStore Hybrid Storage Service
Network Setup Guide: Deploying Your Cloudian HyperStore Hybrid Storage ServiceNetwork Setup Guide: Deploying Your Cloudian HyperStore Hybrid Storage Service
Network Setup Guide: Deploying Your Cloudian HyperStore Hybrid Storage ServiceCloudian
 
Quick-Start Guide: Deploying Your Cloudian HyperStore Hybrid Storage Service
Quick-Start Guide: Deploying Your Cloudian HyperStore Hybrid Storage ServiceQuick-Start Guide: Deploying Your Cloudian HyperStore Hybrid Storage Service
Quick-Start Guide: Deploying Your Cloudian HyperStore Hybrid Storage ServiceCloudian
 
Cloudian HyperStore with IBM Spectrum Protect
Cloudian HyperStore with IBM Spectrum ProtectCloudian HyperStore with IBM Spectrum Protect
Cloudian HyperStore with IBM Spectrum ProtectCloudian
 
Cloudian Object Storage for Accelerite CloudPlatform
Cloudian Object Storage for Accelerite CloudPlatformCloudian Object Storage for Accelerite CloudPlatform
Cloudian Object Storage for Accelerite CloudPlatformCloudian
 
NTT Communications Delivers Object Storage to Enterprise Cloud Service with C...
NTT Communications Delivers Object Storage to Enterprise Cloud Service with C...NTT Communications Delivers Object Storage to Enterprise Cloud Service with C...
NTT Communications Delivers Object Storage to Enterprise Cloud Service with C...Cloudian
 
Cloudian HyperStore Streamlines Scientific Collaboration
Cloudian HyperStore Streamlines Scientific CollaborationCloudian HyperStore Streamlines Scientific Collaboration
Cloudian HyperStore Streamlines Scientific CollaborationCloudian
 
Cloudian HyperStore Features and Benefits
Cloudian HyperStore Features and BenefitsCloudian HyperStore Features and Benefits
Cloudian HyperStore Features and BenefitsCloudian
 
Cloudian HyperStore Enables Healthcare Data Storage
Cloudian HyperStore Enables Healthcare Data StorageCloudian HyperStore Enables Healthcare Data Storage
Cloudian HyperStore Enables Healthcare Data StorageCloudian
 
Data Protection & Ensuring Data Availability with Commvault Next-Generation S...
Data Protection & Ensuring Data Availability with Commvault Next-Generation S...Data Protection & Ensuring Data Availability with Commvault Next-Generation S...
Data Protection & Ensuring Data Availability with Commvault Next-Generation S...Cloudian
 
Modernize the Disperse Enterprise with CTERA Cloud Storage Gateways and Cloud...
Modernize the Disperse Enterprise with CTERA Cloud Storage Gateways and Cloud...Modernize the Disperse Enterprise with CTERA Cloud Storage Gateways and Cloud...
Modernize the Disperse Enterprise with CTERA Cloud Storage Gateways and Cloud...Cloudian
 
Workload Centric Scale-Out Storage for Next Generation Datacenter
Workload Centric Scale-Out Storage for Next Generation DatacenterWorkload Centric Scale-Out Storage for Next Generation Datacenter
Workload Centric Scale-Out Storage for Next Generation DatacenterCloudian
 
Cloudian HyperStore 'Forever Live' Storage Platform
Cloudian HyperStore 'Forever Live' Storage PlatformCloudian HyperStore 'Forever Live' Storage Platform
Cloudian HyperStore 'Forever Live' Storage PlatformCloudian
 
Hadoop and Cloudian HyperStore
Hadoop and Cloudian HyperStoreHadoop and Cloudian HyperStore
Hadoop and Cloudian HyperStoreCloudian
 
Cloudian HyperStore 5.0 Release What's New
Cloudian HyperStore 5.0 Release What's NewCloudian HyperStore 5.0 Release What's New
Cloudian HyperStore 5.0 Release What's NewCloudian
 
Object Storage Overview
Object Storage OverviewObject Storage Overview
Object Storage OverviewCloudian
 
How to configure Cloudian HyperStore with Dragon Disk S3 Client
How to configure Cloudian HyperStore with Dragon Disk S3 ClientHow to configure Cloudian HyperStore with Dragon Disk S3 Client
How to configure Cloudian HyperStore with Dragon Disk S3 ClientCloudian
 
Kumo Meets Customer Demand for Cloud Backup with Cloudian Object Storage and ...
Kumo Meets Customer Demand for Cloud Backup with Cloudian Object Storage and ...Kumo Meets Customer Demand for Cloud Backup with Cloudian Object Storage and ...
Kumo Meets Customer Demand for Cloud Backup with Cloudian Object Storage and ...Cloudian
 

Mais de Cloudian (20)

S3 Deduplication with StorReduce and Cloudian
S3 Deduplication with StorReduce and CloudianS3 Deduplication with StorReduce and Cloudian
S3 Deduplication with StorReduce and Cloudian
 
Dynamic Object Routing
Dynamic Object RoutingDynamic Object Routing
Dynamic Object Routing
 
Cloudian and Rubrik - Hybrid Cloud based Disaster Recovery
Cloudian and Rubrik - Hybrid Cloud based Disaster RecoveryCloudian and Rubrik - Hybrid Cloud based Disaster Recovery
Cloudian and Rubrik - Hybrid Cloud based Disaster Recovery
 
Network Setup Guide: Deploying Your Cloudian HyperStore Hybrid Storage Service
Network Setup Guide: Deploying Your Cloudian HyperStore Hybrid Storage ServiceNetwork Setup Guide: Deploying Your Cloudian HyperStore Hybrid Storage Service
Network Setup Guide: Deploying Your Cloudian HyperStore Hybrid Storage Service
 
Quick-Start Guide: Deploying Your Cloudian HyperStore Hybrid Storage Service
Quick-Start Guide: Deploying Your Cloudian HyperStore Hybrid Storage ServiceQuick-Start Guide: Deploying Your Cloudian HyperStore Hybrid Storage Service
Quick-Start Guide: Deploying Your Cloudian HyperStore Hybrid Storage Service
 
Cloudian HyperStore with IBM Spectrum Protect
Cloudian HyperStore with IBM Spectrum ProtectCloudian HyperStore with IBM Spectrum Protect
Cloudian HyperStore with IBM Spectrum Protect
 
Cloudian Object Storage for Accelerite CloudPlatform
Cloudian Object Storage for Accelerite CloudPlatformCloudian Object Storage for Accelerite CloudPlatform
Cloudian Object Storage for Accelerite CloudPlatform
 
NTT Communications Delivers Object Storage to Enterprise Cloud Service with C...
NTT Communications Delivers Object Storage to Enterprise Cloud Service with C...NTT Communications Delivers Object Storage to Enterprise Cloud Service with C...
NTT Communications Delivers Object Storage to Enterprise Cloud Service with C...
 
Cloudian HyperStore Streamlines Scientific Collaboration
Cloudian HyperStore Streamlines Scientific CollaborationCloudian HyperStore Streamlines Scientific Collaboration
Cloudian HyperStore Streamlines Scientific Collaboration
 
Cloudian HyperStore Features and Benefits
Cloudian HyperStore Features and BenefitsCloudian HyperStore Features and Benefits
Cloudian HyperStore Features and Benefits
 
Cloudian HyperStore Enables Healthcare Data Storage
Cloudian HyperStore Enables Healthcare Data StorageCloudian HyperStore Enables Healthcare Data Storage
Cloudian HyperStore Enables Healthcare Data Storage
 
Data Protection & Ensuring Data Availability with Commvault Next-Generation S...
Data Protection & Ensuring Data Availability with Commvault Next-Generation S...Data Protection & Ensuring Data Availability with Commvault Next-Generation S...
Data Protection & Ensuring Data Availability with Commvault Next-Generation S...
 
Modernize the Disperse Enterprise with CTERA Cloud Storage Gateways and Cloud...
Modernize the Disperse Enterprise with CTERA Cloud Storage Gateways and Cloud...Modernize the Disperse Enterprise with CTERA Cloud Storage Gateways and Cloud...
Modernize the Disperse Enterprise with CTERA Cloud Storage Gateways and Cloud...
 
Workload Centric Scale-Out Storage for Next Generation Datacenter
Workload Centric Scale-Out Storage for Next Generation DatacenterWorkload Centric Scale-Out Storage for Next Generation Datacenter
Workload Centric Scale-Out Storage for Next Generation Datacenter
 
Cloudian HyperStore 'Forever Live' Storage Platform
Cloudian HyperStore 'Forever Live' Storage PlatformCloudian HyperStore 'Forever Live' Storage Platform
Cloudian HyperStore 'Forever Live' Storage Platform
 
Hadoop and Cloudian HyperStore
Hadoop and Cloudian HyperStoreHadoop and Cloudian HyperStore
Hadoop and Cloudian HyperStore
 
Cloudian HyperStore 5.0 Release What's New
Cloudian HyperStore 5.0 Release What's NewCloudian HyperStore 5.0 Release What's New
Cloudian HyperStore 5.0 Release What's New
 
Object Storage Overview
Object Storage OverviewObject Storage Overview
Object Storage Overview
 
How to configure Cloudian HyperStore with Dragon Disk S3 Client
How to configure Cloudian HyperStore with Dragon Disk S3 ClientHow to configure Cloudian HyperStore with Dragon Disk S3 Client
How to configure Cloudian HyperStore with Dragon Disk S3 Client
 
Kumo Meets Customer Demand for Cloud Backup with Cloudian Object Storage and ...
Kumo Meets Customer Demand for Cloud Backup with Cloudian Object Storage and ...Kumo Meets Customer Demand for Cloud Backup with Cloudian Object Storage and ...
Kumo Meets Customer Demand for Cloud Backup with Cloudian Object Storage and ...
 

Último

Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesBoston Institute of Analytics
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdflior mazor
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfhans926745
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 

Último (20)

Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 

Case Study: Implementing Hadoop and Elastic Map Reduce on Scale-out Object Storage

  • 1. Cloudian® S3 Cloud Storage Platform Case Study: Implementing Hadoop and Elastic Map Reduce on Scale-out Object Storage Paul Turner Cloudian Inc. June 11th 2014
  • 2. About Cloudian • Hybrid cloud storage startup in Silicon Valley – Strong venture backing: Goldman Sachs, Intel Capital – Solid management with storage, big data, enterprise software and telco expertise – 50 employees, offices in Foster City, Japan and China • Production hardened product • Target market: mid- to large-enterprises & regional service providers • GTM: traditional storage distribution/VARs CLOUDIAN PARTNERS
  • 3. The Challenge • Business problem = Analysis of log data from our customer systems to improve support (classic ‘Internet of Things’ content) • Existing system required transformation of the data into HDFS for analytics (slow and costly) Goal : Reduce cost and provide faster results 6/16/2014 3
  • 4. Use Case : Support Analytics • Compare system statistics and usage patterns to previous normal results 6/16/2014 4 Abnormal Operations Analysis End User Analysis to root cause issues Trend Analysis for Capacity Planning and Traffic Patterns • Identify all operations for a particular user and review patterns and any faults • Build capacity and traffic trend lines based on statistical analysis of all traffic 100tps S3 Server = 83million lines info log = 3.5GB/Day 10 Server System = 35GB/Day ~ 1TB/month 100 Customer Systems => 1.2PB Annually
  • 5. Traditional Big Data Flow Event Processing Platform Big Data Storage Platform Analytics PlatformContent Storage Consumer Activity (Events, GPS, WiFi) Social MediaDevice Tracking and Logs (Event, Configuration, Usage, Performance, ) Real Time Events Big Data Result of analysis 6/16/2014 5
  • 6. Traditional Big Data Flow Event Processing Platform Analytics Platform (HDFS)Content Storage (Object, NAS) • Wasted storage = storage for content and analytics • Transform of data into HDFS can be costly • High overhead of HDFS (3copy replica) for content which may be poor quality Logs, Config 6/16/2014 6
  • 7. S3 and Hadoop • Apache Hadoop supports S3 since Jan 2008 – http://wiki.apache.org/hadoop/AmazonS3 • Well-proven by Amazon with Elastic MapReduce • State-of-the-art and advancing quickly to provide much easier Hadoop over S3 – e.g. Netflix Genie – https://github.com/Netflix/genie 6/16/2014 7
  • 8. Cloudian Approach Event Processing Platform AnalyticsCloudian HyperStore Storage • No redundant storage of data • Hyperstore scales out with your data – adding nodes for I/O • Analyze more - allows for efficient bulk data analysis in place • Take advantage of multi-core CPUs – makes sense for MapReduce • Can feed smarter data for subsequent analytic systems • Faster time to decision 6/16/2014 8
  • 9. Cloudian Hadoop Configuration • Hadoop 2.2 • Configured for native S3 file system (etc/hadoop/core-site.xml) – S3N native file system for reading and writing regular files on S3. The advantage of this file system is that you can access files on S3 that were written with other tools. Conversely, other tools can access files written using Hadoop. • Configure Hadoop to use Cloudian (etc/hadoop/jets3t.properties) – s3service.s3-endpoint=CLOUDIAN_ENDPOINT – s3service.s3-endpoint-http-port=CLOUDIAN_PORT 6/16/2014 9 Note: you can also dedicate a bucket for Hadoop analytics and then Hadoop will chunk the content into blocks for storage – like HDFS
  • 10. S3 NFS Cloudian HyperStore® Software  Scalable peer-to-peer architecture  Multi-data center replication  Multi-Tenancy and Chargeback  Hybrid cloud-ready (any S3 cloud)  100s of supported applications  Optimized for any workload  Storage for OpenStack & CloudStack 6/16/2014 10
  • 11. Elastic, Distributed and Reliable NOSQL database distributes and replicates data Logical Ring Data is automatically replicated to multiple nodes. Location of data can be designated, for instance, to multiple datacenters and per rack. DC1 DC2 In theory, # of nodes in a logical ring can be up to 2127 (almost infinite). Data load can be rebalanced when a node is added or removed. Jun-14 116/16/2014
  • 12. Enhanced HyperStore® Technology • Policies tailored for different object types • Optimized for all data • Chunking for better performance • Erasure Coding for deep archive efficiency • Reliable storage across multi-node failures HyperStore Patent Pending Small Objects Large Objects Active Content File System NOSQL DB Erasure Coding Deep Archives 6/16/2014 12
  • 13. Cloudian Complete S3 API • Core REST API – Get, Put, Post, Head, Delete • Multi-part uploads: Allows uploading large objects in multiple parts • Versioning: Multiple versions of same object • Bucket Lifecycle: Auto-expiration using rules • Server side encryption: Managed by Cloudian • Location Constraint: Assign data to specific region (e.g. for HIPAA compliance) • Bucket Website: Create buckets as websites to host web content • Access control lists (ACLs) define access rights to bucket and object • And more... Cloudian Complete S3 API Products S3 API Cloudian AmpliData Basho Caringo Cleversafe EMC Atmos NetApp Bycast Scality OpenStack Swift 6/16/2014 13
  • 14. Seamless tiering to Amazon S3, Glacier and other S3 Service Providers 146/16/2014 • Cloudian deployed as On-Premises S3 cloud behind the firewall • Automatically migrates data to AWS using Bucket Lifecycle Policies – Optional migration to Glacier – Metadata maintained for search/list of objects • Configurable to reduce overhead • Read/Writes to migrated objects – restore by default, option to redirect to AWS/S3 Service Provider On-Premises S3 S3 Client/Application Content migrated or restored via Bucket Lifecycle Policies Option to redirect migrated content Amazon S3 Firewall Amazon Glacier
  • 15. Big Data Storage Platform 15 Event Processing Platform Big Data Storage Platform Input I/F Recommend CEP Engine Filter Judge Aggregate Real Time Analysis Big Data Analysis Analyze Recommend Data Analysis and Storage Platform Content Storage Consumer Activity (Events, GPS, WiFi) Social mediaBusiness Tracking (goods, inventory, campaign, sales) Smarter Business 6/16/2014
  • 16. Future Work • Delivery of Cloudian Hadoop-ready object storage (2HCY14) • Integration with key Hadoop distributions • Locality awareness • Potentially use new drive technology for processing (eg HGST Ethernet drive) • Find out more – Booth 139 6/16/2014 16
  • 17. Cloudian® S3 Cloud Storage Platform Thank You! Questions? www.cloudian.com “The Leading Provider of Hybrid Cloud Storage”