SlideShare uma empresa Scribd logo
1 de 26
Cassandra Summit 2012 (#cassandra12)



        Using Cassandra in Cloudian,
           an S3 Cloud Storage System

                                  August 8, 2012
                                 Gary Ogasawara
                                   Cloudian, Inc.

                    Copyright © 2012 Cloudian Inc. & KK All Rights Reserved.   Page 1
What is Cloudian?



Cloudian =
           S3 Cloud Storage
           as Packaged Software

                                                                                     2
#cassandra12        (c) Copyright , Cloudian Inc. & KK, 2012, All rights reserved.
Cloudian Features
     1.   Full Amazon S3 API Compatibility, including error codes

     2.   Multi-datacenter, peer-to-peer architecture. No single point of failure.

     3.   Multi-tenant: QoS controls, billing, reporting by each User and each Group

     4.   Public and Private Clouds.

     5.   Elastic Capacity: small start and scale-out as needed

     6.    System, Group, and User management by Management Console or REST API

     7.   Easy to Use Packaged Software, backed by 24x7 carrier grade support.




                                                                                               3
#cassandra12                   (c) Copyright, Cloudian Inc. & KK, 2012, All rights reserved.
Cloudian Objectives
 1. S3 API full compatibility
                                                                      • Use S3 ecosystem applications “as is”.
                                                                      • API already designed.


 1. Fully packaged software
                                                                     • Easy to deploy on existing
       • Hide NoSQL complexity
                                                                       hardware/network.
       •       Easy install/upgrade
                                                                     • Flexible for different customer types.
       •       HyperStore: Best fit store
                                                                     • Scalable. Start small and grow.

 1. Complete service platform
       • User/Group Provisioning                                     • Turnkey system.
       • Cluster Management                                          • Can choose integration points
       • Reporting                                                     with existing systems.

       • Billing
                                                                                                            4
#cassandra12                          (c) Copyright, Cloudian Inc. & KK, 2012, All rights reserved.
Object vs. File vs. Block Storage


                                                                                      Application
                                                                                        Level
                                                  HTTP
               OBJECTS

                                                                                          OS User
                                                                                           Level
                                            NAS (NFS, CIFS)
                FILES

                                                                                          OS Kernel
                                                                                            Level
                                            SAN (iSCSI)
               BLOCKS
                                                                                      Abstraction
                                                                                        Level

#cassandra12             (c) Copyright , Cloudian Inc. & KK, 2012, All rights reserved.               Page 5
S3 Ecosystem
         Libraries, applications, gateways, etc. using Amazon S3 can be
         simply re-pointed to Cloudian.
    Public




         Hybrid
                                                   Private



#cassandra12            Copyright © 2012 Cloudian Inc. & KK All Rights Reserved.   Page 6
S3 Functions
    • HTTP REST API. PUT, POST, GET, DELETE, HEAD.
    • Objects organized into buckets.
    • Security. Requests authenticated using keyed HMAC with symmetric keys.
        Also, HTTPS option, client-side encryption, server-side encryption.
    • Access control lists (ACLs) define access rights to bucket and object.
    • Accounting of bytes inbound, outbound, stored and HTTP request counts.
        Billing by tiered rating plans per accounting type, per-region.
    • Multi-part uploads. Allows uploading large objects in multiple parts.
    • Versioning. Multiple versions of same object.
    • Location constraint. Buckets can be assigned to a specific region. Each
        region has own domain.
    • …
#cassandra12                  (c) Copyright. Cloudian Inc. & KK, 2012, All rights reserved.   Page 7
Works with leading Cloud Compute Platforms
     Cloudian-Citrix CloudStack
           (May 9, 2012)




        Cloudian-OpenStack
        (October 21, 2011)




#cassandra12 Page 8               (c) Copyright. Cloudian Inc. & KK, 2012, All rights reserved.
Cloudian Customers




Public

                                            Channel Partners:




      Hybrid
                          Private


#cassandra12         (c) Copyright. Cloudian Inc. & KK, 2012, All rights reserved.   Page 9
Why Cassandra?
    Why Cassandra?
     Scalable
       • Add capacity by adding nodes to running system.
       • Distributed (P2P architecture), no single point of failure
     Reliable
       • Resilient to network or hardware failures.
       • Multi-datacenter replication
       • Tuneable data consistency level.
     Features
       • TTL, secondary indexes, counters, compression,
         encryption, …
     Fast
       • Write path especially fast.
                                                                                        10
#cassandra12            (c) Copyright. Cloudian Inc. & KK, 2012, All rights reserved.
Cassandra in Cloudian

    • v1.0.7 in use (started at 0.7.x)
    • Forked to add customizations
    • Hector client
    • Data stored includes:
           • Object metadata
           • Reports/logs
           • Counters for rate control
           •…



#cassandra12                (c) Copyright. Cloudian Inc. & KK, 2012, All rights reserved.   Page 11
Cloudian: Logical Architecture

                        HTTPS
          Login
                                                                             Admin                 Credentials
                                                                             Server                    DB
    Account profile /   HTTPS           Servlets              HTTP
     Security keys                        Servlets


                        HTTP                                                S3 Server             UserData DB
         Reports
                                                                                                  (Cassandra)
                                  Management
                                  Console
     Data Explorer      HTTP                                                                      AccountInfo &
                                                                                                     QoS DB
                                                                                                   (Cassandra)
        WEB UI
                        HTTP or                                                 Data
                        HTTPS                                                   Servers            Reports DB
                        (S3)                                                                       (Cassandra)

      Applications




                                                                                                                  12
#cassandra12                      (c) Copyright. Cloudian Inc. & KK, 2012, All rights reserved.
Minimum Redundant Configuration

                                                                                  Servlets

                                                                                             Credentials
                                                                                                 DB
Browser        HTTPS                              Sticky
requests                                          sessions
for UI                                                                             HTTP/S
                                                                                             Cassandra
                                                                                   Server
                                     LB
Application    HTTP/HTTPS
requests
for S3                                                                            Servlets

                                                                                             Credentials
                                                                                                 DB


                                                                                   HTTP/S
                                                                                             Cassandra
                                                                                   Server



                                                                                                           13
#cassandra12                (c) Copyright. Cloudian Inc. & KK, 2012, All rights reserved.
Multi-Datacenter Example
    2 datacenters / 4 nodes per datacenter


       CMC        Redis (S)      CMC           Redis (S)                          CMC          Redis (S)       CMC        Redis (S)


     S3/Admin                  S3/Admin                                        S3/Admin                      S3/Admin
                  Cassandra                   Cassandra                                        Cassandra                  Cassandra
    /HyperStore               /HyperStore                                     /HyperStore                   /HyperStore




       CMC        Redis (M)      CMC           Redis (S)                          CMC           Redis (S)      CMC        Redis (S)


     S3/Admin                  S3/Admin                                        S3/Admin                      S3/Admin
                  Cassandra                   Cassandra                                        Cassandra                  Cassandra
    /HyperStore               /HyperStore                                     /HyperStore                   /HyperStore


                                               DC1                                                                         DC2


     Storage objects, reports, profiles replicated across DCs by
        Cassandra.
     Credentials DB (Redis) has local DC slave and single global master.
                                                                                                                          14
#cassandra12                            (c) Copyright. Cloudian Inc. & KK, 2012, All rights reserved.
Network Scaling Example



                            DC 1-2                                                       DC 3-2

               DC 1-1                                                       DC 3-1      Region 3
                         Region 1




                          DC 2-1


                        Region 2
                                                                                            15
#cassandra12            (c) Copyright. Cloudian Inc. & KK, 2012, All rights reserved.
Cassandra for Object Store

     Dynamically decide how to store each object

        (Cassandra or file system).
            Cassandra better for small objects.

     Large objects split into multiple parts and chunks.



                                     Column
               Random        Row      Name
               Partitioner   key
                                      Value


               Column Family



                                                                                                   16
#cassandra12                       (c) Copyright. Cloudian Inc. & KK, 2012, All rights reserved.
Cassandra for Object Metadata

      Size, Etag, MD5, timestamp, ACL, part info, version, etc.

      Old versions of metadata format supported.




                                     Column             Column                                       Column
                                      Name               Name                                         Name
               Random          Row
               Partitioner     Key   Value               Value                       …               Value




               Column Family                             Sorted by Column Name



                                                                                                              17
#cassandra12                         (c) Copyright. Cloudian Inc. & KK, 2012, All rights reserved.
Cassandra for Account Info
    DATA MODEL
     User
           - ID, name, contact info, etc.
     Group
           - ID, name, contact info, etc.
     Rating Plan
     Security Credentials
     QoS Counters


    NOTES
     “Static” data. Fixed number of columns.
     Could be put in a Relational DB like MySQL, but no need to add another
      component.

                                                                                                18
#cassandra12                    (c) Copyright. Cloudian Inc. & KK, 2012, All rights reserved.
Quality of Service / SLA Management
 • Configurable maximum limits per-
   region at per-user, per-group,
   system level.
        •   Requests/minute
        •   Storage bytes
        •   Storage objects
        •   Data Bytes Inbound
        •   Data Bytes Outbound
 • While limit is reached, requests are
   rejected.




#cassandra12                      (c) Copyright. Cloudian Inc. & KK, 2012, All rights reserved.   Page 19
Cassandra for Reports
    DATA MODEL
     “Raw” column family
           -   User, Group, System
           -   Transaction type (HTTP GET, PUT, DELETE)                                        …
           -   Object path
           -   Size
           -   …
     “Rollup” column families.
       - RollupHour. Summarizes data for each hour using Raw data.
       - RollupDay. Summarizes data for each day using RollupHour data.
       - RollupMonth. Summarizes data for each month using RollupDay data.


    NOTES
     High write rate. Low read rate.
     Rollup tables used for direct queries.
     Automatic deletion using Cassandra TTL (time-to-live).
                                                                                                   20
#cassandra12                   (c) Copyright. Cloudian Inc. & KK, 2012, All rights reserved.
Cassandra: Wish List
    1. Repair
           •   Slow, impact on performance, difficult to monitor progress, manual
               operator action required.
    2. Compaction
           •   Heavy performance impact. Hard to tune. Capacity planning difficult.
    3. Schema changes
           •   Fixed in 1.1.
    4. Large column slices.
    5. Caches (row and key) not useful. Slower performance, large
       memory use.
    6. JMX too slow. Need to directly use and expose Java interfaces.

                                                                                               21
#cassandra12                   (c) Copyright. Cloudian Inc. & KK, 2012, All rights reserved.
HyperStore™
 HyperStore: Management policies tailored                                     Cloudian S3 Storage Server
 for different object types.

    Object metadata is still stored in                                                               Admin
                                                                                             NFS
     Cassandra                                                                                     Credentials
    Use Cassandra’s distributed systems
     methods for data partitioning, replication,                                 S3 REST            Reporting
     node health detection.                                                        API             (Cassandra)
    Fork Cassandra source for customizations.
                                                                              HyperStore           Accounting
                                                                               Manager             (Cassandra)
 Benefits:
  Better performance
  More capacity per node                                                      Data Store           Data Store
  Higher disk utilization                                                    (Cassandra)          (File System)

  Storage layer flexibility
                                                                                                               22
#cassandra12                 (c) Copyright. Cloudian Inc. & KK, 2012, All rights reserved.
HyperStore: Hybrid Storage Example
                                                                                                     Storage 1
                                                                                                     Storage 2
                                                                                                     optimal
                 U




                                                         X
        Optimal solution is to choose the storage method that minimizes latency.
        Generally, you want to maximize/minimize U, a performance metric, based
         on random variables X using a mixture of N storage layers.
        In a simple case,
              U : average latency
              X = {object size}
              N = {cassandra, ext4 fs}.

                                                                                                                 23
#cassandra12                         (c) Copyright. Cloudian Inc. & KK, 2012, All rights reserved.
HyperStore: Faster Read & Writes
     50
     40
ms 30                                                                                >30% faster
                                                                                              PUT-Cass
     20                                                                                       PUT-HS
    10
     0
          0.5      5                                 50                            500 KB
     /標準
     60
     /標準
     50
     40
     /標準
                                                                                     >400% faster
ms 30
   /標準
                                                                                              GET-Cass
     20
     /標準                                                                                      GET-HS
     10
     /標準
     /標準
      0                                                                                  KB
        0.5        5                                  50                           500
#cassandra12       (c) Copyright. Cloudian Inc. & KK, 2012, All rights reserved.
HyperStore: Less Compaction
                     No HyperStore                                                    With HyperStore
                         PUT      GET        LIST    DELETE                               PUT     GET      LIST    DELETE

        Operations       50478    1679       3642    422                 Operations       50559   9195     3575    2224

        Latency (msec)   149.78   314.80     41.60   34.50               Latency (msec)   96.64   35.63    28.14   23.93




                                           iostat % utilization                                           iostat % utilization




                                           io read/write (MB)                                             io read/write (MB)




#cassandra12                                  20 tps, 10 threads, 2MB data
                                                   Strictly Confidential
                                                                                                                           25
Finally

    Cassandra and other enabling technologies has allowed
    “leveling the playing field” for cloud storage
    providers.

    Info: www.cloudian.com
              Download trial version.

              Coming soon:



                       #1 best seller in “Database” category on amazon.co.jp.


#cassandra12                  (c) Copyright. Cloudian Inc. & KK, 2012, All rights reserved.   Page 26

Mais conteúdo relacionado

Mais procurados

Mais procurados (20)

Hadoop and Cloudian HyperStore
Hadoop and Cloudian HyperStoreHadoop and Cloudian HyperStore
Hadoop and Cloudian HyperStore
 
Webinar: Sizing Up Object Storage for the Enterprise
Webinar: Sizing Up Object Storage for the EnterpriseWebinar: Sizing Up Object Storage for the Enterprise
Webinar: Sizing Up Object Storage for the Enterprise
 
Databricks secure deployments and security baselines, doug march 2022
Databricks secure deployments and security baselines, doug march 2022Databricks secure deployments and security baselines, doug march 2022
Databricks secure deployments and security baselines, doug march 2022
 
Cloud - aber sicher
Cloud - aber sicherCloud - aber sicher
Cloud - aber sicher
 
AWS Partner Presentation-Symantec-AWS Cloud Storage for the Enterprise 2012
AWS Partner Presentation-Symantec-AWS Cloud Storage for the Enterprise 2012AWS Partner Presentation-Symantec-AWS Cloud Storage for the Enterprise 2012
AWS Partner Presentation-Symantec-AWS Cloud Storage for the Enterprise 2012
 
Apache Ignite: In-Memory Hammer for Your Data Science Toolkit
Apache Ignite: In-Memory Hammer for Your Data Science ToolkitApache Ignite: In-Memory Hammer for Your Data Science Toolkit
Apache Ignite: In-Memory Hammer for Your Data Science Toolkit
 
Big SQL: Powerful SQL Optimization - Re-Imagined for open source
Big SQL: Powerful SQL Optimization - Re-Imagined for open sourceBig SQL: Powerful SQL Optimization - Re-Imagined for open source
Big SQL: Powerful SQL Optimization - Re-Imagined for open source
 
Loading data into Apache Ignite
Loading data into Apache IgniteLoading data into Apache Ignite
Loading data into Apache Ignite
 
Big Data Fundamentals
Big Data FundamentalsBig Data Fundamentals
Big Data Fundamentals
 
Big data journey to the cloud 5.30.18 asher bartch
Big data journey to the cloud 5.30.18   asher bartchBig data journey to the cloud 5.30.18   asher bartch
Big data journey to the cloud 5.30.18 asher bartch
 
Cloudian and Rubrik - Hybrid Cloud based Disaster Recovery
Cloudian and Rubrik - Hybrid Cloud based Disaster RecoveryCloudian and Rubrik - Hybrid Cloud based Disaster Recovery
Cloudian and Rubrik - Hybrid Cloud based Disaster Recovery
 
Azure Storage – Foundation for Building Secure, Scalable Cloud Applications
Azure Storage – Foundation for Building Secure, Scalable Cloud ApplicationsAzure Storage – Foundation for Building Secure, Scalable Cloud Applications
Azure Storage – Foundation for Building Secure, Scalable Cloud Applications
 
Multi-Tenant Operations with Cloudera 5.7 & BT
Multi-Tenant Operations with Cloudera 5.7 & BTMulti-Tenant Operations with Cloudera 5.7 & BT
Multi-Tenant Operations with Cloudera 5.7 & BT
 
One Hadoop, Multiple Clouds - NYC Big Data Meetup
One Hadoop, Multiple Clouds - NYC Big Data MeetupOne Hadoop, Multiple Clouds - NYC Big Data Meetup
One Hadoop, Multiple Clouds - NYC Big Data Meetup
 
Storage As A Service (StAAS)
Storage As A Service (StAAS)Storage As A Service (StAAS)
Storage As A Service (StAAS)
 
Big Data Security with Hadoop
Big Data Security with HadoopBig Data Security with Hadoop
Big Data Security with Hadoop
 
Big Data Case Study: Fortune 100 Telco
Big Data Case Study: Fortune 100 TelcoBig Data Case Study: Fortune 100 Telco
Big Data Case Study: Fortune 100 Telco
 
Enterprise File Share and Sync with CleverSafe
Enterprise File Share and Sync with CleverSafeEnterprise File Share and Sync with CleverSafe
Enterprise File Share and Sync with CleverSafe
 
Enterprise Hadoop in the Cloud. In Minutes. | How to Run Cloudera Enterprise ...
Enterprise Hadoop in the Cloud. In Minutes. | How to Run Cloudera Enterprise ...Enterprise Hadoop in the Cloud. In Minutes. | How to Run Cloudera Enterprise ...
Enterprise Hadoop in the Cloud. In Minutes. | How to Run Cloudera Enterprise ...
 
Unlock Hadoop Success with Cloudera Navigator Optimizer
Unlock Hadoop Success with Cloudera Navigator OptimizerUnlock Hadoop Success with Cloudera Navigator Optimizer
Unlock Hadoop Success with Cloudera Navigator Optimizer
 

Destaque

What, beyond the curriculum, does your school teach? Administrators Conferen...
What, beyond the curriculum, does your school teach?  Administrators Conferen...What, beyond the curriculum, does your school teach?  Administrators Conferen...
What, beyond the curriculum, does your school teach? Administrators Conferen...
slater_45
 
Four Tips to improve & analyse your Sales Performance Scorecard Effectively
Four Tips to improve & analyse your Sales Performance Scorecard EffectivelyFour Tips to improve & analyse your Sales Performance Scorecard Effectively
Four Tips to improve & analyse your Sales Performance Scorecard Effectively
Andre Vonk
 

Destaque (20)

クラウドストレージの基礎知識(Cloudian white paper)
クラウドストレージの基礎知識(Cloudian white paper)クラウドストレージの基礎知識(Cloudian white paper)
クラウドストレージの基礎知識(Cloudian white paper)
 
Cloudian update (Japanese:日本語)
Cloudian update (Japanese:日本語)Cloudian update (Japanese:日本語)
Cloudian update (Japanese:日本語)
 
Cloudian presentation for OSCA
Cloudian presentation for OSCA Cloudian presentation for OSCA
Cloudian presentation for OSCA
 
Cloudian presentation for Cassandra Conference 2012 in Tokyo
Cloudian presentation for Cassandra Conference 2012 in TokyoCloudian presentation for Cassandra Conference 2012 in Tokyo
Cloudian presentation for Cassandra Conference 2012 in Tokyo
 
Cloudian presentation at idc japan sv2016
Cloudian presentation at idc japan sv2016Cloudian presentation at idc japan sv2016
Cloudian presentation at idc japan sv2016
 
What, beyond the curriculum, does your school teach? Administrators Conferen...
What, beyond the curriculum, does your school teach?  Administrators Conferen...What, beyond the curriculum, does your school teach?  Administrators Conferen...
What, beyond the curriculum, does your school teach? Administrators Conferen...
 
Bena Brochure
Bena BrochureBena Brochure
Bena Brochure
 
Twitter terms of service
Twitter terms of serviceTwitter terms of service
Twitter terms of service
 
Cloudian update at cloudian seminar 2014
Cloudian update at cloudian seminar 2014Cloudian update at cloudian seminar 2014
Cloudian update at cloudian seminar 2014
 
Seasons Greetings Presentation
Seasons Greetings PresentationSeasons Greetings Presentation
Seasons Greetings Presentation
 
Cloudian meets CloudStack
Cloudian meets CloudStackCloudian meets CloudStack
Cloudian meets CloudStack
 
Dostoevskiy unizennie
Dostoevskiy unizennieDostoevskiy unizennie
Dostoevskiy unizennie
 
Solgenicin
SolgenicinSolgenicin
Solgenicin
 
Cardionics EKG Netwerk project cardionics.be presentatie doc nl
Cardionics EKG  Netwerk project cardionics.be presentatie   doc nlCardionics EKG  Netwerk project cardionics.be presentatie   doc nl
Cardionics EKG Netwerk project cardionics.be presentatie doc nl
 
Cloudian next steps at cloudian seminar 2014
Cloudian next steps at cloudian seminar 2014Cloudian next steps at cloudian seminar 2014
Cloudian next steps at cloudian seminar 2014
 
Pp no 26 tahun 2008
Pp no 26 tahun 2008Pp no 26 tahun 2008
Pp no 26 tahun 2008
 
Eleven Tips To Build Customer Loyalty and Satisfaction
Eleven Tips To Build Customer Loyalty and SatisfactionEleven Tips To Build Customer Loyalty and Satisfaction
Eleven Tips To Build Customer Loyalty and Satisfaction
 
Informe EPT 2009 Resum en català
Informe EPT 2009 Resum en catalàInforme EPT 2009 Resum en català
Informe EPT 2009 Resum en català
 
Usa
UsaUsa
Usa
 
Four Tips to improve & analyse your Sales Performance Scorecard Effectively
Four Tips to improve & analyse your Sales Performance Scorecard EffectivelyFour Tips to improve & analyse your Sales Performance Scorecard Effectively
Four Tips to improve & analyse your Sales Performance Scorecard Effectively
 

Semelhante a Cloudian_Cassandra Summit 2012

AWS Summit 2011: Architecting in the cloud
AWS Summit 2011: Architecting in the cloudAWS Summit 2011: Architecting in the cloud
AWS Summit 2011: Architecting in the cloud
Amazon Web Services
 
Netflix web-adrian-qcon
Netflix web-adrian-qconNetflix web-adrian-qcon
Netflix web-adrian-qcon
Yiwei Ma
 
Inaugural address manjusha - Indicthreads cloud computing conference 2011
Inaugural address manjusha -  Indicthreads cloud computing conference 2011Inaugural address manjusha -  Indicthreads cloud computing conference 2011
Inaugural address manjusha - Indicthreads cloud computing conference 2011
IndicThreads
 
Cloud computing
Cloud computingCloud computing
Cloud computing
Med Zaibi
 
Mon1420 build clouds-oliviermaes-citrix
Mon1420 build clouds-oliviermaes-citrixMon1420 build clouds-oliviermaes-citrix
Mon1420 build clouds-oliviermaes-citrix
eurocloud
 

Semelhante a Cloudian_Cassandra Summit 2012 (20)

Cloudian at cassandra conference in tokyo
Cloudian at cassandra conference in tokyoCloudian at cassandra conference in tokyo
Cloudian at cassandra conference in tokyo
 
AWS Summit 2011: Architecting in the cloud
AWS Summit 2011: Architecting in the cloudAWS Summit 2011: Architecting in the cloud
AWS Summit 2011: Architecting in the cloud
 
Netflix and Open Source
Netflix and Open SourceNetflix and Open Source
Netflix and Open Source
 
Netflix Global Cloud Architecture
Netflix Global Cloud ArchitectureNetflix Global Cloud Architecture
Netflix Global Cloud Architecture
 
Netflix web-adrian-qcon
Netflix web-adrian-qconNetflix web-adrian-qcon
Netflix web-adrian-qcon
 
Migrating Netflix from Datacenter Oracle to Global Cassandra
Migrating Netflix from Datacenter Oracle to Global CassandraMigrating Netflix from Datacenter Oracle to Global Cassandra
Migrating Netflix from Datacenter Oracle to Global Cassandra
 
The New Stack Container Summit Talk
The New Stack Container Summit TalkThe New Stack Container Summit Talk
The New Stack Container Summit Talk
 
Inaugural address manjusha - Indicthreads cloud computing conference 2011
Inaugural address manjusha -  Indicthreads cloud computing conference 2011Inaugural address manjusha -  Indicthreads cloud computing conference 2011
Inaugural address manjusha - Indicthreads cloud computing conference 2011
 
Architectures for High Availability - QConSF
Architectures for High Availability - QConSFArchitectures for High Availability - QConSF
Architectures for High Availability - QConSF
 
Open stack in sina
Open stack in sinaOpen stack in sina
Open stack in sina
 
AWS re:Invent 2016: Introduction to Container Management on AWS (CON303)
AWS re:Invent 2016: Introduction to Container Management on AWS (CON303)AWS re:Invent 2016: Introduction to Container Management on AWS (CON303)
AWS re:Invent 2016: Introduction to Container Management on AWS (CON303)
 
Microsoft Best Practices - AWS India Summit 2012
Microsoft Best Practices - AWS India Summit 2012Microsoft Best Practices - AWS India Summit 2012
Microsoft Best Practices - AWS India Summit 2012
 
How leading financial services organisations are winning with tech
How leading financial services organisations are winning with techHow leading financial services organisations are winning with tech
How leading financial services organisations are winning with tech
 
KT ucloud storage, by Jaesuk Ahn
KT ucloud storage, by Jaesuk AhnKT ucloud storage, by Jaesuk Ahn
KT ucloud storage, by Jaesuk Ahn
 
Am 02 osac_kt_swift
Am 02 osac_kt_swiftAm 02 osac_kt_swift
Am 02 osac_kt_swift
 
CloudStack Architecture Future
CloudStack Architecture FutureCloudStack Architecture Future
CloudStack Architecture Future
 
O'Reilly Webcast: Architecting Applications For The Cloud
O'Reilly Webcast: Architecting Applications For The CloudO'Reilly Webcast: Architecting Applications For The Cloud
O'Reilly Webcast: Architecting Applications For The Cloud
 
Planning the Migration to the Cloud - AWS India Summit 2012
Planning the Migration to the Cloud - AWS India Summit 2012Planning the Migration to the Cloud - AWS India Summit 2012
Planning the Migration to the Cloud - AWS India Summit 2012
 
Cloud computing
Cloud computingCloud computing
Cloud computing
 
Mon1420 build clouds-oliviermaes-citrix
Mon1420 build clouds-oliviermaes-citrixMon1420 build clouds-oliviermaes-citrix
Mon1420 build clouds-oliviermaes-citrix
 

Mais de CLOUDIAN KK

Mais de CLOUDIAN KK (20)

CLOUDIAN HYPERSTORE - 風林火山ストレージ
CLOUDIAN HYPERSTORE - 風林火山ストレージCLOUDIAN HYPERSTORE - 風林火山ストレージ
CLOUDIAN HYPERSTORE - 風林火山ストレージ
 
クラウディアンのご紹介
クラウディアンのご紹介クラウディアンのご紹介
クラウディアンのご紹介
 
IoT/ビッグデータ/AI連携により次世代ストレージが促進するビジネス変革
IoT/ビッグデータ/AI連携により次世代ストレージが促進するビジネス変革IoT/ビッグデータ/AI連携により次世代ストレージが促進するビジネス変革
IoT/ビッグデータ/AI連携により次世代ストレージが促進するビジネス変革
 
CLOUDIAN Presentation at VERITAS VISION in Tokyo
CLOUDIAN Presentation at VERITAS VISION in TokyoCLOUDIAN Presentation at VERITAS VISION in Tokyo
CLOUDIAN Presentation at VERITAS VISION in Tokyo
 
S3 API接続検証プログラムのご紹介
S3 API接続検証プログラムのご紹介S3 API接続検証プログラムのご紹介
S3 API接続検証プログラムのご紹介
 
Auto tiering and Versioning of CLOUDIAN HyperStore
Auto tiering and Versioning of CLOUDIAN HyperStoreAuto tiering and Versioning of CLOUDIAN HyperStore
Auto tiering and Versioning of CLOUDIAN HyperStore
 
AWS SDK for Python and CLOUDIAN HyperStore
AWS SDK for Python and CLOUDIAN HyperStoreAWS SDK for Python and CLOUDIAN HyperStore
AWS SDK for Python and CLOUDIAN HyperStore
 
AWS CLI and CLOUDIAN HyperStore
AWS CLI and CLOUDIAN HyperStoreAWS CLI and CLOUDIAN HyperStore
AWS CLI and CLOUDIAN HyperStore
 
ZiDOMA data and CLOUDIAN HyperStore
ZiDOMA data and CLOUDIAN HyperStoreZiDOMA data and CLOUDIAN HyperStore
ZiDOMA data and CLOUDIAN HyperStore
 
FOBAS CSC and CLOUDIAN HyperStore
FOBAS CSC and CLOUDIAN HyperStoreFOBAS CSC and CLOUDIAN HyperStore
FOBAS CSC and CLOUDIAN HyperStore
 
ARCserve backup and CLOUDIAN HyperStore
ARCserve backup and CLOUDIAN HyperStoreARCserve backup and CLOUDIAN HyperStore
ARCserve backup and CLOUDIAN HyperStore
 
ITコアを刷新するハイブリッドクラウド型ITシステム
ITコアを刷新するハイブリッドクラウド型ITシステムITコアを刷新するハイブリッドクラウド型ITシステム
ITコアを刷新するハイブリッドクラウド型ITシステム
 
【FOBAS】Data is money. ストレージ分散投資のススメ
【FOBAS】Data is money. ストレージ分散投資のススメ【FOBAS】Data is money. ストレージ分散投資のススメ
【FOBAS】Data is money. ストレージ分散投資のススメ
 
【ARI】ストレージのコスト・利便性・非機能要求項目を徹底比較
【ARI】ストレージのコスト・利便性・非機能要求項目を徹底比較【ARI】ストレージのコスト・利便性・非機能要求項目を徹底比較
【ARI】ストレージのコスト・利便性・非機能要求項目を徹底比較
 
【SIS】オブジェクトストレージを活用した増え続ける長期保管データの運用の効率化
【SIS】オブジェクトストレージを活用した増え続ける長期保管データの運用の効率化【SIS】オブジェクトストレージを活用した増え続ける長期保管データの運用の効率化
【SIS】オブジェクトストレージを活用した増え続ける長期保管データの運用の効率化
 
【CLOUDIAN】コード化されたインフラの実装
【CLOUDIAN】コード化されたインフラの実装【CLOUDIAN】コード化されたインフラの実装
【CLOUDIAN】コード化されたインフラの実装
 
【CLOUDIAN】自動階層化による現有ストレージ活用術
【CLOUDIAN】自動階層化による現有ストレージ活用術【CLOUDIAN】自動階層化による現有ストレージ活用術
【CLOUDIAN】自動階層化による現有ストレージ活用術
 
【CLOUDIAN】秒間隔RPO(目標復旧時点)の実現
【CLOUDIAN】秒間隔RPO(目標復旧時点)の実現【CLOUDIAN】秒間隔RPO(目標復旧時点)の実現
【CLOUDIAN】秒間隔RPO(目標復旧時点)の実現
 
【Cloudian】FIT2015における会社製品紹介
【Cloudian】FIT2015における会社製品紹介【Cloudian】FIT2015における会社製品紹介
【Cloudian】FIT2015における会社製品紹介
 
CLOUDIAN at Support Engineer Night
CLOUDIAN at Support Engineer NightCLOUDIAN at Support Engineer Night
CLOUDIAN at Support Engineer Night
 

Último

CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
giselly40
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 

Último (20)

TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 

Cloudian_Cassandra Summit 2012

  • 1. Cassandra Summit 2012 (#cassandra12) Using Cassandra in Cloudian, an S3 Cloud Storage System August 8, 2012 Gary Ogasawara Cloudian, Inc. Copyright © 2012 Cloudian Inc. & KK All Rights Reserved. Page 1
  • 2. What is Cloudian? Cloudian = S3 Cloud Storage as Packaged Software 2 #cassandra12 (c) Copyright , Cloudian Inc. & KK, 2012, All rights reserved.
  • 3. Cloudian Features 1. Full Amazon S3 API Compatibility, including error codes 2. Multi-datacenter, peer-to-peer architecture. No single point of failure. 3. Multi-tenant: QoS controls, billing, reporting by each User and each Group 4. Public and Private Clouds. 5. Elastic Capacity: small start and scale-out as needed 6. System, Group, and User management by Management Console or REST API 7. Easy to Use Packaged Software, backed by 24x7 carrier grade support. 3 #cassandra12 (c) Copyright, Cloudian Inc. & KK, 2012, All rights reserved.
  • 4. Cloudian Objectives 1. S3 API full compatibility • Use S3 ecosystem applications “as is”. • API already designed. 1. Fully packaged software • Easy to deploy on existing • Hide NoSQL complexity hardware/network. • Easy install/upgrade • Flexible for different customer types. • HyperStore: Best fit store • Scalable. Start small and grow. 1. Complete service platform • User/Group Provisioning • Turnkey system. • Cluster Management • Can choose integration points • Reporting with existing systems. • Billing 4 #cassandra12 (c) Copyright, Cloudian Inc. & KK, 2012, All rights reserved.
  • 5. Object vs. File vs. Block Storage Application Level HTTP OBJECTS OS User Level NAS (NFS, CIFS) FILES OS Kernel Level SAN (iSCSI) BLOCKS Abstraction Level #cassandra12 (c) Copyright , Cloudian Inc. & KK, 2012, All rights reserved. Page 5
  • 6. S3 Ecosystem Libraries, applications, gateways, etc. using Amazon S3 can be simply re-pointed to Cloudian. Public Hybrid Private #cassandra12 Copyright © 2012 Cloudian Inc. & KK All Rights Reserved. Page 6
  • 7. S3 Functions • HTTP REST API. PUT, POST, GET, DELETE, HEAD. • Objects organized into buckets. • Security. Requests authenticated using keyed HMAC with symmetric keys. Also, HTTPS option, client-side encryption, server-side encryption. • Access control lists (ACLs) define access rights to bucket and object. • Accounting of bytes inbound, outbound, stored and HTTP request counts. Billing by tiered rating plans per accounting type, per-region. • Multi-part uploads. Allows uploading large objects in multiple parts. • Versioning. Multiple versions of same object. • Location constraint. Buckets can be assigned to a specific region. Each region has own domain. • … #cassandra12 (c) Copyright. Cloudian Inc. & KK, 2012, All rights reserved. Page 7
  • 8. Works with leading Cloud Compute Platforms Cloudian-Citrix CloudStack (May 9, 2012) Cloudian-OpenStack (October 21, 2011) #cassandra12 Page 8 (c) Copyright. Cloudian Inc. & KK, 2012, All rights reserved.
  • 9. Cloudian Customers Public Channel Partners: Hybrid Private #cassandra12 (c) Copyright. Cloudian Inc. & KK, 2012, All rights reserved. Page 9
  • 10. Why Cassandra? Why Cassandra?  Scalable • Add capacity by adding nodes to running system. • Distributed (P2P architecture), no single point of failure  Reliable • Resilient to network or hardware failures. • Multi-datacenter replication • Tuneable data consistency level.  Features • TTL, secondary indexes, counters, compression, encryption, …  Fast • Write path especially fast. 10 #cassandra12 (c) Copyright. Cloudian Inc. & KK, 2012, All rights reserved.
  • 11. Cassandra in Cloudian • v1.0.7 in use (started at 0.7.x) • Forked to add customizations • Hector client • Data stored includes: • Object metadata • Reports/logs • Counters for rate control •… #cassandra12 (c) Copyright. Cloudian Inc. & KK, 2012, All rights reserved. Page 11
  • 12. Cloudian: Logical Architecture HTTPS Login Admin Credentials Server DB Account profile / HTTPS Servlets HTTP Security keys Servlets HTTP S3 Server UserData DB Reports (Cassandra) Management Console Data Explorer HTTP AccountInfo & QoS DB (Cassandra) WEB UI HTTP or Data HTTPS Servers Reports DB (S3) (Cassandra) Applications 12 #cassandra12 (c) Copyright. Cloudian Inc. & KK, 2012, All rights reserved.
  • 13. Minimum Redundant Configuration Servlets Credentials DB Browser HTTPS Sticky requests sessions for UI HTTP/S Cassandra Server LB Application HTTP/HTTPS requests for S3 Servlets Credentials DB HTTP/S Cassandra Server 13 #cassandra12 (c) Copyright. Cloudian Inc. & KK, 2012, All rights reserved.
  • 14. Multi-Datacenter Example  2 datacenters / 4 nodes per datacenter CMC Redis (S) CMC Redis (S) CMC Redis (S) CMC Redis (S) S3/Admin S3/Admin S3/Admin S3/Admin Cassandra Cassandra Cassandra Cassandra /HyperStore /HyperStore /HyperStore /HyperStore CMC Redis (M) CMC Redis (S) CMC Redis (S) CMC Redis (S) S3/Admin S3/Admin S3/Admin S3/Admin Cassandra Cassandra Cassandra Cassandra /HyperStore /HyperStore /HyperStore /HyperStore DC1 DC2  Storage objects, reports, profiles replicated across DCs by Cassandra.  Credentials DB (Redis) has local DC slave and single global master. 14 #cassandra12 (c) Copyright. Cloudian Inc. & KK, 2012, All rights reserved.
  • 15. Network Scaling Example DC 1-2 DC 3-2 DC 1-1 DC 3-1 Region 3 Region 1 DC 2-1 Region 2 15 #cassandra12 (c) Copyright. Cloudian Inc. & KK, 2012, All rights reserved.
  • 16. Cassandra for Object Store  Dynamically decide how to store each object (Cassandra or file system).  Cassandra better for small objects.  Large objects split into multiple parts and chunks. Column Random Row Name Partitioner key Value Column Family 16 #cassandra12 (c) Copyright. Cloudian Inc. & KK, 2012, All rights reserved.
  • 17. Cassandra for Object Metadata  Size, Etag, MD5, timestamp, ACL, part info, version, etc.  Old versions of metadata format supported. Column Column Column Name Name Name Random Row Partitioner Key Value Value … Value Column Family Sorted by Column Name 17 #cassandra12 (c) Copyright. Cloudian Inc. & KK, 2012, All rights reserved.
  • 18. Cassandra for Account Info DATA MODEL  User - ID, name, contact info, etc.  Group - ID, name, contact info, etc.  Rating Plan  Security Credentials  QoS Counters NOTES  “Static” data. Fixed number of columns.  Could be put in a Relational DB like MySQL, but no need to add another component. 18 #cassandra12 (c) Copyright. Cloudian Inc. & KK, 2012, All rights reserved.
  • 19. Quality of Service / SLA Management • Configurable maximum limits per- region at per-user, per-group, system level. • Requests/minute • Storage bytes • Storage objects • Data Bytes Inbound • Data Bytes Outbound • While limit is reached, requests are rejected. #cassandra12 (c) Copyright. Cloudian Inc. & KK, 2012, All rights reserved. Page 19
  • 20. Cassandra for Reports DATA MODEL  “Raw” column family - User, Group, System - Transaction type (HTTP GET, PUT, DELETE) … - Object path - Size - …  “Rollup” column families. - RollupHour. Summarizes data for each hour using Raw data. - RollupDay. Summarizes data for each day using RollupHour data. - RollupMonth. Summarizes data for each month using RollupDay data. NOTES  High write rate. Low read rate.  Rollup tables used for direct queries.  Automatic deletion using Cassandra TTL (time-to-live). 20 #cassandra12 (c) Copyright. Cloudian Inc. & KK, 2012, All rights reserved.
  • 21. Cassandra: Wish List 1. Repair • Slow, impact on performance, difficult to monitor progress, manual operator action required. 2. Compaction • Heavy performance impact. Hard to tune. Capacity planning difficult. 3. Schema changes • Fixed in 1.1. 4. Large column slices. 5. Caches (row and key) not useful. Slower performance, large memory use. 6. JMX too slow. Need to directly use and expose Java interfaces. 21 #cassandra12 (c) Copyright. Cloudian Inc. & KK, 2012, All rights reserved.
  • 22. HyperStore™ HyperStore: Management policies tailored Cloudian S3 Storage Server for different object types.  Object metadata is still stored in Admin NFS Cassandra Credentials  Use Cassandra’s distributed systems methods for data partitioning, replication, S3 REST Reporting node health detection. API (Cassandra)  Fork Cassandra source for customizations. HyperStore Accounting Manager (Cassandra) Benefits:  Better performance  More capacity per node Data Store Data Store  Higher disk utilization (Cassandra) (File System)  Storage layer flexibility 22 #cassandra12 (c) Copyright. Cloudian Inc. & KK, 2012, All rights reserved.
  • 23. HyperStore: Hybrid Storage Example Storage 1 Storage 2 optimal U X  Optimal solution is to choose the storage method that minimizes latency.  Generally, you want to maximize/minimize U, a performance metric, based on random variables X using a mixture of N storage layers.  In a simple case,  U : average latency  X = {object size}  N = {cassandra, ext4 fs}. 23 #cassandra12 (c) Copyright. Cloudian Inc. & KK, 2012, All rights reserved.
  • 24. HyperStore: Faster Read & Writes 50 40 ms 30 >30% faster PUT-Cass 20 PUT-HS 10 0 0.5 5 50 500 KB /標準 60 /標準 50 40 /標準 >400% faster ms 30 /標準 GET-Cass 20 /標準 GET-HS 10 /標準 /標準 0 KB 0.5 5 50 500 #cassandra12 (c) Copyright. Cloudian Inc. & KK, 2012, All rights reserved.
  • 25. HyperStore: Less Compaction No HyperStore With HyperStore PUT GET LIST DELETE PUT GET LIST DELETE Operations 50478 1679 3642 422 Operations 50559 9195 3575 2224 Latency (msec) 149.78 314.80 41.60 34.50 Latency (msec) 96.64 35.63 28.14 23.93 iostat % utilization iostat % utilization io read/write (MB) io read/write (MB) #cassandra12 20 tps, 10 threads, 2MB data Strictly Confidential 25
  • 26. Finally Cassandra and other enabling technologies has allowed “leveling the playing field” for cloud storage providers. Info: www.cloudian.com  Download trial version.  Coming soon:  #1 best seller in “Database” category on amazon.co.jp. #cassandra12 (c) Copyright. Cloudian Inc. & KK, 2012, All rights reserved. Page 26

Notas do Editor

  1. Add symbols for different files from motohashi ppt.