SlideShare uma empresa Scribd logo
1 de 127
S C A L I N G STO R AG E W I T H C E P H

            Ross Turk, Inktank
WHO?


Ross Turk
VP Community, Inktank

 ross@inktank.com
 @rossturk

inktank.com | ceph.com
me
APP                    APP                   HOST/VM                    CLIENT



                       RADOSGW                  RBD                      CEPH FS
  LIBRADOS
                       A bucket-based REST      A reliable and fully-    A POSIX-compliant
  A library allowing   gateway, compatible      distributed block        distributed file
  apps to directly     with S3 and Swift        device, with a Linux     system, with a Linux
  access RADOS,                                 kernel client and a      kernel client and
  with support for                              QEMU/KVM driver          support for FUSE
  C, C++, Java,
  Python, Ruby,
  and PHP




RADOS

A reliable, autonomous, distributed object store comprised of self-healing, self-
managing, intelligent storage nodes
I N T H E B EG I N N I N G
Magic Madzik, Flickr / CC BY 2.0
EA R LY I N FO R M AT I O N STO R AG E
Chico.Ferreira, Flickr / CC BY 2.0
W R I T I N G > C AV E PA I N T I N G S
kevingessner, Flickr / CC BY-SA 2.0
==
x1000        x1
P EO P L E B EG I N W R I T I N G A LOT
Moyan_Brenn, Flickr / CC BY-ND 2.0
W R I T I N G I S T I M E - CO N S U M I N G
trekkyandy, Flickr / CC BY 2.0
T H E I N D U ST R I A L I ZAT I O N O F W R I T I N G
FateDenied, Flickr / CC BY 2.0
magnet       +   tape   =    magnetic tape




                   ==
         x1000              x1
STO R AG E B ECO M ES M EC H A N I C A L
Erik Pitti, Wikipedia / CC BY-ND 2.0
HUMA
        ROCK
  N



         INK
HUMA
  N
        PAPER




HUMA
       COMPUTER   TAPE
  N
CO M P U T E RS N E E D P EO P L E TO WO R K
USDAgov, Flickr / CC BY 2.0
HUMA
       COMPUTER   TAPE
  N
11101011 10110110
     10110101 10101001
     00100100 01001001
==   10100100 10100101
     01011010 01101010
     10101010 10101010
     01010110 01010011
T H RO U G H P U T B ECO M ES I M P O RTA N T
Zane Luke, Flickr / CC BY-ND 2.0
L A Z 0 R B 3 A M S C H A N G E E V E RY T H I N G ! !
Jeff Kubina, Flickr / CC-BY-SA 2.0
H A R D D R I V ES A R E TOTA L LY B E T T E R




              amazing spinny hard drives   sucky stupid tape
                                            slow
E V E RY T H I N G G E T S M ES SY
Rob!, Flickr / CC BY 2.0
aa



      ab               111010               ac


101   ba    bb                        bc    111   010




da    110   db   011            010   000   dc




000                                         110   001
file

                                      owner: rturk
                                  created: aug12
                              last viewed: aug17
                                      size: 42025
11101011 10110110 10110101             perms: 644
10101001 00100100 01001001
10100100 10100101 01011010
01101010 10101010 10101010
aa



      ab              111010               ac


101   ba    bb                       bc    111   010




da    110   db   01            010   000   dc
                 10

000                                        110   001
W E O U TG ROW T H E H A R D D R I V E
Mr. T in DC, Flickr / CC BY 2.0
DISK

                 DISK
HUMA
  N              DISK
HUMA   COMPUTE   DISK
  N       R
HUMA             DISK
  N
                 DISK

                 DISK
HUMAN          HUMAN

                          HUMAN
 HUMAN                                                         DISK
                 HUMAN
HUMAN                                                          DISK
 HUMAN
                  HUMAN                                        DISK
                                                               DISK
        HUMAN
                                                               DISK
           HUMAN
HUMAN                                                          DISK
                                     (COMPUTER)
                HUMAN
                                                               DISK
  HUMAN            HUMAN
                                                               DISK
                HUMAN
 HUMAN                                                         DISK
                 HUMAN                                         DISK
  HUMAN                                                        DISK
                    HUMAN
        HUMAN                                                  DISK
                        HUMAN

          HUMAN
                                  (actually more like this…)
COMPUTE
                 DISK
          R
       COMPUTE
                 DISK
          R
HUMA   COMPUTE
                 DISK
          R
  N    COMPUTE
                 DISK
          R
       COMPUTE
                 DISK
          R
       COMPUTE
HUMA      R
                 DISK
  N    COMPUTE
                 DISK
          R
       COMPUTE
                 DISK
          R
       COMPUTE
HUMA      R
                 DISK
  N    COMPUTE
                 DISK
          R
       COMPUTE
                 DISK
          R
       COMPUTE
                 DISK
          R
aa



      ab               111010               ac


101   ba    bb                        bc    111   010




da    110   db   011            010   000   dc




000                                         110   001
object

                                    pace: quick
                                    driver: frog
                               license: expired
                              expression: agog
11101011 10110110 10110101
10101001 00100100 01001001
10100100 10100101 01011010
01101010 10101010 10101010
COMPUTE
                DISK
         R
      COMPUTE
                DISK
         R
      COMPUTE
                DISK
         R
      COMPUTE
                DISK
         R
      COMPUTE
                DISK
         R
      COMPUTE
                DISK
APP      R
      COMPUTE
                DISK
         R
      COMPUTE
                DISK
         R
      COMPUTE
                DISK
         R
      COMPUTE
                DISK
         R
      COMPUTE
                DISK
         R
      COMPUTE
                DISK
         R
COMPUTE
                           DISK
                    R
                 COMPUTE
                           DISK
                    R
                 COMPUTE
                           DISK
                    R
                 COMPUTE
                           DISK
                    R
                 COMPUTE
                           DISK
                    R
                 COMPUTE
COMPUTE                    DISK
                    R
                 COMPUTE
   R                R
                           DISK
          DISK   COMPUTE
                           DISK
                    R
                 COMPUTE
                           DISK
                    R
                 COMPUTE
                           DISK
                    R
                 COMPUTE
                           DISK
                    R
                 COMPUTE
                           DISK
                    R
COMPUTE
               DISK
        R
     COMPUTE
               DISK
        R
     COMPUTE
               DISK
        R
     COMPUTE
               DISK
        R
VM   COMPUTE
               DISK
        R
     COMPUTE
               DISK
        R
VM   COMPUTE
               DISK
        R
     COMPUTE
               DISK
VM      R
     COMPUTE
               DISK
        R
     COMPUTE
               DISK
        R
     COMPUTE
               DISK
        R
     COMPUTE
               DISK
        R
Ceph




                                                                                                          Cloud computing


                                                                                              Distributed storage




                                                                             Shared storage




                                 Computers
               Writing

        Painting



S TO R A G E T H R O U G H O U T H I S TO RY
Time-scale: Roughly logarithmic. Content: Whatever the opposite of “scientific” is.
COMPUTE
                 DISK
          R
       COMPUTE
                 DISK
          R
HUMA   COMPUTE
                 DISK
          R
  N    COMPUTE
                 DISK
          R
       COMPUTE
                 DISK
          R
       COMPUTE
HUMA      R
                 DISK
  N    COMPUTE
                 DISK
          R
       COMPUTE
                 DISK
          R
       COMPUTE
HUMA      R
                 DISK
  N    COMPUTE
                 DISK
          R
       COMPUTE
                 DISK
          R
       COMPUTE
                 DISK
          R
COMPUTE
          DISK
   R
COMPUTE
          DISK
   R
COMPUTE
          DISK
   R
COMPUTE
          DISK
   R
COMPUTE
          DISK
   R
COMPUTE
          DISK
   R
COMPUTE
          DISK
   R
COMPUTE
          DISK
   R
COMPUTE
          DISK
   R
COMPUTE
          DISK
   R
COMPUTE
          DISK
   R
COMPUTE
          DISK
   R
C D
C D
C D
C D
C D
C D
C D
C D
C D
C D
C D
C D
C D
       C D

HUMA   C D
  N    C D
       C D
       C D
HUMA
  N    C D
       C D
       C D
HUMA
  N    C D
       C D
       C D
STO R AG E A P P L I A N C ES
Michael Moll, Wikipedia / CC BY-SA 2.0
6 . 4 M I L L I O N S Q F T O F FAC TO R I ES
Dude94111, Flickr / CC BY 2.0
T EC H N O LO GY I S A CO M M O D I T Y
RaeAllen, Flickr / CC-BY 2.0
CO M M O D I T Y P R I C ES F LU C T UAT E




May-07       May-08      May-09       May-10     May-11   May-12
Hardware Appliances are Mysterious Black Boxes
Abode of Chaos, Flickr / CC BY 2.0
C   D

                   C   D

                   C   D

                   C   D

                   C   D

 HUMAN        !!   C   D

                   C   D
[DEVELOPER]
                   C   D

                   C   D

                   C   D

                   C   D

                   C   D
C   D

     C   D

C    C   D

     C   D

         D

     C   D

C+   C   D

     C   D
+
     C   D

     C   D

     C   D

     C   D
C   D

     C   D

C    C   D

     C   D

         D

     C   D

C+   C   D

     C   D
+
     C   D

     C   D

     C   D

     C   D
THE WORLD
     NEEDS
AN OPEN STORAGE
  TECHNOLOGY
      THAT
     SCALES
SAG E W E I L



 Co-founder of DreamHost

 Inventor of Ceph

 CEO of Inktank
philosophy   design


 OPEN
SOURCE
O P E N S O U RC E S P R EA D S I D EA S
orchidgalore, Flickr / CC BY 2.0
philosophy   design


     OPEN
    SOURCE
COMMUNITY-
 FOCUSED
W E A R E S M A RT E R TO G E T H E R
rturk, Linkedin Inmap
C E P H B E LO N G S TO A L L O F U S
wackybadger, Flickr / CC BY 2.0
philosophy   design


     OPEN         SCALABL
    SOURCE           E
COMMUNITY-
 FOCUSED
Ceph




                                                                Too much for a room


                                                     Too much for a computer




                                   Too much for a drive




             Too much for a book



  Too much for a cave



C E P H I S B U I LT TO S C A L E
philosophy   design


     OPEN         SCALABL
    SOURCE           E
COMMUNITY-          NO SINGLE POINT OF
 FOCUSED                  FAILURE
A R I LO M A X C A L I FO R N I C U S
aroid, Flickr / CC BY 2.0
single point
                                   of failure




                                highly-available
replicated




T H E O C TO P U S ( A M E TA P H O R )
I love speaking in metaphors.
T H E B E E H I V E ( A N OT H E R M E TA P H O R )
blumenbiene, Flickr / CC BY 2.0
philosophy   design


     OPEN         SCALABL
    SOURCE           E
COMMUNITY-          NO SINGLE POINT OF
 FOCUSED                  FAILURE
                    SOFTWARE
                      BASED
C   D

     C   D

C    C   D

     C   D

         D

     C   D

C+   C   D

     C   D
+
     C   D

     C   D

     C   D

     C   D
C   D

     C   D

C    C   D

     C   D

         D

     C   D

C+   C   D

     C   D
+
     C   D

     C   D

     C   D

     C   D
philosophy   design


     OPEN         SCALABL
    SOURCE           E
COMMUNITY-          NO SINGLE POINT OF
 FOCUSED                  FAILURE
                    SOFTWARE
                      BASED
                      SELF-
                   MANAGING
D I S KS = J U ST T I N Y R ECO R D P L AY E RS
jon_a_ross, Flickr / CC BY 2.0
D    D

  D    D


  D    D      =
  D    D


x 1 MILLION
                  55 times / day
I T A L L STA RT E D W I T H A D R EA M
+
APP                    APP                   HOST/VM                    CLIENT



                       RADOSGW                  RBD                      CEPH FS
  LIBRADOS
                       A bucket-based REST      A reliable and fully-    A POSIX-compliant
  A library allowing   gateway, compatible      distributed block        distributed file
  apps to directly     with S3 and Swift        device, with a Linux     system, with a Linux
  access RADOS,                                 kernel client and a      kernel client and
  with support for                              QEMU/KVM driver          support for FUSE
  C, C++, Java,
  Python, Ruby,
  and PHP




RADOS

A reliable, autonomous, distributed object store comprised of self-healing, self-
managing, intelligent storage nodes
APP                    APP                   HOST/VM                    CLIENT



                       RADOSGW                  RBD                      CEPH FS
  LIBRADOS
                       A bucket-based REST      A reliable and fully-    A POSIX-compliant
  A library allowing   gateway, compatible      distributed block        distributed file
  apps to directly     with S3 and Swift        device, with a Linux     system, with a Linux
  access RADOS,                                 kernel client and a      kernel client and
  with support for                              QEMU/KVM driver          support for FUSE
  C, C++, Java,
  Python, Ruby,
  and PHP




RADOS

A reliable, autonomous, distributed object store comprised of self-healing, self-
managing, intelligent storage nodes
OSD    OSD    OSD    OSD    OSD




                                   btrfs
FS      FS    FS     FS     FS
                                   xfs
                                   ext4
DISK   DISK   DISK   DISK   DISK




  M            M             M
HUMAN




        M




M           M
Monitors:
     Maintain cluster map


M    Provide consensus for
      distributed decision-
      making
     Must have an odd number
     These do not serve stored
      objects to clients


    OSDs:
     One per disk
      (recommended)
     At least three in a cluster
     Serve stored objects to
      clients
     Intelligently peer to perform
      replication tasks
     Supports object classes
APP                    APP                   HOST/VM                    CLIENT



                       RADOSGW                  RBD                      CEPH FS
  LIBRADOS
                       A bucket-based REST      A reliable and fully-    A POSIX-compliant
  A library allowing   gateway, compatible      distributed block        distributed file
  apps to directly     with S3 and Swift        device, with a Linux     system, with a Linux
  access RADOS,                                 kernel client and a      kernel client and
  with support for                              QEMU/KVM driver          support for FUSE
  C, C++, Java,
  Python, Ruby,
  and PHP




RADOS

A reliable, autonomous, distributed object store comprised of self-healing, self-
managing, intelligent storage nodes
APP
    LIBRADOS

               native




    M
M               M
LIBRADOS
     Provides direct access to

L      RADOS for applications
     C, C++, Python, PHP, Jav
       a
     No HTTP overhead
APP                    APP                   HOST/VM                    CLIENT



                       RADOSGW                  RBD                      CEPH FS
  LIBRADOS
                       A bucket-based REST      A reliable and fully-    A POSIX-compliant
  A library allowing   gateway, compatible      distributed block        distributed file
  apps to directly     with S3 and Swift        device, with a Linux     system, with a Linux
  access RADOS,                                 kernel client and a      kernel client and
  with support for                              QEMU/KVM driver          support for FUSE
  C, C++, Java,
  Python, Ruby,
  and PHP




RADOS

A reliable, autonomous, distributed object store comprised of self-healing, self-
managing, intelligent storage nodes
APP                APP
                                REST




RADOSGW          RADOSGW
  LIBRADOS           LIBRADOS


                                       native




             M
       M         M
RADOS Gateway:
 REST-based interface to
  RADOS
 Supports
  buckets, accounting
 Compatible with S3 and
  Swift applications
APP                    APP                   HOST/VM                    CLIENT



                       RADOSGW                  RBD                      CEPH FS
  LIBRADOS
                       A bucket-based REST      A reliable and fully-    A POSIX-compliant
  A library allowing   gateway, compatible      distributed block        distributed file
  apps to directly     with S3 and Swift        device, with a Linux     system, with a Linux
  access RADOS,                                 kernel client and a      kernel client and
  with support for                              QEMU/KVM driver          support for FUSE
  C, C++, Java,
  Python, Ruby,
  and PHP




RADOS

A reliable, autonomous, distributed object store comprised of self-healing, self-
managing, intelligent storage nodes
VM




VIRTUALIZATION CONTAINER
            LIBRBD
          LIBRADOS




        M
   M                 M
CONTAINER            VM       CONTAINER
   LIBRBD                        LIBRBD
  LIBRADOS                      LIBRADOS




                 M
             M            M
HOST
    KRBD (KERNEL MODULE)
         LIBRADOS




       M
M                      M
RADOS Block Device:
 Storage of virtual disks in
  RADOS
 Allows decoupling of VMs
  and containers
     Live migration!
 Images are striped across
  the cluster
 Boot support in
  QEMU, KVM, and
  OpenStack Nova
 Mount support in the Linux
  kernel
APP                    APP                   HOST/VM                    CLIENT



                       RADOSGW                  RBD                      CEPH FS
  LIBRADOS
                       A bucket-based REST      A reliable and fully-    A POSIX-compliant
  A library allowing   gateway, compatible      distributed block        distributed file
  apps to directly     with S3 and Swift        device, with a Linux     system, with a Linux
  access RADOS,                                 kernel client and a      kernel client and
  with support for                              QEMU/KVM driver          support for FUSE
  C, C++, Java,
  Python, Ruby,
  and PHP




RADOS

A reliable, autonomous, distributed object store comprised of self-healing, self-
managing, intelligent storage nodes
CLIENT



metadata           01   data
                   10




               M
           M            M
Metadata Server
 Manages metadata for a
  POSIX-compliant shared
  filesystem
    Directory hierarchy
    File metadata
     (owner, timestamps, mo
     de, etc.)
 Stores metadata in RADOS
 Does not serve file data to
  clients
 Only required for shared
  filesystem
WHAT MAKES
  CEPH
 UNIQUE?
H OW D O YO U F I N D YO U R K E YS ?
azmeen, Flickr / CC BY 2.0
C D
           C D
           C D
           C D
           C D
      ??
APP        C D
           C D
           C D
           C D
           C D
           C D
           C D
C D
           C D   A-G
           C D
           C D
           C D   H-N
APP   F*   C D
           C D
           C D   O-T
           C D
           C D
           C D   U-Z
           C D
I A LWAYS P U T M Y K E YS O N T H E H O O K
vitamindave, Flickr / CC BY 2.0
C D
      C D
      C D
      C D
      C D
APP   C D
      C D
      C D
      C D
      C D
      C D
      C D
D EA R D I A RY: K E YS = I N T H E K I TC H E N
Barnaby, Flickr / CC BY 2.0
HOW DO YOU
 FIND YOUR KEYS
WHEN YOUR HOUSE
         IS
  INFINITELY BIG
       AND
ALWAYS CHANGING?
T H E A N SW E R : C R U S H ! !
pasukaru76, Flickr / CC SA 2.0
10 10 01 01 10 10 01 11 01 10

                               hash(object name) % num pg

10   10    01   01   10   10    01   11   01   10




                               CRUSH(pg, cluster state, rule set)
10 10 01 01 10 10 01 11 01 10




10   10    01   01   10   10   01   11    01   10
CRUSH
 Pseudo-random placement
  algorithm
 Ensures even distribution
 Repeatable, deterministic
 Rule-based configuration
    Replica count
    Infrastructure topology
    Weighting
CLIENT

         ??
CLIENT

         ??
VM




VIRTUALIZATION CONTAINER
            LIBRBD
         LIBRADOS




        M
   M                 M
HOW DO YOU
      SPIN UP
THOUSANDS OF VMs
    INSTANTLY
       AND
  EFFICIENTLY?
instant copy




144   0       0      0   0   = 144
write
                          CLIENT
                  write


                  write


                  write




144   4   = 148
read


                  read
                         CLIENT
                  read




144   4   = 148
HOW DO YOU
       MANAGE
DIRECTORY HEIRARCHY
      WITHOUT
          A
   SINGLE POINT OF
      FAILURE?
F I L ESYST E M S R EQ U I R E M E TA DATA
Barnaby, Flickr / CC BY 2.0
CLIENT



        01
        10




    M
M            M
M
M       M
one tree




three metadata servers


                              ??
DYNAMIC SUBTREE PARTITIONING
AND NOW
BACKPEDALING
ALMOST
EVERYTHING
  WORKS
APP                    APP                   HOST/VM                    CLIENT



                       RADOSGW                  RBD                      CEPH FS
  LIBRADOS
                       A bucket-based REST      A reliable and fully-    A POSIX-compliant
  A library allowing   gateway, compatible      distributed block        distributed file
  apps to directly     with S3 and Swift        device, with a Linux     system, with a Linux
  access RADOS,                                 kernel client and a      kernel client and
  with support for                              QEMU/KVM driver          support for FUSE
  C, C++, Java,
  Python, Ruby,
  and PHP                  AWESOME                  AWESOME
                                                                              NEARLY
   AWESOME                                                                   AWESOME


RADOS                                    AWESOME
A reliable, autonomous, distributed object store comprised of self-healing, self-
managing, intelligent storage nodes
*
LAN SCALE!!
* OR REALLY REALLY SCARY FAST WAN
C E P H A N D C LO U D STAC K
tableatny, Flickr / CC BY 2.0
R B D S U P P O RT I N C LO U D STAC K

 Allows storage of virtual disks inside RADOS
    Works with KVM only right now
    No snapshots yet
 Upcoming in CloudStack 4
 More information can be found on the mailing list:
    ceph-devel / incubator-cloudstack-dev:
     http://article.gmane.org/gmane.comp.file-systems.ceph.devel/7505
Q U EST I O N S ?


Ross Turk
VP Community, Inktank

 ross@inktank.com
 @rossturk

inktank.com | ceph.com

Mais conteúdo relacionado

Último

CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessPixlogix Infotech
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 

Último (20)

CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 

Destaque

2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by HubspotMarius Sescu
 
Everything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTEverything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTExpeed Software
 
Product Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsProduct Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsPixeldarts
 
How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthThinkNow
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfmarketingartwork
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024Neil Kimberley
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)contently
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024Albert Qian
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsKurio // The Social Media Age(ncy)
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Search Engine Journal
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summarySpeakerHub
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next Tessa Mero
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentLily Ray
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best PracticesVit Horky
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project managementMindGenius
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...RachelPearson36
 

Destaque (20)

2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot
 
Everything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTEverything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPT
 
Product Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsProduct Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage Engineerings
 
How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
 
Skeleton Culture Code
Skeleton Culture CodeSkeleton Culture Code
Skeleton Culture Code
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
 

Build A Cloud Day - Chicago

  • 1. S C A L I N G STO R AG E W I T H C E P H Ross Turk, Inktank
  • 2. WHO? Ross Turk VP Community, Inktank  ross@inktank.com  @rossturk inktank.com | ceph.com
  • 3.
  • 4.
  • 5. me
  • 6. APP APP HOST/VM CLIENT RADOSGW RBD CEPH FS LIBRADOS A bucket-based REST A reliable and fully- A POSIX-compliant A library allowing gateway, compatible distributed block distributed file apps to directly with S3 and Swift device, with a Linux system, with a Linux access RADOS, kernel client and a kernel client and with support for QEMU/KVM driver support for FUSE C, C++, Java, Python, Ruby, and PHP RADOS A reliable, autonomous, distributed object store comprised of self-healing, self- managing, intelligent storage nodes
  • 7. I N T H E B EG I N N I N G Magic Madzik, Flickr / CC BY 2.0
  • 8. EA R LY I N FO R M AT I O N STO R AG E Chico.Ferreira, Flickr / CC BY 2.0
  • 9. W R I T I N G > C AV E PA I N T I N G S kevingessner, Flickr / CC BY-SA 2.0
  • 10. == x1000 x1
  • 11. P EO P L E B EG I N W R I T I N G A LOT Moyan_Brenn, Flickr / CC BY-ND 2.0
  • 12. W R I T I N G I S T I M E - CO N S U M I N G trekkyandy, Flickr / CC BY 2.0
  • 13. T H E I N D U ST R I A L I ZAT I O N O F W R I T I N G FateDenied, Flickr / CC BY 2.0
  • 14. magnet + tape = magnetic tape == x1000 x1
  • 15. STO R AG E B ECO M ES M EC H A N I C A L Erik Pitti, Wikipedia / CC BY-ND 2.0
  • 16. HUMA ROCK N INK HUMA N PAPER HUMA COMPUTER TAPE N
  • 17. CO M P U T E RS N E E D P EO P L E TO WO R K USDAgov, Flickr / CC BY 2.0
  • 18. HUMA COMPUTER TAPE N
  • 19. 11101011 10110110 10110101 10101001 00100100 01001001 == 10100100 10100101 01011010 01101010 10101010 10101010 01010110 01010011
  • 20. T H RO U G H P U T B ECO M ES I M P O RTA N T Zane Luke, Flickr / CC BY-ND 2.0
  • 21. L A Z 0 R B 3 A M S C H A N G E E V E RY T H I N G ! ! Jeff Kubina, Flickr / CC-BY-SA 2.0
  • 22. H A R D D R I V ES A R E TOTA L LY B E T T E R amazing spinny hard drives sucky stupid tape slow
  • 23. E V E RY T H I N G G E T S M ES SY Rob!, Flickr / CC BY 2.0
  • 24. aa ab 111010 ac 101 ba bb bc 111 010 da 110 db 011 010 000 dc 000 110 001
  • 25. file owner: rturk created: aug12 last viewed: aug17 size: 42025 11101011 10110110 10110101 perms: 644 10101001 00100100 01001001 10100100 10100101 01011010 01101010 10101010 10101010
  • 26. aa ab 111010 ac 101 ba bb bc 111 010 da 110 db 01 010 000 dc 10 000 110 001
  • 27. W E O U TG ROW T H E H A R D D R I V E Mr. T in DC, Flickr / CC BY 2.0
  • 28. DISK DISK HUMA N DISK HUMA COMPUTE DISK N R HUMA DISK N DISK DISK
  • 29. HUMAN HUMAN HUMAN HUMAN DISK HUMAN HUMAN DISK HUMAN HUMAN DISK DISK HUMAN DISK HUMAN HUMAN DISK (COMPUTER) HUMAN DISK HUMAN HUMAN DISK HUMAN HUMAN DISK HUMAN DISK HUMAN DISK HUMAN HUMAN DISK HUMAN HUMAN (actually more like this…)
  • 30. COMPUTE DISK R COMPUTE DISK R HUMA COMPUTE DISK R N COMPUTE DISK R COMPUTE DISK R COMPUTE HUMA R DISK N COMPUTE DISK R COMPUTE DISK R COMPUTE HUMA R DISK N COMPUTE DISK R COMPUTE DISK R COMPUTE DISK R
  • 31. aa ab 111010 ac 101 ba bb bc 111 010 da 110 db 011 010 000 dc 000 110 001
  • 32. object pace: quick driver: frog license: expired expression: agog 11101011 10110110 10110101 10101001 00100100 01001001 10100100 10100101 01011010 01101010 10101010 10101010
  • 33. COMPUTE DISK R COMPUTE DISK R COMPUTE DISK R COMPUTE DISK R COMPUTE DISK R COMPUTE DISK APP R COMPUTE DISK R COMPUTE DISK R COMPUTE DISK R COMPUTE DISK R COMPUTE DISK R COMPUTE DISK R
  • 34. COMPUTE DISK R COMPUTE DISK R COMPUTE DISK R COMPUTE DISK R COMPUTE DISK R COMPUTE COMPUTE DISK R COMPUTE R R DISK DISK COMPUTE DISK R COMPUTE DISK R COMPUTE DISK R COMPUTE DISK R COMPUTE DISK R
  • 35. COMPUTE DISK R COMPUTE DISK R COMPUTE DISK R COMPUTE DISK R VM COMPUTE DISK R COMPUTE DISK R VM COMPUTE DISK R COMPUTE DISK VM R COMPUTE DISK R COMPUTE DISK R COMPUTE DISK R COMPUTE DISK R
  • 36. Ceph Cloud computing Distributed storage Shared storage Computers Writing Painting S TO R A G E T H R O U G H O U T H I S TO RY Time-scale: Roughly logarithmic. Content: Whatever the opposite of “scientific” is.
  • 37. COMPUTE DISK R COMPUTE DISK R HUMA COMPUTE DISK R N COMPUTE DISK R COMPUTE DISK R COMPUTE HUMA R DISK N COMPUTE DISK R COMPUTE DISK R COMPUTE HUMA R DISK N COMPUTE DISK R COMPUTE DISK R COMPUTE DISK R
  • 38. COMPUTE DISK R COMPUTE DISK R COMPUTE DISK R COMPUTE DISK R COMPUTE DISK R COMPUTE DISK R COMPUTE DISK R COMPUTE DISK R COMPUTE DISK R COMPUTE DISK R COMPUTE DISK R COMPUTE DISK R
  • 39. C D C D C D C D C D C D C D C D C D C D C D C D
  • 40. C D C D HUMA C D N C D C D C D HUMA N C D C D C D HUMA N C D C D C D
  • 41. STO R AG E A P P L I A N C ES Michael Moll, Wikipedia / CC BY-SA 2.0
  • 42. 6 . 4 M I L L I O N S Q F T O F FAC TO R I ES Dude94111, Flickr / CC BY 2.0
  • 43. T EC H N O LO GY I S A CO M M O D I T Y RaeAllen, Flickr / CC-BY 2.0
  • 44. CO M M O D I T Y P R I C ES F LU C T UAT E May-07 May-08 May-09 May-10 May-11 May-12
  • 45. Hardware Appliances are Mysterious Black Boxes Abode of Chaos, Flickr / CC BY 2.0
  • 46. C D C D C D C D C D HUMAN !! C D C D [DEVELOPER] C D C D C D C D C D
  • 47. C D C D C C D C D D C D C+ C D C D + C D C D C D C D
  • 48. C D C D C C D C D D C D C+ C D C D + C D C D C D C D
  • 49. THE WORLD NEEDS AN OPEN STORAGE TECHNOLOGY THAT SCALES
  • 50. SAG E W E I L  Co-founder of DreamHost  Inventor of Ceph  CEO of Inktank
  • 51. philosophy design OPEN SOURCE
  • 52. O P E N S O U RC E S P R EA D S I D EA S orchidgalore, Flickr / CC BY 2.0
  • 53. philosophy design OPEN SOURCE COMMUNITY- FOCUSED
  • 54. W E A R E S M A RT E R TO G E T H E R rturk, Linkedin Inmap
  • 55. C E P H B E LO N G S TO A L L O F U S wackybadger, Flickr / CC BY 2.0
  • 56. philosophy design OPEN SCALABL SOURCE E COMMUNITY- FOCUSED
  • 57. Ceph Too much for a room Too much for a computer Too much for a drive Too much for a book Too much for a cave C E P H I S B U I LT TO S C A L E
  • 58. philosophy design OPEN SCALABL SOURCE E COMMUNITY- NO SINGLE POINT OF FOCUSED FAILURE
  • 59. A R I LO M A X C A L I FO R N I C U S aroid, Flickr / CC BY 2.0
  • 60. single point of failure highly-available replicated T H E O C TO P U S ( A M E TA P H O R ) I love speaking in metaphors.
  • 61. T H E B E E H I V E ( A N OT H E R M E TA P H O R ) blumenbiene, Flickr / CC BY 2.0
  • 62. philosophy design OPEN SCALABL SOURCE E COMMUNITY- NO SINGLE POINT OF FOCUSED FAILURE SOFTWARE BASED
  • 63. C D C D C C D C D D C D C+ C D C D + C D C D C D C D
  • 64. C D C D C C D C D D C D C+ C D C D + C D C D C D C D
  • 65. philosophy design OPEN SCALABL SOURCE E COMMUNITY- NO SINGLE POINT OF FOCUSED FAILURE SOFTWARE BASED SELF- MANAGING
  • 66. D I S KS = J U ST T I N Y R ECO R D P L AY E RS jon_a_ross, Flickr / CC BY 2.0
  • 67. D D D D D D = D D x 1 MILLION 55 times / day
  • 68.
  • 69. I T A L L STA RT E D W I T H A D R EA M
  • 70. +
  • 71. APP APP HOST/VM CLIENT RADOSGW RBD CEPH FS LIBRADOS A bucket-based REST A reliable and fully- A POSIX-compliant A library allowing gateway, compatible distributed block distributed file apps to directly with S3 and Swift device, with a Linux system, with a Linux access RADOS, kernel client and a kernel client and with support for QEMU/KVM driver support for FUSE C, C++, Java, Python, Ruby, and PHP RADOS A reliable, autonomous, distributed object store comprised of self-healing, self- managing, intelligent storage nodes
  • 72. APP APP HOST/VM CLIENT RADOSGW RBD CEPH FS LIBRADOS A bucket-based REST A reliable and fully- A POSIX-compliant A library allowing gateway, compatible distributed block distributed file apps to directly with S3 and Swift device, with a Linux system, with a Linux access RADOS, kernel client and a kernel client and with support for QEMU/KVM driver support for FUSE C, C++, Java, Python, Ruby, and PHP RADOS A reliable, autonomous, distributed object store comprised of self-healing, self- managing, intelligent storage nodes
  • 73. OSD OSD OSD OSD OSD btrfs FS FS FS FS FS xfs ext4 DISK DISK DISK DISK DISK M M M
  • 74. HUMAN M M M
  • 75. Monitors:  Maintain cluster map M  Provide consensus for distributed decision- making  Must have an odd number  These do not serve stored objects to clients OSDs:  One per disk (recommended)  At least three in a cluster  Serve stored objects to clients  Intelligently peer to perform replication tasks  Supports object classes
  • 76. APP APP HOST/VM CLIENT RADOSGW RBD CEPH FS LIBRADOS A bucket-based REST A reliable and fully- A POSIX-compliant A library allowing gateway, compatible distributed block distributed file apps to directly with S3 and Swift device, with a Linux system, with a Linux access RADOS, kernel client and a kernel client and with support for QEMU/KVM driver support for FUSE C, C++, Java, Python, Ruby, and PHP RADOS A reliable, autonomous, distributed object store comprised of self-healing, self- managing, intelligent storage nodes
  • 77. APP LIBRADOS native M M M
  • 78. LIBRADOS  Provides direct access to L RADOS for applications  C, C++, Python, PHP, Jav a  No HTTP overhead
  • 79. APP APP HOST/VM CLIENT RADOSGW RBD CEPH FS LIBRADOS A bucket-based REST A reliable and fully- A POSIX-compliant A library allowing gateway, compatible distributed block distributed file apps to directly with S3 and Swift device, with a Linux system, with a Linux access RADOS, kernel client and a kernel client and with support for QEMU/KVM driver support for FUSE C, C++, Java, Python, Ruby, and PHP RADOS A reliable, autonomous, distributed object store comprised of self-healing, self- managing, intelligent storage nodes
  • 80. APP APP REST RADOSGW RADOSGW LIBRADOS LIBRADOS native M M M
  • 81. RADOS Gateway:  REST-based interface to RADOS  Supports buckets, accounting  Compatible with S3 and Swift applications
  • 82. APP APP HOST/VM CLIENT RADOSGW RBD CEPH FS LIBRADOS A bucket-based REST A reliable and fully- A POSIX-compliant A library allowing gateway, compatible distributed block distributed file apps to directly with S3 and Swift device, with a Linux system, with a Linux access RADOS, kernel client and a kernel client and with support for QEMU/KVM driver support for FUSE C, C++, Java, Python, Ruby, and PHP RADOS A reliable, autonomous, distributed object store comprised of self-healing, self- managing, intelligent storage nodes
  • 83. VM VIRTUALIZATION CONTAINER LIBRBD LIBRADOS M M M
  • 84. CONTAINER VM CONTAINER LIBRBD LIBRBD LIBRADOS LIBRADOS M M M
  • 85. HOST KRBD (KERNEL MODULE) LIBRADOS M M M
  • 86. RADOS Block Device:  Storage of virtual disks in RADOS  Allows decoupling of VMs and containers  Live migration!  Images are striped across the cluster  Boot support in QEMU, KVM, and OpenStack Nova  Mount support in the Linux kernel
  • 87. APP APP HOST/VM CLIENT RADOSGW RBD CEPH FS LIBRADOS A bucket-based REST A reliable and fully- A POSIX-compliant A library allowing gateway, compatible distributed block distributed file apps to directly with S3 and Swift device, with a Linux system, with a Linux access RADOS, kernel client and a kernel client and with support for QEMU/KVM driver support for FUSE C, C++, Java, Python, Ruby, and PHP RADOS A reliable, autonomous, distributed object store comprised of self-healing, self- managing, intelligent storage nodes
  • 88. CLIENT metadata 01 data 10 M M M
  • 89. Metadata Server  Manages metadata for a POSIX-compliant shared filesystem  Directory hierarchy  File metadata (owner, timestamps, mo de, etc.)  Stores metadata in RADOS  Does not serve file data to clients  Only required for shared filesystem
  • 90. WHAT MAKES CEPH UNIQUE?
  • 91. H OW D O YO U F I N D YO U R K E YS ? azmeen, Flickr / CC BY 2.0
  • 92. C D C D C D C D C D ?? APP C D C D C D C D C D C D C D
  • 93. C D C D A-G C D C D C D H-N APP F* C D C D C D O-T C D C D C D U-Z C D
  • 94. I A LWAYS P U T M Y K E YS O N T H E H O O K vitamindave, Flickr / CC BY 2.0
  • 95. C D C D C D C D C D APP C D C D C D C D C D C D C D
  • 96. D EA R D I A RY: K E YS = I N T H E K I TC H E N Barnaby, Flickr / CC BY 2.0
  • 97. HOW DO YOU FIND YOUR KEYS WHEN YOUR HOUSE IS INFINITELY BIG AND ALWAYS CHANGING?
  • 98. T H E A N SW E R : C R U S H ! ! pasukaru76, Flickr / CC SA 2.0
  • 99. 10 10 01 01 10 10 01 11 01 10 hash(object name) % num pg 10 10 01 01 10 10 01 11 01 10 CRUSH(pg, cluster state, rule set)
  • 100. 10 10 01 01 10 10 01 11 01 10 10 10 01 01 10 10 01 11 01 10
  • 101. CRUSH  Pseudo-random placement algorithm  Ensures even distribution  Repeatable, deterministic  Rule-based configuration  Replica count  Infrastructure topology  Weighting
  • 102. CLIENT ??
  • 103.
  • 104.
  • 105. CLIENT ??
  • 106. VM VIRTUALIZATION CONTAINER LIBRBD LIBRADOS M M M
  • 107. HOW DO YOU SPIN UP THOUSANDS OF VMs INSTANTLY AND EFFICIENTLY?
  • 108. instant copy 144 0 0 0 0 = 144
  • 109. write CLIENT write write write 144 4 = 148
  • 110. read read CLIENT read 144 4 = 148
  • 111. HOW DO YOU MANAGE DIRECTORY HEIRARCHY WITHOUT A SINGLE POINT OF FAILURE?
  • 112. F I L ESYST E M S R EQ U I R E M E TA DATA Barnaby, Flickr / CC BY 2.0
  • 113. CLIENT 01 10 M M M
  • 114. M M M
  • 115. one tree three metadata servers ??
  • 116.
  • 117.
  • 118.
  • 119.
  • 123. APP APP HOST/VM CLIENT RADOSGW RBD CEPH FS LIBRADOS A bucket-based REST A reliable and fully- A POSIX-compliant A library allowing gateway, compatible distributed block distributed file apps to directly with S3 and Swift device, with a Linux system, with a Linux access RADOS, kernel client and a kernel client and with support for QEMU/KVM driver support for FUSE C, C++, Java, Python, Ruby, and PHP AWESOME AWESOME NEARLY AWESOME AWESOME RADOS AWESOME A reliable, autonomous, distributed object store comprised of self-healing, self- managing, intelligent storage nodes
  • 124. * LAN SCALE!! * OR REALLY REALLY SCARY FAST WAN
  • 125. C E P H A N D C LO U D STAC K tableatny, Flickr / CC BY 2.0
  • 126. R B D S U P P O RT I N C LO U D STAC K  Allows storage of virtual disks inside RADOS  Works with KVM only right now  No snapshots yet  Upcoming in CloudStack 4  More information can be found on the mailing list:  ceph-devel / incubator-cloudstack-dev: http://article.gmane.org/gmane.comp.file-systems.ceph.devel/7505
  • 127. Q U EST I O N S ? Ross Turk VP Community, Inktank  ross@inktank.com  @rossturk inktank.com | ceph.com

Notas do Editor

  1. People have been trying to capture knowledge for a very long time. I guess the first form of captured knowledge is the cave painting.
  2. TODO: change this slide. Man + magnet + tape = magnetic tape.1000 books on one tape
  3. People learned how to store data on magnetic tape.Many, many, many books could be stored on a single tape.
  4. TODO: animate so that they show progressively