SlideShare uma empresa Scribd logo
1 de 28
Baixar para ler offline
Running ZFS without
                           a system pool
                           Bill Pijewski
                           Software Engineer, Joyent
                           @pijewski

Tuesday, October 2, 2012
Agenda

              • Why ZFS is important to Joyent
              • Evolution of USB and PXE boot architectures
              • Running with no system pool




Tuesday, October 2, 2012
ZFS at Joyent

              • We run a production cloud with many servers in
                   datacenters worldwide

              • Two kinds of zones (covered in detail in other talks):
                • Zones: sparse zones share libraries with the
                           platform

                    • VMs: fully virtualized GNU/Linux, Windows,
                           FreeBSD, etc. machines

              • Use small number of NFS machines to provide
                   additional storage capacity in each datacenter



Tuesday, October 2, 2012
ZFS for Zones and VMs

              • Zones are allocated two ZFS datasets
                • One dataset for data in that zone
                • Another for core files -- to prevent cores from
                           exceeding quota

              • VMs have a ZFS volume into which the VM image is
                   installed, plus one or more additional volumes
                   presented to guest as disks

              • Guest filesystems are installed into volumes


Tuesday, October 2, 2012
ZFS in different contexts

              • For Joyent, two main contexts: SmartOS and SDC
              • SmartOS: community distribution, illumos +
                   lightweight virtualization tools

              • SmartDataCenter (SDC): SmartOS + full cloud
                   management and orchestration stack




Tuesday, October 2, 2012
Important ZFS features

              • As with all ZFS users, we take for granted rely on
                   end-to-end data integrity

              • Copy-on-write architecture: snapshots, clones
              • Compression
              • Space management tools: quotas and reservations
              • Replication to move customers around between
                   different machines




Tuesday, October 2, 2012
Delegated administration!

              • In our next SDC release, we enable delegated
                   administration

              • Allows customers to:
                • Take snapshots outside of Joyentʼs API
                • Create child datasets
                • Snapshot and clone datasets
                • Replicate or migrate data between instances
              • Open work: basic limits on delegated activity to
                   avoid DOS

Tuesday, October 2, 2012
ZFS Performance

              • SSDs for ZIL
              • ARC
                • We hold back some portion of a serverʼs total
                           memory, knowing that a good portion of this
                           memory will be consumed by the ARC

                    • Committing memory achieves greater I/O
                           performance

              • ZFS I/O throttle for QoS controls
              • For more information, check out Brendan Greggʼs
                   excellent talk next door

Tuesday, October 2, 2012
Read-only system pool

              • At Fishworks, we decided to have a read-only
                   system pool

              • Necessary for OS install as well as analytics data
              • Simplified some things:
                • No unnecessary customizations from customers
                • Discouraged hot patching
              • Other disadvantages:
                • Upgrade, rollback, and factory reset were tricky

Tuesday, October 2, 2012
SmartOS USB Boot

              • Instead of installing OS to root disks, SmartOS boots
                   from a USB key

              • Entire kernel and userland fit in about 200 MB
                   (compressed)

              • Other software can be installed from pkgsrc
              • Single ZFS pool for all zones



Tuesday, October 2, 2012
USB Boot Advantages

              • All disks are available for zone/VM storage, thereby
                   increasing both performance and capacity

              • Encourages users to provision a zone for each
                   application rather than using the global zone

              • Discourages customization and one-off patching
              • Fast to get up and running
              • Easy to “bring your OS with you”


Tuesday, October 2, 2012
SmartDataCenter (SDC) Architecture

              • Two kinds of servers: head nodes and compute
                   nodes

                    • Head nodes run management, provisioning,
                           monitoring, and boot services

                    • Compute nodes contain customer zones
              • Head nodes are similar to SmartOS installs
              • Each compute node PXE boots its platform from the
                   head node

              • Both head nodes and compute nodes have a single
                   ZFS pool

Tuesday, October 2, 2012
SDC Diagram

                           DC 0              DC 1              DC 2

                  Headnode              Headnode          Headnode
                                  PXE               PXE               PXE


                     CN 0               CN 10             CN 20


                     CN 1               CN 11             CN 21


                     CN 2               CN 12             CN 22




                                                           ......
                                         ......
                      ......




Tuesday, October 2, 2012
PXE Boot Advantages

              • Ben Rockwood, 10/1/2012:
                           “Apparently other people spend time installing
                           software. I think that's stupid.”

              • As with SmartOS, operators encouraged to put
                   applications in zones instead of global

              • Upgrade = rollback = reboot, nothing more
              • Newer platforms can be staged and machines
                   rebooted later

              • Any machine which hits a known fixed problem will
                   automatically boot onto fresh platform

Tuesday, October 2, 2012
Storage pools!

              • Most OSes assume the existence of a “system” pool
                   -- a pool onto which the OS, applications, and
                   configuration information is installed

              • Joyent moving away from single-vdev pools backed
                   by hardware RAID

              • Embracing hybrid storage pool (HSP) using an SSD
                   for the ZFS intent log (ZIL)

              • Everything else worked on RAID-Z pools except for
                   saving a crash dump



Tuesday, October 2, 2012
RAID-Z Crash Dump

              • Problem: have only one RAID-Z or mirrored pool but
                   cannot save crash dump on said pool

              • Implement crash dumps on RAID-Z (majority of
                   work) and pools with multiple vdevs

              • Not necessarily to save parity bits for crash dump
                   data:

                    • Crash dump is immediately saved upon reboot
                    • Needs to be reliable, simple, and (hopefully) fast


Tuesday, October 2, 2012
Why no parity bits?

              • Since DVAs on the dump device are preallocated,
                   use those 128K blocks for each write

              • Most calls into dump entry point are not block
                   aligned

              • Rather than write variable size, use original 128K
              • I first calculated parity bits, only my test machine
                   took three hours to save a crash dump

              • No parity calculated -- on a pool with n vdevs, each
                   write could require n-1 (synchronous) reads



Tuesday, October 2, 2012
Other system components

              • Swap device (thankfully) supports RAID-Z pools
              • /var, /opt have their own datasets
              • /etc not persistent
              • /root also not persistent, again incentivizing people
                   to configure applications in zones rather than using
                   the GZ




Tuesday, October 2, 2012
Summary

              • The single ZFS pool has simplified Joyentʼs
                   deployment

              • Delegated administration has given customers more
                   power

              • ZFS has been and will continue to be a crucial
                   component of our architecture for many years




Tuesday, October 2, 2012
Questions?




Tuesday, October 2, 2012
Running ZFS without
                           a system pool
                           Bill Pijewski
                           Software Engineer, Joyent
                           @pijewski

Tuesday, October 2, 2012
Backup slides




Tuesday, October 2, 2012
ZFS 101

              • ZFS is a copy-on-write filesystem from Sun originally
                   shipped with Solaris 10

              • Many innovative features: data compression,
                   snapshot/rollback, ZFS send/receive, SSD
                   integration

              • Enterprise-grade reliability and data integrity
              • Two main components relevant here:
                • ZFS pools
                • ZFS datasets
Tuesday, October 2, 2012
ZFS Pools

              • Aggregate disks into a single storage pool from
                   which “datasets” are allocated

              • No parted/LVM needed
              • Mix both spinning disks and SSDs:
                • L2ARC: extends filesystem buffer cache
                • ZIL: absorbs synchronous write activity



Tuesday, October 2, 2012
ZFS Datasets

              • Datasets are a tree of blocks within the storage pool,
                   presented as either:

                    • A filesystem (file interface)
                    • A volume (block interface)
              • Datasets can be flexibly resized, and volumes can
                   even be thinly provisioned

              • Administrative controls on datasets


Tuesday, October 2, 2012
Zones and VMs

              • A zone is a lightweight software-virtualized container
                • Uses the systemʼs OS platform
                • Allocated its own ZFS filesystem (more in a sec)
              • A VM is a hardware-virtualized container for GNU/
                   Linux, Windows, BSD, etc.

                    • Uses its own ZFS volume
                    • VMʼs filesystem installed into ZFS volume
              • Both machines have resource controls for CPU,
                   memory, and disk I/O

Tuesday, October 2, 2012
Advantages of ZFS

              • Snapshots: zone/VM backup and recovery
              • Space management: reservations and quota flexibly
                   allocate space between zones

              • Delegated administration: each tenant can
                   administer their own dataset:

                    • Set compression level and other properties
                    • Take snapshots of application data
                    • Generate send streams for replication/backup

Tuesday, October 2, 2012
Advantages of ZFS (2)

              • Data integrity: verifies data of VM guest filesystems
                   (ext4, XFS, NTFS, etc.)

              • Multiple storage configurations available: mirrored,
                   RAID-Z2, and others

              • System fully supported on any storage
                   configurations, can even take a crash dump to a
                   RAID-Z pool




Tuesday, October 2, 2012

Mais conteúdo relacionado

Mais procurados

What is new in Citrix xen Client
What is new in Citrix xen ClientWhat is new in Citrix xen Client
What is new in Citrix xen ClientDigicomp Academy AG
 
Integrating Apple Macs Using Novell Technologies
Integrating Apple Macs Using Novell TechnologiesIntegrating Apple Macs Using Novell Technologies
Integrating Apple Macs Using Novell TechnologiesNovell
 
Cloud Computing Hypervisors and Comparison Xen KVM
Cloud Computing Hypervisors and Comparison Xen KVM Cloud Computing Hypervisors and Comparison Xen KVM
Cloud Computing Hypervisors and Comparison Xen KVM cloudresearcher
 
cloud computing:Types of virtualization
cloud computing:Types of virtualizationcloud computing:Types of virtualization
cloud computing:Types of virtualizationDr.Neeraj Kumar Pandey
 
Migrating from Novell ZENworks 7 Desktop Management to Novell ZENworks Config...
Migrating from Novell ZENworks 7 Desktop Management to Novell ZENworks Config...Migrating from Novell ZENworks 7 Desktop Management to Novell ZENworks Config...
Migrating from Novell ZENworks 7 Desktop Management to Novell ZENworks Config...Novell
 
Virtualization in cloud
Virtualization in cloudVirtualization in cloud
Virtualization in cloudAshok Kumar
 
Novell ZENworks Patch Management Best Practices
Novell ZENworks Patch Management Best PracticesNovell ZENworks Patch Management Best Practices
Novell ZENworks Patch Management Best PracticesNovell
 
Principles of Virtualization - Introduction to Virtualization Software
Principles of Virtualization - Introduction to Virtualization Software Principles of Virtualization - Introduction to Virtualization Software
Principles of Virtualization - Introduction to Virtualization Software Rubal Sagwal
 
Novell iPrint: Advanced Features on Linux
Novell iPrint: Advanced Features on LinuxNovell iPrint: Advanced Features on Linux
Novell iPrint: Advanced Features on LinuxNovell
 
Tips, Tricks and Cool Solutions for Novell ZENworks Configuration Management
Tips, Tricks and Cool Solutions for Novell ZENworks Configuration ManagementTips, Tricks and Cool Solutions for Novell ZENworks Configuration Management
Tips, Tricks and Cool Solutions for Novell ZENworks Configuration ManagementNovell
 
Xen and the Art of Virtualization
Xen and the Art of VirtualizationXen and the Art of Virtualization
Xen and the Art of VirtualizationSusheel Thakur
 
HP-UX Dynamic Root Disk vs Solaris Live Upgrade vs AIX Multibos by Dusan Balj...
HP-UX Dynamic Root Disk vs Solaris Live Upgrade vs AIX Multibos by Dusan Balj...HP-UX Dynamic Root Disk vs Solaris Live Upgrade vs AIX Multibos by Dusan Balj...
HP-UX Dynamic Root Disk vs Solaris Live Upgrade vs AIX Multibos by Dusan Balj...Circling Cycle
 
XenDesktop 5 und das letzte "Puzzleteil"
XenDesktop 5 und das letzte "Puzzleteil"XenDesktop 5 und das letzte "Puzzleteil"
XenDesktop 5 und das letzte "Puzzleteil"netlogix
 

Mais procurados (20)

DTraceCloud2012
DTraceCloud2012DTraceCloud2012
DTraceCloud2012
 
What is new in Citrix xen Client
What is new in Citrix xen ClientWhat is new in Citrix xen Client
What is new in Citrix xen Client
 
Energy efficient storage in vm
Energy efficient storage in vmEnergy efficient storage in vm
Energy efficient storage in vm
 
Xen io
Xen ioXen io
Xen io
 
Ian Pratt Usenix 08 Keynote
Ian Pratt Usenix 08 KeynoteIan Pratt Usenix 08 Keynote
Ian Pratt Usenix 08 Keynote
 
Integrating Apple Macs Using Novell Technologies
Integrating Apple Macs Using Novell TechnologiesIntegrating Apple Macs Using Novell Technologies
Integrating Apple Macs Using Novell Technologies
 
Cloud Computing Hypervisors and Comparison Xen KVM
Cloud Computing Hypervisors and Comparison Xen KVM Cloud Computing Hypervisors and Comparison Xen KVM
Cloud Computing Hypervisors and Comparison Xen KVM
 
cloud computing:Types of virtualization
cloud computing:Types of virtualizationcloud computing:Types of virtualization
cloud computing:Types of virtualization
 
Migrating from Novell ZENworks 7 Desktop Management to Novell ZENworks Config...
Migrating from Novell ZENworks 7 Desktop Management to Novell ZENworks Config...Migrating from Novell ZENworks 7 Desktop Management to Novell ZENworks Config...
Migrating from Novell ZENworks 7 Desktop Management to Novell ZENworks Config...
 
Virtualization in cloud
Virtualization in cloudVirtualization in cloud
Virtualization in cloud
 
Ian Pratt Nsdi Keynote Apr2008
Ian Pratt Nsdi Keynote Apr2008Ian Pratt Nsdi Keynote Apr2008
Ian Pratt Nsdi Keynote Apr2008
 
Novell ZENworks Patch Management Best Practices
Novell ZENworks Patch Management Best PracticesNovell ZENworks Patch Management Best Practices
Novell ZENworks Patch Management Best Practices
 
Xen Project: Windows PV Drivers
Xen Project: Windows PV DriversXen Project: Windows PV Drivers
Xen Project: Windows PV Drivers
 
Principles of Virtualization - Introduction to Virtualization Software
Principles of Virtualization - Introduction to Virtualization Software Principles of Virtualization - Introduction to Virtualization Software
Principles of Virtualization - Introduction to Virtualization Software
 
Novell iPrint: Advanced Features on Linux
Novell iPrint: Advanced Features on LinuxNovell iPrint: Advanced Features on Linux
Novell iPrint: Advanced Features on Linux
 
Tips, Tricks and Cool Solutions for Novell ZENworks Configuration Management
Tips, Tricks and Cool Solutions for Novell ZENworks Configuration ManagementTips, Tricks and Cool Solutions for Novell ZENworks Configuration Management
Tips, Tricks and Cool Solutions for Novell ZENworks Configuration Management
 
Xen and the Art of Virtualization
Xen and the Art of VirtualizationXen and the Art of Virtualization
Xen and the Art of Virtualization
 
HP-UX Dynamic Root Disk vs Solaris Live Upgrade vs AIX Multibos by Dusan Balj...
HP-UX Dynamic Root Disk vs Solaris Live Upgrade vs AIX Multibos by Dusan Balj...HP-UX Dynamic Root Disk vs Solaris Live Upgrade vs AIX Multibos by Dusan Balj...
HP-UX Dynamic Root Disk vs Solaris Live Upgrade vs AIX Multibos by Dusan Balj...
 
XenDesktop 5 und das letzte "Puzzleteil"
XenDesktop 5 und das letzte "Puzzleteil"XenDesktop 5 und das letzte "Puzzleteil"
XenDesktop 5 und das letzte "Puzzleteil"
 
Xen & virtualization
Xen & virtualizationXen & virtualization
Xen & virtualization
 

Destaque

Intervento di Sergio Bizzarro, PORCELANOSA GRUPO
Intervento di Sergio Bizzarro, PORCELANOSA GRUPOIntervento di Sergio Bizzarro, PORCELANOSA GRUPO
Intervento di Sergio Bizzarro, PORCELANOSA GRUPOinfoprogetto
 
Complete Baths By Porcelanosa
Complete Baths By PorcelanosaComplete Baths By Porcelanosa
Complete Baths By PorcelanosaBarbiefixit
 
Intervento di Lorenzo Bagnoli, PORCELANOSA GRUPO
Intervento di Lorenzo Bagnoli, PORCELANOSA GRUPOIntervento di Lorenzo Bagnoli, PORCELANOSA GRUPO
Intervento di Lorenzo Bagnoli, PORCELANOSA GRUPOinfoprogetto
 
PORCELANOSA GRUPO 8 giugno 2015
PORCELANOSA GRUPO 8 giugno 2015PORCELANOSA GRUPO 8 giugno 2015
PORCELANOSA GRUPO 8 giugno 2015infoprogetto
 
Contract e progettazione integrata, menichini - porcelanosa
Contract e progettazione integrata, menichini -  porcelanosaContract e progettazione integrata, menichini -  porcelanosa
Contract e progettazione integrata, menichini - porcelanosainfoprogetto
 
Intervento di Francesco Giordano, PORCELANOSA Spa
Intervento di Francesco Giordano, PORCELANOSA SpaIntervento di Francesco Giordano, PORCELANOSA Spa
Intervento di Francesco Giordano, PORCELANOSA Spainfoprogetto
 
Matteo Martiniello, PORCELANOSA GRUPO
 Matteo Martiniello, PORCELANOSA GRUPO  Matteo Martiniello, PORCELANOSA GRUPO
Matteo Martiniello, PORCELANOSA GRUPO infoprogetto
 
Short Seminars and Workshops by Antonio Paraiso
Short Seminars and Workshops by Antonio ParaisoShort Seminars and Workshops by Antonio Paraiso
Short Seminars and Workshops by Antonio ParaisoAntonio Paraiso
 
Porcelanosa - Organizational plan
Porcelanosa - Organizational planPorcelanosa - Organizational plan
Porcelanosa - Organizational planErica Catala
 
Modelli di business ed innovazione
Modelli di business ed innovazioneModelli di business ed innovazione
Modelli di business ed innovazioneStefano Mizio
 
La percezione estetica e visiva.
La percezione estetica e visiva.La percezione estetica e visiva.
La percezione estetica e visiva.Dara Bellinvia
 
Presentazione innovazione
Presentazione innovazionePresentazione innovazione
Presentazione innovazioneLuca Foresti
 
Decori na Porcelanosa
Decori na PorcelanosaDecori na Porcelanosa
Decori na PorcelanosaDecori
 
Il come e il perchè dell'innovazione
Il come e il perchè dell'innovazioneIl come e il perchè dell'innovazione
Il come e il perchè dell'innovazioneRaffaele Vincenti
 

Destaque (17)

Intervento di Sergio Bizzarro, PORCELANOSA GRUPO
Intervento di Sergio Bizzarro, PORCELANOSA GRUPOIntervento di Sergio Bizzarro, PORCELANOSA GRUPO
Intervento di Sergio Bizzarro, PORCELANOSA GRUPO
 
Complete Baths By Porcelanosa
Complete Baths By PorcelanosaComplete Baths By Porcelanosa
Complete Baths By Porcelanosa
 
Intervento di Lorenzo Bagnoli, PORCELANOSA GRUPO
Intervento di Lorenzo Bagnoli, PORCELANOSA GRUPOIntervento di Lorenzo Bagnoli, PORCELANOSA GRUPO
Intervento di Lorenzo Bagnoli, PORCELANOSA GRUPO
 
PORCELANOSA GRUPO 8 giugno 2015
PORCELANOSA GRUPO 8 giugno 2015PORCELANOSA GRUPO 8 giugno 2015
PORCELANOSA GRUPO 8 giugno 2015
 
Contract e progettazione integrata, menichini - porcelanosa
Contract e progettazione integrata, menichini -  porcelanosaContract e progettazione integrata, menichini -  porcelanosa
Contract e progettazione integrata, menichini - porcelanosa
 
Intervento di Francesco Giordano, PORCELANOSA Spa
Intervento di Francesco Giordano, PORCELANOSA SpaIntervento di Francesco Giordano, PORCELANOSA Spa
Intervento di Francesco Giordano, PORCELANOSA Spa
 
Matteo Martiniello, PORCELANOSA GRUPO
 Matteo Martiniello, PORCELANOSA GRUPO  Matteo Martiniello, PORCELANOSA GRUPO
Matteo Martiniello, PORCELANOSA GRUPO
 
Krion arq es
Krion arq esKrion arq es
Krion arq es
 
Porcelanosa
PorcelanosaPorcelanosa
Porcelanosa
 
Short Seminars and Workshops by Antonio Paraiso
Short Seminars and Workshops by Antonio ParaisoShort Seminars and Workshops by Antonio Paraiso
Short Seminars and Workshops by Antonio Paraiso
 
Porcelanosa - Organizational plan
Porcelanosa - Organizational planPorcelanosa - Organizational plan
Porcelanosa - Organizational plan
 
Modelli di business ed innovazione
Modelli di business ed innovazioneModelli di business ed innovazione
Modelli di business ed innovazione
 
La percezione estetica e visiva.
La percezione estetica e visiva.La percezione estetica e visiva.
La percezione estetica e visiva.
 
Krion PPT
Krion PPTKrion PPT
Krion PPT
 
Presentazione innovazione
Presentazione innovazionePresentazione innovazione
Presentazione innovazione
 
Decori na Porcelanosa
Decori na PorcelanosaDecori na Porcelanosa
Decori na Porcelanosa
 
Il come e il perchè dell'innovazione
Il come e il perchè dell'innovazioneIl come e il perchè dell'innovazione
Il come e il perchè dell'innovazione
 

Semelhante a Running without a ZFS system pool

SmartOS ZFS Architecture
SmartOS ZFS ArchitectureSmartOS ZFS Architecture
SmartOS ZFS ArchitectureBill Pijewski
 
Private cloud virtual reality to reality a partner story daniel mar_technicom
Private cloud virtual reality to reality a partner story daniel mar_technicomPrivate cloud virtual reality to reality a partner story daniel mar_technicom
Private cloud virtual reality to reality a partner story daniel mar_technicomMicrosoft Singapore
 
Virtualization VM VirtualBox + Oracle Enterprise Linux With Oracle 11GR2
Virtualization VM VirtualBox + Oracle Enterprise Linux With Oracle 11GR2Virtualization VM VirtualBox + Oracle Enterprise Linux With Oracle 11GR2
Virtualization VM VirtualBox + Oracle Enterprise Linux With Oracle 11GR2John Heaton
 
Flexible DevOps Deployment of Enterprise Test Environments in the Cloud
Flexible DevOps Deployment of Enterprise Test Environments in the CloudFlexible DevOps Deployment of Enterprise Test Environments in the Cloud
Flexible DevOps Deployment of Enterprise Test Environments in the CloudDevOps for Enterprise Systems
 
Vancouver bug enterprise storage and zfs
Vancouver bug   enterprise storage and zfsVancouver bug   enterprise storage and zfs
Vancouver bug enterprise storage and zfsRami Jebara
 
Optimizing VM images for OpenStack with KVM/QEMU
Optimizing VM images for OpenStack with KVM/QEMUOptimizing VM images for OpenStack with KVM/QEMU
Optimizing VM images for OpenStack with KVM/QEMUOpenStack Foundation
 
1. Docker Introduction.pdf
1. Docker Introduction.pdf1. Docker Introduction.pdf
1. Docker Introduction.pdfAmarGautam15
 
RHEL5 XEN HandOnTraining_v0.4.pdf
RHEL5 XEN HandOnTraining_v0.4.pdfRHEL5 XEN HandOnTraining_v0.4.pdf
RHEL5 XEN HandOnTraining_v0.4.pdfPaul Yang
 
Operate your hadoop cluster like a high eff goldmine
Operate your hadoop cluster like a high eff goldmineOperate your hadoop cluster like a high eff goldmine
Operate your hadoop cluster like a high eff goldmineDataWorks Summit
 
Latest (storage IO) patterns for cloud-native applications
Latest (storage IO) patterns for cloud-native applications Latest (storage IO) patterns for cloud-native applications
Latest (storage IO) patterns for cloud-native applications OpenEBS
 
Kubernetes - Hosted OSG Services
Kubernetes - Hosted OSG ServicesKubernetes - Hosted OSG Services
Kubernetes - Hosted OSG ServicesIgor Sfiligoi
 
제3회난공불락 오픈소스 인프라세미나 - lustre
제3회난공불락 오픈소스 인프라세미나 - lustre제3회난공불락 오픈소스 인프라세미나 - lustre
제3회난공불락 오픈소스 인프라세미나 - lustreTommy Lee
 
Ceph Day London 2014 - Best Practices for Ceph-powered Implementations of Sto...
Ceph Day London 2014 - Best Practices for Ceph-powered Implementations of Sto...Ceph Day London 2014 - Best Practices for Ceph-powered Implementations of Sto...
Ceph Day London 2014 - Best Practices for Ceph-powered Implementations of Sto...Ceph Community
 
Database as a Service on the Oracle Database Appliance Platform
Database as a Service on the Oracle Database Appliance PlatformDatabase as a Service on the Oracle Database Appliance Platform
Database as a Service on the Oracle Database Appliance PlatformMaris Elsins
 
Presentation architecting a cloud infrastructure
Presentation   architecting a cloud infrastructurePresentation   architecting a cloud infrastructure
Presentation architecting a cloud infrastructurexKinAnx
 
Presentation architecting a cloud infrastructure
Presentation   architecting a cloud infrastructurePresentation   architecting a cloud infrastructure
Presentation architecting a cloud infrastructuresolarisyourep
 
Data Lake and the rise of the microservices
Data Lake and the rise of the microservicesData Lake and the rise of the microservices
Data Lake and the rise of the microservicesBigstep
 

Semelhante a Running without a ZFS system pool (20)

SmartOS ZFS Architecture
SmartOS ZFS ArchitectureSmartOS ZFS Architecture
SmartOS ZFS Architecture
 
Private cloud virtual reality to reality a partner story daniel mar_technicom
Private cloud virtual reality to reality a partner story daniel mar_technicomPrivate cloud virtual reality to reality a partner story daniel mar_technicom
Private cloud virtual reality to reality a partner story daniel mar_technicom
 
Xen @ Google, 2011
Xen @ Google, 2011Xen @ Google, 2011
Xen @ Google, 2011
 
Virtualization VM VirtualBox + Oracle Enterprise Linux With Oracle 11GR2
Virtualization VM VirtualBox + Oracle Enterprise Linux With Oracle 11GR2Virtualization VM VirtualBox + Oracle Enterprise Linux With Oracle 11GR2
Virtualization VM VirtualBox + Oracle Enterprise Linux With Oracle 11GR2
 
Flexible DevOps Deployment of Enterprise Test Environments in the Cloud
Flexible DevOps Deployment of Enterprise Test Environments in the CloudFlexible DevOps Deployment of Enterprise Test Environments in the Cloud
Flexible DevOps Deployment of Enterprise Test Environments in the Cloud
 
Vancouver bug enterprise storage and zfs
Vancouver bug   enterprise storage and zfsVancouver bug   enterprise storage and zfs
Vancouver bug enterprise storage and zfs
 
Optimizing VM images for OpenStack with KVM/QEMU
Optimizing VM images for OpenStack with KVM/QEMUOptimizing VM images for OpenStack with KVM/QEMU
Optimizing VM images for OpenStack with KVM/QEMU
 
How swift is your Swift - SD.pptx
How swift is your Swift - SD.pptxHow swift is your Swift - SD.pptx
How swift is your Swift - SD.pptx
 
1. Docker Introduction.pdf
1. Docker Introduction.pdf1. Docker Introduction.pdf
1. Docker Introduction.pdf
 
RHEL5 XEN HandOnTraining_v0.4.pdf
RHEL5 XEN HandOnTraining_v0.4.pdfRHEL5 XEN HandOnTraining_v0.4.pdf
RHEL5 XEN HandOnTraining_v0.4.pdf
 
Operate your hadoop cluster like a high eff goldmine
Operate your hadoop cluster like a high eff goldmineOperate your hadoop cluster like a high eff goldmine
Operate your hadoop cluster like a high eff goldmine
 
Latest (storage IO) patterns for cloud-native applications
Latest (storage IO) patterns for cloud-native applications Latest (storage IO) patterns for cloud-native applications
Latest (storage IO) patterns for cloud-native applications
 
Kubernetes - Hosted OSG Services
Kubernetes - Hosted OSG ServicesKubernetes - Hosted OSG Services
Kubernetes - Hosted OSG Services
 
제3회난공불락 오픈소스 인프라세미나 - lustre
제3회난공불락 오픈소스 인프라세미나 - lustre제3회난공불락 오픈소스 인프라세미나 - lustre
제3회난공불락 오픈소스 인프라세미나 - lustre
 
Ceph Day London 2014 - Best Practices for Ceph-powered Implementations of Sto...
Ceph Day London 2014 - Best Practices for Ceph-powered Implementations of Sto...Ceph Day London 2014 - Best Practices for Ceph-powered Implementations of Sto...
Ceph Day London 2014 - Best Practices for Ceph-powered Implementations of Sto...
 
Database as a Service on the Oracle Database Appliance Platform
Database as a Service on the Oracle Database Appliance PlatformDatabase as a Service on the Oracle Database Appliance Platform
Database as a Service on the Oracle Database Appliance Platform
 
Presentation architecting a cloud infrastructure
Presentation   architecting a cloud infrastructurePresentation   architecting a cloud infrastructure
Presentation architecting a cloud infrastructure
 
Presentation architecting a cloud infrastructure
Presentation   architecting a cloud infrastructurePresentation   architecting a cloud infrastructure
Presentation architecting a cloud infrastructure
 
Data Lake and the rise of the microservices
Data Lake and the rise of the microservicesData Lake and the rise of the microservices
Data Lake and the rise of the microservices
 
Xen revisited
Xen revisitedXen revisited
Xen revisited
 

Último

08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilV3cube
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...gurkirankumar98700
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsRoshan Dwivedi
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 

Último (20)

08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 

Running without a ZFS system pool

  • 1. Running ZFS without a system pool Bill Pijewski Software Engineer, Joyent @pijewski Tuesday, October 2, 2012
  • 2. Agenda • Why ZFS is important to Joyent • Evolution of USB and PXE boot architectures • Running with no system pool Tuesday, October 2, 2012
  • 3. ZFS at Joyent • We run a production cloud with many servers in datacenters worldwide • Two kinds of zones (covered in detail in other talks): • Zones: sparse zones share libraries with the platform • VMs: fully virtualized GNU/Linux, Windows, FreeBSD, etc. machines • Use small number of NFS machines to provide additional storage capacity in each datacenter Tuesday, October 2, 2012
  • 4. ZFS for Zones and VMs • Zones are allocated two ZFS datasets • One dataset for data in that zone • Another for core files -- to prevent cores from exceeding quota • VMs have a ZFS volume into which the VM image is installed, plus one or more additional volumes presented to guest as disks • Guest filesystems are installed into volumes Tuesday, October 2, 2012
  • 5. ZFS in different contexts • For Joyent, two main contexts: SmartOS and SDC • SmartOS: community distribution, illumos + lightweight virtualization tools • SmartDataCenter (SDC): SmartOS + full cloud management and orchestration stack Tuesday, October 2, 2012
  • 6. Important ZFS features • As with all ZFS users, we take for granted rely on end-to-end data integrity • Copy-on-write architecture: snapshots, clones • Compression • Space management tools: quotas and reservations • Replication to move customers around between different machines Tuesday, October 2, 2012
  • 7. Delegated administration! • In our next SDC release, we enable delegated administration • Allows customers to: • Take snapshots outside of Joyentʼs API • Create child datasets • Snapshot and clone datasets • Replicate or migrate data between instances • Open work: basic limits on delegated activity to avoid DOS Tuesday, October 2, 2012
  • 8. ZFS Performance • SSDs for ZIL • ARC • We hold back some portion of a serverʼs total memory, knowing that a good portion of this memory will be consumed by the ARC • Committing memory achieves greater I/O performance • ZFS I/O throttle for QoS controls • For more information, check out Brendan Greggʼs excellent talk next door Tuesday, October 2, 2012
  • 9. Read-only system pool • At Fishworks, we decided to have a read-only system pool • Necessary for OS install as well as analytics data • Simplified some things: • No unnecessary customizations from customers • Discouraged hot patching • Other disadvantages: • Upgrade, rollback, and factory reset were tricky Tuesday, October 2, 2012
  • 10. SmartOS USB Boot • Instead of installing OS to root disks, SmartOS boots from a USB key • Entire kernel and userland fit in about 200 MB (compressed) • Other software can be installed from pkgsrc • Single ZFS pool for all zones Tuesday, October 2, 2012
  • 11. USB Boot Advantages • All disks are available for zone/VM storage, thereby increasing both performance and capacity • Encourages users to provision a zone for each application rather than using the global zone • Discourages customization and one-off patching • Fast to get up and running • Easy to “bring your OS with you” Tuesday, October 2, 2012
  • 12. SmartDataCenter (SDC) Architecture • Two kinds of servers: head nodes and compute nodes • Head nodes run management, provisioning, monitoring, and boot services • Compute nodes contain customer zones • Head nodes are similar to SmartOS installs • Each compute node PXE boots its platform from the head node • Both head nodes and compute nodes have a single ZFS pool Tuesday, October 2, 2012
  • 13. SDC Diagram DC 0 DC 1 DC 2 Headnode Headnode Headnode PXE PXE PXE CN 0 CN 10 CN 20 CN 1 CN 11 CN 21 CN 2 CN 12 CN 22 ...... ...... ...... Tuesday, October 2, 2012
  • 14. PXE Boot Advantages • Ben Rockwood, 10/1/2012: “Apparently other people spend time installing software. I think that's stupid.” • As with SmartOS, operators encouraged to put applications in zones instead of global • Upgrade = rollback = reboot, nothing more • Newer platforms can be staged and machines rebooted later • Any machine which hits a known fixed problem will automatically boot onto fresh platform Tuesday, October 2, 2012
  • 15. Storage pools! • Most OSes assume the existence of a “system” pool -- a pool onto which the OS, applications, and configuration information is installed • Joyent moving away from single-vdev pools backed by hardware RAID • Embracing hybrid storage pool (HSP) using an SSD for the ZFS intent log (ZIL) • Everything else worked on RAID-Z pools except for saving a crash dump Tuesday, October 2, 2012
  • 16. RAID-Z Crash Dump • Problem: have only one RAID-Z or mirrored pool but cannot save crash dump on said pool • Implement crash dumps on RAID-Z (majority of work) and pools with multiple vdevs • Not necessarily to save parity bits for crash dump data: • Crash dump is immediately saved upon reboot • Needs to be reliable, simple, and (hopefully) fast Tuesday, October 2, 2012
  • 17. Why no parity bits? • Since DVAs on the dump device are preallocated, use those 128K blocks for each write • Most calls into dump entry point are not block aligned • Rather than write variable size, use original 128K • I first calculated parity bits, only my test machine took three hours to save a crash dump • No parity calculated -- on a pool with n vdevs, each write could require n-1 (synchronous) reads Tuesday, October 2, 2012
  • 18. Other system components • Swap device (thankfully) supports RAID-Z pools • /var, /opt have their own datasets • /etc not persistent • /root also not persistent, again incentivizing people to configure applications in zones rather than using the GZ Tuesday, October 2, 2012
  • 19. Summary • The single ZFS pool has simplified Joyentʼs deployment • Delegated administration has given customers more power • ZFS has been and will continue to be a crucial component of our architecture for many years Tuesday, October 2, 2012
  • 21. Running ZFS without a system pool Bill Pijewski Software Engineer, Joyent @pijewski Tuesday, October 2, 2012
  • 23. ZFS 101 • ZFS is a copy-on-write filesystem from Sun originally shipped with Solaris 10 • Many innovative features: data compression, snapshot/rollback, ZFS send/receive, SSD integration • Enterprise-grade reliability and data integrity • Two main components relevant here: • ZFS pools • ZFS datasets Tuesday, October 2, 2012
  • 24. ZFS Pools • Aggregate disks into a single storage pool from which “datasets” are allocated • No parted/LVM needed • Mix both spinning disks and SSDs: • L2ARC: extends filesystem buffer cache • ZIL: absorbs synchronous write activity Tuesday, October 2, 2012
  • 25. ZFS Datasets • Datasets are a tree of blocks within the storage pool, presented as either: • A filesystem (file interface) • A volume (block interface) • Datasets can be flexibly resized, and volumes can even be thinly provisioned • Administrative controls on datasets Tuesday, October 2, 2012
  • 26. Zones and VMs • A zone is a lightweight software-virtualized container • Uses the systemʼs OS platform • Allocated its own ZFS filesystem (more in a sec) • A VM is a hardware-virtualized container for GNU/ Linux, Windows, BSD, etc. • Uses its own ZFS volume • VMʼs filesystem installed into ZFS volume • Both machines have resource controls for CPU, memory, and disk I/O Tuesday, October 2, 2012
  • 27. Advantages of ZFS • Snapshots: zone/VM backup and recovery • Space management: reservations and quota flexibly allocate space between zones • Delegated administration: each tenant can administer their own dataset: • Set compression level and other properties • Take snapshots of application data • Generate send streams for replication/backup Tuesday, October 2, 2012
  • 28. Advantages of ZFS (2) • Data integrity: verifies data of VM guest filesystems (ext4, XFS, NTFS, etc.) • Multiple storage configurations available: mirrored, RAID-Z2, and others • System fully supported on any storage configurations, can even take a crash dump to a RAID-Z pool Tuesday, October 2, 2012