Vancouver bug enterprise storage and zfs

YVR BUG: ZFS & Enterprise Storage Introduction
Rami Jebara, CTO | TUANGRU

About Me
• CTO of Tuangru, a data center management software company
• 22+ years of experience in technology
• Education in science (Physics) and business (MBA)

Agenda
• Brief introduction to enterprise storage
• Introduction to ZFS

Typical Enterprise Storage Needs
• Data services for operations like windows network file sharing, and email
server back ends to a single enterprise
• Multi-tenant cloud for service providers
• Archival grade for backup and long term storage
• Specialized for example low latency applications like high frequency trading
and low latency databases

Components and Technologies
• Direct attach, e.g. attached disk, NVME, NVDIMM, SAS JBOD etc ..
• SAN, e.g. Fiber Channel, iSCSI
• NAS, e.g. CIFS & NFS
• Object, e.g. S3 or Swift Object Storage
• Archival, e.g. BlackPearl from SpectraLogic and Everspan from Sony

Storage Tiers
• Tier 0: High performance, e.g. very busy OLTP databases $$$$
• Tier 1: General purpose, e.g. Web server $$$
• Tier 2: Low performance, e.g. backup site or backup target $$
• Tier 3: Cheap and deep. E.g. Object store $
• Deep Archive: Write once read never (e.g. Archival tape libraries) $

Typical Concerns for a Storage Admin
• Cost
• Security (isolation of traffic and data)
• Performance (Peak load, average load,
percentile etc.)
• QOS (dealing with noisy neighbors)
• Scale management (More applications,
more clients, more data, etc.)
• Growth management (Scale up vs Scale out)
• Data integrity (Silent corruption & device
failure)
• Service availability (Backup and business
continuity)
• Programmability (prescriptive applications)

The Balancing Act
Internal
requirements
Solution
capability
Support
cost
Acquisition
cost
Budget
Internal
talent
Internal External

How does ZFS fit in?
• Brief history of ZFS
• Introduction to ZFS concepts
• Using ZFS in production

FreeBSD and ZFS
FreeBSD is used as the base system for NetApp, EMC Isilon, Dell Compellent,
Spectralogic, IX Systems TrueNAS and FreeNAS and many more. However
not all of these use ZFS.
ZFS is the base storage filesystem for SpectraLogic, Oracle, FreeNAS,
TrueNAS, Delphix, Nexenta, Netgear, OS Nexus, Datto, Joyent Cloud and
many more. However not all of these FreeBSD.

Short History
• 2005 – Released as part of OpenSolaris under the CDDL license.
• 2007 - Integrated into FreeBSD as part of the 7.0-RELEASE.
• 2010 - Forked to the OpenZFS project after Oracle closed source development
• Open-ZFS.org is a vibrant, productive and open community that supports ZFS on
Solaris variants mainly Illumos, FreeBSD, Linux and OS-X

ZFS Basics
ZFS is a copy on write (COW) file system that is designed to keep large amounts of data for an
indefinite period of time.
Its limits are designed not to be reached in practice.
Its design tolerates:
• Normal hardware failure scenarios, e.g. drive failure
• Data corruption, using checksum, parity information and data copies. This includes the normal
corruption due to disk failure and silent corruption/bit-rot

ZFS Storage Hierarchy
zPOOL
VDEV(1)
HDD(1) HDD(2) …HDD(N)
VDEV(2)
…VDEV(N
)
L2ARCZIL

Types of VDEVs
• Disk: An entire disk or a partition
• File: A file with a minimum 128MB size. This is typically for testing or experimentation
• Mirror: AKA RAID 1
• RAIDZ(1,2,3): this is equivalent to RAID levels 5,6, and the theoretical 7
• Spare: Special pseudo device. This is for hot spares to be used with “zfs replace”
• Cache: AKA L2ARC and is used for read caching
• Log: AKA ZIL and is used to capture writes before they are flushed to disk

Datasets
• ZFS datasets are the basic building blocks for data management in ZFS
• Datasets are thin provisioned and share the pool
• Each dataset has system properties like mount point, compression, case sensitivity, read only and
many more
• Datasets can have user properties to further annotate it
• Datasets can be nested
• Dataset administration can be delegated

ZFS Volumes
Volumes are a special type of dataset. They allow the storage admin to export a portion
of the pool as a block device that can be formatted to another file system, like UFS, EXT4
or NTFS.
Volumes work well for exporting block devices via iSCSI and can serve as a disk
backend for a VM.

Snapshots
ZFS allows for nearly instantaneous read-only snapshots. Snapshots do not initially use any space in
the pool but will start to use space as the original diverges from the snapshot.
Snapshots can be used to:
• Restore a dataset or a single file
• Clone a dataset
Snapshots are not recursive by default. Be careful with nested datasets.

Replication
• Snapshots are the basis of replication
• A storage administrator can use zfs send to serialize a dataset and send it to a file,
another pool or system via SSH to a file or dataset
• The zfs send command can also do incremental backups
• The zfs receive command can transfer the data from the send operation back to files
and directories

More Cool Things About ZFS
• Every zpool keeps a history of the commands that affected it and when the action was
done. This can be accessed by the zpool history command
• ZFS has a robust quota system
• ZFS is NFS aware and sharing for datasets can be controlled with the sharenfs
property.

Preparing for Production Deployment
• Map out your performance versus data protection strategy
• Decide if you need to do any acceleration with ZIL and L2ARC
• Consider day 2 operations like pool expansion and hardware failure
• Look at your data and consider if compression & de-duplication will be of any use
• Look at any application specific optimizations for example databases like PostgreSQL and
MySQL
• Measure twice cut once! Remember some ZFS settings and components are immutable and some
operations are not reversible.

DOs DON’Ts
Use ECC RAM (Lots of it!) Desktop RAM
Use reliable IT mode HBAs and storage controllers IR Mode RAID
Monitor ARC & L2ARCcache hit rate Use desktop grade drives
Consider using a ZIL and L2 ARC especially with Network file
systems
Fill up your pool.
Disable atime unless absolutely needed especially for SSDs
Prefer 4K Native Enterprise drives & SSDs
Be very careful with de-duplication
Use the right ashift value for your drives
Scrub your pools periodically
Look at SMART stats for drives
USE GPT partitioning
Scrub your pools periodically
Turn on compression where needed

Example Applications and Tools for ZFS
iocage, a jail manager (FreeBSD)
chyves, a bhyve virtual machine manager (FreeBSD)
LXD, OS Container hypervisor (Ubuntu Server)
Docker, by adding with ZFS as a storage backend (Various Linux distros)
FreeNAS, a NAS implementation on top of FreeBSD

Emerging Trends and Final Thoughts
• Flash is winning the online storage game
• NVMe is the future on the hardware side
• Distributed, programmable and object storage technologies are the future
• The is room for ZFS as it can offer the the base layer or be part of solution
• Opensource innovation is driving the future of storage

Resources
https://www.freebsd.org/doc/handbook/zfs-links.html
https://wiki.freebsd.org/ZFSTuningGuide
https://en.wikipedia.org/wiki/ZFS
http://open-zfs.org/wiki/Main_Page
https://www.bsdnow.tv/tutorials/zfs
The above is a great start… then use Google!

Vancouver bug enterprise storage and zfs

Recomendados

Recomendados

Mais conteúdo relacionado

Mais procurados

Mais procurados (16)

Semelhante a Vancouver bug enterprise storage and zfs

Semelhante a Vancouver bug enterprise storage and zfs (20)

Último

Último (20)

Vancouver bug enterprise storage and zfs

Notas do Editor