SlideShare uma empresa Scribd logo
1 de 16
Baixar para ler offline
White Paper




VNX WITH THE CLOUD TIERING APPLIANCE
A Detailed Review




                    Abstract
                    This paper describes the EMC Cloud Tiering Appliance (CTA). The
                    CTA enables NAS data tiering, allowing administrators to move
                    inactive data from high-performance storage to less-expensive
                    archival storage, thus enabling cost-effective use of file storage.
                    The CTA also facilitates data migration which moves data to new
                    shares or exports.

                    July 2012
Copyright © 2012 EMC Corporation. All Rights Reserved.

EMC believes the information in this publication is accurate as
of its publication date. The information is subject to change
without notice.

The information in this publication is provided “as is.” EMC
Corporation makes no representations or warranties of any kind
with respect to the information in this publication, and
specifically disclaims implied warranties of merchantability or
fitness for a particular purpose.

Use, copying, and distribution of any EMC software described in
this publication requires an applicable software license.

For the most up-to-date listing of EMC product names, see EMC
Corporation Trademarks on EMC.com.

VMware and VMware ESX are registered trademarks or
trademarks of VMware, Inc. in the United States and/or other
jurisdictions. All other trademarks used herein are the property
of their respective owners.

Part Number h10777




                                 VNX with the Cloud Tiering Appliance   2
Table of Contents
Executive summary ........................................................................................... 4
   Business case .................................................................................................................... 4
   Solution overview ............................................................................................................... 4
Introduction ..................................................................................................... 5
   Scope ................................................................................................................................. 5
   Audience ............................................................................................................................ 5
   Terminology........................................................................................................................ 5
Archiving ......................................................................................................... 6
   Overview ............................................................................................................................ 6
   Hierarchical Storage Management ...................................................................................... 6
   FileMover............................................................................................................................ 7
   Archive Policies .................................................................................................................. 7
   Providing data to the CTA.................................................................................................... 8
   Scheduler ........................................................................................................................... 8
   Simulation.......................................................................................................................... 8
   Recall ................................................................................................................................. 8
   Recall using CTA-HA ............................................................................................................ 9
   Archive requirements for the source NAS server.................................................................. 9
   Archive requirements for the target repository server ........................................................ 10
      Compression and encryption ........................................................................................ 10
   Multi-tiered archive .......................................................................................................... 11
   CTA database ................................................................................................................... 11
   Stub scanner jobs............................................................................................................. 12
   Orphans ........................................................................................................................... 12
   Reporting.......................................................................................................................... 13
Migration ....................................................................................................... 13
   Overview .......................................................................................................................... 13
   Migration source............................................................................................................... 13
   Migration targets .............................................................................................................. 14
   The migration process ...................................................................................................... 14
   Other CTA interactions with VNX ....................................................................................... 15
   Miscellaneous topics........................................................................................................ 16
   Summary .......................................................................................................................... 16




                                                                                        VNX with the Cloud Tiering Appliance                    3
Executive summary
The EMC® Cloud Tiering Appliance (CTA) optimizes primary NAS storage by
automatically archiving inactive files to less-expensive secondary storage. The
secondary storage can be of lower cost, such as an NL-SAS or SATA disk on a NAS
device, or it can consist of public or private clouds platforms. After a file is archived, a
small stub file remains on the primary storage, so that the file appears to the user as
if it were in its original location. File tiering dramatically improves storage efficiency,
and shortens the time to back up and restore data.
In addition to archiving data from primary to secondary storage, the Cloud Tiering
Appliance can also permanently migrate files from a source to a destination without
leaving a stub, as when NAS server hardware requires a technology refresh.

Business case
From the early days of computing, storage has existed in tiers or levels, with
differences in cost, performance, availability, or redundancy, and so forth. For
example, newer flash storage outperforms older storage which consists of NL-SAS or
SATA, but costs more per gigabyte.
NAS data also exists in tiers and does not all have the same value. Typically, as data
ages, users access it less frequently. To optimize use of a normal tiered-NAS-storage
environment, a customer must ensure that the less-valuable, less-frequently-
accessed data does not consume high-speed, expensive storage resources. High-
speed storage should be reserved for the active, important data, while the less-active
data should reside on cheaper storage.
In addition, the storage tiering process must work automatically, so that it does not
add to the storage administration overhead.

Solution overview
The CTA employs Hierarchical Storage Management (HSM), which has been a staple
of the mainframe world for decades. Hierarchical Storage Management moves a file
from primary storage to lower-cost, secondary storage, and leaves a small stub
pointer file in the file’s original location. This process of relocating data and stubbing
describes archiving or tiering.
The CTA acts as a policy engine by interacting with a VNX share or export, and
identifying files that fit predefined criteria. For these files, the CTA initiates movement
to a lower tier repository, for example NAS, Centera, or cloud, and leaves a stub file on
the VNX share. When a client that is accessing the VNX share or export tries to read
the stub, the CTA recalls the original file from the repository tier. To the user, the file
appears to be in its original location on high-performance VNX storage. However,
instead of the space required to store the entire file, only an 8 KB stub file is on the
primary tier.
If the storage administrator wants to move data in the share or export to another
location, for example, to replace on old Celerra with a new VNX, the CTA can help. The




                                                         VNX with the Cloud Tiering Appliance   4
CTA migration feature relocates multiprotocol data including stub files, from one
share or export to another.
When used for archiving or tiering, the CTA will automatically move inactive data to
lower tiers of storage. This allows more efficient use of the most expensive, highest-
performing NAS storage.
When used for file migration, the CTA enables relocation of NAS data, within a NAS
server, across NAS servers, and across NAS servers from different vendors.


Introduction
Scope
This paper outlines CTA features, how CTA functions, and the business problems that
CTA helps to solve. In a technical overview of CTA, this paper also describes how to
manage CTA and implement solutions in a VNX NAS environment.

Audience
This white paper is intended for users who have a basic understanding of VNX Unified
Storage or, a general grasp of NAS storage concepts.

Terminology
Archive repository – Lower tier storage than the NAS storage that is accessible to the
CIFS or NFS clients. The repository is the target of a file archival process. In an
archiving operation, CTA moves data from the primary or source tier to the repository
and leaves a stub file on the primary tier. The stub points to the file in the repository.
A repository tier is a NAS share/export, an EMC Centera, an EMC Atmos cloud, or the
Amazon S3 cloud.
File archiving – A primary CTA function that scans a NAS file server for files that meet
defined criteria, and moves the files to a lower tier of storage. CTA replaces the file on
the NAS server with a stub file that points to the real file on the archive repository.
File migration – The movement of files from one export or share to another as when
replacing a NAS server.
FileMover –VNX Data Movers include FileMover software. FileMover or DHSM enables
stubbing and recall of archived files, and provides an API that the CTA uses for both
archiving and migration. To use file archiving with a Celerra or VNX file system,
export, or share, enable FileMover for the Data Mover.
File tiering – See “File archiving.”
FPolicy –NetApp Filers include FPolicy software. FPolicy enables stubbing and recall
of archived files, and provides an API that the CTA uses for both archiving and
migration. The CTA uses the FPolicy interface to archive files from NetApp servers.
Orphan file – When a file has been archived, a stub on the source NAS server points
to the archived file. Deleting a stub does not automatically delete the archived file.



                                                         VNX with the Cloud Tiering Appliance   5
Instead, the archived file becomes an orphan or a repository file without a stub
pointing to it. To delete the orphan, the CTA user runs an orphan delete job on the
repository.
Policy – Rules for migration, or a rules and one or more repository destinations for
archiving and tiering. For example, an archiving policy can send a file that has not
been accessed in 1 year, to a company’s private Atmos cloud server and a that file
has not been accessed in 2 years, to a public Amazon S3 cloud.
Primary storage – The storage tier that CIFS and NFS clients mount on the VNX.
Source tier – See “Primary storage.”


Archiving
Overview
The CTA provides two primary functions: archiving or tiering, and migration. Archiving
moves inactive data to a lower tier of storage and leaves behind a stub file.

Hierarchical Storage Management
Historically, mainframe computer users implemented a tiered storage system
because disk storage was expensive and tape storage was inexpensive. To maintain
the limited free disk space on the mainframe, the user would manually move less-
important data to tape. To retrieve data later, the user had to record the tape volume
where the data was stored.
Mainframe system developers began to consider solutions to the problem of how to
move data to cheaper, external storage, without requiring the user to remember
where it was stored and to manually run commands to recall the data when it was
needed.
The solution was Hierarchical Storage Management (HSM). This system would scan
the data, find files that had not been accessed for some time, and automatically
move them to a lower tier of storage. In place of the file, the system would store a
small stub that contained an internal pointer to the file. The user would see the stub
as if it were the actual file, but when trying to access it, the system would tell the user
to wait while it automatically retrieved the file and restored it to the original location
on the user’s disk.
HSM is the basis for the CTA archiving function. The concept is straightforward,
robust, and time-tested. The CTA supports archiving to disk storage (cloud, CAS, or
NAS). The existence of storage tiers in most customer environments make the HSM
tiering solution as important today as it was decades ago, during the mainframe era.




                                                         VNX with the Cloud Tiering Appliance   6
FileMover
FileMover is NFS and CIFS file services software for the VNX and Celerra file system
that allows HSM-style archiving. On the VNX, the primary FileMover command is
DHSM, which shows its HSM roots.
At a basic level, FileMover intercepts client access to data, and takes action before
the client accesses the data. CTA is an external system that can direct FileMover. The
CTA user defines a task that directs FileMover to perform a series of actions, for
example:
•   To scan a share for files that are more than 60 days old
•   To move the files to the archive
•   To replace the files with stubs
•   To activate the CIFS offline bit on the stubs
Once the stubs are in place, FileMover monitors the stubs. When a client tries to read
or write to the file, FileMover will intercept that access, and recall the data using
information contained in the stub.
Every VNX or Celerra that functions as an archive source must have the FileMover API
enabled and configured. The configuration is part of the CTA setup.
FPolicy is an API for the NetApp that is similar to FileMover.
CTA needs an API similar to FileMover or FPolicy to archive from a NAS system.
Because these types of APIs are not available on other platforms such as Linux or
Windows, CTA can only archive data from VNX, Celerra, and NetApp.

Archive Policies
A policy in CTA archiving or tiering context consists of rules and one or more
destinations. An example of a simple policy is “if this file has not been accessed in
six months, send it to the Atmos cloud, and replace it with a stub.” The one-rule, one-
destination is common, and many CTA users will use this type of policy on their data.
However, CTA rules are flexible. You can create more complex rules, that archive to
multiple tiers. For example, a single policy could be:
“If any file has not been accessed in more than one year, and is larger than 1 MB in
size, send it to my Isilon, unless it’s a PDF file, in which case don’t archive it. Then
when these files have not been accessed for two years, move them from the Isilon to
the Atmos cloud, and update the stub file to point to their new location.”
Policy rules are based on attributes such as access time, modify time, inode change
time, file size, file name, or directory name. The archive policy action is to “archive”
or “don’t archive.” A single expression or a combination of expressions define the
archive policy.




                                                         VNX with the Cloud Tiering Appliance   7
A policy does not contain a share name. You can define one policy to evaluate shares
A, B, and C, but define another policy to evaluate share D. Or you can define several
different policies to evaluate a single share.

Providing data to the CTA
Administrators usually direct a CTA archive policy to evaluate a file system, CIFS
share, or NFS export. The CTA scans the files, and applies the policy rules to each file,
one at a time. If there are multiple rules in the file, the CTA contines to apply the rules
until a rule evaluates to “true.” It then takes the action associated with the rule, such
as archive, or don’t archive, and moves to the next file.
There is another way to provide files to a CTA policy. Instead of directing the CTA to
scan the files in a share, the CTA administrator imports a list of filenames, and the
CTA only scans and applies the archive policy to the files in that list. This feature,
called “file ingest,” is primarily for third-party vendors with software products that
have their own scanning systems, but want to use the CTA archive engine. The Cloud
Tiering Appliance Imported File List Archive Task Technical Notes document describes
the file ingest feature and is available from Powerlink.

Scheduler
You use the scheduler to set the job start time. For example, a CTA administrator
schedules a batch job to start at 2:00 a.m. on Saturday, scan share01, and evaluate
the files with a policy for archiving.
An administrator usually schedules a job to run weekly, every other week, or monthly.
The first time the archiving job runs, the policy will often select and archive at least
half of the data. So the first archive job can require a long time to run and can move a
lot of data. Future jobs will move incremental amounts of data.

Simulation
The CTA can simulate archive jobs. You can schedule an archive job with a policy, but
run it in simulation mode. The CTA will scan the source share and apply the policy
rules against each file, but not take any archive action. Instead, the CTA tracks the
number of files and amount of data it would have archived, and at the end of the
simulation, it displays a report. Simulation is a good way to test the effectiveness of a
policy and to edit the policy rules, before running a real archive job.

Recall
When a file has been archived to a repository, leaving a stub on the source NAS
share, the NAS client expects the stub to look and behave like the original file. “File
recall”is the process by which the user clicks on the stub file and quickly accesses
the original file.
The stub file contains all the information needed to find the actual file. The VNX set
the offline bit on the stub when the file was archived. When a user attempts to read a
stub file, FileMover interacts with CTA and intercepts the read request to begin the
process of recalling the file from the repository. If the repository is on a CIFS or NFS



                                                         VNX with the Cloud Tiering Appliance   8
share, FileMover recalls the file using CIFS or NFS. If the repository is a CAS or cloud
(such as Centera, Atmos, or Amazon), then the VNX sends the recall request to the
CTA, which will effect the recall and pass the file to the VNX.
After recalling the file, the VNX either:
•   Provides the file to the user, but leaves the stub in place, known as “passthrough
    recall.”
•   Writes the file back to its original location and deletes the stub, known as“full
    recall.”
A FileMover command on the VNX Control Station sets the recall style, which can be
set on a file system-by-file system basis.

Recall using CTA-HA
If an archive or migration job batch job fails, there is no loss of data, You would
correct the problem and rerun the job. For this reason, the complexity of a High
Availability (HA) configuration for archival or migration is not necessary or justified.
However, recall is mission-critical, because it affects the ability of clients to access
their data. For this reason, configurations where the CTA is in the recall path, such as
archiving from VNX/Celerra to Centera, Atmos, or Amazon, or all archival from NetApp,
require an HA.
The HA configuration pairs the CTA-HA physical appliance or the CTA/VE-HA virtual
appliance with one or more CTA or CTA/VE systems. The CTA-HA system is a recall-
only version of the CTA. If the source NAS server cannot perform the recall, either the
CTA recall host or its CTA-HA partner can perform the recall.
By creating a DNS hostname that maps to the IP addresses of both the CTA and the
CTA-HA, and by configuring the VNX to find the CTA using that hostname, the VNX can
use both recall hosts alternately, in a round-robin fashion. This balances the recall
load, and if one recall host fails, the other can perform recalls until the failed host
returns to service. This configuration also allows maintenance of one host while the
other continues to perform recalls.
The CTA-HA also performs keystore replication for encryption keys that are generated
on the CTA when the encryption feature is selected during archival to Atmos or
Amazon clouds as described in Compression and encryption on page 10.

Archive requirements for the source NAS server
CTA can archive data from CIFS or NFS shares on VNX, Celerra, or NetApp NAS servers.
FileMover for VNX or Celerra, and FPolicy for NetApp, both provide archiving services.
To identify stub files, both FileMover and FPolicy read the offline bit on the stubs. The
CIFS protocol supports offline bits, but NFS does not, so the VNX/Celerra will handle
offline bits internally for NFS-only archival. The CTA communicates with VNX and
Celerra using the DHSM API. Before archiving data from a VNX or Celerra, the CTA
configuration must include the VNX or Celerra properties and the DHSM connections
that link file systems to one or more repositories. The CTA and FileMover



                                                         VNX with the Cloud Tiering Appliance   9
automatically create the DHSM connections when needed. The Cloud Tiering
Appliance Getting Started Guide provides the configuration procedure for the CTA
with the VNX or Celerra.
Deleting the DHSM connection on a VNX or Celerra Control Statio will optionally
trigger a recall of all stubbed data from the repository linked to the VNX or Celerra file
system that uses that connection.
The CTA and CTA-HA must have full control of the source shares. If the source includes
NFS exports, these exports must have root and read/write permission for the CTA and
CTA-HA IP addresses.
When archiving from CIFS shares, the source server must belong to a domain and the
CTA configuration settings for that server require a username from that domain. The
username must be in the local administrator’s group of the CIFS server associated
with the source.

Archive requirements for the target repository server
The CTA can archive to three kinds of repositories:
•   NAS (CIFS or NFS), such as VNX, Celerra, VNXe, Data Domain, Isilon, Windows, or
    NetApp
•   CAS such as Centera
•   Cloud such as Atmos or Amazon S3
Each repository has slightly different configuration requirements.
•   Requirements for NAS repositories are similar to the requirements for source
    servers. A CIFS domain user must be in the local admin group of the CIFS server.
    NFS exports must have root and read/write permission for the CTA and CTA-HA IPs.
•   Centera configuration requires a PEA file or “anonymous.”
•   Cloud configuration requires a tenant user for Atmos or a bucket user for Amazon
    S3.
One repository can serve as an archive target for multiple CTAs.
One CTA can archive to multiple repositories.
A CTA repository migration job will move all the archived data from one repository to
another, and update the stubs to point to the new location.
Only the CTA or CTA-HA can have access to the CTA repositories. The NAS share that
serves as the repository is visible, but the layout of archived data is proprietary.
Changes to the repository can render archived data unrecallable.

Compression and encryption
Compression and encryption are options when archiving to either of the two cloud
repository tiers, Atmos, or Amazon S3. A compression style such as fast or strong is a
policy option. Encryption is also a policy option, but encryption prerequisites are:




                                                         VNX with the Cloud Tiering Appliance   10
1. You must configure keystore replication between the CTA and a CTA-HA machine.
2. You must generate a key using the CTA GUI.
The CTA stores the key in the keystore and replicates it to the CTA-HA. Every archive
task that uses a policy with encryption uses the key. If CTA generates and replicates a
new key, it applies the key to new encrypted archive tasks. The old keys that remain
in the keystore continue to apply for files encrypted using the old keys.
Keystore replication will be sufficient for normal outages, but after generating a new
key, back up the CTA configuration to preserve the keystore.

Multi-tiered archive
Consider the following example:
“Find NAS files that have not been accessed in six months, and archive them to my
private cloud storage. Find NAS files that have not been accessed in one year, and
send them to the public cloud such as Amazon S3 or AT&T’s Atmos-based cloud. And
if any files archived on the private cloud have not been accessed for one year, move
the files from the private to the public cloud, and update the stub files to point to the
new location.”
This scenario describes CTA’s multi-tiered archiving feature. By creating a multi-tiered
policy type with several rules, each with a different repository, you can design an
archiving scheme to fit this example. This multi-tiered archiving policy would have the
following rules:
•   if atime > 1 year, archive to Amazon S3 cloud
•   if atime > 6 months, archive to private Atmos cloud
The order of the rules is important. If a policy has several rules, the rules are applied
one at a time. When the first rule evaluates as true, CTA takes the “archive” or “don’t
archive” action. CTA does not apply the subsequent rules, and the policy moves on to
the next file.
In this example, if the order of rules were reversed, you would get an unintended
result. The 6-month old rule would be applied first, and the 1-year old rule would
never be applied, because any file older than 1 year is also older than 6 months. All
data older than 6 months would be archived to the private Atmos cloud, and no files
would be archived to the Amazon S3.

CTA database
When a file is archived to a repository, the stub on the source NAS tier points to the
file location in the repository. However, the file in the repository has no pointer back
to the source. The repository files have no connection to the source. The CTA
database solves this problem. Each time a file is archived, an entry in the CTA
database records the file’s original location on the source, and the file’s location in
the repository.




                                                        VNX with the Cloud Tiering Appliance   11
CTA does not use the database for recalls, because the stub on the source includes
the information necessary to locate the file in the repository. However, the database
inlcudes entries for every archived file. It contains statistical data and orphan
information as described in Orphans on page 12. To protect the database, schedule
regular CTA backups. If a CTA fails, import the most recent backup into a new CTA.

Stub scanner jobs
For every scheduled archive job, the CTA automatically schedules a monthly stub
scanner job. The stub scanner is a utility that reads the stubs in a share and
compares them to the entries in the CTA database. If stubs move to different locations
or orphans appear, the stub scanner will ensure that the CTA database is kept
current.
Because a stub on the source has the information necessary to recall a file from the
repository, CTA does not need to query stub and repository file locations in the CTA
database. However, CTA can manage repository storage more efficiently if
information in the database matches the stub and repository file locations on the
system, so stub scanner jobs run every 30 days by default.

Orphans
If stub files are deleted from the source repository, the actual files in the repository
become orphans, and are not automatically deleted for the following reason.
Generally a storage administrator will back up stubs when backing up CTA-archived
shares. Many NDMP-based backup programs back up stubs by default. With
protected repositories and small stubs, a NAS server that employs CTA benefits
greatly from faster backups and smaller backup windows. However, when a backup is
restored, the stubs need to point to something. If the CTA had deleted archived files
when the stubs were deleted, restoring the backup would restore stubs that point to
nothing.
To delete orphan files, and recover space on the repository, run an “orphan_delete”
job periodically. Do not delete orphans until you are certain that you will not restore
stubs that point to the orphans. For example, if you keep backups for six months,
then define the orphan deletion job to delete files that have been orphans for at least
eight months.
The CTA database and the stub scanner play important roles in the management of
orphan files. Every time the stub scanner sees a stub, the CTA records a “last seen”
time in the database. If the stub is deleted, the stub scanner identifies the file in the
repository that was linked to the stub as an orphan. Because the “last seen” time is
in the database, the CTA knows how long the file has been an orphan. CTA uses the
orphan age to determine which orphans to delete.
If the CTA database is lost, the location and age of orphan files in the repository is
lost. CTA database backup is therefore an important process.




                                                         VNX with the Cloud Tiering Appliance   12
Reporting
CTA generates reports on the files it archives or migrates. For archived files, CTA
reports will display the size, number of files archived, and breakdown by file types,
but the CTA does not give a detailed profile of the data in the file system. You can run
archive simulations to obtain information on file ages. For example, multiple
simulations, filtering for access times of various ages, can yield an age profile for the
files in the file system.


Migration
Overview
The CTA provides two primary functions: archiving (or tiering), and migration.
Migration moves files from one share or export to another.
NAS migration copies data from one share or export on one system to a another.
Migration is useful when replacing servers. Administrators use CTA to move data from
old to new servers with minimum disruption to the NAS client users. The CTA can
perform multi-protocol, incremental, stub-aware, cross-vendor migrations, which can
greatly reduce the effort and complexity of implementing new NAS technology.
A CTA migration is a batch job that moves data from a source NAS share to a target.
The target share must be sufficiently large to hold the data that will move from the
source share. The target share does not need to be the same size or layout as the
source share.
CTA supports CIFS, NFS, or multi-protocol CIFS/NFS file systems. The supported
source platforms are: VNX, Celerra, NetApp, Windows. The supported target
platforms are: VNX, VNXe, Celerra, Isilon. CTA migrates data from VNX, Celerra, or
NetApp source to any target. When CTA migrates data from a Windows source server,
only VNX or VNXe can be used as targets.

Migration source
CTA migration uses NDMP as its file transfer engine when migrating from VNX, Celerra,
and NetApp. CTA uses EMCopy when migrating from Windows.
•   The NDMP-style migrations are policy-based. You would first create a policy,
    similar to the archive policies. For example, to migrate all the data in share1 to
    newshare1, but do not wish to include PDF files that are more than three years
    old, you can create a rule to omit these files from the migration. If you want to
    copy everything without filtering, you would create a trivial policy with a rule such
    as:
       if size >= 0 bytes, then migrate
•   The EMCopy-based migration is not policy-based. When migrating from Windows
    to VNX, CTA copies all the data.




                                                        VNX with the Cloud Tiering Appliance   13
The CTA uses the FileMover or DHSM API to migrate files from VNX or Celerra. The CTA
creates snapshots on the source file system by making API calls. Snapshots enable
users to continue to access and write to the source share while the migration is taking
place.
For migration tasks, the CTA needs the same kind of access permissions on the
source and target shares and exports as for archive tasks. For a CIFS source server, a
CIFS domain user must be in the local admin group of the CIFS server. For an NFS
source server, NFS exports must have root and read/write permission for the CTA IPs.
Multi-protocol migrations require both CIFS and NFS access permissions.
NOTE: File migration does not use the CTA-HA.
An NDMP user must exist on the source server for NDMP-style migrations. The server
configuration on the CTA requires the NDMP userid and password. The Cloud Tiering
Appliance Getting Started Guide provides details.
To perform migrations using a Windows source , install the EMWS copy agent on the
Windows server. CTA supports Windows 2003 and 2008 as migration source
platforms.

Migration targets
Before starting a migration task, target shares or exports must exist on target servers.
The CTA will not create them automatically. Migrating files to the root level in these
shares requires that the shares be empty, except for file system files such as
.lost+found. However, you can migrate files to any empty directory in a non-empty
share.
The CTA needs the same permissions on target file systems as on the source. For CIFS
targets, a CIFS domain user must be in the local admin group of the CIFS server. For
NFS targets, NFS exports must have root and read/write permission for the CTA IPs.
Multi-protocol migrations require both CIFS and NFS credentials.
The protocol of the target share or export (CIFS, NFS, or mixed protocol) must match
the protocol of the source. For the NDMP-style migrations, an NDMP user must exist
on the target server. The server configuration on the CTA requires the NDMP userid
and password.

The migration process
Migrations run as scheduled batch jobs. Schedule the migration task on the CTA
Schedule page or through the CLI. Specify:
•   a source share
•   a policy for NDMP-style migrations (except for Windows-to-VNX EMCopy-style
    migrations, which don’t accept policies)
•   an empty target share or directory




                                                       VNX with the Cloud Tiering Appliance   14
When the job begins, CTA creates a snapshot on the source. Then CTA copies all the
data to the target, applying the policy, if applicable. The migration can begin
anywhere in the source share (for example at the top level or in a subdirectory), and
can move files to any empty directory on the target.
After the first pass completes, the target will have a copy of the data in from the
snapshot on the source. However, if users have continued updating the source during
migration, the source share will probably be different from the snapshot. So, you can
run a second pass to pick up those incremental changes. The CTA will create another
snapshot, and compare the second to the first, to pick up changes such as new files,
deleted files, metadata changes, and so forth.
You can continue running passes to pick up changes. For the final pass, put the
source in offline or in read-only mode to the clients to ensure that that the target is
identical to the source. To complete the migration, transfer the client mounts to the
target share. The target has copies of all CIFS and/or NFS ACLs or permissions. After
transferring the clients, you can delete the source share and recover its storage space
on the source NAS system.
If you wish to throttle CTA migration tasks, you can specify a maximum MB/sec rate
when you create the task schedule, so that the migration comsumes less network
bandwidth. You can also use a SID translation tool that CTA provides to create a SID
mapping of local to domain SIDs. CTA applies the mapping at migration time, to
ensure that the SIDs on the target will be correct.
CTA can run migration tasks in a continuous mode. The migration task has the option
to run incremental migrations until a files-moved threshold is reached. For example,
“keep running incremental migrations until fewer than 1000 files are moved in one
run. Then stop and notify the administrator.” Customers typically perform
incremental runs during scheduled off hour periods over the course of several
evenings, with the final locked cutover also during off hours.
CTA migration is not only unique because it can be asymmetrical, for example
between file systems of different size, different disk layouts, across different
platforms. CTA is also unique because it provides profile-filtering, multi-protocol
migration, for example, from NetApp “multi” to VNX “multi”.
CTA also migrates stubs. If the source and target both support CTA stubs, CTA
migrates the stubs without recalling the archived data, and the stubs will continue to
function after migration.
•   If the source is VNX or Celerra and the target is Isilon or VNXe, which do not
    support stubs, CTA migrates the actual files. Stubs are not migrated.

Other CTA interactions with VNX
The CTA supports VNX File-Level Retention (FLR). When archiving to a VNX, Celerra, or
Centera, you can specify retention times on the archived data. You can also specify
retention times on stub files, but FileMover manages the stub retention on the source.
CTA does not support FLR-enabled source file systems.




                                                        VNX with the Cloud Tiering Appliance   15
Historically, the access times during a migration could not be preserved. NDMP
always set access times equal to modification times when moving the data. However,
with the release of VNX 7.1, access times during a file migration can be preserved
when VNX 7.1 is the migration source.
De-duplication on the repository shares of a VNX greatly enhance the efficiency of
repository storage. The use of de-duplication for source shares is not recommended.
Stub files do not benefit from de-duplication.

Miscellaneous topics
In addition to archiving and migration, CTA can perform other tasks.
“Delete stub with policy” is the only task type that deletes stubs, deletes the
repository file that the stubs point to, and removes the reference to stubs from the
CTA database. Use the “delete stub with policy task” with caution.
“Multi-tier stub with policy” is similar to the “Multi-tier archive,” except that it only
scans stubs. The task moves archived data that matches the multi_tier_stub policy
from one repository to another. Like the “delete stub with policy” task, it only scans
stub files.

Summary
The Cloud Tiering Appliance (CTA) enables NAS data tiering, allowing administrators
to move inactive data off of high-performance storage to less-expensive archival
storage, thereby enabling more cost-effective use of file storage. The CTA also
enables relocation of NAS data to new shares or exports, on the same or different
servers, even from different vendors.




                                                         VNX with the Cloud Tiering Appliance   16

Mais conteúdo relacionado

Mais procurados

Virtual SAN - A Deep Dive into Converged Storage (technical whitepaper)
Virtual SAN - A Deep Dive into Converged Storage (technical whitepaper)Virtual SAN - A Deep Dive into Converged Storage (technical whitepaper)
Virtual SAN - A Deep Dive into Converged Storage (technical whitepaper)DataCore APAC
 
twp-oracledatabasebackupservice-2183633
twp-oracledatabasebackupservice-2183633twp-oracledatabasebackupservice-2183633
twp-oracledatabasebackupservice-2183633Arush Jain
 
White Paper: Hadoop on EMC Isilon Scale-out NAS
White Paper: Hadoop on EMC Isilon Scale-out NAS   White Paper: Hadoop on EMC Isilon Scale-out NAS
White Paper: Hadoop on EMC Isilon Scale-out NAS EMC
 
EMC Isilon Multitenancy for Hadoop Big Data Analytics
EMC Isilon Multitenancy for Hadoop Big Data AnalyticsEMC Isilon Multitenancy for Hadoop Big Data Analytics
EMC Isilon Multitenancy for Hadoop Big Data AnalyticsEMC
 
CloudByte Technology Whitepaper
CloudByte Technology WhitepaperCloudByte Technology Whitepaper
CloudByte Technology WhitepaperCloudByte Inc.
 
Survey of distributed storage system
Survey of distributed storage systemSurvey of distributed storage system
Survey of distributed storage systemZhichao Liang
 
Technical Report NetApp Clustered Data ONTAP 8.2: An Introduction
Technical Report NetApp Clustered Data ONTAP 8.2: An IntroductionTechnical Report NetApp Clustered Data ONTAP 8.2: An Introduction
Technical Report NetApp Clustered Data ONTAP 8.2: An IntroductionNetApp
 
Learn the facts about replication in mainframe storage webinar
Learn the facts about replication in mainframe storage webinarLearn the facts about replication in mainframe storage webinar
Learn the facts about replication in mainframe storage webinarHitachi Vantara
 
How to choose a server for your data center's needs
How to choose a server for your data center's needsHow to choose a server for your data center's needs
How to choose a server for your data center's needsIT Tech
 
CDW: SAN vs. NAS
CDW: SAN vs. NASCDW: SAN vs. NAS
CDW: SAN vs. NASSpiceworks
 
Personal storage to enterprise storage system journey
Personal storage to enterprise storage system journeyPersonal storage to enterprise storage system journey
Personal storage to enterprise storage system journeySoumen Sarkar
 

Mais procurados (20)

Virtual SAN - A Deep Dive into Converged Storage (technical whitepaper)
Virtual SAN - A Deep Dive into Converged Storage (technical whitepaper)Virtual SAN - A Deep Dive into Converged Storage (technical whitepaper)
Virtual SAN - A Deep Dive into Converged Storage (technical whitepaper)
 
twp-oracledatabasebackupservice-2183633
twp-oracledatabasebackupservice-2183633twp-oracledatabasebackupservice-2183633
twp-oracledatabasebackupservice-2183633
 
White Paper: Hadoop on EMC Isilon Scale-out NAS
White Paper: Hadoop on EMC Isilon Scale-out NAS   White Paper: Hadoop on EMC Isilon Scale-out NAS
White Paper: Hadoop on EMC Isilon Scale-out NAS
 
Datastores
DatastoresDatastores
Datastores
 
San nas-
San nas-San nas-
San nas-
 
My sql
My sqlMy sql
My sql
 
EMC Isilon Multitenancy for Hadoop Big Data Analytics
EMC Isilon Multitenancy for Hadoop Big Data AnalyticsEMC Isilon Multitenancy for Hadoop Big Data Analytics
EMC Isilon Multitenancy for Hadoop Big Data Analytics
 
CloudByte Technology Whitepaper
CloudByte Technology WhitepaperCloudByte Technology Whitepaper
CloudByte Technology Whitepaper
 
OUTBOARD SERVERS
OUTBOARD SERVERSOUTBOARD SERVERS
OUTBOARD SERVERS
 
Survey of distributed storage system
Survey of distributed storage systemSurvey of distributed storage system
Survey of distributed storage system
 
Technical Report NetApp Clustered Data ONTAP 8.2: An Introduction
Technical Report NetApp Clustered Data ONTAP 8.2: An IntroductionTechnical Report NetApp Clustered Data ONTAP 8.2: An Introduction
Technical Report NetApp Clustered Data ONTAP 8.2: An Introduction
 
Distributed storage system
Distributed storage systemDistributed storage system
Distributed storage system
 
Hitachi Data Services. Business Continuity
Hitachi Data Services. Business ContinuityHitachi Data Services. Business Continuity
Hitachi Data Services. Business Continuity
 
BCP and DR Plan With NAS Solution
 BCP and DR Plan  With NAS Solution BCP and DR Plan  With NAS Solution
BCP and DR Plan With NAS Solution
 
Learn the facts about replication in mainframe storage webinar
Learn the facts about replication in mainframe storage webinarLearn the facts about replication in mainframe storage webinar
Learn the facts about replication in mainframe storage webinar
 
DAS RAID NAS SAN
DAS RAID NAS SANDAS RAID NAS SAN
DAS RAID NAS SAN
 
How to choose a server for your data center's needs
How to choose a server for your data center's needsHow to choose a server for your data center's needs
How to choose a server for your data center's needs
 
CDW: SAN vs. NAS
CDW: SAN vs. NASCDW: SAN vs. NAS
CDW: SAN vs. NAS
 
Personal storage to enterprise storage system journey
Personal storage to enterprise storage system journeyPersonal storage to enterprise storage system journey
Personal storage to enterprise storage system journey
 
Vmware san connectivity
Vmware san connectivityVmware san connectivity
Vmware san connectivity
 

Destaque

White Paper: EMC Isilon OneFS Operating System
White Paper: EMC Isilon OneFS Operating System  White Paper: EMC Isilon OneFS Operating System
White Paper: EMC Isilon OneFS Operating System EMC
 
White Paper: EMC Isilon OneFS — A Technical Overview
White Paper: EMC Isilon OneFS — A Technical Overview   White Paper: EMC Isilon OneFS — A Technical Overview
White Paper: EMC Isilon OneFS — A Technical Overview EMC
 
What must a leader bear in mind when attempting to change workplace culture
What must a leader bear in mind when attempting to change workplace cultureWhat must a leader bear in mind when attempting to change workplace culture
What must a leader bear in mind when attempting to change workplace cultureDaleCarnegieIndia1
 
Improving Online Campaign Effectiveness in a Fragmented Digital World
Improving Online Campaign Effectiveness in a Fragmented Digital WorldImproving Online Campaign Effectiveness in a Fragmented Digital World
Improving Online Campaign Effectiveness in a Fragmented Digital WorldResearch Now
 
Stomp presentation v1.5.1
Stomp presentation v1.5.1Stomp presentation v1.5.1
Stomp presentation v1.5.1Patrick Cannon
 
Remembering God- Your Future Depends on It
Remembering God- Your Future Depends on ItRemembering God- Your Future Depends on It
Remembering God- Your Future Depends on Itlcvtrainer
 
UP THERE, EVERYWHERE Partners
UP THERE, EVERYWHERE PartnersUP THERE, EVERYWHERE Partners
UP THERE, EVERYWHERE PartnersShari Monnes
 
Misson Impossible 3 Trailer Analysis
Misson Impossible 3 Trailer AnalysisMisson Impossible 3 Trailer Analysis
Misson Impossible 3 Trailer AnalysisKhendle Christie
 
Tues treaty of versailles
Tues treaty of versaillesTues treaty of versailles
Tues treaty of versaillesTravis Klein
 
Euskal Herriko hiriburuak
Euskal Herriko hiriburuakEuskal Herriko hiriburuak
Euskal Herriko hiriburuakiranjulienara
 
Tutorial per pintar_les_ungles_amb_l_estelada
Tutorial per pintar_les_ungles_amb_l_esteladaTutorial per pintar_les_ungles_amb_l_estelada
Tutorial per pintar_les_ungles_amb_l_esteladamgonellgomez
 
Target audience research
Target audience researchTarget audience research
Target audience researchharryronchetti
 
EMC Big Data | Hadoop Starter Kit | EMC Forum 2014
EMC Big Data | Hadoop Starter Kit | EMC Forum 2014EMC Big Data | Hadoop Starter Kit | EMC Forum 2014
EMC Big Data | Hadoop Starter Kit | EMC Forum 2014EMC
 
2011 Treasurer's Report
2011 Treasurer's Report2011 Treasurer's Report
2011 Treasurer's Reportcccathedraltx
 

Destaque (20)

White Paper: EMC Isilon OneFS Operating System
White Paper: EMC Isilon OneFS Operating System  White Paper: EMC Isilon OneFS Operating System
White Paper: EMC Isilon OneFS Operating System
 
White Paper: EMC Isilon OneFS — A Technical Overview
White Paper: EMC Isilon OneFS — A Technical Overview   White Paper: EMC Isilon OneFS — A Technical Overview
White Paper: EMC Isilon OneFS — A Technical Overview
 
What must a leader bear in mind when attempting to change workplace culture
What must a leader bear in mind when attempting to change workplace cultureWhat must a leader bear in mind when attempting to change workplace culture
What must a leader bear in mind when attempting to change workplace culture
 
Improving Online Campaign Effectiveness in a Fragmented Digital World
Improving Online Campaign Effectiveness in a Fragmented Digital WorldImproving Online Campaign Effectiveness in a Fragmented Digital World
Improving Online Campaign Effectiveness in a Fragmented Digital World
 
Formulario devoluciones
Formulario devolucionesFormulario devoluciones
Formulario devoluciones
 
Stomp presentation v1.5.1
Stomp presentation v1.5.1Stomp presentation v1.5.1
Stomp presentation v1.5.1
 
Remembering God- Your Future Depends on It
Remembering God- Your Future Depends on ItRemembering God- Your Future Depends on It
Remembering God- Your Future Depends on It
 
Sunsilk gog
Sunsilk gogSunsilk gog
Sunsilk gog
 
UP THERE, EVERYWHERE Partners
UP THERE, EVERYWHERE PartnersUP THERE, EVERYWHERE Partners
UP THERE, EVERYWHERE Partners
 
Misson Impossible 3 Trailer Analysis
Misson Impossible 3 Trailer AnalysisMisson Impossible 3 Trailer Analysis
Misson Impossible 3 Trailer Analysis
 
Tues treaty of versailles
Tues treaty of versaillesTues treaty of versailles
Tues treaty of versailles
 
Euskal Herriko hiriburuak
Euskal Herriko hiriburuakEuskal Herriko hiriburuak
Euskal Herriko hiriburuak
 
Hunt+5
Hunt+5Hunt+5
Hunt+5
 
Job Definition
Job DefinitionJob Definition
Job Definition
 
Tutorial per pintar_les_ungles_amb_l_estelada
Tutorial per pintar_les_ungles_amb_l_esteladaTutorial per pintar_les_ungles_amb_l_estelada
Tutorial per pintar_les_ungles_amb_l_estelada
 
Presentation2michaelcollins
Presentation2michaelcollinsPresentation2michaelcollins
Presentation2michaelcollins
 
50 states
50 states50 states
50 states
 
Target audience research
Target audience researchTarget audience research
Target audience research
 
EMC Big Data | Hadoop Starter Kit | EMC Forum 2014
EMC Big Data | Hadoop Starter Kit | EMC Forum 2014EMC Big Data | Hadoop Starter Kit | EMC Forum 2014
EMC Big Data | Hadoop Starter Kit | EMC Forum 2014
 
2011 Treasurer's Report
2011 Treasurer's Report2011 Treasurer's Report
2011 Treasurer's Report
 

Semelhante a VNX with the Cloud Tiering Appliance

EMC VNX FAST VP
EMC VNX FAST VP EMC VNX FAST VP
EMC VNX FAST VP EMC
 
EMC FAST VP for Unified Storage Systems
EMC FAST VP for Unified Storage Systems EMC FAST VP for Unified Storage Systems
EMC FAST VP for Unified Storage Systems EMC
 
DATA STORAGE REPLICATION aCelera and WAN Series Solution Brief
DATA STORAGE REPLICATION aCelera and WAN Series Solution BriefDATA STORAGE REPLICATION aCelera and WAN Series Solution Brief
DATA STORAGE REPLICATION aCelera and WAN Series Solution Brief Array Networks
 
EMC FAST Cache
EMC FAST Cache EMC FAST Cache
EMC FAST Cache EMC
 
White Paper: EMC FAST Cache — A Detailed Review
White Paper: EMC FAST Cache — A Detailed Review   White Paper: EMC FAST Cache — A Detailed Review
White Paper: EMC FAST Cache — A Detailed Review EMC
 
Sample_Blueprint-Fault_Tolerant_NAS
Sample_Blueprint-Fault_Tolerant_NASSample_Blueprint-Fault_Tolerant_NAS
Sample_Blueprint-Fault_Tolerant_NASMike Alvarado
 
Intel_Datagres_WhiitePaper_120315-1
Intel_Datagres_WhiitePaper_120315-1Intel_Datagres_WhiitePaper_120315-1
Intel_Datagres_WhiitePaper_120315-1Michele Hunter
 
Maginatics @ SDC 2013: Architecting An Enterprise Storage Platform Using Obje...
Maginatics @ SDC 2013: Architecting An Enterprise Storage Platform Using Obje...Maginatics @ SDC 2013: Architecting An Enterprise Storage Platform Using Obje...
Maginatics @ SDC 2013: Architecting An Enterprise Storage Platform Using Obje...Maginatics
 
Net App Unified Storage Architecture
Net App Unified Storage ArchitectureNet App Unified Storage Architecture
Net App Unified Storage Architecturenburgett
 
Net App Unified Storage Architecture
Net App Unified Storage ArchitectureNet App Unified Storage Architecture
Net App Unified Storage ArchitectureSamantha_Roehl
 
Data Warehouse Scalability Using Cisco Unified Computing System and Oracle Re...
Data Warehouse Scalability Using Cisco Unified Computing System and Oracle Re...Data Warehouse Scalability Using Cisco Unified Computing System and Oracle Re...
Data Warehouse Scalability Using Cisco Unified Computing System and Oracle Re...EMC
 
New Features For Your Software Defined Storage
New Features For Your Software Defined StorageNew Features For Your Software Defined Storage
New Features For Your Software Defined StorageDataCore Software
 
SAN vs NAS vs DAS: Decoding Data Storage Solutions
SAN vs NAS vs DAS: Decoding Data Storage SolutionsSAN vs NAS vs DAS: Decoding Data Storage Solutions
SAN vs NAS vs DAS: Decoding Data Storage SolutionsMaryJWilliams2
 
Deduplication Solutions Are Not All Created Equal: Why Data Domain?
Deduplication Solutions Are Not All Created Equal: Why Data Domain?Deduplication Solutions Are Not All Created Equal: Why Data Domain?
Deduplication Solutions Are Not All Created Equal: Why Data Domain?EMC
 
Why is Virtualization Creating Storage Sprawl? By Storage Switzerland
Why is Virtualization Creating Storage Sprawl? By Storage SwitzerlandWhy is Virtualization Creating Storage Sprawl? By Storage Switzerland
Why is Virtualization Creating Storage Sprawl? By Storage SwitzerlandINFINIDAT
 
Hyperconvergence Facts and FAQs
Hyperconvergence Facts and FAQsHyperconvergence Facts and FAQs
Hyperconvergence Facts and FAQsSpringpath
 

Semelhante a VNX with the Cloud Tiering Appliance (20)

How fast works in symmetrix vmax
How fast works in symmetrix vmaxHow fast works in symmetrix vmax
How fast works in symmetrix vmax
 
EMC VNX FAST VP
EMC VNX FAST VP EMC VNX FAST VP
EMC VNX FAST VP
 
EMC FAST VP for Unified Storage Systems
EMC FAST VP for Unified Storage Systems EMC FAST VP for Unified Storage Systems
EMC FAST VP for Unified Storage Systems
 
DATA STORAGE REPLICATION aCelera and WAN Series Solution Brief
DATA STORAGE REPLICATION aCelera and WAN Series Solution BriefDATA STORAGE REPLICATION aCelera and WAN Series Solution Brief
DATA STORAGE REPLICATION aCelera and WAN Series Solution Brief
 
EMC FAST Cache
EMC FAST Cache EMC FAST Cache
EMC FAST Cache
 
White Paper: EMC FAST Cache — A Detailed Review
White Paper: EMC FAST Cache — A Detailed Review   White Paper: EMC FAST Cache — A Detailed Review
White Paper: EMC FAST Cache — A Detailed Review
 
Sample_Blueprint-Fault_Tolerant_NAS
Sample_Blueprint-Fault_Tolerant_NASSample_Blueprint-Fault_Tolerant_NAS
Sample_Blueprint-Fault_Tolerant_NAS
 
Intel_Datagres_WhiitePaper_120315-1
Intel_Datagres_WhiitePaper_120315-1Intel_Datagres_WhiitePaper_120315-1
Intel_Datagres_WhiitePaper_120315-1
 
Tombolo
TomboloTombolo
Tombolo
 
Maginatics @ SDC 2013: Architecting An Enterprise Storage Platform Using Obje...
Maginatics @ SDC 2013: Architecting An Enterprise Storage Platform Using Obje...Maginatics @ SDC 2013: Architecting An Enterprise Storage Platform Using Obje...
Maginatics @ SDC 2013: Architecting An Enterprise Storage Platform Using Obje...
 
Challenges in Managing IT Infrastructure
Challenges in Managing IT InfrastructureChallenges in Managing IT Infrastructure
Challenges in Managing IT Infrastructure
 
Net App Unified Storage Architecture
Net App Unified Storage ArchitectureNet App Unified Storage Architecture
Net App Unified Storage Architecture
 
Net App Unified Storage Architecture
Net App Unified Storage ArchitectureNet App Unified Storage Architecture
Net App Unified Storage Architecture
 
Data Warehouse Scalability Using Cisco Unified Computing System and Oracle Re...
Data Warehouse Scalability Using Cisco Unified Computing System and Oracle Re...Data Warehouse Scalability Using Cisco Unified Computing System and Oracle Re...
Data Warehouse Scalability Using Cisco Unified Computing System and Oracle Re...
 
New Features For Your Software Defined Storage
New Features For Your Software Defined StorageNew Features For Your Software Defined Storage
New Features For Your Software Defined Storage
 
8 Strategies For Building A Modern DataCenter
8 Strategies For Building A Modern DataCenter8 Strategies For Building A Modern DataCenter
8 Strategies For Building A Modern DataCenter
 
SAN vs NAS vs DAS: Decoding Data Storage Solutions
SAN vs NAS vs DAS: Decoding Data Storage SolutionsSAN vs NAS vs DAS: Decoding Data Storage Solutions
SAN vs NAS vs DAS: Decoding Data Storage Solutions
 
Deduplication Solutions Are Not All Created Equal: Why Data Domain?
Deduplication Solutions Are Not All Created Equal: Why Data Domain?Deduplication Solutions Are Not All Created Equal: Why Data Domain?
Deduplication Solutions Are Not All Created Equal: Why Data Domain?
 
Why is Virtualization Creating Storage Sprawl? By Storage Switzerland
Why is Virtualization Creating Storage Sprawl? By Storage SwitzerlandWhy is Virtualization Creating Storage Sprawl? By Storage Switzerland
Why is Virtualization Creating Storage Sprawl? By Storage Switzerland
 
Hyperconvergence Facts and FAQs
Hyperconvergence Facts and FAQsHyperconvergence Facts and FAQs
Hyperconvergence Facts and FAQs
 

Mais de EMC

INDUSTRY-LEADING TECHNOLOGY FOR LONG TERM RETENTION OF BACKUPS IN THE CLOUD
INDUSTRY-LEADING  TECHNOLOGY FOR LONG TERM RETENTION OF BACKUPS IN THE CLOUDINDUSTRY-LEADING  TECHNOLOGY FOR LONG TERM RETENTION OF BACKUPS IN THE CLOUD
INDUSTRY-LEADING TECHNOLOGY FOR LONG TERM RETENTION OF BACKUPS IN THE CLOUDEMC
 
Cloud Foundry Summit Berlin Keynote
Cloud Foundry Summit Berlin Keynote Cloud Foundry Summit Berlin Keynote
Cloud Foundry Summit Berlin Keynote EMC
 
EMC GLOBAL DATA PROTECTION INDEX
EMC GLOBAL DATA PROTECTION INDEX EMC GLOBAL DATA PROTECTION INDEX
EMC GLOBAL DATA PROTECTION INDEX EMC
 
Transforming Desktop Virtualization with Citrix XenDesktop and EMC XtremIO
Transforming Desktop Virtualization with Citrix XenDesktop and EMC XtremIOTransforming Desktop Virtualization with Citrix XenDesktop and EMC XtremIO
Transforming Desktop Virtualization with Citrix XenDesktop and EMC XtremIOEMC
 
Citrix ready-webinar-xtremio
Citrix ready-webinar-xtremioCitrix ready-webinar-xtremio
Citrix ready-webinar-xtremioEMC
 
EMC FORUM RESEARCH GLOBAL RESULTS - 10,451 RESPONSES ACROSS 33 COUNTRIES
EMC FORUM RESEARCH GLOBAL RESULTS - 10,451 RESPONSES ACROSS 33 COUNTRIES EMC FORUM RESEARCH GLOBAL RESULTS - 10,451 RESPONSES ACROSS 33 COUNTRIES
EMC FORUM RESEARCH GLOBAL RESULTS - 10,451 RESPONSES ACROSS 33 COUNTRIES EMC
 
EMC with Mirantis Openstack
EMC with Mirantis OpenstackEMC with Mirantis Openstack
EMC with Mirantis OpenstackEMC
 
Modern infrastructure for business data lake
Modern infrastructure for business data lakeModern infrastructure for business data lake
Modern infrastructure for business data lakeEMC
 
Force Cyber Criminals to Shop Elsewhere
Force Cyber Criminals to Shop ElsewhereForce Cyber Criminals to Shop Elsewhere
Force Cyber Criminals to Shop ElsewhereEMC
 
Pivotal : Moments in Container History
Pivotal : Moments in Container History Pivotal : Moments in Container History
Pivotal : Moments in Container History EMC
 
Data Lake Protection - A Technical Review
Data Lake Protection - A Technical ReviewData Lake Protection - A Technical Review
Data Lake Protection - A Technical ReviewEMC
 
Mobile E-commerce: Friend or Foe
Mobile E-commerce: Friend or FoeMobile E-commerce: Friend or Foe
Mobile E-commerce: Friend or FoeEMC
 
Virtualization Myths Infographic
Virtualization Myths Infographic Virtualization Myths Infographic
Virtualization Myths Infographic EMC
 
Intelligence-Driven GRC for Security
Intelligence-Driven GRC for SecurityIntelligence-Driven GRC for Security
Intelligence-Driven GRC for SecurityEMC
 
The Trust Paradox: Access Management and Trust in an Insecure Age
The Trust Paradox: Access Management and Trust in an Insecure AgeThe Trust Paradox: Access Management and Trust in an Insecure Age
The Trust Paradox: Access Management and Trust in an Insecure AgeEMC
 
EMC Technology Day - SRM University 2015
EMC Technology Day - SRM University 2015EMC Technology Day - SRM University 2015
EMC Technology Day - SRM University 2015EMC
 
EMC Academic Summit 2015
EMC Academic Summit 2015EMC Academic Summit 2015
EMC Academic Summit 2015EMC
 
Data Science and Big Data Analytics Book from EMC Education Services
Data Science and Big Data Analytics Book from EMC Education ServicesData Science and Big Data Analytics Book from EMC Education Services
Data Science and Big Data Analytics Book from EMC Education ServicesEMC
 
Using EMC Symmetrix Storage in VMware vSphere Environments
Using EMC Symmetrix Storage in VMware vSphere EnvironmentsUsing EMC Symmetrix Storage in VMware vSphere Environments
Using EMC Symmetrix Storage in VMware vSphere EnvironmentsEMC
 
Using EMC VNX storage with VMware vSphereTechBook
Using EMC VNX storage with VMware vSphereTechBookUsing EMC VNX storage with VMware vSphereTechBook
Using EMC VNX storage with VMware vSphereTechBookEMC
 

Mais de EMC (20)

INDUSTRY-LEADING TECHNOLOGY FOR LONG TERM RETENTION OF BACKUPS IN THE CLOUD
INDUSTRY-LEADING  TECHNOLOGY FOR LONG TERM RETENTION OF BACKUPS IN THE CLOUDINDUSTRY-LEADING  TECHNOLOGY FOR LONG TERM RETENTION OF BACKUPS IN THE CLOUD
INDUSTRY-LEADING TECHNOLOGY FOR LONG TERM RETENTION OF BACKUPS IN THE CLOUD
 
Cloud Foundry Summit Berlin Keynote
Cloud Foundry Summit Berlin Keynote Cloud Foundry Summit Berlin Keynote
Cloud Foundry Summit Berlin Keynote
 
EMC GLOBAL DATA PROTECTION INDEX
EMC GLOBAL DATA PROTECTION INDEX EMC GLOBAL DATA PROTECTION INDEX
EMC GLOBAL DATA PROTECTION INDEX
 
Transforming Desktop Virtualization with Citrix XenDesktop and EMC XtremIO
Transforming Desktop Virtualization with Citrix XenDesktop and EMC XtremIOTransforming Desktop Virtualization with Citrix XenDesktop and EMC XtremIO
Transforming Desktop Virtualization with Citrix XenDesktop and EMC XtremIO
 
Citrix ready-webinar-xtremio
Citrix ready-webinar-xtremioCitrix ready-webinar-xtremio
Citrix ready-webinar-xtremio
 
EMC FORUM RESEARCH GLOBAL RESULTS - 10,451 RESPONSES ACROSS 33 COUNTRIES
EMC FORUM RESEARCH GLOBAL RESULTS - 10,451 RESPONSES ACROSS 33 COUNTRIES EMC FORUM RESEARCH GLOBAL RESULTS - 10,451 RESPONSES ACROSS 33 COUNTRIES
EMC FORUM RESEARCH GLOBAL RESULTS - 10,451 RESPONSES ACROSS 33 COUNTRIES
 
EMC with Mirantis Openstack
EMC with Mirantis OpenstackEMC with Mirantis Openstack
EMC with Mirantis Openstack
 
Modern infrastructure for business data lake
Modern infrastructure for business data lakeModern infrastructure for business data lake
Modern infrastructure for business data lake
 
Force Cyber Criminals to Shop Elsewhere
Force Cyber Criminals to Shop ElsewhereForce Cyber Criminals to Shop Elsewhere
Force Cyber Criminals to Shop Elsewhere
 
Pivotal : Moments in Container History
Pivotal : Moments in Container History Pivotal : Moments in Container History
Pivotal : Moments in Container History
 
Data Lake Protection - A Technical Review
Data Lake Protection - A Technical ReviewData Lake Protection - A Technical Review
Data Lake Protection - A Technical Review
 
Mobile E-commerce: Friend or Foe
Mobile E-commerce: Friend or FoeMobile E-commerce: Friend or Foe
Mobile E-commerce: Friend or Foe
 
Virtualization Myths Infographic
Virtualization Myths Infographic Virtualization Myths Infographic
Virtualization Myths Infographic
 
Intelligence-Driven GRC for Security
Intelligence-Driven GRC for SecurityIntelligence-Driven GRC for Security
Intelligence-Driven GRC for Security
 
The Trust Paradox: Access Management and Trust in an Insecure Age
The Trust Paradox: Access Management and Trust in an Insecure AgeThe Trust Paradox: Access Management and Trust in an Insecure Age
The Trust Paradox: Access Management and Trust in an Insecure Age
 
EMC Technology Day - SRM University 2015
EMC Technology Day - SRM University 2015EMC Technology Day - SRM University 2015
EMC Technology Day - SRM University 2015
 
EMC Academic Summit 2015
EMC Academic Summit 2015EMC Academic Summit 2015
EMC Academic Summit 2015
 
Data Science and Big Data Analytics Book from EMC Education Services
Data Science and Big Data Analytics Book from EMC Education ServicesData Science and Big Data Analytics Book from EMC Education Services
Data Science and Big Data Analytics Book from EMC Education Services
 
Using EMC Symmetrix Storage in VMware vSphere Environments
Using EMC Symmetrix Storage in VMware vSphere EnvironmentsUsing EMC Symmetrix Storage in VMware vSphere Environments
Using EMC Symmetrix Storage in VMware vSphere Environments
 
Using EMC VNX storage with VMware vSphereTechBook
Using EMC VNX storage with VMware vSphereTechBookUsing EMC VNX storage with VMware vSphereTechBook
Using EMC VNX storage with VMware vSphereTechBook
 

Último

The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxLoriGlavin3
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 

Último (20)

The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 

VNX with the Cloud Tiering Appliance

  • 1. White Paper VNX WITH THE CLOUD TIERING APPLIANCE A Detailed Review Abstract This paper describes the EMC Cloud Tiering Appliance (CTA). The CTA enables NAS data tiering, allowing administrators to move inactive data from high-performance storage to less-expensive archival storage, thus enabling cost-effective use of file storage. The CTA also facilitates data migration which moves data to new shares or exports. July 2012
  • 2. Copyright © 2012 EMC Corporation. All Rights Reserved. EMC believes the information in this publication is accurate as of its publication date. The information is subject to change without notice. The information in this publication is provided “as is.” EMC Corporation makes no representations or warranties of any kind with respect to the information in this publication, and specifically disclaims implied warranties of merchantability or fitness for a particular purpose. Use, copying, and distribution of any EMC software described in this publication requires an applicable software license. For the most up-to-date listing of EMC product names, see EMC Corporation Trademarks on EMC.com. VMware and VMware ESX are registered trademarks or trademarks of VMware, Inc. in the United States and/or other jurisdictions. All other trademarks used herein are the property of their respective owners. Part Number h10777 VNX with the Cloud Tiering Appliance 2
  • 3. Table of Contents Executive summary ........................................................................................... 4 Business case .................................................................................................................... 4 Solution overview ............................................................................................................... 4 Introduction ..................................................................................................... 5 Scope ................................................................................................................................. 5 Audience ............................................................................................................................ 5 Terminology........................................................................................................................ 5 Archiving ......................................................................................................... 6 Overview ............................................................................................................................ 6 Hierarchical Storage Management ...................................................................................... 6 FileMover............................................................................................................................ 7 Archive Policies .................................................................................................................. 7 Providing data to the CTA.................................................................................................... 8 Scheduler ........................................................................................................................... 8 Simulation.......................................................................................................................... 8 Recall ................................................................................................................................. 8 Recall using CTA-HA ............................................................................................................ 9 Archive requirements for the source NAS server.................................................................. 9 Archive requirements for the target repository server ........................................................ 10 Compression and encryption ........................................................................................ 10 Multi-tiered archive .......................................................................................................... 11 CTA database ................................................................................................................... 11 Stub scanner jobs............................................................................................................. 12 Orphans ........................................................................................................................... 12 Reporting.......................................................................................................................... 13 Migration ....................................................................................................... 13 Overview .......................................................................................................................... 13 Migration source............................................................................................................... 13 Migration targets .............................................................................................................. 14 The migration process ...................................................................................................... 14 Other CTA interactions with VNX ....................................................................................... 15 Miscellaneous topics........................................................................................................ 16 Summary .......................................................................................................................... 16 VNX with the Cloud Tiering Appliance 3
  • 4. Executive summary The EMC® Cloud Tiering Appliance (CTA) optimizes primary NAS storage by automatically archiving inactive files to less-expensive secondary storage. The secondary storage can be of lower cost, such as an NL-SAS or SATA disk on a NAS device, or it can consist of public or private clouds platforms. After a file is archived, a small stub file remains on the primary storage, so that the file appears to the user as if it were in its original location. File tiering dramatically improves storage efficiency, and shortens the time to back up and restore data. In addition to archiving data from primary to secondary storage, the Cloud Tiering Appliance can also permanently migrate files from a source to a destination without leaving a stub, as when NAS server hardware requires a technology refresh. Business case From the early days of computing, storage has existed in tiers or levels, with differences in cost, performance, availability, or redundancy, and so forth. For example, newer flash storage outperforms older storage which consists of NL-SAS or SATA, but costs more per gigabyte. NAS data also exists in tiers and does not all have the same value. Typically, as data ages, users access it less frequently. To optimize use of a normal tiered-NAS-storage environment, a customer must ensure that the less-valuable, less-frequently- accessed data does not consume high-speed, expensive storage resources. High- speed storage should be reserved for the active, important data, while the less-active data should reside on cheaper storage. In addition, the storage tiering process must work automatically, so that it does not add to the storage administration overhead. Solution overview The CTA employs Hierarchical Storage Management (HSM), which has been a staple of the mainframe world for decades. Hierarchical Storage Management moves a file from primary storage to lower-cost, secondary storage, and leaves a small stub pointer file in the file’s original location. This process of relocating data and stubbing describes archiving or tiering. The CTA acts as a policy engine by interacting with a VNX share or export, and identifying files that fit predefined criteria. For these files, the CTA initiates movement to a lower tier repository, for example NAS, Centera, or cloud, and leaves a stub file on the VNX share. When a client that is accessing the VNX share or export tries to read the stub, the CTA recalls the original file from the repository tier. To the user, the file appears to be in its original location on high-performance VNX storage. However, instead of the space required to store the entire file, only an 8 KB stub file is on the primary tier. If the storage administrator wants to move data in the share or export to another location, for example, to replace on old Celerra with a new VNX, the CTA can help. The VNX with the Cloud Tiering Appliance 4
  • 5. CTA migration feature relocates multiprotocol data including stub files, from one share or export to another. When used for archiving or tiering, the CTA will automatically move inactive data to lower tiers of storage. This allows more efficient use of the most expensive, highest- performing NAS storage. When used for file migration, the CTA enables relocation of NAS data, within a NAS server, across NAS servers, and across NAS servers from different vendors. Introduction Scope This paper outlines CTA features, how CTA functions, and the business problems that CTA helps to solve. In a technical overview of CTA, this paper also describes how to manage CTA and implement solutions in a VNX NAS environment. Audience This white paper is intended for users who have a basic understanding of VNX Unified Storage or, a general grasp of NAS storage concepts. Terminology Archive repository – Lower tier storage than the NAS storage that is accessible to the CIFS or NFS clients. The repository is the target of a file archival process. In an archiving operation, CTA moves data from the primary or source tier to the repository and leaves a stub file on the primary tier. The stub points to the file in the repository. A repository tier is a NAS share/export, an EMC Centera, an EMC Atmos cloud, or the Amazon S3 cloud. File archiving – A primary CTA function that scans a NAS file server for files that meet defined criteria, and moves the files to a lower tier of storage. CTA replaces the file on the NAS server with a stub file that points to the real file on the archive repository. File migration – The movement of files from one export or share to another as when replacing a NAS server. FileMover –VNX Data Movers include FileMover software. FileMover or DHSM enables stubbing and recall of archived files, and provides an API that the CTA uses for both archiving and migration. To use file archiving with a Celerra or VNX file system, export, or share, enable FileMover for the Data Mover. File tiering – See “File archiving.” FPolicy –NetApp Filers include FPolicy software. FPolicy enables stubbing and recall of archived files, and provides an API that the CTA uses for both archiving and migration. The CTA uses the FPolicy interface to archive files from NetApp servers. Orphan file – When a file has been archived, a stub on the source NAS server points to the archived file. Deleting a stub does not automatically delete the archived file. VNX with the Cloud Tiering Appliance 5
  • 6. Instead, the archived file becomes an orphan or a repository file without a stub pointing to it. To delete the orphan, the CTA user runs an orphan delete job on the repository. Policy – Rules for migration, or a rules and one or more repository destinations for archiving and tiering. For example, an archiving policy can send a file that has not been accessed in 1 year, to a company’s private Atmos cloud server and a that file has not been accessed in 2 years, to a public Amazon S3 cloud. Primary storage – The storage tier that CIFS and NFS clients mount on the VNX. Source tier – See “Primary storage.” Archiving Overview The CTA provides two primary functions: archiving or tiering, and migration. Archiving moves inactive data to a lower tier of storage and leaves behind a stub file. Hierarchical Storage Management Historically, mainframe computer users implemented a tiered storage system because disk storage was expensive and tape storage was inexpensive. To maintain the limited free disk space on the mainframe, the user would manually move less- important data to tape. To retrieve data later, the user had to record the tape volume where the data was stored. Mainframe system developers began to consider solutions to the problem of how to move data to cheaper, external storage, without requiring the user to remember where it was stored and to manually run commands to recall the data when it was needed. The solution was Hierarchical Storage Management (HSM). This system would scan the data, find files that had not been accessed for some time, and automatically move them to a lower tier of storage. In place of the file, the system would store a small stub that contained an internal pointer to the file. The user would see the stub as if it were the actual file, but when trying to access it, the system would tell the user to wait while it automatically retrieved the file and restored it to the original location on the user’s disk. HSM is the basis for the CTA archiving function. The concept is straightforward, robust, and time-tested. The CTA supports archiving to disk storage (cloud, CAS, or NAS). The existence of storage tiers in most customer environments make the HSM tiering solution as important today as it was decades ago, during the mainframe era. VNX with the Cloud Tiering Appliance 6
  • 7. FileMover FileMover is NFS and CIFS file services software for the VNX and Celerra file system that allows HSM-style archiving. On the VNX, the primary FileMover command is DHSM, which shows its HSM roots. At a basic level, FileMover intercepts client access to data, and takes action before the client accesses the data. CTA is an external system that can direct FileMover. The CTA user defines a task that directs FileMover to perform a series of actions, for example: • To scan a share for files that are more than 60 days old • To move the files to the archive • To replace the files with stubs • To activate the CIFS offline bit on the stubs Once the stubs are in place, FileMover monitors the stubs. When a client tries to read or write to the file, FileMover will intercept that access, and recall the data using information contained in the stub. Every VNX or Celerra that functions as an archive source must have the FileMover API enabled and configured. The configuration is part of the CTA setup. FPolicy is an API for the NetApp that is similar to FileMover. CTA needs an API similar to FileMover or FPolicy to archive from a NAS system. Because these types of APIs are not available on other platforms such as Linux or Windows, CTA can only archive data from VNX, Celerra, and NetApp. Archive Policies A policy in CTA archiving or tiering context consists of rules and one or more destinations. An example of a simple policy is “if this file has not been accessed in six months, send it to the Atmos cloud, and replace it with a stub.” The one-rule, one- destination is common, and many CTA users will use this type of policy on their data. However, CTA rules are flexible. You can create more complex rules, that archive to multiple tiers. For example, a single policy could be: “If any file has not been accessed in more than one year, and is larger than 1 MB in size, send it to my Isilon, unless it’s a PDF file, in which case don’t archive it. Then when these files have not been accessed for two years, move them from the Isilon to the Atmos cloud, and update the stub file to point to their new location.” Policy rules are based on attributes such as access time, modify time, inode change time, file size, file name, or directory name. The archive policy action is to “archive” or “don’t archive.” A single expression or a combination of expressions define the archive policy. VNX with the Cloud Tiering Appliance 7
  • 8. A policy does not contain a share name. You can define one policy to evaluate shares A, B, and C, but define another policy to evaluate share D. Or you can define several different policies to evaluate a single share. Providing data to the CTA Administrators usually direct a CTA archive policy to evaluate a file system, CIFS share, or NFS export. The CTA scans the files, and applies the policy rules to each file, one at a time. If there are multiple rules in the file, the CTA contines to apply the rules until a rule evaluates to “true.” It then takes the action associated with the rule, such as archive, or don’t archive, and moves to the next file. There is another way to provide files to a CTA policy. Instead of directing the CTA to scan the files in a share, the CTA administrator imports a list of filenames, and the CTA only scans and applies the archive policy to the files in that list. This feature, called “file ingest,” is primarily for third-party vendors with software products that have their own scanning systems, but want to use the CTA archive engine. The Cloud Tiering Appliance Imported File List Archive Task Technical Notes document describes the file ingest feature and is available from Powerlink. Scheduler You use the scheduler to set the job start time. For example, a CTA administrator schedules a batch job to start at 2:00 a.m. on Saturday, scan share01, and evaluate the files with a policy for archiving. An administrator usually schedules a job to run weekly, every other week, or monthly. The first time the archiving job runs, the policy will often select and archive at least half of the data. So the first archive job can require a long time to run and can move a lot of data. Future jobs will move incremental amounts of data. Simulation The CTA can simulate archive jobs. You can schedule an archive job with a policy, but run it in simulation mode. The CTA will scan the source share and apply the policy rules against each file, but not take any archive action. Instead, the CTA tracks the number of files and amount of data it would have archived, and at the end of the simulation, it displays a report. Simulation is a good way to test the effectiveness of a policy and to edit the policy rules, before running a real archive job. Recall When a file has been archived to a repository, leaving a stub on the source NAS share, the NAS client expects the stub to look and behave like the original file. “File recall”is the process by which the user clicks on the stub file and quickly accesses the original file. The stub file contains all the information needed to find the actual file. The VNX set the offline bit on the stub when the file was archived. When a user attempts to read a stub file, FileMover interacts with CTA and intercepts the read request to begin the process of recalling the file from the repository. If the repository is on a CIFS or NFS VNX with the Cloud Tiering Appliance 8
  • 9. share, FileMover recalls the file using CIFS or NFS. If the repository is a CAS or cloud (such as Centera, Atmos, or Amazon), then the VNX sends the recall request to the CTA, which will effect the recall and pass the file to the VNX. After recalling the file, the VNX either: • Provides the file to the user, but leaves the stub in place, known as “passthrough recall.” • Writes the file back to its original location and deletes the stub, known as“full recall.” A FileMover command on the VNX Control Station sets the recall style, which can be set on a file system-by-file system basis. Recall using CTA-HA If an archive or migration job batch job fails, there is no loss of data, You would correct the problem and rerun the job. For this reason, the complexity of a High Availability (HA) configuration for archival or migration is not necessary or justified. However, recall is mission-critical, because it affects the ability of clients to access their data. For this reason, configurations where the CTA is in the recall path, such as archiving from VNX/Celerra to Centera, Atmos, or Amazon, or all archival from NetApp, require an HA. The HA configuration pairs the CTA-HA physical appliance or the CTA/VE-HA virtual appliance with one or more CTA or CTA/VE systems. The CTA-HA system is a recall- only version of the CTA. If the source NAS server cannot perform the recall, either the CTA recall host or its CTA-HA partner can perform the recall. By creating a DNS hostname that maps to the IP addresses of both the CTA and the CTA-HA, and by configuring the VNX to find the CTA using that hostname, the VNX can use both recall hosts alternately, in a round-robin fashion. This balances the recall load, and if one recall host fails, the other can perform recalls until the failed host returns to service. This configuration also allows maintenance of one host while the other continues to perform recalls. The CTA-HA also performs keystore replication for encryption keys that are generated on the CTA when the encryption feature is selected during archival to Atmos or Amazon clouds as described in Compression and encryption on page 10. Archive requirements for the source NAS server CTA can archive data from CIFS or NFS shares on VNX, Celerra, or NetApp NAS servers. FileMover for VNX or Celerra, and FPolicy for NetApp, both provide archiving services. To identify stub files, both FileMover and FPolicy read the offline bit on the stubs. The CIFS protocol supports offline bits, but NFS does not, so the VNX/Celerra will handle offline bits internally for NFS-only archival. The CTA communicates with VNX and Celerra using the DHSM API. Before archiving data from a VNX or Celerra, the CTA configuration must include the VNX or Celerra properties and the DHSM connections that link file systems to one or more repositories. The CTA and FileMover VNX with the Cloud Tiering Appliance 9
  • 10. automatically create the DHSM connections when needed. The Cloud Tiering Appliance Getting Started Guide provides the configuration procedure for the CTA with the VNX or Celerra. Deleting the DHSM connection on a VNX or Celerra Control Statio will optionally trigger a recall of all stubbed data from the repository linked to the VNX or Celerra file system that uses that connection. The CTA and CTA-HA must have full control of the source shares. If the source includes NFS exports, these exports must have root and read/write permission for the CTA and CTA-HA IP addresses. When archiving from CIFS shares, the source server must belong to a domain and the CTA configuration settings for that server require a username from that domain. The username must be in the local administrator’s group of the CIFS server associated with the source. Archive requirements for the target repository server The CTA can archive to three kinds of repositories: • NAS (CIFS or NFS), such as VNX, Celerra, VNXe, Data Domain, Isilon, Windows, or NetApp • CAS such as Centera • Cloud such as Atmos or Amazon S3 Each repository has slightly different configuration requirements. • Requirements for NAS repositories are similar to the requirements for source servers. A CIFS domain user must be in the local admin group of the CIFS server. NFS exports must have root and read/write permission for the CTA and CTA-HA IPs. • Centera configuration requires a PEA file or “anonymous.” • Cloud configuration requires a tenant user for Atmos or a bucket user for Amazon S3. One repository can serve as an archive target for multiple CTAs. One CTA can archive to multiple repositories. A CTA repository migration job will move all the archived data from one repository to another, and update the stubs to point to the new location. Only the CTA or CTA-HA can have access to the CTA repositories. The NAS share that serves as the repository is visible, but the layout of archived data is proprietary. Changes to the repository can render archived data unrecallable. Compression and encryption Compression and encryption are options when archiving to either of the two cloud repository tiers, Atmos, or Amazon S3. A compression style such as fast or strong is a policy option. Encryption is also a policy option, but encryption prerequisites are: VNX with the Cloud Tiering Appliance 10
  • 11. 1. You must configure keystore replication between the CTA and a CTA-HA machine. 2. You must generate a key using the CTA GUI. The CTA stores the key in the keystore and replicates it to the CTA-HA. Every archive task that uses a policy with encryption uses the key. If CTA generates and replicates a new key, it applies the key to new encrypted archive tasks. The old keys that remain in the keystore continue to apply for files encrypted using the old keys. Keystore replication will be sufficient for normal outages, but after generating a new key, back up the CTA configuration to preserve the keystore. Multi-tiered archive Consider the following example: “Find NAS files that have not been accessed in six months, and archive them to my private cloud storage. Find NAS files that have not been accessed in one year, and send them to the public cloud such as Amazon S3 or AT&T’s Atmos-based cloud. And if any files archived on the private cloud have not been accessed for one year, move the files from the private to the public cloud, and update the stub files to point to the new location.” This scenario describes CTA’s multi-tiered archiving feature. By creating a multi-tiered policy type with several rules, each with a different repository, you can design an archiving scheme to fit this example. This multi-tiered archiving policy would have the following rules: • if atime > 1 year, archive to Amazon S3 cloud • if atime > 6 months, archive to private Atmos cloud The order of the rules is important. If a policy has several rules, the rules are applied one at a time. When the first rule evaluates as true, CTA takes the “archive” or “don’t archive” action. CTA does not apply the subsequent rules, and the policy moves on to the next file. In this example, if the order of rules were reversed, you would get an unintended result. The 6-month old rule would be applied first, and the 1-year old rule would never be applied, because any file older than 1 year is also older than 6 months. All data older than 6 months would be archived to the private Atmos cloud, and no files would be archived to the Amazon S3. CTA database When a file is archived to a repository, the stub on the source NAS tier points to the file location in the repository. However, the file in the repository has no pointer back to the source. The repository files have no connection to the source. The CTA database solves this problem. Each time a file is archived, an entry in the CTA database records the file’s original location on the source, and the file’s location in the repository. VNX with the Cloud Tiering Appliance 11
  • 12. CTA does not use the database for recalls, because the stub on the source includes the information necessary to locate the file in the repository. However, the database inlcudes entries for every archived file. It contains statistical data and orphan information as described in Orphans on page 12. To protect the database, schedule regular CTA backups. If a CTA fails, import the most recent backup into a new CTA. Stub scanner jobs For every scheduled archive job, the CTA automatically schedules a monthly stub scanner job. The stub scanner is a utility that reads the stubs in a share and compares them to the entries in the CTA database. If stubs move to different locations or orphans appear, the stub scanner will ensure that the CTA database is kept current. Because a stub on the source has the information necessary to recall a file from the repository, CTA does not need to query stub and repository file locations in the CTA database. However, CTA can manage repository storage more efficiently if information in the database matches the stub and repository file locations on the system, so stub scanner jobs run every 30 days by default. Orphans If stub files are deleted from the source repository, the actual files in the repository become orphans, and are not automatically deleted for the following reason. Generally a storage administrator will back up stubs when backing up CTA-archived shares. Many NDMP-based backup programs back up stubs by default. With protected repositories and small stubs, a NAS server that employs CTA benefits greatly from faster backups and smaller backup windows. However, when a backup is restored, the stubs need to point to something. If the CTA had deleted archived files when the stubs were deleted, restoring the backup would restore stubs that point to nothing. To delete orphan files, and recover space on the repository, run an “orphan_delete” job periodically. Do not delete orphans until you are certain that you will not restore stubs that point to the orphans. For example, if you keep backups for six months, then define the orphan deletion job to delete files that have been orphans for at least eight months. The CTA database and the stub scanner play important roles in the management of orphan files. Every time the stub scanner sees a stub, the CTA records a “last seen” time in the database. If the stub is deleted, the stub scanner identifies the file in the repository that was linked to the stub as an orphan. Because the “last seen” time is in the database, the CTA knows how long the file has been an orphan. CTA uses the orphan age to determine which orphans to delete. If the CTA database is lost, the location and age of orphan files in the repository is lost. CTA database backup is therefore an important process. VNX with the Cloud Tiering Appliance 12
  • 13. Reporting CTA generates reports on the files it archives or migrates. For archived files, CTA reports will display the size, number of files archived, and breakdown by file types, but the CTA does not give a detailed profile of the data in the file system. You can run archive simulations to obtain information on file ages. For example, multiple simulations, filtering for access times of various ages, can yield an age profile for the files in the file system. Migration Overview The CTA provides two primary functions: archiving (or tiering), and migration. Migration moves files from one share or export to another. NAS migration copies data from one share or export on one system to a another. Migration is useful when replacing servers. Administrators use CTA to move data from old to new servers with minimum disruption to the NAS client users. The CTA can perform multi-protocol, incremental, stub-aware, cross-vendor migrations, which can greatly reduce the effort and complexity of implementing new NAS technology. A CTA migration is a batch job that moves data from a source NAS share to a target. The target share must be sufficiently large to hold the data that will move from the source share. The target share does not need to be the same size or layout as the source share. CTA supports CIFS, NFS, or multi-protocol CIFS/NFS file systems. The supported source platforms are: VNX, Celerra, NetApp, Windows. The supported target platforms are: VNX, VNXe, Celerra, Isilon. CTA migrates data from VNX, Celerra, or NetApp source to any target. When CTA migrates data from a Windows source server, only VNX or VNXe can be used as targets. Migration source CTA migration uses NDMP as its file transfer engine when migrating from VNX, Celerra, and NetApp. CTA uses EMCopy when migrating from Windows. • The NDMP-style migrations are policy-based. You would first create a policy, similar to the archive policies. For example, to migrate all the data in share1 to newshare1, but do not wish to include PDF files that are more than three years old, you can create a rule to omit these files from the migration. If you want to copy everything without filtering, you would create a trivial policy with a rule such as: if size >= 0 bytes, then migrate • The EMCopy-based migration is not policy-based. When migrating from Windows to VNX, CTA copies all the data. VNX with the Cloud Tiering Appliance 13
  • 14. The CTA uses the FileMover or DHSM API to migrate files from VNX or Celerra. The CTA creates snapshots on the source file system by making API calls. Snapshots enable users to continue to access and write to the source share while the migration is taking place. For migration tasks, the CTA needs the same kind of access permissions on the source and target shares and exports as for archive tasks. For a CIFS source server, a CIFS domain user must be in the local admin group of the CIFS server. For an NFS source server, NFS exports must have root and read/write permission for the CTA IPs. Multi-protocol migrations require both CIFS and NFS access permissions. NOTE: File migration does not use the CTA-HA. An NDMP user must exist on the source server for NDMP-style migrations. The server configuration on the CTA requires the NDMP userid and password. The Cloud Tiering Appliance Getting Started Guide provides details. To perform migrations using a Windows source , install the EMWS copy agent on the Windows server. CTA supports Windows 2003 and 2008 as migration source platforms. Migration targets Before starting a migration task, target shares or exports must exist on target servers. The CTA will not create them automatically. Migrating files to the root level in these shares requires that the shares be empty, except for file system files such as .lost+found. However, you can migrate files to any empty directory in a non-empty share. The CTA needs the same permissions on target file systems as on the source. For CIFS targets, a CIFS domain user must be in the local admin group of the CIFS server. For NFS targets, NFS exports must have root and read/write permission for the CTA IPs. Multi-protocol migrations require both CIFS and NFS credentials. The protocol of the target share or export (CIFS, NFS, or mixed protocol) must match the protocol of the source. For the NDMP-style migrations, an NDMP user must exist on the target server. The server configuration on the CTA requires the NDMP userid and password. The migration process Migrations run as scheduled batch jobs. Schedule the migration task on the CTA Schedule page or through the CLI. Specify: • a source share • a policy for NDMP-style migrations (except for Windows-to-VNX EMCopy-style migrations, which don’t accept policies) • an empty target share or directory VNX with the Cloud Tiering Appliance 14
  • 15. When the job begins, CTA creates a snapshot on the source. Then CTA copies all the data to the target, applying the policy, if applicable. The migration can begin anywhere in the source share (for example at the top level or in a subdirectory), and can move files to any empty directory on the target. After the first pass completes, the target will have a copy of the data in from the snapshot on the source. However, if users have continued updating the source during migration, the source share will probably be different from the snapshot. So, you can run a second pass to pick up those incremental changes. The CTA will create another snapshot, and compare the second to the first, to pick up changes such as new files, deleted files, metadata changes, and so forth. You can continue running passes to pick up changes. For the final pass, put the source in offline or in read-only mode to the clients to ensure that that the target is identical to the source. To complete the migration, transfer the client mounts to the target share. The target has copies of all CIFS and/or NFS ACLs or permissions. After transferring the clients, you can delete the source share and recover its storage space on the source NAS system. If you wish to throttle CTA migration tasks, you can specify a maximum MB/sec rate when you create the task schedule, so that the migration comsumes less network bandwidth. You can also use a SID translation tool that CTA provides to create a SID mapping of local to domain SIDs. CTA applies the mapping at migration time, to ensure that the SIDs on the target will be correct. CTA can run migration tasks in a continuous mode. The migration task has the option to run incremental migrations until a files-moved threshold is reached. For example, “keep running incremental migrations until fewer than 1000 files are moved in one run. Then stop and notify the administrator.” Customers typically perform incremental runs during scheduled off hour periods over the course of several evenings, with the final locked cutover also during off hours. CTA migration is not only unique because it can be asymmetrical, for example between file systems of different size, different disk layouts, across different platforms. CTA is also unique because it provides profile-filtering, multi-protocol migration, for example, from NetApp “multi” to VNX “multi”. CTA also migrates stubs. If the source and target both support CTA stubs, CTA migrates the stubs without recalling the archived data, and the stubs will continue to function after migration. • If the source is VNX or Celerra and the target is Isilon or VNXe, which do not support stubs, CTA migrates the actual files. Stubs are not migrated. Other CTA interactions with VNX The CTA supports VNX File-Level Retention (FLR). When archiving to a VNX, Celerra, or Centera, you can specify retention times on the archived data. You can also specify retention times on stub files, but FileMover manages the stub retention on the source. CTA does not support FLR-enabled source file systems. VNX with the Cloud Tiering Appliance 15
  • 16. Historically, the access times during a migration could not be preserved. NDMP always set access times equal to modification times when moving the data. However, with the release of VNX 7.1, access times during a file migration can be preserved when VNX 7.1 is the migration source. De-duplication on the repository shares of a VNX greatly enhance the efficiency of repository storage. The use of de-duplication for source shares is not recommended. Stub files do not benefit from de-duplication. Miscellaneous topics In addition to archiving and migration, CTA can perform other tasks. “Delete stub with policy” is the only task type that deletes stubs, deletes the repository file that the stubs point to, and removes the reference to stubs from the CTA database. Use the “delete stub with policy task” with caution. “Multi-tier stub with policy” is similar to the “Multi-tier archive,” except that it only scans stubs. The task moves archived data that matches the multi_tier_stub policy from one repository to another. Like the “delete stub with policy” task, it only scans stub files. Summary The Cloud Tiering Appliance (CTA) enables NAS data tiering, allowing administrators to move inactive data off of high-performance storage to less-expensive archival storage, thereby enabling more cost-effective use of file storage. The CTA also enables relocation of NAS data to new shares or exports, on the same or different servers, even from different vendors. VNX with the Cloud Tiering Appliance 16