SlideShare a Scribd company logo
1 of 9
Download to read offline
IBM ProtecTIER Deduplication for z/OS
John Webster
March 04, 2010




                                            Technology Insight Series


                             Eva lua t o r Gr oup
Copyright 2010 Evaluator Group, Inc.  All rights reserved. 

 


                                                        
 
ProtecTIER® Deduplication Gateway for System z®



Announcement Summary 
The many data deduplication technologies available today have the ability to dramatically lower 
enterprise IT costs for both storing and moving data. “Dedupe” has become widely used in open systems 
environments to, in some cases, significantly reduce storage capacity that would otherwise be required 
for backup, archive, and even primary storage supporting critical applications. Until now, the only way 
mainframe storage administrators could take advantage of this increasingly popular technology was to 
insert an ESCON/FICON‐to‐TCP/IP channel emulation device into a data stream between a mainframe 
channel and an open systems virtual tape library (VTL) that supports data deduplication (Bus‐Tech MDL 
100V + FalconStor VTL for example).    

    With the announcement of the IBM System Storage TS7680 ProtecTIER Deduplication Gateway for 
    System z (TS7680), IBM now offers its mainframe customers an advanced data deduplication solution 
    that can be used for a number of application scenarios including backup and other data stream‐intensive 
    applications where data is first streamed to tape for subsequent processing. One of the results of 
    implementing data deduplication on System z is that a variety of disk platforms, both current generation 
    and legacy, can now be considered as a cost‐effective storage platform for these types of applications as 
    compared to tape. 

IBM acquired Diligent Technologies Corporation in April, 2008. IBM subsequently introduced a number 
of IBM branded products including the IBM System Storage TS7650 Appliance and TS7650G Gateway 
based on ProtecTIER with HyperFactor® (discussed in more detail below1 ) and has installed under the 
IBM logo more than 600 ProtecTIER solutions for open systems. With the announcement of the TS7680, 
IBM extended its portfolio of enterprise‐class data deduplication solutions by providing one for System z 
environments based on proven technology.  


Dedupe addresses capacity, performance, and bandwidth issues 
Data deduplication is able to dramatically decrease the amount of disk space required for backup data 
when disk is used as a backup target, while retaining the significant performance improvements that 
disk based backup devices have over tape.  Thus, data deduplication should be considered for any IT 
environment looking to contain storage costs associated with backup, while preserving the delivery of 
required service levels for data protection. Some storage administrators have decided to replace tape 
with disk for applications requiring rapid access to data precisely because the cost per GB of 
deduplicated data on disk made it more affordable to maintain tape data on disk.   

Business continuance and disaster recovery‐related data replication processes within and outside of a 
system can also take significant amounts of time depending on the volume of data and the size of the 
interconnecting data “pipe.” Deduplicating the data objects within these replication streams to in many 
cases a small fraction of their original size will allow them to be moved in much less time.  Reduced 
bandwidth requirement could also be translated into reducing communications costs between sites for 
replication‐related data transfers. 


                                                                
1
      See also Evaluator Group Announcement Summary of IBM’s TS7650 VTL Systems published February 9, 2009. 
      Copyright 2010, Evaluator Group, Inc.                                                                    Page 1 of 7     
 




 
ProtecTIER® Deduplication Gateway for System z®


Post­process vs. Inline 
The storage industry’s approach to data deduplication has evolved to the point where today there are 
essentially two different processes that yield deduplicated data objects. Real‐time or streaming data 
deduplication is known as “in‐line” while data deduplication that occurs later is commonly referred to as 
“post‐process” deduplication.  The in‐line process deduplicates data “in flight” and in real time as it is 
being sent to a backup device for example. Post‐processing refers to data deduplication performed at 
some point in time after the data has been sent to a storage device—a Virtual Tape Library (VTL) for 
example that runs deduplication after data has been stored.   

As with most options, the optimal method to use depends upon the goals the storage administrator has 
in mind. Consider the backup process. Storage administrators looking to simply minimize the backup 
window often choose the post‐process method. The potential advantage is that, because the 
deduplication process is not in the path of the data stream, there will be no performance impact during 
the write operation and therefore no elongation of the backup window. 2   That is, backup data is sent to 
a temporary holding area within the disk array to negate potential performance impact. Once the 
backup job completes, the data is later examined for duplicates, with duplicate data removed at a later 
“post‐process” time.  The disadvantage of this method is that additional storage space is required when 
compared with the in‐line process.   

An alternative to deduplicating after a backup is to perform deduplication “in‐line” as data is being sent 
to the backup device.  The first advantage with this method is that no extra disk space is required. The 
data stored to disk is in deduplicated form right from the start. Second, no additional processing step to 
deduplicate the data is required. Another advantage of in‐line processing is that once the data is de‐
duplicated and stored, deduplicated data may be replicated immediately to off‐site storage.  As a result, 
the time to complete the entire business continuance process—including backup—is reduced, and as 
mentioned earlier, the bandwidth and/or the time required to replicate is also reduced. As noted above, 
in some implementations in‐line processing impacts performance and therefore backup time. IBM 
claims “negligible” performance impact due to using a light‐weight index of no more than 4GB 
maximum3   that maps to the contents of the data repository supporting up to 1PB.”   

 

 

 

 

 




                                                            
2
   Depending on implementation, a second backup may not be able to start until the post processing de‐duplication 
completes.  
3
   EGI has not yet been able to validate this claim with ProtectTIER users 
Page 2 of 7                                                                     Copyright 2010, Evaluator Group, Inc. 

 
ProtecTIER® Deduplication Gateway for System z®


The TS7680 for IBM’s System z 
The TS7680 is implemented as a gateway to disk arrays within a System z ESCON® or FICON® channel.   




                                                                                                                      
                  Figure 1: Data Deduplication for System z (Source: IBM and Evaluator Group) 

Shown above in Figure 1 is a typical deployment of a TS7680 system to provide data deduplication and 
offline tape storage in a System z environment.  As illustrated below in Figure 2, is a depiction of how 
the ProtecTIER TS7680 system operates between the System z host and the disk cache. 




                                                                                 
           Figure 2: IBM TS7680 ProtecTIER Host Connectivity (Source: IBM and Evaluator Group) 

Key points to bear in mind when evaluating the TS7680 include: 
       •    Deduplication is performed in‐line as described above. 
       •    Components within the TS7680 solution include a single frame containing two clustered 
            ProtecTIER servers for failover redundancy, FICON interfaces, and the ProtecTIER software. No 
            System z host‐resident software is required. 
       •    Maximum capacity of the back‐end disk array storage is 1PB meaning that the TS7680 supports 
            up to 1PB of disk for storage of deduplicated data. If a deduplication ratio of 10:1 is assumed, 
            one could expect to store 10PB of normally formatted data within this 1 PB space after 
    Copyright 2010, Evaluator Group, Inc.                                                             Page 3 of 7         
 




 
ProtecTIER® Deduplication Gateway for System z®

          deduplication. Deduplication ratios can vary widely however depending on the amount of data 
          redundancy encountered by the system. It is misleading to translate deduplication ratios seen in 
          open systems environments to System z. It is also the case that data deduplication ratios can 
          increase over time as the system processes an increasing amount of data, and consequently 
          encounters more redundancy. 
     •    Backend disk is Fibre Channel‐attached and can be IBM System Storage DS8000®, IBM XIV® 
          Storage Systems(SATA disk), IBM System Storage DS5000®/4000® for mid‐range System z 
          environments, and any combination of third‐party disk arrays already supported for attachment 
          to IBM’s TS7650G. 
     •    The TS7680 emulates an automated tape library with IBM System Storage 3592 Model J1A tape 
          drives and supporting MEDIA5 (3592 JA) cartridges. 
     •    From the perspective of the storage administrator, the TS7680 is managed transparently using 
          system‐managed tape (SMStape) facilities. No host application, tape management, or JCL 
          changes are required. Virtual tapes are returned to scratch processing after deletion. Alerts are 
          sent to the administrator if available capacity is running low. 
     •    Backend tape attachment is not supported. Data objects that need to be migrated to tape must 
          first be “rehydrated” i.e. returned to normal format and then sent via the System z host to a 
          tape device.  


The HyperFactor Process 
Storage vendors now offer a variety of ways to deduplicate data. As mentioned, the process can occur 
in‐line or run sometime after data is stored. In addition, there are differing deduplication processes that 
can be applied. File level deduplication has been available for a number of years. Deduplication using 
hashing algorithms to generate a code that represents stored data objects is more recent, and now 
more common. 

ProtecTIER’s HyperFactor uses a series of algorithms to identify elements within a data stream that have 
been previously stored by ProtecTIER. Once similar elements have been found, HyperFactor compares 
the new data to the similar data already stored and writes only the byte‐level changes to disk. 
HyperFactor uses a memory resident index of no more than 4GB to identify similar data. A copy of the 
index is maintained on TS7680‐attached disk. IBM reports a maximum measured throughput of 500 
MB/s using HyperFactor’s data deduplication in‐line processing. 


Comparing the TS7680 to Other IBM System z Virtual Tape Solutions 
IBM Virtualization Engine™ TS7700 Family 
Although the TS7680 leverages disk storage capabilities, it does nevertheless emulate IBM’s 3592 tape 
and should be compared first to other IBM virtualized tape subsystems. While both the TS7720 and 
7740 offer compression, they do not support or deliver the reduction in storage capacity that data 
deduplication is capable of. The TS7700 offerings do provide “Grid” replication functionality, which 
supports the replication of tape data between up to four sites. In addition, Grid supports capabilities 

Page 4 of 7                                                                 Copyright 2010, Evaluator Group, Inc. 

 
ProtecTIER® Deduplication Gateway for System z®

such as access to state‐consistent tape volumes from any site. However, the TS7680 is planning a less 
sophisticated two site replication capability in a future release expected early next year.  

 
             Feature                         TS7680                 TS7740                    TS7720 
Max. disk capacity (raw)                      1PB            14TB or 56TB w/ 4‐way     70TB or 280 TB w/ 4‐way 
                                                                      grid                       grid 
Max. number of virtual                        256           256 or 1024 (4‐way grid)   256 or 1024 (4‐way grid)
drives supported 
Max. number of virtual                        1M                      1M                         1M
volumes supported 
Direct tape attachment                         No                  Yes (grid)          Yes when configured in 
                                                                                            TS7740 grid 
Deduplication                        Yes                   No                                   No 
Device‐to‐device             Future (see below)            Yes                                  Yes
Replication  
        Table 1: Comparison of IBM ProtecTIER® TS780 and IBM Virtualization Engine™ Family 

IBM VTF™ Mainframe 
VTF Mainframe is based on software acquired in the Diligent Technologies acquisition. VTF Mainframe is 
z/OS® host‐resident software that provides emulation of IBM and IBM‐compatible cartridge devices and 
tape volumes and redirects tape‐targeted data streams to ESCON/FICON channel‐attached disk. It does 
not support HyperFactor deduplication, but it does support remote mirroring between storage devices 
and could be considered along with the TS7680 when there is a need to reduce the time required to run 
batch jobs that are heavy users of tape. Also unlike the TS7680, VTF Mainframe supports multiple 
concurrent access to a single tape data set (Parallel Access Tape). 
                                  




    Copyright 2010, Evaluator Group, Inc.                                                               Page 5 of 7     
 




 
ProtecTIER® Deduplication Gateway for System z®

 
                    Feature                                                 TS7680                            VTF Mainframe 
    Supported disk                                         IBM DS Series, IBM XIV, and/or any         Any ESCON/FICON 3380 or 3390‐
                                                            disk supported for attachment to                   compatable 
                                                                   ProtecTIER TS7650G 
    Deduplication                                                          Yes                                       No 
    Max. number of virtual                                                 256                                  256 Per LPAR 
    drives supported 
    Max. Theoretical Factoring                                 25:1 4  (HyperFactor deduplication)       2:1 (standard compression)
    Ratio 
    Native tape attachment                                                     No                    N/A (Runs as z/OS‐resident software 
                                                                                                      which directs tape data stream to 
                                                                                                                     disk) 
    DFSMS Support                                                             Yes                                     Yes 

    Replication                                                 Future – TS7680 to TS7680 (see          Yes – between ESCON/FICON‐
                                                                            below)                   attached 3380/3390 compatible disk 
                                                                                                                 subsystems 
    Maximum Physical Disk                                                     1PB                    No limit other than that imposed by 
    Capacity/System                                                                                                  z/OS 
    Parallel Access Tape                                                       No                                     Yes 

    Tape stacking support                                                     Yes                                    Yes 

                         Table 2: Comparison of IBM ProtecTIER® TS780 and IBM VTF™ Mainframe 


Replication as a Future Deliverable 
As part of this announcement, IBM also announced planed support for TS7680 device to device 
replication. This will be a significant enhancement to the TS7680 product set in that it will deliver the 
benefits of deduplication to business continuance and disaster recovery planners. During replication, 
only the deduplicated data will be sent from a primary site to a secondary site over the communications 
link between the two, be it LAN, MAN, or WAN . This capability could reduce the overall cost of a robust 
business continuance plan—one that also includes disaster recovery capabilities. Indeed, the ability to 
send deduplicated data between sites could put a more robust DR plan within reach of organizations 
that cannot now afford one. 

Replication will be configured at the tape volume level i.e. the smallest data unit that will be sent 
between primary and secondary sites will be a tape volume. Replication can proceed before the volume 
is unloaded. Volumes will be visible to one active site at a time. 

The trade‐off here will be in determining whether or not to use the significant reduction in data 
transmitted between sites to reduce the cost of a DR‐related communications link by reducing the 
bandwidth required, or to improve on recovery time objectives by maintaining the communications link 


                                                            
4
  Ratio highly dependent on the amount of time data resides within the target storage device and the degree of 
variability in the data stream. Some data streams dedupe better than others. 
Page 6 of 7                                                                                               Copyright 2010, Evaluator Group, Inc. 

 
ProtecTIER® Deduplication Gateway for System z®

already in place. Under the right circumstances, a storage administrator could also consider eliminating 
the need to send physical tapes off‐site. 


Conclusion 
IBM’s TS7680 delivers a form of data deduplication that is consistent with mainframe production 
environments. The inline deduplication process implemented here should have minimal impact on 
performance when data written to TS7680‐attached disk. The fact that deduplicated data is immediately 
available for replication (once this capability is delivered) means that there is no impact to disaster other 
processes needing to use the replicated copies. 

The TS7680 gives mainframe administrators another tool to improve service levels with disk based tape 
processing while repurposing tape for other longer‐term storage requirements. The fact that the TS7680 
supports some legacy disk arrays means that previous generation disk can now be used in place of tape 
to accelerate application performance.  

    The open system environment has enjoyed the benefits of deduplication for some time now. Mainframe 
    customers looking to leverage those same benefits for an IBM solution now have an IBM option to 
    evaluate. 

     




        Copyright 2010, Evaluator Group, Inc.                                                         Page 7 of 7     
 




 

More Related Content

What's hot

Virtualization And Disk Performance
Virtualization And Disk PerformanceVirtualization And Disk Performance
Virtualization And Disk PerformanceDiskeeper
 
Chapter 12 - Mass Storage Systems
Chapter 12 - Mass Storage SystemsChapter 12 - Mass Storage Systems
Chapter 12 - Mass Storage SystemsWayne Jones Jnr
 
Big in memory file system
Big in memory file systemBig in memory file system
Big in memory file systemMahesh Gupta
 
Mass storage device
Mass storage deviceMass storage device
Mass storage deviceRaza Umer
 
Introduction to Storage technologies
Introduction to Storage technologiesIntroduction to Storage technologies
Introduction to Storage technologiesKaivalya Shah
 
b tree file system report
b tree file system reportb tree file system report
b tree file system reportDinesh Gupta
 
17. Recovery System in DBMS
17. Recovery System in DBMS17. Recovery System in DBMS
17. Recovery System in DBMSkoolkampus
 
Flash-Specific Data Protection
Flash-Specific Data ProtectionFlash-Specific Data Protection
Flash-Specific Data ProtectionEMC
 
Basics of storage Technology
Basics of storage TechnologyBasics of storage Technology
Basics of storage TechnologyLopamudra Das
 
Storage Area Networks Unit 2 Notes
Storage Area Networks Unit 2 NotesStorage Area Networks Unit 2 Notes
Storage Area Networks Unit 2 NotesSudarshan Dhondaley
 
Storage Training July 10
Storage Training July 10Storage Training July 10
Storage Training July 10Fiaz27
 
Memory Management in Windows 7
Memory Management in Windows 7Memory Management in Windows 7
Memory Management in Windows 7Naveed Qadri
 
I-Sieve: An inline High Performance Deduplication System Used in cloud storage
I-Sieve: An inline High Performance Deduplication System Used in cloud storageI-Sieve: An inline High Performance Deduplication System Used in cloud storage
I-Sieve: An inline High Performance Deduplication System Used in cloud storageredpel dot com
 
Btrfs: Design, Implementation and the Current Status
Btrfs: Design, Implementation and the Current StatusBtrfs: Design, Implementation and the Current Status
Btrfs: Design, Implementation and the Current StatusLukáš Czerner
 

What's hot (20)

Virtualization And Disk Performance
Virtualization And Disk PerformanceVirtualization And Disk Performance
Virtualization And Disk Performance
 
Chapter 12 - Mass Storage Systems
Chapter 12 - Mass Storage SystemsChapter 12 - Mass Storage Systems
Chapter 12 - Mass Storage Systems
 
Big in memory file system
Big in memory file systemBig in memory file system
Big in memory file system
 
Mass storage device
Mass storage deviceMass storage device
Mass storage device
 
Data storage csc
Data storage cscData storage csc
Data storage csc
 
Introduction to Storage technologies
Introduction to Storage technologiesIntroduction to Storage technologies
Introduction to Storage technologies
 
Ch10
Ch10Ch10
Ch10
 
Ch11
Ch11Ch11
Ch11
 
b tree file system report
b tree file system reportb tree file system report
b tree file system report
 
Massstorage
MassstorageMassstorage
Massstorage
 
17. Recovery System in DBMS
17. Recovery System in DBMS17. Recovery System in DBMS
17. Recovery System in DBMS
 
Flash-Specific Data Protection
Flash-Specific Data ProtectionFlash-Specific Data Protection
Flash-Specific Data Protection
 
Basics of storage Technology
Basics of storage TechnologyBasics of storage Technology
Basics of storage Technology
 
Storage Area Networks Unit 2 Notes
Storage Area Networks Unit 2 NotesStorage Area Networks Unit 2 Notes
Storage Area Networks Unit 2 Notes
 
Storage Training July 10
Storage Training July 10Storage Training July 10
Storage Training July 10
 
Memory Management in Windows 7
Memory Management in Windows 7Memory Management in Windows 7
Memory Management in Windows 7
 
Mass storage systemsos
Mass storage systemsosMass storage systemsos
Mass storage systemsos
 
ch11
ch11ch11
ch11
 
I-Sieve: An inline High Performance Deduplication System Used in cloud storage
I-Sieve: An inline High Performance Deduplication System Used in cloud storageI-Sieve: An inline High Performance Deduplication System Used in cloud storage
I-Sieve: An inline High Performance Deduplication System Used in cloud storage
 
Btrfs: Design, Implementation and the Current Status
Btrfs: Design, Implementation and the Current StatusBtrfs: Design, Implementation and the Current Status
Btrfs: Design, Implementation and the Current Status
 

Viewers also liked

Happy New Year Orz
Happy New Year  OrzHappy New Year  Orz
Happy New Year OrzIlo Kvetna
 
I’m on the train; shall I email you my coordinates…? Mobile Geographic Inform...
I’m on the train; shall I email you my coordinates…? Mobile Geographic Inform...I’m on the train; shall I email you my coordinates…? Mobile Geographic Inform...
I’m on the train; shall I email you my coordinates…? Mobile Geographic Inform...Paul Cripps
 
Sa tourlin kppt
Sa tourlin kpptSa tourlin kppt
Sa tourlin kpptsrasunil
 
Wrapping e decorazione scooter
Wrapping e decorazione scooterWrapping e decorazione scooter
Wrapping e decorazione scooteromegawraproma
 
19727 thivolle 2011_archivage_1_
19727 thivolle 2011_archivage_1_19727 thivolle 2011_archivage_1_
19727 thivolle 2011_archivage_1_Salsa Nina
 
Corp Profile Access Cad
Corp Profile  Access CadCorp Profile  Access Cad
Corp Profile Access Cadaccesscad
 
Universal marine global aquaculture
Universal marine global aquacultureUniversal marine global aquaculture
Universal marine global aquacultureuniversalnets
 
Consumententrends 2010 2011
Consumententrends 2010 2011Consumententrends 2010 2011
Consumententrends 2010 2011secondsight
 
Unibilt
UnibiltUnibilt
Unibiltak5823
 
Apresentação: Padrões de Projetos para Persistência de Dados
Apresentação: Padrões de Projetos para Persistência de DadosApresentação: Padrões de Projetos para Persistência de Dados
Apresentação: Padrões de Projetos para Persistência de DadosLuan Lima
 

Viewers also liked (12)

Happy New Year Orz
Happy New Year  OrzHappy New Year  Orz
Happy New Year Orz
 
I’m on the train; shall I email you my coordinates…? Mobile Geographic Inform...
I’m on the train; shall I email you my coordinates…? Mobile Geographic Inform...I’m on the train; shall I email you my coordinates…? Mobile Geographic Inform...
I’m on the train; shall I email you my coordinates…? Mobile Geographic Inform...
 
Sa tourlin kppt
Sa tourlin kpptSa tourlin kppt
Sa tourlin kppt
 
Wrapping e decorazione scooter
Wrapping e decorazione scooterWrapping e decorazione scooter
Wrapping e decorazione scooter
 
19727 thivolle 2011_archivage_1_
19727 thivolle 2011_archivage_1_19727 thivolle 2011_archivage_1_
19727 thivolle 2011_archivage_1_
 
Corp Profile Access Cad
Corp Profile  Access CadCorp Profile  Access Cad
Corp Profile Access Cad
 
WBG
WBGWBG
WBG
 
Universal marine global aquaculture
Universal marine global aquacultureUniversal marine global aquaculture
Universal marine global aquaculture
 
Consumententrends 2010 2011
Consumententrends 2010 2011Consumententrends 2010 2011
Consumententrends 2010 2011
 
Unibilt
UnibiltUnibilt
Unibilt
 
Apresentação: Padrões de Projetos para Persistência de Dados
Apresentação: Padrões de Projetos para Persistência de DadosApresentação: Padrões de Projetos para Persistência de Dados
Apresentação: Padrões de Projetos para Persistência de Dados
 
Revista
RevistaRevista
Revista
 

Similar to IBM ProtecTIER Deduplication for z/OS

IBM System Storage TS7650G ProtecTIER Deduplication Gateway
IBM System Storage TS7650G ProtecTIER Deduplication GatewayIBM System Storage TS7650G ProtecTIER Deduplication Gateway
IBM System Storage TS7650G ProtecTIER Deduplication GatewayIBM India Smarter Computing
 
IBM TS7610 ProtecTIER Deduplication Appliance Express – Enterprise Level Tech...
IBM TS7610 ProtecTIER Deduplication Appliance Express – Enterprise Level Tech...IBM TS7610 ProtecTIER Deduplication Appliance Express – Enterprise Level Tech...
IBM TS7610 ProtecTIER Deduplication Appliance Express – Enterprise Level Tech...IBM India Smarter Computing
 
Study notes for CompTIA Certified Advanced Security Practitioner
Study notes for CompTIA Certified Advanced Security PractitionerStudy notes for CompTIA Certified Advanced Security Practitioner
Study notes for CompTIA Certified Advanced Security PractitionerDavid Sweigert
 
Software defined storage rev. 2.0
Software defined storage rev. 2.0 Software defined storage rev. 2.0
Software defined storage rev. 2.0 TTEC
 
03 Data Recovery - Notes
03 Data Recovery - Notes03 Data Recovery - Notes
03 Data Recovery - NotesKranthi
 
Study notes for CompTIA Certified Advanced Security Practitioner (ver2)
Study notes for CompTIA Certified Advanced Security Practitioner  (ver2)Study notes for CompTIA Certified Advanced Security Practitioner  (ver2)
Study notes for CompTIA Certified Advanced Security Practitioner (ver2)David Sweigert
 
Spectrum Scale final
Spectrum Scale finalSpectrum Scale final
Spectrum Scale finalJoe Krotz
 
TechDay - Toronto 2016 - Hyperconvergence and OpenNebula
TechDay - Toronto 2016 - Hyperconvergence and OpenNebulaTechDay - Toronto 2016 - Hyperconvergence and OpenNebula
TechDay - Toronto 2016 - Hyperconvergence and OpenNebulaOpenNebula Project
 
Cohesity Data Platform One Pager
Cohesity Data Platform One PagerCohesity Data Platform One Pager
Cohesity Data Platform One PagerdcVAST
 
IBM Cloud Object Storage Point of View
IBM Cloud Object Storage Point of View IBM Cloud Object Storage Point of View
IBM Cloud Object Storage Point of View Philippe Ponti
 
Webinar: How Snapshots CAN be Backups
Webinar: How Snapshots CAN be BackupsWebinar: How Snapshots CAN be Backups
Webinar: How Snapshots CAN be BackupsStorage Switzerland
 
GlusterFS Presentation FOSSCOMM2013 HUA, Athens, GR
GlusterFS Presentation FOSSCOMM2013 HUA, Athens, GRGlusterFS Presentation FOSSCOMM2013 HUA, Athens, GR
GlusterFS Presentation FOSSCOMM2013 HUA, Athens, GRTheophanis Kontogiannis
 
Understanding the Windows Server Administration Fundamentals (Part-2)
Understanding the Windows Server Administration Fundamentals (Part-2)Understanding the Windows Server Administration Fundamentals (Part-2)
Understanding the Windows Server Administration Fundamentals (Part-2)Tuan Yang
 
Techgate's Business Cloud Backup
Techgate's Business Cloud BackupTechgate's Business Cloud Backup
Techgate's Business Cloud BackupTechgate plc
 
White paper whitewater-datastorageinthecloud
White paper whitewater-datastorageinthecloudWhite paper whitewater-datastorageinthecloud
White paper whitewater-datastorageinthecloudAccenture
 
S100298 pendulum-swings-orlando-v1804a
S100298 pendulum-swings-orlando-v1804aS100298 pendulum-swings-orlando-v1804a
S100298 pendulum-swings-orlando-v1804aTony Pearson
 
Preventing Possible PVS Performance Pain Points
Preventing Possible PVS Performance Pain PointsPreventing Possible PVS Performance Pain Points
Preventing Possible PVS Performance Pain PointsAndrew Wood
 

Similar to IBM ProtecTIER Deduplication for z/OS (20)

IBM System Storage TS7650G ProtecTIER Deduplication Gateway
IBM System Storage TS7650G ProtecTIER Deduplication GatewayIBM System Storage TS7650G ProtecTIER Deduplication Gateway
IBM System Storage TS7650G ProtecTIER Deduplication Gateway
 
IBM TS7610 ProtecTIER Deduplication Appliance Express – Enterprise Level Tech...
IBM TS7610 ProtecTIER Deduplication Appliance Express – Enterprise Level Tech...IBM TS7610 ProtecTIER Deduplication Appliance Express – Enterprise Level Tech...
IBM TS7610 ProtecTIER Deduplication Appliance Express – Enterprise Level Tech...
 
TS7680 ProtecTIER for z/OS Datasheet
TS7680 ProtecTIER for z/OS DatasheetTS7680 ProtecTIER for z/OS Datasheet
TS7680 ProtecTIER for z/OS Datasheet
 
TS7680 ProtecTIER for z/OS Datasheet
TS7680 ProtecTIER for z/OS DatasheetTS7680 ProtecTIER for z/OS Datasheet
TS7680 ProtecTIER for z/OS Datasheet
 
Study notes for CompTIA Certified Advanced Security Practitioner
Study notes for CompTIA Certified Advanced Security PractitionerStudy notes for CompTIA Certified Advanced Security Practitioner
Study notes for CompTIA Certified Advanced Security Practitioner
 
Software defined storage rev. 2.0
Software defined storage rev. 2.0 Software defined storage rev. 2.0
Software defined storage rev. 2.0
 
03 Data Recovery - Notes
03 Data Recovery - Notes03 Data Recovery - Notes
03 Data Recovery - Notes
 
Lesson 2
Lesson 2Lesson 2
Lesson 2
 
Study notes for CompTIA Certified Advanced Security Practitioner (ver2)
Study notes for CompTIA Certified Advanced Security Practitioner  (ver2)Study notes for CompTIA Certified Advanced Security Practitioner  (ver2)
Study notes for CompTIA Certified Advanced Security Practitioner (ver2)
 
Spectrum Scale final
Spectrum Scale finalSpectrum Scale final
Spectrum Scale final
 
TechDay - Toronto 2016 - Hyperconvergence and OpenNebula
TechDay - Toronto 2016 - Hyperconvergence and OpenNebulaTechDay - Toronto 2016 - Hyperconvergence and OpenNebula
TechDay - Toronto 2016 - Hyperconvergence and OpenNebula
 
Cohesity Data Platform One Pager
Cohesity Data Platform One PagerCohesity Data Platform One Pager
Cohesity Data Platform One Pager
 
IBM Cloud Object Storage Point of View
IBM Cloud Object Storage Point of View IBM Cloud Object Storage Point of View
IBM Cloud Object Storage Point of View
 
Webinar: How Snapshots CAN be Backups
Webinar: How Snapshots CAN be BackupsWebinar: How Snapshots CAN be Backups
Webinar: How Snapshots CAN be Backups
 
GlusterFS Presentation FOSSCOMM2013 HUA, Athens, GR
GlusterFS Presentation FOSSCOMM2013 HUA, Athens, GRGlusterFS Presentation FOSSCOMM2013 HUA, Athens, GR
GlusterFS Presentation FOSSCOMM2013 HUA, Athens, GR
 
Understanding the Windows Server Administration Fundamentals (Part-2)
Understanding the Windows Server Administration Fundamentals (Part-2)Understanding the Windows Server Administration Fundamentals (Part-2)
Understanding the Windows Server Administration Fundamentals (Part-2)
 
Techgate's Business Cloud Backup
Techgate's Business Cloud BackupTechgate's Business Cloud Backup
Techgate's Business Cloud Backup
 
White paper whitewater-datastorageinthecloud
White paper whitewater-datastorageinthecloudWhite paper whitewater-datastorageinthecloud
White paper whitewater-datastorageinthecloud
 
S100298 pendulum-swings-orlando-v1804a
S100298 pendulum-swings-orlando-v1804aS100298 pendulum-swings-orlando-v1804a
S100298 pendulum-swings-orlando-v1804a
 
Preventing Possible PVS Performance Pain Points
Preventing Possible PVS Performance Pain PointsPreventing Possible PVS Performance Pain Points
Preventing Possible PVS Performance Pain Points
 

More from IBM India Smarter Computing

Using the IBM XIV Storage System in OpenStack Cloud Environments
Using the IBM XIV Storage System in OpenStack Cloud Environments Using the IBM XIV Storage System in OpenStack Cloud Environments
Using the IBM XIV Storage System in OpenStack Cloud Environments IBM India Smarter Computing
 
TSL03104USEN Exploring VMware vSphere Storage API for Array Integration on th...
TSL03104USEN Exploring VMware vSphere Storage API for Array Integration on th...TSL03104USEN Exploring VMware vSphere Storage API for Array Integration on th...
TSL03104USEN Exploring VMware vSphere Storage API for Array Integration on th...IBM India Smarter Computing
 
A Comparison of PowerVM and Vmware Virtualization Performance
A Comparison of PowerVM and Vmware Virtualization PerformanceA Comparison of PowerVM and Vmware Virtualization Performance
A Comparison of PowerVM and Vmware Virtualization PerformanceIBM India Smarter Computing
 
IBM pureflex system and vmware vcloud enterprise suite reference architecture
IBM pureflex system and vmware vcloud enterprise suite reference architectureIBM pureflex system and vmware vcloud enterprise suite reference architecture
IBM pureflex system and vmware vcloud enterprise suite reference architectureIBM India Smarter Computing
 

More from IBM India Smarter Computing (20)

Using the IBM XIV Storage System in OpenStack Cloud Environments
Using the IBM XIV Storage System in OpenStack Cloud Environments Using the IBM XIV Storage System in OpenStack Cloud Environments
Using the IBM XIV Storage System in OpenStack Cloud Environments
 
All-flash Needs End to End Storage Efficiency
All-flash Needs End to End Storage EfficiencyAll-flash Needs End to End Storage Efficiency
All-flash Needs End to End Storage Efficiency
 
TSL03104USEN Exploring VMware vSphere Storage API for Array Integration on th...
TSL03104USEN Exploring VMware vSphere Storage API for Array Integration on th...TSL03104USEN Exploring VMware vSphere Storage API for Array Integration on th...
TSL03104USEN Exploring VMware vSphere Storage API for Array Integration on th...
 
IBM FlashSystem 840 Product Guide
IBM FlashSystem 840 Product GuideIBM FlashSystem 840 Product Guide
IBM FlashSystem 840 Product Guide
 
IBM System x3250 M5
IBM System x3250 M5IBM System x3250 M5
IBM System x3250 M5
 
IBM NeXtScale nx360 M4
IBM NeXtScale nx360 M4IBM NeXtScale nx360 M4
IBM NeXtScale nx360 M4
 
IBM System x3650 M4 HD
IBM System x3650 M4 HDIBM System x3650 M4 HD
IBM System x3650 M4 HD
 
IBM System x3300 M4
IBM System x3300 M4IBM System x3300 M4
IBM System x3300 M4
 
IBM System x iDataPlex dx360 M4
IBM System x iDataPlex dx360 M4IBM System x iDataPlex dx360 M4
IBM System x iDataPlex dx360 M4
 
IBM System x3500 M4
IBM System x3500 M4IBM System x3500 M4
IBM System x3500 M4
 
IBM System x3550 M4
IBM System x3550 M4IBM System x3550 M4
IBM System x3550 M4
 
IBM System x3650 M4
IBM System x3650 M4IBM System x3650 M4
IBM System x3650 M4
 
IBM System x3500 M3
IBM System x3500 M3IBM System x3500 M3
IBM System x3500 M3
 
IBM System x3400 M3
IBM System x3400 M3IBM System x3400 M3
IBM System x3400 M3
 
IBM System x3250 M3
IBM System x3250 M3IBM System x3250 M3
IBM System x3250 M3
 
IBM System x3200 M3
IBM System x3200 M3IBM System x3200 M3
IBM System x3200 M3
 
IBM PowerVC Introduction and Configuration
IBM PowerVC Introduction and ConfigurationIBM PowerVC Introduction and Configuration
IBM PowerVC Introduction and Configuration
 
A Comparison of PowerVM and Vmware Virtualization Performance
A Comparison of PowerVM and Vmware Virtualization PerformanceA Comparison of PowerVM and Vmware Virtualization Performance
A Comparison of PowerVM and Vmware Virtualization Performance
 
IBM pureflex system and vmware vcloud enterprise suite reference architecture
IBM pureflex system and vmware vcloud enterprise suite reference architectureIBM pureflex system and vmware vcloud enterprise suite reference architecture
IBM pureflex system and vmware vcloud enterprise suite reference architecture
 
X6: The sixth generation of EXA Technology
X6: The sixth generation of EXA TechnologyX6: The sixth generation of EXA Technology
X6: The sixth generation of EXA Technology
 

Recently uploaded

How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxBkGupta21
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESmohitsingh558521
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfPrecisely
 

Recently uploaded (20)

How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptx
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
 

IBM ProtecTIER Deduplication for z/OS

  • 1. IBM ProtecTIER Deduplication for z/OS John Webster March 04, 2010 Technology Insight Series Eva lua t o r Gr oup
  • 3. ProtecTIER® Deduplication Gateway for System z® Announcement Summary  The many data deduplication technologies available today have the ability to dramatically lower  enterprise IT costs for both storing and moving data. “Dedupe” has become widely used in open systems  environments to, in some cases, significantly reduce storage capacity that would otherwise be required  for backup, archive, and even primary storage supporting critical applications. Until now, the only way  mainframe storage administrators could take advantage of this increasingly popular technology was to  insert an ESCON/FICON‐to‐TCP/IP channel emulation device into a data stream between a mainframe  channel and an open systems virtual tape library (VTL) that supports data deduplication (Bus‐Tech MDL  100V + FalconStor VTL for example).     With the announcement of the IBM System Storage TS7680 ProtecTIER Deduplication Gateway for  System z (TS7680), IBM now offers its mainframe customers an advanced data deduplication solution  that can be used for a number of application scenarios including backup and other data stream‐intensive  applications where data is first streamed to tape for subsequent processing. One of the results of  implementing data deduplication on System z is that a variety of disk platforms, both current generation  and legacy, can now be considered as a cost‐effective storage platform for these types of applications as  compared to tape.  IBM acquired Diligent Technologies Corporation in April, 2008. IBM subsequently introduced a number  of IBM branded products including the IBM System Storage TS7650 Appliance and TS7650G Gateway  based on ProtecTIER with HyperFactor® (discussed in more detail below1 ) and has installed under the  IBM logo more than 600 ProtecTIER solutions for open systems. With the announcement of the TS7680,  IBM extended its portfolio of enterprise‐class data deduplication solutions by providing one for System z  environments based on proven technology.   Dedupe addresses capacity, performance, and bandwidth issues  Data deduplication is able to dramatically decrease the amount of disk space required for backup data  when disk is used as a backup target, while retaining the significant performance improvements that  disk based backup devices have over tape.  Thus, data deduplication should be considered for any IT  environment looking to contain storage costs associated with backup, while preserving the delivery of  required service levels for data protection. Some storage administrators have decided to replace tape  with disk for applications requiring rapid access to data precisely because the cost per GB of  deduplicated data on disk made it more affordable to maintain tape data on disk.    Business continuance and disaster recovery‐related data replication processes within and outside of a  system can also take significant amounts of time depending on the volume of data and the size of the  interconnecting data “pipe.” Deduplicating the data objects within these replication streams to in many  cases a small fraction of their original size will allow them to be moved in much less time.  Reduced  bandwidth requirement could also be translated into reducing communications costs between sites for  replication‐related data transfers.                                                               1  See also Evaluator Group Announcement Summary of IBM’s TS7650 VTL Systems published February 9, 2009.  Copyright 2010, Evaluator Group, Inc.  Page 1 of 7       
  • 4. ProtecTIER® Deduplication Gateway for System z® Post­process vs. Inline  The storage industry’s approach to data deduplication has evolved to the point where today there are  essentially two different processes that yield deduplicated data objects. Real‐time or streaming data  deduplication is known as “in‐line” while data deduplication that occurs later is commonly referred to as  “post‐process” deduplication.  The in‐line process deduplicates data “in flight” and in real time as it is  being sent to a backup device for example. Post‐processing refers to data deduplication performed at  some point in time after the data has been sent to a storage device—a Virtual Tape Library (VTL) for  example that runs deduplication after data has been stored.    As with most options, the optimal method to use depends upon the goals the storage administrator has  in mind. Consider the backup process. Storage administrators looking to simply minimize the backup  window often choose the post‐process method. The potential advantage is that, because the  deduplication process is not in the path of the data stream, there will be no performance impact during  the write operation and therefore no elongation of the backup window. 2   That is, backup data is sent to  a temporary holding area within the disk array to negate potential performance impact. Once the  backup job completes, the data is later examined for duplicates, with duplicate data removed at a later  “post‐process” time.  The disadvantage of this method is that additional storage space is required when  compared with the in‐line process.    An alternative to deduplicating after a backup is to perform deduplication “in‐line” as data is being sent  to the backup device.  The first advantage with this method is that no extra disk space is required. The  data stored to disk is in deduplicated form right from the start. Second, no additional processing step to  deduplicate the data is required. Another advantage of in‐line processing is that once the data is de‐ duplicated and stored, deduplicated data may be replicated immediately to off‐site storage.  As a result,  the time to complete the entire business continuance process—including backup—is reduced, and as  mentioned earlier, the bandwidth and/or the time required to replicate is also reduced. As noted above,  in some implementations in‐line processing impacts performance and therefore backup time. IBM  claims “negligible” performance impact due to using a light‐weight index of no more than 4GB  maximum3   that maps to the contents of the data repository supporting up to 1PB.”                                                                           2  Depending on implementation, a second backup may not be able to start until the post processing de‐duplication  completes.   3  EGI has not yet been able to validate this claim with ProtectTIER users  Page 2 of 7   Copyright 2010, Evaluator Group, Inc.   
  • 5. ProtecTIER® Deduplication Gateway for System z® The TS7680 for IBM’s System z  The TS7680 is implemented as a gateway to disk arrays within a System z ESCON® or FICON® channel.      Figure 1: Data Deduplication for System z (Source: IBM and Evaluator Group)  Shown above in Figure 1 is a typical deployment of a TS7680 system to provide data deduplication and  offline tape storage in a System z environment.  As illustrated below in Figure 2, is a depiction of how  the ProtecTIER TS7680 system operates between the System z host and the disk cache.    Figure 2: IBM TS7680 ProtecTIER Host Connectivity (Source: IBM and Evaluator Group)  Key points to bear in mind when evaluating the TS7680 include:  • Deduplication is performed in‐line as described above.  • Components within the TS7680 solution include a single frame containing two clustered  ProtecTIER servers for failover redundancy, FICON interfaces, and the ProtecTIER software. No  System z host‐resident software is required.  • Maximum capacity of the back‐end disk array storage is 1PB meaning that the TS7680 supports  up to 1PB of disk for storage of deduplicated data. If a deduplication ratio of 10:1 is assumed,  one could expect to store 10PB of normally formatted data within this 1 PB space after  Copyright 2010, Evaluator Group, Inc.  Page 3 of 7       
  • 6. ProtecTIER® Deduplication Gateway for System z® deduplication. Deduplication ratios can vary widely however depending on the amount of data  redundancy encountered by the system. It is misleading to translate deduplication ratios seen in  open systems environments to System z. It is also the case that data deduplication ratios can  increase over time as the system processes an increasing amount of data, and consequently  encounters more redundancy.  • Backend disk is Fibre Channel‐attached and can be IBM System Storage DS8000®, IBM XIV®  Storage Systems(SATA disk), IBM System Storage DS5000®/4000® for mid‐range System z  environments, and any combination of third‐party disk arrays already supported for attachment  to IBM’s TS7650G.  • The TS7680 emulates an automated tape library with IBM System Storage 3592 Model J1A tape  drives and supporting MEDIA5 (3592 JA) cartridges.  • From the perspective of the storage administrator, the TS7680 is managed transparently using  system‐managed tape (SMStape) facilities. No host application, tape management, or JCL  changes are required. Virtual tapes are returned to scratch processing after deletion. Alerts are  sent to the administrator if available capacity is running low.  • Backend tape attachment is not supported. Data objects that need to be migrated to tape must  first be “rehydrated” i.e. returned to normal format and then sent via the System z host to a  tape device.   The HyperFactor Process  Storage vendors now offer a variety of ways to deduplicate data. As mentioned, the process can occur  in‐line or run sometime after data is stored. In addition, there are differing deduplication processes that  can be applied. File level deduplication has been available for a number of years. Deduplication using  hashing algorithms to generate a code that represents stored data objects is more recent, and now  more common.  ProtecTIER’s HyperFactor uses a series of algorithms to identify elements within a data stream that have  been previously stored by ProtecTIER. Once similar elements have been found, HyperFactor compares  the new data to the similar data already stored and writes only the byte‐level changes to disk.  HyperFactor uses a memory resident index of no more than 4GB to identify similar data. A copy of the  index is maintained on TS7680‐attached disk. IBM reports a maximum measured throughput of 500  MB/s using HyperFactor’s data deduplication in‐line processing.  Comparing the TS7680 to Other IBM System z Virtual Tape Solutions  IBM Virtualization Engine™ TS7700 Family  Although the TS7680 leverages disk storage capabilities, it does nevertheless emulate IBM’s 3592 tape  and should be compared first to other IBM virtualized tape subsystems. While both the TS7720 and  7740 offer compression, they do not support or deliver the reduction in storage capacity that data  deduplication is capable of. The TS7700 offerings do provide “Grid” replication functionality, which  supports the replication of tape data between up to four sites. In addition, Grid supports capabilities  Page 4 of 7   Copyright 2010, Evaluator Group, Inc.   
  • 7. ProtecTIER® Deduplication Gateway for System z® such as access to state‐consistent tape volumes from any site. However, the TS7680 is planning a less  sophisticated two site replication capability in a future release expected early next year.       Feature  TS7680  TS7740  TS7720  Max. disk capacity (raw)  1PB 14TB or 56TB w/ 4‐way  70TB or 280 TB w/ 4‐way  grid  grid  Max. number of virtual  256 256 or 1024 (4‐way grid) 256 or 1024 (4‐way grid) drives supported  Max. number of virtual  1M 1M 1M volumes supported  Direct tape attachment  No Yes (grid) Yes when configured in  TS7740 grid  Deduplication  Yes No No  Device‐to‐device  Future (see below) Yes Yes Replication   Table 1: Comparison of IBM ProtecTIER® TS780 and IBM Virtualization Engine™ Family  IBM VTF™ Mainframe  VTF Mainframe is based on software acquired in the Diligent Technologies acquisition. VTF Mainframe is  z/OS® host‐resident software that provides emulation of IBM and IBM‐compatible cartridge devices and  tape volumes and redirects tape‐targeted data streams to ESCON/FICON channel‐attached disk. It does  not support HyperFactor deduplication, but it does support remote mirroring between storage devices  and could be considered along with the TS7680 when there is a need to reduce the time required to run  batch jobs that are heavy users of tape. Also unlike the TS7680, VTF Mainframe supports multiple  concurrent access to a single tape data set (Parallel Access Tape).      Copyright 2010, Evaluator Group, Inc.  Page 5 of 7       
  • 8. ProtecTIER® Deduplication Gateway for System z®   Feature  TS7680  VTF Mainframe  Supported disk  IBM DS Series, IBM XIV, and/or any  Any ESCON/FICON 3380 or 3390‐ disk supported for attachment to  compatable  ProtecTIER TS7650G  Deduplication  Yes No  Max. number of virtual  256 256 Per LPAR  drives supported  Max. Theoretical Factoring  25:1 4  (HyperFactor deduplication) 2:1 (standard compression) Ratio  Native tape attachment  No N/A (Runs as z/OS‐resident software  which directs tape data stream to  disk)  DFSMS Support  Yes Yes  Replication  Future – TS7680 to TS7680 (see  Yes – between ESCON/FICON‐ below)  attached 3380/3390 compatible disk  subsystems  Maximum Physical Disk  1PB No limit other than that imposed by  Capacity/System  z/OS  Parallel Access Tape  No Yes  Tape stacking support  Yes Yes  Table 2: Comparison of IBM ProtecTIER® TS780 and IBM VTF™ Mainframe  Replication as a Future Deliverable  As part of this announcement, IBM also announced planed support for TS7680 device to device  replication. This will be a significant enhancement to the TS7680 product set in that it will deliver the  benefits of deduplication to business continuance and disaster recovery planners. During replication,  only the deduplicated data will be sent from a primary site to a secondary site over the communications  link between the two, be it LAN, MAN, or WAN . This capability could reduce the overall cost of a robust  business continuance plan—one that also includes disaster recovery capabilities. Indeed, the ability to  send deduplicated data between sites could put a more robust DR plan within reach of organizations  that cannot now afford one.  Replication will be configured at the tape volume level i.e. the smallest data unit that will be sent  between primary and secondary sites will be a tape volume. Replication can proceed before the volume  is unloaded. Volumes will be visible to one active site at a time.  The trade‐off here will be in determining whether or not to use the significant reduction in data  transmitted between sites to reduce the cost of a DR‐related communications link by reducing the  bandwidth required, or to improve on recovery time objectives by maintaining the communications link                                                               4  Ratio highly dependent on the amount of time data resides within the target storage device and the degree of  variability in the data stream. Some data streams dedupe better than others.  Page 6 of 7   Copyright 2010, Evaluator Group, Inc.   
  • 9. ProtecTIER® Deduplication Gateway for System z® already in place. Under the right circumstances, a storage administrator could also consider eliminating  the need to send physical tapes off‐site.  Conclusion  IBM’s TS7680 delivers a form of data deduplication that is consistent with mainframe production  environments. The inline deduplication process implemented here should have minimal impact on  performance when data written to TS7680‐attached disk. The fact that deduplicated data is immediately  available for replication (once this capability is delivered) means that there is no impact to disaster other  processes needing to use the replicated copies.  The TS7680 gives mainframe administrators another tool to improve service levels with disk based tape  processing while repurposing tape for other longer‐term storage requirements. The fact that the TS7680  supports some legacy disk arrays means that previous generation disk can now be used in place of tape  to accelerate application performance.   The open system environment has enjoyed the benefits of deduplication for some time now. Mainframe  customers looking to leverage those same benefits for an IBM solution now have an IBM option to  evaluate.    Copyright 2010, Evaluator Group, Inc.  Page 7 of 7