SlideShare uma empresa Scribd logo
1 de 66
Virtualizing Big Data in
the Cloud
and everything in between
Technical Intelligence Manager of CTO,
Delphix
Kellyn Pot’Vin-Gorman
2© 2017 Delphix Corporation
Kellyn Pot’Vin-Gorman
Technical Intelligence Manager for the Office
of CTO, Delphix
• 2 decades experience as multi-platform DBA,
(Oracle, MSSQL, MySQL, Sybase,
Postgres…..)
• Oracle ACE Director, (Alumni)
• Oak Table Network
• APEX Women in Technology Award, CTA 2014
• STEM education with Raspberry Pi and Python
• Liaison for Denver SQL Server User Group
• Rocky Mountain Oracle Conference Director
and Board Director
• Author, blogger, (http://dbakevlar.com)
3© 2017 Delphix Corporation
Cloud and Big Data
• Big data often is dependent upon relational and
other legacy data stores.
• Multiple data sources, complex, often home-grown
environments when left without automation, leads to
human error.
• Difficult if not already in the cloud, often built in the
cloud by ease of access to cloud resources.
• Open source leads to open to discussion.
4© 2016 Delphix Corporation
Cloud Trends
 85% of Enterprises have a multi-cloud strategy
 77% are hybrid cloud, (different than the 2017 cloud survey)
 Workloads are being run in the cloud-
- 41% in public clouds
- 38% in private clouds
 Enterprise Companies are choosing cloud
- 65% want public cloud
- 63% want private cloud solutions
- 93% will be hybrid
State of the Cloud Survey, RightScale
5© 2017 Delphix Corporation
This trend will only increase in the next five years as cloud
continues to overtake the industry
6© 2017 Delphix Corporation
Big Data Project Types through 2026
0
10
20
30
40
50
60
70
80
90
100
2017 2018 2019 2020 2021 2022 2023 2024 2025
Big Data Pro
Big Data HW
Big Data SW
Billions of Dollars in Big Data Projects
Cloud Adoption and How It’s Changing
0%
10%
20%
30%
40%
50%
60%
70%
80%
90%
100%
Public Cloud Private Cloud Hybrid Cloud Any Cloud
2016
2017
2018
http://www.rightscale.com/blog/cloud-industry-insights/cloud-computing-trends-2017-state-cloud-survey
8© 2017 Delphix Corporation
How does virtualization fit into this?
• Big data is…well, BIG and built out of necessity.
• Companies report spending more time gathering & distributing big data than
analyzing it.
• Only 59% of big data is easily accessible and 37% report it takes more than a
day to access new data.
• Another 59% report that legacy data storage systems are still hindering big
data initiatives.
http://www.cioinsight.com/it-strategy/big-data/slideshows/big-datas-biggest-challenges.html
9© 2017 Delphix Corporation
Virtualization Adoption Trend
.
Source: Gartner Forecasts
0% 20% 40% 60% 80% 100%
2013
2014
2015
2016
2017
2018
Percentage of Virtualized Workloads
Percentage of Virtualized
Worklaods
Our Use Case
 Using online project of flat files for big data by Chris Wilson from Time
Magazine, based off publicly available datasets.
 Uses flat files and Ajax to produce workable datasets from open payments
data from cms.gov of highly anticipated datasets
https://source.opennews.org/articles/case-flat-files-big-data-projects/
11© 2017 Delphix Corporation
History of the Data
https://www.cms.gov/openpayments/
Value of the Data
$0
$100
$200
$300
$400
$500
$600
10 20 30 40 50 60 70 80 90
Average Payment and Percentile for Physicians
from Drug Companies Per Medication
Payments
Delivery Method- A Real Decision
 Data is large, no matter if a big data solution or otherwise, (VLDB).
 90% of data between environments is often consistent, with data appends.
 How often is a solution choice based off skill set of those in place and how will
this support the future with growth?
 Do you want to pay for licensing of database and client servers as stated in our
use case example?
 No need to patch, upgrade, etc. Just lock down file permissions and maintain
was the goal and this resonated with many customer scenarios.
Should vs. Did
 Although an RDBMS with JSON would have been the preferred method to
deliver the data, the author’s team made a different choice…
This…
“Presented a technical challenge, because our small [team] is
a more comfortable with client-side web development than we
are with administering servers and databases. So we decided
to make the whole thing searchable using only flat files and
Ajax requests.”
I’ve heard similar stories before.
My Virtualization Demo Environment
 Each zip file was under 1GB, (NOT big data), 16G uncompressed.
 Unstructured, it was cumbersome to work with.
 Gave excellent example of network bottlenecks transferring to Source.
17© 2017 Delphix Corporation
“The juiciest file, which contained information on payments that
medical companies made to physicians for things like meals,
travel, and consulting fees, was 2.6 million lines in a single 1.44
GB file.”
Per the Author-
https://source.opennews.org/articles/case-flat-files-big-data-projects/
18© 2017 Delphix Corporation
The Solution-
• Files hosted on Amazon S3
• CloudFront layer for high availability
• 350K flat files to be migrated to Amazon
• Over 30 seconds per file to upload.
Now do this for development and test, then rinse and repeat, finally producing
and releasing till complete.
To Paraphrase Hippocrates…
Databases are short, files are long.
20© 2017 Delphix Corporation
How Can This Scenario Be Enhanced?
• Remove the bottleneck and duplication of flat
files with virtualization.
• Ease ability to access and migrate to a cloud
platform from on-prem.
• If a real environment, and commonly, legacy
data sources, applications and access points,
containerize and simply delivery.
Virtualize Options for Big Data
 Partitioning- As many big data is partitioned resources across a single
physical system, virtualizing is often easy with modern virtualization products.
 Isolation- Many big data environments may already be on VMs, to create a
virtualized dataset could eliminate extensive storage requirements of duplicate
data.
 Package- Collect all tiers and dependencies for a big data solution and
containerize, making development, testing and delivery simple and automated.
22© 2017 Delphix Corporation
Introduction to Virtualization for Flat Files
Flat files can be virtualized
individually or part of a
“container” and can
eliminate duplication of
immense files that are part
of big data environments.
What is a Delphix vFile?
 Feature for “Unstructured Files”
 A directory tree of files for Delphix to manage.
 Can be:
- Link from an existing dataset on a source server into a dSource
- Files will be projected using NFS to a target server.
 Small part of bigger “swiss knife”, as able to virtualize
relational databases, applications, etc.
24© 2017 Delphix Corporation
Prerequisites for Cloud Environments
 Ensure NFS Mounts are option in
cloud environment
 No clustered environments, (MSFC,
Oracle RAC, etc.)
 Ensure credentials are set up
correctly to perform per
documentation.
Prepare Cloud Host, (if not pre-installed)
 For our Demo- Amazon- Install the NFS Client
 Linux or Redhat
sudo yum –y install nfs-utils
 Ubuntu
sudo apt-get –y install nfs-common
Create A New Source for DataSet
Allow For Initial Sync
Notice the Length of
Time for 16G to
Delphix Engine
28© 2017 Delphix Corporation
Limitations of vFiles Datasets
 No interval refresh capability from
parent, (must be recreated.)
 Pulled directly from dataset’s parent,
so take care with network, I/O
performance.
 Existing can still rewind, take
snapshots, etc.
Just checking if you’re paying
attention
29© 2017 Delphix Corporation
Now Copy Source Files Into Mounted vFile Source
Take a snapshot to mark updates to Files:
30© 2017 Delphix Corporation
Provision to our Target Host
• NFS Mount is ready on
LinuxTarget host.
• Choose to provision to the
LinuxTarget
• Click on Next and keep defaults,
but could add scripts to make
scripting changes, etc. as part of
“hooks”
31© 2017 Delphix Corporation
Provisioning a vFile
 My Datasets
 Select source to provision from
 Select a snapshot
 Click Provision
 Provision vFiles and update the
default to the correct mount point.
 Select a target environment and
add filter.
32© 2017 Delphix Corporation
Verification- Clone Is Successful!
Disable Source Mount
Enabled Once More!
33© 2017 Delphix Corporation
vFiles Manageability
 Can Enable or Disable from CLI or
Interface.
 Configure from same.
 Easily populate to all targets from
one source.
 Automate via DevOps as part of
scripting or Jenkins/Chef jobs
34© 2017 Delphix Corporation
Rewinding a vFile or “File Version Control”
 Highlight the vFiles
 On Timeflow, select the snapshot
 Click on Rewind
No need to SCP, FTP
or recreate!
35© 2017 Delphix Corporation
Rewind vFile
36© 2017 Delphix Corporation
Files Returned To Previous Version/Status
37© 2017 Delphix Corporation
Refresh vFile
• Refresh from Source
• Update target with
new files
• Return to original files
if catastrophic
situation.
38© 2017 Delphix Corporation
An overall trend in companies that includes autonomic computing,
where the IT environment will be able to manage itself based on
perceived activity, and resource shared computing, in which computer
processing power is utilized and/or paid for only as needed. The goal of
virtualization is to centralize administrative tasks while improving
scalability and work loads via the cloud.
Virtualization and Cloud
39© 2017 Delphix Corporation
Data Virtualization On-Prem
Dataset on source
server
Targets
“Projects”
thin copies
via NFS
40© 2017 Delphix Corporation
Delphix engine is:
• software appliance, (VM)
• hosted on any hardware
• tracks continual changes on regular intervals
• Using native technology
• Uses a source, (Dsource) to track changes
• Can rewind changes to a flat file.
• Uses little to no storage during life of vFile.
41© 2017 Delphix Corporation
Data Virtualization: linking to a source
Source File
Delphix Virtualization Engine
8TB storage
Rsync (UNIX/Linux)
Robocopy (Windows)
42© 2017 Delphix Corporation
Data Virtualization: provisioning a virtual database to a
target
Delphix Virtualization Engine
8 TB storage
NFS
iSCSI
43© 2017 Delphix Corporation
Provisioning a vFile to Target NFS Mounts
Target NFS Mount
Delphix Virtualization Engine
8 TB storage
NFS
iSCSI
Target NFS Mount
NFS
iSCSI
44© 2017 Delphix Corporation
What if Changes are Made?
Target NAS Mount
Delphix Virtualization Engine
8 TB storage
NFS
iSCSI
Target NAS Mount
NFS
iSCSI
Target NAS Mount
NFS
iSCSI
45© 2017 Delphix Corporation
Data Virtualization: provisioning a virtual database to a
target
Target NAS Mount
Delphix Virtualization Engine
8 TB storage
NFS
iSCSI
Target NAS Mount
NFS
iSCSI
Target NAS Mount
NFS
iSCSI
Source NFS Mount
Rsync (UNIX/Linux)
Robocopy (Windows)
46© 2017 Delphix Corporation
Now “Containerize” for Ease of Delivery
DevOps
Delphix Virtualization Engine
8 TB storage
NFS
iSCSI
Testing
NFS
iSCSI
Reporting
NFS
iSCSI
Create “Container”
Rsync (UNIX/Linux)
Robocopy (Windows)
47© 2017 Delphix Corporation
Robust and Full Read and Write Files
• As many as development, test, reporting etc. require.
• Delphix engine tracks the changes to each of the flat files.
• Allows to create containers to isolate data sources, files, applications and other
parts of big data environments and deliver as many as required per project.
48© 2017 Delphix Corporation
Moving to the Cloud with Standard Methods
On-Prem NFS
Mount
Cloud Storage NFS Mount
http://docs.aws.amazon.com/efs/latest/ug/mounting-fs.html
49© 2017 Delphix Corporation
Standard Cloud Migration Limitations
Even if only development or test has been migrated to the cloud…
 Data is migrated, but this doesn’t count for ongoing data loads,
application connectivity across the network.
 Refreshes are time consuming and complex.
 Often use archaic methods to refresh or replication is required.
 Rarely are difference in cost structures taken into consideration in
cloud migration projects from on-prem configurations.
50© 2017 Delphix Corporation
On-demand EC2 instances
 Range from $0.0065/hr to $8.184/hr
 Reserved instances provide discounts from 29% to 75%
over on-demand
Storage in Elastic Block Storage
 EBS General Purpose SSD (gp2) volumes
 $0.10 per GB-month of provisioned storage
 EBS Provisioned IOPS SSD (io1) volumes
 $0.125 per GB-month of provisioned storage
 $0.065 per provisioned IOPS-month
 EBS Throughput Optimized HDD (st1) volumes
 $0.045 per GB-month of provisioned storage
Data transfer in EC2
 DATA IN from “internet” is free, but internally within EC2 can be
$0.01/GB
Cost Estimates for Cloud Vendors
 Source: https://aws.amazon.com/ec2/pricing/ Sep
2016
51© 2017 Delphix Corporation
Virtualize Flat Files into the Cloud
On Prem NFS
Mount
Delphix Virtualization Engine
8 TB storage
NFS Mount on
Cloud
52© 2017 Delphix Corporation
And From the Command Line
Full and robust CLI allows for mass
provisioning and scripting options that aren’t
feasible with a GUI.
 Manage environment
 Simple and Complex Provisioning
 User Management
 Storage Management
 DevOps automation for Jenkins, Chef and
scripting automation.
53© 2017 Delphix Corporation
Set Environment:
snapshot list database=Vvfiles_546
timeflow "dexample" timeflowRanges;
Commit
Delphix Command Line Examples
Provision a new VDB and use the defaults:
delphix> database provision
After each command, note that the CLI relates the hierarchy:
delphix database provision> defaults
Using the defaults, an example is used for the deployment:
54© 2017 Delphix Corporation
delphix database provision *> set sourceConfig.type=OracleSIConfig
delphix database provision *> set sourceConfig.databaseName=VEM_833
delphix database provision *> set sourceConfig.uniqueName=VEM_833
VDB Configuration
55© 2017 Delphix Corporation
Set the appropriate information just as
you would to create a database
deployment from the GUI:
delphix database provision
defaults *> set
location=DEFAULT_SNAPSHOT
delphix database provision *> set
container.group=“Dev Copies"
Setting Snapshots and Group
56© 2017 Delphix Corporation
All IaaS solutions provide encryption in-flight and encryption at-rest
 But encryption doesn’t protect data as much as it needs to be .
Europe already requires data masking, not just data encryption for any
confindential data:
http://ec.europa.eu/justice/data-protection/article-
29/documentation/opinion-recommendation/files/2014/wp216_en.pdf
Confidential data
57© 2017 Delphix Corporation
Encryption is Different than Masking/Obfuscation
58© 2017 Delphix Corporation
Data Masking
59© 2017 Delphix Corporation
GDPR and Our Future with Data Protection
https://en.wikipedia.org/wiki/General_Data_Protection_Regulation
The General Data Protection Regulation (GDPR) (Regulation (EU)
2016/679) is a regulation by which the European Parliament, the
Council of the European Union and the European Commission intend
to strengthen and unify data protection for all individuals within the
European Union (EU).
60© 2017 Delphix Corporation
Big Data is often pulled from various data sources
The ability to mask this flat file data has incredible potential.
Capabilities to mask flat files is powerful and a reason for “Agile Masking”.
Flat File Masking
https://docs.delphix.com/docs/delphix-masking/masking-engine-user-guide/understand-and-manage-the-masking-engine/file-masking/flat-file-
masking
61© 2017 Delphix Corporation
Agile Masking and File Formats
https://docs.delphix.com/docs/delphix-masking/masking-engine-user-guide/understand-and-manage-the-masking-engine/file-masking/flat-file-masking
• Multi-record
• CSV
• XML
• Word
• Excel
• PowerPoint
• Unstructured
• EDI
• Installation: Agile masking installed with
valid license (file masking option)
• Characteristics: Description and type of
files
62© 2017 Delphix Corporation
Connectors and Agile Masking of Flat Files
Free Text Redaction Algorithm – This algorithms masks or redacts free
text columns of files. It uses either a Whitelist or Blacklist to determine
what words are masked or not masked. This algorithm may require
additional configuration to work in the manner you desire.
63© 2017 Delphix Corporation
vFiles, Masking, the Cloud- The Whole Picture
Delphix Masking Engine
Delphix Virtualization Engine
8 TB storage
Delphix Virtualization Engine
8 TB storage
UNIX Source
8 TB database Unix Target
64© 2017 Delphix Corporation
vFiles, Masking, the Cloud- The Whole Picture
Delphix Masking Engine
Delphix Virtualization Engine
8 TB storage
Delphix Virtualization Engine
8 TB storage
Unix Target
Unix Source
65© 2017 Delphix Corporation
Summary: Solutions for Cloud Migrations using
Virtualization
• Review large data sets, both in legacy data
sources and in structured, flat files for
opportunities to be delivered to the cloud.
• Consider virtualizing data sets to deter
from latency issues.
• “Containerize” environments for easy
delivery of complex builds.
• With GDPR and security a higher priority,
consider masking non-production
data/files, encrypting/securing production.
Thank you!
Please fill out the session survey.

Mais conteúdo relacionado

Mais procurados

Apache Spark Workshop at Hadoop Summit
Apache Spark Workshop at Hadoop SummitApache Spark Workshop at Hadoop Summit
Apache Spark Workshop at Hadoop SummitSaptak Sen
 
Analytics Modernization: Configuring SAS® Grid Manager for Hadoop
Analytics Modernization: Configuring SAS® Grid Manager for HadoopAnalytics Modernization: Configuring SAS® Grid Manager for Hadoop
Analytics Modernization: Configuring SAS® Grid Manager for HadoopHortonworks
 
Insights into Real-world Data Management Challenges
Insights into Real-world Data Management ChallengesInsights into Real-world Data Management Challenges
Insights into Real-world Data Management ChallengesDataWorks Summit
 
Consolidating File Servers into the Cloud
Consolidating File Servers into the CloudConsolidating File Servers into the Cloud
Consolidating File Servers into the CloudBuurst
 
Keep your data safe by moving from unsupported SQL Server 2008 to SQL Server ...
Keep your data safe by moving from unsupported SQL Server 2008 to SQL Server ...Keep your data safe by moving from unsupported SQL Server 2008 to SQL Server ...
Keep your data safe by moving from unsupported SQL Server 2008 to SQL Server ...Principled Technologies
 
Enabling the Software Defined Data Center for Hybrid IT
Enabling the Software Defined Data Center for Hybrid ITEnabling the Software Defined Data Center for Hybrid IT
Enabling the Software Defined Data Center for Hybrid ITNetApp
 
Your Self-Driving Car - How Did it Get So Smart?
Your Self-Driving Car - How Did it Get So Smart?Your Self-Driving Car - How Did it Get So Smart?
Your Self-Driving Car - How Did it Get So Smart?Hortonworks
 
Cloud Innovation Day - Commonwealth of PA v11.3
Cloud Innovation Day - Commonwealth of PA v11.3Cloud Innovation Day - Commonwealth of PA v11.3
Cloud Innovation Day - Commonwealth of PA v11.3Eric Rice
 
YARN Ready: Integrating to YARN with Tez
YARN Ready: Integrating to YARN with Tez YARN Ready: Integrating to YARN with Tez
YARN Ready: Integrating to YARN with Tez Hortonworks
 
Google Cloud Dataproc - Easier, faster, more cost-effective Spark and Hadoop
Google Cloud Dataproc - Easier, faster, more cost-effective Spark and HadoopGoogle Cloud Dataproc - Easier, faster, more cost-effective Spark and Hadoop
Google Cloud Dataproc - Easier, faster, more cost-effective Spark and Hadoophuguk
 
Starting Small and Scaling Big with Hadoop (Talend and Hortonworks webinar)) ...
Starting Small and Scaling Big with Hadoop (Talend and Hortonworks webinar)) ...Starting Small and Scaling Big with Hadoop (Talend and Hortonworks webinar)) ...
Starting Small and Scaling Big with Hadoop (Talend and Hortonworks webinar)) ...Hortonworks
 
Delivering Apache Hadoop for the Modern Data Architecture
Delivering Apache Hadoop for the Modern Data Architecture Delivering Apache Hadoop for the Modern Data Architecture
Delivering Apache Hadoop for the Modern Data Architecture Hortonworks
 
IBM Storage for Analytics, Cognitive and Cloud
IBM Storage for Analytics, Cognitive and CloudIBM Storage for Analytics, Cognitive and Cloud
IBM Storage for Analytics, Cognitive and CloudTony Pearson
 
Internet of things Crash Course Workshop
Internet of things Crash Course WorkshopInternet of things Crash Course Workshop
Internet of things Crash Course WorkshopDataWorks Summit
 
SQL on Hadoop for the Oracle Professional
SQL on Hadoop for the Oracle ProfessionalSQL on Hadoop for the Oracle Professional
SQL on Hadoop for the Oracle ProfessionalMichael Rainey
 
Gartner evaluation criteria_for_clou_security_networking
Gartner evaluation criteria_for_clou_security_networkingGartner evaluation criteria_for_clou_security_networking
Gartner evaluation criteria_for_clou_security_networkingYerlin Sturdivant
 
Oracle GoldenGate Cloud Service Overview
Oracle GoldenGate Cloud Service OverviewOracle GoldenGate Cloud Service Overview
Oracle GoldenGate Cloud Service OverviewJinyu Wang
 
Edw Optimization Solution
Edw Optimization Solution Edw Optimization Solution
Edw Optimization Solution Hortonworks
 

Mais procurados (20)

Apache Spark Workshop at Hadoop Summit
Apache Spark Workshop at Hadoop SummitApache Spark Workshop at Hadoop Summit
Apache Spark Workshop at Hadoop Summit
 
Analytics Modernization: Configuring SAS® Grid Manager for Hadoop
Analytics Modernization: Configuring SAS® Grid Manager for HadoopAnalytics Modernization: Configuring SAS® Grid Manager for Hadoop
Analytics Modernization: Configuring SAS® Grid Manager for Hadoop
 
Insights into Real-world Data Management Challenges
Insights into Real-world Data Management ChallengesInsights into Real-world Data Management Challenges
Insights into Real-world Data Management Challenges
 
Consolidating File Servers into the Cloud
Consolidating File Servers into the CloudConsolidating File Servers into the Cloud
Consolidating File Servers into the Cloud
 
Keep your data safe by moving from unsupported SQL Server 2008 to SQL Server ...
Keep your data safe by moving from unsupported SQL Server 2008 to SQL Server ...Keep your data safe by moving from unsupported SQL Server 2008 to SQL Server ...
Keep your data safe by moving from unsupported SQL Server 2008 to SQL Server ...
 
Windows on AWS
Windows on AWSWindows on AWS
Windows on AWS
 
Enabling the Software Defined Data Center for Hybrid IT
Enabling the Software Defined Data Center for Hybrid ITEnabling the Software Defined Data Center for Hybrid IT
Enabling the Software Defined Data Center for Hybrid IT
 
Your Self-Driving Car - How Did it Get So Smart?
Your Self-Driving Car - How Did it Get So Smart?Your Self-Driving Car - How Did it Get So Smart?
Your Self-Driving Car - How Did it Get So Smart?
 
Deploying Big-Data-as-a-Service (BDaaS) in the Enterprise
Deploying Big-Data-as-a-Service (BDaaS) in the EnterpriseDeploying Big-Data-as-a-Service (BDaaS) in the Enterprise
Deploying Big-Data-as-a-Service (BDaaS) in the Enterprise
 
Cloud Innovation Day - Commonwealth of PA v11.3
Cloud Innovation Day - Commonwealth of PA v11.3Cloud Innovation Day - Commonwealth of PA v11.3
Cloud Innovation Day - Commonwealth of PA v11.3
 
YARN Ready: Integrating to YARN with Tez
YARN Ready: Integrating to YARN with Tez YARN Ready: Integrating to YARN with Tez
YARN Ready: Integrating to YARN with Tez
 
Google Cloud Dataproc - Easier, faster, more cost-effective Spark and Hadoop
Google Cloud Dataproc - Easier, faster, more cost-effective Spark and HadoopGoogle Cloud Dataproc - Easier, faster, more cost-effective Spark and Hadoop
Google Cloud Dataproc - Easier, faster, more cost-effective Spark and Hadoop
 
Starting Small and Scaling Big with Hadoop (Talend and Hortonworks webinar)) ...
Starting Small and Scaling Big with Hadoop (Talend and Hortonworks webinar)) ...Starting Small and Scaling Big with Hadoop (Talend and Hortonworks webinar)) ...
Starting Small and Scaling Big with Hadoop (Talend and Hortonworks webinar)) ...
 
Delivering Apache Hadoop for the Modern Data Architecture
Delivering Apache Hadoop for the Modern Data Architecture Delivering Apache Hadoop for the Modern Data Architecture
Delivering Apache Hadoop for the Modern Data Architecture
 
IBM Storage for Analytics, Cognitive and Cloud
IBM Storage for Analytics, Cognitive and CloudIBM Storage for Analytics, Cognitive and Cloud
IBM Storage for Analytics, Cognitive and Cloud
 
Internet of things Crash Course Workshop
Internet of things Crash Course WorkshopInternet of things Crash Course Workshop
Internet of things Crash Course Workshop
 
SQL on Hadoop for the Oracle Professional
SQL on Hadoop for the Oracle ProfessionalSQL on Hadoop for the Oracle Professional
SQL on Hadoop for the Oracle Professional
 
Gartner evaluation criteria_for_clou_security_networking
Gartner evaluation criteria_for_clou_security_networkingGartner evaluation criteria_for_clou_security_networking
Gartner evaluation criteria_for_clou_security_networking
 
Oracle GoldenGate Cloud Service Overview
Oracle GoldenGate Cloud Service OverviewOracle GoldenGate Cloud Service Overview
Oracle GoldenGate Cloud Service Overview
 
Edw Optimization Solution
Edw Optimization Solution Edw Optimization Solution
Edw Optimization Solution
 

Semelhante a Data platforms 2017

Oracle Open World 2017 Delphix and DBVisit
Oracle Open World 2017 Delphix and DBVisitOracle Open World 2017 Delphix and DBVisit
Oracle Open World 2017 Delphix and DBVisitKellyn Pot'Vin-Gorman
 
The Rise of DataOps: Making Big Data Bite Size with DataOps
The Rise of DataOps: Making Big Data Bite Size with DataOpsThe Rise of DataOps: Making Big Data Bite Size with DataOps
The Rise of DataOps: Making Big Data Bite Size with DataOpsDelphix
 
Cloud Opportunities with Virtualization
Cloud Opportunities with VirtualizationCloud Opportunities with Virtualization
Cloud Opportunities with VirtualizationKellyn Pot'Vin-Gorman
 
From DBA to DevOps to DataOps- The Revolution
From DBA to DevOps to DataOps-  The RevolutionFrom DBA to DevOps to DataOps-  The Revolution
From DBA to DevOps to DataOps- The RevolutionKellyn Pot'Vin-Gorman
 
Data Analytics Using Container Persistence Through SMACK - Manny Rodriguez-Pe...
Data Analytics Using Container Persistence Through SMACK - Manny Rodriguez-Pe...Data Analytics Using Container Persistence Through SMACK - Manny Rodriguez-Pe...
Data Analytics Using Container Persistence Through SMACK - Manny Rodriguez-Pe...{code} by Dell EMC
 
The Power of DataOps for Cloud and Digital Transformation
The Power of DataOps for Cloud and Digital Transformation The Power of DataOps for Cloud and Digital Transformation
The Power of DataOps for Cloud and Digital Transformation Delphix
 
451 Research: Data Is the Key to Friction in DevOps
451 Research: Data Is the Key to Friction in DevOps451 Research: Data Is the Key to Friction in DevOps
451 Research: Data Is the Key to Friction in DevOpsDelphix
 
There's More to Docker than the Container: The Docker Platform - Kendrick Col...
There's More to Docker than the Container: The Docker Platform - Kendrick Col...There's More to Docker than the Container: The Docker Platform - Kendrick Col...
There's More to Docker than the Container: The Docker Platform - Kendrick Col...{code} by Dell EMC
 
Empowering Agile Development with Containers
Empowering Agile Development with ContainersEmpowering Agile Development with Containers
Empowering Agile Development with ContainersKellyn Pot'Vin-Gorman
 
Managing ScaleIO as Software on Mesos - David vonThenen - Dell EMC World 2017
Managing ScaleIO as Software on Mesos - David vonThenen - Dell EMC World 2017Managing ScaleIO as Software on Mesos - David vonThenen - Dell EMC World 2017
Managing ScaleIO as Software on Mesos - David vonThenen - Dell EMC World 2017{code} by Dell EMC
 
Managing ScaleIO as Software on Mesos
Managing ScaleIO as Software on MesosManaging ScaleIO as Software on Mesos
Managing ScaleIO as Software on MesosDavid vonThenen
 

Semelhante a Data platforms 2017 (20)

Oracle Open World 2017 Delphix and DBVisit
Oracle Open World 2017 Delphix and DBVisitOracle Open World 2017 Delphix and DBVisit
Oracle Open World 2017 Delphix and DBVisit
 
The Rise of DataOps: Making Big Data Bite Size with DataOps
The Rise of DataOps: Making Big Data Bite Size with DataOpsThe Rise of DataOps: Making Big Data Bite Size with DataOps
The Rise of DataOps: Making Big Data Bite Size with DataOps
 
Cloud Opportunities with Virtualization
Cloud Opportunities with VirtualizationCloud Opportunities with Virtualization
Cloud Opportunities with Virtualization
 
Virtualization and Containers
Virtualization and ContainersVirtualization and Containers
Virtualization and Containers
 
From DBA to DevOps to DataOps- The Revolution
From DBA to DevOps to DataOps-  The RevolutionFrom DBA to DevOps to DataOps-  The Revolution
From DBA to DevOps to DataOps- The Revolution
 
Screw DevOps, Let's Talk DataOps
Screw DevOps, Let's Talk DataOpsScrew DevOps, Let's Talk DataOps
Screw DevOps, Let's Talk DataOps
 
Data Analytics Using Container Persistence Through SMACK - Manny Rodriguez-Pe...
Data Analytics Using Container Persistence Through SMACK - Manny Rodriguez-Pe...Data Analytics Using Container Persistence Through SMACK - Manny Rodriguez-Pe...
Data Analytics Using Container Persistence Through SMACK - Manny Rodriguez-Pe...
 
The Power of DataOps for Cloud and Digital Transformation
The Power of DataOps for Cloud and Digital Transformation The Power of DataOps for Cloud and Digital Transformation
The Power of DataOps for Cloud and Digital Transformation
 
DevOps and DBA- Delphix
DevOps and DBA-  DelphixDevOps and DBA-  Delphix
DevOps and DBA- Delphix
 
Cloudy with a Chance of Databases
Cloudy with a Chance of DatabasesCloudy with a Chance of Databases
Cloudy with a Chance of Databases
 
451 Research: Data Is the Key to Friction in DevOps
451 Research: Data Is the Key to Friction in DevOps451 Research: Data Is the Key to Friction in DevOps
451 Research: Data Is the Key to Friction in DevOps
 
DevOps and the DBA
DevOps and the DBADevOps and the DBA
DevOps and the DBA
 
There's More to Docker than the Container: The Docker Platform - Kendrick Col...
There's More to Docker than the Container: The Docker Platform - Kendrick Col...There's More to Docker than the Container: The Docker Platform - Kendrick Col...
There's More to Docker than the Container: The Docker Platform - Kendrick Col...
 
DevOps for the DBA- Jax Style!
DevOps for the DBA-  Jax Style!DevOps for the DBA-  Jax Style!
DevOps for the DBA- Jax Style!
 
Delphix 4.0
Delphix 4.0Delphix 4.0
Delphix 4.0
 
Empowering Agile Development with Containers
Empowering Agile Development with ContainersEmpowering Agile Development with Containers
Empowering Agile Development with Containers
 
Managing ScaleIO as Software on Mesos - David vonThenen - Dell EMC World 2017
Managing ScaleIO as Software on Mesos - David vonThenen - Dell EMC World 2017Managing ScaleIO as Software on Mesos - David vonThenen - Dell EMC World 2017
Managing ScaleIO as Software on Mesos - David vonThenen - Dell EMC World 2017
 
Agile DBA
Agile DBA Agile DBA
Agile DBA
 
Virtualization 101
Virtualization 101Virtualization 101
Virtualization 101
 
Managing ScaleIO as Software on Mesos
Managing ScaleIO as Software on MesosManaging ScaleIO as Software on Mesos
Managing ScaleIO as Software on Mesos
 

Mais de Kellyn Pot'Vin-Gorman

Redgate_summit_atl_kgorman_intersection.pptx
Redgate_summit_atl_kgorman_intersection.pptxRedgate_summit_atl_kgorman_intersection.pptx
Redgate_summit_atl_kgorman_intersection.pptxKellyn Pot'Vin-Gorman
 
SQLSatOregon_kgorman_keynote_NIAIMLEC.pptx
SQLSatOregon_kgorman_keynote_NIAIMLEC.pptxSQLSatOregon_kgorman_keynote_NIAIMLEC.pptx
SQLSatOregon_kgorman_keynote_NIAIMLEC.pptxKellyn Pot'Vin-Gorman
 
Turning ADHD into "Awesome Dynamic Highly Dependable"
Turning ADHD into "Awesome Dynamic Highly Dependable"Turning ADHD into "Awesome Dynamic Highly Dependable"
Turning ADHD into "Awesome Dynamic Highly Dependable"Kellyn Pot'Vin-Gorman
 
Cepta The Future of Data with Power BI
Cepta The Future of Data with Power BICepta The Future of Data with Power BI
Cepta The Future of Data with Power BIKellyn Pot'Vin-Gorman
 
Pass Summit Linux Scripting for the Microsoft Professional
Pass Summit Linux Scripting for the Microsoft ProfessionalPass Summit Linux Scripting for the Microsoft Professional
Pass Summit Linux Scripting for the Microsoft ProfessionalKellyn Pot'Vin-Gorman
 
PASS 24HOP Linux Scripting Tips and Tricks
PASS 24HOP Linux Scripting Tips and TricksPASS 24HOP Linux Scripting Tips and Tricks
PASS 24HOP Linux Scripting Tips and TricksKellyn Pot'Vin-Gorman
 
Power BI with Essbase in the Oracle Cloud
Power BI with Essbase in the Oracle CloudPower BI with Essbase in the Oracle Cloud
Power BI with Essbase in the Oracle CloudKellyn Pot'Vin-Gorman
 
ODTUG Leadership Talk- WIT and Sponsorship
ODTUG Leadership Talk-  WIT and SponsorshipODTUG Leadership Talk-  WIT and Sponsorship
ODTUG Leadership Talk- WIT and SponsorshipKellyn Pot'Vin-Gorman
 
DevOps and Decoys How to Build a Successful Microsoft DevOps Including the Data
DevOps and Decoys  How to Build a Successful Microsoft DevOps Including the DataDevOps and Decoys  How to Build a Successful Microsoft DevOps Including the Data
DevOps and Decoys How to Build a Successful Microsoft DevOps Including the DataKellyn Pot'Vin-Gorman
 

Mais de Kellyn Pot'Vin-Gorman (20)

Redgate_summit_atl_kgorman_intersection.pptx
Redgate_summit_atl_kgorman_intersection.pptxRedgate_summit_atl_kgorman_intersection.pptx
Redgate_summit_atl_kgorman_intersection.pptx
 
SQLSatOregon_kgorman_keynote_NIAIMLEC.pptx
SQLSatOregon_kgorman_keynote_NIAIMLEC.pptxSQLSatOregon_kgorman_keynote_NIAIMLEC.pptx
SQLSatOregon_kgorman_keynote_NIAIMLEC.pptx
 
Boston_sql_kegorman_highIO.pptx
Boston_sql_kegorman_highIO.pptxBoston_sql_kegorman_highIO.pptx
Boston_sql_kegorman_highIO.pptx
 
Oracle on Azure IaaS 2023 Update
Oracle on Azure IaaS 2023 UpdateOracle on Azure IaaS 2023 Update
Oracle on Azure IaaS 2023 Update
 
IaaS for DBAs in Azure
IaaS for DBAs in AzureIaaS for DBAs in Azure
IaaS for DBAs in Azure
 
Being Successful with ADHD
Being Successful with ADHDBeing Successful with ADHD
Being Successful with ADHD
 
Azure DBA with IaaS
Azure DBA with IaaSAzure DBA with IaaS
Azure DBA with IaaS
 
Turning ADHD into "Awesome Dynamic Highly Dependable"
Turning ADHD into "Awesome Dynamic Highly Dependable"Turning ADHD into "Awesome Dynamic Highly Dependable"
Turning ADHD into "Awesome Dynamic Highly Dependable"
 
PASS Summit 2020
PASS Summit 2020PASS Summit 2020
PASS Summit 2020
 
DevOps in Silos
DevOps in SilosDevOps in Silos
DevOps in Silos
 
Azure Databases with IaaS
Azure Databases with IaaSAzure Databases with IaaS
Azure Databases with IaaS
 
How to Win When Migrating to Azure
How to Win When Migrating to AzureHow to Win When Migrating to Azure
How to Win When Migrating to Azure
 
Securing Power BI Data
Securing Power BI DataSecuring Power BI Data
Securing Power BI Data
 
Cepta The Future of Data with Power BI
Cepta The Future of Data with Power BICepta The Future of Data with Power BI
Cepta The Future of Data with Power BI
 
Pass Summit Linux Scripting for the Microsoft Professional
Pass Summit Linux Scripting for the Microsoft ProfessionalPass Summit Linux Scripting for the Microsoft Professional
Pass Summit Linux Scripting for the Microsoft Professional
 
Taming the shrew Power BI
Taming the shrew Power BITaming the shrew Power BI
Taming the shrew Power BI
 
PASS 24HOP Linux Scripting Tips and Tricks
PASS 24HOP Linux Scripting Tips and TricksPASS 24HOP Linux Scripting Tips and Tricks
PASS 24HOP Linux Scripting Tips and Tricks
 
Power BI with Essbase in the Oracle Cloud
Power BI with Essbase in the Oracle CloudPower BI with Essbase in the Oracle Cloud
Power BI with Essbase in the Oracle Cloud
 
ODTUG Leadership Talk- WIT and Sponsorship
ODTUG Leadership Talk-  WIT and SponsorshipODTUG Leadership Talk-  WIT and Sponsorship
ODTUG Leadership Talk- WIT and Sponsorship
 
DevOps and Decoys How to Build a Successful Microsoft DevOps Including the Data
DevOps and Decoys  How to Build a Successful Microsoft DevOps Including the DataDevOps and Decoys  How to Build a Successful Microsoft DevOps Including the Data
DevOps and Decoys How to Build a Successful Microsoft DevOps Including the Data
 

Último

Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdflior mazor
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 

Último (20)

Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 

Data platforms 2017

  • 1. Virtualizing Big Data in the Cloud and everything in between Technical Intelligence Manager of CTO, Delphix Kellyn Pot’Vin-Gorman
  • 2. 2© 2017 Delphix Corporation Kellyn Pot’Vin-Gorman Technical Intelligence Manager for the Office of CTO, Delphix • 2 decades experience as multi-platform DBA, (Oracle, MSSQL, MySQL, Sybase, Postgres…..) • Oracle ACE Director, (Alumni) • Oak Table Network • APEX Women in Technology Award, CTA 2014 • STEM education with Raspberry Pi and Python • Liaison for Denver SQL Server User Group • Rocky Mountain Oracle Conference Director and Board Director • Author, blogger, (http://dbakevlar.com)
  • 3. 3© 2017 Delphix Corporation Cloud and Big Data • Big data often is dependent upon relational and other legacy data stores. • Multiple data sources, complex, often home-grown environments when left without automation, leads to human error. • Difficult if not already in the cloud, often built in the cloud by ease of access to cloud resources. • Open source leads to open to discussion.
  • 4. 4© 2016 Delphix Corporation Cloud Trends  85% of Enterprises have a multi-cloud strategy  77% are hybrid cloud, (different than the 2017 cloud survey)  Workloads are being run in the cloud- - 41% in public clouds - 38% in private clouds  Enterprise Companies are choosing cloud - 65% want public cloud - 63% want private cloud solutions - 93% will be hybrid State of the Cloud Survey, RightScale
  • 5. 5© 2017 Delphix Corporation This trend will only increase in the next five years as cloud continues to overtake the industry
  • 6. 6© 2017 Delphix Corporation Big Data Project Types through 2026 0 10 20 30 40 50 60 70 80 90 100 2017 2018 2019 2020 2021 2022 2023 2024 2025 Big Data Pro Big Data HW Big Data SW Billions of Dollars in Big Data Projects
  • 7. Cloud Adoption and How It’s Changing 0% 10% 20% 30% 40% 50% 60% 70% 80% 90% 100% Public Cloud Private Cloud Hybrid Cloud Any Cloud 2016 2017 2018 http://www.rightscale.com/blog/cloud-industry-insights/cloud-computing-trends-2017-state-cloud-survey
  • 8. 8© 2017 Delphix Corporation How does virtualization fit into this? • Big data is…well, BIG and built out of necessity. • Companies report spending more time gathering & distributing big data than analyzing it. • Only 59% of big data is easily accessible and 37% report it takes more than a day to access new data. • Another 59% report that legacy data storage systems are still hindering big data initiatives. http://www.cioinsight.com/it-strategy/big-data/slideshows/big-datas-biggest-challenges.html
  • 9. 9© 2017 Delphix Corporation Virtualization Adoption Trend . Source: Gartner Forecasts 0% 20% 40% 60% 80% 100% 2013 2014 2015 2016 2017 2018 Percentage of Virtualized Workloads Percentage of Virtualized Worklaods
  • 10. Our Use Case  Using online project of flat files for big data by Chris Wilson from Time Magazine, based off publicly available datasets.  Uses flat files and Ajax to produce workable datasets from open payments data from cms.gov of highly anticipated datasets https://source.opennews.org/articles/case-flat-files-big-data-projects/
  • 11. 11© 2017 Delphix Corporation History of the Data https://www.cms.gov/openpayments/
  • 12. Value of the Data $0 $100 $200 $300 $400 $500 $600 10 20 30 40 50 60 70 80 90 Average Payment and Percentile for Physicians from Drug Companies Per Medication Payments
  • 13. Delivery Method- A Real Decision  Data is large, no matter if a big data solution or otherwise, (VLDB).  90% of data between environments is often consistent, with data appends.  How often is a solution choice based off skill set of those in place and how will this support the future with growth?  Do you want to pay for licensing of database and client servers as stated in our use case example?  No need to patch, upgrade, etc. Just lock down file permissions and maintain was the goal and this resonated with many customer scenarios.
  • 14. Should vs. Did  Although an RDBMS with JSON would have been the preferred method to deliver the data, the author’s team made a different choice…
  • 15. This… “Presented a technical challenge, because our small [team] is a more comfortable with client-side web development than we are with administering servers and databases. So we decided to make the whole thing searchable using only flat files and Ajax requests.” I’ve heard similar stories before.
  • 16. My Virtualization Demo Environment  Each zip file was under 1GB, (NOT big data), 16G uncompressed.  Unstructured, it was cumbersome to work with.  Gave excellent example of network bottlenecks transferring to Source.
  • 17. 17© 2017 Delphix Corporation “The juiciest file, which contained information on payments that medical companies made to physicians for things like meals, travel, and consulting fees, was 2.6 million lines in a single 1.44 GB file.” Per the Author- https://source.opennews.org/articles/case-flat-files-big-data-projects/
  • 18. 18© 2017 Delphix Corporation The Solution- • Files hosted on Amazon S3 • CloudFront layer for high availability • 350K flat files to be migrated to Amazon • Over 30 seconds per file to upload. Now do this for development and test, then rinse and repeat, finally producing and releasing till complete.
  • 19. To Paraphrase Hippocrates… Databases are short, files are long.
  • 20. 20© 2017 Delphix Corporation How Can This Scenario Be Enhanced? • Remove the bottleneck and duplication of flat files with virtualization. • Ease ability to access and migrate to a cloud platform from on-prem. • If a real environment, and commonly, legacy data sources, applications and access points, containerize and simply delivery.
  • 21. Virtualize Options for Big Data  Partitioning- As many big data is partitioned resources across a single physical system, virtualizing is often easy with modern virtualization products.  Isolation- Many big data environments may already be on VMs, to create a virtualized dataset could eliminate extensive storage requirements of duplicate data.  Package- Collect all tiers and dependencies for a big data solution and containerize, making development, testing and delivery simple and automated.
  • 22. 22© 2017 Delphix Corporation Introduction to Virtualization for Flat Files Flat files can be virtualized individually or part of a “container” and can eliminate duplication of immense files that are part of big data environments.
  • 23. What is a Delphix vFile?  Feature for “Unstructured Files”  A directory tree of files for Delphix to manage.  Can be: - Link from an existing dataset on a source server into a dSource - Files will be projected using NFS to a target server.  Small part of bigger “swiss knife”, as able to virtualize relational databases, applications, etc.
  • 24. 24© 2017 Delphix Corporation Prerequisites for Cloud Environments  Ensure NFS Mounts are option in cloud environment  No clustered environments, (MSFC, Oracle RAC, etc.)  Ensure credentials are set up correctly to perform per documentation.
  • 25. Prepare Cloud Host, (if not pre-installed)  For our Demo- Amazon- Install the NFS Client  Linux or Redhat sudo yum –y install nfs-utils  Ubuntu sudo apt-get –y install nfs-common
  • 26. Create A New Source for DataSet
  • 27. Allow For Initial Sync Notice the Length of Time for 16G to Delphix Engine
  • 28. 28© 2017 Delphix Corporation Limitations of vFiles Datasets  No interval refresh capability from parent, (must be recreated.)  Pulled directly from dataset’s parent, so take care with network, I/O performance.  Existing can still rewind, take snapshots, etc. Just checking if you’re paying attention
  • 29. 29© 2017 Delphix Corporation Now Copy Source Files Into Mounted vFile Source Take a snapshot to mark updates to Files:
  • 30. 30© 2017 Delphix Corporation Provision to our Target Host • NFS Mount is ready on LinuxTarget host. • Choose to provision to the LinuxTarget • Click on Next and keep defaults, but could add scripts to make scripting changes, etc. as part of “hooks”
  • 31. 31© 2017 Delphix Corporation Provisioning a vFile  My Datasets  Select source to provision from  Select a snapshot  Click Provision  Provision vFiles and update the default to the correct mount point.  Select a target environment and add filter.
  • 32. 32© 2017 Delphix Corporation Verification- Clone Is Successful! Disable Source Mount Enabled Once More!
  • 33. 33© 2017 Delphix Corporation vFiles Manageability  Can Enable or Disable from CLI or Interface.  Configure from same.  Easily populate to all targets from one source.  Automate via DevOps as part of scripting or Jenkins/Chef jobs
  • 34. 34© 2017 Delphix Corporation Rewinding a vFile or “File Version Control”  Highlight the vFiles  On Timeflow, select the snapshot  Click on Rewind No need to SCP, FTP or recreate!
  • 35. 35© 2017 Delphix Corporation Rewind vFile
  • 36. 36© 2017 Delphix Corporation Files Returned To Previous Version/Status
  • 37. 37© 2017 Delphix Corporation Refresh vFile • Refresh from Source • Update target with new files • Return to original files if catastrophic situation.
  • 38. 38© 2017 Delphix Corporation An overall trend in companies that includes autonomic computing, where the IT environment will be able to manage itself based on perceived activity, and resource shared computing, in which computer processing power is utilized and/or paid for only as needed. The goal of virtualization is to centralize administrative tasks while improving scalability and work loads via the cloud. Virtualization and Cloud
  • 39. 39© 2017 Delphix Corporation Data Virtualization On-Prem Dataset on source server Targets “Projects” thin copies via NFS
  • 40. 40© 2017 Delphix Corporation Delphix engine is: • software appliance, (VM) • hosted on any hardware • tracks continual changes on regular intervals • Using native technology • Uses a source, (Dsource) to track changes • Can rewind changes to a flat file. • Uses little to no storage during life of vFile.
  • 41. 41© 2017 Delphix Corporation Data Virtualization: linking to a source Source File Delphix Virtualization Engine 8TB storage Rsync (UNIX/Linux) Robocopy (Windows)
  • 42. 42© 2017 Delphix Corporation Data Virtualization: provisioning a virtual database to a target Delphix Virtualization Engine 8 TB storage NFS iSCSI
  • 43. 43© 2017 Delphix Corporation Provisioning a vFile to Target NFS Mounts Target NFS Mount Delphix Virtualization Engine 8 TB storage NFS iSCSI Target NFS Mount NFS iSCSI
  • 44. 44© 2017 Delphix Corporation What if Changes are Made? Target NAS Mount Delphix Virtualization Engine 8 TB storage NFS iSCSI Target NAS Mount NFS iSCSI Target NAS Mount NFS iSCSI
  • 45. 45© 2017 Delphix Corporation Data Virtualization: provisioning a virtual database to a target Target NAS Mount Delphix Virtualization Engine 8 TB storage NFS iSCSI Target NAS Mount NFS iSCSI Target NAS Mount NFS iSCSI Source NFS Mount Rsync (UNIX/Linux) Robocopy (Windows)
  • 46. 46© 2017 Delphix Corporation Now “Containerize” for Ease of Delivery DevOps Delphix Virtualization Engine 8 TB storage NFS iSCSI Testing NFS iSCSI Reporting NFS iSCSI Create “Container” Rsync (UNIX/Linux) Robocopy (Windows)
  • 47. 47© 2017 Delphix Corporation Robust and Full Read and Write Files • As many as development, test, reporting etc. require. • Delphix engine tracks the changes to each of the flat files. • Allows to create containers to isolate data sources, files, applications and other parts of big data environments and deliver as many as required per project.
  • 48. 48© 2017 Delphix Corporation Moving to the Cloud with Standard Methods On-Prem NFS Mount Cloud Storage NFS Mount http://docs.aws.amazon.com/efs/latest/ug/mounting-fs.html
  • 49. 49© 2017 Delphix Corporation Standard Cloud Migration Limitations Even if only development or test has been migrated to the cloud…  Data is migrated, but this doesn’t count for ongoing data loads, application connectivity across the network.  Refreshes are time consuming and complex.  Often use archaic methods to refresh or replication is required.  Rarely are difference in cost structures taken into consideration in cloud migration projects from on-prem configurations.
  • 50. 50© 2017 Delphix Corporation On-demand EC2 instances  Range from $0.0065/hr to $8.184/hr  Reserved instances provide discounts from 29% to 75% over on-demand Storage in Elastic Block Storage  EBS General Purpose SSD (gp2) volumes  $0.10 per GB-month of provisioned storage  EBS Provisioned IOPS SSD (io1) volumes  $0.125 per GB-month of provisioned storage  $0.065 per provisioned IOPS-month  EBS Throughput Optimized HDD (st1) volumes  $0.045 per GB-month of provisioned storage Data transfer in EC2  DATA IN from “internet” is free, but internally within EC2 can be $0.01/GB Cost Estimates for Cloud Vendors  Source: https://aws.amazon.com/ec2/pricing/ Sep 2016
  • 51. 51© 2017 Delphix Corporation Virtualize Flat Files into the Cloud On Prem NFS Mount Delphix Virtualization Engine 8 TB storage NFS Mount on Cloud
  • 52. 52© 2017 Delphix Corporation And From the Command Line Full and robust CLI allows for mass provisioning and scripting options that aren’t feasible with a GUI.  Manage environment  Simple and Complex Provisioning  User Management  Storage Management  DevOps automation for Jenkins, Chef and scripting automation.
  • 53. 53© 2017 Delphix Corporation Set Environment: snapshot list database=Vvfiles_546 timeflow "dexample" timeflowRanges; Commit Delphix Command Line Examples Provision a new VDB and use the defaults: delphix> database provision After each command, note that the CLI relates the hierarchy: delphix database provision> defaults Using the defaults, an example is used for the deployment:
  • 54. 54© 2017 Delphix Corporation delphix database provision *> set sourceConfig.type=OracleSIConfig delphix database provision *> set sourceConfig.databaseName=VEM_833 delphix database provision *> set sourceConfig.uniqueName=VEM_833 VDB Configuration
  • 55. 55© 2017 Delphix Corporation Set the appropriate information just as you would to create a database deployment from the GUI: delphix database provision defaults *> set location=DEFAULT_SNAPSHOT delphix database provision *> set container.group=“Dev Copies" Setting Snapshots and Group
  • 56. 56© 2017 Delphix Corporation All IaaS solutions provide encryption in-flight and encryption at-rest  But encryption doesn’t protect data as much as it needs to be . Europe already requires data masking, not just data encryption for any confindential data: http://ec.europa.eu/justice/data-protection/article- 29/documentation/opinion-recommendation/files/2014/wp216_en.pdf Confidential data
  • 57. 57© 2017 Delphix Corporation Encryption is Different than Masking/Obfuscation
  • 58. 58© 2017 Delphix Corporation Data Masking
  • 59. 59© 2017 Delphix Corporation GDPR and Our Future with Data Protection https://en.wikipedia.org/wiki/General_Data_Protection_Regulation The General Data Protection Regulation (GDPR) (Regulation (EU) 2016/679) is a regulation by which the European Parliament, the Council of the European Union and the European Commission intend to strengthen and unify data protection for all individuals within the European Union (EU).
  • 60. 60© 2017 Delphix Corporation Big Data is often pulled from various data sources The ability to mask this flat file data has incredible potential. Capabilities to mask flat files is powerful and a reason for “Agile Masking”. Flat File Masking https://docs.delphix.com/docs/delphix-masking/masking-engine-user-guide/understand-and-manage-the-masking-engine/file-masking/flat-file- masking
  • 61. 61© 2017 Delphix Corporation Agile Masking and File Formats https://docs.delphix.com/docs/delphix-masking/masking-engine-user-guide/understand-and-manage-the-masking-engine/file-masking/flat-file-masking • Multi-record • CSV • XML • Word • Excel • PowerPoint • Unstructured • EDI • Installation: Agile masking installed with valid license (file masking option) • Characteristics: Description and type of files
  • 62. 62© 2017 Delphix Corporation Connectors and Agile Masking of Flat Files Free Text Redaction Algorithm – This algorithms masks or redacts free text columns of files. It uses either a Whitelist or Blacklist to determine what words are masked or not masked. This algorithm may require additional configuration to work in the manner you desire.
  • 63. 63© 2017 Delphix Corporation vFiles, Masking, the Cloud- The Whole Picture Delphix Masking Engine Delphix Virtualization Engine 8 TB storage Delphix Virtualization Engine 8 TB storage UNIX Source 8 TB database Unix Target
  • 64. 64© 2017 Delphix Corporation vFiles, Masking, the Cloud- The Whole Picture Delphix Masking Engine Delphix Virtualization Engine 8 TB storage Delphix Virtualization Engine 8 TB storage Unix Target Unix Source
  • 65. 65© 2017 Delphix Corporation Summary: Solutions for Cloud Migrations using Virtualization • Review large data sets, both in legacy data sources and in structured, flat files for opportunities to be delivered to the cloud. • Consider virtualizing data sets to deter from latency issues. • “Containerize” environments for easy delivery of complex builds. • With GDPR and security a higher priority, consider masking non-production data/files, encrypting/securing production.
  • 66. Thank you! Please fill out the session survey.