SlideShare uma empresa Scribd logo
1 de 12
Baixar para ler offline
Expert Insights
A blueprint
for data in
a multicloud
world
Experts on this topic
Tony Giordano Tony has more than 25 years of global professional
services experience in performance marketing, big
Senior Partner and VP,
data and analytics, information governance, customer
CBDS Data Platform Services
relationship management, and program management,
Global Leader, IBM Services
with industry experience in financial services, life
linkedin.com/in/
sciences, retail, and automotive sectors. He has
tony-anthony-giordano-3111091
authored two books on Information Management.
tonygior@us.ibm.com
Mehdi Charafeddine Mehdi is a technical leader with a track record for
delivering large complex projects that require a new
Associate Partner,
way of looking at problems and solving them. As a
Executive Solutions Architect,
strong Open Source advocate, Mehdi experiments with
IBM Services
new technologies with a value potential to clients. This
linkedin.com/in/mehdichara
requires a unique combination of a deep understanding
mchara@ibm.com
of software versus the risk/return ratio for IBM clients.
Dan Sutherland Dan is an expert at transforming a client’s strategic
business vision into innovative solutions while
Distinguished Engineer and CTO,
exploiting leading edge technologies. Dan is
Data Platforms, IBM Services
comfortable resolving complex data architecture and
linkedin.com/in/
design issues and regularly advises on very large, high
dan-sutherland-4661a67
volume data-centric projects where sound data
dsutherl@us.ibm.com
architecture and design techniques are key
components to a successful solution implementation.
Becky Carroll Becky has over 25 years of experience helping
business leaders in large enterprises transform
Associate Partner,
traditional business processes to new business
Cognitive and Analytics
models. She has led projects covering big data and
Cognitive Business Decision
analytics, cognitive and AI, digital and social business
Support, IBM Services
strategy and analytics, CRM, and customer service.
linkedin.com/in/beckycarroll
Becky’s work spans many industries, including High
rlcarrol@us.ibm.com
Tech, Telecommunications, Entertainment & Media,
Financial Services, Airline, Automotive, Industrial
Products, Life Sciences, Gaming, Education. Becky
holds a BS in Electrical Engineering and Computer
Science and a MBA in Marketing Management.
A well-planned and executed
data strategy helps avoid
unwelcome surprises in a
multicloud environment.
Talking points
A data strategy and architecture
addressing applications and data
spread over multiple clouds are essential
for companies operating in today’s
multicloud environment.
A business-driven data strategy should
be fully integrated with a company’s
evolving multicloud architecture to keep
data from becoming siloed and, therefore,
not easily accessible to applications.
A strong data strategy will focus on
business transformation opportunities,
avoid “lock-in” to any particular cloud,
and use DataOps, a method to automate
preparatory work of data scientists, to
derive the most insight possible from
available data.
Moving data in
a multicloud world
As companies move beyond initial forays into the cloud,
many are experiencing the advantages—and the
challenges—of a multicloud environment. Benefits of
multicloud are manifest: increased ability to innovate,
enhanced products and services, and agile business
processes.1
In fact, according to a recent Institute for Business Value
(IBV) study, 85 percent of companies globally are already
operating in a multicloud environment.2
By 2021,
96 percent plan to be using multiple clouds.3
Companies are remaking themselves as “digital” and
making artificial intelligence (AI) central to their core
business processes. As digitization continues, the
movement to multiple clouds will increase, because it
often involves obtaining cloud-based services from
different cloud vendors. But the migration is unlikely to
be as simple as merely shifting a current architecture
to a new location in the cloud.
Each cloud vendor has its own architectural methods.
How each organizes data could be incompatible with
other clouds. If addressing such incompatibility is not
accounted for in a well-designed strategy, a company
may experience poor performance and higher-than-
expected cost.
At the root of the challenge is the nature of data and its
rapid growth. When companies expand the number of
services and applications they use, their data grows
exponentially. But it isn’t the amount of data that is the
real issue—it’s where that data resides. And where it
resides is often based on where it is collected and created.
1
A data fabric helps make
data visible—and usable—
across an entire enterprise.
Such dispersed data created by distinct business units or
functions is known as a “data silo.” Silos drastically
diminish the data’s usability. The data may be redundant,
but organized differently, and therefore difficult to
correlate. There may be discrepancies in the data because
of the way it was collected. And the data may be unknown
or otherwise inaccessible to applications in other parts of
the company.
Since almost all companies will soon be functioning in the
multicloud reality, negative effects of data siloed within a
company will only be heightened as companies spread
processes, applications, and pockets of data across
multiple clouds (see Figure 1).
Figure 1
Ninety-eight percent of surveyed organizations say they
plan to use multiple hybrid clouds within three years
0-1 Clouds
24%
2%
2-9 Clouds
68%
70%
10+ Clouds
9%
28%
Now In 3 years
Source: IBV 2018 Multicloud management survey, Q2 & Q3.
Q2 How many cloud services and platforms are you currently
using across your entire business?
Q3 How many cloud services and platforms are you planning
to use across your entire business in the next three years?
These new circumstances lead to an important question.
How can companies successfully navigate the complexity,
cost and latency issues operating in a multicloud
environment generates?
The answer? By properly defining a “data fabric”
or “blueprint for data” in a multicloud world.
A data fabric to
navigate multiple clouds
A data fabric is a conceptual representation or
architecture of how data assets will be organized. This
blueprint is a formal structure to define and view data
across an enterprise, and is independent of particular
infrastructure or cloud requirements.
A data fabric helps make data visible—and usable—
across an entire enterprise. It guides how the data will
be maintained and governed. It’s founded on the concept
that while business processes may change, the data
underpinning them should be stable.
Data is grouped into collections called “hubs” or “lakes”
for visibility and access. Data fabrics can help avoid or
mitigate data silos, low reliability and scalability, reliance
on legacy systems, and cost inefficiencies. But to work
in a multicloud environment, a data fabric needs to
accommodate different patterns of use.
Consider three examples of how a company might choose
to distribute its workloads, applications, and data across
different clouds.
2
Example 1
In the first instance, a company wants to run an analytics reside on one cloud, with only a specific analytics service
package offered on IBM Cloudpak, using data it already calling on that data from a different cloud (see Figure 2).
has on AWS. Here, its data ingestion and curation services
Figure 2
Managing data across a multicloud by architectural component
3
Content services
Data curation services
Data hub component
Consumption layer
Conformed layer
Raw ingest layer
Relational HDFS Parquet Hive Mongo Columner
Rapid ingestion services
Real time ingestion Batch ingestion
Deployment Platform
Management User experience services
Orchestration services
Insight services
Security
Platform
AWS
IBM Cloudpak
To work in a multicloud
environment, a data fabric
must accommodate
different patterns of use.
Example 2
In a second example, the company wants to take
advantage of attractive pricing on applications offered
on Azure, but keep other existing processes on AWS.
Its ingestion of raw data and its ability to conform and
consume data, then, are distributed over different clouds.
It also subscribes to an analytics service on IBM Cloudpak
(see Figure 3).
Figure 3
Managing data across a multicloud by data hub layer
Data curation services
Data hub component
Consumption layer
Conformed layer
Raw ingest layer
Relational HDFS Parquet Hive Mongo Columner
Rapid ingestion services
Real time ingestion Batch ingestion
Deployment Platform
Management
Platform
User experience services
Orchestration services
Content services
Insight services
IBM Cloudpak
AWS
Azure
Security
4
Example 3
In a third example, the company needs to have its various cross-business function such as security on Azure
business divisions functioning on different clouds because operating across all its other clouds (see Figure 4).
of their unique needs, but chooses to keep a critical
Figure 4
Managing data across multiple clouds
5
Deployment Platform
User experience services
Orchestration services
Content services
Data curation services
Consumption layer
Conformed layer
Raw ingest layer
Data hub component
Rapid ingestion services
Real time ingestion Batch ingestion
Relational HDFS Parquet Hive Mongo Columner
Insight services
Management
Platform
AWS
Google Cloud
Security
Azure
The right data strategy
will anticipate change to
help a company maintain
its flexibility.
As the three examples illustrate, this new multicloud
reality creates new options and opportunities. But it also
creates new challenges for IT leaders. Indeed, recent
market intelligence suggests that 82 percent of IT leaders
are concerned about how they will connect all of these
clouds with their traditional IT.4
Seventy-three percent say
they need better ways to move apps, workloads, and data
more effectively across these clouds. And 67 percent
worry about how they will manage this new mix of cloud
and environments in a consistent way, across vendors,
without introducing new security
and compliance risks.5
To face these challenges, we have developed three guiding
principles to help companies create and execute a data
strategy for moving to, and operating in, a multicloud
environment.
Principle one: Moving to the cloud is a time
for business-driven transformation
Our data shows that cost reduction continues to be of
strategic importance in investing in multicloud
environments. However, some organizations experience
difficulties and delays in capturing cost savings from their
multicloud investments. Based on our experience with
clients, many face cost increases—in some cases, by as
much as 300 percent. The cause?
Moving to the cloud magnifies existing faults in a legacy IT
landscape. Cloud vendors charge for data transfers and
network usage at a much higher rate than most internal
IT organizations. So siloed data—an inevitable reality for
most legacy systems—generates enormous cost and
performance issues as it moves from cloud to cloud to
be used by various applications.
Therefore, a move to the cloud is the time to examine
current business processes and data management,
evaluate how changes to both can help keep costs in
check, and best exploit the potential of the new multicloud
environment. This is not simply an IT discussion or a
restructuring of the current data. Rather, it should be a
close collaboration between the line of business and IT,
with business needs, value, and a line-of-business
executive leading the way.
A related challenge is the rapidly changing technology
landscape: today’s tools may not work tomorrow. The
transformation opportunity, then, also means that
moving the legacy implementation will likely require
new technologies, instead of continuing to reuse current
ones, such as Hadoop.
Principle two: Anticipate and plan for future
movement, changes and innovation
When migrating to a multicloud environment, companies
should avoid “vendor lock-in.” Cloud providers and
offerings are changing rapidly, as are the technologies
they support and offer. Which offering or service is best
for a company could likewise change quickly.
The right data strategy will anticipate change to help a
company maintain its flexibility. At some point, it’s likely
a company will want to move its data and apps from one
cloud to another, scale its storage and processing without
encumbrance, and store data in the optimal place for data
science and workloads.
To anticipate change, a company’s data strategy needs
three key components: containerization, serverless
capabilities, and pragmatic data design.
6
“Containerization” refers to a company packaging its data
applications into “containers” that are not dependent on a
single cloud implementation. Containerized applications
can run across clouds and operating environments. This
enables the organization to move their applications and
data as their ecosystem evolves, especially as processes
increasingly interact across clouds. The company will also
not be entirely reliant on a specific cloud to run certain
parts of its business.
Another aspect of change is the large volumes of data
generated by new applications and services. It’s
practically impossible to scale the teams responsible for
managing that data and the costs associated with its
storage and processing. A serverless strategy can help.
Instead of reserving a set resource allocation—for
example, 1,000 servers—a company’s cloud vendor
assumes responsibility for scaling resources required as
its data and usage needs rise gradually, spike, or ease.
The company pays for the resources it uses, and focuses
on the business case and the code necessary to support
it, rather than the IT resources required to run it. Together
with containerization, this reduces the risk of new
deployments in terms of complexity, skills, and costs.
This approach helps a company become not just cloud
vendor-agnostic, but also technology-agnostic.
A third component in anticipating change is proper data
structure design. When designing data structures such as
for traditional reporting, data science, digital, and
operational use cases, it’s important to keep the workload
in the right place relative to the data to reduce network
traffic and cost. Data design also should include data
policies for security, compliance, and data lifecycle
management across multiple cloud providers. These data
design factors can be integrated into a set of managed
and orchestrated data workflows that can span multiple
cloud providers.
Yara: Data and a multicloud
digital farming platform
Norway-based Yara, one of the world’s largest
fertilizer producers, wants to help fulfill the vision
of a sustainable world without hunger. To lead the
innovation of its core business models through
digitalization, Yara is building the world’s leading
digital farming platform.
In building the platform, Yara focused on creating
and realizing a cloud-agnostic strategy that enabled
consistent data governance and data security. It also
focused on DataOps—automating data functions
that allow its data scientists to focus on data models
and innovation.
The platform provides holistic digital services and
instant agronomic advice to farmers across the globe,
ultimately avoiding deforestation by increasing food
production on existing farmland. Together, the Yara
digital platform aims to cover 7 percent of all arable
land worldwide.
The cloud-agnostic digital farming data platform
follows a pay-as-you-go commercial model and
provides Yara with two Data Services: Weather Data
and Crop Yield as a Service. These accelerators are the
first of many; an Open Innovation layer will enable Yara
to create new ground-breaking algorithms providing
farmers knowledge and decision-making insights.
7
Principle three: Add DataOps to DevOps
Companies should take a lesson from DevOps and how it
has revolutionized application development over the past
decade or so. Automating aspects of deployment has
accelerated how developers test and prove their work.
Data scientists now need a similar revolution, and they are
finding it in DataOps: the automation of the kinds of work
that slow them down.
Today, data scientists spend an inordinate amount of time
preparing, validating, and cleansing data sources, then
training their data models on them. They spend a
surprisingly small part of their time designing the data
models themselves—the highest value work of a data
scientist. When you automate the data prep and training,
you free the data scientist to bring to the business more
insight and, ultimately, new value.
Data strategy,
culture, and people
Just as book-filled libraries would be useless without
a culture of reading, troves of data and the most
innovative data tools require people with the right
skills to put them to work, and the culture has to
support them. People need data and the insights
gleaned from it, and they need it in the specific
context of their work. The same data can mean
different things to different users. What insights
a scientist finds useful will likely differ from those
needed by a developer, a product manager, a marketer
or a process guru.
Especially when AI is applied to huge data sets,
meaning and context become critical. What are the
right questions to ask, and is this the right data to
provide the answers? If this is the right type of data,
can we trust it? Can we trust both the AI algorithms
operating on the data, as well as the training the AI
is receiving—including the actual training data it is
being fed?
A successful data strategy will include an inventory
of needed skills and training, in addition to a long-term
plan for cultivating a vibrant data culture: one where
people are energized to get the most from data, and
have the ability and organizational support to do so.
8
How to get started
Creating a data strategy integrated with your multicloud
plans and applying the principles outlined above might
seem challenging, but there are immediate steps you can
take to prepare.
1. Build a business transformation strategy that
explicitly factors in and takes advantage of multicloud
capabilities, and manages data to exploit those
capabilities.
2. Investigate technologies that keep data closer to
where it will be used, and make sure it is properly
governed across multicloud environments.
3. Identify any lower-value tasks consuming the majority
of your data scientists’ time and automate them. This
will free data scientists to focus on more valuable and
strategically important work.
Are you ready?
— Which core business processes
have you prioritized and explored
for transformation?
— What steps has your organization
taken to build a data culture adapted
to a multicloud environment?
— How will you navigate the world of
regulations associated with data
resident in different clouds?
9
About Expert Insights
Expert Insights represent the opinions of thought
leaders on newsworthy business and related technology
topics. They are based upon conversations with leading
subject matter experts from around the globe. For more
information, contact the IBM Institute for Business Value
at iibv@us.ibm.com.
Notes and sources
1 IBV survey Assembling your cloud orchestra: A field
guide to multicloud management (2018). Steve Cowley,
Lynn Kesterson-Townes, Arvind Krishna, Sangita Singh.
https://www.ibm.com/downloads/cas/EXLAL23W
2 Ibid.
3 Ibid.
4 IDC Cloud Forecast 2018-2020 – Market sizing + CAGR;
BCG & McKinsey Study conducted for IBM – Multicloud
+ Priority concerns.
5 Ibid.
6 Source: IBM Institute for Business Value hybrid cloud
survey (2016).
7 Internal IBM measurement, IBM Global Business
Services.
8 IBM client experience
© Copyright IBM Corporation 2019
IBM Corporation
New Orchard Road
Armonk, NY 10504
Produced in the United States of America
October 2019
IBM, the IBM logo, ibm.com and Watson are trademarks of
International Business Machines Corp., registered in many
jurisdictions worldwide. Other product and service names might
be trademarks of IBM or other companies. A current list of IBM
trademarks is available on the web at “Copyright and trademark
information” at: ibm.com/legal/copytrade.shtml.
This document is current as of the initial date of publication and
may be changed by IBM at any time. Not all offerings are available
in every country in which IBM operates.
THE INFORMATION IN THIS DOCUMENT IS PROVIDED “AS IS”
WITHOUT ANY WARRANTY, EXPRESS OR IMPLIED, INCLUDING
WITHOUT ANY WARRANTIES OF MERCHANTABILITY, FITNESS
FOR A PARTICULAR PURPOSE AND ANY WARRANTY OR
CONDITION OF NON-INFRINGEMENT. IBM products are war-
ranted according to the terms and conditions of the agreements
under which they are provided.
This report is intended for general guidance only. It is not
intended to be a substitute for detailed research or the exercise
of professional judgment. IBM shall not be responsible for any
loss whatsoever sustained by any organization or person who
relies on this publication.
The data used in this report may be derived from third-party
sources and IBM does not independently verify, validate or audit
such data. The results from the use of such data are provided on
an “as is” basis and IBM makes no representations or warranties,
express or implied.
39028739USEN-00

Mais conteúdo relacionado

Mais procurados

Cloudonomics: The Economics of Cloud Computing
Cloudonomics: The Economics of Cloud ComputingCloudonomics: The Economics of Cloud Computing
Cloudonomics: The Economics of Cloud ComputingRackspace
 
Survivors guide to the cloud whitepaper
Survivors guide to the cloud whitepaperSurvivors guide to the cloud whitepaper
Survivors guide to the cloud whitepaperOnomi
 
Hybrid cloud-for-flexible-accelerated-and-sustainable-it16-10-051475673810
Hybrid cloud-for-flexible-accelerated-and-sustainable-it16-10-051475673810Hybrid cloud-for-flexible-accelerated-and-sustainable-it16-10-051475673810
Hybrid cloud-for-flexible-accelerated-and-sustainable-it16-10-051475673810Netmagic Solutions Pvt. Ltd.
 
Hybrid IT – A Winning Strategy
Hybrid IT – A Winning StrategyHybrid IT – A Winning Strategy
Hybrid IT – A Winning StrategyOneNeck
 
TierPoint White Paper_When_Virtualization_Meets_Infrastructure_2015
TierPoint White Paper_When_Virtualization_Meets_Infrastructure_2015TierPoint White Paper_When_Virtualization_Meets_Infrastructure_2015
TierPoint White Paper_When_Virtualization_Meets_Infrastructure_2015sllongo3
 
Why adopt more than one cloud service?
 Why adopt more than one cloud service? Why adopt more than one cloud service?
Why adopt more than one cloud service?Abhishek Sood
 
Everything as a Service
Everything as a ServiceEverything as a Service
Everything as a ServiceCognizant
 
Ethopian Database Management system as a Cloud Service: Limitations and advan...
Ethopian Database Management system as a Cloud Service: Limitations and advan...Ethopian Database Management system as a Cloud Service: Limitations and advan...
Ethopian Database Management system as a Cloud Service: Limitations and advan...IOSR Journals
 
Developing a Business Case for Cloud
Developing a Business Case for CloudDeveloping a Business Case for Cloud
Developing a Business Case for CloudBooz Allen Hamilton
 
Preparing for next-generation cloud: Lessons learned and insights shared
Preparing for next-generation cloud: Lessons learned and insights sharedPreparing for next-generation cloud: Lessons learned and insights shared
Preparing for next-generation cloud: Lessons learned and insights sharedThe Economist Media Businesses
 
Factors that Can Contribute in Enhancing Hybrid Cloud Benefits
Factors that Can Contribute in Enhancing Hybrid Cloud BenefitsFactors that Can Contribute in Enhancing Hybrid Cloud Benefits
Factors that Can Contribute in Enhancing Hybrid Cloud BenefitsWeb Werks Data Centers
 
Cloud Computing Trends: at the Horizon\'s Watch
Cloud Computing Trends: at the Horizon\'s WatchCloud Computing Trends: at the Horizon\'s Watch
Cloud Computing Trends: at the Horizon\'s WatchJocelynDG
 
Analytics as a Service in SL
Analytics as a Service in SLAnalytics as a Service in SL
Analytics as a Service in SLSkylabReddy Vanga
 
Influence of Hadoop in Big Data Analysis and Its Aspects
Influence of Hadoop in Big Data Analysis and Its Aspects Influence of Hadoop in Big Data Analysis and Its Aspects
Influence of Hadoop in Big Data Analysis and Its Aspects IJMER
 
Architecting a-big-data-platform-for-analytics 24606569
Architecting a-big-data-platform-for-analytics 24606569Architecting a-big-data-platform-for-analytics 24606569
Architecting a-big-data-platform-for-analytics 24606569Kun Le
 
Best practices for building and deploying predictive models over big data pre...
Best practices for building and deploying predictive models over big data pre...Best practices for building and deploying predictive models over big data pre...
Best practices for building and deploying predictive models over big data pre...Kun Le
 
Cloud Computing Presentation
Cloud Computing PresentationCloud Computing Presentation
Cloud Computing Presentationmhalcrow
 
"How CenturyLink is Setting the standard for the Next Generation of Cloud Ser...
"How CenturyLink is Setting the standard for the Next Generation of Cloud Ser..."How CenturyLink is Setting the standard for the Next Generation of Cloud Ser...
"How CenturyLink is Setting the standard for the Next Generation of Cloud Ser...Lillian Hiscox
 

Mais procurados (20)

Cloudonomics: The Economics of Cloud Computing
Cloudonomics: The Economics of Cloud ComputingCloudonomics: The Economics of Cloud Computing
Cloudonomics: The Economics of Cloud Computing
 
Survivors Guide To The Cloud
Survivors Guide To The CloudSurvivors Guide To The Cloud
Survivors Guide To The Cloud
 
Survivors guide to the cloud whitepaper
Survivors guide to the cloud whitepaperSurvivors guide to the cloud whitepaper
Survivors guide to the cloud whitepaper
 
Hybrid cloud-for-flexible-accelerated-and-sustainable-it16-10-051475673810
Hybrid cloud-for-flexible-accelerated-and-sustainable-it16-10-051475673810Hybrid cloud-for-flexible-accelerated-and-sustainable-it16-10-051475673810
Hybrid cloud-for-flexible-accelerated-and-sustainable-it16-10-051475673810
 
Host your Cloud – Netmagic Solutions
Host your Cloud – Netmagic SolutionsHost your Cloud – Netmagic Solutions
Host your Cloud – Netmagic Solutions
 
Hybrid IT – A Winning Strategy
Hybrid IT – A Winning StrategyHybrid IT – A Winning Strategy
Hybrid IT – A Winning Strategy
 
TierPoint White Paper_When_Virtualization_Meets_Infrastructure_2015
TierPoint White Paper_When_Virtualization_Meets_Infrastructure_2015TierPoint White Paper_When_Virtualization_Meets_Infrastructure_2015
TierPoint White Paper_When_Virtualization_Meets_Infrastructure_2015
 
Why adopt more than one cloud service?
 Why adopt more than one cloud service? Why adopt more than one cloud service?
Why adopt more than one cloud service?
 
Everything as a Service
Everything as a ServiceEverything as a Service
Everything as a Service
 
Ethopian Database Management system as a Cloud Service: Limitations and advan...
Ethopian Database Management system as a Cloud Service: Limitations and advan...Ethopian Database Management system as a Cloud Service: Limitations and advan...
Ethopian Database Management system as a Cloud Service: Limitations and advan...
 
Developing a Business Case for Cloud
Developing a Business Case for CloudDeveloping a Business Case for Cloud
Developing a Business Case for Cloud
 
Preparing for next-generation cloud: Lessons learned and insights shared
Preparing for next-generation cloud: Lessons learned and insights sharedPreparing for next-generation cloud: Lessons learned and insights shared
Preparing for next-generation cloud: Lessons learned and insights shared
 
Factors that Can Contribute in Enhancing Hybrid Cloud Benefits
Factors that Can Contribute in Enhancing Hybrid Cloud BenefitsFactors that Can Contribute in Enhancing Hybrid Cloud Benefits
Factors that Can Contribute in Enhancing Hybrid Cloud Benefits
 
Cloud Computing Trends: at the Horizon\'s Watch
Cloud Computing Trends: at the Horizon\'s WatchCloud Computing Trends: at the Horizon\'s Watch
Cloud Computing Trends: at the Horizon\'s Watch
 
Analytics as a Service in SL
Analytics as a Service in SLAnalytics as a Service in SL
Analytics as a Service in SL
 
Influence of Hadoop in Big Data Analysis and Its Aspects
Influence of Hadoop in Big Data Analysis and Its Aspects Influence of Hadoop in Big Data Analysis and Its Aspects
Influence of Hadoop in Big Data Analysis and Its Aspects
 
Architecting a-big-data-platform-for-analytics 24606569
Architecting a-big-data-platform-for-analytics 24606569Architecting a-big-data-platform-for-analytics 24606569
Architecting a-big-data-platform-for-analytics 24606569
 
Best practices for building and deploying predictive models over big data pre...
Best practices for building and deploying predictive models over big data pre...Best practices for building and deploying predictive models over big data pre...
Best practices for building and deploying predictive models over big data pre...
 
Cloud Computing Presentation
Cloud Computing PresentationCloud Computing Presentation
Cloud Computing Presentation
 
"How CenturyLink is Setting the standard for the Next Generation of Cloud Ser...
"How CenturyLink is Setting the standard for the Next Generation of Cloud Ser..."How CenturyLink is Setting the standard for the Next Generation of Cloud Ser...
"How CenturyLink is Setting the standard for the Next Generation of Cloud Ser...
 

Semelhante a A blueprint for data in a multicloud world

BRIDGING DATA SILOS USING BIG DATA INTEGRATION
BRIDGING DATA SILOS USING BIG DATA INTEGRATIONBRIDGING DATA SILOS USING BIG DATA INTEGRATION
BRIDGING DATA SILOS USING BIG DATA INTEGRATIONijmnct
 
Data Management Trends 2022_Shailendra Mruthyunjayappa.pdf
Data Management Trends 2022_Shailendra Mruthyunjayappa.pdfData Management Trends 2022_Shailendra Mruthyunjayappa.pdf
Data Management Trends 2022_Shailendra Mruthyunjayappa.pdfShailendra Mruthyunjayappa
 
Hybrid Cloud - Key Benefits & Must Have Requirements
Hybrid Cloud - Key Benefits & Must Have RequirementsHybrid Cloud - Key Benefits & Must Have Requirements
Hybrid Cloud - Key Benefits & Must Have RequirementsJamcracker Inc
 
Big Data Analytics in the Cloud for Business Intelligence.docx
Big Data Analytics in the Cloud for Business Intelligence.docxBig Data Analytics in the Cloud for Business Intelligence.docx
Big Data Analytics in the Cloud for Business Intelligence.docxVENKATAAVINASH10
 
Top 10 Cloud Trends for 2017
Top 10 Cloud Trends for 2017Top 10 Cloud Trends for 2017
Top 10 Cloud Trends for 2017Tableau Software
 
Big data an elephant business opportunities
Big data an elephant   business opportunitiesBig data an elephant   business opportunities
Big data an elephant business opportunitiesBigdata Meetup Kochi
 
Public cloud 101,One individual does not hold all the keys to the kingdom. Co...
Public cloud 101,One individual does not hold all the keys to the kingdom. Co...Public cloud 101,One individual does not hold all the keys to the kingdom. Co...
Public cloud 101,One individual does not hold all the keys to the kingdom. Co...Samuel K. Itotia
 
Going to the SP2013 Cloud - what does a business need to make it successful?
Going to the SP2013 Cloud - what does a business need to make it successful?Going to the SP2013 Cloud - what does a business need to make it successful?
Going to the SP2013 Cloud - what does a business need to make it successful?Matt Groves
 
GigaOm-sector-roadmap-cloud-analytic-databases-2017
GigaOm-sector-roadmap-cloud-analytic-databases-2017GigaOm-sector-roadmap-cloud-analytic-databases-2017
GigaOm-sector-roadmap-cloud-analytic-databases-2017Jeremy Maranitch
 
Cloud computing-overview
Cloud computing-overviewCloud computing-overview
Cloud computing-overviewshraddhaudage
 
Top 10 guidelines for deploying modern data architecture for the data driven ...
Top 10 guidelines for deploying modern data architecture for the data driven ...Top 10 guidelines for deploying modern data architecture for the data driven ...
Top 10 guidelines for deploying modern data architecture for the data driven ...LindaWatson19
 
Hybrid & Multi-cloud Environment.pdf
Hybrid & Multi-cloud Environment.pdfHybrid & Multi-cloud Environment.pdf
Hybrid & Multi-cloud Environment.pdfmanoharparakh
 
Tailoring hybrid cloud exec report
Tailoring hybrid cloud exec reportTailoring hybrid cloud exec report
Tailoring hybrid cloud exec reportJose Pena
 
Activating Big Data: The Key To Success with Machine Learning Advanced Analyt...
Activating Big Data: The Key To Success with Machine Learning Advanced Analyt...Activating Big Data: The Key To Success with Machine Learning Advanced Analyt...
Activating Big Data: The Key To Success with Machine Learning Advanced Analyt...Vasu S
 

Semelhante a A blueprint for data in a multicloud world (20)

Data Analytics.pptx
Data Analytics.pptxData Analytics.pptx
Data Analytics.pptx
 
BRIDGING DATA SILOS USING BIG DATA INTEGRATION
BRIDGING DATA SILOS USING BIG DATA INTEGRATIONBRIDGING DATA SILOS USING BIG DATA INTEGRATION
BRIDGING DATA SILOS USING BIG DATA INTEGRATION
 
Data Management Trends 2022_Shailendra Mruthyunjayappa.pdf
Data Management Trends 2022_Shailendra Mruthyunjayappa.pdfData Management Trends 2022_Shailendra Mruthyunjayappa.pdf
Data Management Trends 2022_Shailendra Mruthyunjayappa.pdf
 
Hybrid Cloud - Key Benefits & Must Have Requirements
Hybrid Cloud - Key Benefits & Must Have RequirementsHybrid Cloud - Key Benefits & Must Have Requirements
Hybrid Cloud - Key Benefits & Must Have Requirements
 
Big Data Analytics in the Cloud for Business Intelligence.docx
Big Data Analytics in the Cloud for Business Intelligence.docxBig Data Analytics in the Cloud for Business Intelligence.docx
Big Data Analytics in the Cloud for Business Intelligence.docx
 
Riding the Cloud
Riding the Cloud Riding the Cloud
Riding the Cloud
 
Cloud Analytics Playbook
Cloud Analytics PlaybookCloud Analytics Playbook
Cloud Analytics Playbook
 
4aa5-6541enw
4aa5-6541enw4aa5-6541enw
4aa5-6541enw
 
Top 10 Cloud Trends for 2017
Top 10 Cloud Trends for 2017Top 10 Cloud Trends for 2017
Top 10 Cloud Trends for 2017
 
Big data an elephant business opportunities
Big data an elephant   business opportunitiesBig data an elephant   business opportunities
Big data an elephant business opportunities
 
Public cloud 101,One individual does not hold all the keys to the kingdom. Co...
Public cloud 101,One individual does not hold all the keys to the kingdom. Co...Public cloud 101,One individual does not hold all the keys to the kingdom. Co...
Public cloud 101,One individual does not hold all the keys to the kingdom. Co...
 
Going to the SP2013 Cloud - what does a business need to make it successful?
Going to the SP2013 Cloud - what does a business need to make it successful?Going to the SP2013 Cloud - what does a business need to make it successful?
Going to the SP2013 Cloud - what does a business need to make it successful?
 
GigaOm-sector-roadmap-cloud-analytic-databases-2017
GigaOm-sector-roadmap-cloud-analytic-databases-2017GigaOm-sector-roadmap-cloud-analytic-databases-2017
GigaOm-sector-roadmap-cloud-analytic-databases-2017
 
Cloud computing-overview
Cloud computing-overviewCloud computing-overview
Cloud computing-overview
 
Cloud Computing Overview | Torry Harris Whitepaper
Cloud Computing Overview | Torry Harris WhitepaperCloud Computing Overview | Torry Harris Whitepaper
Cloud Computing Overview | Torry Harris Whitepaper
 
Top 10 guidelines for deploying modern data architecture for the data driven ...
Top 10 guidelines for deploying modern data architecture for the data driven ...Top 10 guidelines for deploying modern data architecture for the data driven ...
Top 10 guidelines for deploying modern data architecture for the data driven ...
 
FINAL PRINTED VER - 29102014
FINAL PRINTED VER - 29102014FINAL PRINTED VER - 29102014
FINAL PRINTED VER - 29102014
 
Hybrid & Multi-cloud Environment.pdf
Hybrid & Multi-cloud Environment.pdfHybrid & Multi-cloud Environment.pdf
Hybrid & Multi-cloud Environment.pdf
 
Tailoring hybrid cloud exec report
Tailoring hybrid cloud exec reportTailoring hybrid cloud exec report
Tailoring hybrid cloud exec report
 
Activating Big Data: The Key To Success with Machine Learning Advanced Analyt...
Activating Big Data: The Key To Success with Machine Learning Advanced Analyt...Activating Big Data: The Key To Success with Machine Learning Advanced Analyt...
Activating Big Data: The Key To Success with Machine Learning Advanced Analyt...
 

Último

Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyKhushali Kathiriya
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...Zilliz
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businesspanagenda
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsNanddeep Nachan
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfOverkill Security
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024The Digital Insurer
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDropbox
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWERMadyBayot
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot ModelNavi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot ModelDeepika Singh
 
A Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusA Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusZilliz
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Jeffrey Haguewood
 

Último (20)

Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot ModelNavi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
A Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusA Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source Milvus
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 

A blueprint for data in a multicloud world

  • 1. Expert Insights A blueprint for data in a multicloud world
  • 2. Experts on this topic Tony Giordano Tony has more than 25 years of global professional services experience in performance marketing, big Senior Partner and VP, data and analytics, information governance, customer CBDS Data Platform Services relationship management, and program management, Global Leader, IBM Services with industry experience in financial services, life linkedin.com/in/ sciences, retail, and automotive sectors. He has tony-anthony-giordano-3111091 authored two books on Information Management. tonygior@us.ibm.com Mehdi Charafeddine Mehdi is a technical leader with a track record for delivering large complex projects that require a new Associate Partner, way of looking at problems and solving them. As a Executive Solutions Architect, strong Open Source advocate, Mehdi experiments with IBM Services new technologies with a value potential to clients. This linkedin.com/in/mehdichara requires a unique combination of a deep understanding mchara@ibm.com of software versus the risk/return ratio for IBM clients. Dan Sutherland Dan is an expert at transforming a client’s strategic business vision into innovative solutions while Distinguished Engineer and CTO, exploiting leading edge technologies. Dan is Data Platforms, IBM Services comfortable resolving complex data architecture and linkedin.com/in/ design issues and regularly advises on very large, high dan-sutherland-4661a67 volume data-centric projects where sound data dsutherl@us.ibm.com architecture and design techniques are key components to a successful solution implementation. Becky Carroll Becky has over 25 years of experience helping business leaders in large enterprises transform Associate Partner, traditional business processes to new business Cognitive and Analytics models. She has led projects covering big data and Cognitive Business Decision analytics, cognitive and AI, digital and social business Support, IBM Services strategy and analytics, CRM, and customer service. linkedin.com/in/beckycarroll Becky’s work spans many industries, including High rlcarrol@us.ibm.com Tech, Telecommunications, Entertainment & Media, Financial Services, Airline, Automotive, Industrial Products, Life Sciences, Gaming, Education. Becky holds a BS in Electrical Engineering and Computer Science and a MBA in Marketing Management.
  • 3. A well-planned and executed data strategy helps avoid unwelcome surprises in a multicloud environment. Talking points A data strategy and architecture addressing applications and data spread over multiple clouds are essential for companies operating in today’s multicloud environment. A business-driven data strategy should be fully integrated with a company’s evolving multicloud architecture to keep data from becoming siloed and, therefore, not easily accessible to applications. A strong data strategy will focus on business transformation opportunities, avoid “lock-in” to any particular cloud, and use DataOps, a method to automate preparatory work of data scientists, to derive the most insight possible from available data. Moving data in a multicloud world As companies move beyond initial forays into the cloud, many are experiencing the advantages—and the challenges—of a multicloud environment. Benefits of multicloud are manifest: increased ability to innovate, enhanced products and services, and agile business processes.1 In fact, according to a recent Institute for Business Value (IBV) study, 85 percent of companies globally are already operating in a multicloud environment.2 By 2021, 96 percent plan to be using multiple clouds.3 Companies are remaking themselves as “digital” and making artificial intelligence (AI) central to their core business processes. As digitization continues, the movement to multiple clouds will increase, because it often involves obtaining cloud-based services from different cloud vendors. But the migration is unlikely to be as simple as merely shifting a current architecture to a new location in the cloud. Each cloud vendor has its own architectural methods. How each organizes data could be incompatible with other clouds. If addressing such incompatibility is not accounted for in a well-designed strategy, a company may experience poor performance and higher-than- expected cost. At the root of the challenge is the nature of data and its rapid growth. When companies expand the number of services and applications they use, their data grows exponentially. But it isn’t the amount of data that is the real issue—it’s where that data resides. And where it resides is often based on where it is collected and created. 1
  • 4. A data fabric helps make data visible—and usable— across an entire enterprise. Such dispersed data created by distinct business units or functions is known as a “data silo.” Silos drastically diminish the data’s usability. The data may be redundant, but organized differently, and therefore difficult to correlate. There may be discrepancies in the data because of the way it was collected. And the data may be unknown or otherwise inaccessible to applications in other parts of the company. Since almost all companies will soon be functioning in the multicloud reality, negative effects of data siloed within a company will only be heightened as companies spread processes, applications, and pockets of data across multiple clouds (see Figure 1). Figure 1 Ninety-eight percent of surveyed organizations say they plan to use multiple hybrid clouds within three years 0-1 Clouds 24% 2% 2-9 Clouds 68% 70% 10+ Clouds 9% 28% Now In 3 years Source: IBV 2018 Multicloud management survey, Q2 & Q3. Q2 How many cloud services and platforms are you currently using across your entire business? Q3 How many cloud services and platforms are you planning to use across your entire business in the next three years? These new circumstances lead to an important question. How can companies successfully navigate the complexity, cost and latency issues operating in a multicloud environment generates? The answer? By properly defining a “data fabric” or “blueprint for data” in a multicloud world. A data fabric to navigate multiple clouds A data fabric is a conceptual representation or architecture of how data assets will be organized. This blueprint is a formal structure to define and view data across an enterprise, and is independent of particular infrastructure or cloud requirements. A data fabric helps make data visible—and usable— across an entire enterprise. It guides how the data will be maintained and governed. It’s founded on the concept that while business processes may change, the data underpinning them should be stable. Data is grouped into collections called “hubs” or “lakes” for visibility and access. Data fabrics can help avoid or mitigate data silos, low reliability and scalability, reliance on legacy systems, and cost inefficiencies. But to work in a multicloud environment, a data fabric needs to accommodate different patterns of use. Consider three examples of how a company might choose to distribute its workloads, applications, and data across different clouds. 2
  • 5. Example 1 In the first instance, a company wants to run an analytics reside on one cloud, with only a specific analytics service package offered on IBM Cloudpak, using data it already calling on that data from a different cloud (see Figure 2). has on AWS. Here, its data ingestion and curation services Figure 2 Managing data across a multicloud by architectural component 3 Content services Data curation services Data hub component Consumption layer Conformed layer Raw ingest layer Relational HDFS Parquet Hive Mongo Columner Rapid ingestion services Real time ingestion Batch ingestion Deployment Platform Management User experience services Orchestration services Insight services Security Platform AWS IBM Cloudpak
  • 6. To work in a multicloud environment, a data fabric must accommodate different patterns of use. Example 2 In a second example, the company wants to take advantage of attractive pricing on applications offered on Azure, but keep other existing processes on AWS. Its ingestion of raw data and its ability to conform and consume data, then, are distributed over different clouds. It also subscribes to an analytics service on IBM Cloudpak (see Figure 3). Figure 3 Managing data across a multicloud by data hub layer Data curation services Data hub component Consumption layer Conformed layer Raw ingest layer Relational HDFS Parquet Hive Mongo Columner Rapid ingestion services Real time ingestion Batch ingestion Deployment Platform Management Platform User experience services Orchestration services Content services Insight services IBM Cloudpak AWS Azure Security 4
  • 7. Example 3 In a third example, the company needs to have its various cross-business function such as security on Azure business divisions functioning on different clouds because operating across all its other clouds (see Figure 4). of their unique needs, but chooses to keep a critical Figure 4 Managing data across multiple clouds 5 Deployment Platform User experience services Orchestration services Content services Data curation services Consumption layer Conformed layer Raw ingest layer Data hub component Rapid ingestion services Real time ingestion Batch ingestion Relational HDFS Parquet Hive Mongo Columner Insight services Management Platform AWS Google Cloud Security Azure
  • 8. The right data strategy will anticipate change to help a company maintain its flexibility. As the three examples illustrate, this new multicloud reality creates new options and opportunities. But it also creates new challenges for IT leaders. Indeed, recent market intelligence suggests that 82 percent of IT leaders are concerned about how they will connect all of these clouds with their traditional IT.4 Seventy-three percent say they need better ways to move apps, workloads, and data more effectively across these clouds. And 67 percent worry about how they will manage this new mix of cloud and environments in a consistent way, across vendors, without introducing new security and compliance risks.5 To face these challenges, we have developed three guiding principles to help companies create and execute a data strategy for moving to, and operating in, a multicloud environment. Principle one: Moving to the cloud is a time for business-driven transformation Our data shows that cost reduction continues to be of strategic importance in investing in multicloud environments. However, some organizations experience difficulties and delays in capturing cost savings from their multicloud investments. Based on our experience with clients, many face cost increases—in some cases, by as much as 300 percent. The cause? Moving to the cloud magnifies existing faults in a legacy IT landscape. Cloud vendors charge for data transfers and network usage at a much higher rate than most internal IT organizations. So siloed data—an inevitable reality for most legacy systems—generates enormous cost and performance issues as it moves from cloud to cloud to be used by various applications. Therefore, a move to the cloud is the time to examine current business processes and data management, evaluate how changes to both can help keep costs in check, and best exploit the potential of the new multicloud environment. This is not simply an IT discussion or a restructuring of the current data. Rather, it should be a close collaboration between the line of business and IT, with business needs, value, and a line-of-business executive leading the way. A related challenge is the rapidly changing technology landscape: today’s tools may not work tomorrow. The transformation opportunity, then, also means that moving the legacy implementation will likely require new technologies, instead of continuing to reuse current ones, such as Hadoop. Principle two: Anticipate and plan for future movement, changes and innovation When migrating to a multicloud environment, companies should avoid “vendor lock-in.” Cloud providers and offerings are changing rapidly, as are the technologies they support and offer. Which offering or service is best for a company could likewise change quickly. The right data strategy will anticipate change to help a company maintain its flexibility. At some point, it’s likely a company will want to move its data and apps from one cloud to another, scale its storage and processing without encumbrance, and store data in the optimal place for data science and workloads. To anticipate change, a company’s data strategy needs three key components: containerization, serverless capabilities, and pragmatic data design. 6
  • 9. “Containerization” refers to a company packaging its data applications into “containers” that are not dependent on a single cloud implementation. Containerized applications can run across clouds and operating environments. This enables the organization to move their applications and data as their ecosystem evolves, especially as processes increasingly interact across clouds. The company will also not be entirely reliant on a specific cloud to run certain parts of its business. Another aspect of change is the large volumes of data generated by new applications and services. It’s practically impossible to scale the teams responsible for managing that data and the costs associated with its storage and processing. A serverless strategy can help. Instead of reserving a set resource allocation—for example, 1,000 servers—a company’s cloud vendor assumes responsibility for scaling resources required as its data and usage needs rise gradually, spike, or ease. The company pays for the resources it uses, and focuses on the business case and the code necessary to support it, rather than the IT resources required to run it. Together with containerization, this reduces the risk of new deployments in terms of complexity, skills, and costs. This approach helps a company become not just cloud vendor-agnostic, but also technology-agnostic. A third component in anticipating change is proper data structure design. When designing data structures such as for traditional reporting, data science, digital, and operational use cases, it’s important to keep the workload in the right place relative to the data to reduce network traffic and cost. Data design also should include data policies for security, compliance, and data lifecycle management across multiple cloud providers. These data design factors can be integrated into a set of managed and orchestrated data workflows that can span multiple cloud providers. Yara: Data and a multicloud digital farming platform Norway-based Yara, one of the world’s largest fertilizer producers, wants to help fulfill the vision of a sustainable world without hunger. To lead the innovation of its core business models through digitalization, Yara is building the world’s leading digital farming platform. In building the platform, Yara focused on creating and realizing a cloud-agnostic strategy that enabled consistent data governance and data security. It also focused on DataOps—automating data functions that allow its data scientists to focus on data models and innovation. The platform provides holistic digital services and instant agronomic advice to farmers across the globe, ultimately avoiding deforestation by increasing food production on existing farmland. Together, the Yara digital platform aims to cover 7 percent of all arable land worldwide. The cloud-agnostic digital farming data platform follows a pay-as-you-go commercial model and provides Yara with two Data Services: Weather Data and Crop Yield as a Service. These accelerators are the first of many; an Open Innovation layer will enable Yara to create new ground-breaking algorithms providing farmers knowledge and decision-making insights. 7
  • 10. Principle three: Add DataOps to DevOps Companies should take a lesson from DevOps and how it has revolutionized application development over the past decade or so. Automating aspects of deployment has accelerated how developers test and prove their work. Data scientists now need a similar revolution, and they are finding it in DataOps: the automation of the kinds of work that slow them down. Today, data scientists spend an inordinate amount of time preparing, validating, and cleansing data sources, then training their data models on them. They spend a surprisingly small part of their time designing the data models themselves—the highest value work of a data scientist. When you automate the data prep and training, you free the data scientist to bring to the business more insight and, ultimately, new value. Data strategy, culture, and people Just as book-filled libraries would be useless without a culture of reading, troves of data and the most innovative data tools require people with the right skills to put them to work, and the culture has to support them. People need data and the insights gleaned from it, and they need it in the specific context of their work. The same data can mean different things to different users. What insights a scientist finds useful will likely differ from those needed by a developer, a product manager, a marketer or a process guru. Especially when AI is applied to huge data sets, meaning and context become critical. What are the right questions to ask, and is this the right data to provide the answers? If this is the right type of data, can we trust it? Can we trust both the AI algorithms operating on the data, as well as the training the AI is receiving—including the actual training data it is being fed? A successful data strategy will include an inventory of needed skills and training, in addition to a long-term plan for cultivating a vibrant data culture: one where people are energized to get the most from data, and have the ability and organizational support to do so. 8
  • 11. How to get started Creating a data strategy integrated with your multicloud plans and applying the principles outlined above might seem challenging, but there are immediate steps you can take to prepare. 1. Build a business transformation strategy that explicitly factors in and takes advantage of multicloud capabilities, and manages data to exploit those capabilities. 2. Investigate technologies that keep data closer to where it will be used, and make sure it is properly governed across multicloud environments. 3. Identify any lower-value tasks consuming the majority of your data scientists’ time and automate them. This will free data scientists to focus on more valuable and strategically important work. Are you ready? — Which core business processes have you prioritized and explored for transformation? — What steps has your organization taken to build a data culture adapted to a multicloud environment? — How will you navigate the world of regulations associated with data resident in different clouds? 9
  • 12. About Expert Insights Expert Insights represent the opinions of thought leaders on newsworthy business and related technology topics. They are based upon conversations with leading subject matter experts from around the globe. For more information, contact the IBM Institute for Business Value at iibv@us.ibm.com. Notes and sources 1 IBV survey Assembling your cloud orchestra: A field guide to multicloud management (2018). Steve Cowley, Lynn Kesterson-Townes, Arvind Krishna, Sangita Singh. https://www.ibm.com/downloads/cas/EXLAL23W 2 Ibid. 3 Ibid. 4 IDC Cloud Forecast 2018-2020 – Market sizing + CAGR; BCG & McKinsey Study conducted for IBM – Multicloud + Priority concerns. 5 Ibid. 6 Source: IBM Institute for Business Value hybrid cloud survey (2016). 7 Internal IBM measurement, IBM Global Business Services. 8 IBM client experience © Copyright IBM Corporation 2019 IBM Corporation New Orchard Road Armonk, NY 10504 Produced in the United States of America October 2019 IBM, the IBM logo, ibm.com and Watson are trademarks of International Business Machines Corp., registered in many jurisdictions worldwide. Other product and service names might be trademarks of IBM or other companies. A current list of IBM trademarks is available on the web at “Copyright and trademark information” at: ibm.com/legal/copytrade.shtml. This document is current as of the initial date of publication and may be changed by IBM at any time. Not all offerings are available in every country in which IBM operates. THE INFORMATION IN THIS DOCUMENT IS PROVIDED “AS IS” WITHOUT ANY WARRANTY, EXPRESS OR IMPLIED, INCLUDING WITHOUT ANY WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND ANY WARRANTY OR CONDITION OF NON-INFRINGEMENT. IBM products are war- ranted according to the terms and conditions of the agreements under which they are provided. This report is intended for general guidance only. It is not intended to be a substitute for detailed research or the exercise of professional judgment. IBM shall not be responsible for any loss whatsoever sustained by any organization or person who relies on this publication. The data used in this report may be derived from third-party sources and IBM does not independently verify, validate or audit such data. The results from the use of such data are provided on an “as is” basis and IBM makes no representations or warranties, express or implied. 39028739USEN-00