IBM Cloud Pak for Data is a unified platform that simplifies data collection, organization, and analysis through an integrated cloud-native architecture. It allows enterprises to turn data into insights by unifying various data sources and providing a catalog of microservices for additional functionality. The platform addresses challenges organizations face in leveraging data due to legacy systems, regulatory constraints, and time spent preparing data. It provides a single interface for data teams to collaborate and access over 45 integrated services to more efficiently gain insights from data.
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
IBM Cloud pak for data brochure
1. Forrester Wave Leader
Enterprise Insight Platform Q1 2019
Cloud Pak for Data
Let’s simplify your information architecture
and put your data to work
2. Don’t be burdened by
your data.
Rely on your data.
IBM Cloud Pak for Data is a single unified platform which helps to unify
and simplify the collection, organization and analysis of data. Enterprises
can turn data into insights through an integrated cloud-native
architecture. IBM Cloud Pak for Data is extensible, easily customized to
unique client data and AI landscapes through an integrated catalog of
IBM, open source and third-party microservices add-ons.
2Cloud Pak for Data
3. It’s becoming more challenging and complex
to establish data driven practices – and data
driven practices are necessary.
The Challenge
Every company must be data driven in
today’s ecosystem.
In the data explosion era the globe is now
creating 2.5 quintillion bytes of data every
day (link).
When it comes to leveraging data though, on
average roughly 70% of data produced
within an enterprise goes completely
unused (link).
Leading 79.4% of executives fearful of
disruption by data-driven startups or
companies and only 7.3% confident in their
future data strategy (link).
It makes sense why most
enterprises are challenged to
be truly data driven.
1. A majority of enterprises have some presence
of now legacy systems making it difficult to
connect and utilize all data sources.
2. Industries are dealing with stronger regulatory
constraints to protect data.
3. Digital transformation has forced enterprises
to strategically think differently than they have in
the past.
4. Highly paid data teams are spending 50-80%
of their time (multiple sources) simply finding,
preparing, and governing data sets before any
business insights work can begin. Growth in team
working silos and work complexity also contribute
to inefficiency.
5. A primary reason why 97.2% of executives say
they’re building or launching Big Data and AI
initiatives is that within the past decade nearly
three-quarters of Fortune 1,000 companies have
been replaced (link). Many are being replaced by
companies like Facebook and Amazon who have
reached Expert levels when it comes to operating
as a data driven company; rather than using data
and AI to cut costs or grow their business they are
reshaping entire industries.
The Factors
We aren’t leveraging the
data we create.
there is
no AI
without IA
Let’s start by
understanding the
situation
3Cloud Pak for Data
4. A cohesive modern data strategy is necessary to
achieve modern AI and analytics results.
Our approach is based on three key take-aways: data fuels digital transformation, AI
unlocks the value of data, and hybrid cloud democratizes data. These guide our over-
arching strategy for achieving real business value from data and AI.
The Journey to AI
Infuse
Operationalize AI with trust and transparency
Analyze
Scale insights with AI everywhere
Organize
Create a trusted analytics foundation
Collect
Make data simple and accessible
The Future of AI is Flexible
It all starts with a hybrid multi-cloud approach.
Keep your head
above the clouds
4Cloud Pak for Data
Our Approach
5. “Simplify your information architecture
and put your data to work”
Find, connect to, govern, and leverage your data
across multiple sources without needing to move
or replicate.
Automate many mundane and repeatable tasks
like cleaning, matching, and metadata creation to
reduce data prep time by 80%.
Leverage deployment flexibility amongst any
Cloud, Hybrid Cloud or Private Cloud environment
and provider with Red Hat OpenShift.
Eliminate working silos with a single unified
experience allowing all data users to collaborate
and connect to multiple analytics applications and
models.
Centralize your teams’ workflow and operations
management with an ecosystem of 45+ integrated
services.
Enable your highly skilled and paid data teams to
spend more of their time on business value
generating innovations in big data and analytics.
What we mean by
How to simplify your
information architecture “IA”
How to get data
working for you
5Cloud Pak for Data
6. INFRASTRUCTURE
LAYER
KUBERNETES
LAYER
PLATFORM
INTERFACE
LAYER
SERVICES
LAYER
On
Premise
Avoid lock-in and leverage all
cloud infrastructure with our
Any Cloud mentality.
Leverage the leading hybrid
cloud, enterprise container
platform for an innovative
and fast deployment strategy.
5. ANY CLOUD
4.
At a click, access and deploy
an ecosystem of 45+
analytics services and
templates from IBM and third
parties. More on page 8
1. SERVICES ECOSYSTEM
Query across multiple data
sources fast and easy without
moving your data.
More on page 7
2. DATA VIRTUALIZATION
Complete yet simple.
Speed time to value with a
single platform that
integrates data management,
data governance and
analysis for greater efficiency
and improved use of
resources.
3. PLATFORM INTERFACE
High level view
of the…
Check out details at
page numbers
referenced above
Explore how we’ve constructed the Forrester Wave’s Leading
enterprise insights platform.
6Cloud Pak for Data
The Platform
7. Integrate your data and teams without needing to
overhaul existing infrastructure.
READ THE REPORT
Data warehouses
and data marts
Relational
databases No SQL
Spreadsheets and
text files
Big data; Hadoop
Ecosystem
Data Virtualization
and Caching Layer
A unified data asset catalog,
lineage and provenance
Access control and security
policies
Data silos are very good at holding potential insights from data tightly within their barriers. Leaving the tedious task of
searching through, moving, and governing those data resources to highly paid and skilled data teams. Often that work
takes 80% of the time dedicated to a single initiative.
Data virtualization connects those data silos to make them appear as if they were a single data set on your desktop. It
also leverages servers where data does sit by performing analytics queries and then simply returning the results to the
original application.
No data is copied. It exists only at the source.
7Cloud Pak for Data
Removing Data Silos
8. Collect
Premium Add-ons,
Accelerators or Existing
License Trade-ups
Cloud Pak for Data
Base Capabilities
à la carte
Let the integrated end-to-end analytics services grow
and scale with you on your journey to AI. Deployment
is easy and premium add-ons have flexible licensing
models. Explore the capabilities that are ready at your
fingertips.
Additional licensing
Data Virtualization
•Query Anything, Anywhere (virtualized data across
multiple sources) •Auto-discovery of data source &
metadata with built-in governance •Distributed Parallel
Processing
Db2 Warehouse
•In-memory optimized columnar engine •SQL, Spatial,
XML, and JSON support •Scales to peta-bytes, portable
and compatible with multiple DBs
Db2 AESE
Db2 Advanced Enterprise Server Edition is suitable for
transactional, warehouse, and mixed workloads
PostgreSQL
Open source object-relational database designed for
developers
Streams
Develop and run applications that process in-flight data
with the IBM Streams add-on. IBM Streams enables
continuous and fast analysis of massive volumes of
moving data to help improve the speed of business
insight and decision making.
Db2 Event Store
Memory-optimized database designed to rapidly ingest
and analyze streamed data for event-driven applications
MongoDB
A cross-platform document-oriented database program.
Analyze & Infuse on
next page
Organize
Data Discovery
Includes services from Information Analyzer
•Default Quality Rules •Quality Score •Ability to Sample
Data •Create Connections •Assign Terms, Rules •Auto
Term Assignment •Review, Approval Process
Data Integration
Includes services from DFD & Datastage
•Create, Update, Delete Jobs •Create, Update, Delete
Connections •Compile Jobs •Job Logs
Data Catalog
Includes services from Information Governance Catalog
& Watson Knowledge Catalog
•Import Business Terms (UI) •Create Policies & Rules
•Import Policies & Rules (UI) •Asset Explorer •Search
Assets •Graph Explorer •Comments, Ratings
Infosphere Information Regulatory
Accelerator
Designed to reduce costs and complexity by:
Extracting selected key terms, available
definitions, policy and controls from the
regulatory taxonomy using Machine Learning,
thereby reducing the manual effort involved
in this process.
Infosphere DataStage Edition
An ETL tool and part of the IBM Information
Platforms Solutions suite and IBM InfoSphere
Watson Knowledge Catalog Pro
A data catalog that is tightly integrated with an enterprise
data governance platform.
8Cloud Pak for Data
Integrated Services Menu
9. Analyze
Infuse
Watson Studio
•Environments (Jupyter, RStudio, Zeppelin, etc.)
•Scripting & Job Automation •Machine Learning
Frameworks & Spark •Image & Packet Management
•Model Management & Deployment •Git Version Control
Cognos Dashboard
Integrates reporting, modeling, analysis, exploration,
dashboards, stories, and event management so you can
understand your organization's data, and make effective
business decisions.
Watson API Kit
•Watson Knowledge Studio •Natural Language
Understanding •Speech to Text •Text to Speech
Watson Assistant
Building conversational interfaces into any
application, device, or channel
Watson OpenScale
Open platform to operate and automate AI across its
lifecycle
Cognos Analytics
Business intelligence and analytics solution that
makes it easy to visualize, analyze and share insights
about your business
Watson Discovery
•Watson Knowledge Studio •Watson Explorer
•Watson Discovery
Watson Studio Premium
•SPSS Modeler, Data Refinery •Decision
Optimization •Model Builder (AutoML)
•Hadoop Services for SPSS / Notebook,
SPSS SQL Pushback •WML Advanced
Training (batch training, HPO, Distributed
Deep Learning) •Continuous Learning
Services
Ask your IBM representative about ways to get
started with Cloud Pak for Data today.
Enterprise
Edition
Supported on any
cloud provider.
Cloud Native
Edition
Cloud Pak for Data
System
Supported on any
cloud provider.
Software + Hardware;
optimized and tested
hyper-converged
system.
9Cloud Pak for Data