Making Self-Service BI a Reality in the Enterprise

MAKING SELF-SERVICE BI A REALITY
IN THE ENTERPRISE
Alex Gutow | Dir. Product Marketing, Cloudera
Romain Rigaux | Engineering Lead, Cloudera

2 © Cloudera, Inc. All rights reserved.
TRENDS DRIVING ANALYTIC MODERNIZATION
Self-Service Flexibility Real-time Analysis Hybrid Cloud Converged Workloads

MOTIVATIONS FOR SELF-SERVICE
Easing the relationship between IT and business
Faster access and iteration
Support for canned reports and
exploration
Quickly onboard new users and use
cases
Use existing skills and tools
BUSINESS
USERS
Empower business and remove
bottlenecks
Ensure priority SLAs are met
Contain resources/costs based on
system limitations
Meet enterprise requirements
INFRASTRUCTURE
TEAM

HOW ARE END-USERS ACCESSING AND ANALYZING DATA TODAY?
Select all that apply
• Pre-canned reports and dashboards
• Self-service analytics through BI tools
• Direct SQL queries
• Python/R
• Other

TRADITIONAL BI
Trustworthy but too rigid for exploration
Centralized Distributed
Context-free
Context-rich Traditional Approach
Acquire Curate Use
Strong for traditional BI and reporting
• Highly curated data
• Strict data preparation
• Restricted access
• Highly governed
Constrained self-service agility
• Limited to structured data
• Limited compute types/SQL-only
• Long latencies/freshness from ingest to use

EXPLORATORY BI
Flexible but lacks trust
Context-free
Context-rich
Pure Exploration
Acquire Use
Strong for self-service agility
• Data is immediately available
• Flexible, iterative wrangling/preparation
• Support for multiple compute types
• Unrestricted access
Constrained for traditional BI and reporting
• Difficult to comprehend data semantics (3
V’s)
• Limited governance
• Uncontrolled curation
• Limited trust (tribal knowledge)
• Disperse storage with redundant data sets
due to distributed wrangling

CONVERGE THE BEST OF TRADITIONAL & EXPLORATORY
Context-free
Context-rich
Acquire Use
Curate
Governance + Self-Service
Leveraging governance artifacts for self-
service agility
• Immediate discovery
• As-it-happens trust and governance
• Flexible curation and preparation
• Automations for scale

CLOUDERA DATA WAREHOUSE
Decoupled architecture on-prem & cloud-native
Analytic Workbench:
SQL Developers
Preferred BI Tools:
Analysts
Workload 360:
Migrate, Analyze,
Optimize, Scale
Navigator:
Trust &
Stewardship
Apache Impala
Query Engine
Hive-on-Spark
ETL Processing
Apache Kudu |
HDFS
Local Storage
AWS S3 | Azure ADLS
Object Storage

WHAT BI/ANALYTICS TOOLS ARE YOU USING?
Select all that apply
• Tableau
• Qlik
• Zoomdata
• Arcadia
• PowerBI
• Microstrategy
• Cognos
• SAS
• Hue
• SQL Shell
• Other

BRIDGING THE GAP
For collaborative governance
IT/OPS STEWARD DBA SQL DEV ANALYST

BRIDGING THE GAP
For collaborative governance
IT/OPS
STEWARD
DBA
SQL DEVANALYST

EMPOWER THE BUSINESS
with trust and efficiency
• Flexibility: Converge all data from all
sources
• Trusted discovery: Understand what
to trust across cloud and on-prem
• Query assistance and
recommendations: Based on popular
values and best practices
• Share and go beyond SQL: Extend to
third-party tools or data science teams

DEMO

CUSTOMER 360 ANALYSIS
Analytic Workbench Demo
Discover Query Share
• Browse all available data
• Search in data catalog (data, queries,
documents)
• Import new data in minutes
• Built in best practices
• In-editor popular values, risk alerts,
and recommendations
• Query builder and dynamic
dashboards
• Navigate, Copy, Download, Export
• Share data, queries, and results
• Parameterized presentations
• Integrated with BI tools & platform

CUSTOMER 360 ANALYSIS
Understand support costs, product usage, time-to-resolution, marketing channel activities
Shared Storage
S3 | ADLS | Kudu | HDFS
Shared Metadata, Security, Governance
Opportunities
revenue
Cases
start
end
Account History
id
Usage
ts
Contact Activities
type
duration
Salesforce Usage Logs Marketing Database

EPSILON
Agility Harmony
structured + unstructured Annual operational savings
Self-service campaign and segmentation
development
Gives marketers the freedom and flexibility to
use customer data how and when they want
without roadblocks
• Consolidate cross-enterprise data sources
• Complete customer 360 with secured data
• Easy-to-use frontend
• Build segments and campaigns in minutes
not hours
Industry-leading SLAs on amount of data and
segmentation speed
• Processing down from 6hrs to 10min
• 4000 lines of SQL against millions of profiles
and billions of transactions
• Guarantee 10M onboarded profiles/hr and 40-
50M transactions/hr

GLOBAL PHARMACEUTICAL
R&D Information Platform
Balance curated usage with agile discovery
• Consistent, shared data access
through BI tools
• Interactive query speeds
• OLAP capabilities
• HIPAA compliant
Results:
• Reduced cost and time to identify
clinical trial groups
• Accelerate new drug development
• 1st time metrics and monitoring on compliance
data
structured + unstructured
Analytic Users
70% Execs/Managers
25% Analysts
5% Data Scientists
Reduction in silos
Use cases

NOVANTAS
Metriscape for customer journey analytics
Deposits in cross-bank
data set
Savings for every $1B in deposits
Analyzes 1000s of business metrics
for each customer
Built in governance for trust in agile analytics
• Meet regulatory requirements
• Auto-generated and user-driven
• Includes unstructured call center data
• Single platform for train/test models,
interactive queries, and visualizations
Results:
• 50% decrease in marketing execution costs
• Limit retention promotional expenses by 10%
with only 3% change in churn
• Low TCO with hybrid cloud and transient
resources
Faster time to monetization
with hybrid cloud

THE DATA MANAGEMENT INFRASTRUCTURE MODEL
Source: Solve Your Data Challenges With the Data Management Infrastructure Model, Gartner, 2017

MULTI-USER, MULTI-FUNCTION
Support all skills and curate based on known and new data
ANALYTIC
DATABASE
Discovery (raw)
ANALYTIC
DATABASE
Exploration
(curated)
DATA
ENGINEERING
Prep - New
Report
ANALYTIC
DATABASE
BI/New
Reporting
DATA
SCIENCE
Model
Build/Test
DATA
ENGINEERING
Prep –
Known
ANALYTIC
DATABASE
Regular
Reporting
Shared Storage (S3, ADLS, Kudu, HDFS)
Shared Metadata, Security, Governance

CLOUDERA ENTERPRISE
The modern platform for machine learning and analytics optimized for the cloud
Amazon
S3
Microsoft
ADLS HDFS KUDU
SECURITY GOVERNANCE
WORKLOAD
MANAGEMENT
INGEST &
REPLICATION
DATA CATALOG
Core
Services
Storage
Services
ANALYTIC
DATABASE
DATA
SCIENCE
EXTENSIBLE
SERVICES
OPERATIONAL
DATABASE
DATA
ENGINEERING

QUESTIONS?
Try the Free Demo:
cloudera.com/products/open-source/apache-
hadoop/hue.html#demo
Get Started in the Cloud:
cloudera.com/products/altus/altus-analytic-db.html

Making Self-Service BI a Reality in the Enterprise

Recomendados

Recomendados

Mais conteúdo relacionado

Mais procurados

Mais procurados (20)

Semelhante a Making Self-Service BI a Reality in the Enterprise

Semelhante a Making Self-Service BI a Reality in the Enterprise (20)

Mais de Cloudera, Inc.

Mais de Cloudera, Inc. (20)

Último

Último (20)

Making Self-Service BI a Reality in the Enterprise

Notas do Editor