O slideshow foi denunciado.
Utilizamos seu perfil e dados de atividades no LinkedIn para personalizar e exibir anúncios mais relevantes. Altere suas preferências de anúncios quando desejar.

Data Driven With the Cloudera Modern Data Warehouse 3.19.19

554 visualizações

Publicada em

In this session, we will cover how to move beyond structured, curated reports based on known questions on known data, to an ad-hoc exploration of all data to optimize business processes and into the unknown questions on unknown data, where machine learning and statistically motivated predictive analytics are shaping business strategy.

Publicada em: Tecnologia
  • Seja o primeiro a comentar

Data Driven With the Cloudera Modern Data Warehouse 3.19.19

  1. 1. © Cloudera, Inc. All rights reserved. Data Driven With the Cloudera Data Warehouse David Dichmann | ddichmann@cloudera.com
  2. 2. © Cloudera, Inc. All rights reserved. 2© Cloudera, Inc. All rights reserved. What’s YOUR Data Strategy?
  3. 3. © Cloudera, Inc. All rights reserved. 3 OUTCOMES • Curated Data and Agile Discovery with HIPAA compliance • Accelerated new Drug Development NEW PRODUCT DEVELOPMENT GLOBAL PHARMACEUTICAL Use Cases Users Fewer Silos Diverse Data
  4. 4. © Cloudera, Inc. All rights reserved. 4 OUTCOMES • LoB Data Analysts access all data • Saved $4M+ in deposit fraud FRAUD PREVENTION LARGE NORTH AMERICAN BANK Terabytes Users Databases Queries / Month
  5. 5. © Cloudera, Inc. All rights reserved. 5 OUTCOMES • $10 M new revenue • $30 M+ price optimization • $100K+ weather correlation BUSINESS OPTIMIZATION MAJOR TELCO MANUFACTURER Query Responses New Sources Data Sets Users
  6. 6. © Cloudera, Inc. All rights reserved.6 © Cloudera, Inc. All rights reserved. Quickly enable business analytics by sharing petabytes of verified data across thousands of users while surpassing demands of SLAs and costs Massive, Diverse Data Security, Governance User Profiles, Use Cases Self Service EverythingAutomation, Consistency Experiments, Time To Value
  7. 7. © Cloudera, Inc. All rights reserved. 7 TRADITIONAL CHANGES MODERN Users Internal Transparency +External Curation Planned ETLs Flexibility On-Demand ELTs Exploration Constrained Self-Service Freeform Volume Finite Correlations Virtually Infinite
  8. 8. © Cloudera, Inc. All rights reserved. 8© Cloudera, Inc. All rights reserved. TRADITIONAL DATA WAREHOUSE Structured Data Sources (ERP, CRM, SCM) Transformations EDW Advanced Analytics Dashboards Ad Hoc Canned Reports Staging Data Marts Seceral Months Master Schema ETLODS 2 3 4 1 5 Struggle to handle volume and variety Limited Access
  9. 9. © Cloudera, Inc. All rights reserved. 9© Cloudera, Inc. All rights reserved. MODERN DATA WAREHOUSE Advanced Analytics Dashboards Ad Hoc Canned Reports Data Store Within Days Data Marts 1 2 Ingest & Store All Data At Scale Self-service / On-demand Variety of Data Sources/Types
  10. 10. © Cloudera, Inc. All rights reserved. 10© Cloudera, Inc. All rights reserved. MODERN DATA WAREHOUSE Fixed Reports DATA SOURCES Flexible Reporting Advanced Analytics Self-Service BI/Ad Hoc Dashboards/ Analytic Apps EDW COMPLIMENTING A TRADITIONAL EDW
  11. 11. © Cloudera, Inc. All rights reserved. 11© Cloudera, Inc. All rights reserved. CLOUD NATIVE WITH ALTUS DW Multi-Cloud PaaS for Agile Analytics ● Quick time to value for analytics - no software or clusters to manage ● Bring the warehouse to the data with zero copy simplicity ● Use your security policies with your data - no proprietary stacks ● Apply enterprise governance to transient workloads ● Shared data experience with SDX, for analytic workloads ● Optimized for Azure & AWS DATA WAREHOUSE GOVERNANCESECURITY ALTUS CONTROL PLANE LIFECYCLE MANAGEMENT MULTI-CLOUD Amazon S3 Microsoft ADLS
  12. 12. © Cloudera, Inc. All rights reserved. 12© Cloudera, Inc. All rights reserved. Traditional Data Warehouse Optimization Transform Status Quo TRANSFORMATIONAL AREAS OF DATA WAREHOUSING Operations & Events Data Warehouse Run Business Better Research & Discovery Data Warehouse Change the Culture
  13. 13. © Cloudera, Inc. All rights reserved. 13 DRIVERS FOR MODERNIZATION Deeper Business Insights Grow • Customer Sentiment • Fault Prevention • Improve Product Quality • New Revenue Streams Experimentation and collaboration at scale Protect • Proactive Fraud Prevention • Keep up with Regulatory Compliance • Preempt Cyberthreats Real-time response on massive data volume and variety Connect • Improve Operational Efficiency • Support Internet of Things (IoT) New analytics techniques democratized to all users
  14. 14. © Cloudera, Inc. All rights reserved. 14 CHALLENGES OF A MODERN DATA WAREHOUSE Extreme Speed and Scale More Data • Massive amounts handled faster at scale • More variety from new sources (social media, IoT) • Insight within minutes of new data arrival Performance and flexibility at scale More Workloads • 100’s of production grade deployments • Enterprise grade dependability • Strict security and governance On-demand scale out, discovery, collaboration More People • 1,000’s of new users and new user types • 1,000’s of new use cases • All skill levels: Analytics, Data Science, and Machine Learning All workloads with a shared data experience
  15. 15. © Cloudera, Inc. All rights reserved. 15 Optimize Core Processes ● Versatile Solution ● Broaden Data Reach ● Reduce IT Burden or Costs Dynamic Consumption ● Transient, Short-lived, Long-lived ● Public, Private, Hybrid Multi-Cloud ● Adaptive Compute & Storage Self-Service Everything ● Resource Provisioning ● Workload Development ● Optimizing & Troubleshooting CLOUDERA MODERN DATA WAREHOUSE Optimize Processes, Consumption and Costs https://www.cloudera.com/about/customers/xl-axiata.html https://blog.cloudera.com/blog/2018/03/automated-provisioning- of-cdh-in-the-cloud-with-cloudera-director-and-ansible/ https://www.cloudera.com/about/customers/komatsu-mining.html
  16. 16. © Cloudera, Inc. All rights reserved. 16© Cloudera, Inc. All rights reserved. Financial Services Telecom Government Healthcare Manufacturing Customer 360 Personalized Medicine Supply Chain Analysis Operational Efficiencies Network Quality Analysis Equipment Health (IoT) Fraud Compliance Cyber Threat Analysis Regulatory Reporting TOP 10 DATA WAREHOUSE USE CASES BY INDUSTRYGROWCONNECTPROTECT
  17. 17. © Cloudera, Inc. All rights reserved.17 A MODERN DATA WAREHOUSE FROM CLOUDERA HYBRID Storage Preferred BI & ELT ToolsHue Analytic Workbench, Superset Dashboards, CDSW Workload XM, Data Analytics Studio Navigator & Sentry, Atlas & Ranger Impala / Hive LLAP Query Engine Hive on Tez / Spark ELT Processing KUDU | HDFS | Druid Local Storage AWS S3 | ADLS Object Storage Shared Data Experience (SDX) Optimized File Formats (ORC, Parquet, Avro, JSON) Solr Search Analytics Cloudera Manager, Ambari, Altus, Data Plane HYBRID Controls HYBRID Compute HYBRID Storage HYBRID Reporting
  18. 18. © Cloudera, Inc. All rights reserved.18 © Cloudera, Inc. All rights reserved. EXTREME SPEED & SCALE Fastest ELT at Scale for Data Engineers ● Fast data with distributed, in-memory processing ● Curated data, metadata instantly available Fastest Self-Service BI at Scale for Analysts & Developers ● Interactive multi-user queries without rigid modeling for exploration ● Elastic scalability for more users/data Impala LLAP
  19. 19. © Cloudera, Inc. All rights reserved. 19 EXTENSIVE PARTNER ECOSYSTEM System Integrators ISV IHV Alliances Cloud Alliances OEM Alliances Market Expansion
  20. 20. © Cloudera, Inc. All rights reserved.20 © Cloudera, Inc. All rights reserved. CLOUDERA DW - PARTING THOUGHTS Hybrid Optimized Shared Data ExperiencePerformance @Scale Shared Data Exponential Use Cases, Successful Outcomes
  21. 21. © Cloudera, Inc. All rights reserved. THANK YOU

×