O slideshow foi denunciado.
Utilizamos seu perfil e dados de atividades no LinkedIn para personalizar e exibir anúncios mais relevantes. Altere suas preferências de anúncios quando desejar.

Building a Data Hub that Empowers Customer Insight (Technical Workshop)

737 visualizações

Publicada em

We have seen the evolution with the Bi and Data Science fields from the structured data warehouse to data lake and finally, to the data hub. This session will cover the key steps required to building a data hub, examining how best to align and engage stakeholders and develop architectural sanction to enable your organisations to realise new customer insights and better enable you to achieve business objectives.

Publicada em: Software
  • Seja o primeiro a comentar

Building a Data Hub that Empowers Customer Insight (Technical Workshop)

  1. 1. 1© Cloudera, Inc. All rights reserved. The Big Data Journey from unknown Data to Business Value A technical demonstration of Cloudera’s Enterprise Data Hub Mahdi Askari – System Engineer
  2. 2. 2© Cloudera, Inc. All rights reserved. Trusted Data Secure Data Architectural Overview Data Landing Zone - Logical Design Data Sources Enterprise Data Hub Enterprise Data Warehouse Data Discovery, Visualization & Analytics Raw Data Keyed Data Refined Data Metadata & Governance
  3. 3. 3© Cloudera, Inc. All rights reserved. Cloudera Enterprise Making Hadoop Fast, Easy, and Secure A new kind of data platform: • One place for unlimited data • Unified, multi-framework data access Cloudera makes it: • Fast for business • Easy to manage • Secure without compromise OPERATIONS DATA MANAGEMENT STRUCTURED UNSTRUCTURED PROCESS, ANALYZE, SERVE UNIFIED SERVICES RESOURCE MANAGEMENT SECURITY FILESYSTEM RELATIONAL NoSQL STORE INTEGRATE BATCH STREAM SQL SEARCH SDK OPERATIONS Cloudera Manager Cloudera Director DATA MANAGEMENT Cloudera Navigator Encrypt and KeyTrustee Optimizer STRUCTURED Sqoop UNSTRUCTURED Kafka, Flume PROCESS, ANALYZE, SERVE UNIFIED SERVICES RESOURCE MANAGEMENT YARN SECURITY Sentry, RecordService FILESYSTEM HDFS RELATIONAL Kudu NoSQL HBase STORE INTEGRATE BATCH Spark, Hive, Pig MapReduce STREAM Spark SQL Impala SEARCH Solr SDK Kite
  4. 4. 4© Cloudera, Inc. All rights reserved. DataCo’s Big Data Journey
  5. 5. 5© Cloudera, Inc. All rights reserved. Retail Company DataCo’s Big Data Journey Overview Imaginary Company with challenges around unstructured data, different data types, and solution scalability Questions DataCo is looking to answer: • “What are the most popular products by category?” • “What products generate the most revenue?” • “Are the top viewed products on the Web Site generating the most revenue?” Demonstration Goals: • Ingest, Transform, and Analyze DataCo’s Data • Leverage Impala for interactive Tableau Dashboards • Answer DataCo’s Data Questions
  6. 6. 6© Cloudera, Inc. All rights reserved. Connect to the data sources and profile fields • Hadoop User Experience (HUE) • MySQL remote Database with retail data • Seamless integration using Hue DB Query Tool
  7. 7. 7© Cloudera, Inc. All rights reserved. Cloudera’s Enterprise Data Hub Sqoop HDFS Put Retail DB Web Server Logs Sqoop ingest structured data
  8. 8. 8© Cloudera, Inc. All rights reserved. Cloudera’s Enterprise Data Hub Sqoop HDFS Put Retail DB Web Server Logs Real-time access to data in HDFS Impala Query Editor
  9. 9. 9© Cloudera, Inc. All rights reserved. Cloudera’s Enterprise Data Hub Sqoop HDFS Put Retail DB Web Server Logs Ingest web server logs
  10. 10. 10© Cloudera, Inc. All rights reserved. Cloudera Hadoop Sqoop HDFS Put Retail DB Web Server Logs Create schema on read tables with Hive
  11. 11. 11© Cloudera, Inc. All rights reserved. Impala Queries to Correlate Structured data with Unstructured data Structured Sales Data Unstructured Web Clickstream Data Are the top viewed products on the Web Site generating the most revenue?
  12. 12. 12© Cloudera, Inc. All rights reserved. Self-Service and data discovery visualization capabilities • Cloudera Impala allows access using ODBC/JDBC • Tableau and the popular BI tools in this space support Impala connectivity for interactive Visualizations and Dashboard creations Cloudera’s Enterprise Data Hub
  13. 13. 13© Cloudera, Inc. All rights reserved. Data Visualization with Tableau Types of connections supported: JDBC/ODBC and NFS Impala allows near real time SQL access to all of your data inside the EDH
  14. 14. 14© Cloudera, Inc. All rights reserved. Thank you!

×