O slideshow foi denunciado.
Seu SlideShare está sendo baixado. ×

Global Data Management – a practical framework to rethinking enterprise, operational, and analytical data

Anúncio
Anúncio
Anúncio
Anúncio
Anúncio
Anúncio
Anúncio
Anúncio
Anúncio
Anúncio
Anúncio
Anúncio
Carregando em…3
×

Confira estes a seguir

1 de 39 Anúncio

Global Data Management – a practical framework to rethinking enterprise, operational, and analytical data

Baixar para ler offline

Global data management is not a newly coined term. However, what it stands for is actually widening in scope particularly around data-in-motion and data-at-rest. Significant technology trends such as IoT, cloud, AI/ML, blockchain, and streaming data have given rise to excessive data volumes and also innovative use cases. The scope for global data management now extends all the way from ingestion, processing, storage, governance, security to analysis. With a good number of endpoints served through the cloud and major application footprints remaining on-premisess, it is pertinent to have a global data management strategy that supports hybrid models and more specifically, a multi-cloud model.

Many modern businesses struggle to balance the demands of rapidly innovating through new technologies like machine learning with the need to keep data safe and secure, all while responding to a constantly changing regulatory landscape. This puts data stewards, data engineers, architects, data scientists, and analysts under intense pressure as they must contend with existing and new applications, multiple logical and physical data stores and sources, diverse data types, and data spread across several deployment environments.

Attend this session led by Matt Aslett, Research Director at 451 Research and Dinesh Chandrasekhar, Director, Hortonworks to learn more about creating a framework for your enterprise that offers guidance on how to think about global data management—priorities, responsibilities, key stakeholders, compliance, and growth.

Speakers
Dinesh Chandrasekhar, Hortonworks, Director Product Marketing
Matt Aslett, 451 Research, Research Director, Data platforms and Analytics

Global data management is not a newly coined term. However, what it stands for is actually widening in scope particularly around data-in-motion and data-at-rest. Significant technology trends such as IoT, cloud, AI/ML, blockchain, and streaming data have given rise to excessive data volumes and also innovative use cases. The scope for global data management now extends all the way from ingestion, processing, storage, governance, security to analysis. With a good number of endpoints served through the cloud and major application footprints remaining on-premisess, it is pertinent to have a global data management strategy that supports hybrid models and more specifically, a multi-cloud model.

Many modern businesses struggle to balance the demands of rapidly innovating through new technologies like machine learning with the need to keep data safe and secure, all while responding to a constantly changing regulatory landscape. This puts data stewards, data engineers, architects, data scientists, and analysts under intense pressure as they must contend with existing and new applications, multiple logical and physical data stores and sources, diverse data types, and data spread across several deployment environments.

Attend this session led by Matt Aslett, Research Director at 451 Research and Dinesh Chandrasekhar, Director, Hortonworks to learn more about creating a framework for your enterprise that offers guidance on how to think about global data management—priorities, responsibilities, key stakeholders, compliance, and growth.

Speakers
Dinesh Chandrasekhar, Hortonworks, Director Product Marketing
Matt Aslett, 451 Research, Research Director, Data platforms and Analytics

Anúncio
Anúncio

Mais Conteúdo rRelacionado

Diapositivos para si (20)

Semelhante a Global Data Management – a practical framework to rethinking enterprise, operational, and analytical data (20)

Anúncio

Mais de DataWorks Summit (20)

Mais recentes (20)

Anúncio

Global Data Management – a practical framework to rethinking enterprise, operational, and analytical data

  1. 1. Copyright (C) 2018 451 Research LLC Global Data Management A practical framework for rethinking enterprise, operational, and analytical data Matt Aslett, Research Director, Data Platforms & Analytics Dinesh Chandrasekhar, Director, Product Marketing
  2. 2. 2 © Hortonworks Inc. 2011–2018. All rights reserved The New Way of Business Is Fueled By Connected Data • Connected Customers, Vehicles, Devices • Socially crowd-sourced requirements • Digital design and analysis • Digital prototypes and tests (simulations) • Connected Factories, Sensors, Devices • Human-robotic interaction • 3D-printing on demand • Connected Trucks, Inventory • Location, traffic, weather-aware distribution • Real-time inventory visibility • Dynamic rerouting • Connected Customers, Devices • Omni- channel demand sensing • Real-Time Recommendations • Connected Assets • Remote service monitoring & delivery • Predictive maintenance • OTA Updates DEVELOPMENT MANUFACTURING DISTRIBUTION MARKETING/SALES SERVICE
  3. 3. 3 © Hortonworks Inc. 2011–2018. All rights reserved Digital businesses today Search Recommendations Reviews Summon my service Frictionless commerce Instant feedback Immersive Ambient Physical meets digital
  4. 4. 4 © Hortonworks Inc. 2011–2018. All rights reserved Today’s Digital Enterprises RFID TRACKERS AND NANO-DEVICES to give you visibility into movement of your goods MOBILE NOTIFICATIONS to inform you of shipment delay from a supplier BLOCKCHAINS to give complete trust and provenance in your supply chain VIRTUAL ASSISTANTS to enhance your customer experience AI-POWERED CHATBOTS to improve your customer support functions ELECTRONIC B2B EXCHANGES to streamline order processing with partners
  5. 5. 5 © Hortonworks Inc. 2011–2018. All rights reserved So, What Is Stopping You from Digital Transformation? Cannot get a 360 VIEW of your customer? DROWNING in data lakes? TOO MUCH DATA coming in from TOO MANY SOURCES and devices? New business initiatives leading to EXCESSIVE IT COSTS?` MOST IMPORTANTLY… Don’t have the right data at the right time to make the right decision?
  6. 6. 6 © Hortonworks, Inc. 2011-2018. All rights reserved. Modern Data Architecture DATA CENTER Machine Learning/ Artificial Intelligence Telemetry – Connected Devices Time Series Databases Stream Analytics Deep Historical Analysis Exception Monitoring Legacy/ Operational Data Sensors, Control Systems Cyber Security Edge Analytics Social Mobile IoT IoT CLOUD Geo Location
  7. 7. 7 © Hortonworks, Inc. 2011-2018. All rights reserved. G L O B A L D ATA M A N A G E M E N T DATA SOURCES DATA CENTER CLOUD EDGE Exception Monitoring 360 View of Operations Cyber Security Telemetry – Connected Devices Time Series Sensors, Control Systems Telemetry – Connected Devices Sensors, Control Systems Time Series Exception Monitoring Cyber Security Legacy/ Operational Data Global Data Management Enables Modern Data Architecture
  8. 8. 8 © Hortonworks, Inc. 2011-2018. All rights reserved. Data Management Challenges • Dealing with multi-clouds • Avoiding cloud/ vendor lock-in • Future proofing your architecture • Common view of security, governance • Manage all data, regardless of type or location • Maximize data re-use for multiple workloads DATA SOURCES DATA CENTER CLOUD EDGE Exception Monitoring 360 View of Operations Cyber Security Telemetry – Connected Devices Time Series Sensors, Control Systems
  9. 9. 9 © Hortonworks, Inc. 2011-2018. All rights reserved. Global Data Management Platform DATA SOURCES DATA CENTER CLOUD EDGE Exception Monitoring 360 View of Operations Cyber Security Telemetry – Connected Devices Time Series Sensors, Control Systems DATA-IN-MOTION DATA-AT-REST MANAGE, SECURE, GOVERN, CONSUME
  10. 10. 10 © Hortonworks, Inc. 2011-2018. All rights reserved. Global Data Management - Powering Innovation MODERN DATA USE CASES EDW OPTIMIZATION CYBERSECURITY DATA SCIENCE ADVANCED ANALYTICS IOT/ STREAMING ANALYTICS DATA SOURCES DATA CENTER CLOUD EDGE Exception Monitoring 360 View of Operations Cyber Security Telemetry – Connected Devices Time Series Sensors, Control Systems DATA-IN-MOTION DATA-AT-REST MANAGE, SECURE, GOVERN, CONSUME
  11. 11. 11 © Hortonworks Inc. 2011–2018. All rights reserved Offload ETL processing from EDW Load cold archive data in Hadoop Enrich data with machine/ sensor data Run interactive analytics on Hadoop Centralized management and security Fine grained authentication and access control End to end data lineage Detailed audit trails Enterprise AI Platform, open languages Improve Data Science Productivity Improve Data Science learning and collaboration Organize Your Data EDW Modernization & Offload Govern Your Data Governed Data Lakes Drive Intelligence from Data Data Science and AI Global Data Management enables Digital Transformation
  12. 12. Copyright (C) 2018 451 Research LLC Global Data Management A practical framework for rethinking enterprise, operational, and analytical data Matt Aslett, Research Director, Data Platforms & Analytics
  13. 13. Copyright (C) 2018 451 Research LLC 451 Research is a leading IT research & advisory company 13 Founded in 2000 300+ employees, including over 120 analysts 2,000+ clients: Technology & Service providers, corporate advisory, finance, professional services, and IT decision makers 70,000+ IT professionals, business users and consumers in our 451 Alliance research community Over 52 million data points published each quarter and 4,500+ reports published each year 3,000+ technology & service providers under coverage 451 Research and its sister company, Uptime Institute, are the two divisions of The 451 Group Headquartered in New York City, with offices in London, Boston, San Francisco, Washington DC, Mexico, Costa Rica, Brazil, Spain, UAE, Russia, Taiwan, Singapore and Malaysia Research & Data Advisory Events Go 2 Market
  14. 14. Copyright (C) 2018 451 Research LLC 14 Copyright (C) 2018 451 Research LLC
  15. 15. Source: 451 Research, Voice of the Enterprise: Cloud, Hosting & Managed Services, Workloads and Key Projects 2018 Q1. Which of the following types of cloud or hosted services, if any, does your organization currently use? Please select all that apply. 15 68% 46% 37% 32% 31% 29% 7% Software as a service (SaaS) and hosted applications Infrastructure as a service (IaaS)/public cloud On-premises private cloud Hosted private cloud Platform as a service (PaaS) Hosted, non-cloud infrastructure None % of respondents (n = 707) Types of cloud or hosted services currently in use All respondents
  16. 16. Source: 451 Research, Voice of the Enterprise: Cloud, Hosting & Managed Services, Workloads and Key Projects 2018 Q4. Which of the following IaaS features is your organization using in connection with your IaaS/public cloud deployment? Please select all that apply. 16 45% 42% 41% 37% 33% 30% 25% 23% 22% 21% 16% 14% 5% 8% Relational database Data/business analytics Containers Auto-scaling Data warehouse Serverless compute/function as a service NoSQL database Real-time/streaming data processing Machine learning Mobile services IoT platform Large-scale/batch data transfer Other None % of respondents (n = 322) Public cloud features currently in use IaaS/public cloud users
  17. 17. Source: 451 Research, Voice of the Enterprise: Digital Pulse, Workloads and Key Projects 2018 Q15. You indicated that the majority of your organization’s workloads/applications will be deployed in off-premises cloud/hosted IT environments two years from now. What are the drivers behind this? Please select up to 3. 17 38% 37% 35% 32% 30% 27% 27% 19% 16% 1% Reduce IT costs Enhance IT systems agility Improved access to new technology resources/capabilities/features Modernize IT infrastructure Deploy new applications/features faster Enhance application performance and resiliency Move from capital-intensive IT to an operating expense model Improve security Overcome lack of in-house IT staff/expertise Other % of respondents (n=448) Drivers of Deploying Workloads/Applications in Off-Premises Environments Base: Respondents Plan To Deploy Majority Workloads/Applications in Off-Premises IT Environments
  18. 18. Migration patterns 18 Copyright (C) 2018 451 Research LLC
  19. 19. Migration patterns 44% 18% 14% 12% 11% % of respondents (n=1049) Source: 451 Research, Voice of the Enterprise: Digital Pulse, Workloads and Key Projects 2018 Q7. Which of the following best describes your organization’s overall IT infrastructure approach to mission-critical legacy applications and workloads going forward? 19 Modernize Retain existing applications on- premises but move to more modern application and infrastructure architectures. Refactor and shift Re-architect or redesign existing applications using cloud-native frameworks and deploy in off-premises cloud environments. Retain Keep current applications unchanged on existing on-premises infrastructure. Repurchase and shift Replace current on-premises applications with SaaS or off-premises hosted versions of the applications. Lift and shift Migrate applications to off- premises/cloud with minimal changes to the application code or business logic.
  20. 20. Copyright (C) 2018 451 Research LLC Multiple Execution Venues 20
  21. 21. Copyright (C) 2018 451 Research LLC A bridge to the cloud 21 Copyright (C) 2018 451 Research LLC
  22. 22. Source: 451 Research, Voice of the Enterprise: Cloud, Hosting & Managed Services, Workloads and Key Projects 2018 Q3. Which of the following vendors is your organization currently using for IaaS/public cloud? Please select all that apply. 22 71% 69% 23% 10% 8% 7% 3% 3% 3% 3% 3% 10% Amazon Web Services (AWS) Microsoft Azure Google Cloud Platform Oracle Cloud IBM Cloud Rackspace CenturyLink Cloud Digital Ocean Alibaba Cloud NTT OVH (VMware vCloud Air) Other % of respondents (n = 319) IaaS/public cloud vendors currently in use IaaS/public cloud users • Azure access also comes bundled with a Microsoft EA license • 59% of organizations using public cloud use more than one provider • 64% of Azure users also use AWS, 18% use Google Cloud • 62% of AWS users also use Azure, 22% use Google Cloud • 82% of Google Cloud users also use AWS, 65% use Azure
  23. 23. Copyright (C) 2018 451 Research LLC It’s a multi-cloud world (increasingly deliberately) 23 Which Flavor of Multi-Cloud? + Q. Has your organization configured any of the following cloud deployments for interoperability for the seamless delivery of a business function? Source: 451 Research, Voice of the Enterprise: Cloud Transformation, Vendor Evaluations, 2016
  24. 24. Copyright (C) 2018 451 Research LLC It’s a multi-cloud world (increasingly deliberately) 24 Which Flavor of Multi-Cloud? Q. Has your organization configured any of the following cloud deployments for interoperability for the seamless delivery of a business function? Source: 451 Research, Voice of the Enterprise: Cloud Transformation, Vendor Evaluations, 2016 +
  25. 25. Source: 451 Research, Voice of the Enterprise: Cloud, Hosting & Managed Services, Workloads and Key Projects 2018 Q20. Is your organization currently using multiple infrastructure environments in the operation of (Workload)? 25 47% 6% 18% 29% Yes No, but we plan to No, but we are considering it No, and we don’t plan to % of respondents (n = 621) Multiple infrastructure environments in the operation of a specific workload Respondents identifying workloads as currently in place
  26. 26. Source: 451 Research, Voice of the Enterprise: Cloud, Hosting & Managed Services, Workloads and Key Projects 2018 Q21. Which, if any, of the following best describe your organization’s reasons for using multiple infrastructure environments to operate (Workload)? Please select all that apply. 26 47% 40% 33% 33% 33% 32% 26% 6% 1% Improving performance/availability Optimizing for cost Isolating sensitive business data Migrating between infrastructure environments Meeting regulatory or data sovereignty requirements Adding new functions to existing deployments Adding geographic diversity/reducing latency Other None of the above % of respondents (n = 419) Reasons for multiple infrastructure environments in the operation of workload Multiple infrastructure environment user/potential user
  27. 27. Copyright (C) 2018 451 Research LLC The trouble with data processing in a globally distributed, multi-location environment 27 EDW In theory: one data warehouse, at the heart of the enterprise ENTERPRISE
  28. 28. Copyright (C) 2018 451 Research LLC The trouble with data processing in a globally distributed, multi-location environment 28 EDW In practice: data marts/ departmental data warehouses/ data lakes ENTERPRISE
  29. 29. Copyright (C) 2018 451 Research LLC The trouble with data processing in a globally distributed, multi-location environment 29 In theory: cloud EDW ENTERPRISE
  30. 30. Copyright (C) 2018 451 Research LLC The trouble with data processing in a globally distributed, multi-location environment 30 In practice: multiple clouds, multiple database services EDW ENTERPRISE
  31. 31. Copyright (C) 2018 451 Research LLC The trouble with data processing in a globally distributed, multi-location environment 31 Data processing at the edge EDW ENTERPRISE
  32. 32. Copyright (C) 2018 451 Research LLC Standardization – uniformity across multiple clouds 32 EDW ENTERPRISE
  33. 33. Copyright (C) 2018 451 Research LLC Catalog – index and discovery 33 Data Catalog EDW ENTERPRISE
  34. 34. Copyright (C) 2018 451 Research LLC Schrodinger’s data (credit Chris Hallenbeck, SAP SVP database and data management) How do you know the data is identifiable/to be forgotten without looking at it? Identification of PII data 34Source: Jie Qi on Flickr https://www.flickr.com/photos/jieq/2371783536
  35. 35. Copyright (C) 2018 451 Research LLC Amid the potential complexity – the data fabric 35 EDW The structure of anything; the manner in which the parts of a thing are united • A material made of fibers, a textile or cloth. More intangible - The framework underlying a structure • The fabric of space and time • The fabric of society
  36. 36. Copyright (C) 2018 451 Research LLC Data fabric – interconnective tissue 36 A consistent user experience across multiple environments (both on-premises and cloud) The ability to (increasingly automatically) identify data in multiple environments (both on-premises and cloud) The ability to manage data in multiple environments (both on-premises and cloud) – data lifecycle, data governance, data security The ability to manage data across multiple environments as if it was a single globally distributed architecture (work in progress)
  37. 37. Copyright (C) 2018 451 Research LLC Building blocks for global data management 37 EDW Data Catalog
  38. 38. 38 © Hortonworks Inc. 2011 – 2018. All Rights Reserved Capture streaming data Deliver perishable insights Combine new & old data Store data forever Access a multi-tenant data lake Model with machine learning DATA AT RESTDATA IN MOTION ACTIONABLE INTELLIGENCE Perishable Insights Historical Insights A Connected Data Strategy Solves for All Data GLOBAL DATA MANAGEMENT Manage, Secure, Govern MULTIPLE CLUSTERS AND SOURCES MULTIHYBRID
  39. 39. Copyright (C) 2018 451 Research LLC Thank You! matthew.aslett@451research.com @maslett www.451research.com dchandrasekhar@hortonworks.com @AppInt4All www.hortonworks.com

×