6. Data Warehouse Optimization
Data Sources Big Data Architecture
Data Warehouse
(Master & Transactional Data)
ERP
CRM
CDR
Analytic
Data Mart(s)
Analytic
Data Mart(s)
Analytic
Data Mart(s)
Logs
Logs
Other Data
Raw Data
Parsed Data
Analytic Datasets
Master Data
Tape
Archive
8. Data Architecture and Integration Challenges
Data Sources Big Data Architecture
Data Warehouse
(Master & Transactional Data)
ERP
CRM
CDR
Analytic
Data Mart(s)
Analytic
Data Mart(s)
Analytic
Data Mart(s)
Logs
Logs
Other Data
Raw Data
Parsed Data
Analytic Datasets
Master Data
NOSQL
9. Data Architecture and Integration Challenges
Data Sources Big Data Architecture
Data Warehouse
(Master & Transactional Data)
ERP
CRM
CDR
Analytic
Data Mart(s)
Analytic
Data Mart(s)
Analytic
Data Mart(s)
Logs
Logs
Other Data
Raw Data
Parsed Data
Analytic Datasets
Master Data
NOSQL
Extract
Transform
Load
Orchestration & Integration
MR
10. Data Architecture and Integration Challenges
Data Sources Big Data Architecture
Data Warehouse
(Master & Transactional Data)
ERP
CRM
CDR
Analytic
Data Mart(s)
Analytic
Data Mart(s)
Analytic
Data Mart(s)
Logs
Logs
Other Data
Raw Data
Parsed Data
Analytic Datasets
Master Data
NOSQL
Extract
Transform
Load
Orchestration & Integration
MR
13. Global Marketing
Well we are ready, but
how will the Hardware
Team know how to size
and design the Hadoop
cluster……..?
I don’t know….and it
may take a long time to
build the Hadoop cluster
Time is a critical
factor, we need to get
this project moving
14. Global Marketing
Reduce time to Cluster Sizing, Design &
Deployment
Faster time to productive operations
Optimize and adapt for your needs
Deliver the best return on investment
Reduce risk &
increase
flexibility
with
Dell
Dell.com/Crowbar
15. Global Marketing
Dell | Hadoop
Solution
“Dell … was one of the first of
the hardware vendors to grasp
the fact that cloud is about
provisioning services, not about
the hardware.”
Maxwell Cooter, Cloud Pro
Excels at supporting complex big data
analyses across large collections of
structured and unstructured data
• Hadoop handles a variety of workloads,
including search, log processing, data
warehousing, recommendation systems and
video/image analysis
• Work on the most modern scale-out
architectures using a clean-sheet design data
framework
• Without vendor lock-in
Apache Hadoop software
Crowbar software framework with a
Hadoop barclamp
PowerEdge C8000 Series, C6220, R720, R720XD
Force10 or PowerConnect switches
Reference Architecture
Deployment Guide
Joint Service and Support
Proven solutions
Proven components
Partner Ecosystem
16. Global Marketing
Crowbar
• Accelerates
multi-node
deployments
• Simplifies
maintenance
• Streamlines
ongoing
updates
Built with DevOps
• Provides an operational model for managing big
data clusters and cloud
Field-proven technologies
• Build on locally deployed Chef Server
• Raw servers to full cluster in <2 hours
• Hardened with more than a year of deployments
Apache 2 open source
• Multi-apps (Hadoop & OpenStack)
• Multi-OS (Ubuntu, RHEL, CentOS, SUSE)
NOT limited to Dell hardware
Crowbar Software Framework
A modular, open source framework
17. Global Marketing
Deploy a Hadoop cluster in ~2 hours
Reduce software
licensing fees
100%
Use Crowbar to:
• Automate the deployment and
configuration of a Hadoop cluster
• Quickly provision bare-metal
servers from box to cluster with
minimal intervention
• Maintain, upgrade and evolve your
Hadoop cluster over time
• Leverage an open source
framework backed by a growing
global developer ecosystem
Reduce
development time
4-6 mo.
Crowbar
software
framewo
rk
Evolve to meet your needs over time with built in DevOps
19. Global Marketing
Leverage developer expertise worldwide
Download the open source software:
https://github.com/dellcloudedge/crowb
ar
Participate in an active community
http://lists.us.dell.com/mailman/listinfo
/crowbar
Get resources on the Wiki:
https://github.com/dellcloudedge/crowb
ar/wiki
Visit Dell.com/Crowbar,
Crowbar@Dell.com
TAKE-AWAYSPentaho provides complete integrated DI+BI for every leading big data platform.
The company decided to invest in Hadoop to ingest the raw CDR data into Hadoop along with other data. This frees DW capacity for high value transactional data; while also, lowering cost and meeting compliance requirements. And since the data is rescued from tape, it becomes available for reporting and analysis.
The company decided to invest in Hadoop to ingest the raw CDR data into Hadoop along with other data. This frees DW capacity for high value transactional data; while also, lowering cost and meeting compliance requirements. And since the data is rescued from tape, it becomes available for reporting and analysis.
The company decided to invest in Hadoop to ingest the raw CDR data into Hadoop along with other data. This frees DW capacity for high value transactional data; while also, lowering cost and meeting compliance requirements. And since the data is rescued from tape, it becomes available for reporting and analysis.
The company decided to invest in Hadoop to ingest the raw CDR data into Hadoop along with other data. This frees DW capacity for high value transactional data; while also, lowering cost and meeting compliance requirements. And since the data is rescued from tape, it becomes available for reporting and analysis.
Delivered as a hardware, software, and services Reference Architecture (RA) which can scale from 6-nodes up to 720-nodesCurrently utilizes PowerEdge C 2100/C6100/C6105 R720, R720XD servers and PowerConnect 6248 or Force 10 switchesDell CrowbarAutomated solution deployment and configuration (Bare metal, OS, Solution Stack, and Monitoring)CDH3 EnterpriseCloudera Hadoop DistributionCloudera Management ToolsCloudera SupportPartner EcosystemSoftware and services capabilities to address broader customer needs around HadoopEnabling non-technical business users to leverage HadoopSimplify getting data into HadoopIntuitive analytics reporting and dashboardsSolution Provided viaReference ArchitectureDeployment GuideDell Digital LockerDell Deployment Services
First OpenStack cloud solution provider in marketPioneer OpenStack partner (Only Day 1 hardware provider)Most history with the OpenStack technology = expertize + RA’s that have been tested longer and fuller than newcomersDell offers a deep partnership ecosystemSingle point of support and purchase to reduce the problem of dealing with multiple vendorsONLY company providing automated software to do multi-node OpenStack provisioning: CrowbarDell developed software that we opensourced in the community.OpenStack expertsize