3. Microsoft Data Warehousing Vision
Make SQL Server the gold standard for data warehousing offering customers
Massive Scalability at Low Hardware Choice Improved Business Agility
Cost and Alignment
4. Approximate data volume
• managed by data warehouse
•
Today In 3 Years
•
21%
Less than 500 GB
5%
•
• 500 GB – 1 TB
12%
20%
21%
1 – 3 TB
18%
•
19%
• 3 – 10 TB
25%
17%
More than 10 TB
34%
2%
Don’t Know
6%
Source: TDWI Report – Next Generation DW
Microsoft Confidential—Preliminary Information Subject to Change 4
5. Data Warehouse Industry Trends
100%
Broad Commitment
Advanced
Centralized Analytics
EDW
Data
Quality
75%
Analytics
within EDW HA for DW 64-bit MDM
Analytics
Web Services
Outside EDW DBMS Built
for DW
Plan to Use
Real-time DW
Blades in
Security
MPP
Racks
50%
DBMS Built
for
DW Appliance Streaming
Data
Transactions Mixed Workloads SOA
Server
Virtualization Data Federation Low-Power
SMP
Hardware
Columnar DBMS
DW In-Memory DBMS
Bundles
25%
SaaS
Narrow Commitment
Open Source Open Source
OS Reporting
Software Open Source
Appliance Data Integration
Open Source DBMS
Public Cloud
0%
-50% -25% 0% 25% 50% 75% 100%
Decreasing Usage
Anticipated Growth in the next 3 Years Increasing Usage
Areas of strategic investment for Microsoft Source: TDWI
8. Software:
• SQL Server 2008
Enterprise
• Windows Server 2008
Configuration guidelines:
• Physical table structures
• Indexes
• Compression
• SQL Server settings
• Windows Server settings
• Loading
Hardware:
• Tight specifications for servers,
storage and networking
• ‘Per core’ building block
9. Reduces DBA effort; fewer indexes,
much higher level of sequential I/O
Dell, HP, Bull, EMC and IBM – more in
future
Commodity Hardware and value pricing;
Lower storage costs.
New reference architectures scale up to
48TB (assuming 2.5x compression)
Validated by Microsoft; better choice of
hardware; application of Best Practice
15. Fast Track vNext
Fast Track Data Warehouse 2.0 Future Partners to create new
Enterprise ETL Services Validated Reference
Star Join Query Optimizations New Reference Architectures from
IBM Architectures with Test Harness
Updated Configurations from HP,
Dell and Bull
EMC as a Service Partner for Fast
Track
2008 2009 2010 Beyond
Fast Track Data Warehouse New Test Harness for Partners
DW Reference Architectures Microsoft to create new Test
Predictable performance at low Harness for validation of new
cost Fast Track configurations
Faster time to solution NEC to validate new Reference
Architectures
Microsoft Confidential—Preliminary 16
18. Parallel Data Warehouse Appliance - Hardware
Architecture
Database Servers Storage Nodes
Control Nodes
SQL
Active / Passive
SQL
SQL
SQL
SQL
Management Servers
Dual Fiber Channel
SQL
Dual Infiniband
SQL
Landing Zone
SQL
SQL
SQL
Backup Node
SQL
Spare Database Server
Corporate Network Private Network
19. Parallel Data Warehouse demo at BI conference 2008
• Query
‐ Cache flushed
‐ Inner joins
• Report
‐ Retailer: day-part analysis
‐ Sales, Time, Date, Prod type
• Sample Results
‐ 625K rows returned in 11 seconds
from 1 trillion row table
‐ Final product will be even faster
20. Existing Current Madison
Environment Challenges Highlights
Hardware Data Load Speeds Improved by 300%
16 CPU HP 8620 Itanium
Hitachi Storage 27TB Raw
SATA 21 LUNS
Analytic Capacity 30TB/160 Cores
Software Analytic Speed Query Speeds 70X
Windows 2003 SP2 Improvement
SQLServer 2008
SSIS/SSRS
Mixed Workload Concurrency
Data Warehouse Mixed Workload
18 Terabytes
Star Schema Total Cost of TCO Lowered by
80 Fact Tables
500 + Dimensions
Ownership 50%
22. PDW vNext
Focus on continually lowering the
Microsoft Announce Intention to MTP Program Launched costs of high end DW, while
Acquire DATAllegro (July) Circa 10 Customers Provided with early increasing performance
Acquisition Closes (Sept) Madison Benchmark Additional Hardware Partners
150TB demo of DATAllegro on SQL Madison Named as SQL Server 2008 R2 Closer functional alignment with SQL
Server run at BI Conference (Oct) Parallel Data Warehouse Server
List Price at $57.5K per proc Better integration with SQL and tools
and technologies
2008 2009 2010 Beyond
Project “Madison”
MTP 2 Program to Launch (fully
Compatibility with DATAllegro v3 functional, fully performant)
MS BI integration TAP Program (on client site)
RTM in H1 2010
?
23.
24. Hub and Spoke – Flexible Business Alignment
EDW provides “single version of truth” but makes it difficult to support mixed
workloads and multiple user groups, each requiring SLAs
25. Hub and Spoke – Flexible Business Alignment
Departmental data marts enable mixed workloads, but make it difficult to
consolidate information across the enterprise
26. Hub and Spoke – Flexible Business Alignment
Parallel database copy Support user groups with
technology enables rapid very different SLAs:
data movement and Performance
consistency between hub Capacity
and spokes Loading
Concurrency
Create SQL Server 2008, Fast Track Data Warehouse, and SQL Server Analysis
Services spokes
A Hub and Spoke solution gives you the flexibility to add/change diverse workloads/user
groups, while maintaining data consistency across the enterprise
51. Faster time to solution
High scale: up to 48TB
Fast Track Low TCO with better price performance; industry standard hardware
Data Warehouse Better performance out of the box and predictable performance
offers customers Reduced risk through balanced hardware & Best practices
Integration with Madison Hub & Spoke Architecture
Twelve reference architectures from HP, Dell, Bull, EMC
SQL Server Fast Track Data and IBM
Warehouse has 2 components
System Integrators with industry solution templates –
Avanade, HP, Hitachi, Cognizant and EMC
52. • Fast Track Data Warehouse offers
−
−
−
−
−
• Parallel Data Warehouse offers
−
−
−
−
•
−
−
−