SlideShare uma empresa Scribd logo
1 de 46
Baixar para ler offline
DAT 205 - Amazon Redshift in Action
Enterprise, Big Data, and SaaS Use Cases

November 15, 2013

© 2013 Amazon.com, Inc. and its affiliates. All rights reserved. May not be copied, modified, or distributed in whole or in part without the express consent of Amazon.com, Inc.
Amazon Redshift

Fast, simple, petabyte-scale data warehousing for less than $1,000/TB/Year
Amazon Redshift architecture
• Leader Node
–
–
–

JDBC/ODBC

SQL endpoint
Stores metadata
Coordinates query execution

• Compute Nodes
–
–
–
–

10 GigE
(HPC)

Local, columnar storage
Execute queries in parallel
Load, backup, restore via Amazon S3
Parallel load from Amazon DynamoDB

• Single node version available

Ingestion
Backup
Restore
Amazon Redshift is priced to let you analyze all your data
Price Per Hour for
HS1.XL Single Node

Effective Hourly
Price per TB

Effective Annual
Price per TB

On-Demand

$ 0.850

$ 0.425

$ 3,723

1 Year Reservation

$ 0.500

$ 0.250

$ 2,190

3 Year Reservation

$ 0.228

$ 0.114

$

999

Simple Pricing
Number of Nodes x Cost per Hour
No charge for Leader Node
No upfront costs
Pay as you go
Data Warehousing for Capital Markets
Jason Timmes, AVP of Software Development, NASDAQ OMX
November 15, 2013

© 2013 Amazon.com, Inc. and its affiliates. All rights reserved. May not be copied, modified, or distributed in whole or in part without the express consent of Amazon.com, Inc.
Where innovation meets action
OUR TECHNOLOGY

WE OWN AND OPERATE

IS USED TO POWER MORE THAN

70 M ARKETPLACES

26 MARKETS
including

IN 50 COUNTRIES

3 CLEARINGHOUSES

1 MILLION
MESSAGES/SECOND
AT A MEDIAN SPEED OF
SUB-55 MICROSECONDS

POWER
1 IN 10

OF THE WORLD’S SECURITIES TRANSACTIONS

AND 5 CENTRAL

SECURITIES
OUR GLOBAL PLATFORM
CAN HANDLE MORE THAN

WE

D E P OS ITOR IE S

MORE THAN 5500
STRUCTURED PRODUCTS
ARE TIED TO OUR GLOBAL INDEXES
WITH THE NOTIONAL VALUE OF

AT LEAST $1 TRILLION

WE LIST ~3300
GLOBAL COMPANIES WORTH

$6 TRILLION
IN MARKET CAP REPRESENTING

DIVERSE INDUSTRIES AND

MANY OF THE WORLD’S
MOST WELL-KNOWN AND

INNOVATIVE BRANDS

6
What I do

New data and analytics platforms to store and
serve data to internal and external customers.
The Challenge
• Archiving Market Data
– classic “Big Data” problem

• Power Surveillance and Business
Intelligence/Analytics
• Minimize cost
– Not only infrastructure, but development/IT labor costs too

• Empower the business for self-service
SIP Total Monthly Message Volumes
OPRA, UQDF and CQS

Market
Data
Is Big
Data

Total Monthly Message Volume
Date
Aug-12
Sep-12
Oct-12
Nov-12
Dec-12
Jan-13
Feb-13
Mar-13
Apr-13
May-13
Jun-13
Jul-13
Aug-13

Charts courtesy of the
Financial Information
Forum

NASDAQ Exchange Daily Peak Messages
600,000,000
500,000,000
400,000,000
300,000,000
200,000,000
100,000,000
0

OPRA Annual Increase: 69%
CQS Annual Increase: 10%
UQDF Annual Decrease: 6%

Jan-13

Feb-13 Mar-13

Apr-13 May-13 Jun-13

Jul-13

Aug-13 Sep-13

Financial Information Forum, Redistribution without permission from FIF prohibited, email: fifinfo@fif.com

UQDF
2,317,804,321
1,948,330,199
1,016,336,632
2,148,867,295
2,017,355,401
2,099,233,536
1,969,123,978
2,010,832,630
2,447,109,450
2,400,946,680
2,601,863,331
2,142,134,920
2,188,338,764

CQS
8,241,554,280
7,452,279,225
7,452,279,225
9,552,313,807
8,052,399,165
7,474,101,082
7,531,093,813
7,896,498,260
9,805,224,566
9,430,865,048
11,062,086,463
8,266,215,553
9,079,813,726

Total Monthly
Message Volume
Date
OPRA
Aug-12
80,600,107,361
Sep-12
77,303,404,427
Oct-12
98,407,788,187
Nov-12
104,739,265,089
Dec-12
81,363,853,339
Jan-13
82,227,243,377
Feb-13
87,207,025,489
Mar-13
93,573,969,245
Apr-13
123,865,614,055
May-13
134,587,099,561
Jun-13
162,771,803,250
Jul-13
120,920,111,089
Aug-13
136,237,441,349

Combined
Average Daily
Volume
459,102,548
494,768,917
403,267,422
557,199,100
503,487,728
455,873,077
500,011,463
495,366,545
556,924,273
537,809,624
683,197,490
473,106,840
512,188,750
Average Daily
Volume
3,504,352,494
4,068,600,233
4,686,085,152
4,987,584,052
4,068,192,667
3,915,583,018
4,589,843,447
4,678,698,462
5,630,255,184
6,117,595,435
8,138,590,163
5,496,368,686
6,192,610,970
23
Our legacy solution
• On-premises MPP DB
– Relatively expensive, finite storage
– Required periodic additional expenses to add more storage
– Ongoing IT (administrative) human costs

• Legacy BI tool
– Requires developer involvement for new data sources, reports,
dashboards, etc.
New Solution: Amazon Redshift
• Cost Effective
– Redshift is 43% of the cost of legacy
• Assuming equal storage capacities

– Doesn’t include IT ongoing costs!

• Performance
– Easily outperforms our legacy BI/DB solution
– Insert 550K rows/second on a 2 node 8XL cluster

• Elastic
– Add additional capacity on demand, easy to grow our cluster
New Solution: Pentaho BI/ETL
• Amazon Redshift partner
– http://aws.amazon.com/redshift/par
tners/pentaho/

• Self Service
– Tools empower BI users to
integrate new data sources, create
their own analytics, dashboards,
and reports without requiring
development involvement

• Cost effective
Net Result
• New solution is cheaper, faster, and offers
capabilities that our business didn’t have before
– Empowers our business users to explore data like they never
could before
– Reduces IT and development as bottlenecks
– Margin improvement (expense reduction and supports business
decisions to grow revenue)
HauteLook + Amazon Redshift
A Case Study
Kevin Diamond, HauteLook
November 15, 2014

© 2013 Amazon.com, Inc. and its affiliates. All rights reserved. May not be copied, modified, or distributed in whole or in part without the express consent of Amazon.com, Inc.
Who am I? Kevin Diamond
• CTO of HauteLook, a Nordstrom Company
• Oversee all technology, infrastructure, data,
engineering, etc.

• Major focus on great customer experience and
the analytics to provide it
What is HauteLook?
• Private sale, members-only limited-time sale events
• Premium fashion and lifestyle brands at exclusive prices of
50-75% off
• Over 20 new sale events begin each morning at 8am PST

• Over 14 million members
• Acquired by Nordstrom in 2011
Why a Data Warehouse?
• Centralized storage of multiple data sources
• Singular reporting consistency for all departments
• Data model that supports analytics not transactions
• Operational reports vs. analytical reports
– Real-time vs. previous day
Why Amazon Redshift?
• Looked at some competitors:
– Ranged from $ to $$$
– All required Software, Implementation and BIG Hardware

• Skipped the RFP

• Jumped into the Public Beta of Amazon Redshift
and never looked back
How We Implemented Amazon Redshift
• ETL from MySQL and Microsoft SQL Server into AWS across a
Direct Connect line storing on S3
• Also used S3 to dump flat files (iTunes Connect Data, Web Analytics
dumps, log files, etc)
• Used AWS Data Pipeline for executing Sqoop and Hadoop running
on EC2 to load data into Amazon Redshift
• Redshift Data Model based on Star Schema which looks something
like …
Example of Star Schema
Usage with Business Intelligence
• Already selected a BI Tool
• Had difficulty deploying in the cloud
• But worked great on-premises
• Easily tied into Amazon Redshift using ODBC Drivers
• BUT, metadata for reports had to live in MSSQL
• Ported many SSIS/SSRS reports over
– But only the analytical reports!
And it all looks like this
Amazon Redshift Instances
• We use a little under 2TB
• Thought to use 2 - BIG 8XL instance to get great performance (in
passive failover mode)
• Cost us $$$
• Then we tested using 6 - XL instances in a cluster
• Performed better and allowed for more concurrency of queries in all
but a handful of cases that really needed the 8XL power
• Cost us $
• Duh! That’s why we do distributed everything else!!
Some First Hand Experience
• ETL was hardest part
• Amazon Redshift performs awesome
• Someone needs to make a great client SQL tool

• MicroStrategy works great on it (just wished it loved
running in EC2)
• Saving a ton, thanks to:
–

No hardware costs

–

No maintenance/overhead (rack + power)

–

Annual costs are equivalent to just the annual maintenance
of some of the cheaper DW on-premises options
Conclusion/Last Advice
•

Only use 8XL instances if you need >2TB of space
–

Otherwise distribute on a bunch of XL nodes

•

Buy reserved instances (we still need to do this!) since you likely will have this always on

•

Although we haven’t yet, the idea of a flexible scale-up/down DW is crazy awesome – maybe during
Holiday we will

•

Probably could have used Elastic MapReduce instead of Hadoop – wasn’t sure how it would play with Sqoop

•

Almost all BI tools play with Amazon Redshift now, so choose what is right for your business, and make sure it
works in EC2 before just putting it there

•

Communication between AWS and your DC is easy and fast, but I recommend a Direct Connect

•

Passed our rigorous information security standards, but used in a VPC
Amazon Redshift in Action:
Enterprise, Big Data, and SaaS Use Cases
Parag Thakker – VP, Roundarch Isobar
Colin McGuigan – Architect, Roundarch Isobar

November 15th, 2013

© 2013 Amazon.com, Inc. and its affiliates. All rights reserved. May not be copied, modified, or distributed in whole or in part without the express consent of Amazon.com, Inc.
roundarch isobar
OUR SERVICES ACROSS BOUGHT, OWNED AND EARNED MEDIA

Strategies

Campaigns

Experiences

Platforms

Products

We digitally transform
business processes and
disrupt industries

We create, measure and
optimize digitally-focused
campaigns

We produce joyful
experiences that inspire
consumer interaction

We design and build
flexible and scalable
technology solutions

We invent digital
products that generate
new revenue streams

Audience insight

Research: competitive,
segmentation, persona
development, heuristics

Platforms: content
management, search,
portals, mobile, frontend technology,
internet-enabled
devices/wearables, social
apps, web services,
security, big data,
hosting

Digital products

Business planning:
competitive & industry
analysis, business cases,
maturity models,
roadmaps
Strategies: brand,
interactive, multichannel, social, content

27

Communications planning
Creative: advertising, visual
design, content creation,
studio production
Optimization: analytics,
monitoring, SEO, MVT,
media ROI analysis

Requirements and
specifications: content
analysis and specs,
functional requirements,
functional specifications
User experience design:
information architecture,
taxonomy and meta data,
interaction design, mobile

Digital product
extensions
Brand as a service
We have served the U.S. Air Force since 2001, building their enterprise portal and many
mission-critical applications
U.S. Air Force

Key metrics for our USAF work include:

• 900,000+ registered users

• Portal availability over 99.9% of time

• 700,000+ PK-E users

• 28 production enterprise services

• Response time worldwide: 3 seconds for 80% of all pages

• Over 300 applications available

• Over 1.2 million logins/week

• Public-facing and secure private instances (NIPR & SIPR)

• 124,000 unique daily users
28

• 4-5+ million pages daily (40-70 Mbit/sec)

• Portal support for over 5,000 “Communities of Interest”
Transforming in-stadium operations through a touch-screen command center
New York Jets
Our executive touch-screen environment provides real-time stadium
and game data, allowing the Jets owner, Woody Johnson, to monitor
the fan experience during game time and make operational
decisions that help maximize sales. The command center provides
summary-level and drill-down views of stadium operations such as
tickets, parking and concessions. It also creates predictive
algorithms that help identify pinch points and open revenue
opportunities.

29

“We brought the big picture close enough to
identify new, better ways to do business.”
Through a joint venture with Copia Capital, we created a new product offering for William
Blair
William Blair | Investment Research Management System
• Facilitates collaboration between
portfolio managers and analysts

Technology:

• Provides a holistic view of a
company/stock

• Uses Jquery,
JavascriptMVC, Less

– What is everything our
organization knows about
AAPL

• Digitizes PDF/Excel tools and
reports to enable rich, dynamic
interactions
• Simplifies content creation; e.g.,
comments, recommendation
reports, document upload

• Rich charting and visualization of
analytics

30

• JavaScript, HTML5, CSS3

• JSON Web Services

• Java, Spring, JPA, Mongo
DB
• User comment: “We love
how fast it is!”
What is the focus of your
CMO today?

Optimize marketing spend
across all channels (Bought, Earned
and Owned)
31
domain
marketing spend

billions

Web

Mobile
Display
Ads

Affiliate

Search

Sales

hundreds
data size

Email
TV
Radio

dozens
data sources

Print
Social

media channels

multiple terabytes
clients

multiple
32
marketing effectiveness stages
DLP

Scorecard

Sonar

AMNET

Compass

Optimize
Scorecard

Real-Time and Non-Real-Time

Learn
Analyze
• Centralized cross channel
Big Data Platform
• Standardized cross channel
reporting tools

• Discovery tools to identify
channel optimization
opportunities
• Modeling solutions

• Channel experience
enhancements
• Improved media buying,
planning & reporting functions
• Real time integration into DSP
• A/B testing based micro
segment adjustments
So what have we accomplished?
Built Marketing Analytics Platform - Radar
with 200+ in-time analytics, reporting andfrequency, granularity
forenable feeds (1TB/week) with various optimization
to scalable multi-tenant in 3 platform on Amazon
as multiple clients with customized metrics
with first launch SaaS months
and classification

34
scorecard dashboard

35
scorecard logical architecture
Media Team

Display

Paid
Search

Organic
Search

Digital
Video

Site
Metrics

Sales

Google
DFA

Google
Bing
Marin

Google
Bing

Custom

Google
Omniture

Client
Stakeholders

TBD

Scorecard App

Detailed Analytic
Reports

TV

Radio

Print
OOH

Earned
Social

DDS

DDS

DDS

Facebook
Twitter

Competit
ive
Custom

Paid
Social
Facebook
Twitter
Media Team

36

Planners

Client Team
data sources

DATA VOLUME

Voluminous Data

Digital
CRM
Research
- Surveys

- Demographics
- Campaigns

- Search
- Mobile
- Attribution
- Site
- Social
- Display

VARIETY and GRANULARITY
37

- Cookie Level
- UGC
- Geospatial
- Weather
- Sales
- Competitive
tech architecture
SaaS
Reporting
Platform

BI Tools

Analysts

Clients

Radio

WWW

Display
Ads

Search

S3

Redshift

Social

Feeds

38

Hadoop EMR

MySQL RDS

EC2

Beanstalk
ETL
Extract
Files loaded on Amazon S3/Amazon Glacier

Transform
Utilize Pig on Amazon EMR to cleanse,
standardize and validate the data

Radio

Glacier

Display
Ads

S3

Redshift

Search

Load
Use COPY to load Pig output

Social

Feeds
Hadoop EMR

39
data warehouse
Performance
Handles humongous aggregation quickly

Tableau,
BI Tools
Analysts

Cheap, fast, easily scalable

ODBC and JDBC access
For BI / adhoc analysis

Redshift

40
aggregation
Mapping

Radio

Join performance data with metadata
Display
Ads

Multi-step aggregation

SQL

Product,
Campaign

In Amazon Redshift using SQL
Search
Views, Clicks,
CTR, CPC etc

Load aggregates

Social

in MySQL for sub second web response
Aggregates

Redshift

41

MySQL RDS
data workflow
Jenkins for client+channel ETL
Job control dashboard

Jenkins

Ruby for provisioning, job flow
Data intake/extract
Amazon DynamoDB for state management

Ruby

DynamoDB

On demand, job-initiated
Amazon EMR clusters

S3

42

Hadoop EMR

Redshift

MySQL RDS
SaaS dashboard
Designed for redundancy
Hardware and location

Client1.com

Client2.com

ElastiCache

Multi-Tenant
Managed services

DNS

Automated stack provisioning
For clients
MySQL RDS

43

EC2
Beanstalk

Load
Balancing
AWS advantages
Innovate

US

Quickly with reduced risk

AMAZON

Time
To market

Java

Ruby
Python

Lower
Operational overhead

Highly
Scalable

44

Developers

DevOps

AWS Ops
learnings
Metadata is more important than the data
Design for scalability upfront
Always explore better ways to aggregate
Cost management is very important
Build Agile: Perform early end-to-end validation on smaller dataset
Separate data visualization, data cleansing, storage & data aggregation
Be smart about implementing data aggregation routines across multiple granularities

45
Please give us your feedback on this
presentation

DAT205
As a thank you, we will select prize
winners daily for completed surveys!

Mais conteúdo relacionado

Mais procurados

Production NoSQL in an Hour: Introduction to Amazon DynamoDB (DAT101) | AWS r...
Production NoSQL in an Hour: Introduction to Amazon DynamoDB (DAT101) | AWS r...Production NoSQL in an Hour: Introduction to Amazon DynamoDB (DAT101) | AWS r...
Production NoSQL in an Hour: Introduction to Amazon DynamoDB (DAT101) | AWS r...Amazon Web Services
 
Building a Modern Data Warehouse: Deep Dive on Amazon Redshift - SRV337 - Chi...
Building a Modern Data Warehouse: Deep Dive on Amazon Redshift - SRV337 - Chi...Building a Modern Data Warehouse: Deep Dive on Amazon Redshift - SRV337 - Chi...
Building a Modern Data Warehouse: Deep Dive on Amazon Redshift - SRV337 - Chi...Amazon Web Services
 
Building AWS Redshift Data Warehouse with Matillion and Tableau
Building AWS Redshift Data Warehouse with Matillion and TableauBuilding AWS Redshift Data Warehouse with Matillion and Tableau
Building AWS Redshift Data Warehouse with Matillion and TableauLynn Langit
 
Powering Interactive Data Analysis at Pinterest by Amazon Redshift
Powering Interactive Data Analysis at Pinterest by Amazon RedshiftPowering Interactive Data Analysis at Pinterest by Amazon Redshift
Powering Interactive Data Analysis at Pinterest by Amazon RedshiftJie Li
 
Best Practices for Migrating your Data Warehouse to Amazon Redshift
Best Practices for Migrating your Data Warehouse to Amazon RedshiftBest Practices for Migrating your Data Warehouse to Amazon Redshift
Best Practices for Migrating your Data Warehouse to Amazon RedshiftAmazon Web Services
 
Getting Started with Amazon Redshift - AWS July 2016 Webinar Series
Getting Started with Amazon Redshift - AWS July 2016 Webinar SeriesGetting Started with Amazon Redshift - AWS July 2016 Webinar Series
Getting Started with Amazon Redshift - AWS July 2016 Webinar SeriesAmazon Web Services
 
Best Practices for Migrating your Data Warehouse to Amazon Redshift
Best Practices for Migrating your Data Warehouse to Amazon Redshift Best Practices for Migrating your Data Warehouse to Amazon Redshift
Best Practices for Migrating your Data Warehouse to Amazon Redshift Amazon Web Services
 
AWS Webcast - Data Integration into Amazon Redshift
AWS Webcast - Data Integration into Amazon RedshiftAWS Webcast - Data Integration into Amazon Redshift
AWS Webcast - Data Integration into Amazon RedshiftAmazon Web Services
 
Getting Started with Amazon DynamoDB
Getting Started with Amazon DynamoDBGetting Started with Amazon DynamoDB
Getting Started with Amazon DynamoDBAmazon Web Services
 
(BDT314) A Big Data & Analytics App on Amazon EMR & Amazon Redshift
(BDT314) A Big Data & Analytics App on Amazon EMR & Amazon Redshift(BDT314) A Big Data & Analytics App on Amazon EMR & Amazon Redshift
(BDT314) A Big Data & Analytics App on Amazon EMR & Amazon RedshiftAmazon Web Services
 
(BDT316) Offloading ETL to Amazon Elastic MapReduce
(BDT316) Offloading ETL to Amazon Elastic MapReduce(BDT316) Offloading ETL to Amazon Elastic MapReduce
(BDT316) Offloading ETL to Amazon Elastic MapReduceAmazon Web Services
 
(DAT201) Introduction to Amazon Redshift
(DAT201) Introduction to Amazon Redshift(DAT201) Introduction to Amazon Redshift
(DAT201) Introduction to Amazon RedshiftAmazon Web Services
 
(BDT404) Large-Scale ETL Data Flows w/AWS Data Pipeline & Dataduct
(BDT404) Large-Scale ETL Data Flows w/AWS Data Pipeline & Dataduct(BDT404) Large-Scale ETL Data Flows w/AWS Data Pipeline & Dataduct
(BDT404) Large-Scale ETL Data Flows w/AWS Data Pipeline & DataductAmazon Web Services
 
AWS Webcast - Redshift Overview and New Features
AWS Webcast - Redshift Overview and New Features AWS Webcast - Redshift Overview and New Features
AWS Webcast - Redshift Overview and New Features Amazon Web Services
 
Near Real-Time Data Analysis With FlyData
Near Real-Time Data Analysis With FlyData Near Real-Time Data Analysis With FlyData
Near Real-Time Data Analysis With FlyData FlyData Inc.
 
(BDT310) Big Data Architectural Patterns and Best Practices on AWS | AWS re:I...
(BDT310) Big Data Architectural Patterns and Best Practices on AWS | AWS re:I...(BDT310) Big Data Architectural Patterns and Best Practices on AWS | AWS re:I...
(BDT310) Big Data Architectural Patterns and Best Practices on AWS | AWS re:I...Amazon Web Services
 
New Database Migration Services & RDS Updates
New Database Migration Services & RDS UpdatesNew Database Migration Services & RDS Updates
New Database Migration Services & RDS UpdatesAmazon Web Services
 
AWS re:Invent 2016: Workshop: Converting Your Oracle or Microsoft SQL Server ...
AWS re:Invent 2016: Workshop: Converting Your Oracle or Microsoft SQL Server ...AWS re:Invent 2016: Workshop: Converting Your Oracle or Microsoft SQL Server ...
AWS re:Invent 2016: Workshop: Converting Your Oracle or Microsoft SQL Server ...Amazon Web Services
 

Mais procurados (20)

Production NoSQL in an Hour: Introduction to Amazon DynamoDB (DAT101) | AWS r...
Production NoSQL in an Hour: Introduction to Amazon DynamoDB (DAT101) | AWS r...Production NoSQL in an Hour: Introduction to Amazon DynamoDB (DAT101) | AWS r...
Production NoSQL in an Hour: Introduction to Amazon DynamoDB (DAT101) | AWS r...
 
Building a Modern Data Warehouse: Deep Dive on Amazon Redshift - SRV337 - Chi...
Building a Modern Data Warehouse: Deep Dive on Amazon Redshift - SRV337 - Chi...Building a Modern Data Warehouse: Deep Dive on Amazon Redshift - SRV337 - Chi...
Building a Modern Data Warehouse: Deep Dive on Amazon Redshift - SRV337 - Chi...
 
Building AWS Redshift Data Warehouse with Matillion and Tableau
Building AWS Redshift Data Warehouse with Matillion and TableauBuilding AWS Redshift Data Warehouse with Matillion and Tableau
Building AWS Redshift Data Warehouse with Matillion and Tableau
 
Powering Interactive Data Analysis at Pinterest by Amazon Redshift
Powering Interactive Data Analysis at Pinterest by Amazon RedshiftPowering Interactive Data Analysis at Pinterest by Amazon Redshift
Powering Interactive Data Analysis at Pinterest by Amazon Redshift
 
Masterclass - Redshift
Masterclass - RedshiftMasterclass - Redshift
Masterclass - Redshift
 
Best Practices for Migrating your Data Warehouse to Amazon Redshift
Best Practices for Migrating your Data Warehouse to Amazon RedshiftBest Practices for Migrating your Data Warehouse to Amazon Redshift
Best Practices for Migrating your Data Warehouse to Amazon Redshift
 
Getting Started with Amazon Redshift - AWS July 2016 Webinar Series
Getting Started with Amazon Redshift - AWS July 2016 Webinar SeriesGetting Started with Amazon Redshift - AWS July 2016 Webinar Series
Getting Started with Amazon Redshift - AWS July 2016 Webinar Series
 
Best Practices for Migrating your Data Warehouse to Amazon Redshift
Best Practices for Migrating your Data Warehouse to Amazon Redshift Best Practices for Migrating your Data Warehouse to Amazon Redshift
Best Practices for Migrating your Data Warehouse to Amazon Redshift
 
AWS Webcast - Data Integration into Amazon Redshift
AWS Webcast - Data Integration into Amazon RedshiftAWS Webcast - Data Integration into Amazon Redshift
AWS Webcast - Data Integration into Amazon Redshift
 
Getting Started with Amazon DynamoDB
Getting Started with Amazon DynamoDBGetting Started with Amazon DynamoDB
Getting Started with Amazon DynamoDB
 
(BDT314) A Big Data & Analytics App on Amazon EMR & Amazon Redshift
(BDT314) A Big Data & Analytics App on Amazon EMR & Amazon Redshift(BDT314) A Big Data & Analytics App on Amazon EMR & Amazon Redshift
(BDT314) A Big Data & Analytics App on Amazon EMR & Amazon Redshift
 
(BDT316) Offloading ETL to Amazon Elastic MapReduce
(BDT316) Offloading ETL to Amazon Elastic MapReduce(BDT316) Offloading ETL to Amazon Elastic MapReduce
(BDT316) Offloading ETL to Amazon Elastic MapReduce
 
(DAT201) Introduction to Amazon Redshift
(DAT201) Introduction to Amazon Redshift(DAT201) Introduction to Amazon Redshift
(DAT201) Introduction to Amazon Redshift
 
(BDT404) Large-Scale ETL Data Flows w/AWS Data Pipeline & Dataduct
(BDT404) Large-Scale ETL Data Flows w/AWS Data Pipeline & Dataduct(BDT404) Large-Scale ETL Data Flows w/AWS Data Pipeline & Dataduct
(BDT404) Large-Scale ETL Data Flows w/AWS Data Pipeline & Dataduct
 
AWS Webcast - Redshift Overview and New Features
AWS Webcast - Redshift Overview and New Features AWS Webcast - Redshift Overview and New Features
AWS Webcast - Redshift Overview and New Features
 
AWS Big Data Platform
AWS Big Data PlatformAWS Big Data Platform
AWS Big Data Platform
 
Near Real-Time Data Analysis With FlyData
Near Real-Time Data Analysis With FlyData Near Real-Time Data Analysis With FlyData
Near Real-Time Data Analysis With FlyData
 
(BDT310) Big Data Architectural Patterns and Best Practices on AWS | AWS re:I...
(BDT310) Big Data Architectural Patterns and Best Practices on AWS | AWS re:I...(BDT310) Big Data Architectural Patterns and Best Practices on AWS | AWS re:I...
(BDT310) Big Data Architectural Patterns and Best Practices on AWS | AWS re:I...
 
New Database Migration Services & RDS Updates
New Database Migration Services & RDS UpdatesNew Database Migration Services & RDS Updates
New Database Migration Services & RDS Updates
 
AWS re:Invent 2016: Workshop: Converting Your Oracle or Microsoft SQL Server ...
AWS re:Invent 2016: Workshop: Converting Your Oracle or Microsoft SQL Server ...AWS re:Invent 2016: Workshop: Converting Your Oracle or Microsoft SQL Server ...
AWS re:Invent 2016: Workshop: Converting Your Oracle or Microsoft SQL Server ...
 

Destaque

Easy Analytics on AWS with Amazon Redshift, Amazon QuickSight, and Amazon Mac...
Easy Analytics on AWS with Amazon Redshift, Amazon QuickSight, and Amazon Mac...Easy Analytics on AWS with Amazon Redshift, Amazon QuickSight, and Amazon Mac...
Easy Analytics on AWS with Amazon Redshift, Amazon QuickSight, and Amazon Mac...Amazon Web Services
 
Implementation of linear regression and logistic regression on Spark
Implementation of linear regression and logistic regression on SparkImplementation of linear regression and logistic regression on Spark
Implementation of linear regression and logistic regression on SparkDalei Li
 
AWS re:Invent 2016: What’s New with Amazon Redshift (BDA304)
AWS re:Invent 2016: What’s New with Amazon Redshift (BDA304)AWS re:Invent 2016: What’s New with Amazon Redshift (BDA304)
AWS re:Invent 2016: What’s New with Amazon Redshift (BDA304)Amazon Web Services
 
Hive, Impala, and Spark, Oh My: SQL-on-Hadoop in Cloudera 5.5
Hive, Impala, and Spark, Oh My: SQL-on-Hadoop in Cloudera 5.5Hive, Impala, and Spark, Oh My: SQL-on-Hadoop in Cloudera 5.5
Hive, Impala, and Spark, Oh My: SQL-on-Hadoop in Cloudera 5.5Cloudera, Inc.
 
AWS Road Trip 2013 - Presentation
AWS Road Trip 2013 - PresentationAWS Road Trip 2013 - Presentation
AWS Road Trip 2013 - PresentationAmazon Web Services
 
Automating Backup & Archiving with AWS and CommVault
Automating Backup & Archiving with AWS and CommVaultAutomating Backup & Archiving with AWS and CommVault
Automating Backup & Archiving with AWS and CommVaultAmazon Web Services
 
AWS Customer Presentation - ORbyte
AWS Customer Presentation - ORbyteAWS Customer Presentation - ORbyte
AWS Customer Presentation - ORbyteAmazon Web Services
 
Empowering Publishers - Unlocking the power of Amazon Web Services - May-15-2...
Empowering Publishers - Unlocking the power of Amazon Web Services - May-15-2...Empowering Publishers - Unlocking the power of Amazon Web Services - May-15-2...
Empowering Publishers - Unlocking the power of Amazon Web Services - May-15-2...Amazon Web Services
 
STP205 Making it Big Without Breaking the Bank - AWS re: Invent 2012
STP205 Making it Big Without Breaking the Bank - AWS re: Invent 2012STP205 Making it Big Without Breaking the Bank - AWS re: Invent 2012
STP205 Making it Big Without Breaking the Bank - AWS re: Invent 2012Amazon Web Services
 
AWS Summit Sydney 2014 | Understanding AWS Security
AWS Summit Sydney 2014 | Understanding AWS SecurityAWS Summit Sydney 2014 | Understanding AWS Security
AWS Summit Sydney 2014 | Understanding AWS SecurityAmazon Web Services
 
GOWAR - Virtual Wars Real Places. AWS Case Study
GOWAR - Virtual Wars Real Places. AWS Case StudyGOWAR - Virtual Wars Real Places. AWS Case Study
GOWAR - Virtual Wars Real Places. AWS Case StudyAmazon Web Services
 
DAT201 Migrating Databases to AWS - AWS re: Invent 2012
DAT201 Migrating Databases to AWS - AWS re: Invent 2012DAT201 Migrating Databases to AWS - AWS re: Invent 2012
DAT201 Migrating Databases to AWS - AWS re: Invent 2012Amazon Web Services
 
AWS Cloud Kata | Hong Kong - Getting to Scale on AWS, Customer Presentation b...
AWS Cloud Kata | Hong Kong - Getting to Scale on AWS, Customer Presentation b...AWS Cloud Kata | Hong Kong - Getting to Scale on AWS, Customer Presentation b...
AWS Cloud Kata | Hong Kong - Getting to Scale on AWS, Customer Presentation b...Amazon Web Services
 
Webinar: Delivering Static and Dynamic Content Using CloudFront
Webinar: Delivering Static and Dynamic Content Using CloudFrontWebinar: Delivering Static and Dynamic Content Using CloudFront
Webinar: Delivering Static and Dynamic Content Using CloudFrontAmazon Web Services
 
AWS Summit - Brisbane 2014 - Keynote
AWS Summit - Brisbane 2014 - KeynoteAWS Summit - Brisbane 2014 - Keynote
AWS Summit - Brisbane 2014 - KeynoteAmazon Web Services
 
AWS Partner Presentation - Sonian
AWS Partner Presentation - SonianAWS Partner Presentation - Sonian
AWS Partner Presentation - SonianAmazon Web Services
 

Destaque (20)

Easy Analytics on AWS with Amazon Redshift, Amazon QuickSight, and Amazon Mac...
Easy Analytics on AWS with Amazon Redshift, Amazon QuickSight, and Amazon Mac...Easy Analytics on AWS with Amazon Redshift, Amazon QuickSight, and Amazon Mac...
Easy Analytics on AWS with Amazon Redshift, Amazon QuickSight, and Amazon Mac...
 
Implementation of linear regression and logistic regression on Spark
Implementation of linear regression and logistic regression on SparkImplementation of linear regression and logistic regression on Spark
Implementation of linear regression and logistic regression on Spark
 
AWS re:Invent 2016: What’s New with Amazon Redshift (BDA304)
AWS re:Invent 2016: What’s New with Amazon Redshift (BDA304)AWS re:Invent 2016: What’s New with Amazon Redshift (BDA304)
AWS re:Invent 2016: What’s New with Amazon Redshift (BDA304)
 
Cost Optimisation on AWS
Cost Optimisation on AWSCost Optimisation on AWS
Cost Optimisation on AWS
 
Hive, Impala, and Spark, Oh My: SQL-on-Hadoop in Cloudera 5.5
Hive, Impala, and Spark, Oh My: SQL-on-Hadoop in Cloudera 5.5Hive, Impala, and Spark, Oh My: SQL-on-Hadoop in Cloudera 5.5
Hive, Impala, and Spark, Oh My: SQL-on-Hadoop in Cloudera 5.5
 
Mobile apps and iot aws lambda
Mobile apps and iot aws lambdaMobile apps and iot aws lambda
Mobile apps and iot aws lambda
 
AWS Road Trip 2013 - Presentation
AWS Road Trip 2013 - PresentationAWS Road Trip 2013 - Presentation
AWS Road Trip 2013 - Presentation
 
Automating Backup & Archiving with AWS and CommVault
Automating Backup & Archiving with AWS and CommVaultAutomating Backup & Archiving with AWS and CommVault
Automating Backup & Archiving with AWS and CommVault
 
AWS Customer Presentation - ORbyte
AWS Customer Presentation - ORbyteAWS Customer Presentation - ORbyte
AWS Customer Presentation - ORbyte
 
Empowering Publishers - Unlocking the power of Amazon Web Services - May-15-2...
Empowering Publishers - Unlocking the power of Amazon Web Services - May-15-2...Empowering Publishers - Unlocking the power of Amazon Web Services - May-15-2...
Empowering Publishers - Unlocking the power of Amazon Web Services - May-15-2...
 
STP205 Making it Big Without Breaking the Bank - AWS re: Invent 2012
STP205 Making it Big Without Breaking the Bank - AWS re: Invent 2012STP205 Making it Big Without Breaking the Bank - AWS re: Invent 2012
STP205 Making it Big Without Breaking the Bank - AWS re: Invent 2012
 
AWS Blackbelt NINJA Dojo
AWS Blackbelt NINJA DojoAWS Blackbelt NINJA Dojo
AWS Blackbelt NINJA Dojo
 
AWS Summit Sydney 2014 | Understanding AWS Security
AWS Summit Sydney 2014 | Understanding AWS SecurityAWS Summit Sydney 2014 | Understanding AWS Security
AWS Summit Sydney 2014 | Understanding AWS Security
 
GOWAR - Virtual Wars Real Places. AWS Case Study
GOWAR - Virtual Wars Real Places. AWS Case StudyGOWAR - Virtual Wars Real Places. AWS Case Study
GOWAR - Virtual Wars Real Places. AWS Case Study
 
DAT201 Migrating Databases to AWS - AWS re: Invent 2012
DAT201 Migrating Databases to AWS - AWS re: Invent 2012DAT201 Migrating Databases to AWS - AWS re: Invent 2012
DAT201 Migrating Databases to AWS - AWS re: Invent 2012
 
AWS Cloud Kata | Hong Kong - Getting to Scale on AWS, Customer Presentation b...
AWS Cloud Kata | Hong Kong - Getting to Scale on AWS, Customer Presentation b...AWS Cloud Kata | Hong Kong - Getting to Scale on AWS, Customer Presentation b...
AWS Cloud Kata | Hong Kong - Getting to Scale on AWS, Customer Presentation b...
 
Webinar: Delivering Static and Dynamic Content Using CloudFront
Webinar: Delivering Static and Dynamic Content Using CloudFrontWebinar: Delivering Static and Dynamic Content Using CloudFront
Webinar: Delivering Static and Dynamic Content Using CloudFront
 
AWS Summit - Brisbane 2014 - Keynote
AWS Summit - Brisbane 2014 - KeynoteAWS Summit - Brisbane 2014 - Keynote
AWS Summit - Brisbane 2014 - Keynote
 
AWS Partner Presentation - Sonian
AWS Partner Presentation - SonianAWS Partner Presentation - Sonian
AWS Partner Presentation - Sonian
 
What's New
What's NewWhat's New
What's New
 

Semelhante a Amazon Redshift in Action: Enterprise, Big Data, and SaaS Use Cases (DAT205) | AWS re:Invent 2013

AWS Summit 2013 | India - Petabyte Scale Data Warehousing at Low Cost, Abhish...
AWS Summit 2013 | India - Petabyte Scale Data Warehousing at Low Cost, Abhish...AWS Summit 2013 | India - Petabyte Scale Data Warehousing at Low Cost, Abhish...
AWS Summit 2013 | India - Petabyte Scale Data Warehousing at Low Cost, Abhish...Amazon Web Services
 
Data warehousing in the era of Big Data: Deep Dive into Amazon Redshift
Data warehousing in the era of Big Data: Deep Dive into Amazon RedshiftData warehousing in the era of Big Data: Deep Dive into Amazon Redshift
Data warehousing in the era of Big Data: Deep Dive into Amazon RedshiftAmazon Web Services
 
Building a data warehouse with Amazon Redshift … and a quick look at Amazon ...
Building a data warehouse  with Amazon Redshift … and a quick look at Amazon ...Building a data warehouse  with Amazon Redshift … and a quick look at Amazon ...
Building a data warehouse with Amazon Redshift … and a quick look at Amazon ...Julien SIMON
 
How Glidewell Moves Data to Amazon Redshift
How Glidewell Moves Data to Amazon RedshiftHow Glidewell Moves Data to Amazon Redshift
How Glidewell Moves Data to Amazon RedshiftAttunity
 
Big Data & Analytics - Innovating at the Speed of Light
Big Data & Analytics - Innovating at the Speed of LightBig Data & Analytics - Innovating at the Speed of Light
Big Data & Analytics - Innovating at the Speed of LightAmazon Web Services LATAM
 
Understanding AWS Database Options (DAT201) | AWS re:Invent 2013
Understanding AWS Database Options (DAT201) | AWS re:Invent 2013Understanding AWS Database Options (DAT201) | AWS re:Invent 2013
Understanding AWS Database Options (DAT201) | AWS re:Invent 2013Amazon Web Services
 
Immersion Day - Como simplificar o acesso ao seu ambiente analítico
Immersion Day - Como simplificar o acesso ao seu ambiente analíticoImmersion Day - Como simplificar o acesso ao seu ambiente analítico
Immersion Day - Como simplificar o acesso ao seu ambiente analíticoAmazon Web Services LATAM
 
Using real time big data analytics for competitive advantage
 Using real time big data analytics for competitive advantage Using real time big data analytics for competitive advantage
Using real time big data analytics for competitive advantageAmazon Web Services
 
DoneDeal - AWS Data Analytics Platform
DoneDeal - AWS Data Analytics PlatformDoneDeal - AWS Data Analytics Platform
DoneDeal - AWS Data Analytics Platformmartinbpeters
 
Big Data Goes Airborne. Propelling Your Big Data Initiative with Ironcluster ...
Big Data Goes Airborne. Propelling Your Big Data Initiative with Ironcluster ...Big Data Goes Airborne. Propelling Your Big Data Initiative with Ironcluster ...
Big Data Goes Airborne. Propelling Your Big Data Initiative with Ironcluster ...Precisely
 
Database and Analytics on the AWS Cloud
Database and Analytics on the AWS CloudDatabase and Analytics on the AWS Cloud
Database and Analytics on the AWS CloudAmazon Web Services
 
Scaling your Analytics with Amazon Elastic MapReduce (BDT301) | AWS re:Invent...
Scaling your Analytics with Amazon Elastic MapReduce (BDT301) | AWS re:Invent...Scaling your Analytics with Amazon Elastic MapReduce (BDT301) | AWS re:Invent...
Scaling your Analytics with Amazon Elastic MapReduce (BDT301) | AWS re:Invent...Amazon Web Services
 
5 Things that Make Hadoop a Game Changer
5 Things that Make Hadoop a Game Changer5 Things that Make Hadoop a Game Changer
5 Things that Make Hadoop a Game ChangerCaserta
 
Amazon RedShift - Ianni Vamvadelis
Amazon RedShift - Ianni VamvadelisAmazon RedShift - Ianni Vamvadelis
Amazon RedShift - Ianni Vamvadelishuguk
 
Data & Analytics - Session 1 - Big Data Analytics
Data & Analytics - Session 1 -  Big Data AnalyticsData & Analytics - Session 1 -  Big Data Analytics
Data & Analytics - Session 1 - Big Data AnalyticsAmazon Web Services
 
Initiate Edinburgh 2019 - Big Data Meets AI
Initiate Edinburgh 2019 - Big Data Meets AIInitiate Edinburgh 2019 - Big Data Meets AI
Initiate Edinburgh 2019 - Big Data Meets AIAmazon Web Services
 
Getting Started with Amazon Redshift
Getting Started with Amazon RedshiftGetting Started with Amazon Redshift
Getting Started with Amazon RedshiftAmazon Web Services
 
Getting Started with Amazon Redshift
Getting Started with Amazon RedshiftGetting Started with Amazon Redshift
Getting Started with Amazon RedshiftAmazon Web Services
 
클라우드에서의 데이터 웨어하우징 & 비즈니스 인텔리전스
클라우드에서의 데이터 웨어하우징 & 비즈니스 인텔리전스클라우드에서의 데이터 웨어하우징 & 비즈니스 인텔리전스
클라우드에서의 데이터 웨어하우징 & 비즈니스 인텔리전스Amazon Web Services Korea
 

Semelhante a Amazon Redshift in Action: Enterprise, Big Data, and SaaS Use Cases (DAT205) | AWS re:Invent 2013 (20)

AWS Summit 2013 | India - Petabyte Scale Data Warehousing at Low Cost, Abhish...
AWS Summit 2013 | India - Petabyte Scale Data Warehousing at Low Cost, Abhish...AWS Summit 2013 | India - Petabyte Scale Data Warehousing at Low Cost, Abhish...
AWS Summit 2013 | India - Petabyte Scale Data Warehousing at Low Cost, Abhish...
 
Data warehousing in the era of Big Data: Deep Dive into Amazon Redshift
Data warehousing in the era of Big Data: Deep Dive into Amazon RedshiftData warehousing in the era of Big Data: Deep Dive into Amazon Redshift
Data warehousing in the era of Big Data: Deep Dive into Amazon Redshift
 
Building a data warehouse with Amazon Redshift … and a quick look at Amazon ...
Building a data warehouse  with Amazon Redshift … and a quick look at Amazon ...Building a data warehouse  with Amazon Redshift … and a quick look at Amazon ...
Building a data warehouse with Amazon Redshift … and a quick look at Amazon ...
 
How Glidewell Moves Data to Amazon Redshift
How Glidewell Moves Data to Amazon RedshiftHow Glidewell Moves Data to Amazon Redshift
How Glidewell Moves Data to Amazon Redshift
 
Big Data & Analytics - Innovating at the Speed of Light
Big Data & Analytics - Innovating at the Speed of LightBig Data & Analytics - Innovating at the Speed of Light
Big Data & Analytics - Innovating at the Speed of Light
 
Understanding AWS Database Options (DAT201) | AWS re:Invent 2013
Understanding AWS Database Options (DAT201) | AWS re:Invent 2013Understanding AWS Database Options (DAT201) | AWS re:Invent 2013
Understanding AWS Database Options (DAT201) | AWS re:Invent 2013
 
Immersion Day - Como simplificar o acesso ao seu ambiente analítico
Immersion Day - Como simplificar o acesso ao seu ambiente analíticoImmersion Day - Como simplificar o acesso ao seu ambiente analítico
Immersion Day - Como simplificar o acesso ao seu ambiente analítico
 
Using real time big data analytics for competitive advantage
 Using real time big data analytics for competitive advantage Using real time big data analytics for competitive advantage
Using real time big data analytics for competitive advantage
 
DoneDeal - AWS Data Analytics Platform
DoneDeal - AWS Data Analytics PlatformDoneDeal - AWS Data Analytics Platform
DoneDeal - AWS Data Analytics Platform
 
Big Data Goes Airborne. Propelling Your Big Data Initiative with Ironcluster ...
Big Data Goes Airborne. Propelling Your Big Data Initiative with Ironcluster ...Big Data Goes Airborne. Propelling Your Big Data Initiative with Ironcluster ...
Big Data Goes Airborne. Propelling Your Big Data Initiative with Ironcluster ...
 
Database and Analytics on the AWS Cloud
Database and Analytics on the AWS CloudDatabase and Analytics on the AWS Cloud
Database and Analytics on the AWS Cloud
 
Scaling your Analytics with Amazon Elastic MapReduce (BDT301) | AWS re:Invent...
Scaling your Analytics with Amazon Elastic MapReduce (BDT301) | AWS re:Invent...Scaling your Analytics with Amazon Elastic MapReduce (BDT301) | AWS re:Invent...
Scaling your Analytics with Amazon Elastic MapReduce (BDT301) | AWS re:Invent...
 
5 Things that Make Hadoop a Game Changer
5 Things that Make Hadoop a Game Changer5 Things that Make Hadoop a Game Changer
5 Things that Make Hadoop a Game Changer
 
Amazon RedShift - Ianni Vamvadelis
Amazon RedShift - Ianni VamvadelisAmazon RedShift - Ianni Vamvadelis
Amazon RedShift - Ianni Vamvadelis
 
Data & Analytics - Session 1 - Big Data Analytics
Data & Analytics - Session 1 -  Big Data AnalyticsData & Analytics - Session 1 -  Big Data Analytics
Data & Analytics - Session 1 - Big Data Analytics
 
Initiate Edinburgh 2019 - Big Data Meets AI
Initiate Edinburgh 2019 - Big Data Meets AIInitiate Edinburgh 2019 - Big Data Meets AI
Initiate Edinburgh 2019 - Big Data Meets AI
 
Getting Started with Amazon Redshift
Getting Started with Amazon RedshiftGetting Started with Amazon Redshift
Getting Started with Amazon Redshift
 
Getting Started with Amazon Redshift
Getting Started with Amazon RedshiftGetting Started with Amazon Redshift
Getting Started with Amazon Redshift
 
ESGYN Overview
ESGYN OverviewESGYN Overview
ESGYN Overview
 
클라우드에서의 데이터 웨어하우징 & 비즈니스 인텔리전스
클라우드에서의 데이터 웨어하우징 & 비즈니스 인텔리전스클라우드에서의 데이터 웨어하우징 & 비즈니스 인텔리전스
클라우드에서의 데이터 웨어하우징 & 비즈니스 인텔리전스
 

Mais de Amazon Web Services

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Amazon Web Services
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Amazon Web Services
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateAmazon Web Services
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSAmazon Web Services
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Amazon Web Services
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Amazon Web Services
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...Amazon Web Services
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsAmazon Web Services
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareAmazon Web Services
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSAmazon Web Services
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAmazon Web Services
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareAmazon Web Services
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWSAmazon Web Services
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckAmazon Web Services
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without serversAmazon Web Services
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...Amazon Web Services
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceAmazon Web Services
 

Mais de Amazon Web Services (20)

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS Fargate
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWS
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot
 
Open banking as a service
Open banking as a serviceOpen banking as a service
Open banking as a service
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
 
Computer Vision con AWS
Computer Vision con AWSComputer Vision con AWS
Computer Vision con AWS
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatare
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e web
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWS
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch Deck
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without servers
 
Fundraising Essentials
Fundraising EssentialsFundraising Essentials
Fundraising Essentials
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container Service
 

Último

Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embeddingZilliz
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 

Último (20)

Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embedding
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 

Amazon Redshift in Action: Enterprise, Big Data, and SaaS Use Cases (DAT205) | AWS re:Invent 2013

  • 1. DAT 205 - Amazon Redshift in Action Enterprise, Big Data, and SaaS Use Cases November 15, 2013 © 2013 Amazon.com, Inc. and its affiliates. All rights reserved. May not be copied, modified, or distributed in whole or in part without the express consent of Amazon.com, Inc.
  • 2. Amazon Redshift Fast, simple, petabyte-scale data warehousing for less than $1,000/TB/Year
  • 3. Amazon Redshift architecture • Leader Node – – – JDBC/ODBC SQL endpoint Stores metadata Coordinates query execution • Compute Nodes – – – – 10 GigE (HPC) Local, columnar storage Execute queries in parallel Load, backup, restore via Amazon S3 Parallel load from Amazon DynamoDB • Single node version available Ingestion Backup Restore
  • 4. Amazon Redshift is priced to let you analyze all your data Price Per Hour for HS1.XL Single Node Effective Hourly Price per TB Effective Annual Price per TB On-Demand $ 0.850 $ 0.425 $ 3,723 1 Year Reservation $ 0.500 $ 0.250 $ 2,190 3 Year Reservation $ 0.228 $ 0.114 $ 999 Simple Pricing Number of Nodes x Cost per Hour No charge for Leader Node No upfront costs Pay as you go
  • 5. Data Warehousing for Capital Markets Jason Timmes, AVP of Software Development, NASDAQ OMX November 15, 2013 © 2013 Amazon.com, Inc. and its affiliates. All rights reserved. May not be copied, modified, or distributed in whole or in part without the express consent of Amazon.com, Inc.
  • 6. Where innovation meets action OUR TECHNOLOGY WE OWN AND OPERATE IS USED TO POWER MORE THAN 70 M ARKETPLACES 26 MARKETS including IN 50 COUNTRIES 3 CLEARINGHOUSES 1 MILLION MESSAGES/SECOND AT A MEDIAN SPEED OF SUB-55 MICROSECONDS POWER 1 IN 10 OF THE WORLD’S SECURITIES TRANSACTIONS AND 5 CENTRAL SECURITIES OUR GLOBAL PLATFORM CAN HANDLE MORE THAN WE D E P OS ITOR IE S MORE THAN 5500 STRUCTURED PRODUCTS ARE TIED TO OUR GLOBAL INDEXES WITH THE NOTIONAL VALUE OF AT LEAST $1 TRILLION WE LIST ~3300 GLOBAL COMPANIES WORTH $6 TRILLION IN MARKET CAP REPRESENTING DIVERSE INDUSTRIES AND MANY OF THE WORLD’S MOST WELL-KNOWN AND INNOVATIVE BRANDS 6
  • 7. What I do New data and analytics platforms to store and serve data to internal and external customers.
  • 8. The Challenge • Archiving Market Data – classic “Big Data” problem • Power Surveillance and Business Intelligence/Analytics • Minimize cost – Not only infrastructure, but development/IT labor costs too • Empower the business for self-service
  • 9. SIP Total Monthly Message Volumes OPRA, UQDF and CQS Market Data Is Big Data Total Monthly Message Volume Date Aug-12 Sep-12 Oct-12 Nov-12 Dec-12 Jan-13 Feb-13 Mar-13 Apr-13 May-13 Jun-13 Jul-13 Aug-13 Charts courtesy of the Financial Information Forum NASDAQ Exchange Daily Peak Messages 600,000,000 500,000,000 400,000,000 300,000,000 200,000,000 100,000,000 0 OPRA Annual Increase: 69% CQS Annual Increase: 10% UQDF Annual Decrease: 6% Jan-13 Feb-13 Mar-13 Apr-13 May-13 Jun-13 Jul-13 Aug-13 Sep-13 Financial Information Forum, Redistribution without permission from FIF prohibited, email: fifinfo@fif.com UQDF 2,317,804,321 1,948,330,199 1,016,336,632 2,148,867,295 2,017,355,401 2,099,233,536 1,969,123,978 2,010,832,630 2,447,109,450 2,400,946,680 2,601,863,331 2,142,134,920 2,188,338,764 CQS 8,241,554,280 7,452,279,225 7,452,279,225 9,552,313,807 8,052,399,165 7,474,101,082 7,531,093,813 7,896,498,260 9,805,224,566 9,430,865,048 11,062,086,463 8,266,215,553 9,079,813,726 Total Monthly Message Volume Date OPRA Aug-12 80,600,107,361 Sep-12 77,303,404,427 Oct-12 98,407,788,187 Nov-12 104,739,265,089 Dec-12 81,363,853,339 Jan-13 82,227,243,377 Feb-13 87,207,025,489 Mar-13 93,573,969,245 Apr-13 123,865,614,055 May-13 134,587,099,561 Jun-13 162,771,803,250 Jul-13 120,920,111,089 Aug-13 136,237,441,349 Combined Average Daily Volume 459,102,548 494,768,917 403,267,422 557,199,100 503,487,728 455,873,077 500,011,463 495,366,545 556,924,273 537,809,624 683,197,490 473,106,840 512,188,750 Average Daily Volume 3,504,352,494 4,068,600,233 4,686,085,152 4,987,584,052 4,068,192,667 3,915,583,018 4,589,843,447 4,678,698,462 5,630,255,184 6,117,595,435 8,138,590,163 5,496,368,686 6,192,610,970 23
  • 10. Our legacy solution • On-premises MPP DB – Relatively expensive, finite storage – Required periodic additional expenses to add more storage – Ongoing IT (administrative) human costs • Legacy BI tool – Requires developer involvement for new data sources, reports, dashboards, etc.
  • 11. New Solution: Amazon Redshift • Cost Effective – Redshift is 43% of the cost of legacy • Assuming equal storage capacities – Doesn’t include IT ongoing costs! • Performance – Easily outperforms our legacy BI/DB solution – Insert 550K rows/second on a 2 node 8XL cluster • Elastic – Add additional capacity on demand, easy to grow our cluster
  • 12. New Solution: Pentaho BI/ETL • Amazon Redshift partner – http://aws.amazon.com/redshift/par tners/pentaho/ • Self Service – Tools empower BI users to integrate new data sources, create their own analytics, dashboards, and reports without requiring development involvement • Cost effective
  • 13. Net Result • New solution is cheaper, faster, and offers capabilities that our business didn’t have before – Empowers our business users to explore data like they never could before – Reduces IT and development as bottlenecks – Margin improvement (expense reduction and supports business decisions to grow revenue)
  • 14. HauteLook + Amazon Redshift A Case Study Kevin Diamond, HauteLook November 15, 2014 © 2013 Amazon.com, Inc. and its affiliates. All rights reserved. May not be copied, modified, or distributed in whole or in part without the express consent of Amazon.com, Inc.
  • 15. Who am I? Kevin Diamond • CTO of HauteLook, a Nordstrom Company • Oversee all technology, infrastructure, data, engineering, etc. • Major focus on great customer experience and the analytics to provide it
  • 16. What is HauteLook? • Private sale, members-only limited-time sale events • Premium fashion and lifestyle brands at exclusive prices of 50-75% off • Over 20 new sale events begin each morning at 8am PST • Over 14 million members • Acquired by Nordstrom in 2011
  • 17. Why a Data Warehouse? • Centralized storage of multiple data sources • Singular reporting consistency for all departments • Data model that supports analytics not transactions • Operational reports vs. analytical reports – Real-time vs. previous day
  • 18. Why Amazon Redshift? • Looked at some competitors: – Ranged from $ to $$$ – All required Software, Implementation and BIG Hardware • Skipped the RFP • Jumped into the Public Beta of Amazon Redshift and never looked back
  • 19. How We Implemented Amazon Redshift • ETL from MySQL and Microsoft SQL Server into AWS across a Direct Connect line storing on S3 • Also used S3 to dump flat files (iTunes Connect Data, Web Analytics dumps, log files, etc) • Used AWS Data Pipeline for executing Sqoop and Hadoop running on EC2 to load data into Amazon Redshift • Redshift Data Model based on Star Schema which looks something like …
  • 20. Example of Star Schema
  • 21. Usage with Business Intelligence • Already selected a BI Tool • Had difficulty deploying in the cloud • But worked great on-premises • Easily tied into Amazon Redshift using ODBC Drivers • BUT, metadata for reports had to live in MSSQL • Ported many SSIS/SSRS reports over – But only the analytical reports!
  • 22. And it all looks like this
  • 23. Amazon Redshift Instances • We use a little under 2TB • Thought to use 2 - BIG 8XL instance to get great performance (in passive failover mode) • Cost us $$$ • Then we tested using 6 - XL instances in a cluster • Performed better and allowed for more concurrency of queries in all but a handful of cases that really needed the 8XL power • Cost us $ • Duh! That’s why we do distributed everything else!!
  • 24. Some First Hand Experience • ETL was hardest part • Amazon Redshift performs awesome • Someone needs to make a great client SQL tool • MicroStrategy works great on it (just wished it loved running in EC2) • Saving a ton, thanks to: – No hardware costs – No maintenance/overhead (rack + power) – Annual costs are equivalent to just the annual maintenance of some of the cheaper DW on-premises options
  • 25. Conclusion/Last Advice • Only use 8XL instances if you need >2TB of space – Otherwise distribute on a bunch of XL nodes • Buy reserved instances (we still need to do this!) since you likely will have this always on • Although we haven’t yet, the idea of a flexible scale-up/down DW is crazy awesome – maybe during Holiday we will • Probably could have used Elastic MapReduce instead of Hadoop – wasn’t sure how it would play with Sqoop • Almost all BI tools play with Amazon Redshift now, so choose what is right for your business, and make sure it works in EC2 before just putting it there • Communication between AWS and your DC is easy and fast, but I recommend a Direct Connect • Passed our rigorous information security standards, but used in a VPC
  • 26. Amazon Redshift in Action: Enterprise, Big Data, and SaaS Use Cases Parag Thakker – VP, Roundarch Isobar Colin McGuigan – Architect, Roundarch Isobar November 15th, 2013 © 2013 Amazon.com, Inc. and its affiliates. All rights reserved. May not be copied, modified, or distributed in whole or in part without the express consent of Amazon.com, Inc.
  • 27. roundarch isobar OUR SERVICES ACROSS BOUGHT, OWNED AND EARNED MEDIA Strategies Campaigns Experiences Platforms Products We digitally transform business processes and disrupt industries We create, measure and optimize digitally-focused campaigns We produce joyful experiences that inspire consumer interaction We design and build flexible and scalable technology solutions We invent digital products that generate new revenue streams Audience insight Research: competitive, segmentation, persona development, heuristics Platforms: content management, search, portals, mobile, frontend technology, internet-enabled devices/wearables, social apps, web services, security, big data, hosting Digital products Business planning: competitive & industry analysis, business cases, maturity models, roadmaps Strategies: brand, interactive, multichannel, social, content 27 Communications planning Creative: advertising, visual design, content creation, studio production Optimization: analytics, monitoring, SEO, MVT, media ROI analysis Requirements and specifications: content analysis and specs, functional requirements, functional specifications User experience design: information architecture, taxonomy and meta data, interaction design, mobile Digital product extensions Brand as a service
  • 28. We have served the U.S. Air Force since 2001, building their enterprise portal and many mission-critical applications U.S. Air Force Key metrics for our USAF work include: • 900,000+ registered users • Portal availability over 99.9% of time • 700,000+ PK-E users • 28 production enterprise services • Response time worldwide: 3 seconds for 80% of all pages • Over 300 applications available • Over 1.2 million logins/week • Public-facing and secure private instances (NIPR & SIPR) • 124,000 unique daily users 28 • 4-5+ million pages daily (40-70 Mbit/sec) • Portal support for over 5,000 “Communities of Interest”
  • 29. Transforming in-stadium operations through a touch-screen command center New York Jets Our executive touch-screen environment provides real-time stadium and game data, allowing the Jets owner, Woody Johnson, to monitor the fan experience during game time and make operational decisions that help maximize sales. The command center provides summary-level and drill-down views of stadium operations such as tickets, parking and concessions. It also creates predictive algorithms that help identify pinch points and open revenue opportunities. 29 “We brought the big picture close enough to identify new, better ways to do business.”
  • 30. Through a joint venture with Copia Capital, we created a new product offering for William Blair William Blair | Investment Research Management System • Facilitates collaboration between portfolio managers and analysts Technology: • Provides a holistic view of a company/stock • Uses Jquery, JavascriptMVC, Less – What is everything our organization knows about AAPL • Digitizes PDF/Excel tools and reports to enable rich, dynamic interactions • Simplifies content creation; e.g., comments, recommendation reports, document upload • Rich charting and visualization of analytics 30 • JavaScript, HTML5, CSS3 • JSON Web Services • Java, Spring, JPA, Mongo DB • User comment: “We love how fast it is!”
  • 31. What is the focus of your CMO today? Optimize marketing spend across all channels (Bought, Earned and Owned) 31
  • 33. marketing effectiveness stages DLP Scorecard Sonar AMNET Compass Optimize Scorecard Real-Time and Non-Real-Time Learn Analyze • Centralized cross channel Big Data Platform • Standardized cross channel reporting tools • Discovery tools to identify channel optimization opportunities • Modeling solutions • Channel experience enhancements • Improved media buying, planning & reporting functions • Real time integration into DSP • A/B testing based micro segment adjustments
  • 34. So what have we accomplished? Built Marketing Analytics Platform - Radar with 200+ in-time analytics, reporting andfrequency, granularity forenable feeds (1TB/week) with various optimization to scalable multi-tenant in 3 platform on Amazon as multiple clients with customized metrics with first launch SaaS months and classification 34
  • 36. scorecard logical architecture Media Team Display Paid Search Organic Search Digital Video Site Metrics Sales Google DFA Google Bing Marin Google Bing Custom Google Omniture Client Stakeholders TBD Scorecard App Detailed Analytic Reports TV Radio Print OOH Earned Social DDS DDS DDS Facebook Twitter Competit ive Custom Paid Social Facebook Twitter Media Team 36 Planners Client Team
  • 37. data sources DATA VOLUME Voluminous Data Digital CRM Research - Surveys - Demographics - Campaigns - Search - Mobile - Attribution - Site - Social - Display VARIETY and GRANULARITY 37 - Cookie Level - UGC - Geospatial - Weather - Sales - Competitive
  • 39. ETL Extract Files loaded on Amazon S3/Amazon Glacier Transform Utilize Pig on Amazon EMR to cleanse, standardize and validate the data Radio Glacier Display Ads S3 Redshift Search Load Use COPY to load Pig output Social Feeds Hadoop EMR 39
  • 40. data warehouse Performance Handles humongous aggregation quickly Tableau, BI Tools Analysts Cheap, fast, easily scalable ODBC and JDBC access For BI / adhoc analysis Redshift 40
  • 41. aggregation Mapping Radio Join performance data with metadata Display Ads Multi-step aggregation SQL Product, Campaign In Amazon Redshift using SQL Search Views, Clicks, CTR, CPC etc Load aggregates Social in MySQL for sub second web response Aggregates Redshift 41 MySQL RDS
  • 42. data workflow Jenkins for client+channel ETL Job control dashboard Jenkins Ruby for provisioning, job flow Data intake/extract Amazon DynamoDB for state management Ruby DynamoDB On demand, job-initiated Amazon EMR clusters S3 42 Hadoop EMR Redshift MySQL RDS
  • 43. SaaS dashboard Designed for redundancy Hardware and location Client1.com Client2.com ElastiCache Multi-Tenant Managed services DNS Automated stack provisioning For clients MySQL RDS 43 EC2 Beanstalk Load Balancing
  • 44. AWS advantages Innovate US Quickly with reduced risk AMAZON Time To market Java Ruby Python Lower Operational overhead Highly Scalable 44 Developers DevOps AWS Ops
  • 45. learnings Metadata is more important than the data Design for scalability upfront Always explore better ways to aggregate Cost management is very important Build Agile: Perform early end-to-end validation on smaller dataset Separate data visualization, data cleansing, storage & data aggregation Be smart about implementing data aggregation routines across multiple granularities 45
  • 46. Please give us your feedback on this presentation DAT205 As a thank you, we will select prize winners daily for completed surveys!