SlideShare uma empresa Scribd logo
1 de 12
Next Generation
Data Warehouse
Development
with Lambda and
Redshift
Andras Gombosi
AMAZON REDSHIFT
Petabyte scale Massively Parallel
Data Warehouse
Exceptionally fast *
Horizontally Scalable
Massive Storage capacity
Attractive and transparent pricing
SQL interface
AWS ecosystem
Secure *
Challenges – the cost of greatness
SLAs
Security
Audit & Compliance
Solution:
Empower the Database
Developer and DBA
communities with DevOps
methodologies!
5
AWS Cloud
Lambda
Trigger
CD Pipeline for DB code
Developer
And DBA
Communites
Git Push
Task Router
DEPLOYER APP
Data Modeller SDLC: Forward Engineering
CD PIPELINE
Data Modeling
Tool
Edge ETL Use Cases: AWS Billing Data Load
SQL Forwarder
CD PIPELINE
Deployer functionality
Email
SDLC changes and effects
UPTIME and
REPUTATION
CODE QUALITY
AND SKILLS
AUTOMATIC,
PROCESS ENFORCED
DATA SECURITY
Push changes to repo instead of execute!
Well Architected : Security Pillar
Endpoints
Well Architected : Other Pillars
Security Reliability
AWS Well Architected Framework
Performance
Efficiency
Cost
Optimisation
Operational
Excellence
Conclusion &
Questions
• AWS Ecosystem
• Cloud-native
• Not just for Redshift
• Not “outside the box”, use
multiple boxes!

Mais conteúdo relacionado

Mais procurados

AWS Cloud Computing for Developers
AWS Cloud Computing for DevelopersAWS Cloud Computing for Developers
AWS Cloud Computing for Developers
Amazon Web Services
 

Mais procurados (19)

AWS Lambda Containers - bridging the gap between serverless and containers on...
AWS Lambda Containers - bridging the gap between serverless and containers on...AWS Lambda Containers - bridging the gap between serverless and containers on...
AWS Lambda Containers - bridging the gap between serverless and containers on...
 
AWS Office Hours: Dev and Test
AWS Office Hours: Dev and TestAWS Office Hours: Dev and Test
AWS Office Hours: Dev and Test
 
Real time serverless data pipelines on AWS
Real time serverless data pipelines on AWSReal time serverless data pipelines on AWS
Real time serverless data pipelines on AWS
 
Rapid Prototyping for Big Data with AWS
Rapid Prototyping for Big Data with AWS Rapid Prototyping for Big Data with AWS
Rapid Prototyping for Big Data with AWS
 
Real time Object Detection and Analytics using RedisEdge and Docker
Real time Object Detection and Analytics using RedisEdge and DockerReal time Object Detection and Analytics using RedisEdge and Docker
Real time Object Detection and Analytics using RedisEdge and Docker
 
The future is Serveless | Filip Knapik | #4 Serverless UG Warsaw
The future is Serveless | Filip Knapik | #4 Serverless UG WarsawThe future is Serveless | Filip Knapik | #4 Serverless UG Warsaw
The future is Serveless | Filip Knapik | #4 Serverless UG Warsaw
 
World's best AWS Cloud Log Analytics & Management Tool
World's best AWS Cloud Log Analytics & Management ToolWorld's best AWS Cloud Log Analytics & Management Tool
World's best AWS Cloud Log Analytics & Management Tool
 
Azure functions
Azure functionsAzure functions
Azure functions
 
Scalable Application Development on AWS
Scalable Application Development on AWSScalable Application Development on AWS
Scalable Application Development on AWS
 
Gaming in the Cloud at Websummit Dublin
Gaming in the Cloud at Websummit DublinGaming in the Cloud at Websummit Dublin
Gaming in the Cloud at Websummit Dublin
 
Scalable Java Application Development on AWS
Scalable Java Application Development on AWSScalable Java Application Development on AWS
Scalable Java Application Development on AWS
 
Building Serverless Web Applications - May 2017 AWS Online Tech Talks
Building Serverless Web Applications  - May 2017 AWS Online Tech TalksBuilding Serverless Web Applications  - May 2017 AWS Online Tech Talks
Building Serverless Web Applications - May 2017 AWS Online Tech Talks
 
AWS Cloud Computing for Developers
AWS Cloud Computing for DevelopersAWS Cloud Computing for Developers
AWS Cloud Computing for Developers
 
Welcome Keynote
Welcome KeynoteWelcome Keynote
Welcome Keynote
 
AWS Community Day Bangkok 2019 - Dev Ops Philosophy Increase Productivity
AWS Community Day Bangkok 2019 - Dev Ops Philosophy Increase ProductivityAWS Community Day Bangkok 2019 - Dev Ops Philosophy Increase Productivity
AWS Community Day Bangkok 2019 - Dev Ops Philosophy Increase Productivity
 
Amazingly Simple Serverless Go
Amazingly Simple Serverless GoAmazingly Simple Serverless Go
Amazingly Simple Serverless Go
 
Cosmos DB and Azure Functions A serverless database processing.pptx
Cosmos DB and Azure Functions  A serverless database processing.pptxCosmos DB and Azure Functions  A serverless database processing.pptx
Cosmos DB and Azure Functions A serverless database processing.pptx
 
Trying out the Go language with Google App Engine
Trying out the Go language with Google App EngineTrying out the Go language with Google App Engine
Trying out the Go language with Google App Engine
 
Running your database in the cloud presentation
Running your database in the cloud presentationRunning your database in the cloud presentation
Running your database in the cloud presentation
 

Semelhante a Next Generation Data Warehouse Development with Lambda and Redshift

Semelhante a Next Generation Data Warehouse Development with Lambda and Redshift (20)

Lessons from Building Large-Scale, Multi-Cloud, SaaS Software at Databricks
Lessons from Building Large-Scale, Multi-Cloud, SaaS Software at DatabricksLessons from Building Large-Scale, Multi-Cloud, SaaS Software at Databricks
Lessons from Building Large-Scale, Multi-Cloud, SaaS Software at Databricks
 
SMC301 The State of Serverless Computing
SMC301 The State of Serverless ComputingSMC301 The State of Serverless Computing
SMC301 The State of Serverless Computing
 
Aws-What You Need to Know_Simon Elisha
Aws-What You Need to Know_Simon ElishaAws-What You Need to Know_Simon Elisha
Aws-What You Need to Know_Simon Elisha
 
Hands on Lab: Windows Workloads - AWS Online Tech Talks
Hands on Lab: Windows Workloads - AWS Online Tech TalksHands on Lab: Windows Workloads - AWS Online Tech Talks
Hands on Lab: Windows Workloads - AWS Online Tech Talks
 
Beyond Relational
Beyond RelationalBeyond Relational
Beyond Relational
 
AWS Webcast - Migrating to RDS Oracle
AWS Webcast - Migrating to RDS OracleAWS Webcast - Migrating to RDS Oracle
AWS Webcast - Migrating to RDS Oracle
 
The State of Serverless Computing | AWS Public Sector Summit 2017
The State of Serverless Computing | AWS Public Sector Summit 2017The State of Serverless Computing | AWS Public Sector Summit 2017
The State of Serverless Computing | AWS Public Sector Summit 2017
 
Big Data Goes Airborne. Propelling Your Big Data Initiative with Ironcluster ...
Big Data Goes Airborne. Propelling Your Big Data Initiative with Ironcluster ...Big Data Goes Airborne. Propelling Your Big Data Initiative with Ironcluster ...
Big Data Goes Airborne. Propelling Your Big Data Initiative with Ironcluster ...
 
Building your first Analysis Services Tabular BI Semantic model with SQL Serv...
Building your first Analysis Services Tabular BI Semantic model with SQL Serv...Building your first Analysis Services Tabular BI Semantic model with SQL Serv...
Building your first Analysis Services Tabular BI Semantic model with SQL Serv...
 
Cloud Big Data Architectures
Cloud Big Data ArchitecturesCloud Big Data Architectures
Cloud Big Data Architectures
 
Hands On Lab: Windows Workloads on AWS - May 2017 AWS Online Tech Talks
Hands On Lab: Windows Workloads on AWS - May 2017 AWS Online Tech TalksHands On Lab: Windows Workloads on AWS - May 2017 AWS Online Tech Talks
Hands On Lab: Windows Workloads on AWS - May 2017 AWS Online Tech Talks
 
ENT305 Migrating Your Databases to AWS: Deep Dive on Amazon Relational Databa...
ENT305 Migrating Your Databases to AWS: Deep Dive on Amazon Relational Databa...ENT305 Migrating Your Databases to AWS: Deep Dive on Amazon Relational Databa...
ENT305 Migrating Your Databases to AWS: Deep Dive on Amazon Relational Databa...
 
Solved: Your Most Dreaded Test Environment Management Challenges
Solved: Your Most Dreaded Test Environment Management ChallengesSolved: Your Most Dreaded Test Environment Management Challenges
Solved: Your Most Dreaded Test Environment Management Challenges
 
Hands on Lab: Windows Workloads - AWS Online Tech Talks
Hands on Lab: Windows Workloads - AWS Online Tech TalksHands on Lab: Windows Workloads - AWS Online Tech Talks
Hands on Lab: Windows Workloads - AWS Online Tech Talks
 
Windows Workloads on AWS - July 2017 AWS Online Tech Talks
Windows Workloads on AWS - July 2017 AWS Online Tech TalksWindows Workloads on AWS - July 2017 AWS Online Tech Talks
Windows Workloads on AWS - July 2017 AWS Online Tech Talks
 
AWS re:Invent 2016: The State of Serverless Computing (SVR311)
AWS re:Invent 2016: The State of Serverless Computing (SVR311)AWS re:Invent 2016: The State of Serverless Computing (SVR311)
AWS re:Invent 2016: The State of Serverless Computing (SVR311)
 
Weathering the Data Storm – How SnapLogic and AWS Deliver Analytics in the Cl...
Weathering the Data Storm – How SnapLogic and AWS Deliver Analytics in the Cl...Weathering the Data Storm – How SnapLogic and AWS Deliver Analytics in the Cl...
Weathering the Data Storm – How SnapLogic and AWS Deliver Analytics in the Cl...
 
Building compelling Enterprise Solutions on AWS
Building compelling Enterprise Solutions on AWSBuilding compelling Enterprise Solutions on AWS
Building compelling Enterprise Solutions on AWS
 
Practical Cloud
Practical CloudPractical Cloud
Practical Cloud
 
Best of re:Invent
Best of re:InventBest of re:Invent
Best of re:Invent
 

Último

Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 

Último (20)

Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 

Next Generation Data Warehouse Development with Lambda and Redshift

Notas do Editor

  1. Hello, my name is Andras Gombosi. I am a senior Data- and Database Engineer at TerraAlto. We are a Dublin based well established technical consultancy focusing solely on AWS. We are an AWS Advanced Consulting Partner. We are also an AWS Managed Services Provider, members of an elite group of 126 companies worldwide having this competency. We are serving clients of all sizes from start-ups to truly global enterprises, we have countless migrations under our belt in Europe, Asia and also in AWS China. We are also working on various projects in the space of Big Data, IoT- and Data Lakes and Blockchain based track-and-trace solutions. One of our core operating principles is automation. The topic I have brought today is automation in a place where automation is not as widespread yet. Data Warehousing and BI development.
  2. The ongoing rumour is that Redshift has been named to mark that we are moving away from something Red… I have worked with those Red technologies for nearly a decade, but since I have also made a shift. Redshift in physics happens when light undergoes an increase in wavelength. This phenomenon is directly related to the expansion of space, the expansion of the universe. Redshift is an exceptionally good service for corporate data warehouseing, both as a standalone DWH and as a SQL-compatible extension of a corporate data lake. As usual , AWS does most of the heavy lifting, but Data security and cluster performance however great care and attention from the customer side as well.
  3. The capabilities of Redshift to grow make organisations capable of having a single, true enterprise Data Warehouse, typically queried, developed and modified by multiple, often geographically distributed teams and processes, in some cases hundreds having some sort of access to it. Developers and Data Engineers modify data and change structure in Data Marts Data Analysts query data directly DBA’s change Data Security (grants and revokes) and do housekeeping (VACUUM, ANALYZE) ETL processes (Glue, EMR, Matilion, Informatica) constantly insert and update Front end BI tools (QuickSight, Tableau, Microstrategy, SpotFire) query data through data marts Control? The challenges are not new, just a bit amplified, again because of the size, and because of the open source origins of Redshift, as open source solutions are typically surrounded by a tooling ecosystem, which is not present on Redshift right now out of the box. Challenges: SLA’s on Data Availability, and Uptime of data marts or other data sources for the upstream consumers. That means ETL/ELT jobs are running in a timely and performant manner, and BI teams and other upstream consumer tools can connect and query without any disruption. Security. In this case security of the Data itself. Who has access to what? Audit and Compliance. Who changed what exactly and when? In a complicated environment it is vital to have formal, automated processes without human intervention, otherwise due to the sheer scale the proper management of these challenges become very time-consuming, and sometimes near impossible.
  4. One possible solution is a “DevOps” style governance framework. Yes, bringing database changes under the DevOps umbrella is an increasingly popular topic. There are many tools and many ways to build a pipeline, some of them pricy, some of them complicated, some of them only work with specific DB engines, and some of them are all three of these. Nevertheless, the principles are the same for a Redshift CD pipeline too. A Code repository for code version Control and audit is the entry point, triggering an event driven , automatic , intelligent Continuous Deployment capability Ideally this is accompanied with an in-cluster Database and Schema based User- and Privilege Management Framework which is controlling access via user groups, dedicated service users and default privileges. The solution I brought today is a BASIC, practically free Cloud- and AWS Native way to get going. It does not use anything, only AWS and Python.
  5. Code Commit is the starting point. Multiple communities use separate Repositories, and different branches are set up. Some branches are protected, cannot be directly pushed into, only via Pull Requests and Merging. Code is being pushed or merged to the appropriate branch triggers a task router Task Router: Can be CodePipeline with a Lambda as custom action for the Build stage to execute anything on a database. For most organizations Lambda might be better suitable. Your mileage may vary. We are using Lambda for this step too. Task router understands information about the commit and evaluates requests. The Commit message, for example, for the order you want to run your SQL files, or the routing information. i.e. flag your commit if it has a big task to route it towards a container instead of Lambda (Limitation here is 15 minutes execution time.) Two major types of long running executions ETL COPY’s and UNLOAD’s , CREATE TABLE AS’s are usually done by an ETL / ELT tool (Glue, Matillion, Informatica, ) HOUSEKEEPING VACUUM / ANALYZE. Some ETL tools are also capable of scheduling these operations. Big Job deployer is entirely optional in most cases, depending of what other tools are available already in-house
  6. A few examples of possible use cases apart from normal development work. Anything which is can commit a SQL file to a repository can utilize the framework. Automatic, controlled , central deployment of generated scripts forward engineered from a database modelling tool, be it a full new schema deployment or incremental deltas to structure. No more “Oops” situations where someone have accidentally dropped a few and broke another few views on Production instead of Dev just because he started to work before the third coffee. DBSchema, Aqua, Aginity, whatever is your weapon of choice. If the tool has git integration, it will work seamlessly with the PipeLine. If a Database team gets to a higher capability maturity level and the company can justify purchasing more complicated and potentially pricy Database Release Management software solutions, the SDLC might be changing again, but up until then…
  7. What we frequently see is that more and more customers want to have almost real-time visibility of their AWS costs. AWS provides a neat extract mechanism which dumps the billing data into an S3 bucket, hourly if required. But good guy AWS not only dumps the raw data, it also dumps the SQL Commands and Manifest files for loading the raw csv’s to Redshift A trigger on the appropriate S3 put can start a function which picks up the event, makes minor changes to the loader SQL file (adding Redshift target schema for example), and commits the modified SQL to a repository monitored by a pipeline. Similar approaches can work very well even in certain Data Lake scenarios, or if you make the loader SQL and manifest part of your interface contract between systems, a deliverable with pieces of data.
  8. TRIGGER -In a newly pushed commit, following info is getting automatically forwarded to Lambda in the trigger event - WHO - WHEN - Which repo - Which branch - Commit ID EXECUTOR - Most of the work is done by a Lambda function, written in Python. Boto is an incredibly convenient and elegant tool to create integration between AWS services. Retrieves commit details and code from Code Commit based on Commit ID Retrieves additional config from DynamoDB, such as hooks for Slack or Teams , Redshift host, and target database and schema. Retrieves appropriate Secrets from Secrets Manager. You will have to have a naming convention in place , [repo-branch] combo works fine. Executes code against Database Initiates notifications, Slack Hook, MS Teams Hook , basically anything supporting CURL / HTTP hooks, or email Exact setup depends on networking setup including Lambda networking, client preferences and existing messaging platform usage and integration capabilities. Logs Everything in CloudWatch
  9. It will be a change, especially for teams at the low end of the Capability Maturity Model, but a crucial change, and that is exactly the point! Improved Code Quality, "lot of tools try to differentiate themselves with automatic code review capability" In the real, complicated world it is not always that simple that it can be codified, otherwise the DBA work would not have to be black magic! And there are other options, such as the new Redshift Recommendations, or clever monitoring of certain STL and STV views, sometimes in combination with alerting on a Kibana dashboard. Skills -> Pull Requests protected branches-> 4 eye checks Console provides easy access to relatively advanced GIT features, which is important, database development teams are traditionally a little bit behind in terms of experience in DevOps A human-to-human knowledge transfer is built in the deployment process, which automatically encourages Growth in both Team Maturity and individual developer skills, and Redshift Performance. Quality Many SQL statements can be scripted in an IDEMPOTENT way, so many scripts will be re-runnable. UPTIME The main effect is a much Improved, undisturbed Availability of Data for end-user facing BI tools. Breaking a Data Mart via an incorrect VIEW definition is now much harder. This leads to and trust in the IT team. Increased Customer satisfaction Overall Data Security enforced by automatic processes on every level, including auditing and traceability.
  10. Multi-layer security is present. VPC (This was yesterday -> Re:Invent happened while I was sleeping) - closed VPC with Service Endpoints wherever it is possible (ENI or NAT setup might be required, but a seasoned SA should breeze through these) Executor Lambda running in closed VPC, which has S3 and Secrets Manager endpoint, also Redshift enhanced VPC routing is on. Code Commit has no VPC endpoints available yet, and also in AWS China there is no CodeCommit . Companies having very strict security requirements such as data (including code) cannot travel on the open internet even if encrypted still have choices, hosting GIT on an EC2 instance within closed VPC. Triggering the executors might require manual setup of the hooks. IAM IAM provides full lock-down capabilities on both Infrastructure and Services and Resource level - Bespoke Lambda and any other service execution / resource roles - Bespoke CodeCommit users and groups for engineers and senior / approver group Directory Service Working in Federation with IAM for Single Sign-On Console access, for example to facilitate Pull Request reviews and merges. The client controls access levels via AD Groups. Redshift Redshift -> in-database user management framework with service users the Lambda executors are utilising, and pre-configured upstream user groups. Many of our clients DO NOT even have credentials for any Redshift user accounts with elevated privileges, such as Schema owners or Superusers.
  11. Reliability Lambda scales horizontally, Automatic burst 500 – 3000 (bigger regions, such as Ireland) Scaling on Code Commit and Secrets Manager are managed by AWS Just as on Fargate and ECS Operational Excellence Deployment of CD pipelines via Parameterized CloudFormation templates, infrastructure as a code Lambda : Retry functionality and Dead Letter Queues, optionally AWS Step functions for an extra layer of state management CloudWatch and X-Ray Notifications on functional DB code failures to Dev teams via Slack / Teams notifications Notifications and alerting on Infra level problems to SysOps teams via CloudWatch and DLQ Performance Efficiency Rightly sized Lambdas and rightly sized, configured containers for the Infra Power users are using repositories which are connected to dedicated Redshift users with access to Superuser / dedicated WLM queues DynamoDB -> autoscaling might be an overkill, depends on the size of the dev teams and branches to manage, but the main thing is that the load is measurable and the functionality is there to auto-scale if required. Cost Optimization The beauty is, that this is practically free to run once you build it, cost is insignificant if there is any. Lambda , Code Commit , Triggering, CloudFormation, all the nice tools are being made available free of charge or very cheap. Minimal cost associated with the “Big Job” The SQL code itself runs on the Redshift clusters!
  12. Also, not just for Redshift. I believe the power of the AWS EcoSystem is evident, multiple cloud-native services working perfectly in concert to create an automated, event-driven, efficient, secure and scalable solution to a challenge. AWS is the perfect place for thinking “outside the box”