The Database Sizing Workflow

•Transferir como PPT, PDF•

2 gostaram•1,032 visualizações

Kristofferson A

Tecnologia Negócios

The Database Sizing Workflow
Presented by:
Karl Arao
1

whoami
Karl Arao
• Senior Technical Consultant @ Enkitec
• Performance and Capacity Planning Enthusiast
7+ years DBA experience
Oracle ACE, OCP-DBA, RHCE, OakTable
Blog: karlarao.wordpress.com
Wiki: karlarao.tiddlyspot.com
Twitter: @karlarao
www.enkitec.com 2

Agenda
• The sizing scenarios/objective
• The general sizing workflow
– Extract
– Visualize
– Model
– Project
• Putting it all together: Real Sizing Scenarios
www.enkitec.com 4

The sizing scenarios/objective
• Consolidation, HW refresh, platform migration
– How many can fit?
– Can I combine A + B + ½ of C?
– What's the ideal hardware to buy - "right sizing"
www.enkitec.com 6

The sizing workflow
– Extract
• Workload data
– Visualize
• Consolidated peak workload
– Model
• Provisioning plan
– Project
• “Headroom”
www.enkitec.com 7

AWR data
• Top Events
– AAS CPU, latency, wait class
• SYSSTAT
– PGA, SGA, physical memory, Executes/sec
• IO
– IOPS breakdown, MB/s
• CPU
– Load Average, NUM_CPUs,
• Storage
– total storage size, per tablespace size
• Services
– distribution of workload/modules
• Top SQL
– PIOs, LIOs, modules, SQL type, SQL_ID, PX
Correlate across months of workload data! http://goo.gl/7uCk7w

• Tableau auto creates a time dimension for the time
column “MM/DD/YY HH24:MI:SS” of AWR csv output
www.enkitec.com 15

www.enkitec.com 16
• Summary and Underlying data
1-2AM
2-3AM

www.enkitec.com 17
Consolidated CPU usage

What to model?
• the provisioning plan
– instance mapping
– node failure scenarios
– resource management
• backups, test/dev, DR, ZFS
• hardware options
• memory upgrade
• redundancy (normal or high)
www.enkitec.com 19

Putting it all together
www.enkitec.com 23

Summary
• The sizing scenarios/objective
• The 4 points of the sizing worklflow
www.enkitec.com 24

References
• Where did my CPU go? (webinar) http://www.youtube.com/watch?v=WXktSUbE4AU
(paper) http://goo.gl/qP1xqr
• Book: Computer Architecture: A Quantitative Approach 5th Ed - Chapter1
Section1.10 Putting it all together Perf, Price, Power http://goo.gl/MXigAQ
• Book: The Art of Scalability - Ch11 “Headroom” http://theartofscalability.com
• Viz Example: CPU sizing 15 vs 60 mins snap interval http://goo.gl/rOJ9M4
• Viz Example: Different views of IO performance http://goo.gl/i660CZ
• Exadata Provisioning Worksheet http://www.slideshare.net/karlarao/pape-
rkaraoconsolidation-successstory
www.enkitec.com 25
karl.arao@enkitec.com
karlarao.wordpress.com
karlarao.tiddlyspot.com
@karlarao

Mais conteúdo relacionado

Mais procurados

PostgreSQL on AWS: Tips & Tricks (and horror stories)Alexander Kukushkin

Developing Scylla Applications: Practical TipsScyllaDB

Whitepaper: Exadata Consolidation Success StoryKristofferson A

Meet Spilo, Zalando’s HIGH-AVAILABLE POSTGRESQL CLUSTER - Feike Steenbergendistributed matters

Spark 2.x Troubleshooting GuideIBM

Tanel Poder Oracle Scripts and Tools (2010)Tanel Poder

High-Load Storage of Users’ Actions with ScyllaDB and HDDsScyllaDB

Demystifying postgres logical replication percona live scEmanuel Calvo

Accumulo Summit 2015: Performance Models for Apache Accumulo: The Heavy Tail ...Accumulo Summit

Building a Distributed Data Streaming Architecture for Modern Hardware with S...ScyllaDB

PostgreSQL Write-Ahead Log (Heikki Linnakangas) Ontico

Logical replication with pglogicalUmair Shahid

Elephants in the CloudMike Fowler

DataEngConf SF16 - Collecting and Moving Data at Scale Hakka Labs

High Performance, High Reliability Data Loading on ClickHouseAltinity Ltd

A Consolidation Success Story by Karl AraoEnkitec

Autovacuum, explained for engineers, new improved version PGConf.eu 2015 ViennaPostgreSQL-Consulting

Linux tuning for PostgreSQL at Secon 2015Alexey Lesovsky

Logical Replication in PostgreSQL - FLOSSUK 2016Petr Jelinek

ClickHouse Mark Cache, by Mik Kocikowski, CloudflareAltinity Ltd

Mais procurados (20)

PostgreSQL on AWS: Tips & Tricks (and horror stories)

Developing Scylla Applications: Practical Tips

Whitepaper: Exadata Consolidation Success Story

Meet Spilo, Zalando’s HIGH-AVAILABLE POSTGRESQL CLUSTER - Feike Steenbergen

Spark 2.x Troubleshooting Guide

Tanel Poder Oracle Scripts and Tools (2010)

High-Load Storage of Users’ Actions with ScyllaDB and HDDs

Demystifying postgres logical replication percona live sc

Accumulo Summit 2015: Performance Models for Apache Accumulo: The Heavy Tail ...

Building a Distributed Data Streaming Architecture for Modern Hardware with S...

PostgreSQL Write-Ahead Log (Heikki Linnakangas)

Logical replication with pglogical

Elephants in the Cloud

DataEngConf SF16 - Collecting and Moving Data at Scale

High Performance, High Reliability Data Loading on ClickHouse

A Consolidation Success Story by Karl Arao

Autovacuum, explained for engineers, new improved version PGConf.eu 2015 Vienna

Linux tuning for PostgreSQL at Secon 2015

Logical Replication in PostgreSQL - FLOSSUK 2016

ClickHouse Mark Cache, by Mik Kocikowski, Cloudflare

Semelhante a The Database Sizing Workflow

EPM Automate - Automating Enterprise Performance Management Cloud SolutionsAlithya

DrupalSouth 2015 - Performance: Not an AfterthoughtNick Santamaria

EPM Automate - Automating Enterprise Performance Management Cloud SolutionsJoseph Alaimo Jr

Evolve18 | Ameeth Palla | Optimizing Your Assets ImplementationEvolve The Adobe Digital Marketing Community

SharePoint 2013 Performance Analysis - Robi VončinaSPC Adriatics

Five essential new enhancements in azure HDnsightAshish Thapliyal

Cmake kitwareachintyalte

Apache Spark for RDBMS Practitioners: How I Learned to Stop Worrying and Lov...Databricks

SQL Server 2022 Programmability & PerformanceGianluca Hotz

Data(?)Ops with CircleCIJinwoong Kim

EM12c: Capacity Planning with OEM MetricsMaaz Anjum

Track A-2 基於 Spark 的數據分析Etu Solution

Capacity Planning for fun & profitRodrigo Campos

VMworld 2013: Virtualizing Databases: Doing IT Right VMworld

VMworld 2013: Virtualizing Mission Critical Oracle RAC with vSphere and vCOPSVMworld

Deep Dive into Spark SQL with Advanced Performance Tuning with Xiao Li & Wenc...Databricks

Conquering Hadoop and Apache Spark with Operational Intelligence with Akshay RaiDatabricks

Big Data Day LA 2016/ Big Data Track - How To Use Impala and Kudu To Optimize...Data Con LA

Exadata SMART Monitoring - OEM 13cAlfredo Krieg

Spark_Intro_Syed_AcademySyed Hadoop

Semelhante a The Database Sizing Workflow (20)

EPM Automate - Automating Enterprise Performance Management Cloud Solutions

DrupalSouth 2015 - Performance: Not an Afterthought

EPM Automate - Automating Enterprise Performance Management Cloud Solutions

Evolve18 | Ameeth Palla | Optimizing Your Assets Implementation

SharePoint 2013 Performance Analysis - Robi Vončina

Five essential new enhancements in azure HDnsight

Cmake kitware

Apache Spark for RDBMS Practitioners: How I Learned to Stop Worrying and Lov...

SQL Server 2022 Programmability & Performance

Data(?)Ops with CircleCI

EM12c: Capacity Planning with OEM Metrics

Track A-2 基於 Spark 的數據分析

Capacity Planning for fun & profit

VMworld 2013: Virtualizing Databases: Doing IT Right

VMworld 2013: Virtualizing Mission Critical Oracle RAC with vSphere and vCOPS

Deep Dive into Spark SQL with Advanced Performance Tuning with Xiao Li & Wenc...

Conquering Hadoop and Apache Spark with Operational Intelligence with Akshay Rai

Big Data Day LA 2016/ Big Data Track - How To Use Impala and Kudu To Optimize...

Exadata SMART Monitoring - OEM 13c

Spark_Intro_Syed_Academy

Mais de Kristofferson A

Whitepaper: Mining the AWR repository for Capacity Planning and VisualizationKristofferson A

Whitepaper: Where did my CPU go?Kristofferson A

RMOUG 2012 - Mining the AWRKristofferson A

VirtaThon 2011 - Mining the AWRKristofferson A

Performance Scenario: Diagnosing and resolving sudden slow down on two node RACKristofferson A

Devcon: Virtualization?Kristofferson A

OOW Unconference 2010: Mining the AWR repository for Capacity Planning, Visua...Kristofferson A

Oracle Closed World 2010: Graphing the AAS ala EM + doing some cool linear re...Kristofferson A

Mais de Kristofferson A (8)

Whitepaper: Mining the AWR repository for Capacity Planning and Visualization

Whitepaper: Where did my CPU go?

RMOUG 2012 - Mining the AWR

VirtaThon 2011 - Mining the AWR

Performance Scenario: Diagnosing and resolving sudden slow down on two node RAC

Devcon: Virtualization?

OOW Unconference 2010: Mining the AWR repository for Capacity Planning, Visua...

Oracle Closed World 2010: Graphing the AAS ala EM + doing some cool linear re...

Último

Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun

From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software

Real Time Object Detection Using Open CVKhem

Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC

How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes

TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc

Presentation on how to chat with PDF using ChatGPT code interpreternaman860154

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo

08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls

Finology Group – Insurtech Innovation Award 2024The Digital Insurer

🐬 The future of MySQL is Postgres 🐘RTylerCroy

Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer

Histor y of HAM Radio presentation slidevu2urc

04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG

08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls

The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge

2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung

Boost PC performance: How more available memory can improve productivityPrincipled Technologies

Automating Google Workspace (GWS) & more with Apps Scriptwesley chun

The Database Sizing Workflow

1. The Database Sizing Workflow Presented by: Karl Arao 1

2. whoami Karl Arao • Senior Technical Consultant @ Enkitec • Performance and Capacity Planning Enthusiast 7+ years DBA experience Oracle ACE, OCP-DBA, RHCE, OakTable Blog: karlarao.wordpress.com Wiki: karlarao.tiddlyspot.com Twitter: @karlarao www.enkitec.com 2

3. www.enkitec.com 3 200+ 3

4. Agenda • The sizing scenarios/objective • The general sizing workflow – Extract – Visualize – Model – Project • Putting it all together: Real Sizing Scenarios www.enkitec.com 4

5. www.enkitec.com 5

6. The sizing scenarios/objective • Consolidation, HW refresh, platform migration – How many can fit? – Can I combine A + B + ½ of C? – What's the ideal hardware to buy - "right sizing" www.enkitec.com 6

7. The sizing workflow – Extract • Workload data – Visualize • Consolidated peak workload – Model • Provisioning plan – Project • “Headroom” www.enkitec.com 7

8. www.enkitec.com 8

9. Extract www.enkitec.com 9

10. AWR data • Top Events – AAS CPU, latency, wait class • SYSSTAT – PGA, SGA, physical memory, Executes/sec • IO – IOPS breakdown, MB/s • CPU – Load Average, NUM_CPUs, • Storage – total storage size, per tablespace size • Services – distribution of workload/modules • Top SQL – PIOs, LIOs, modules, SQL type, SQL_ID, PX Correlate across months of workload data! http://goo.gl/7uCk7w

11. www.enkitec.com 11

12. www.enkitec.com 12 OS data

13. Visualize www.enkitec.com 13

14. Visualize – Workload Characterization General Workload • top events • load profile (exec/sec) • top modules/services CPU usage • CPU, cpuwait, scheduler SGA/PGA IOPS, MB/s, latency • IO breakdown • read/write ratio Storage Size www.enkitec.com 14

15. • Tableau auto creates a time dimension for the time column “MM/DD/YY HH24:MI:SS” of AWR csv output www.enkitec.com 15

16. www.enkitec.com 16 • Summary and Underlying data 1-2AM 2-3AM

17. www.enkitec.com 17 Consolidated CPU usage

18. Model www.enkitec.com 18

19. What to model? • the provisioning plan – instance mapping – node failure scenarios – resource management • backups, test/dev, DR, ZFS • hardware options • memory upgrade • redundancy (normal or high) www.enkitec.com 19

20. www.enkitec.com 20

21. Projection www.enkitec.com 21

22. www.enkitec.com 22

23. Putting it all together www.enkitec.com 23

24. Summary • The sizing scenarios/objective • The 4 points of the sizing worklflow www.enkitec.com 24

25. References • Where did my CPU go? (webinar) http://www.youtube.com/watch?v=WXktSUbE4AU (paper) http://goo.gl/qP1xqr • Book: Computer Architecture: A Quantitative Approach 5th Ed - Chapter1 Section1.10 Putting it all together Perf, Price, Power http://goo.gl/MXigAQ • Book: The Art of Scalability - Ch11 “Headroom” http://theartofscalability.com • Viz Example: CPU sizing 15 vs 60 mins snap interval http://goo.gl/rOJ9M4 • Viz Example: Different views of IO performance http://goo.gl/i660CZ • Exadata Provisioning Worksheet http://www.slideshare.net/karlarao/pape- rkaraoconsolidation-successstory www.enkitec.com 25 karl.arao@enkitec.com karlarao.wordpress.com karlarao.tiddlyspot.com @karlarao

Notas do Editor

Outline: Ultimate Exadata IO monitoring – Flash, HardDisk , & Write back cache overhead http://www.kylehailey.com/oaktable-world/agenda/ I’ll do a session highlighting a very write intensive OLTP Exadata environment and will discuss the different ways to monitor IO from the database and storage layer perspective and correlating it back to the application by mining the dba_hist_sqlstat data. I’ll also touch on utilizing the OEM12c Metric Extensions and BI Publisher integration to ultimately scale the monitoring to a bunch of Exadata environments. It’s going to be a fun hacking session. &gt; discuss the capacity doodle &gt; the variables &gt; monitoring &gt; the reclaim &gt; highlight issue on very write intensive OLTP environment &gt; monitoring problem on OEM perf page &gt; show IO perf page not accounting the flash IOs ** partly because some people in the team have access to only limited view of things ** or they have difficulty interpreting the numbers, they need simple stuff on OEM12c storage grid perf &gt; although 12c has exadata IOs monitoring but, I&apos;d like to get the IOPS number separated by flash and disk &gt; wbfc patent &gt; write back cache http://goo.gl/2WCmw &gt; exadata oltp optimizations &gt; discuss about the basic architecture &gt; discuss different ways to monitor IO (email to randy) http://goo.gl/i660CZ Different views of IO performance SECTION 1: USER IO wait class and cell single block reads latency with curve fitting SECTION 2: Small IOPS vs Large IOPS SECTION 3: Flash vs HD IOPS SECTION 4: Flash vs HD IOPS with read/write breakdown SECTION 5: IO throughput read/write MB/s SECTION 6: Drill down on smart scans affecting cell single block latency on 24hour period &gt; IO workload correlate up to the topevents and sqlstat data &gt; causal links - produce analysis which relates database load to application processing creating a strong understanding front to back as an enabler to ‘fix’ &gt; feedback loop on what is working and what is not &gt; track IO config changes - IORM (topevents data) &gt; basic, auto, low latency... and when it is applicable &gt; scaling it! &gt; metrics extension &gt; BIP &gt; show data model &gt; email everyday
Just a brief introduction of myself..
And this is what the tar files looks like and it’s just a simple CSV output of AWR data
And what makes the tableau really interesting is it automatically creates “dimensions” out of those CSV files My objective on this image is to quickly see the utilization of CPU if I combine particular instances and I can do that by just pulling the Total Oracle CPU seconds metric on the graph and that’s the boxed line chart at the bottom and that&apos;s the sum of Total Oracle CPU seconds of the instances that are selected on the right hand side portion of the graph. So let’s say I want to consolidated the 3 instances on a single 24cores compute node.. (24cores x 3600 seconds = 86400 seconds of CPU capacity) I’ll be able to tell from the workload trend that it can fit on that box and I’m expecting the highest CPU Utilization that I’ll have is about 69% (60000/86400) And you can also right click on this and do a “View Data”
So how it works is whatever SNAP_ID on the selected instances that falls on a specific hour dimension will get summed. So this tool automatically takes care of snap interval differences of the databases which is tedious to do manually.

The Database Sizing Workflow

Recomendados

Recomendados

Mais conteúdo relacionado

Mais procurados

Mais procurados (20)

Semelhante a The Database Sizing Workflow

Semelhante a The Database Sizing Workflow (20)

Mais de Kristofferson A

Mais de Kristofferson A (8)

Último

Último (20)

The Database Sizing Workflow

Notas do Editor