SlideShare a Scribd company logo
1 of 42
Download to read offline
ANTI 

PATTERNS
DATA
Updated!
ines @ Engine Yard.com
@Randommood
And I’m a
happy dog!
I N E S
S O M B R A
I work with
Databases
I can get a little
ragey
sometimes
Disclaimer #1
I’m sorry.

!
Ask me to
slow down.
Disclaimer #2
ZOMG, the
horror!
.BACKUPS
yes, we are going there
“I know you.
You know
you. And I
know you
know that I
know you”
White Goodman

Boring Definition #1
Backups
Copy and archiving
of data
Goal is to restore
the state of a DB
Many types - blah
Anti-Pattern #1
Taking too
many
backups
Not free, they
requires resources
Full backup every
hour, really? 

What about backup
retention?
Anti-Pattern #2
Taking too
few
backups
Enough to minimize
the risk of data loss
due to corrupted
backup files
yes,
 this
 totes
 happens!
The untested backup
Anti-Pattern #3
Doing backups right
Logically
test
backups
Errorless restore
is not enough.
Test logical data
too
Doing backups right
Know your
types 
tools
Take logical and
binary backups
Continuous archiving
 hot backup utilities
Doing backups right
Practice
restores
Backups alone do not
constitute DR. Have a
plan  practice it
Server extensions and
configuration matter
when restoring
“I want a
ridiculously
good
looking
Database”
Derek Zoolander

(honestly, Ben Stiller rules)
Obvious statement #1
Many DB
choices
Cargo
culting your
database
Anti-Pattern #4
Failure to
understand use
case, strengths 
weaknesses of a
new database
RDBMS for
Session
Data
Anti-Pattern #5 Often means at least
one write per request
Any DB issue/task may
cause app to hang
Tables have a
tendency to bloat
Modeling, it’s all the same
Anti-Pattern #6
Data Model
Consistency needs
Availability needs
Scaling needs
Operational story 
cost
Doing it right
Know your
needs
Doing it right
Spike it,
forealsies
Spike it with your
data and traffic. 

Best way to gain
operational
experience
Doing it right
Leverage
new
features
Relational databases
are getting quite
versatile
Evaluate clustered
MySQL options
Are we
doing
ok?
We have a cloud deployment!
Happy team on shipping day, lmfao if you don’t celebrate like this
Cloud-based
databases,
they are real
Obvious statement #2
Databases can live in
the cloud quite well
Many IaaS, PaaS, 
DBaaS options
Easy to get started 
may be economical
Where did my instance go?
Anti-Pattern #7
Anti-Pattern #8
Cloud, it’s
just like
hardware
It’s not. Cloud
resources are
virtualized
Capacity planning
and monitoring
matter. A lot
Anti-Pattern #9
Shit doesn’t
happen
You are not
immune to
infrastructure
failures. 

Plan for it
Anti-Pattern #10
Storage is
the same
Instance storage is
not persisted (unless
you use EBS)
Data locality matters
Don’t run your cloud
DBs too hot!
Doing cloud right
Know your
cloud
deployments
Replication in the
cloud is a must-have
Put DB master 
replicas in different
AZs
Doing cloud right
Learn high
availability 
disaster
recovery
Get good at replica
promotions (some
work involved)
Understand and
invest in DR/HA.
Know your options
Doing cloud right
Know your
system
Invest in monitoring
Know your data
distribution 
querying patterns
Know baseline
behavior
And there’s
more!
Boring Definition #2
Indexes

(or indices, I prefer indexes)
Improves speed of
data retrieval
Used in random 
ordered lookups
Imply additional
writes  storage
Anti-Pattern #12
Too few
Indexes
Room for query
optimization 
increased speed
Analyze, slow logs 
monitoring tools are
your friends
Anti-Pattern #13
Too many
Indexes
They are not free.
Your DB maintains
them.
Too many will
impact your write
throughput.
Doing Indexes right
Many index
types
Many types. Learn
how your DB does
them.
You want the right
amount.
Doing Indexes right
Postgres
Indexes
summarized
B-TR EE S 

D E FAULT. NU MER IC, 

TE XT, NU LL
HA SH 

EQUA L I T Y. D ON’ T USE
G IN 

A RRAY VA LUE S  F TS
G I ST 

GEOME TRIC DATA  F TS
They Can Be
Created
Concurrently!

More Related Content

Similar to Data antipatterns NYC Devops - 2014

scale_perf_best_practices
scale_perf_best_practicesscale_perf_best_practices
scale_perf_best_practices
webuploader
 
Database Maintenance Optimization Brad Mc Gehee
Database Maintenance Optimization   Brad Mc GeheeDatabase Maintenance Optimization   Brad Mc Gehee
Database Maintenance Optimization Brad Mc Gehee
Pratik joshi
 

Similar to Data antipatterns NYC Devops - 2014 (20)

Nuts and bolts
Nuts and boltsNuts and bolts
Nuts and bolts
 
Karen Lopez 10 Physical Data Modeling Blunders
Karen Lopez 10 Physical Data Modeling BlundersKaren Lopez 10 Physical Data Modeling Blunders
Karen Lopez 10 Physical Data Modeling Blunders
 
Daniel Abadi: VLDB 2009 Panel
Daniel Abadi: VLDB 2009 PanelDaniel Abadi: VLDB 2009 Panel
Daniel Abadi: VLDB 2009 Panel
 
DataFirst approach to coding
DataFirst approach to codingDataFirst approach to coding
DataFirst approach to coding
 
Apache Con 2008 Top 10 Mistakes
Apache Con 2008 Top 10 MistakesApache Con 2008 Top 10 Mistakes
Apache Con 2008 Top 10 Mistakes
 
PhpTek Ten Things to do to make your MySQL servers Happier and Healthier
PhpTek Ten Things to do to make your MySQL servers Happier and HealthierPhpTek Ten Things to do to make your MySQL servers Happier and Healthier
PhpTek Ten Things to do to make your MySQL servers Happier and Healthier
 
Top 10 Scalability Mistakes
Top 10 Scalability MistakesTop 10 Scalability Mistakes
Top 10 Scalability Mistakes
 
MongoDB World 2019: Finding the Right MongoDB Atlas Cluster Size: Does This I...
MongoDB World 2019: Finding the Right MongoDB Atlas Cluster Size: Does This I...MongoDB World 2019: Finding the Right MongoDB Atlas Cluster Size: Does This I...
MongoDB World 2019: Finding the Right MongoDB Atlas Cluster Size: Does This I...
 
Data science unit2
Data science unit2Data science unit2
Data science unit2
 
data science chapter-4,5,6
data science chapter-4,5,6data science chapter-4,5,6
data science chapter-4,5,6
 
Introduction to Redis and its features.pptx
Introduction to Redis and its features.pptxIntroduction to Redis and its features.pptx
Introduction to Redis and its features.pptx
 
scale_perf_best_practices
scale_perf_best_practicesscale_perf_best_practices
scale_perf_best_practices
 
2014 nicta-reproducibility
2014 nicta-reproducibility2014 nicta-reproducibility
2014 nicta-reproducibility
 
Database Maintenance Optimization Brad Mc Gehee
Database Maintenance Optimization   Brad Mc GeheeDatabase Maintenance Optimization   Brad Mc Gehee
Database Maintenance Optimization Brad Mc Gehee
 
BigQuery is not a Data Strategy.pdf
BigQuery is not a Data Strategy.pdfBigQuery is not a Data Strategy.pdf
BigQuery is not a Data Strategy.pdf
 
Ledingkart Meetup #4: Data pipeline @ lk
Ledingkart Meetup #4: Data pipeline @ lkLedingkart Meetup #4: Data pipeline @ lk
Ledingkart Meetup #4: Data pipeline @ lk
 
IS L03 - Database Management
IS L03 - Database ManagementIS L03 - Database Management
IS L03 - Database Management
 
Nevera Dul Moment
Nevera Dul MomentNevera Dul Moment
Nevera Dul Moment
 
What is causing my database to grow?
What is causing my database to grow?What is causing my database to grow?
What is causing my database to grow?
 
2014 aus-agta
2014 aus-agta2014 aus-agta
2014 aus-agta
 

More from Ines Sombra

More from Ines Sombra (16)

Architectural Patterns of Resilient Distributed Systems
 Architectural Patterns of Resilient Distributed Systems Architectural Patterns of Resilient Distributed Systems
Architectural Patterns of Resilient Distributed Systems
 
We hear you like papers
We hear you like papersWe hear you like papers
We hear you like papers
 
Testing & Integration (The Remix)
 Testing & Integration (The Remix) Testing & Integration (The Remix)
Testing & Integration (The Remix)
 
From 0 to Capacity Planning
From 0 to Capacity PlanningFrom 0 to Capacity Planning
From 0 to Capacity Planning
 
Agile, Rugged, and Lean - The Paper Edition
Agile, Rugged, and Lean - The Paper EditionAgile, Rugged, and Lean - The Paper Edition
Agile, Rugged, and Lean - The Paper Edition
 
Computational Patterns of the Cloud - QCon NYC 2014
Computational Patterns of the Cloud - QCon NYC 2014Computational Patterns of the Cloud - QCon NYC 2014
Computational Patterns of the Cloud - QCon NYC 2014
 
How the Cloud is changing the world
How the Cloud is changing the worldHow the Cloud is changing the world
How the Cloud is changing the world
 
NoSQL Databases in the Cloud - Great Wide Open 2014
NoSQL Databases in the Cloud - Great Wide Open 2014NoSQL Databases in the Cloud - Great Wide Open 2014
NoSQL Databases in the Cloud - Great Wide Open 2014
 
Relational Databases in the Cloud - Great Wide Open 2014
Relational Databases in the Cloud - Great Wide Open 2014Relational Databases in the Cloud - Great Wide Open 2014
Relational Databases in the Cloud - Great Wide Open 2014
 
Hello data
Hello dataHello data
Hello data
 
Ricon east
Ricon eastRicon east
Ricon east
 
PgPyDay
PgPyDayPgPyDay
PgPyDay
 
Getting started with Riak in the Cloud
Getting started with Riak in the CloudGetting started with Riak in the Cloud
Getting started with Riak in the Cloud
 
Riak at Engine Yard Cloud
Riak at Engine Yard CloudRiak at Engine Yard Cloud
Riak at Engine Yard Cloud
 
Postgres Open
Postgres OpenPostgres Open
Postgres Open
 
North Bay Ruby Meetup 101911
North Bay Ruby Meetup 101911North Bay Ruby Meetup 101911
North Bay Ruby Meetup 101911
 

Recently uploaded

Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 

Recently uploaded (20)

TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 

Data antipatterns NYC Devops - 2014