SlideShare uma empresa Scribd logo
1 de 38
Baixar para ler offline
Best Practice for Designing and
Implementing MySQL Geographic
Distributed, High Availability Solutions
Marco Tusa
Percona
2
• Open source enthusiast
• MySQL tech lead; principal architect
• Working in DB/development
world over 33 years (yes, I am that old)
• Open source developer and community
contributor
About Me
3
Why Are We Talking About Geographic Distribution?
The need to have geographic distribution for…
3
Why Are We Talking About Geographic Distribution?
The need to have geographic distribution for…
Disaster
Recovery
3
Why Are We Talking About Geographic Distribution?
The need to have geographic distribution for…
Disaster
Recovery
Data close to
consumer/user
3
Why Are We Talking About Geographic Distribution?
The need to have geographic distribution for…
Disaster
Recovery
Data close to
consumer/user
Security by
regulations/laws
3
Why Are We Talking About Geographic Distribution?
The need to have geographic distribution for…
Disaster
Recovery
Data close to
consumer/user
Security by
regulations/laws
4
What is Geographic Distribution?
● What is geographic distribution?
○ Near ! High Availability
○ Far ! Disaster Recovery
• Geographic distribution in
High Availability is to cover
service availability in a location
○ 10 Gb Ethernet best case scenario
400 metre distance max
What is HA and What is DR?
• Geographic distribution in
Disaster Recovery is to assure
we can restore service, in a
geographically distributed location
○ Real speed may vary
○ Linear distance ~1000Km
What is HA and What is DR?
7
The Impact of Distance on Geographic Distribution
A Real-Life Example
I worked on many cases where a customer had two data centers (DC).
I will use information from one of the case as example.
The DC were at a distance of approximately 400 km, connected with a “fiber
channel”.
Server 1 and Server 2 were hosted in the same DC, while Server 3 was in the
secondary DC.
Their ping to Server3 was ~3ms. Not bad at all, right?
We decided to perform some serious tests, running multiple sets of tests with
netperf for many days collecting data. We also used the data to perform
additional fine-tuning on the TCP/IP layer AND at the network provider.
A Real-Life Example
Observations
37ms latency is not very high. If that had been the top limit, it would have
worked.
But, it was not.
In the presence of the optimized channel, with fiber and so on, when the tests
were hitting heavy traffic, the congestion was enough to compromise the data
transmitted.
It hit a latency >200ms for Server 3. Note those were spikes, but if you are in
the presence of a tightly coupled database cluster, those events can become
failures in applying the data, and can create a lot of instability.
Facts About Server 3
The connection between the two was with fiber.
Distance Km ~400 (~800), we need to double because of the round trip,
we also receive packages.
Theoretical time at light-speed =2.66ms (2 ways)
Ping = 3.10ms (signal traveling at ~80% of the light speed) as if the
signal had traveled ~930Km (full round trip 800 Km)
TCP/IP best at 48K = 4.27ms (~62% light speed) as if the signal had
traveled ~1,281km
TCP/IP best at 512K =37.25ms (~2.6% light speed) as if the signal had
traveled ~11,175km
Given the above, we have from ~20%-~40% to ~97% loss from the
theoretical transmission rate.
TCP Encapsulation
Max transportable 1500 MTU – IP Header – TCP Header 1500 – ~40 = 1460 bytes
TCP Sliding Window
Some Numbers
• With 8KB we need 6 IP Frames
• With 40KB we need 28 IP Frames
• With 387KB we need 271 IP Frames
• With 1MB we need 718 IP Frames
• With 4MB we need ~2,800 Frames
All this if we use the full TCP capacity
Which Solutions Are Available?
2 Models
Tightly coupled database clusters
• Datacentric approach (single state of
the data, distributed commit)
• Data is consistent in time cross nodes
• Replication requires high performant
link
• Geographic distribution is forbidden
• DR is not supported
Loosely coupled database clusters
• Single node approach (local commit)
• Data state differs by node
• Single node state does not affect the
cluster
• Replication link doesn’t need to be high
performance
• Geographic distribution is allowed
• DR is supported
Replicate Data is the Key - Sync vs Async
1 Data State
3 Different
Data States
Tightly
coupled
Loosely
coupled
17
Which Implementations?: Primary - Secondary
Replication by internal design
Positive things:
• Each node is independent
• Read scale
• Low network impact
Negative things:
• Stale reads
• Low HA
• Consistent only in the Primary
• Each node has its own data state
Loosely coupled
18
Replication by internal design
Positive things:
• Highly available
• Read scale
• Data is in almost in sync all of the time
Negative things:
• Doesn’t scale writes
• More nodes = more internal
overhead
• Network is a very impacting factor
• 1 Writer only (PLEASE!!!!!)
Which Implementations?: PXC
Tightly coupled
19
Which Implementations?: PS – Group Replication
Replication by standard MySQL
Positive things:
• Highly available
• Read scale
• Data is in almost in sync
all the time
• More network tolerant
Negative things:
• Doesn’t scale writes
• 1 Writer only (PLEASE!!!!!)
Tightly coupled
20
Implementations by HA Level and Tight/Loose Relation
20
Implementations by HA Level and Tight/Loose Relation
Architectures: What Should NOT Be Done 1
London West
London East
Frankfurt
Node 1
Node 2
Node 3
Sync High perf link
Sync Internet link
Async Internet link
Sync High perf internet link
Architectures: What Should NOT Be Done 1
London West
London East
Frankfurt
Node 1
Node 2
Node 3
Sync High perf link
Sync Internet link
Async Internet link
Sync High perf internet link
Slave
Architectures: What Should NOT Be Done 2
London West
London East
Frankfurt
Node 1
Node 2
Node 3
Secondary
Sync High perf link
Sync Internet link
Async Internet link
Sync High perf internet link
Slave
Architectures: What Should NOT Be Done 2
London West
London East
Frankfurt
Node 1
Node 2
Node 3
Secondary
Sync High perf link
Sync Internet link
Async Internet link
Sync High perf internet link
Architectures: What Can Be Done
Slave
London West
London West
(400 mt limit)
Frankfurt
Node 1
Node 2
Node 3
S-Node1
S-Node2
S-Node3
Sync High perf link
Sync Internet link
Async Internet link
Sync High perf internet link
Architectures: What Can Be Done
Slave
London West
London West
(400 mt limit)
Frankfurt
Node 1
Node 2
Node 3
S-Node1
S-Node2
S-Node3
Sync High perf link
Sync Internet link
Async Internet link
Sync High perf internet link
Architectures: What You Should Do
Slave
London Frankfurt
Node 1
Node 2
Node 3
S-Node1
S-Node2
S-Node3
Sync High perf link
Sync Internet link
Async Internet link
Sync High perf internet link
Conclusion - a Healthy Solution
Must have a business continuity plan and cover at least:
● HA
● DR (RTO)
● Backup/restore (RPO)
● Load distribution
● Correct monitoring/alerting
Conclusion - a Healthy Solution*
*I am showing PXC/Galera replication by convenience, but it could also be Percona Server with Group
Replication
Conclusion - a Healthy Solution
Must have a business continuity plan and cover at least:
● HA → (PXC OR PS-GR)
● DR → (PXC OR PS-GR) with Asynchronous replication and RMP
(replication manager for PXC/PS-GR)
● Backup/restore → Backup/restore policy
● Load distribution → ProxySQL with Query rules
● Correct monitoring/alerting → Percona Monitoring and
Management (PMM)
Useful References
● https://www.percona.com/blog/2018/11/15/mysql-high-availability-on-premises-a-geographically-distributed-scenario/
● https://dev.mysql.com/doc/mysql-ha-scalability/en/ha-overview.html
● https://www.percona.com/blog/2014/11/17/typical-misconceptions-on-galera-for-mysql/
● http://galeracluster.com/documentation-webpages/limitations.html
● http://tusacentral.net/joomla/index.php/mysql-blogs/170-geographic-replication-and-quorum-calculation-in-mysqlgalera.html
● http://tusacentral.net/joomla/index.php/mysql-blogs/167-geographic-replication-with-mysql-and-galera.html
● http://tusacentral.net/joomla/index.php/mysql-blogs/164-effective-way-to-check-the-network-connection-when-in-need-of-a-
geographic-distribution-replication-.html
● http://tusacentral.net/joomla/index.php/mysql-blogs/183-proxysql-percona-cluster-galera-integration.html
● https://github.com/sysown/proxysql/wiki
● https://www.percona.com/blog/2018/11/15/how-not-to-do-mysql-high-availability-geographic-node-distribution-with-galera-
based-replication-misuse/
● https://github.com/y-trudeau/Mysql-tools/tree/master/PXC
Best practice-high availability-solution-geo-distributed-final
Best practice-high availability-solution-geo-distributed-final

Mais conteúdo relacionado

Mais procurados

Introduction to the Mysteries of ClickHouse Replication, By Robert Hodges and...
Introduction to the Mysteries of ClickHouse Replication, By Robert Hodges and...Introduction to the Mysteries of ClickHouse Replication, By Robert Hodges and...
Introduction to the Mysteries of ClickHouse Replication, By Robert Hodges and...Altinity Ltd
 
Oracle Database performance tuning using oratop
Oracle Database performance tuning using oratopOracle Database performance tuning using oratop
Oracle Database performance tuning using oratopSandesh Rao
 
How to set up orchestrator to manage thousands of MySQL servers
How to set up orchestrator to manage thousands of MySQL serversHow to set up orchestrator to manage thousands of MySQL servers
How to set up orchestrator to manage thousands of MySQL serversSimon J Mudd
 
Packet walkthrough
Packet walkthroughPacket walkthrough
Packet walkthroughDavids G'ya
 
Load Balancing MySQL with HAProxy - Slides
Load Balancing MySQL with HAProxy - SlidesLoad Balancing MySQL with HAProxy - Slides
Load Balancing MySQL with HAProxy - SlidesSeveralnines
 
Proxysql sharding
Proxysql shardingProxysql sharding
Proxysql shardingMarco Tusa
 
MySQL Parallel Replication: All the 5.7 and 8.0 Details (LOGICAL_CLOCK)
MySQL Parallel Replication: All the 5.7 and 8.0 Details (LOGICAL_CLOCK)MySQL Parallel Replication: All the 5.7 and 8.0 Details (LOGICAL_CLOCK)
MySQL Parallel Replication: All the 5.7 and 8.0 Details (LOGICAL_CLOCK)Jean-François Gagné
 
Getting the most out of MariaDB MaxScale
Getting the most out of MariaDB MaxScaleGetting the most out of MariaDB MaxScale
Getting the most out of MariaDB MaxScaleMariaDB plc
 
Virtual Flink Forward 2020: Lessons learned on Apache Flink application avail...
Virtual Flink Forward 2020: Lessons learned on Apache Flink application avail...Virtual Flink Forward 2020: Lessons learned on Apache Flink application avail...
Virtual Flink Forward 2020: Lessons learned on Apache Flink application avail...Flink Forward
 
GoldenGate and Stream Processing with Special Guest Rakuten
GoldenGate and Stream Processing with Special Guest RakutenGoldenGate and Stream Processing with Special Guest Rakuten
GoldenGate and Stream Processing with Special Guest RakutenJeffrey T. Pollock
 
ProxySQL High Availability (Clustering)
ProxySQL High Availability (Clustering)ProxySQL High Availability (Clustering)
ProxySQL High Availability (Clustering)Mydbops
 
High Performance, High Reliability Data Loading on ClickHouse
High Performance, High Reliability Data Loading on ClickHouseHigh Performance, High Reliability Data Loading on ClickHouse
High Performance, High Reliability Data Loading on ClickHouseAltinity Ltd
 
RDS Postgres and Aurora Postgres | AWS Public Sector Summit 2017
RDS Postgres and Aurora Postgres | AWS Public Sector Summit 2017RDS Postgres and Aurora Postgres | AWS Public Sector Summit 2017
RDS Postgres and Aurora Postgres | AWS Public Sector Summit 2017Amazon Web Services
 
Juraci Paixão Kröhling - All you need to know about OpenTelemetry
Juraci Paixão Kröhling - All you need to know about OpenTelemetryJuraci Paixão Kröhling - All you need to know about OpenTelemetry
Juraci Paixão Kröhling - All you need to know about OpenTelemetryJuliano Costa
 
[2017 Windows on AWS] AWS를 활용한 그룹웨어 구축 방안
[2017 Windows on AWS] AWS를 활용한 그룹웨어 구축 방안[2017 Windows on AWS] AWS를 활용한 그룹웨어 구축 방안
[2017 Windows on AWS] AWS를 활용한 그룹웨어 구축 방안Amazon Web Services Korea
 
Tanel Poder - Troubleshooting Complex Oracle Performance Issues - Part 2
Tanel Poder - Troubleshooting Complex Oracle Performance Issues - Part 2Tanel Poder - Troubleshooting Complex Oracle Performance Issues - Part 2
Tanel Poder - Troubleshooting Complex Oracle Performance Issues - Part 2Tanel Poder
 
Análise de Performance do MySQL e MariaDB
Análise de Performance do MySQL e MariaDBAnálise de Performance do MySQL e MariaDB
Análise de Performance do MySQL e MariaDBSaveincloud
 

Mais procurados (20)

Introduction to the Mysteries of ClickHouse Replication, By Robert Hodges and...
Introduction to the Mysteries of ClickHouse Replication, By Robert Hodges and...Introduction to the Mysteries of ClickHouse Replication, By Robert Hodges and...
Introduction to the Mysteries of ClickHouse Replication, By Robert Hodges and...
 
Oracle Database performance tuning using oratop
Oracle Database performance tuning using oratopOracle Database performance tuning using oratop
Oracle Database performance tuning using oratop
 
How to set up orchestrator to manage thousands of MySQL servers
How to set up orchestrator to manage thousands of MySQL serversHow to set up orchestrator to manage thousands of MySQL servers
How to set up orchestrator to manage thousands of MySQL servers
 
Packet walkthrough
Packet walkthroughPacket walkthrough
Packet walkthrough
 
Load Balancing MySQL with HAProxy - Slides
Load Balancing MySQL with HAProxy - SlidesLoad Balancing MySQL with HAProxy - Slides
Load Balancing MySQL with HAProxy - Slides
 
Proxysql sharding
Proxysql shardingProxysql sharding
Proxysql sharding
 
MySQL Parallel Replication: All the 5.7 and 8.0 Details (LOGICAL_CLOCK)
MySQL Parallel Replication: All the 5.7 and 8.0 Details (LOGICAL_CLOCK)MySQL Parallel Replication: All the 5.7 and 8.0 Details (LOGICAL_CLOCK)
MySQL Parallel Replication: All the 5.7 and 8.0 Details (LOGICAL_CLOCK)
 
Getting the most out of MariaDB MaxScale
Getting the most out of MariaDB MaxScaleGetting the most out of MariaDB MaxScale
Getting the most out of MariaDB MaxScale
 
Virtual Flink Forward 2020: Lessons learned on Apache Flink application avail...
Virtual Flink Forward 2020: Lessons learned on Apache Flink application avail...Virtual Flink Forward 2020: Lessons learned on Apache Flink application avail...
Virtual Flink Forward 2020: Lessons learned on Apache Flink application avail...
 
Introduction to Galera Cluster
Introduction to Galera ClusterIntroduction to Galera Cluster
Introduction to Galera Cluster
 
GoldenGate and Stream Processing with Special Guest Rakuten
GoldenGate and Stream Processing with Special Guest RakutenGoldenGate and Stream Processing with Special Guest Rakuten
GoldenGate and Stream Processing with Special Guest Rakuten
 
ProxySQL High Availability (Clustering)
ProxySQL High Availability (Clustering)ProxySQL High Availability (Clustering)
ProxySQL High Availability (Clustering)
 
MySQL Router REST API
MySQL Router REST APIMySQL Router REST API
MySQL Router REST API
 
High Performance, High Reliability Data Loading on ClickHouse
High Performance, High Reliability Data Loading on ClickHouseHigh Performance, High Reliability Data Loading on ClickHouse
High Performance, High Reliability Data Loading on ClickHouse
 
RDS Postgres and Aurora Postgres | AWS Public Sector Summit 2017
RDS Postgres and Aurora Postgres | AWS Public Sector Summit 2017RDS Postgres and Aurora Postgres | AWS Public Sector Summit 2017
RDS Postgres and Aurora Postgres | AWS Public Sector Summit 2017
 
Juraci Paixão Kröhling - All you need to know about OpenTelemetry
Juraci Paixão Kröhling - All you need to know about OpenTelemetryJuraci Paixão Kröhling - All you need to know about OpenTelemetry
Juraci Paixão Kröhling - All you need to know about OpenTelemetry
 
[2017 Windows on AWS] AWS를 활용한 그룹웨어 구축 방안
[2017 Windows on AWS] AWS를 활용한 그룹웨어 구축 방안[2017 Windows on AWS] AWS를 활용한 그룹웨어 구축 방안
[2017 Windows on AWS] AWS를 활용한 그룹웨어 구축 방안
 
Tanel Poder - Troubleshooting Complex Oracle Performance Issues - Part 2
Tanel Poder - Troubleshooting Complex Oracle Performance Issues - Part 2Tanel Poder - Troubleshooting Complex Oracle Performance Issues - Part 2
Tanel Poder - Troubleshooting Complex Oracle Performance Issues - Part 2
 
Análise de Performance do MySQL e MariaDB
Análise de Performance do MySQL e MariaDBAnálise de Performance do MySQL e MariaDB
Análise de Performance do MySQL e MariaDB
 
Gremlin's Anatomy
Gremlin's AnatomyGremlin's Anatomy
Gremlin's Anatomy
 

Semelhante a Best practice-high availability-solution-geo-distributed-final

ROUTING PROTOCOLS new.pptx
ROUTING PROTOCOLS new.pptxROUTING PROTOCOLS new.pptx
ROUTING PROTOCOLS new.pptxAayushMishra89
 
Network-aware Data Management for High Throughput Flows Akamai, Cambridge, ...
Network-aware Data Management for High Throughput Flows   Akamai, Cambridge, ...Network-aware Data Management for High Throughput Flows   Akamai, Cambridge, ...
Network-aware Data Management for High Throughput Flows Akamai, Cambridge, ...balmanme
 
High performance browser networking ch1,2,3
High performance browser networking ch1,2,3High performance browser networking ch1,2,3
High performance browser networking ch1,2,3Seung-Bum Lee
 
Module 1 Introduction to Computer Networks.pptx
Module 1 Introduction to Computer Networks.pptxModule 1 Introduction to Computer Networks.pptx
Module 1 Introduction to Computer Networks.pptxAASTHAJAJOO
 
Unit i packet switching networks
Unit i  packet switching networksUnit i  packet switching networks
Unit i packet switching networkssangusajjan
 
Network-aware Data Management for Large Scale Distributed Applications, IBM R...
Network-aware Data Management for Large Scale Distributed Applications, IBM R...Network-aware Data Management for Large Scale Distributed Applications, IBM R...
Network-aware Data Management for Large Scale Distributed Applications, IBM R...balmanme
 
LAN Design and implementation of Shanto Mariam University of Creative Technology
LAN Design and implementation of Shanto Mariam University of Creative TechnologyLAN Design and implementation of Shanto Mariam University of Creative Technology
LAN Design and implementation of Shanto Mariam University of Creative TechnologyAbdullah Al Mamun
 
Cloud interconnection networks basic .pptx
Cloud interconnection networks basic .pptxCloud interconnection networks basic .pptx
Cloud interconnection networks basic .pptxRahulBhole12
 
Evolution of network - computer networks
Evolution of network - computer networksEvolution of network - computer networks
Evolution of network - computer networksSabarishSanjeevi
 
Computer Networks Lecture Notes
Computer Networks Lecture NotesComputer Networks Lecture Notes
Computer Networks Lecture NotesFellowBuddy.com
 
Simulating the behavior of satellite Internet links to small islands
Simulating the behavior of satellite Internet links to small islandsSimulating the behavior of satellite Internet links to small islands
Simulating the behavior of satellite Internet links to small islandsAPNIC
 
DITEC - Fundamentals in Networking
DITEC - Fundamentals in NetworkingDITEC - Fundamentals in Networking
DITEC - Fundamentals in NetworkingRasan Samarasinghe
 
Dc ch10 : circuit switching and packet switching
Dc ch10 : circuit switching and packet switchingDc ch10 : circuit switching and packet switching
Dc ch10 : circuit switching and packet switchingSyaiful Ahdan
 

Semelhante a Best practice-high availability-solution-geo-distributed-final (20)

ROUTING PROTOCOLS new.pptx
ROUTING PROTOCOLS new.pptxROUTING PROTOCOLS new.pptx
ROUTING PROTOCOLS new.pptx
 
Network-aware Data Management for High Throughput Flows Akamai, Cambridge, ...
Network-aware Data Management for High Throughput Flows   Akamai, Cambridge, ...Network-aware Data Management for High Throughput Flows   Akamai, Cambridge, ...
Network-aware Data Management for High Throughput Flows Akamai, Cambridge, ...
 
High performance browser networking ch1,2,3
High performance browser networking ch1,2,3High performance browser networking ch1,2,3
High performance browser networking ch1,2,3
 
Module 1 Introduction to Computer Networks.pptx
Module 1 Introduction to Computer Networks.pptxModule 1 Introduction to Computer Networks.pptx
Module 1 Introduction to Computer Networks.pptx
 
Network
NetworkNetwork
Network
 
Network in brief
Network in briefNetwork in brief
Network in brief
 
Unit i packet switching networks
Unit i  packet switching networksUnit i  packet switching networks
Unit i packet switching networks
 
Network-aware Data Management for Large Scale Distributed Applications, IBM R...
Network-aware Data Management for Large Scale Distributed Applications, IBM R...Network-aware Data Management for Large Scale Distributed Applications, IBM R...
Network-aware Data Management for Large Scale Distributed Applications, IBM R...
 
Lecture set 1
Lecture set 1Lecture set 1
Lecture set 1
 
Networking lecture1
Networking lecture1Networking lecture1
Networking lecture1
 
LAN Design and implementation of Shanto Mariam University of Creative Technology
LAN Design and implementation of Shanto Mariam University of Creative TechnologyLAN Design and implementation of Shanto Mariam University of Creative Technology
LAN Design and implementation of Shanto Mariam University of Creative Technology
 
Cloud interconnection networks basic .pptx
Cloud interconnection networks basic .pptxCloud interconnection networks basic .pptx
Cloud interconnection networks basic .pptx
 
Evolution of network - computer networks
Evolution of network - computer networksEvolution of network - computer networks
Evolution of network - computer networks
 
Computer Networks Lecture Notes
Computer Networks Lecture NotesComputer Networks Lecture Notes
Computer Networks Lecture Notes
 
Simulating the behavior of satellite Internet links to small islands
Simulating the behavior of satellite Internet links to small islandsSimulating the behavior of satellite Internet links to small islands
Simulating the behavior of satellite Internet links to small islands
 
1 introduction
1 introduction1 introduction
1 introduction
 
Routing Protocols
Routing ProtocolsRouting Protocols
Routing Protocols
 
Presentation8
Presentation8Presentation8
Presentation8
 
DITEC - Fundamentals in Networking
DITEC - Fundamentals in NetworkingDITEC - Fundamentals in Networking
DITEC - Fundamentals in Networking
 
Dc ch10 : circuit switching and packet switching
Dc ch10 : circuit switching and packet switchingDc ch10 : circuit switching and packet switching
Dc ch10 : circuit switching and packet switching
 

Mais de Marco Tusa

Percona xtra db cluster(pxc) non blocking operations, what you need to know t...
Percona xtra db cluster(pxc) non blocking operations, what you need to know t...Percona xtra db cluster(pxc) non blocking operations, what you need to know t...
Percona xtra db cluster(pxc) non blocking operations, what you need to know t...Marco Tusa
 
My sql on kubernetes demystified
My sql on kubernetes demystifiedMy sql on kubernetes demystified
My sql on kubernetes demystifiedMarco Tusa
 
Accessing data through hibernate: what DBAs should tell to developers and vic...
Accessing data through hibernate: what DBAs should tell to developers and vic...Accessing data through hibernate: what DBAs should tell to developers and vic...
Accessing data through hibernate: what DBAs should tell to developers and vic...Marco Tusa
 
MySQL innoDB split and merge pages
MySQL innoDB split and merge pagesMySQL innoDB split and merge pages
MySQL innoDB split and merge pagesMarco Tusa
 
Robust ha solutions with proxysql
Robust ha solutions with proxysqlRobust ha solutions with proxysql
Robust ha solutions with proxysqlMarco Tusa
 
Fortify aws aurora_proxy_2019_pleu
Fortify aws aurora_proxy_2019_pleuFortify aws aurora_proxy_2019_pleu
Fortify aws aurora_proxy_2019_pleuMarco Tusa
 
Accessing Data Through Hibernate; What DBAs Should Tell Developers and Vice V...
Accessing Data Through Hibernate; What DBAs Should Tell Developers and Vice V...Accessing Data Through Hibernate; What DBAs Should Tell Developers and Vice V...
Accessing Data Through Hibernate; What DBAs Should Tell Developers and Vice V...Marco Tusa
 
Are we there Yet?? (The long journey of Migrating from close source to opens...
Are we there Yet?? (The long journey of Migrating from close source to opens...Are we there Yet?? (The long journey of Migrating from close source to opens...
Are we there Yet?? (The long journey of Migrating from close source to opens...Marco Tusa
 
Improve aws withproxysql
Improve aws withproxysqlImprove aws withproxysql
Improve aws withproxysqlMarco Tusa
 
Fortify aws aurora_proxy
Fortify aws aurora_proxyFortify aws aurora_proxy
Fortify aws aurora_proxyMarco Tusa
 
Mysql8 advance tuning with resource group
Mysql8 advance tuning with resource groupMysql8 advance tuning with resource group
Mysql8 advance tuning with resource groupMarco Tusa
 
Geographically dispersed perconaxtra db cluster deployment
Geographically dispersed perconaxtra db cluster deploymentGeographically dispersed perconaxtra db cluster deployment
Geographically dispersed perconaxtra db cluster deploymentMarco Tusa
 
Sync rep aurora_2016
Sync rep aurora_2016Sync rep aurora_2016
Sync rep aurora_2016Marco Tusa
 
Proxysql ha plam_2016_2_keynote
Proxysql ha plam_2016_2_keynoteProxysql ha plam_2016_2_keynote
Proxysql ha plam_2016_2_keynoteMarco Tusa
 
Empower my sql server administration with 5.7 instruments
Empower my sql server administration with 5.7 instrumentsEmpower my sql server administration with 5.7 instruments
Empower my sql server administration with 5.7 instrumentsMarco Tusa
 
Galera explained 3
Galera explained 3Galera explained 3
Galera explained 3Marco Tusa
 
Plmce 14 be a_hero_16x9_final
Plmce 14 be a_hero_16x9_finalPlmce 14 be a_hero_16x9_final
Plmce 14 be a_hero_16x9_finalMarco Tusa
 
Scaling with sync_replication using Galera and EC2
Scaling with sync_replication using Galera and EC2Scaling with sync_replication using Galera and EC2
Scaling with sync_replication using Galera and EC2Marco Tusa
 
Discard inport exchange table & tablespace
Discard inport exchange table & tablespaceDiscard inport exchange table & tablespace
Discard inport exchange table & tablespaceMarco Tusa
 
MySQL cluster 72 in the Cloud
MySQL cluster 72 in the CloudMySQL cluster 72 in the Cloud
MySQL cluster 72 in the CloudMarco Tusa
 

Mais de Marco Tusa (20)

Percona xtra db cluster(pxc) non blocking operations, what you need to know t...
Percona xtra db cluster(pxc) non blocking operations, what you need to know t...Percona xtra db cluster(pxc) non blocking operations, what you need to know t...
Percona xtra db cluster(pxc) non blocking operations, what you need to know t...
 
My sql on kubernetes demystified
My sql on kubernetes demystifiedMy sql on kubernetes demystified
My sql on kubernetes demystified
 
Accessing data through hibernate: what DBAs should tell to developers and vic...
Accessing data through hibernate: what DBAs should tell to developers and vic...Accessing data through hibernate: what DBAs should tell to developers and vic...
Accessing data through hibernate: what DBAs should tell to developers and vic...
 
MySQL innoDB split and merge pages
MySQL innoDB split and merge pagesMySQL innoDB split and merge pages
MySQL innoDB split and merge pages
 
Robust ha solutions with proxysql
Robust ha solutions with proxysqlRobust ha solutions with proxysql
Robust ha solutions with proxysql
 
Fortify aws aurora_proxy_2019_pleu
Fortify aws aurora_proxy_2019_pleuFortify aws aurora_proxy_2019_pleu
Fortify aws aurora_proxy_2019_pleu
 
Accessing Data Through Hibernate; What DBAs Should Tell Developers and Vice V...
Accessing Data Through Hibernate; What DBAs Should Tell Developers and Vice V...Accessing Data Through Hibernate; What DBAs Should Tell Developers and Vice V...
Accessing Data Through Hibernate; What DBAs Should Tell Developers and Vice V...
 
Are we there Yet?? (The long journey of Migrating from close source to opens...
Are we there Yet?? (The long journey of Migrating from close source to opens...Are we there Yet?? (The long journey of Migrating from close source to opens...
Are we there Yet?? (The long journey of Migrating from close source to opens...
 
Improve aws withproxysql
Improve aws withproxysqlImprove aws withproxysql
Improve aws withproxysql
 
Fortify aws aurora_proxy
Fortify aws aurora_proxyFortify aws aurora_proxy
Fortify aws aurora_proxy
 
Mysql8 advance tuning with resource group
Mysql8 advance tuning with resource groupMysql8 advance tuning with resource group
Mysql8 advance tuning with resource group
 
Geographically dispersed perconaxtra db cluster deployment
Geographically dispersed perconaxtra db cluster deploymentGeographically dispersed perconaxtra db cluster deployment
Geographically dispersed perconaxtra db cluster deployment
 
Sync rep aurora_2016
Sync rep aurora_2016Sync rep aurora_2016
Sync rep aurora_2016
 
Proxysql ha plam_2016_2_keynote
Proxysql ha plam_2016_2_keynoteProxysql ha plam_2016_2_keynote
Proxysql ha plam_2016_2_keynote
 
Empower my sql server administration with 5.7 instruments
Empower my sql server administration with 5.7 instrumentsEmpower my sql server administration with 5.7 instruments
Empower my sql server administration with 5.7 instruments
 
Galera explained 3
Galera explained 3Galera explained 3
Galera explained 3
 
Plmce 14 be a_hero_16x9_final
Plmce 14 be a_hero_16x9_finalPlmce 14 be a_hero_16x9_final
Plmce 14 be a_hero_16x9_final
 
Scaling with sync_replication using Galera and EC2
Scaling with sync_replication using Galera and EC2Scaling with sync_replication using Galera and EC2
Scaling with sync_replication using Galera and EC2
 
Discard inport exchange table & tablespace
Discard inport exchange table & tablespaceDiscard inport exchange table & tablespace
Discard inport exchange table & tablespace
 
MySQL cluster 72 in the Cloud
MySQL cluster 72 in the CloudMySQL cluster 72 in the Cloud
MySQL cluster 72 in the Cloud
 

Último

Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Colleen Farrelly
 
Thiophen Mechanism khhjjjjjjjhhhhhhhhhhh
Thiophen Mechanism khhjjjjjjjhhhhhhhhhhhThiophen Mechanism khhjjjjjjjhhhhhhhhhhh
Thiophen Mechanism khhjjjjjjjhhhhhhhhhhhYasamin16
 
RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.natarajan8993
 
Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 217djon017
 
ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Cantervoginip
 
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDINTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDRafezzaman
 
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degreeyuu sss
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]📊 Markus Baersch
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfgstagge
 
Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Seán Kennedy
 
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degreeyuu sss
 
Identifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanIdentifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanMYRABACSAFRA2
 
LLMs, LMMs, their Improvement Suggestions and the Path towards AGI
LLMs, LMMs, their Improvement Suggestions and the Path towards AGILLMs, LMMs, their Improvement Suggestions and the Path towards AGI
LLMs, LMMs, their Improvement Suggestions and the Path towards AGIThomas Poetter
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024thyngster
 
April 2024 - NLIT Cloudera Real-Time LLM Streaming 2024
April 2024 - NLIT Cloudera Real-Time LLM Streaming 2024April 2024 - NLIT Cloudera Real-Time LLM Streaming 2024
April 2024 - NLIT Cloudera Real-Time LLM Streaming 2024Timothy Spann
 
Advanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsAdvanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsVICTOR MAESTRE RAMIREZ
 
Multiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfMultiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfchwongval
 
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Boston Institute of Analytics
 
Real-Time AI Streaming - AI Max Princeton
Real-Time AI  Streaming - AI Max PrincetonReal-Time AI  Streaming - AI Max Princeton
Real-Time AI Streaming - AI Max PrincetonTimothy Spann
 

Último (20)

Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024
 
Thiophen Mechanism khhjjjjjjjhhhhhhhhhhh
Thiophen Mechanism khhjjjjjjjhhhhhhhhhhhThiophen Mechanism khhjjjjjjjhhhhhhhhhhh
Thiophen Mechanism khhjjjjjjjhhhhhhhhhhh
 
RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.
 
Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2
 
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
 
ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Canter
 
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDINTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
 
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdf
 
Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...
 
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
 
Identifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanIdentifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population Mean
 
LLMs, LMMs, their Improvement Suggestions and the Path towards AGI
LLMs, LMMs, their Improvement Suggestions and the Path towards AGILLMs, LMMs, their Improvement Suggestions and the Path towards AGI
LLMs, LMMs, their Improvement Suggestions and the Path towards AGI
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
 
April 2024 - NLIT Cloudera Real-Time LLM Streaming 2024
April 2024 - NLIT Cloudera Real-Time LLM Streaming 2024April 2024 - NLIT Cloudera Real-Time LLM Streaming 2024
April 2024 - NLIT Cloudera Real-Time LLM Streaming 2024
 
Advanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsAdvanced Machine Learning for Business Professionals
Advanced Machine Learning for Business Professionals
 
Multiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfMultiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdf
 
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
 
Real-Time AI Streaming - AI Max Princeton
Real-Time AI  Streaming - AI Max PrincetonReal-Time AI  Streaming - AI Max Princeton
Real-Time AI Streaming - AI Max Princeton
 

Best practice-high availability-solution-geo-distributed-final

  • 1. Best Practice for Designing and Implementing MySQL Geographic Distributed, High Availability Solutions Marco Tusa Percona
  • 2. 2 • Open source enthusiast • MySQL tech lead; principal architect • Working in DB/development world over 33 years (yes, I am that old) • Open source developer and community contributor About Me
  • 3. 3 Why Are We Talking About Geographic Distribution? The need to have geographic distribution for…
  • 4. 3 Why Are We Talking About Geographic Distribution? The need to have geographic distribution for… Disaster Recovery
  • 5. 3 Why Are We Talking About Geographic Distribution? The need to have geographic distribution for… Disaster Recovery Data close to consumer/user
  • 6. 3 Why Are We Talking About Geographic Distribution? The need to have geographic distribution for… Disaster Recovery Data close to consumer/user Security by regulations/laws
  • 7. 3 Why Are We Talking About Geographic Distribution? The need to have geographic distribution for… Disaster Recovery Data close to consumer/user Security by regulations/laws
  • 8. 4 What is Geographic Distribution? ● What is geographic distribution? ○ Near ! High Availability ○ Far ! Disaster Recovery
  • 9. • Geographic distribution in High Availability is to cover service availability in a location ○ 10 Gb Ethernet best case scenario 400 metre distance max What is HA and What is DR?
  • 10. • Geographic distribution in Disaster Recovery is to assure we can restore service, in a geographically distributed location ○ Real speed may vary ○ Linear distance ~1000Km What is HA and What is DR?
  • 11. 7 The Impact of Distance on Geographic Distribution
  • 12. A Real-Life Example I worked on many cases where a customer had two data centers (DC). I will use information from one of the case as example. The DC were at a distance of approximately 400 km, connected with a “fiber channel”. Server 1 and Server 2 were hosted in the same DC, while Server 3 was in the secondary DC. Their ping to Server3 was ~3ms. Not bad at all, right? We decided to perform some serious tests, running multiple sets of tests with netperf for many days collecting data. We also used the data to perform additional fine-tuning on the TCP/IP layer AND at the network provider.
  • 14. Observations 37ms latency is not very high. If that had been the top limit, it would have worked. But, it was not. In the presence of the optimized channel, with fiber and so on, when the tests were hitting heavy traffic, the congestion was enough to compromise the data transmitted. It hit a latency >200ms for Server 3. Note those were spikes, but if you are in the presence of a tightly coupled database cluster, those events can become failures in applying the data, and can create a lot of instability.
  • 15. Facts About Server 3 The connection between the two was with fiber. Distance Km ~400 (~800), we need to double because of the round trip, we also receive packages. Theoretical time at light-speed =2.66ms (2 ways) Ping = 3.10ms (signal traveling at ~80% of the light speed) as if the signal had traveled ~930Km (full round trip 800 Km) TCP/IP best at 48K = 4.27ms (~62% light speed) as if the signal had traveled ~1,281km TCP/IP best at 512K =37.25ms (~2.6% light speed) as if the signal had traveled ~11,175km Given the above, we have from ~20%-~40% to ~97% loss from the theoretical transmission rate.
  • 16. TCP Encapsulation Max transportable 1500 MTU – IP Header – TCP Header 1500 – ~40 = 1460 bytes
  • 18. Some Numbers • With 8KB we need 6 IP Frames • With 40KB we need 28 IP Frames • With 387KB we need 271 IP Frames • With 1MB we need 718 IP Frames • With 4MB we need ~2,800 Frames All this if we use the full TCP capacity
  • 19. Which Solutions Are Available? 2 Models Tightly coupled database clusters • Datacentric approach (single state of the data, distributed commit) • Data is consistent in time cross nodes • Replication requires high performant link • Geographic distribution is forbidden • DR is not supported Loosely coupled database clusters • Single node approach (local commit) • Data state differs by node • Single node state does not affect the cluster • Replication link doesn’t need to be high performance • Geographic distribution is allowed • DR is supported
  • 20. Replicate Data is the Key - Sync vs Async 1 Data State 3 Different Data States Tightly coupled Loosely coupled
  • 21. 17 Which Implementations?: Primary - Secondary Replication by internal design Positive things: • Each node is independent • Read scale • Low network impact Negative things: • Stale reads • Low HA • Consistent only in the Primary • Each node has its own data state Loosely coupled
  • 22. 18 Replication by internal design Positive things: • Highly available • Read scale • Data is in almost in sync all of the time Negative things: • Doesn’t scale writes • More nodes = more internal overhead • Network is a very impacting factor • 1 Writer only (PLEASE!!!!!) Which Implementations?: PXC Tightly coupled
  • 23. 19 Which Implementations?: PS – Group Replication Replication by standard MySQL Positive things: • Highly available • Read scale • Data is in almost in sync all the time • More network tolerant Negative things: • Doesn’t scale writes • 1 Writer only (PLEASE!!!!!) Tightly coupled
  • 24. 20 Implementations by HA Level and Tight/Loose Relation
  • 25. 20 Implementations by HA Level and Tight/Loose Relation
  • 26. Architectures: What Should NOT Be Done 1 London West London East Frankfurt Node 1 Node 2 Node 3 Sync High perf link Sync Internet link Async Internet link Sync High perf internet link
  • 27. Architectures: What Should NOT Be Done 1 London West London East Frankfurt Node 1 Node 2 Node 3 Sync High perf link Sync Internet link Async Internet link Sync High perf internet link
  • 28. Slave Architectures: What Should NOT Be Done 2 London West London East Frankfurt Node 1 Node 2 Node 3 Secondary Sync High perf link Sync Internet link Async Internet link Sync High perf internet link
  • 29. Slave Architectures: What Should NOT Be Done 2 London West London East Frankfurt Node 1 Node 2 Node 3 Secondary Sync High perf link Sync Internet link Async Internet link Sync High perf internet link
  • 30. Architectures: What Can Be Done Slave London West London West (400 mt limit) Frankfurt Node 1 Node 2 Node 3 S-Node1 S-Node2 S-Node3 Sync High perf link Sync Internet link Async Internet link Sync High perf internet link
  • 31. Architectures: What Can Be Done Slave London West London West (400 mt limit) Frankfurt Node 1 Node 2 Node 3 S-Node1 S-Node2 S-Node3 Sync High perf link Sync Internet link Async Internet link Sync High perf internet link
  • 32. Architectures: What You Should Do Slave London Frankfurt Node 1 Node 2 Node 3 S-Node1 S-Node2 S-Node3 Sync High perf link Sync Internet link Async Internet link Sync High perf internet link
  • 33. Conclusion - a Healthy Solution Must have a business continuity plan and cover at least: ● HA ● DR (RTO) ● Backup/restore (RPO) ● Load distribution ● Correct monitoring/alerting
  • 34. Conclusion - a Healthy Solution* *I am showing PXC/Galera replication by convenience, but it could also be Percona Server with Group Replication
  • 35. Conclusion - a Healthy Solution Must have a business continuity plan and cover at least: ● HA → (PXC OR PS-GR) ● DR → (PXC OR PS-GR) with Asynchronous replication and RMP (replication manager for PXC/PS-GR) ● Backup/restore → Backup/restore policy ● Load distribution → ProxySQL with Query rules ● Correct monitoring/alerting → Percona Monitoring and Management (PMM)
  • 36. Useful References ● https://www.percona.com/blog/2018/11/15/mysql-high-availability-on-premises-a-geographically-distributed-scenario/ ● https://dev.mysql.com/doc/mysql-ha-scalability/en/ha-overview.html ● https://www.percona.com/blog/2014/11/17/typical-misconceptions-on-galera-for-mysql/ ● http://galeracluster.com/documentation-webpages/limitations.html ● http://tusacentral.net/joomla/index.php/mysql-blogs/170-geographic-replication-and-quorum-calculation-in-mysqlgalera.html ● http://tusacentral.net/joomla/index.php/mysql-blogs/167-geographic-replication-with-mysql-and-galera.html ● http://tusacentral.net/joomla/index.php/mysql-blogs/164-effective-way-to-check-the-network-connection-when-in-need-of-a- geographic-distribution-replication-.html ● http://tusacentral.net/joomla/index.php/mysql-blogs/183-proxysql-percona-cluster-galera-integration.html ● https://github.com/sysown/proxysql/wiki ● https://www.percona.com/blog/2018/11/15/how-not-to-do-mysql-high-availability-geographic-node-distribution-with-galera- based-replication-misuse/ ● https://github.com/y-trudeau/Mysql-tools/tree/master/PXC