2011 Db Distributed

Distributed Transactions
Alan Medlar

amedlar@cs.ucl.ac.uk

Motivation
• Distributed Database
• collection of sites, each with own database
• each site processes local transactions
• local transactions can only access local
database
• Distributed transactions require co-
ordination among sites

Advantages

• Distributed databases can improve
availability (especially if we are using
database replication)
• Parallel processing of sub-transactions at
individual sites instead of all locally
improves performance

Disadvantages
• Cost: hardware, software dev, network
(leased lines?)
• Operational Overhead: network trafﬁc, co-
ordination overhead
• Technical: harder to debug, security, greater
complexity
• ACID properties harder to achieve

Main Issues
• Transparency: database provides abstraction layer above
data access, distributed databases should be accessed in the
same way
• Distributed Transactions: local transactions are only
processed at one site, global transactions need to preserve
ACID across multiple sites and provide distributed query
processing (eg: distributed join)
• Atomicity: all sites in a global transactions must commit or
none do
• Consistency: all schedules must be conﬂict serializable (last
lecture!)

Failures
• Site failures: exactly the same as for local databases
(hardware failure, out of memory etc)
• Networking failures
• Failure of a network link: no hope of communicating
with other database site
• Loss of messages: network link might be ﬁne, but
congested, packet loss, TCP timeouts
• Network partition: more relevant to replication, set of
replicas might be divided in two, updating only replicas
in their partition

Fragmentation

• Divide a relation into sections which can be
allocated to different sites to optimise
(reduce processing time, network trafﬁc
overhead) transaction processing
• Horizontal and vertical fragmentation

Branch Account no Customer Balance
Euston 1234 Alice 200
Euston 2345 Bob 100
Euston 3456 Eve 5
Harrow 4567 Richard 550
Harrow 5678 Jane 75
Harrow 6789 Graham 175

Euston 1234 Alice 200
Euston 2345 Bob 100
Euston 3456 Eve 5

Horizontal Fragmentation
(in this case taking advantage of usage locality)

Harrow 4567 Richard 550
Harrow 5678 Jane 75
Harrow 6789 Graham 175

Branch Customer Id Id Account no Balance
Euston Alice 0 0 1234 200
Euston Bob 1 1 2345 100
Euston Eve 2 2 3456 5
Harrow Richard 3 3 4567 550
Harrow Jane 4 4 5678 75
Harrow Graham 5 5 6789 175
Vertical Fragmentation
Additional Id-tuple allows for a join to recreate the
original relation

Problem

• Now our data is split into fragments and
each fragment is at a separate site
• How do we access these sites using
transactions, whilst maintaining the ACID
properties?

2-Phase Commit

• Distributed algorithm that permits all nodes
in a distributed system to agree to commit a
transaction, the protocol results in all sites
committing or aborting
• Completes despite network or node failures
• Necessary to provide atomicity

2-Phase Commit
• Voting Phase: each site is polled as to
whether a transactions should commit (ie:
whether their sub-transaction can commit)
• Decision Phase: if any site says “abort” or
does not reply, then all sites must be told
to abort
• Logging is performed for failure recovery
(as usual)

client

start

TC

A B

client

start

TC
prepare

A B

client

start

TC
prepare
prepare

A B

client

start

TC
prepare
prepare ready

A B

client

start

TC
ready prepare
prepare ready

A B

client

start

TC
commit commit
ready prepare
prepare ready

A B

client

OK start

TC
commit commit
ready prepare
prepare ready

A B

Voting Phase
• TC (transaction co-ordinator) writes
<prepare Ti> to log
• TC sends prepare message to all sites (A,B)
• Site’s local DBMS decides whether to
commit its part of the transaction or abort.
If commit write <ready Ti> else <no Ti> to
log
• Ready or abort message sent back to TC

Decision Phase
• After receiving all results from prepare messages (or after
a timeout) TC can decision whether the entire transaction
should commit
• If any site replies “abort” or timed out, TC aborts the
entire transaction by logging <abort Ti> and then sending
the “abort” message to all sites
• If all sites replies with “ready”, TC commits by logging
<commit Ti> and sending commit message to all sites
• Upon receipt of a commit message, each site logs
<commit Ti> and only then alters the database in memory

Failure Example 1
• One of the database sites (A,B) fails
• On recovery the log is examined:
• if log contains <commit Ti>, redo the changes of the
transaction
• if the log contains <abort Ti>, undo the changes
• if the log contains <ready Ti>, but not a commit, contact TC
for the outcome of transaction Ti, if TC is down, then other
sites
• if log does not contain ready, commit or abort then the failure
must have occurred before the receipt of “prepare Ti”, so TC
would have aborted the transaction

Failure Example 2
• One of the transaction coordinator (TC) fails (sites A or B waiting
for commit/abort message)
• Each database site log is examined:
• if any site log contains <commit Ti> Ti must be committed at all
sites
• if any site log contains <abort Ti> or <no Ti> Ti must be aborted
at all sites
• if any site log does not contain <ready Ti>, TC must have failed
before decision to commit
• if none of the above apply then all active sites must have
<ready Ti> (but no additional commits or aborts), TC must be
consulted (when it comes back online)

Network Faults

• Failure of the network
• From the perspective of entities on one
side of the network failure, entities on
the other side have failed
(apply previous strategies)

Locking (non-replicated system)

• Each local site has a lock manager
• administers lock requests for data items stored
at site
• when a transactions requires a data item to be
locked, it requests a lock from the lock manager
• lock manager blocks until lock can be held
• Problem: deadlocks in a distributed system, clearly
more complicated to resolve...

Locking (single co-ordinator)

• Have a single lock manager for the whole
distributed database
• manages locks at all sites
• locks for reading of any replica
• locks for writing of all replicas
• Simpler deadlock handling
• Single point of failure
• Bottleneck?

Locking (replicated system)

• Majority protocol where each local site has a lock
manager
• Transactions wants a lock on a data item that is
replicated at n sites
• must get a lock for that data item at more than n/2
sites
• transaction cannot operate until it has locks on more
than half of the replica sites (only one transaction can
do this at a time)
• if replicas are written to all replicas must be updated...

Updating Replicas
• Replication makes reading more reliable
(probability p that a replica is unavailable, the
probability that all n replicas are unavailable is
pn)
• Replication makes writing less reliable (the
probability of all n replicas being available to be
updated with a write has a probability (1-p)n)
• Writing must succeed even if not all replicas
are available...

Updating Replicas (2)
• Majority update protocol!
• Update more than half of the replicas (the
rest have “failed”, can be updated later), but
this time add a timestamp or version number
• To read a data item, read more than half of
the replicas and use the one with the most
recent timestamp
• Write more reliable, reading more complex!

~ Fin ~
(Graphics lectures begin on Monday 9th March)

2011 Db Distributed

Recomendados

Recomendados

Mais conteúdo relacionado

Destaque

Destaque (13)

Semelhante a 2011 Db Distributed

Semelhante a 2011 Db Distributed (20)

Último

Último (20)

2011 Db Distributed

Notas do Editor