Beruflich Dokumente
Kultur Dokumente
1 Terabyte
10 MB/s
Ba
nd
wi
dt
h
1 Terabyte
Parallelism:
divide a big problem
into many smaller ones
to be solved in parallel.
Pipeline
Partition
Any
Sequential
Program
Sequential
Any
Sequential
Sequential
Program
Any
Sequential
Program
Any
Sequential
Program
Speed-Up
More resources means
proportionally less time
for given amount of data.
Scale-Up
If resources increased in
proportion to increase in
data size, time is
constant.
sec./transact
(response time)
Transact./sec.
(throughput)
Ideal
degree of parallelis
Ideal
degree of parallelis
Architecture
Shared Memory
(SMP)
CLIENTS
Shared Disk
Shared Nothing
(network)
CLIENTS
CLIENTS
Processors
Memory
Easy to program
Expensive to build
Difficult to scaleup
Hard to program
Cheap to build
Easy to scaleup
Intra-operator parallelism
get all machines working to compute a given
operation (scan, sort, join)
Inter-operator parallelism
each operator may run concurrently on a
different site (exploits pipelining)
Inter-query parallelism
different queries run on different sites
Automatic Data
Partitioning
Partitioning a table:
Range
Hash
Round Robin
Parallel Scans
Parallel Sorting
Idea:
Distributed Databases
Introduction
Types of Distributed
Databases
DBMS1
DBMS2
DBMS3
Distributed DBMS
Architectures
QUERY
Client-Server
CLIENT
CLIENT
Collaborating-Server
SERVER
SERVER
SERVER
QUERY
SERVER
Middleware systems
One database server capable of
managing queries and transactions
spanning multiple servers , the
remaining servers need to handle only
local queries.
TID
Storing Data
t1
t2
t3
t4
Fragmentation
Horizontal: Usually disjoint.
Vertical: Lossless-join; tids.
Replication
Gives increased availability.
Faster query evaluation.
Synchronous vs.
Asynchronous.
R1
R3
SITE A
SITE B
R1
R2
Distributed Catalog
Management
Distributed Queries
Delhi
Mumbai
Supp-part
Semijoin
Bloomjoin
Synchronous Replication
Cost of Synchronous
Replication
Asynchronous Replication
Peer-to-Peer Replication
Distributed Locking
Distributed Deadlock
Detection
Each site maintains a local waits-for graph.
T1
T2
SITE A
T1
T2
SITE B
T1
T2
GLOBAL
Distributed Recovery
Blocking