Beruflich Dokumente
Kultur Dokumente
Berkeley undergrad
Faculty since 1994
A systems person
Pioneered clusters, early search
engines, cloud computing
Why?
Why?
What?
Who?
How?
For instance?
Why?
everything
8/26/15
8/26/15
Updated
the internet?
http://flickr.com/photos/mricon/1836673/
Software Logs
RFID
GPS
IoT
Quantified self
Microphones
Cameras
...
http://www.flickr.com/photos/revsorg/1168346563/
8/26/15
information is knowledge
albert einstein
knowledge is power
sir francis bacon
SoWhy?
8/26/15
Why?
What?
Who?
How?
For instance?
8/26/15
What is a Database?
What is a DBMS?
What: Is an OS a DBMS?
What: Is an OS a DBMS?
8/26/15
Thought Experiment 1:
Thought Experiment 1:
b)
c)
d)
e)
partners
both
neither
???
Thought Experiment 2:
Youre updating a file.
The power goes out.
Which changes survive?
b)
c)
d)
e)
partners
both
neither
???
a) all
b) none
c) all since
last save
d) ???
Thought Experiment 1:
You and your project partner edit the same file.
You both save it at the same time.
Whose changes survive?
a) yours
Q: How
do you code against an API that guarantees you
??? ?
b) partners
c) both
A: Very carefully.
d) neither
e) ???
Thought Experiment 2:
a) all
b) none
c) all since
last save
d) ???
2SMOP:
8/26/15
What: Summing up
What: Summing up
What: Summing up
What: Summing up
everything
everything
Who?
Instructor
Prof. Eric Brewer
Cloud programmer
Data scientist
Data engineer
Machine Learning architect
TAs
Michelle Nguyen
Matthew Deng
Richard Liaw
Andrew Liu
Bryan Munar
Ajay Solanky
Evan Ye
Pete Yeh
8/26/15
How? Workload
45%: Projects:
use and implementation of a database
10%: Short weekly quizzes (vitamins)
45% Exams 3 midterms, but no final
Textbook
Database Management Systems, 3rd Edition
Ramakrishnan and Gehrke
Suggested
I wouldnt buy any more textbooks
Website has links to programming resources
Grading, hand-in policies, etc. are on Piazza
Cheating policy: zero tolerance
We have the technology
Why?
What?
Who?
How?
For instance?
1. Read something
1. Read something
2. Do something
2. Do something
3. Write something
3. Write something
4. GOTO 1
4. GOTO 1
CPU
CPU
Out-of-Core algorithms.
8/26/15
CPU
CPU
CPU
CPU
CPU
CPU
CPU
CPU
CPU
CPU
CPU
CPU
CPU
CPU
CPU
CPU
CPU
CPU
Scaling up
Simplifying Assumption
Streaming
Divide-and-Conquer
INPUT
OUTPUT
RAM
10
8/26/15
INPUT
Input
Buffer
f(x)
Output
Buffer
OUTPUT
Input
Buffer
f(x)
INPUT
Input
Buffer
RAM
Parallelize Me
INPUT
f(x)
Output
Buffer
OUTPUT
RAM
Parallelize Me
Output
Buffer
OUTPUT
INPUT
Input
Buffer
RAM
INPUT
INPUT
UNIX Pipes
Input
Buffer
Input
Buffer
f(x)
f(x)
f(x)
Output
Buffer
OUTPUT
RAM
Output
Buffer
OUTPUT
RAM
Output
Buffer
OUTPUT
RAM
head, tail, nl, tr, tail, tee, od, cut, grep, sed,
awk, split, csplit,
11
8/26/15
Rendezvous
Time-space Rendezvous
in the same place (RAM) at the same time
DISK 2
DISK 1
B-2
INPUT
OUTPUT
IN
OUT
Phase 1
Phase 2
DISK 2
DISK 1
DISK 2
DISK 1
B-2
B-2
INPUT
OUTPUT
IN
INPUT
OUTPUT
OUT
IN
IN
Parallelize Me?
Parallelize Me?
Phase 1
B-2
IN
IN
OUT
IN
OUT
IN
B-2
IN
OUT
IN
OUT
IN
OUT
IN
OUT
B-2
B-2
B-2
IN
B-2
B-2
OUT
OUT
B-2
B-2
OUT
12
8/26/15
Summing Up 1
Unordered collection model
Read in chunks to avoid fixed I/O costs
Summing Up 2
Streaming
Divide & Conquer for rendezvous
Up Next
13