Beruflich Dokumente
Kultur Dokumente
Part of a specialization
2. Regression
3. Classification
4. Clustering
& Retrieval
5. Recommender
Systems
6. Capstone
What is regression?
From features to predictions
Data
ML
Regression
Method
Intelligence
Input x:
relationship
Predict y:
continuous output or
response to input
2015
Emily
Fox
&
Carlos
Guestrin
hard work
Stock prediction
Predict the price of a stock (y)
Depends on x =
-Recent history of stock price
-News events
-Related commodities
Tweet popularity
How many people will retweet your tweet? (y)
Depends on x = # followers,
# of followers of followers,
features of text tweeted,
popularity of hashtag,
# of past retweets,
very happy
Inputs x are
brain region
intensities
Case Study:
Predicting house prices
ML
Regression
Method
$
$
$
(y)
+ house
attributes (x)
10
Intelligence
$ = ??
price ($)
Data
house size
2015
Emily
Fox
&
Carlos
Guestrin
Impact of regression
11
Course outline
house size
13
x
2015
Emily
Fox
&
Carlos
Guestrin
price ($)
14
intercept
slope
better fit
house size
Get estimated
parameters
-interpret
-use to form
predictions
2015
Emily
Fox
&
Carlos
Guestrin
price ($)
Incorporate
more inputs
price ($)
x[2]
house size
15
x[1]
-
-
-
-
-
-
Square feet
# bathrooms
# bedrooms
Lot size
Year built
price ($)
price ($)
house size x
T
I
F
R
VE size
Ohouse
Measures of error:
- Training
- Test
- True (generalization)
16
price ($)
price ($)
house size x
T
I
F
R
VE size
Ohouse
Bias-variance
tradeo
17
price ($)
price ($)
house size x
T
I
F
R
VE size
Ohouse
measure of
model complexity
bias-variance tradeo
18
measure of
model complexity
Cross validation
Valid
set
error2() (2)
19
20
Dishwasher
Garbage
disposal
Microwave
Range
/
Oven
Refrigerator
Washer
Dryer
Laundry
loca0on
Hea0ng
type
Je]ed
Tub
Deck
Fenced
Yard
Lawn
Garden
Sprinkler
System
Lot
size
Single
Family
Year
built
Last
sold
price
Last
sale
price/sqM
Finished
sqM
Unnished
sqM
Finished
basement
sqM
#
oors
Flooring
types
Parking
type
Parking
amount
Cooling
Hea0ng
Exterior
materials
Roof
type
Structure
style
price ($)
house size
22
$ = ???
f(x0)
0.5
0.5
$ = ???
23
0.1
0.2
0.3
0.4
x0
0.6
0.7
0.8
0.9
Algorithms
Concepts
24
Linear regression
Regularization: Ridge (L2), Lasso (L1)
Nearest neighbor and kernel regression
Gradient descent
Coordinate descent
Assumed background
25
Math background
Basic calculus
-Concept of derivatives
26
Programming experience
Basic Python used
-Can pick up along the way if
knowledge of other language
27
Assignments will:
1. Use GraphLab Create to
explore high-level concepts
2. Ask you to implement
all algorithms without GraphLab Create
Net result:
- learn how to code methods in Python
28
Computing needs
Basic 64-bit desktop or laptop
Access to internet
Ability to:
-Install and run Python (and GraphLab Create)
-Store a few GB of data
29