Beruflich Dokumente
Kultur Dokumente
Part 0 -- Introduction
1/15
Part 0 - Introduction
Part 0 -- Introduction
2/15
Email: wgreene@stern.nyu.edu
URL: http://www.stern.nyu.edu/~wgreene
http://www.stern.nyu.edu/~wgreene/Statistics/Outline.htm
600000
500000
400000
Mushroom
16.2%
Plain
32.5%
Normal - 95% CI
900000
Mean
StDev
N
AD
P-Value
95
700000
90
500000
400000
200000
100000
15000
800000
700000
60
50
40
30
20000
22500
25000
IncomePC
27500
30000
32500
e mc
200000
100000
15000
400000
600000
Listing
800000
1000000
17500
20000
22500
25000
IncomePC
27500
Mean
StDev
N
369687
156865
51
80
200000
Normal
10
300000
12
500000
400000
10
17500
Histogram of Listing
14
600000
70
20
300000
200000
369687
156865
51
0.994
0.012
80
600000
300000
100000
30000
32500
1000000
60
800000
40
Listing
800000
800000
Percent
900000
Frequency
Sausage
5.8%
900000
700000
Listing
Boxplot of Listing
Category
Pepperoni
Plain
Mushroom
Sausage
Pepper and Onion
Mushroom and Onion
Garlic
Meatball
Listing
Pepperoni
21.8%
Listing
Meatball
Garlic 5.0%
2.3%
Percent
20
600000
400000
0
200000
300000
400000
500000 600000
Listing
700000
800000
900000
00
00
00
00
00
00
00
00
00
00
00
00
00
00
00
00
00
00
10
20
30
40
50
60
70
80
90
Listing
200000
15000
20000
25000
IncomePC
30000
Part 0 -- Introduction
3/15
Course Objectives
Pepperoni
21.8%
Sausage
5.8%
900000
800000
800000
600000
500000
400000
Mushroom
16.2%
Plain
32.5%
900000
700000
Listing
Boxplot of Listing
Category
Pepperoni
Plain
Mushroom
Sausage
Pepper and Onion
Mushroom and Onion
Garlic
Meatball
Normal - 95% CI
700000
900000
Mean
StDev
N
AD
P-Value
90
500000
400000
200000
100000
15000
800000
700000
60
50
40
30
20000
22500
25000
IncomePC
27500
30000
32500
e mc
200000
100000
15000
400000
600000
Listing
800000
1000000
17500
20000
22500
25000
IncomePC
27500
Mean
StDev
N
369687
156865
51
80
200000
Normal
10
300000
12
500000
400000
10
17500
Histogram of Listing
14
600000
70
20
300000
200000
369687
156865
51
0.994
0.012
80
600000
300000
100000
95
Listing
Meatball
Garlic 5.0%
2.3%
30000
32500
1000000
60
800000
40
Listing
Percent
Frequency
Listing
Percent
20
600000
400000
0
200000
300000
400000
500000 600000
Listing
700000
800000
900000
00
00
00
00
00
00
00
00
00
00
00
00
00
00
00
00
00
00
10
20
30
40
50
60
70
80
90
Listing
200000
15000
20000
25000
IncomePC
30000
Part 0 -- Introduction
4/15
The favorable percentage is one of the lowest in more than two decades
of Pew surveys if not the lowest, the poll said. The previous low was
40% in January, but the result is not statistically significant because of
the margin of error.
(USA Today, 9/3/09, page 4)
600000
500000
400000
Mushroom
16.2%
Plain
32.5%
Normal - 95% CI
900000
Mean
StDev
N
AD
P-Value
95
700000
90
500000
400000
200000
100000
15000
800000
700000
60
50
40
30
20000
22500
25000
IncomePC
27500
30000
32500
e mc
200000
100000
15000
400000
600000
Listing
800000
1000000
17500
20000
22500
25000
IncomePC
27500
Mean
StDev
N
369687
156865
51
80
200000
Normal
10
300000
12
500000
400000
10
17500
Histogram of Listing
14
600000
70
20
300000
200000
369687
156865
51
0.994
0.012
80
600000
300000
100000
30000
32500
1000000
60
800000
40
Listing
800000
800000
Percent
900000
Frequency
Sausage
5.8%
900000
700000
Listing
Boxplot of Listing
Category
Pepperoni
Plain
Mushroom
Sausage
Pepper and Onion
Mushroom and Onion
Garlic
Meatball
Listing
Pepperoni
21.8%
Listing
Meatball
Garlic 5.0%
2.3%
Percent
20
600000
400000
0
200000
300000
400000
500000 600000
Listing
700000
800000
900000
00
00
00
00
00
00
00
00
00
00
00
00
00
00
00
00
00
00
10
20
30
40
50
60
70
80
90
Listing
200000
15000
20000
25000
IncomePC
30000
Part 0 -- Introduction
5/15
Really?
To Get Rid of Hiccups, Have Someone Startle You.
The truth is: Most home remedies, like holding your breath or
drinking from a glass of water backward, haven't been medically
proven to be effective, says Pollack. However, you can try this trick
dating back to 1971, when it was published in The New England
Journal of Medicine: Swallow one teaspoon of white granulated
sugar. According to the study, this tactic resulted in the cessation of
hiccups in 19 out of 20 afflicted patients.
Posted August 31, 2010, cnn.com
http://www.cnn.com/2010/HEALTH/08/31/rs.12.health.myths/index.html?iref=allsearch
600000
500000
400000
Mushroom
16.2%
Plain
32.5%
Normal - 95% CI
900000
Mean
StDev
N
AD
P-Value
95
700000
90
500000
400000
200000
100000
15000
800000
700000
60
50
40
30
20000
22500
25000
IncomePC
27500
30000
32500
e mc
200000
100000
15000
400000
600000
Listing
800000
1000000
17500
20000
22500
25000
IncomePC
27500
Mean
StDev
N
369687
156865
51
80
200000
Normal
10
300000
12
500000
400000
10
17500
Histogram of Listing
14
600000
70
20
300000
200000
369687
156865
51
0.994
0.012
80
600000
300000
100000
30000
32500
1000000
60
800000
40
Listing
800000
800000
Percent
900000
Frequency
Sausage
5.8%
900000
700000
Listing
Boxplot of Listing
Category
Pepperoni
Plain
Mushroom
Sausage
Pepper and Onion
Mushroom and Onion
Garlic
Meatball
Listing
Pepperoni
21.8%
Listing
Meatball
Garlic 5.0%
2.3%
Percent
20
600000
400000
0
200000
300000
400000
500000 600000
Listing
700000
800000
900000
00
00
00
00
00
00
00
00
00
00
00
00
00
00
00
00
00
00
10
20
30
40
50
60
70
80
90
Listing
200000
15000
20000
25000
IncomePC
30000
Part 0 -- Introduction
6/15
600000
500000
400000
Mushroom
16.2%
Plain
32.5%
Normal - 95% CI
900000
Mean
StDev
N
AD
P-Value
95
700000
90
500000
400000
200000
100000
15000
800000
700000
60
50
40
30
20000
22500
25000
IncomePC
27500
30000
32500
e mc
200000
100000
15000
400000
600000
Listing
800000
1000000
17500
20000
22500
25000
IncomePC
27500
Mean
StDev
N
369687
156865
51
80
200000
Normal
10
300000
12
500000
400000
10
17500
Histogram of Listing
14
600000
70
20
300000
200000
369687
156865
51
0.994
0.012
80
600000
300000
100000
30000
32500
1000000
60
800000
40
Listing
800000
800000
Percent
900000
Frequency
Sausage
5.8%
900000
700000
Listing
Boxplot of Listing
Category
Pepperoni
Plain
Mushroom
Sausage
Pepper and Onion
Mushroom and Onion
Garlic
Meatball
Listing
Pepperoni
21.8%
Listing
Meatball
Garlic 5.0%
2.3%
Percent
20
600000
400000
0
200000
300000
400000
500000 600000
Listing
700000
800000
900000
00
00
00
00
00
00
00
00
00
00
00
00
00
00
00
00
00
00
10
20
30
40
50
60
70
80
90
Listing
200000
15000
20000
25000
IncomePC
30000
Part 0 -- Introduction
7/15
600000
500000
400000
Mushroom
16.2%
Plain
32.5%
Normal - 95% CI
900000
Mean
StDev
N
AD
P-Value
95
700000
90
500000
400000
200000
100000
15000
800000
700000
60
50
40
30
20000
22500
25000
IncomePC
27500
30000
32500
e mc
200000
100000
15000
400000
600000
Listing
800000
1000000
17500
20000
22500
25000
IncomePC
27500
Mean
StDev
N
369687
156865
51
80
200000
Normal
10
300000
12
500000
400000
10
17500
Histogram of Listing
14
600000
70
20
300000
200000
369687
156865
51
0.994
0.012
80
600000
300000
100000
30000
32500
1000000
60
800000
40
Listing
800000
800000
Percent
900000
Frequency
Sausage
5.8%
900000
700000
Listing
Boxplot of Listing
Category
Pepperoni
Plain
Mushroom
Sausage
Pepper and Onion
Mushroom and Onion
Garlic
Meatball
Listing
Pepperoni
21.8%
Listing
Meatball
Garlic 5.0%
2.3%
Percent
20
600000
400000
0
200000
300000
400000
500000 600000
Listing
700000
800000
900000
00
00
00
00
00
00
00
00
00
00
00
00
00
00
00
00
00
00
10
20
30
40
50
60
70
80
90
Listing
200000
15000
20000
25000
IncomePC
30000
Part 0 -- Introduction
8/15
Course Prerequisites
Pepperoni
21.8%
Sausage
5.8%
900000
800000
800000
600000
500000
400000
Mushroom
16.2%
Plain
32.5%
900000
700000
Listing
Boxplot of Listing
Category
Pepperoni
Plain
Mushroom
Sausage
Pepper and Onion
Mushroom and Onion
Garlic
Meatball
Normal - 95% CI
900000
Mean
StDev
N
AD
P-Value
95
700000
90
500000
400000
200000
100000
15000
800000
700000
60
50
40
30
20000
22500
25000
IncomePC
27500
30000
32500
e mc
200000
100000
15000
400000
600000
Listing
800000
1000000
17500
20000
22500
25000
IncomePC
27500
Mean
StDev
N
369687
156865
51
80
200000
Normal
10
300000
12
500000
400000
10
17500
Histogram of Listing
14
600000
70
20
300000
200000
369687
156865
51
0.994
0.012
80
600000
300000
100000
30000
32500
1000000
60
800000
40
Listing
Meatball
Garlic 5.0%
2.3%
Percent
Frequency
Listing
Percent
Listing
20
600000
400000
0
200000
300000
400000
500000 600000
Listing
700000
800000
900000
00
00
00
00
00
00
00
00
00
00
00
00
00
00
00
00
00
00
10
20
30
40
50
60
70
80
90
Listing
200000
15000
20000
25000
IncomePC
30000
Part 0 -- Introduction
9/15
Course Materials
Notes: Distributed in first class
Text: Hildebrand, Ott and Gray. Basic
Statistical Ideas for Managers, 2nd ed.
(Recommended, not required)
On the course website:
Miscellaneous notes and materials
Class slide presentations
Problem sets
http://www.stern.nyu.edu/~wgreene/Statistics/Outline.htm
600000
500000
400000
Mushroom
16.2%
Plain
32.5%
Normal - 95% CI
900000
Mean
StDev
N
AD
P-Value
95
10
700000
90
500000
400000
200000
100000
15000
800000
700000
60
50
40
30
20000
22500
25000
IncomePC
27500
30000
32500
e mc
200000
100000
15000
400000
600000
Listing
800000
1000000
17500
20000
22500
25000
IncomePC
27500
Mean
StDev
N
369687
156865
51
80
200000
Normal
10
300000
12
500000
400000
10
17500
Histogram of Listing
14
600000
70
20
300000
200000
369687
156865
51
0.994
0.012
80
600000
300000
100000
30000
32500
1000000
60
800000
40
Listing
800000
800000
Percent
900000
Frequency
Sausage
5.8%
900000
700000
Listing
Boxplot of Listing
Category
Pepperoni
Plain
Mushroom
Sausage
Pepper and Onion
Mushroom and Onion
Garlic
Meatball
Listing
Pepperoni
21.8%
Listing
Meatball
Garlic 5.0%
2.3%
Percent
20
600000
400000
0
200000
300000
400000
500000 600000
Listing
700000
800000
900000
00
00
00
00
00
00
00
00
00
00
00
00
00
00
00
00
00
00
10
20
30
40
50
60
70
80
90
Listing
200000
15000
20000
25000
IncomePC
30000
Part 0 -- Introduction
10/15
Buy: Professional
Bookstore
Rent: www.e-academy.com
e-Store
600000
500000
400000
Mushroom
16.2%
Plain
32.5%
Normal - 95% CI
900000
Mean
StDev
N
AD
P-Value
95
11
700000
90
500000
400000
200000
100000
15000
800000
700000
60
50
40
30
20000
22500
25000
IncomePC
27500
30000
32500
e mc
200000
100000
15000
400000
600000
Listing
800000
1000000
17500
20000
22500
25000
IncomePC
27500
Mean
StDev
N
369687
156865
51
80
200000
Normal
10
300000
12
500000
400000
10
17500
Histogram of Listing
14
600000
70
20
300000
200000
369687
156865
51
0.994
0.012
80
600000
300000
100000
30000
32500
1000000
60
800000
40
Listing
800000
800000
Percent
900000
Frequency
Sausage
5.8%
900000
700000
Listing
Boxplot of Listing
Category
Pepperoni
Plain
Mushroom
Sausage
Pepper and Onion
Mushroom and Onion
Garlic
Meatball
Listing
Pepperoni
21.8%
Listing
Meatball
Garlic 5.0%
2.3%
Percent
20
600000
400000
0
200000
300000
400000
500000 600000
Listing
700000
800000
900000
00
00
00
00
00
00
00
00
00
00
00
00
00
00
00
00
00
00
10
20
30
40
50
60
70
80
90
Listing
200000
15000
20000
25000
IncomePC
30000
Part 0 -- Introduction
11/15
Data Description
Pepperoni
21.8%
Sausage
5.8%
900000
800000
800000
600000
500000
400000
Mushroom
16.2%
Plain
32.5%
900000
700000
Listing
Boxplot of Listing
Category
Pepperoni
Plain
Mushroom
Sausage
Pepper and Onion
Mushroom and Onion
Garlic
Meatball
Normal - 95% CI
700000
900000
Mean
StDev
N
AD
P-Value
90
500000
400000
200000
100000
15000
800000
700000
60
50
40
30
20000
22500
25000
IncomePC
27500
30000
32500
e mc
200000
100000
15000
400000
600000
Listing
800000
1000000
17500
20000
22500
25000
IncomePC
27500
Mean
StDev
N
369687
156865
51
80
200000
Normal
10
300000
12
500000
400000
10
17500
Histogram of Listing
14
600000
70
20
300000
200000
369687
156865
51
0.994
0.012
80
600000
300000
100000
95
12
Listing
Meatball
Garlic 5.0%
2.3%
30000
32500
1000000
60
800000
40
Listing
Percent
Frequency
Listing
Types
Information content
Percent
20
600000
400000
0
200000
300000
400000
500000 600000
Listing
700000
800000
900000
00
00
00
00
00
00
00
00
00
00
00
00
00
00
00
00
00
00
10
20
30
40
50
60
70
80
90
Listing
200000
15000
20000
25000
IncomePC
30000
Part 0 -- Introduction
12/15
How to
describe/summarize
them.
Data:
House
Price
Listings
and
Income
600000
500000
400000
Mushroom
16.2%
Plain
32.5%
Normal - 95% CI
900000
Mean
StDev
N
AD
P-Value
95
13
700000
90
500000
400000
200000
100000
15000
800000
700000
60
50
40
30
20000
22500
25000
IncomePC
27500
30000
32500
e mc
200000
100000
15000
400000
600000
Listing
800000
1000000
17500
20000
22500
25000
IncomePC
27500
Mean
StDev
N
369687
156865
51
80
200000
Normal
10
300000
12
500000
400000
10
17500
Histogram of Listing
14
600000
70
20
300000
200000
369687
156865
51
0.994
0.012
80
600000
300000
100000
30000
32500
1000000
60
800000
40
Listing
800000
800000
Percent
900000
Frequency
Sausage
5.8%
900000
700000
Listing
Boxplot of Listing
Category
Pepperoni
Plain
Mushroom
Sausage
Pepper and Onion
Mushroom and Onion
Garlic
Meatball
Listing
Pepperoni
21.8%
Listing
Meatball
Garlic 5.0%
2.3%
How to determine if
there is any
connection between
the two variables.
Percent
20
600000
400000
0
200000
300000
400000
500000 600000
Listing
700000
800000
900000
00
00
00
00
00
00
00
00
00
00
00
00
00
00
00
00
00
00
10
20
30
40
50
60
70
80
90
Listing
200000
15000
20000
25000
IncomePC
30000
Part 0 -- Introduction
13/15
Pepperoni
21.8%
Sausage
5.8%
900000
800000
800000
600000
500000
400000
Mushroom
16.2%
Plain
32.5%
900000
700000
Listing
Boxplot of Listing
Category
Pepperoni
Plain
Mushroom
Sausage
Pepper and Onion
Mushroom and Onion
Garlic
Meatball
Normal - 95% CI
900000
Mean
StDev
N
AD
P-Value
95
14
700000
90
500000
400000
200000
100000
15000
800000
700000
60
50
40
30
20000
22500
25000
IncomePC
27500
30000
32500
e mc
200000
100000
15000
400000
600000
Listing
800000
1000000
17500
20000
22500
25000
IncomePC
27500
Mean
StDev
N
369687
156865
51
80
200000
Normal
10
300000
12
500000
400000
10
17500
Histogram of Listing
14
600000
70
20
300000
200000
369687
156865
51
0.994
0.012
80
600000
300000
100000
30000
32500
1000000
60
800000
40
Listing
Meatball
Garlic 5.0%
2.3%
Percent
Frequency
Listing
Percent
20
600000
400000
0
200000
300000
400000
500000 600000
Listing
700000
800000
900000
00
00
00
00
00
00
00
00
00
00
00
00
00
00
00
00
00
00
10
20
30
40
50
60
70
80
90
Listing
200000
15000
20000
25000
IncomePC
30000
Part 0 -- Introduction
14/15
Hawaii. Outlier?
800000
Listing_1
700000
600000
500000
Model building
Understanding
covariation of more
than one variable.
400000
15000
17500
20000
22500
25000
IncomePC_1
27500
30000
32500
Mushroom
16.2%
Plain
32.5%
Normal - 95% CI
900000
Mean
StDev
N
AD
P-Value
95
15
700000
90
500000
400000
200000
100000
15000
800000
700000
60
50
40
30
20000
22500
25000
IncomePC
27500
30000
32500
e mc
200000
100000
15000
400000
600000
Listing
800000
1000000
17500
20000
22500
25000
IncomePC
27500
Mean
StDev
N
369687
156865
51
80
200000
Normal
10
300000
12
500000
400000
10
17500
Histogram of Listing
14
600000
70
20
300000
200000
369687
156865
51
0.994
0.012
80
600000
300000
100000
30000
32500
1000000
60
800000
40
Listing
800000
800000
Percent
900000
Frequency
Sausage
5.8%
900000
700000
Listing
Boxplot of Listing
Category
Pepperoni
Plain
Mushroom
Sausage
Pepper and Onion
Mushroom and Onion
Garlic
Meatball
Listing
Pepperoni
21.8%
Listing
Meatball
Garlic 5.0%
2.3%
Percent
20
600000
400000
0
200000
300000
400000
500000 600000
Listing
700000
800000
900000
00
00
00
00
00
00
00
00
00
00
00
00
00
00
00
00
00
00
10
20
30
40
50
60
70
80
90
Listing
200000
15000
20000
25000
IncomePC
30000
Part 0 -- Introduction
15/15
900000
800000
800000
600000
500000
400000
Mushroom
16.2%
Plain
32.5%
Normal - 95% CI
900000
Mean
StDev
N
AD
P-Value
95
16
700000
90
500000
400000
200000
100000
15000
800000
700000
60
50
40
30
20000
22500
25000
IncomePC
27500
30000
32500
e mc
200000
100000
15000
400000
600000
Listing
800000
1000000
17500
20000
22500
25000
IncomePC
27500
Mean
StDev
N
369687
156865
51
80
200000
Normal
10
300000
12
500000
400000
10
17500
Histogram of Listing
14
600000
70
20
300000
200000
369687
156865
51
0.994
0.012
80
600000
300000
100000
30000
32500
1000000
60
800000
40
Listing
Sausage
5.8%
900000
700000
Listing
Boxplot of Listing
Category
Pepperoni
Plain
Mushroom
Sausage
Pepper and Onion
Mushroom and Onion
Garlic
Meatball
Percent
Pepperoni
21.8%
Frequency
Meatball
Garlic 5.0%
2.3%
Listing
Percent
Statistical inference
Hypothesis testing: (Is the correlation large?
Could it actually be zero?)
Hypothesis tests for specific applications
Mean of a population: Is it a specific value?
Pair of means: Are they equal?
Applications in regression: Are the variables in
the model really related?
An application in marketing: Did the sales
promotion work? How would you find out?
Listing
20
600000
400000
0
200000
300000
400000
500000 600000
Listing
700000
800000
900000
00
00
00
00
00
00
00
00
00
00
00
00
00
00
00
00
00
00
10
20
30
40
50
60
70
80
90
Listing
200000
15000
20000
25000
IncomePC
30000