Beruflich Dokumente
Kultur Dokumente
Hien Tran
School of Engineering – Tan Tao University
What is statistics?
Statistics is the science of collecting, organizing,
summarizing, and analyzing information (DATA) to
draw conclusions or answer questions. In addition,
statistics is about providing a measure of confidence
in any conclusions.
A key aspect of data is that they vary. Is everyone in your
class the same height? No! Does everyone have the same
hair color? No! So, among individuals there is variability.
In fact, data vary when measured on ourselves as well.
Do you sleep the same number of hours every night? No!
Do you consume the same number of calories every day?
No!
One goal of statistics is to describe and understand sources
of variability.
The entire group of individuals to be studied
is called the population. An individual is a
person or object that is a member of the
population being studied. A sample is a
subset of the population that is being studied.
Descriptive statistics consist of organizing and
summarizing data. Descriptive statistics describe data
through numerical summaries, tables, and graphs. A
statistic is a numerical summary based on a sample.
Statistics
Lies, Damned lies, and Statistics
Availibility bias
i) Smoking can’t be all that bad. I know a lot of smokers who have lived
long and healthy lives.
ii) Cats must have a special ability to fall from great heights and
survive, because I’ve seen a lot of press reports about such events
(and I forget that I’ll rarely read a report about a cat falling and
dying).
iii) Violent criminals should be locked up for life because I’m always
reading newspaper articles about re-offenders (again, very unlikely
that I’ll read anything about non-re-offenders).
Lies, Damned lies, and Statistics
Wording of questions
Example: (Questions and Answers in Attitude Surveys, 1981,
p.277 by Schuman and Presser)
Q1: Do you think the United States should forbid public speeches
against democracy?
YES: 21.4% (i.e. NO: 78.6%)
Q2: Do you think the United States should allow public speeches
against democracy?
NO: 47.8% (i.e. YES: 52.2%)
Lies, Damned lies, and Statistics
Sampling technique
Harry S. Truman
versus
Thomas E. Dewey
Lies, Damned lies, and Statistics
Jumping to conclusion / bad interpretation
Lies, Damned lies, and Statistics
Misleading Averages
(a) (b)
Lies, Damned lies, and Statistics
Lurking variable/hidden factor
Simpson’s Paradox – Example: Kidney stone treatment
Treatment A Treatment B
Small Stones Group 1 Group 2
93% (81/87) 87% (234/270)
Large Stones Group 3 Group 4
73% (192/263) 69% (55/80)
Both 78% (273/350) 83% (289/350)