Beruflich Dokumente
Kultur Dokumente
Company Confidential
Agenda
I. Introduction What is Statistics?
A. Terms and Definitions
a. Types of Data
b. Population and Sample
II. Distributions & Summary Statistics
A. Describing the distribution:
a. Shape
1. Symmetry
2. Skewness
3. Modality
b. Center
1. Average\Mean
2. Median
3. Mode
c. Spread
1. Percentile, Deciles and Quartiles
2. Range
3. Inter-quartile Range
4. Variance
5. Standard Deviation
Company Confidential
Types of Data
Company Confidential
Data Type Conversion
The tenure (in months) of Makati Analysts:
44, 90, 80, 135, 21, 53, 29, 128, 47, 11, 15, 49, 66, 49, 21, 110, 23,
50, 48, 50, 47, 45
Ordinal Nominal
Company Confidential
Data Type Drill
Determine the type of data for the following:
If possible, how can these be converted to other data types?
Company Confidential
Population vs. Sample
Company Confidential
Population vs. Sample
Company Confidential
Describing a Distribution
We describe a distribution by its
1. Shape usually described by
Symmetry
Modality
Outliers
2. Center refers to the measure of the
middle or expected value of the
data set
Mean
Median
Mode
2. Spread also called variation,
denotes variability in a distribution
Percentile, Decile, Quartile
Range
Interquartile Range
Standard Deviation and Variance
Company Confidential
Shape: Symmetry
Symmetric
Left and right side of the center are mirror images of each other.
Skewed
Skewed to the Right\Positively Skewed Long tail to the right
Skewed to the Left\Negatively Skewed Long tail to the left
Company Confidential
Shape: Modality
Modality
Refers to the number of peaks in a dataset.
Mode is the most frequent value in a dataset
Company Confidential
Outliers
Outliers
Observations that deviate markedly from the rest of the data
Could result from special causes; may indicate bad data
Company Confidential
Shape of the Tenure Data of Makati Analysts
3
Frequency
0
24 36 48 60 72 84 96 More
Company Confidential
Measures of Central Tendency
Mean, Median, Mode
Median
Mean Middle value of a set of data that has
Ratio between the sum of values and been put into rank order.
the number of values
Arithmetic average
Mode
Observation with the highest
frequency
Company Confidential
Central tendency measures of the tenure data of
Makati Analysts
Median
Mean 55.05
Mean
Median 48.50
6
Mode 21
3
Frequency
0
24 36 48 60 72 84 96 More
Company Confidential
Measures of Spread
Company Confidential
Percentiles, Deciles and Quartiles
Percentiles
Are values that divide a set of observations into 100 equal parts
Denoted by P1, P2, , P99
These values are such that 1% of the data falls below P1, 2% falls below P2,
, and 99% falls below P99.
Deciles
Are values that divide a set of observations into 10 equal parts
Denoted by D1, D2, , D9.
These are values are such that 10% of the data falls below D1, 20% falls
below D2, , and 90% falls below D9.
Quartiles
Are values that divide a data set into 4 equal parts
Denoted by Q1, Q2, and Q3
These values are such that 25% of the data falls below Q1, 50% falls below
Q2, and 75% falls below Q3.
Company Confidential
How to compute for Percentiles
Compute for the 50th Percentile in the Makati Analysts tenure data
. 44, 90, 80, 135, 21, 53, 29, 128, 47, 11, 15, 49, 66, 49, 21, 110, 23, 50, 48,
50, 47, 45
Company Confidential
How to compute for
Percentiles, Deciles and Quartiles
Company Confidential
The Range
Company Confidential
Computing for IQR
Company Confidential