Sie sind auf Seite 1von 91

Tech-Pro Consultants

Six Sigma Basic Statistics

March 2005 Dr. K.S.Ravichandran

Basic Statistics

Tech-Pro Consultants

Objectives
Review & Enhance The Basic Statistical & Quality Terms Needed For Six Sigma Process Improvement Begin To Enhance Minitab Operating Skills

Politicians Promise: if elected, I'd make certain that everybody gets an above average income

Tech-Pro Consultants

What is Statistics?
Is the science that develops methods to effectively derive information from numerical data Statistics is a collection of scientific methods for collecting, organizing and interpreting data, usually with the goal of inferring certain properties of the population from a representative sample of the population science of collecting and classifying a group of facts according to their relative number and determining certain values that represent characteristics of the group
There are three kinds of Lies: Lie, Damned Lie and Statistics Mark Twain
Tech-Pro Consultants

Basic Statistics
Types of data Measures of the Center of the data Mean Median Mode Measures of the Spread of Data Range Variance Standard Deviation Normal Distribution and Normal Probabilities Process Stability and Process Capability

Ask a statistician for her phone number... and get an estimate with 95% confidence
Used With Permission AlliedSignal 1995 - Dr. Steve Zinkgraf

Tech-Pro Consultants

What sorts of data do you see being collected around your area?
(List them below)
___________________________________________________ ___________________________________________________ ___________________________________________________ ___________________________________________________ ___________________________________________________ ___________________________________________________ ___________________________________________________ ___________________________________________________ ___________________________________________________ ___________________________________________________ ___________________________________________________

In God we trust. All others must bring data.


Tech-Pro Consultants

Two General Kinds of Data


(but

3 families)

ATTRIBUTE DATA - The data is discrete (counted).


Results from using go/no-go gages, or from the inspection of visual defects, visual problems, missing parts, or from pass/fail or yes/no decisions.

VARIABLE DATA - The data is continuous


(measured). Results from the actual measuring of a characteristic such as impedance of a motor winding, tensile strength of steel, diameter of a pipe, flow rate of a pump, etc.
Statisticians do it discretely and continuously.
Tech-Pro Consultants

2 General Kinds of Data (but 3 families)

Type-I Attributes Data (Binomial)

Type-II Attributes Data (Poisson)

ATTRIBUTE DATA (Count Data) Just ask (#1) Number of Items in a Category (Count-Based Proportions) yourself, Am I Heads / Tails (i.e., counting # of Heads and # of Tails) counting Yes / No (Order Form Filled Out Accurately or Not) things, Pass / Fail; Good / Bad (Accurate Billing/Overcharged) here? (#2) Counts of Discrete Event Occurrences If yes, you # of Scratches on a Car Hood have attributes # of Errors on a Form data. # of Insulation Breaks in a Spool of Wire # of times customer hangs up before receiving response VARIABLE DATA (Continuous Measurement Scale) (#3) Continuous Data Decimal subdivisions are meaningful Ex: Time to answer the telephone ( Exact # of secs. per call)

Different Types Of Data Require Different Analysis Tools


Tech-Pro Consultants

Manufacturing Process: Making Sheets of Glass

3 Families of Data:
Sample#1 Sample#2 Sample#3 Sample#4

Am I Counting Things? (Discrete Data)

ATTRIBUTES DATA

TYPE-I

Any Bubbles?
(accept / reject the entire item)

Reject

Reject

Accept

Reject
Poisson Distribution Normal Distribution or Other

TYPE-II

Number of Bubbles? 3 2 0 4

(Continuous Data) (Measurement Data)

VARIABLES DATA

Glass Weight
Weight = 12.2 Weight = 12.4 Weight = 11.9 Weight = 12.1

Tech-Pro Consultants

Binomial Distribution

Transactional Process: Converting an expense account form into a reimbursement check

3 Families of Data:
Form#1 Form#2 Form#3 Form#4

Am I Counting Things? (Discrete Data)

ATTRIBUTES DATA

TYPE-I

Any Errors?
(accept / reject the entire item)

Reject

Reject

Accept

Reject
Poisson Distribution Normal Distribution or Other

TYPE-II

Number of Errors on Form? 3 2 0 4

(Continuous Data) (Measurement Data)

VARIABLES DATA

Time to Reimburse Employee

36.1 hrs

24.6 hrs

21.0 hrs

29.2 hrs

Tech-Pro Consultants

Binomial Distribution

PLANT p np DEPARTMENT OPERATION NUMBER AND NAME c u

PART NUMBER AND NAME

Pass/Fail Data Avg.-

Sample at 8:00am
UCLLCL-

Sample at 9:00am
DATE CONTROL LIMITS CALCULATED:

Sample at 10:00am
Average Sample Size: Frequency:

40% 30% 20% 10%


Sample (n) Number (np, c) Proportion (p,u) Date (Shift, Time, etc.)

8:00am 9:00am

ANY CHANGE IN PEOPLE, EQUIPMENT, MATERIALS, METHODS, ENVIRONMENT, OR MEASUREMENT just She tells you are just Average: never mind, she is SYSTEMS, SHOULD BE NOTED. THESE NOTES WILL HELP YOU TO TAKE CORRECTIVE OR PROCESS

being Tech-Pro Consultants Mean

PLANT p np DEPARTMENT OPERATION NUMBER AND NAME c u

PART NUMBER AND NAME

Number of Avg.Blemishes Data

UCL-

LCL-

8:00am 8:10am 8:20am

DATE CONTROL LIMITS CALCULATED:

8:50am Frequency: 9:00am 9:10am etc.

Average Sample Size:

8:30am
8:40am etc.
4 3 2 1
Sample (n) Number (np, c) Proportion (p,u) Date (Shift, Time, etc.)

8:00am8:10am Tech-Pro Consultants

ANY CHANGE IN PEOPLE, EQUIPMENT, MATERIALS, METHODS, ENVIRONMENT, OR MEASUREMENT SYSTEMS, SHOULD BE NOTED. THESE NOTES WILL HELP YOU TO TAKE CORRECTIVE OR PROCESS

Exercise: Which Type of Data Is It?


DIRECTIONS: For each of the following applications, identify the type of data you would be investigating (Attributes Type-I, Attributes Type-II, or Variables Data) ... AND EXPLAIN YOUR CHOICE

(1)
(2) (3) (4)

Percent defective parts in hourly production


Percent cream content in milk bottles (comes in four-bottle container sets) Amount of time it takes to respond to a request Number of blemishes per square yard of cloth, where pieces of cloth may be of variable size Daily test of water acidity (pH) Number of raisins per box of Raisin Bran Number of defective parts in lots of size 100 Length of screws in samples of size ten from production lots Number of errors on a purchase order
Tech-Pro Consultants

(5) (6) (7) (8) (9)

The Probability Test

What is the largest probability possible? _______ What does this mean? What is the smallest probability possible? _______ What does this mean? What does a probability of 0.50 mean? _______________ What is the probability you will be struck by lightning during your lifetime? _____________________ What are your chances of appearing on The Tonight Show? ___________________ What is the probability of being killed by terrorists overseas? ____________________ What are your chances of being killed by an American in Baltimore? _______________

Tech-Pro Consultants

The Probability Test


Answers

What is the largest probability possible? ___1.0 = 100%__ What does this mean? What is the smallest probability possible? ___0.0 = 0%__ What does this mean? What does a probability of 0.50 mean? 50% Just flip a coin What is the probability you will be struck by lightning during your lifetime? 0.000001667 = 1/600,000 What are your chances of appearing on The Tonight Show? 0.00000204 = 1/490,000 What is the probability of being killed by terrorists overseas? 0.000001538 = 1/650,000 What are your chances of being killed by an American in Baltimore? 0.00025 = 1/4,000
Instructor Page Tech-Pro Consultants

The Probability Test (cont.)

Roll a fair die once, what is Prob(a six)? ______ Roll a fair die twice, what is Prob(a six on the second roll)?__ Roll two fair dice, what is Prob(get two sixes)?____________ What do you think of the recent headline, Education research shows 49.5% of all American high school students fall below the national average!

Tech-Pro Consultants

Probability

The Customer Requirements


What is the process capability? What is the probability of meeting the requirements? Are capability and probability related?

The Practical Problem Statement ...

Suppose a certain customer permits only those combinations which yield 3, 4, 5, . . . , or 11.
Used With Permission 6 Sigma Academy Inc. 1995

Tech-Pro Consultants

Computing the Risks- The Statistical Problem Statement


1
1 2 3 4 5 6 2 3 4 5 6 7

2
3 4 5 6 7 8

3
4 5 6 7 8 9

4
5 6 7 8 9 10

5
6 7 8 9 10 11

6
7 8 9 10 11 12
= =

Ways to form a 2
Ways to form a 12 Probability of Defect

in
in

Used With Permission 6 Sigma Academy Inc. 1995

Tech-Pro Consultants

Deeper Insight Into Probability


What is the probability of rolling a 5 using a fair pair of dice?

1 1 2 3 4 5 6

6
Die 1 1 2 3 4 Die 2 4 3 2 1 Total Probability .0278 .0278 .0278 .0278 .1111

.0278 .0278 .0278 .0278 .0278 .0278 .0278 .0278 .0278 .0278 .0278 .0278 .0278 .0278 .0278 .0278 .0278 .0278 .0278 .0278 .0278 .0278 .0278 .0278 .0278 .0278 .0278 .0278 .0278 .0278 .0278 .0278 .0278 .0278 .0278 .0278
Tech-Pro Consultants

Used With Permission 6 Sigma Academy Inc. 1995

Establishing the Odds


Value 2 3 4 5 6 7 8 9 10 11 12 Total Combinations 1 2 3 4 5 6 5 4 3 2 1 36 Probability .0278 .0556 .0833 .1111 .1389 .1667 .1389 .1111 .0833 .0556 .0278 1.0000

Probability of any given value on Die 1 = 1/6 = .1667 Probability of any given value on Die 2 = 1/6 = .1667 Probability of any given combination = 1/6 x 1/6 = 1/36 = .0278
Tech-Pro Consultants

Used With Permission 6 Sigma Academy Inc. 1995

Graphing the Results


Suppose a certain customer permits only those combinations which yield 3, 4, 5, . . . , or 11.
Value 2 3 4 5 6 7 8 9 10 11 12 Total Combinations 1 2 3 4 5 6 5 4 3 2 1 36 Probability .0278 .0556 .0833 .1111 .1389 .1667 .1389 .1111 .0833 .0556 .0278 1.0000

18 16 14 12 10 2.8%

LSL

USL

2.8%

8
6 4 2 0 2 4
Total of Dice Values

10

12

14
Used With Permission 6 Sigma Academy Inc. 1995

Zone of Customer Satisfaction 94.4% . . .Hence, the probability of Customer Satisfaction is 94.4 %
Tech-Pro Consultants

Statistical Distributions

We can describe the behavior of any process or system by plotting multiple data points for the same variable
Over time Across products or business By different people, machines, etc...

The accumulation of these data can be viewed as a distribution of values Represented by:
Dot plots Histograms Normal curve or other smoothed distribution
Used With Permission AlliedSignal 1995 - Dr. Steve Zinkgraf

Tech-Pro Consultants

Process = Hose
1 Drop = 1 Unit of Output Histogram is ... a pile of individual values

Y = Weight (lbs)

220

160

100

Dotplot
:: :: .: .:: :::::::::: : . : . . ::.::::: :.:::.:.:.:.: : : : : . : . . -----+---------+---------+---------+---------+---------+-C1 100 125 150 175 200 225
Tech-Pro Consultants

Dot Plots
2nd Observation 1st Observation

1.0 1.05

1.1 1.15

1.2 1.25

1.3 1.35

1.4

Diameter

Suppose we have a manufacturing line that is producing shafts. Diameters range from 1.0 to 1.4 inches. As we make a measurement of a shaft, we record the value with a dot on the above scale Ex: 1st Observation = 1.4 inches 2nd Observation = 1.1 inches
Tech-Pro Consultants

Dot Plots

:: :::. . . :.. :::::: : : . :.. ..::::::::::::::: :: ::. :..: .::::::::::::::::::::::::.:::..:.: .


1.0
1.05

1.1
1.15

1.2
1.25

1.3
1.35

1.4

And Suppose we continue sampling until 150 shafts have been measured What Statements Can You Make About Our Process ?

Tech-Pro Consultants

Dot Plots

:: :::. . . :.. :::::: : : . :.. ..::::::::::::::: :: ::. :..: .::::::::::::::::::::::::.:::..:.: .


1.0
1.05

1.1
1.15

1.2
1.25

1.3
1.35

1.4

Now imagine the same data, grouped into intervals with bars used to represent how the data looks.

Tech-Pro Consultants

Histogram Distribution

35 30 25

F requency

20 15 10 5 0 1.0 1.05 1.1 1.15 1.2 1.25 1.3 1.35 1.4

Data represented just with the dots is called a Dot Plot Using data represented in the above bar format is called a Histogram
Tech-Pro Consultants

Histogram
Lower Specification Upper Specification

.001

1.0
1.05

1.1
1.15

1.2
1.25

1.3
1.35

1.4

2.0

Now weve combined the Histogram with our Lower and Upper Specifications.

Question #1 : What are Specifications ? Where do they come from ?

Question #2: What can you say about our process now ?
Tech-Pro Consultants

Histogram
Lower Specification Upper Specification

1.0
1.05

1.1
1.15

1.2
1.25

1.3
1.35

1.4

1.1

1.3

Suppose the customer has given us new specifications !

Question: What can you say about our process now ?

Tech-Pro Consultants

Dotplot Distribution

: : :::. . . :.. :::::: : : . :.. ..::::::::::::::: :: ::. :..: .::::::::::::::::::::::::.:::..:.: . -+---------+---------+---------+---------+------28.0 29.0 30.0 31.0 32.0
Imagine a customer service help line in which the business knows that to stay competitive, it must return the customers telephone calls in less than 30 minutes. The actual response time was measured 150 times and plotted above.

Tech-Pro Consultants

Smoothed (Normal) Distribution


Process Capability Analysis
Lower Spec Upper Spec

26

28

30

32

34

Time
Cp 1.85 Targ * Mean 30.0692 %>USL Exp 0.00 PPM>USL Exp 0

Finally, we can view the data as a smoothed distribution (red line), in this CPU 1.83 USL 35.000 Mean+3s 32.7649 Obs 0.00 Obs 0 CPL 1.88 LSL 25.000 Mean-3s 27.3735 %<LSL Exp 0.00 PPM<LSL Exp example using the normal distribution assumption. It provides an0 Cpk 1.83 k 0.014 s 0.8986 Obs 0.00 approximation of how the data might look if we were to collectObs infinite an 0 Cpm * n 150.000 number of data points
Used With Permission AlliedSignal 1995 - Dr. Steve Zinkgraf

Tech-Pro Consultants

Forming the Normal Curve


Smooth curve interconnecting the center of each bar

Center of the bar

Units of Measure

p(x > a) =
a

1 2

2 e-(1/2)[(x - )/ ] dx

Performance Limit

Given that 100% of the area under the normal curve lies between , we may calculate that area which lies beyond the performance limit. Doing so would reveal the random chance probability of creating a defect.

Area of Yield

Probability of a Defect

- infinity

a Note: The tails of the normal curve will touch the baseline at infinity.

+ infinity
Used With Permission 6 Sigma Academy Inc. 1995

Tech-Pro Consultants

Basic Statistics

Types of data

Measures of the Center of the Data Mean Median Mode


Measures of the Spread of Data Range Variance Standard Deviation Shape: Normal Distribution and Normal Probabilities

Process Stability and Process Capability


Used With Permission AlliedSignal 1995 - Dr. Steve Zinkgraf

Tech-Pro Consultants

Data Example
(Actual # of Days from Order to Ship)

140 145 160 190 155 165 150 190 195 138 160 155 153 145

170 175 175 170 180 135 170 157 130 185 190 155 170 155

215 150 145 155 155 150 155 150 180 160 135 160 140 142

130 155 150 148 155 150 140 180 190 145 150 164 112 125

136 123 155 140 120 130 138 121 125 116 145 150 102 115

130 120 130 131 120 118 125 135 125 118 122 115

150 110 116 108 95 125 133 110 150 108

Tech-Pro Consultants

Where is the Center of the Data? Decribed in 2 ways:

Mean = The average value (the Center of Gravity)

Median = the 50% point, (or the middle number)


To find the median of a data set,

X=

Sum of the data points Number of data points

(1) arrange data in order from smallest to largest (2) the middle number is the median!

- Uses all data points - Heavily influenced by extreme values

1, 2, 3, 14, 85
The median is 3
- Not heavily influenced by extreme values

Tech-Pro Consultants

As head of the universitys Communications Dept. you are asked to summarize the average starting salaries of Communications graduates.

$10, 20, 30, 40, 50 ($ in thousands)

What is the average income (or center of gravity)?

What is the median income?

However, under the advice of the Public Relations Dept. you consider to including one of your former Communications majors: Shaquille ONeal (a rather wealthy rookie basketball star)

$10, 20, 30, 40, 5000 ($ in thousands)

What is the average income (or center of gravity)?


Tech-Pro Consultants

What is the median income?

Where is the Center?

Mode (not used as much): The value that occurs most often.

The Mode may not exist; and if does exist, it may not be unique. -Can be used with categorical/attribute data What is the mode for the following set of defect data? # of change notices issued: -Price change: 13 -Spec change: 112 -Ship to address change: 40 -Delivery date changed: 79
Tech-Pro Consultants

What does Bimodal mean?

Breakout Example

Suppose

your son or daughter is considering going to work for a small, family owned business after graduation. The owner of the business proudly states that, of the last 7 college graduates hired, the mean salary was $25,000; the salaries were bimodal, with modes of $18,000 and $20,000; and the median salary was $19,000. He refuses to identify the individual salaries
Use

your knowledge of the mean, median, and mode to analyze the starting salary opportunities with this company. (Round all salaries to the nearest $1,000)

From Introductory Statistics William D. Ergle Tech-Pro Consultants

Exercise
Minitab can easily calculate the Mean and Median

1. Open up Minitab 2. Open file: Distskew.mtw 3. Perform The Following Stat> Basic Statistics> Descriptive Statistics> 4. Enter The Variables Names 5. Evaluate Results

Tech-Pro Consultants

Descriptive Statistics For 3 Distributions

TABULAR FORM
Variable Normal Pos Skew Neg Skew N 500 500 500 Mean 70.000 70.000 70.000 Median 69.977 65.695 73.783 TrMean 70.014 68.554 71.368 StDev 10.000 10.000 10.000

Look For This In Your Session Window !


Tech-Pro Consultants

Graphical Form

Tech-Pro Consultants

Different Distributions
Sketch in the Means and Medians on each Distribution.

Comparison of Distributions.
300
300

Comparison of Distributions.

Frequency

Frequency

200

200

Tail
100 0 0 10 20 30 40 50 60 70 80

100

Tail
60 70 80 90 100 110 120 130

C3

C2

Negative Skew

Positive Skew

Symmetric Distribution
Comparison of Distributions.
100

Frequency

50

0 20 30 40 50 60 70 80 90 100 110

C1

Used With Permission AlliedSignal 1995 - Dr. Steve Zinkgraf

Tech-Pro Consultants

Graphical Reminder
* The 3 Charts On The Previous Page Were Created Under The Minitab Histogram Option Graph>Histogram

Tech-Pro Consultants

Relationship Of The Mean & Median

300

Mean

Median

Median
300

Mean

Frequency

Frequency
0 10 20 30 40 50 60 70 80

200

200

100

100

0 60 70 80 90 100 110 120 130

Neg Skew

Pos Skew

Mean, Median
100

Frequency

50

0 20 30 40 50 60 70 80 90 100 110

Normal
Used With Permission AlliedSignal 1995 - Dr. Steve Zinkgraf

Tech-Pro Consultants

Basic Statistics

Types of data Measures of the Center of the Data Mean Median Mode Measures of the Spread of Data Range Variance Standard Deviation Normal Distribution and Normal Probabilities

Process Stability and Process Capability


Used With Permission AlliedSignal 1995 - Dr. Steve Zinkgraf

Tech-Pro Consultants

Population Parameters vs Sample Statistics


Examples of SAMPLE:
1000 US Citizens Hrs. Worth of Acct. Pay Handful of Sand

Examples of POPULATION:
Entire United States Yrs. Worth of Acct. Payable Every Grain of Sand On The Beach

X
= Population Mean = Population Standard Deviation

= Sample Mean

s=^

= Sample Standard Deviation

Tech-Pro Consultants

3 ways to describe how far the data is spread:

Range = R

the difference between largest and smallest observations

Standard Deviation = s

Variance = s2 (just the square of the std dev!)

Tech-Pro Consultants

CLASS EXERCISE

Calculate manually the Variance and Standard Deviation of These 5 Data Points

X
5 4 3 1 2

X
Avg = ___

X
2

X
4

Sum of the data points Number of data points

-1

S2
Divide the Sum by (n-1): = Variance = S2 = __________

Sum of the last column = _______

Square Root of the Variance = Std.Dev. = S = _________

Tech-Pro Consultants

CLASS EXERCISE

Calculate manually the Variance and Standard Deviation of These 5 Data Points

X
5 4 3 1 2

X
Avg = 3

X
2 1 0 -2 -1

X
4 1 0 4 1

Sum of the data points Number of data points

S2
Divide the Sum by (n-1): = Variance = S2 = 2.5
Instructor Page

Sum of the last column = 10

Square Root of the Variance = Std.Dev. = S = 1.58

Tech-Pro Consultants

Computational Equations
N

Population Mean

Xi =
N
i 1

Population Standard Deviation

(X i =
i=1

)2

N
n

Sample Mean

xi

x=
N

i=1

n
(X i X )2
Used With Permission 6 Sigma Academy Inc. 1995

Sample Standard Deviation

s=
Tech-Pro Consultants

i=1

n -1

The Standard Deviation

Point of Inflection

The distance between the point of inflection and the mean constitutes the size of a standard deviation. If three such deviations can be fit between the target value and the specification limit, we would say the process has three sigma capability.

1
Upper Specification Limit (USL) Target Specification (T) Lower Specification Limit (LSL) Mean of the distribution ( ) Standard Deviation of the distribution ( )

p(d) USL

Used With Permission 6 Sigma Academy Inc. 1995

Tech-Pro Consultants

Basic Statistics

Types of data

Measures of the Center of the Data Mean Median Mode


Measures of the Spread of Data Range Variance Standard Deviation Normal Distribution and Normal Probabilities Process Stability and Process Capability
Used With Permission AlliedSignal 1995 - Dr. Steve Zinkgraf

Tech-Pro Consultants

The Normal Distribution


The Normal Distribution is a distribution of data which has certain consistent properties These properties are very useful in our understanding of the characteristics of the underlying process from which the data were obtained Most natural phenomena and man-made processes are distributed normally, or can be represented as normally distributed

Used With Permission AlliedSignal 1995 - Dr. Steve Zinkgraf

Tech-Pro Consultants

The Normal Distribution


Property 1: A normal distribution can be described completely by knowing only the:
mean, and standard deviation
Distribution One

Distribution Two Distribution Three

What is the difference among these three normal distributions?


Used With Permission AlliedSignal 1995 - Dr. Steve Zinkgraf

Tech-Pro Consultants

Statistical Number Line

Exercise

300
add 10 add 10 add 10

X Axis
(pounds)

Suppose the weights of players on a football team had =300 lbs and =10 lbs You fill in the X-axis values (weights) above

Tech-Pro Consultants

Statistical Number Line

270

280

290

300

310

320

330

X Axis
(pounds)

Exercise

Suppose the weights of a football team had =300 lbs and =10 lbs You fill in the X-axis values (weights)
Tech-Pro Consultants

Instructor Page

68%

+ 1 = 68%of the individuals

X Axis
270 280 290 300 310 320 330 (pounds)

Tech-Pro Consultants

Instructor Page

95%

+ 2 = 95%of the individuals

X Axis
270 280 290 300 310 320 330 (pounds)

Tech-Pro Consultants

Instructor Page

99.7%

+ 3 = 99.7%of the individuals

X Axis
270 280 290 300 310 320 330 (pounds)

Tech-Pro Consultants

Instructor Page

The Normal Curve and Probability Areas Associated with the Standard Deviation

Property 2: The area under sections of the curve can be used to estimate the cumulative probability of a certain event occurring
Probability of sample value
40% 30% 20% 10% 0% -4 -3 -2 -1 0 1 2 3 4
Used With Permission AlliedSignal 1995 - Dr. Steve Zinkgraf

68%

Cumulative probability of obtaining a value between two values

95%

99.73%

Number of standard deviations from the mean


Tech-Pro Consultants

Empirical Rule of Standard Deviation


The previous rules of cumulative probability apply even when a set of data is not perfectly normally distributed. Lets compare the values for a theoretical (perfect) normal distributions to empirical (real-world) distributions

Number of Standard Deviations

Theoretical Normal

Empirical Normal

+/- 1

68%
95% 99.7%
Tech-Pro Consultants

60-75%
90-98% 99-100%
Used With Permission AlliedSignal 1995 - Dr. Steve Zinkgraf

+/- 2
+/- 3

How can I tell if my data is bell-shaped? (i.e., Normally Distributed)

Tech-Pro Consultants

Normal Probability Plots

We can test whether a given data set can be described as normal with a test called a Normal Probability Plot If a distribution is close to normal, the normal probability plot will be a straight line. Minitab makes the normal probability plot easy. Using Distskew.Mtw. Choose: Stat>Basic Stats>Normality Tests Produce a normal plot of each of the first 3 columns. Which appear to be normal?

Tech-Pro Consultants

3 Ways To See If Your Data Is Normally Distributed


Normal Distribution

Normal Probability Plots


.999
100

.99 .95

Probability

.80 .50 .20 .05 .01 .001

Frequency

50

0 20 30 40 50 60 70 80 90 100 110
Average: 70 Std Dev: 10 N of data: 500

26

36

46

56

66

76

86

96

106

Normal
Anderson-Darling Normality Test A-Squared: 0.418 p-value: 0.328

C1

Positive Skewed Distribution

Normal Probability Plots


300

.999 .99

Frequency

200

.95 .80 .50 .20

100

.05 .01 .001


0

60
60 70 80 90 100 110 120 130
Average: 70 Std Dev: 10 N of data: 500

70

80

90

100

110

120

130

Pos Skew
Anderson-Darling Normality Test A-Squared: 46.447 p-value: 0.000

C2

Normal Probability Plots


300

Negative Skewed Distribution

.999 .99 .95 .80

Frequency

Probability

200

.50 .20 .05 .01 .001

100

If the Normality Test shows a P-value that is less than 0.05, then the data is NOT represented well by a normal distribution
Used With Permission AlliedSignal 1995 Dr. Steve Zinkgraf

0 0 10 20 30 40 50 60 70 80
Average: 70 Std Dev: 10 N of data: 500

Probability

10

20

30

40

50

60

70

80

C3

Neg Skew
Anderson-Darling Normality Test A-Squared: 43.953 p-value: 0.000

Tech-Pro Consultants

P Value for Normality Test

If your P value is less that than .05, then the data is NOT approximately normal.

Tech-Pro Consultants

Mystery Distribution

Generate a Normal Probability Plot for the Mystery variable in Mystery.mtw What is your conclusion? Is this a normal distribution?

Mystery Distribution

.999 .99 .95

Probability

.80 .50 .20 .05 .01 .001

50

100

150

Mystery
Average: 100 Std Dev: 32.3849 N of data: 500 Anderson-Darling Normality Test A-Squared: 27.108 p-value: 0.000

Used With Permission AlliedSignal 1995 - Dr. Steve Zinkgraf

Tech-Pro Consultants

Central Limit Theorem

Various sampling distributions of individual measurements Random sample of g sets with n measurements assigned to each set

For more detail, see the next few pages.

X X
The central limit theorem states that the distribution of the sample means, our estimate of , can be approximated with a normal distribution even though the original population may be non-normal. Given this, we may say that the grand average (resulting from averaging sets of samples) approaches the universe mean as the number of sample sets approaches infinity. This property is at the core of many statistical tests and is very important for resolving a wide array of industrial problems. Tech-Pro Consultants

Used With Permission

6 Sigma Academy Inc. 1995

Important Distinctions:
The Distribution of Averages
VS

The Distribution of Individuals

Tech-Pro Consultants

What would the Distribution of Individuals look like? The Distribution of Individuals

?
96
85 74 Y = Lifetime(Hrs) 96 85 74 Y = Lifetime(Hrs)

= Individual Measurement
Flashlight

= Average of the Subgroup


Tech-Pro Consultants

What would the Distribution of Individuals look like? The Distribution of Individuals

96

85 74 Y = Lifetime(Hrs)

96

85 74 Y = Lifetime(Hrs)

= Individual Measurement
Flashlight

= Average of the Subgroup


Tech-Pro Consultants

What would the Distribution of Averages look like?

The Distribution of Averages

10 9.5 10.5 Y = Weight (lbs)

= Individual Measurement = Average of the Subgroup

Tech-Pro Consultants

What would the Distribution of Averages look like?

The Distribution of Averages

10 9.5 10.5 Y = Weight (lbs)

= Individual Measurement = Average of the Subgroup

Tech-Pro Consultants

Distribution of Individuals
1 point is ... Histogram is... Spread is...

Distribution of Averages
1 Avg (i.e., 1 X-Bar) A Pile of X-Bars
n

1 Individual A Pile of Individuals


X
Compressed by

SE(Mean)
X

The question might be...

What is the probability that an individual battery will last beyond 87 hours?

What is the probability that the average lifetime of an n=20 sample will exceed 87 hours?

Graphically...

74

85

96

74

85

96

Tech-Pro Consultants

__ _ _ __ _ _ __ _ _ __ _ _ __ _ _ __ _ _ __ _ _ __ _ _ __ _ _ __ _ _ __ _ _ __ _ _ __

97

__ 97 Dist. of Avgs spread compresses by factor of _ _ __ _ _ __ _ _ __ _ _ __ _ _ __ _ _ __ _ _ __ _ _ __ _ _ __ _ _ __ _ _ __ _ _ __ 95

95

X
93

93

91

91

89

89

Individuals

87

87

85

85

83

83

81

81

79

79

77

77

75

75

n=1

73

n=2

n=4

n=12
Tech-Pro Consultants

n=20

n=50

73

Basic Statistics

Types of data Measures of the Center of the Data Mean Median Mode Measures of the Spread of Data Range Variance Standard Deviation Normal Distribution and Normal Probabilities Process Stability and Process Capability
Used With Permission AlliedSignal 1995 - Dr. Steve Zinkgraf

Tech-Pro Consultants

Basic Statistics

Variability Is the process on target with minimum variability? We use the mean to determine if process is on target.

We use the Standard Deviation determine variability Stability How does the process perform over time? Represented by a constant mean and predictable variability over time.

X-Bar Chart for Process A


UCL=77.20 75 80

X-Bar Chart for Process B

UCL=77.27

Sample Mean

Sample Mean

70

X=70.98 LCL=64.70

X=70.91 70

60

65 0 5 1 0 1 5 20 25

LCL=64.62 50 0 5 1 0 1 5 20 25

Sample Number

Sample Number

Which process is the best process?


Tech-Pro Consultants

Used With Permission AlliedSignal 1995 - Dr. Steve Zinkgraf

Variation

While every process displays Variation, some processes display controlled variation, while other processes display uncontrolled variation (Walter Shewhart). . Controlled Variation is characterized by a stable and consistent pattern of variation over time. Associated with Common Causes. Uncontrolled Variation is characterized by variation that changes over time. Associated with Special Causes. Process A shows controlled variation. Process B shows uncontrolled variation
X-Bar Chart for Process A
UCL=77.20 75

X-Bar Chart for Process B

80 UCL=77.27

Sample Mean

Sample Mean

70

X=70.98 LCL=64.70

X=70.91 70

60

65 0 5 1 0 1 5 20 25

LCL=64.62 50 0 5 1 0 1 5 20 25

Sample Number

Sample Number

Special Causes
Tech-Pro Consultants

Used With Permission AlliedSignal 1995 - Dr. Steve Zinkgraf

Can We Tolerate Variability ?


There will always be variability present in any process We can tolerate variability if The total variability of the Output is relatively small compared to the process specifications and the process is on target The process is stable over time
LSL Nom USL

Traditional
Cost Acceptable Goal Post Mentality

OLD
LSL Nom USL

New Cost

Used With Permission AlliedSignal 1995 - Dr. Steve Zinkgraf

Tech-Pro Consultants

Expanding On The Goal Post Mentality


LSL Nom USL

3 Points

UNDER THE OLD RULES, The field goal kicker gets 3 points for his team as long as the ball falls between the LSL and USL.
Tech-Pro Consultants

Expanding On The Goal Post Mentality


LSL Nom USL

Points

UNDER THE NEW RULES, The Field Goal Kicker Might Get... 3 points Target & +/-1 2 points Between +/-1 & +/-2 1 point > +/-2 Out To The LSL & USL

Tech-Pro Consultants

Data Analysis Tasks For Improvement

Determine If Process Is Stable


If process is not stable, identify and remove causes of instability

Determine The Location Of The Process Mean.


Is It On Target?

If not, identify the variables which affect the mean and determine optimal settings to achieve target value

Estimate The Magnitude Of The Total Variability.


it acceptable with respect to the customer requirements (spec limits)?

Is

If not, identify the sources of the variability and eliminate or reduce their influence on the process
Used With Permission AlliedSignal 1995 - Dr. Steve Zinkgraf

Tech-Pro Consultants

Visualizing the Process Dynamics - Is The Process Stable ?


Inherent Capability of the Process . . . also called short-term capability

Time 1

Time 2 Time 3

Time 4

Sustained Capability of the Process . . . also called long-term capability

General Assumptions:: Over time, a typical process will shift and drift by approx. 1.5

LSL

T
Tech-Pro Consultants

USL
Used With Permission 6 Sigma Academy Inc. 1995

The Goal Is ...


Variables Data Attributes Data

Target

0% Rejected
Tech-Pro Consultants

How We Progress Toward The Goal

PHASE TWO - Stability


- IN CONTROL - NATURAL VARIATION ONLY

PHASE ONE - Unpredictable Performance


- VARIATION (SPECIAL / NATURAL CAUSES) - UNPREDICTABLE (HOURLY, DAILY) - DETECT AND ELIMINATE SPECIAL CAUSES
Tech-Pro Consultants

Not capable of getting all the water output into the clowns mouth?

How We Progress Toward The Goal


IN CONTROL AND CAPABLE

Lower Specification

Upper Specification

.001

1.0 1.05

1.1 1.15

1.2 1.25

1.3 1.35

1.4

2.0

(Variation from common causes reduced)

LOWER SPECIFICATION LIMIT

UPPER SPECIFICATION LIMIT


SIZE

IN CONTROL, BUT NOT CAPABLE (Variation from common causes excessive)

Now it is capable of getting all the water output into the clowns mouth

Tech-Pro Consultants

Is The Process on Target ? - Accurate ?


Manufacturing Distribution of the Widget Part

LSL
Increase in nonconformance due to shift in process centering
1.233 1.235 1.237 1.239

USL

T
1.241 1.243 1.245 1.247

Part
Recognize that the process center (m) is independent of the design center (T). In other words, the ability of a process to repeat any given centering condition is independent of the design specifications.

Used With Permission 6 Sigma Academy Inc. 1995

Tech-Pro Consultants

Is The Process on Target ? - Precise?


Manufacturing Distribution of the Widget Part

LSL T
1.235 1.237 1.239 1.241

USL

1.243

1.245

1.247

Part
Recognize that the process width is independent of the design width. In other words, the inherent precision of a process is not determined by the design specifications.

Used With Permission 6 Sigma Academy Inc. 1995

Tech-Pro Consultants

Is The Variability Acceptable To Customer Requirements ?

LSL

USL

Y = f

(X1 . . . XN)

The variation inherent to any dependent variable (Y) is determined by the variations inherent to each of the independent variables.

Poor Process Capability


Very High Probability of Defects Very High Probability of Defects Very Low Probability of Defects

Excellent Process Capability


Very Low Probability of Defects

LSL

USL
Tech-Pro Consultants

LSL

USL

Used With Permission 6 Sigma Academy Inc. 1995

Summary
Reviewed & Enhanced The Basic Statistical & Quality Terms Needed For Six Sigma Process Improvement Began to Build Up Minitab Operating Skills

Tech-Pro Consultants

Six Sigma

Q&A

Tech-Pro Consultants

Six Sigma

Thank You

Tech-Pro Consultants

Das könnte Ihnen auch gefallen