Beruflich Dokumente
Kultur Dokumente
Dr S G Deshmukh
What is a Decision?
Decision
A reasoned choice among alternatives
Examples:
Where to advertise a new product What stock to buy What movie to see Where to go for dinner Where to locate a new plant Which mode of transportation to choose
Decision Elements
Decision Statement
What are we trying to decide?
Alternative:
What are the options?
Decision Criteria:
How are we going to judge the merits of each alternative?
Design
Diagnosing the problem/opportunity Generating alternatives
Choice
Choosing the best alternative
Types of Decisions
Type of structure - Nature of task
Structured Unstructured
Nature of Decision
Structured Problems Routine and repetitive with standard solution Well defined decision making procedure Given a well-defined set of input, a well defined set of output is defined Semi-structured Problems Has some structured aspect Some of the inputs or outputs or procedures are not well defined Unstructured Problems All phases of decision making process are unstructured Not well defined input, output set and procedures
6
Scope of Decision
Operational Planning and Control:
Focus on efficient and effective execution of specific tasks. They affect activities taking place right now E.g... What should be today's production level
Strategic Planning
Long-range goals and policies for resource allocation E.g... What new products should be offered
7
Intuition
Judgment
High Level of Uncertainty Little Precedent to Follow Variables Less Scientifically Predictable When Facts Are Limited When Facts Do Not Clearly Point the Way to Go When Analytical Data Are of Little Use Several Alternatives, With Good Arguments for Each Time Is Limited
Observation..
We face numerous decisions in life & business. We can use Statistics to analyze the potential outcomes of decision alternatives.
10
Few examples..1..
AMUL (Largest Milk producer in India)
Must determine product mix Schedules must meet timely requirements for perishable items Developed optimization model to determine the above (integrated Production-distribution) model:
12
Few examples..2..
Samsung Electronics Leading consumer electronics manufacturer Semiconductor facilities cost $ 2-3 billion High equipment utilization is key Developed comprehensive planning and scheduling system to control WIP Benefits: Cut cycle times in half
13
14
Quantitative Analysis
Quantitative Analysis Process Model Development Data Preparation Model Solution Report Generation
Mathematical Model
MODEL
Definition of the Problem * Construction of the Model Solution of the Model * Validation of the Model Implementation of the Final Result
Implementation
INTERPRETATION
SOLUTION ANALYSIS
Characteristics of Models
Model is an abstraction of reality Models are usually simplified versions of the things they represent A valid model accurately represents the relevant characteristics of the object or decision being studied
19
Benefits of Modeling
Economy - it is often less costly to analyze decision problems using models. Timeliness - models often deliver needed information more quickly than their realworld counterparts. Feasibility - models can be used to do things that would be impossible. Models give us insight & understanding that improves decision making.
20
Statisticians collect and analyze data, then calculate results using a specific design. They are able to draw conclusions and make decisions in the face of uncertainty.
21
What Statisticians Do
Statisticians look for patterns in data to help make decisions in business, industry, and the biological, physical, psychological, and social sciences. Statisticians help make important advances in scientific research and work in opinion polling, market research, survey management, data analysis, statistical experiments, and education. Statisticians use quantitative abilities, statistical knowledge, and computing and communication skills to collaborate with other scientists to work on challenging problems
22
Statistics
a research question(s) (hypothesis) Collect data Analyze and summarize data Draw conclusions to answer research question(s)
Statistical Inference
In
23
24
Variation
What if everyone:
Looked
How many people would you have to interview to know everything about the population with regard to looks, thoughts, and beliefs?
25
Variation
Interviews or observations are required on multiple members of the population for valid conclusions about population characteristics.
26
Variation
Variation is everywhere
Individuals Repeated
27
29
30
Sampling biases:
One
group in a population is overrepresented compared to another. Example: New Longitudinal Study Finds that Having a Working Mother Does No Significant Harm to Children. The sample was not representative of average or higher income families.
31
Sampling biases:
One
group in a population is overrepresented compared to another. Example: Ms Agony asked readers of her column in Readers Digest whether they would have children again if they had it to do over. 70% of respondents said NO. Was the sample representative of all parents? Her invitation attracted parents that regretted having children. Most parents do not regret having children based on scientific studies 32 selecting random samples of parents.
of 2004, full-time employed women earned on average only about 76 percent as much as full-time employed men Does this difference show that women are discriminated against? Occupation has been ignored. More men have received training for higher paying jobs. 33
34
Did
the new jail really cause the decrease in crime? Or did the decrease just happen when the new jail opened?
35
Variables any characteristic of an individual. A variable can take different values for different individuals.
36
Variables
A variable can be: Numerical/Quantitative:
age:
Categorical/Qualitative:
sex:
Variables
of numbers
Age categories
PAN
38
Variables
supports expanded Bus schedules? Does a new diet help weight loss? Does taking aspirin prevent heart attacks? Which rivers are polluted?
39
Data representation Various Probability Distributions Discrete (Binomial, Geometric, Poisson, Uniform etc.) Continuous (Uniform, Exponential, Normal etc.) Central Limit Theorem Moment generating functions Distribution of Sample Means Point Estimates Confidence Interval Type I and Type II errors Hypothesis Testing Regression Anova DOE Non-parametric tests
40
Census gathering data from the entire population Sample a portion of the whole
43
Population Snapshot
45
Most theory about sample means depends on assumptions that the mean comes from a normal distribution. The Central Limit Theorem says that for any population, if the sample size is large enough, the sample means will be approximately normally distributed with the mean equal to the population mean and standard deviation equal to the population standard deviation divided by the square root of n (/n).
46
Normal Distribution
Mother of all !
normal variate (Z) ~ N(, 2 ) 2 : Chi-Square Square of Z t distribution small sample size F Distribution ~ Ratio of 2
Standard Approximation
estimate
X X=
n
XZ n or XZ X+Z n n
48
Interval
Estimate
2
1
Z
2
2
X Z
49
Z
2
X + Z ] = 1 2 n n
50
95%
X X X X X X
51
( X X )
n 1
( n 1) S =
2
degrees of freedom = n - 1
52
( n 1) S
2 2
( n 1) S
1 2
df = n 1
= 1 level of confidence
53
Selected 2 Distributions
df = 3 df = 5 df = 10
54
Statistical Significance
Significance is a statistical term that tells how sure you are that a difference or relationship exists. To say that a significant difference or relationship exists only tells half the story. We might be very sure that a relationship exists, but is it a strong, moderate, or weak relationship? After finding a significant relationship, it is important to evaluate its strength. Significant relationships can be strong or weak. Significant differences can be large or small. It just depends on your sample size.
55
One important concept in significance testing is whether you use a onetailed or two-tailed test of significance. The answer is that it depends on your hypothesis. When your research hypothesis states the direction of the difference or relationship, then you use a one-tailed probability. For example, a one-tailed test would be used to test these null hypotheses: Females will not score significantly higher than males on an IQ test. Blue collar workers are will not buy significantly more product than white collar workers. Superman is not significantly stronger than the average person. In each case, the null hypothesis (indirectly) predicts the direction of the difference. A two-tailed test would be used to test these null hypotheses: There will be no significant difference in IQ scores between males and females. There will be no significant difference in the amount of product purchased between blue collar and white collar workers. There is no significant difference in strength between Superman and the average person. The one-tailed probability is exactly half the value of the two-tailed probability.
56
If your statistic is higher than the critical value from the table
Your finding is significant.
You
reject the null hypothesis. The probability is small that the difference or relationship happened by chance, and p is less than the critical alpha level (p < alpha ).
58
If your statistic is lower than the critical value from the table
Your finding is not significant.
You
fail to reject the null hypothesis. The probability is high that the difference or relationship happened by chance, and p is greater than the critical alpha level (p > alpha ).
59
Partition of Total Sum of Squares in Partition of Total Sum of Squares in RBD RBD
SST (Total Sum of Squares)
Regression analysis is the process of constructing a mathematical model or function that can be used to predict or determine one variable by another variable. Correlation is a measure of the degree of relatedness of two variables.
61
bivariate (two variables) linear regression -- the most elementary regression model dependent variable, the variable to be predicted, usually called Y independent variable, the predictor or explanatory variable, usually called X
62
Regression Models
x
Probabilistic Regression Model Y = 0 + 1X + 0 and 1 are population parameters 0 and 1 are estimated by sample statistics b0 and b1
63
64
XY nXY = X n X
2
( X )( Y ) XY
2
X n
Y X b =Y b X = n b n
0 1 1
65
( X )( Y )
n
2
( X X )
= X
X n
b1 =
SS XY SS XX
Y X b = Y b X = n b n
0 1 1
66
Parametric Statistics are statistical techniques based on assumptions about the population from which the sample data are collected. Assumption that data being analyzed are randomly selected from a normally distributed population. Requires quantitative measurement that yield interval or ratio level data. Nonparametric Statistics are based on fewer assumptions about the population and the parameters. Sometimes called distribution-free statistics. A variety of nonparametric statistics are available for use with nominal or ordinal data.
Etc.
67
Compare two unpaired Unpaired t test groups Compare two paired groups Paired t test
69
Applications of statistics..
Statistical Quality control Simulation Six-sigma
70
Statement of purpose/question of interest Summary of data collection e.g. random sample, stratified sample, available data identify possible sources of bias Why do you believe sample was representative? Summarize the data (concise, well-labeled, easy to read) Numerical or quantitative data Graphs: Pie diagram or histogram measures of central tendency (e.g. mean or median) measures of spread (e.g. range, SD, IQR) a check for outliers (e.g. z scores,) a check for normality (prob. plot, 68-95-99.7 rule) if needed by your analysis Quantitative data Graphs: pie chart or bar graph 71 Proportion in each category
Statistical inference Quantitative data e.g. confidence intervals for mean(s), hypothesis test for mean(s), regression, ANOVA Qualitative data Include a discussion of why our method is appropriate Diagnostics Verification of any assumptions made during statistical inference Interpretation/Explanation of results What does it all mean? Use the above summaries to justify your interpretation Suggest reasons for what you have observed Overall conclusion, recommendations, future questions
72
Observation..
The objective of all experimental design, as well as of statistical methods in general, is to get the greatest amount of accurate information for the outlay of manpower, time, and money. Without a working knowledge of statistical methods no analyst can expect to reach that goal.
73
74