Introduction To Inference

Chapter 14.
Introduction to Inference
Topics covered in this chapter: Estimating with a 95% Confidence Interval Estimating with a 99% Confidence Interval Significance Test with a One-sided P-value Significance Test with a Two-sided P-value
Estimating with a 95% Confidence Interval (s is known)

Example 14.3: Healing of skin wounds The Problem: Biologists studying the healing of skin wounds measured the rate at which new cells closed a razor cut made in the skin of an anesthetized newt. Estimate the mean healing rate using a 95% confidence interval. 1. Open dataset eg14-03.por. 2. Find the sample statistics. a. Click on Analyze. b. Click on Descriptive Statistics. c. Click on Descriptives. d. Move Rate to the Variable(s) box. e. Click OK.
103
Introduction to Inference104
Here we are given the sample mean and sample standard deviation. However, we are given = 8 in this example. We will simply use the sample mean from this output. 3. Find the confidence interval. a. Click on Variable View. b. Add the variables xbar, sigma, and n. Change the number of decimals of xbar to 4.
c. Click on Data View. d. Under xbar, type 25.6667 for the sample mean given in the table above. e. Under sigma, type 8 for the population standard deviation. f. Under n, type 18 for the sample size.
105Chapter 14
4. Compute the confidence interval. For 95% confidence intervals, remember to use z* = 1.96. a. Click on Transform. b. Click on Compute Variable. c. Under Target Variable, type lowerbound. d. Under Numeric Expression, type xbar 1.96 * sigma / SQRT(n).
e. Click OK. f. Repeat steps c through e, using upperbound and the expression: xbar + 1.96 * sigma / SQRT(n).
The 95% confidence interval for the mean healing rate for all newts of this particular species in micrometers per hour is (21.97, 29.36).
Estimating with a 99% Confidence Interval (s is known)

Example (Apply Your Knowledge 14.5): IQ test scores The Problem: Make a stemplot and construct a 99% confidence interval for the IQ scores of 31 seventh-grade girls in a Midwest school district. 1. Make a stemplot. a. Open the data set ex14-05.por. b. Click on Analyze. c. Click on Descriptive Statistics. d. Click on Explore. e. Choose the variable IQ to be in the Dependent List. f. Under Display, choose Plots. g. Click on the Plots button at the right side of the window. h. For Descriptive, choose Stem-and-leaf. i. Click Continue. j. Click OK. The stemplot will show as below in the output:
107Chapter 14
2. Get the necessary descriptive statistics. a. Click on Analyze. b. Click on Descriptive Statistics. c. Click on Descriptives. d. Choose IQ as the Variable(s). e. Click OK.
3. Add the variables to compute the 99% confidence interval. a. Click on Variable View. b. Add the variables xbar, sigma and n. c. Click on Data View. d. Under xbar, type 105.84 in the first row. e. Under sigma, type in 15 since the population standard deviation is 15. f. Under n, type in 31 for the sample size.
4. Compute the 99% confidence interval. Recall that z* = 2.576 when computing 99% confidence intervals. a. Click on Transform. b. Click on Compute Variable. c. Under Target Variable, type lowerbound. d. Under Numeric Expression, type: xbar 2.576 * sigma / SQRT(n).
e. Click OK. f. Repeat steps c through e, instead using upperbound and the expression: xbar + 2.576 * sigma / SQRT(n).
109Chapter 14
The 99% confidence interval for the mean IQ scores for all seventh-grade girls in the school district is (98.90, 112.78).
Significance Test with a One-sided P-value

Example 14.7: Sweetening colas: one-sided P-value The Problem: Diet colas use artificial sweeteners to avoid sugar. These sweeteners gradually lose their sweetness over time. Manufacturers therefore test new colas for loss of sweetness before marketing them. Suppose that we know that for any cola, the sweetness loss scores vary from taster to taster according to a Normal distribution with standard deviation = 1. The mean for all tasters measures loss of sweetness and is different for different colas. The study of sweetness low tests the hypotheses: H0: = 0 Ha: > 0 Given the sample statistics, a one-sided P-value is calculated to determine the evidence against the null hypothesis. 1. Compute a one-sided p-value. a. Click on Variable View. b. Enter the variables z and probability. c. Increase the number of decimal places of probability to four. d. Click on Data View. e. Type 3.23 under z. 2. Calculate a one-sided p-value. a. Click on Transform. b. Click on Compute Variable. c. For target variable, type in probability.
d. e. f. g.
In numeric expression, type 1 . Under Function group, click on CDF and Noncentral CDF. Double-click on Cdfnorm. Click on z, then click on the right-facing arrow.
h. Click OK. i. Change existing variable? Click OK.
The probability of obtaining a sample mean of greater than or equal to 1.02 is 0.0006. The small p-value leads us to believe that the null hypothesis is false.
111Chapter 14
Significance test with a two-sided P-value

Example 14.8: Job satisfaction: two-sided p-value The Problem: Does the job satisfaction of assembly workers differ when their work is machine-paced rather than self-paced? Assign workers either to an assembly line moving at a fixed pace or to a self-paced setting. All subjects work in both settings in random order. This is a matched pairs design. After two weeks in each setting, the workers take a test of job satisfaction. The parameter of interest is the mean of the differences in scores in the population of all assembly workers. Similar to example 14.7, the sample statistics are given, but in this case the twosided p-value is computed. This example will be solved using nearly identical methods in example 14.7. However, in this example, z = 1.20 instead of 3.23. The one method that is different is in step 2d. The value computed needs to be multiplied by two, for a two-sided p-value. Hence, the numeric expression will read:
In this example, the p-value = .2301.
Chapter 14 Exercises
14.3 14.5 14.13 14.17 14.19 14.21 14.23 14.35 14.41 14.51 14.53 14.55 Find a critical value. IQ test scores. Sweetening Colas: find the P-value. Measuring conductivity. Measuring conductivity. Significance from a table. Testing a random number generator. I want more muscle. I want more muscle. Bone loss by nursing mothers. Bone loss by nursing mothers. Eye grease.
385
Chapter 14 SPSS Solutions
**NOTE: SPSS does not do inference based on Z distributions, nor does it perform inference on variables that are already summarized. If you really want to use SPSS for these problems or chapters, follow the instructions below (youll be basically using Transform, Compute Variable as a calculator) or use another technology (such as a graphing calculator or another statistics program like Minitab or Crunchit.)
14.3 Well use Transform, Compute Variable and IDF.Normal from the Inverse DF function group. With 97.5% in the center of a standard Normal distribution, there is 0.025/2 = 0.0125 on each end. Due to the symmetry of the distribution, we can find either z* (and remove the negative sign) or z*. z* = 2.2414. As always, if you do not see all the decimal places you want, go to the Variable View and change them.
14.5 Open data file ex14-05. Statistics, Explore.

IQ Stem-and-Leaf Plot Frequency Stem & Leaf
To create the stemplot, use Analyze, Descriptive
2.00 Extremes 2.00 8 . 4.00 9 . 9.00 10 . 10.00 11 . 2.00 12 . 2.00 13 . Stem width: Each leaf:
(=<74) 69 1368 023334578 1122244489 08 02
10 1 case(s)
This stemplot indicates there are two low outliers (=<74, namely 72 and 74). In the Descriptives block of the output, a confidence interval is given (set the confidence level using the Statistics button in the Explore dialog box).
Descriptives Statistic IQ Mean 99% Confidence Interval for Lower Bound Mean Upper Bound 105.84 98.79 112.89 Std. Error 2.563
386
This confidence interval is based on a distribution we wont meet until Chapter 17 ( the t distributions). To find the confidence interval, well calculate it by hand. We will, however, make use of the mean given above.
Based on this sample, with 99% confidence, the average IQ score of all seventh-grade girls in this school district is between 98.9 and 112.8.
14.13 When = 0, the distribution of x will be N (0, 1/ 10 = 0.316). The P-value is the area to the right of x = 0.3 on this distribution. We use Transform, Compute Variable and CDF.Normal to find the P-value is 1 0.8288 = 0.1712.
14.17 For n = 6 measurements, the standard deviation of the sampling distribution is = .2 / 6 = 0.0816. For this two-sided alternative, well double the area to the left of our observed sample mean (since these are less than the claimed mean). Well find this area using Transform, Compute Variable and CDF.Normal.
387
A sample mean of 4.98 (very close to 5) has P-value 0.8064, while the sample mean of 4.7 (much farther away from 5) has P-value 0.0002; this is much better evidence against the null as it is farther away.
14.19 Well enter the data and use Analyze, Basic Statistics, Descriptives to find the mean of these data, then compute the test by hand.
Descriptive Statistics N Conduct Valid N (listwise) 6 6 Minimum 4.73 Maximum 5.32 Mean 4.9883 Std. Deviation .23828
With the not equal (two-tailed) alternate hypothesis, the P-value of the test is twice the area below z = 0.14.
These data are not good evidence that the mean conductivity is not 5.
14.21 The P-value of this test is the area to the right of z = 1.776. We find this using CDF.Normal,
The P-value is 0.0379, so this result is significant at the = 0.05 level (P < ), but not at the 0.01 level.
14.23 We compute the test statistic and P-value below.
388
The test statistic is z = 2.20 with P-value 0.0278. This result is significant at the = 0.05 level (P < ), but not at the 0.01 level.
14.35 Again, we compute the confidence interval by hand. If you dont know the value of z*, use IDF.Normal to find it.
Based on this sample, the mean muscle gap for American young men (this is where the sample was from) is between 2.06 and 2.64 kg/m2, with 90% confidence.
14.41 This is a continuation of Exercise 14.35 (above). If we assume that is the difference womens preference minus what they have, we have hypotheses H 0 : = 0, H a : > 0. Since the alternative is greater than the P-value will be the area above the computed test statistic.
The test statistic is z = 13.29 with P-value essentially 0. We know this P-value should be very small because the 68-95-99.7 Rule states that being more than 3 standard deviations above or below the mean is extremely unusual.
14.51 Open data file ex14-51 and use Analyze, Descriptive Statistics, Explore to make the stemplot.
389
Change Stem-and-Leaf Plot Frequency 1.00 2.00 5.00 8.00 7.00 5.00 9.00 3.00 2.00 3.00 1.00 1.00 Stem width: Each leaf: Stem & -8 -7 -6 -5 -4 -3 -2 -1 -0 0 1 2 . . . . . . . . . . . . Leaf 3 08 25588 12233679 0347799 01368 011223557 008 38 234 7 2
1.0 1 case(s)
Based on the graph above, there are no strong departures from Normality, so proceeding with inference is reasonable. We use the mean computed by SPSS in calculating the confidence interval.
Descriptives Change Mean 95% Confidence Interval for Lower Bound Mean Upper Bound Statistic Std. Error -3.587 .3655 -4.323 -2.852
We are 99% confident the mean bone loss of all breast-feeding mothers is between 4.53% and 2.65%, based on this sample of 47 mothers.
14.53 If you did Exercise 14.51, you should have noted that the interval does not contain 0, indicating that breast-feeding mothers do lose bone mineral, on average, and that this result is statistically significant. Well compute the z test statistic first for this test.
390
We have a test statistic of z = 9.837 with P-value 0 (being almost 10 standard deviations below the mean ahs essentially no chance of happening). This confirms that breastfeeding mothers lose bone mineral, on average.
14.55 If is the mean difference in sensitivity, (with without grease), we have hypotheses H 0 : = 0, H a : > 0, since the question is if grease increases sensitivity increases sensitivity. We use Analyze, Descriptive Statistics, Descriptives to find the mean of the data, then compute the test and P-value.
Descriptive Statistics N Diff Valid N (listwise) 16 16 Minimum -.18 Maximum .64 Mean .1012 Std. Deviation .22633
Our test statistic is z = 1.84 with P-value 0.0329. At the 5% level, these results indicate that eye grease does increase sensitivity, on average.

Introduction To Inference

Hochgeladen von

Dokumentinformationen

Originalbeschreibung:

Originaltitel

Copyright

Verfügbare Formate

Dieses Dokument teilen

Dokument teilen oder einbetten

Freigabeoptionen

Stufen Sie dieses Dokument als nützlich ein?

Sind diese Inhalte unangemessen?

Copyright:

Verfügbare Formate

Introduction To Inference

Hochgeladen von

Copyright:

Verfügbare Formate

Chapter 14.

Estimating with a 95% Confidence Interval (s is known)

Estimating with a 99% Confidence Interval (s is known)

Significance Test with a One-sided P-value

h. Click OK. i. Change existing variable? Click OK.

Significance test with a two-sided P-value

In this example, the p-value = .2301.

Chapter 14 SPSS Solutions

14.5 Open data file ex14-05. Statistics, Explore.

To create the stemplot, use Analyze, Descriptive

(=<74) 69 1368 023334578 1122244489 08 02

14.23 We compute the test statistic and P-value below.

Das könnte Ihnen auch gefallen