Sie sind auf Seite 1von 16

Chapter 14.

Introduction to Inference
Topics covered in this chapter: Estimating with a 95% Confidence Interval Estimating with a 99% Confidence Interval Significance Test with a One-sided P-value Significance Test with a Two-sided P-value

Estimating with a 95% Confidence Interval (s is known)


Example 14.3: Healing of skin wounds The Problem: Biologists studying the healing of skin wounds measured the rate at which new cells closed a razor cut made in the skin of an anesthetized newt. Estimate the mean healing rate using a 95% confidence interval. 1. Open dataset eg14-03.por. 2. Find the sample statistics. a. Click on Analyze. b. Click on Descriptive Statistics. c. Click on Descriptives. d. Move Rate to the Variable(s) box. e. Click OK.

103

Introduction to Inference104

Here we are given the sample mean and sample standard deviation. However, we are given = 8 in this example. We will simply use the sample mean from this output. 3. Find the confidence interval. a. Click on Variable View. b. Add the variables xbar, sigma, and n. Change the number of decimals of xbar to 4.

c. Click on Data View. d. Under xbar, type 25.6667 for the sample mean given in the table above. e. Under sigma, type 8 for the population standard deviation. f. Under n, type 18 for the sample size.

105Chapter 14

4. Compute the confidence interval. For 95% confidence intervals, remember to use z* = 1.96. a. Click on Transform. b. Click on Compute Variable. c. Under Target Variable, type lowerbound. d. Under Numeric Expression, type xbar 1.96 * sigma / SQRT(n).

e. Click OK. f. Repeat steps c through e, using upperbound and the expression: xbar + 1.96 * sigma / SQRT(n).

Introduction to Inference106

The 95% confidence interval for the mean healing rate for all newts of this particular species in micrometers per hour is (21.97, 29.36).

Estimating with a 99% Confidence Interval (s is known)


Example (Apply Your Knowledge 14.5): IQ test scores The Problem: Make a stemplot and construct a 99% confidence interval for the IQ scores of 31 seventh-grade girls in a Midwest school district. 1. Make a stemplot. a. Open the data set ex14-05.por. b. Click on Analyze. c. Click on Descriptive Statistics. d. Click on Explore. e. Choose the variable IQ to be in the Dependent List. f. Under Display, choose Plots. g. Click on the Plots button at the right side of the window. h. For Descriptive, choose Stem-and-leaf. i. Click Continue. j. Click OK. The stemplot will show as below in the output:

107Chapter 14

2. Get the necessary descriptive statistics. a. Click on Analyze. b. Click on Descriptive Statistics. c. Click on Descriptives. d. Choose IQ as the Variable(s). e. Click OK.

3. Add the variables to compute the 99% confidence interval. a. Click on Variable View. b. Add the variables xbar, sigma and n. c. Click on Data View. d. Under xbar, type 105.84 in the first row. e. Under sigma, type in 15 since the population standard deviation is 15. f. Under n, type in 31 for the sample size.

Introduction to Inference108

4. Compute the 99% confidence interval. Recall that z* = 2.576 when computing 99% confidence intervals. a. Click on Transform. b. Click on Compute Variable. c. Under Target Variable, type lowerbound. d. Under Numeric Expression, type: xbar 2.576 * sigma / SQRT(n).

e. Click OK. f. Repeat steps c through e, instead using upperbound and the expression: xbar + 2.576 * sigma / SQRT(n).

109Chapter 14

The 99% confidence interval for the mean IQ scores for all seventh-grade girls in the school district is (98.90, 112.78).

Significance Test with a One-sided P-value


Example 14.7: Sweetening colas: one-sided P-value The Problem: Diet colas use artificial sweeteners to avoid sugar. These sweeteners gradually lose their sweetness over time. Manufacturers therefore test new colas for loss of sweetness before marketing them. Suppose that we know that for any cola, the sweetness loss scores vary from taster to taster according to a Normal distribution with standard deviation = 1. The mean for all tasters measures loss of sweetness and is different for different colas. The study of sweetness low tests the hypotheses: H0: = 0 Ha: > 0 Given the sample statistics, a one-sided P-value is calculated to determine the evidence against the null hypothesis. 1. Compute a one-sided p-value. a. Click on Variable View. b. Enter the variables z and probability. c. Increase the number of decimal places of probability to four. d. Click on Data View. e. Type 3.23 under z. 2. Calculate a one-sided p-value. a. Click on Transform. b. Click on Compute Variable. c. For target variable, type in probability.

Introduction to Inference110

d. e. f. g.

In numeric expression, type 1 . Under Function group, click on CDF and Noncentral CDF. Double-click on Cdfnorm. Click on z, then click on the right-facing arrow.

h. Click OK. i. Change existing variable? Click OK.

The probability of obtaining a sample mean of greater than or equal to 1.02 is 0.0006. The small p-value leads us to believe that the null hypothesis is false.

111Chapter 14

Significance test with a two-sided P-value


Example 14.8: Job satisfaction: two-sided p-value The Problem: Does the job satisfaction of assembly workers differ when their work is machine-paced rather than self-paced? Assign workers either to an assembly line moving at a fixed pace or to a self-paced setting. All subjects work in both settings in random order. This is a matched pairs design. After two weeks in each setting, the workers take a test of job satisfaction. The parameter of interest is the mean of the differences in scores in the population of all assembly workers. Similar to example 14.7, the sample statistics are given, but in this case the twosided p-value is computed. This example will be solved using nearly identical methods in example 14.7. However, in this example, z = 1.20 instead of 3.23. The one method that is different is in step 2d. The value computed needs to be multiplied by two, for a two-sided p-value. Hence, the numeric expression will read:

In this example, the p-value = .2301.

Introduction to Inference112

Chapter 14 Exercises
14.3 14.5 14.13 14.17 14.19 14.21 14.23 14.35 14.41 14.51 14.53 14.55 Find a critical value. IQ test scores. Sweetening Colas: find the P-value. Measuring conductivity. Measuring conductivity. Significance from a table. Testing a random number generator. I want more muscle. I want more muscle. Bone loss by nursing mothers. Bone loss by nursing mothers. Eye grease.

385

Chapter 14 SPSS Solutions

**NOTE: SPSS does not do inference based on Z distributions, nor does it perform inference on variables that are already summarized. If you really want to use SPSS for these problems or chapters, follow the instructions below (youll be basically using Transform, Compute Variable as a calculator) or use another technology (such as a graphing calculator or another statistics program like Minitab or Crunchit.)

14.3 Well use Transform, Compute Variable and IDF.Normal from the Inverse DF function group. With 97.5% in the center of a standard Normal distribution, there is 0.025/2 = 0.0125 on each end. Due to the symmetry of the distribution, we can find either z* (and remove the negative sign) or z*. z* = 2.2414. As always, if you do not see all the decimal places you want, go to the Variable View and change them.

14.5 Open data file ex14-05. Statistics, Explore.


IQ Stem-and-Leaf Plot Frequency Stem & Leaf

To create the stemplot, use Analyze, Descriptive

2.00 Extremes 2.00 8 . 4.00 9 . 9.00 10 . 10.00 11 . 2.00 12 . 2.00 13 . Stem width: Each leaf:

(=<74) 69 1368 023334578 1122244489 08 02

10 1 case(s)

This stemplot indicates there are two low outliers (=<74, namely 72 and 74). In the Descriptives block of the output, a confidence interval is given (set the confidence level using the Statistics button in the Explore dialog box).
Descriptives Statistic IQ Mean 99% Confidence Interval for Lower Bound Mean Upper Bound 105.84 98.79 112.89 Std. Error 2.563

386

This confidence interval is based on a distribution we wont meet until Chapter 17 ( the t distributions). To find the confidence interval, well calculate it by hand. We will, however, make use of the mean given above.

Based on this sample, with 99% confidence, the average IQ score of all seventh-grade girls in this school district is between 98.9 and 112.8.
14.13 When = 0, the distribution of x will be N (0, 1/ 10 = 0.316). The P-value is the area to the right of x = 0.3 on this distribution. We use Transform, Compute Variable and CDF.Normal to find the P-value is 1 0.8288 = 0.1712.

14.17 For n = 6 measurements, the standard deviation of the sampling distribution is = .2 / 6 = 0.0816. For this two-sided alternative, well double the area to the left of our observed sample mean (since these are less than the claimed mean). Well find this area using Transform, Compute Variable and CDF.Normal.

387

A sample mean of 4.98 (very close to 5) has P-value 0.8064, while the sample mean of 4.7 (much farther away from 5) has P-value 0.0002; this is much better evidence against the null as it is farther away.

14.19 Well enter the data and use Analyze, Basic Statistics, Descriptives to find the mean of these data, then compute the test by hand.
Descriptive Statistics N Conduct Valid N (listwise) 6 6 Minimum 4.73 Maximum 5.32 Mean 4.9883 Std. Deviation .23828

With the not equal (two-tailed) alternate hypothesis, the P-value of the test is twice the area below z = 0.14.

These data are not good evidence that the mean conductivity is not 5.

14.21 The P-value of this test is the area to the right of z = 1.776. We find this using CDF.Normal,

The P-value is 0.0379, so this result is significant at the = 0.05 level (P < ), but not at the 0.01 level.

14.23 We compute the test statistic and P-value below.

388

The test statistic is z = 2.20 with P-value 0.0278. This result is significant at the = 0.05 level (P < ), but not at the 0.01 level.

14.35 Again, we compute the confidence interval by hand. If you dont know the value of z*, use IDF.Normal to find it.

Based on this sample, the mean muscle gap for American young men (this is where the sample was from) is between 2.06 and 2.64 kg/m2, with 90% confidence.
14.41 This is a continuation of Exercise 14.35 (above). If we assume that is the difference womens preference minus what they have, we have hypotheses H 0 : = 0, H a : > 0. Since the alternative is greater than the P-value will be the area above the computed test statistic.

The test statistic is z = 13.29 with P-value essentially 0. We know this P-value should be very small because the 68-95-99.7 Rule states that being more than 3 standard deviations above or below the mean is extremely unusual.

14.51 Open data file ex14-51 and use Analyze, Descriptive Statistics, Explore to make the stemplot.

389

Change Stem-and-Leaf Plot Frequency 1.00 2.00 5.00 8.00 7.00 5.00 9.00 3.00 2.00 3.00 1.00 1.00 Stem width: Each leaf: Stem & -8 -7 -6 -5 -4 -3 -2 -1 -0 0 1 2 . . . . . . . . . . . . Leaf 3 08 25588 12233679 0347799 01368 011223557 008 38 234 7 2

1.0 1 case(s)

Based on the graph above, there are no strong departures from Normality, so proceeding with inference is reasonable. We use the mean computed by SPSS in calculating the confidence interval.
Descriptives Change Mean 95% Confidence Interval for Lower Bound Mean Upper Bound Statistic Std. Error -3.587 .3655 -4.323 -2.852

We are 99% confident the mean bone loss of all breast-feeding mothers is between 4.53% and 2.65%, based on this sample of 47 mothers.

14.53 If you did Exercise 14.51, you should have noted that the interval does not contain 0, indicating that breast-feeding mothers do lose bone mineral, on average, and that this result is statistically significant. Well compute the z test statistic first for this test.

390

We have a test statistic of z = 9.837 with P-value 0 (being almost 10 standard deviations below the mean ahs essentially no chance of happening). This confirms that breastfeeding mothers lose bone mineral, on average.
14.55 If is the mean difference in sensitivity, (with without grease), we have hypotheses H 0 : = 0, H a : > 0, since the question is if grease increases sensitivity increases sensitivity. We use Analyze, Descriptive Statistics, Descriptives to find the mean of the data, then compute the test and P-value.
Descriptive Statistics N Diff Valid N (listwise) 16 16 Minimum -.18 Maximum .64 Mean .1012 Std. Deviation .22633

Our test statistic is z = 1.84 with P-value 0.0329. At the 5% level, these results indicate that eye grease does increase sensitivity, on average.

Das könnte Ihnen auch gefallen