Beruflich Dokumente
Kultur Dokumente
1) It appears that over the past 45 years, the number of farms in the United States declined while the
average size of farms increased. The data, provided by the U.S. Department of Agriculture,
appear in the Farms worksheet in the Regression Modeling HW data workbook on Moodle. The
data show five-year interval data for US farms.
Intercept – This would be the average farm size when there are no farms. This has no
function meaning in the context of this problem and is outside the scope of the data.
Slope – This represents the change in the average farm size (in acres) for a one unit (one
million) increase in number of farms. The negative coefficient indicates a negative
correlation or that the variables move in opposite directions. Notice from the raw
data that the number of farms has been decreasing over time so I would interpret this
coefficient in the context of a decrease. Interpretation: Each one million farm
decrease corresponds to an increase of 72.3 acres in the average farm size.
Page: 1 of 11
QMM5100 W19 Multiple Regression HW Solution
Verifying this with a hypothesis test you fail to reject Ho, cannot conclude the data is not
Normally distributed. In other words not enough deviation from a Normal distribution to violate
the regression model assumption.
Page: 2 of 11
QMM5100 W19 Multiple Regression HW Solution
2) Larry Swattle, a local attorney, is interested in understanding the relationship between the outside
temperature and his energy usage. Larry finds his electric bills for the last 24 months and records the
electric consumption in Kilowatt Hours. He then uses the internet to find the average monthly
temperature in Fahrenheit for his city for those months. The data appear in the Electricity worksheet
of the Multiple Regression HW data workbook on Moodle.
a) Fit the simple linear regression model using monthly electric consumption as the dependent or
Y variable and average daily temperature as the dependent or X variable. Analyze the
residuals to determine if they satisfy the three regression model assumptions.
Page: 3 of 11
QMM5100 W19 Multiple Regression HW Solution
Prediction: 586.13
d) Determine the leverage for each observation. Do any of the data points have excess leverage?
e) Remove any data points with excess leverage and fit the simple linear regression model
using monthly electric consumption as the dependent or Y variable and average daily
temperature as the dependent or X variable. Using the model without points that have
excess predict electric consumption for a month with an average temperature of 60. Do
the point(s) with excess leverage appear to unduly influence the model?
Page: 4 of 11
QMM5100 W19 Multiple Regression HW Solution
3) The quality of the orange juice produced by a manufacturer (e.g. Minute Maid, Tropicana) is
constantly monitored. There are numerous sensory and chemical components that combine
to make the best tasting orange juice. For example, one manufacturer has developed a
quantitative index of the “sweetness” of orange juice where the higher the index, the sweeter
the juice. To determine if there is a relationship between the sweetness index and the parts
per million (ppm) of water soluble pectin (a chemical measure of the juice) data was
collected on 24 production runs at a juice plant. The data appear in the Sweetness worksheet
of the Multiple Regression HW data workbook on Moodle.
c) Analyze the residuals to determine if they satisfy the three regression model assumptions.
Page: 5 of 11
QMM5100 W19 Multiple Regression HW Solution
d) Use a hypothesis test to determine if any of the data points have too much leverage.
4 4
Critical Value: 0.167
n 24
For runs 11 and 16 the Test Statistic exceeds the critical value, therefore runs 11 and 16
may have undue influence on the model.
Page: 6 of 11
QMM5100 W19 Multiple Regression HW Solution
e) Remove any data points that have excess leverage and refit the regression model. Does
removing these data points change the model?
Page: 7 of 11
QMM5100 W19 Multiple Regression HW Solution
Weight = -362.48 + 2.574 Height + 2.948 Neck + 0.692 Abdomen + 1.856 Hip
Intercept: You would expect a person 0 inches tall with a neck, abdomen and hips that all
have 0 circumference to weigh -362.48 pounds. This value has no meaning in this
problem.
Height: Each additional inch of height adds 2.574 pounds to the average weight, holding
Neck, Abdomen and Hip circumference constant.
Neck: Each additional centimeter of neck circumference adds 2.948 pounds to the
average weight, holding Height and Abdomen and Hip circumference constant.
Hip: Each additional centimeter of hip circumference adds 1.856 pounds to the average
weight, holding Height and Neck and Abdomen circumference constant.
c) Provide a point prediction for the weight of a male 70 inches tall with a 39cm neck, 94cm
abdomen and a 100cm hip.
Weight = -362.48 + 2.574 (70) + 2.948 (39) + 0.692 (94) + 1.856 (100) = 183.33
Page: 8 of 11
QMM5100 W19 Multiple Regression HW Solution
d) Provide a 95% confidence interval for the average weight of males 70 inches tall with
20% body fat.
180.40 ; 186.27
Page: 9 of 11
QMM5100 W19 Multiple Regression HW Solution
The regression model is significant. The Math and Writing variables are significant
while Reading, Citizenship and Science are not significant.
Page: 10 of 11
QMM5100 W19 Multiple Regression HW Solution
d) Provide the equation of the fit line. Interpret the regression model parameters in the
context of passing proficiency tests.
Intercept: A school district 0 percent students passing Math, Science and Writing would
expect -40.401 percent of students to pass all five sections. This value may not have
meaning in the context of this problem.
Writing: Each additional one percent of students that pass Writing corresponds to
0.328% of students passing all sections, everything else being equal.
Math: Each additional one percent of students that pass Math corresponds to 0.616% of
students passing all sections, everything else being equal.
Science: Each additional one percent of students that pass Science corresponds to
0.338% of students passing all sections, everything else being equal.
Page: 11 of 11