Sie sind auf Seite 1von 4

PAM 2100 Spring 2014 - Homework #1

Name: ______________________

Discussion Section:________________ Grade: __________

Complete all of the questions. You may work in small groups on the assignment, but you must submit our own
answers. The assignment is due at the start class on Thursday, Feb 6.
1. The five starting players for the Cornell mens basketball team have the following heights measured in feet:
5.75, 6.25, 6.5, 6.5, 7.
a. What is the mean height of the Cornells basketball team?
b. What is the median height?
c. What is the mode height?
d. If rather than measure the heights in feet it was measured in inches what are the mean and median?

e. What is the relationship between the mean measured in feet and inches?

2. The six starting players for the Cornell mens hockey team have the following heights measured in feet: 5.75,
6, 6, 6.25, 6.3, 6.5. What is the median height?

3. Politician A: Our schools are getting worse! Sixty percent of our schools had lower test scores this year
than last year.
Politician B: Our schools are getting better! Eighty percent of our students had higher test scores this year
than last year.
Explain how both statements could be true.

4. Use the dataset BMI_2100 for the following questions. The BMI_2100 dataset contains actual data from
Prof Cawleys research on the relationship between BMI and income. Basically this exercise is meant to
force you to use Excel. Here is a link to an online Excel textbook if you need help:
http://catalog.library.cornell.edu/gb6.html?bib_key=ISBN:9780596807696&limit=Select%20chapters%20a
vailable
Here are the variable definitions:
gender: M-male; F-female
bmi: The persons body mass index
height: Height in inches
wtlbs: Weight in pounds
income: Income in dollars
age: Age
poverty: 1 - the person is classified as living in poverty; 0 not living in poverty
hgc: Highest grade completed
employ: 1 employed; 0 not employed
hisp: 1 the person is Hispanic; 0 not Hispanic
black: 1 the person is Black; 0 not Black

a) What are the average and median weights of males? HINT: First sort the data by gender. Then use the
=average( ) and the =median( ) commands.

b) What are the average and median weights of females?

c) Create a table that summarizes the frequency, proportion, and percentage of males and females in the
dataset.
Frequency Proportion Percentage
Females
Males
Total

HINT: First get counts of the number of males and females. Sort the entire dataset (Datasort), by gender, and
then scroll down until the break between males and females occurs to get a count of males and females.
Alternatively, you could use the function =countif(range, expression) to count the number of males or females directly.
You would probably have to use help or look in the Excel textbook to figure this command out.
d) Use the data from part c) to create a bar chart of the proportion of males and females in the dataset. I
would like you to include a title. HINT: Highlight the data then InsertChart. You will probably need
to adjust the data in the Select Data tab, and then use the Design, Layout, Format tab to get the charts
to look right.

e) Create a table that summarizes the frequency, proportion, and percentage of the Hispanics, Black, and
Neither Hispanic or Black in the dataset. Because the way the data is presented under the hisp and black
variables, you can just use the command =sum( ) to get a total of the number of Hispanic people and
Black people, and then calculate the number of Neither Hispanic or Black. That is, since there is a 1 under
the variable hisp for every person that is Hispanic you can simply add up all of the 1s in the hisp column
to get the total number of Hispanic people.
Frequency

Proportion

Percentage

Hispanic
Black
Not Hispanic or Black
Total

f) Create a pie chart of the distribution from part e).

g) What is the ratio of Black to Neither Hispanic or Black peoples in the dataset?

h) What is the average BMI in the dataset?

i) What is the median BMI?

j) What is the modal BMI?

k) What are the first, second and third quartiles of BMI? HINT: You can sort the dataset from low to high
and then determine which observation to use as demonstrated in class. Alternatively, you find an
appropriate Excel function to use.

l) What is the 95th percentile of BMI? Again, use the method shown in class or find the appropriate Excel
function.

m) Create a histogram of BMI with column width of 5 BMI with percentages. This is a bit of a pain since
you are going to have to first determine how many peoples BMI falls between 0-5, then 5-10, etc. Then
you are going to have to make a table like in e), and then make a chart (remember that for histograms
the columns must touch). One way to do this is to simply sort the dataset by BMI from low to high,
and then either count or use the =count( ) function manually. Alternatively, you could try and figure out
how to use the =countif( ) command. With this command you can just adjust the count criteria to make
Excel count the number of observations in each column. Alternatively, you could install the Excel data
analysis package and learn how to use the Histogram tool in Excel (in Google search for How to use
the Excel Histogram tool).

n) What proportion of the dataset is classified as Overweight or more?