Sie sind auf Seite 1von 6

STAT 701 - Modern Applied Statistics I Fall 2016 - Syllabus

Class Meetings: Online via D2L

Instructor Information:
Instructor Name: Semhar Michael
Office Location: SAM 264
Wednesday, Thursday 10:30am-12:00pm Other times by advances
Office Hours:
appointment, please call or email
Phone: (605) 688-6316
Email: D2L Mail tool (Responses to email will be provided within 24 hours)
Semhar.michael@sdstate.edu, if D2L is unavailable

Graduate Teaching Assistants:


Name: Most Farzana Yesmin;
Email: mostfarzana.yesmin@jacks.sdstate.edu/ or D2L course email (Will
respond to email within 24 hours.)
Name: Md Riaz A. Khan;
Email: mdriazahmed.khan@jacks.sdstate.edu or D2L course email (Will
respond to email within 24 hours.)
Contact GTAs as primary technical assistant and secondarily the instructor

Course Description:
Modern Applied Statistics I covers various advanced statistical techniques with
an emphasis on the application, interpretation and implementation of the
methods. The class will start by given overview of statistical graphics and the
associated modern statistical computing language(s). The next section of the
class will focus on parametric, non- and semi-parametric methods such as
generalized linear and additive models, quantile regression, density estimation,
and survival analysis. The last section of the class will focus on longitudinal and
repeated measure models and conclude with an overview of techniques from
meta-analysis and large-scale inference.

Course Prerequisites: STAT 541 and STAT 700 or permission of the instructor

Suggested Prerequisites:
This course will build upon STAT 541 and assume students have knowledge of
simple linear regression, multiple linear regression, analysis of variance, and
the basics of statistical inference. Familiarity with statistical programing is
required.

Textbooks:
A Handbook of Statistical Analyses Using R, third Edition by Everitt and
Hothorn (Main Text at Bookstore)
R Graphics Cookbook by Winston Chang published through OReilly
(Basic guide to Grammar of Graphics in R)
Supplementary Texts:
Data Analysis with Open Source Tools by Philipp Janert (A good text that
was in the running for being the main textbook for this class. I
recommend this book for any of the Masters in Data Science students as
a good reference.)
Advanced Data Analysis from an Elementary Point of View by Cosma
Rohilla Shalizi (Early version is free online, very good applied statistics
text)
Doing Data Science by Cathy ONeil and Rachel Schutt (A good summary
of the current work environments that statistical methods that are used
within.)

Programing and Computation:


We will be using the R (RStudio)- a free software environment for statistical
computing and graphics. Students will need access to a computer that is capable
of running R and loading R libraries.

Description of Instructional Methods:


Recordings of in-class lectures, lecture slides, homework assignments and
supplementary materials will be posted on D2L under each module. I advise you
to set up email updates for D2L and check it regularly. Homework/exam feedback
will be given through the D2L Dropbox.

Course presentation:
The course is organized in to 10 learning modules, each module contains a
lecture slide, multimedia lectures, and sets of readings from the textbook and
other resources. It will also include homework assignments. There will be 1
learning modules to complete every one-to-two weeks.

Learning Objectives:
Implement and discuss standard methods for summarizing, interpreting, and
presenting data
Implement and interpret standard data analyses using parametric, non- and
semi- parametric methods
Implement and interpret standard data analyses using longitudinal and mixed
models
Develop the skills necessary to coherently present the statistical methods
presented in this class

Course Assessment:
The main form of assessment for this class will be weekly homework
assignments (65%). To be able to keep up with and pass this class it is
necessary to maintain a high completion rate of the homework assignments.
The assignments will be essay style commonly requiring 1-2 pages of written
interpretation (including graphical displays and annotated code) of the results
of a data analysis. The submission of the assignments will be through the D2L
Dropbox and presentation of the homework assignments should be suitable to
this medium of delivery. Weekly participation on D2L discussion board is
required (10%). All questions and comments related to course material and
homework should be posted on the discussion board. Participation grades will
be given each week.

Attendance:
Participation on the discussion board and collaborate live sessions are required.
Attendance is compulsory. Late homework will not be graded. If you have an
extended absence or one in which you seek exception to any of the above
policies, please report the circumstances to the Vice-President of Student
Affairs at 688-4493 and to the instructor.

Evaluation method:
Homework assignments (65%)
HW assignments will be posted on D2L as per schedule.
It is acceptable to collaborate on the problem sets. However, it is
expected that students list with whom they have worked, and the
contributions of each collaborator, including themselves, to the given
problem solution.
No unexcused late assignments will be accepted under any
circumstances.
Each problem (in a problem set) will be graded on a credit/no-credit
basis. (See class schedule for submission due dates.)
There are multiple small homework sets in this class, each consisting
of roughly 2-6 problems.
Barring unforeseen circumstances, homework will be returned on the
Monday or Wednesday of the week following the assignment due date
through the class Dropbox.
Discussion board and collaborate participation (10%)
Questions and comments on course material, homework assignments
or anything related to the course should be posted on the discussion
board (within group).
Attendance during office hours on collaborate session recommended
Final Project (25%)
No collaboration is allowed on the final project.
Will be due on the date of the final exam Tuesday, December 15.
The project will consist of 3-5 data analyses that will each require a 1-3
page write up (including appropriate graphical representations) and
statistical analysis of each.
With preapproval of the course instructor, one of the data analyses for
the exam may consist of a dataset you are currently working on from an
outside project.
Answers to homework / final project must be submitted with the
annotated/commented R scripts if necessary. See the Google style guide for R
as an example on how to format and annotate your R scripts or the Appendix
in Advanced Data Analysis from an Elementary Point of View.
http://google-styleguide.googlecode.com/svn/trunk/google-r-style.html

Course Specific Grading Requirements Grade Description


Components Grade
Percentage
Assignments 65%
Discussion/Collaborate 10%
Final Exam 25%
Total 100 %
90-100 % = A 80-89 % = B
70-79 % = C 60-69 % = D
Below 59% = F

University ADA Statement:


Any student who feels s/he may need an accommodation based on the impact
of a disability should contact Nancy Hartenhoff-Crooks (or successor)
Coordinator of Disability Services (605-688-4504 or Fax, 605-688-4987) to
privately discuss your specific needs. The Office of Disability Services is located
in room 065, the University Student Union.

Academic Honesty:
Academic dishonesty will not be tolerated. CheatingDirectly copying from any
source other than yourself, including but not limited to your classmates and
online sources. It does not mean that you will not talk to other students about
homework problems; however, you need to demonstrate your understanding of
the problem by writing up the solution by yourself.
PlagiarismOffering as ones own work the words, ideas, or arguments of
another person without appropriate attribution by quotation, reference, or
footnote.

Cheating or plagiarism will result in a score of 0 for that particular assignment


for both parties involved. More than one offense will result in a failing grade for
the course AND formal reporting of the incident to the Division of Student
Affairs.

We will follow the University policy on academic integrity and appeals -


The University has a clear expectation for academic integrity and does not
tolerate academic dishonesty. University Policy 2:4 sets forth the definitions of
academic dishonesty, which includes but is not limited to, cheating, plagiarism,
fabrication, facilitating academic dishonesty, misrepresentation, and other
forms of dishonesty relating to academics. The policy and its procedures also
set forth how charges of academic dishonesty are handled at the University.
Academic Dishonesty is strictly proscribed and if found may result in student
discipline up to and including dismissal from the University.

Academic Freedom and Responsibility Statement


University Freedom in learning Policy-
Students are responsible for learning the content of any course of study in
which they are enrolled. Under Board of Regents and University policy,
student academic performance shall be evaluated solely on an academic basis
and students should be free to take reasoned exception to the data or views
offered in any course of study. Students who believe that an academic
evaluation is unrelated to academic standards but is related instead to
judgment of their personal opinion or conduct should first contact the
instructor of the course. If the student remains unsatisfied, the student may
contact the department head and/or dean of the college which offers the class
to initiate a review of the evaluation.
Note: The instructor reserves the right to make changes to this syllabus
during the semester. Announcements of changes will be made in class.

Tentative Class Schedule Fall 2016

THIS WEEK Monday Tuesday Wednesday Thursday Friday


24
Introduction to August 22
23 First Day of 25 26
Statistical Graphics
Class
and GGplot

GGplot: Linear 29 30 31 September 1 2 W grade


Models and 1st HW Due Last Day to begins
Analysis of Add/Drop
Variance
Logistic Regression 5 6 7 8 9 Graduation
I& NO CLASS 2nd HW Due Apps due
Generalized Linear Labor Day
Models
12 13 14 15 16
Density Estimation 3rd HW Due

Recursive 19 20 21 22 23
Partitioning 4th HW Due

Generalized 26 27 28 29 30
Additive Models and 5th HW Due
Spline Models
October 3 4 5 6 7
Survival Analysis 6th HW Due

10 11 12 13 14
NO CLASS 7th HW Due
Survival Analysis Native
American
Day
Longitudinal Data 17 18 19 20 21
Analysis and Mixed 8th HW Due
Models
Longitudinal Data 24 25 26 27 28
Analysis and Mixed 9th HW Due
Models
Longitudinal Data 31 November 1 2 3 4 Last Day to
Analysis and Mixed 10th HW Due Drop with W
Models
Multiple 7 8 9 10 11
Comparisons, 11th HW Due NO CLASS
FDR, Veterans Day
Simultaneous
Inference
Multiple 14 15 16 17 18
Comparisons,
FDR,
Simultaneous
Inference
Multiple 21 22 23 24 25
Comparisons, 12th HW Due NO CLASS NO CLASS NO CLASS
FDR, Thanksgiving Thanksgiving Thanksgiving
Simultaneous Recess Recess Recess
Inference
Final Project
28 29 30 December 1 2
Meta-Analysis

5 6 7 8 9
Reading Day NO CLASS Final Exams Final Exams
Reading Day
12 13 14 15 16
Final Exams Final Exams Final Exams
Final
Projects
Due!