Beruflich Dokumente
Kultur Dokumente
Chapter Objectives
Define editing and distinguish between a field edit and an office edit Define coding and outline the steps it involves Compute measures of central tendency and dispersion of the data for each variable in a data set State the potential uses of frequency distribution or one- way tables
12 | 2
Rockbridge ensures data integrity in how the results are interpreted and explained to management
12 | 3
Editing
Editing is the process of examining completed data collection forms and taking whatever corrective action is needed to ensure the data are of high quality
Preliminary or field edit Final or office edit
12 | 4
Field Edit
A field edit, or preliminary edit, is a quick examination of completed data collection forms, usually on the same day they are filled out Objectives
Ensure that proper procedures are being followed in selecting respondents, interviewing them, and recording their responses Fix fieldwork deficiencies before they turn into major problems
12 | 5
Office Edit
A final, or office edit, verifies response consistency and accuracy
Makes necessary corrections Determines whether some or all parts of a data collection form should be discarded
12 | 6
12 | 7
12 | 8
12 | 9
Incomplete Interviews
Questions not asked Directions not followed (proper segments of the questionnaire were not administered)
12 | 10
12 | 11
12 | 12
12 | 13
12 | 14
Misclassification
Mortgage debt improperly reported as installment debt
Impossible answers
Reports paying $600 for a new Edsel in 1970 - the car should have been recorded as a "used" car; or weekly income reported on the income-per-month line
12 | 15
12 | 16
Preventing Errors
Careful planning before fieldwork begins Automating data entry
12 | 17
Coding
Coding broadly refers to the set of all tasks associated with transforming edited responses into a form that is ready for analysis Steps
Transforming responses to each question into a set of meaningful categories Assigning numerical codes to the categories Creating a data set suitable for computer analysis
12 | 18
12 | 19
12 | 20
Missing-Value Category
A missing value can stem from
A respondent's refusal to answer a question An interviewer's failure to ask a question or record an answer or a "don't know" that does not seem legitimate
12 | 21
12 | 22
Need six variables, each relating to a specific country and having two possible values. For example, 1= No and 2 = Yes Six columns must be set aside in the data spreadsheet to record responses to this question
Copyright Houghton Mifflin Company. All rights reserved. 12 | 23
This question requires as many variables (and columns) as there are objects to be ranked 4 separate variables are needed
12 | 24
12 | 25
12 | 26
12 | 27
Table 12.4 Measures of Central Tendency and Dispersion for Different Types of Variables
12 | 28
Measures of Dispersion
Strictly speaking, the concept of dispersion is not meaningful for nominal data An idea about the distribution of responses can be obtained by examining their relative frequencies of occurrence
12 | 29
Measures of Dispersion
Range: Defined by the highest and lowest response values Interquartile range: Difference between the 75th and 25th percentile responses
12 | 30
Measures of Dispersion
Standard deviation: As defined in Chapter 9
12 | 31
Measures of Dispersion
Standard deviation: As defined in Chapter 9
12 | 32
Mode
The value that occurs most frequently
12 | 33
Table 12.5 How Long Have You Been Using the Services of National? Computing Mode
12 | 34
Median
The observation below which 50 percent of the observations fall
12 | 35
1= Less than a year; 2 = 1 to less than 2 years; 4 = 5 years or more Arranging the 20 values in ascending order: 1 4 1 4 1 4 3 4 3 4 3 4 3 4
4 4
4 4
Because the sample size = 20, there are two middle values: 4 and 4. The median is, therefore, the average of the two middle values = 4.
12 | 36
12 | 37
Mean
n = Number of units in the sample xi = data obtained from each sample unit I = sample mean value, given by X
(X
i 1
n
Copyright Houghton Mifflin Company. All rights reserved. 12 | 38
12 | 39
Measures of Dispersion
Range Variance Standard Deviation
12 | 40
Range
Range is the difference between the largest and smallest value The simplest measure of dispersion
12 | 41
Variance
Variance of a set of data is a measure of deviation of the data around the arithmetic mean (xi x )2 S2 = ---------n-1
12 | 42
Standard Deviation
Standard deviation is the square root of the variance
n (xi x )2 i=1---------n-1
12 | 43
Table 12.9 Overall Quality of Services Provided by National: Computing Range, Variance, and Standard Deviation
12 | 44
12 | 45
12 | 46
12 | 47
Researchers conclusion
Consumers need really neither really hot nor really mild sauce
12 | 48
12 | 49