Beruflich Dokumente
Kultur Dokumente
Data
Data Analysis
Developing a database
structures
The database structure is the manner in which you intend to store the data
for the study so that it can be accessed in subsequent data analysis
In every research project, you should generate a printed codebook that
describes the data and indicates where and how it can be accessed.
Minimally the codebook should include the following items for each
variable:
Variable name
Variable description
Variable format (number, data, text)
Instrument/method of collection
Date collected
Respondent or group
Variable location (in database)
Notes
The codebook is an indispensable tool for the analysis team. Together
with the database, it should provide comprehensive documentation that
enables other researchers who might subsequently want to analyze the
data to do so without any additional information.
You can enter data into a computer in a variety of ways. Probably the
easiest is to just type the data in directly. To ensure a high level of data
accuracy, you should use a procedure called double entry. In this
procedure, you enter the data once. Then, you use a special program
that allows you to enter the data a second time and checks the second
entries against the first. If there is a discrepancy, the program notifies
you and enables you to determine which is the correct entry. This
double-entry procedure significantly reduces entry errors. However,
these double-entry programs are not widely available and require some
training. An alternative is to enter the data once and set up a procedure
for checking the data for accuracy. For instance, you might spot check
records on a random basis.
After you enter the data, you will use various programs to summarize
the data that enable you to check that all the data falls within
acceptable limits and boundaries. For instance, such summaries
enable you to spot whether there are persons whose age is 601 or
whether anyone entered a 7 where you expect a 1-to-5 response.
Data Transformations
Descriptive Statistics
The distribution
The central tendency
The dispersion
Distribution
Distribution (2)
Central tendency
Mean (rata-rata)
Median (nilai tengah)
Mode (modus)
Dispersion
N
Mean
Median
Mode
Standard Deviation
Variance
Range
15,20,21,20,36,15,25,15
8,0000
20,8750
20,0000
15,0000
7,0799
50,1250
21,0000
Inferential Statistics