Beruflich Dokumente
Kultur Dokumente
Outline
Getting data into Stata
Editing in Stata
How does Stata handle data
Stata notation and help
Using Stata and Basic Stata commands
Transferring Data
Stattransfer or DBMS copy work
Stattransfer often seeks to optimize the
Stata dataset by default
Stattransfer
CLICK ON DOUBLE
Editing in Stata
Any ASCII text editor will work
Stata has a built in text editor, but it is
limited.
I recommend using another text editor
http://fmwww.bc.edu/repec/bocode/t/textEditors.html
Handling Data
SAS processes one record at a time
Stata processes all the records at the same
time
Stata Abbreviations
could be written
reg income education female
Stata Help
update all
Stata in Windows
Page up scrolls through the previous
commands
There is a graphical user interface
(menus) if you forget a command
We have Stata on rocky and tasha no
graphical capabilities, no menus, and loss
of some shortcuts
Using Stata
Create batch files called .do files
I work interactively
Sysdir, ls and cd
sysdir
STATA: C:\Program Files\Stata9\
UPDATES: C:\ProgramFiles\Stata9\ado\updates\
BASE: C:\Program Files\Stata9\ado\base\
SITE: C:\Program Files\Stata9\ado\site\
PLUS: c:\ado\stbplus\
PERSONAL: c:\ado\personal\
OLDPLACE: c:\ado\
Delimiters
SAS recognizes ; as a delimiter
Stata recognizes the carriage return
Missing Data
Stata and SAS both use . as missing
Stata implicitly values a missing as a very
large number
SAS implicitly values a missing as a very
small number
Boolean Logic
gen
gen
gen
y=x
y=x
y=x
if
if
if
gen drgnum1=drg==1 or
tab drg, gen(drgnum)
Drop
egen Commands
ICD-9 Codes
Stata has capabilities to handle ICD-9
diagnosis and procedure codes
You can
Dates
Combining Data
Merge
Explicit Subscripting
gsort id -date
by id : gen n=_n
by id : gen N=_N
gen select=n==1
Using Stata
Summarizing Data
. sum gender age educ
Variable |
Obs
Mean
Std. Dev.
Min
Max
-------------+-------------------------------------------------------gender |
4085
1.496206
.5000468
1
2
age |
4085
64.5601
9.451724
50
94
educ |
4085
4.398286
1.662883
1
9
Tabulating Data
. tab gender
gender |
Freq.
Percent
Cum.
------------+----------------------------------1 |
2,058
50.38
50.38
2 |
2,027
49.62
100.00
------------+----------------------------------Total |
4,085
100.00
. table gender
---------------------gender |
Freq.
----------+----------1 |
2,058
2 |
2,027
----------------------
Tabulating Data
tab gender age
too many values
r(134);
tab age gender
|
gender
age |
1
2 |
Total
-----------+----------------------+---------50 |
49
69 |
118
51 |
72
71 |
143
94 |
1
0 |
1
-----------+----------------------+---------Total |
2,058
2,027 |
4,085
Tabstat
. tabstat age, by (gender)
gender |
mean
---------+---------1 | 64.77454
2 | 64.34238
---------+---------Total |
64.5601
--------------------
----------------------gender | mean(age)
----------+-----------1 |
64.77454
2 |
64.34238
-----------------------
Graphing
Diagnostic graphics
wtp
500
500
500
100
100
-.072394
stage:
. 072394
1
Density
-.060237
stage:
.060237
2
Density
-.05479
stage:
.05479
3
Density
500
500
Presenting
results
125
75
75
0
-. 055777
stage:
. 055777
4
Density
0
-.062437
stage:
.062437
5
Density
Output
Linear regression
Number of obs
F( 21, 1284)
Prob > F
R-squared
Root MSE
wtp
Coef.
Robust
Std. Err. t
ethn1
Ethn2
ethn3
ethn4
english
lifeus
age1999
income
incmis
_cons
1.990048
-25.74654
-35.59552
-3.244168
-11.44402
37.34419
-.6272524
.8068256
14.07434
111.3607
8.742036
11.69993
11.98309
11.16836
9.699576
13.86037
.3097408
.1714309
9.404149
24.13083
0.23
-2.20
-2.97
-0.29
-1.18
2.69
-2.03
4.71
1.50
4.61
=
=
=
=
=
1306
10.88
0.0000
0.1398
90.367
P>t
[95% Conf.Interval]
0.820
0.028
0.003
0.771
0.238
0.007
0.043
0.000
0.135
0.000
-15.16019
-48.69961
-59.1041
-25.15441
-30.47277
10.15274
-1.234906
.4705102
-4.374848
64.02051
19.14029
-2.793467
-12.08694
18.66607
7.584741
64.53564
-.0195987
1.143141
32.52352
158.7009
Outreg
Outputs data to a delimited file
Delimited file can be read into Excel
Very flexible
Creates publishable tables