Professional Documents
Culture Documents
ni dung
hhsize
6
5
10
incomepc
2100
3210
1200
Byte
di ln nht
1
2
1
2
80
80
Phin bn Stata 8 c giao din tng t nh phin bn Stata 7. Khc bit ln nht l Stata 8 c thm tu
chn Statistics trong thanh thc n. Tu chn ny cho php thc hin cc mt s lnh thng k bng cc
tu chn qua giao din ca s m khng phi g cc lnh trong ca s Command.
2
Cc ca s ca Stata
Cc ca s ca Stata c m ra bng vic la chn cc tu chn thanh thc n Windows
(menu bar). Cc ca s ny bao gm:
Results
Hin th cc lnh v kt qu
Graph
Hin th th
Viewer
Command
Dng g cc cu lnh
Review
Variables
Data editor
Do-file editor
M file s liu
3
View
Save
Lu file s liu
Save as
Lu file s liu di tn mi
File name
Log
Save graph
Lu gi file th
Print graph
In th
Print results
In kt qu
Exit
Edit
Copy text
Copy tables
Paste
Dn
Prefs
Windows
Results
M ca s kt qu
Graph
M ca s th
Log
M ca s log file
Viewer
Command
M ca s cu lnh
Review
Variables
Help/Search
M ca s tr gip (help)
Data editor
Do-file editor
Help
Save
Lu tr file s liu ra a
4
Print results
In ni dung ca ca s kt qu
Begin log
Start viewer
M ca s tr tr (help)
a ca s kt qu ra pha trc
a ca s v th ra pha trc
Do-file editor
Data editor
M ca s sa cha s liu
Data browser
M ca s xem s liu
Tt lnh more
Break
replace
text
smcl
V d:
log using baitap1
Tp s liu Stata c th c m bng la chn Open trn thc n File; hoc nt Open (use)
trn thanh cng c tool bar.
Nu file s liu c dung lng ln th chng ta phi thit lp b nh cn dng cho Stata bng
lnh:
set memory #[k|m]
V d:
set mem 32m
set mem 32000k
Nhp s liu
C mt s cch nhp s liu t bn phm vo b nh ca Stata.
6
S dng ca s Stata editor nhp s liu. Hoc t ca s command, g lnh edit. Sau
nhp s liu theo kiu biu bng trong ca s ny.
name
income
comma
tab
clear
V d:
. insheet using c:\income.txt
(3 vars, 4 obs)
. insheet maho hoten thunhap using c:\income.txt
(note: variable names in file ignored)
(3 vars, 4 obs)
Lu tr s liu
C php:
save (ng dn\tn tp) [,replace]
Lnh ny lu tr s liu ang trong b nh ca Stata thnh tp ch nh di tn tp. Nu tu
chn replace c ch ra th tp s liu ny s ghi ln tp hin thi (tt nhin tn tp s liu l
ging nhau).
7
Vic lu tr s liu c th thc hin bng cc ty chn Save v Save as trong thanh thc n
(menu bar); hoc nt Save trn thanh cng c (tool bar).
Ch : Xem thm lnh infile v outfile
i vi tn bin, Stata phn bit ch vit thng vi ch vit hoa. V d, trong cng
mt tp s liu, bin Ho_ten v bin ho_ten l 2 bin khc nhau.
Mt s lnh Stata cho php vit tt. V d lnh summarize c th vit tt l sum. Trong
cun ti liu ny phn gch chn di c php ca cu lnh l c php vit tt ca cu
lnh .
by danh sch bin (by varlist): Stata s thc hin cu lnh vi theo tng gi tr c ch ra bi
danh sch bin. Bin c ch ra bi danh sch bin c yu cu sp xp trc khi thc hin
lnh.
V d:
. sort sex
. by sex: sum
rlpcex1
-> sex = 1
Variable |
Obs
Mean
Std. Dev.
Min
Max
-------------+----------------------------------------------------rlpcex1 |
4375
2980.906
2430.648
357.318
45801.71
-> sex = 2
Variable |
Obs
Mean
Std. Dev.
Min
Max
-------------+----------------------------------------------------rlpcex1 |
1624
3748.368
3231.241
376.9805
30624.77
rlpcex1
Obs
Mean
Std. Dev.
Min
Max
-------------+----------------------------------------------------rlpcex1 |
699
5329.628
3962.946
1057.797
30624.77
poor if reg7==1
Variable |
Obs
Mean
Std. Dev.
Min
Max
-------------+----------------------------------------------------poor |
859
.4982538
.5002882
0
1
Quyn s (weight)
Cho php tnh ton s dng quyn s. Tu chn v quyn s s c trnh by k mc 5 ca
chng ny.
Cc tu chn (Options)
Nhiu cu lnh Stata cho php cc tu chn ring. Cc tu chn ny c ch ra sau du phy.
V d:
Lnh sum c tu chn l detail, cho php tnh ton thm mt s thng k khc ngoi gi tr
trung bnh v lnh chun.
. sum
rlpcex1, detail
comp.M&Reg price adj.pc tot exp
------------------------------------------------------------Percentiles
Smallest
1%
682.9575
357.318
5%
1012.433
366.2792
10%
1238.088
376.9805
Obs
5999
25%
1671.054
381.3502
Sum of Wgt.
5999
50%
2397.042
75%
90%
95%
99%
3711.917
5940.803
8045.32
14163.04
Largest
26944.64
30624.77
31066.5
45801.71
Mean
Std. Dev.
3188.667
2692.567
Variance
Skewness
Kurtosis
7249918
3.791027
29.21398
Ch :
-
Stata cho php vit tt cc lnh v ty chn. Trong ti liu ny, phn gch chn di
cc lnh c ngha l lnh c th vit tt bng k t trong phn gch chn ny. V d
nh lnh use c ngha l c th c vit tt bi u.
C php ca cc cu lnh trong ti liu ny c vit bng ting Anh, cho php ngi
c c th i chiu vi phn hng dn s dng trong Stata.
ngha
S hc
+
Cng
Tr
Nhn
Chia
11
Lu tha
Quan h
>
Ln hn
<
Nh hn
>=
Ln hn hoc bng
<=
Nh hn hoc bng
==
Bng
~=
!=
Lgc
~
Khng
Hoc
&
Ch :
Trong biu thc du == c dng cho vic kim nh biu thc, v d nh c dng sau lnh
if. Cn du = c dng cho lnh to bin.
V d:
gen RRD=0
replace RRD=1 if reg8==1
Cc hm s (function)
Hm s thng c dng trong biu thc (exp) ca cu lnh Stata. Nu coi Y l mt hm s
ca f(X1, X2,, Xn) th lnh v hm s trong Stata s tnh gi tr ca Y nu cho cc gi tr ca
Xi. Stata c 8 loi hm s:
Mathematical functions
Cc hm ton hc
Statistical functions
Hm thng k
Random numbers
String functions
Hm lin quan n dy k t
Special functions
Hm c bit
Date functions
Hm ngy thng
Time-series functions
Matrix functions
Hm ma trn
V d:
gen absx=abs(x)
gen log_exp=log(rlpcex1)
Cc k hiu c th v cc hm s ny c th xem mc help functions.
3. M t d liu (Data reporting)
3.1. Xo b nh ca Stata
C php:
clear
12
----------------------------------------------------------------------------------------Summary statistics
.
Ch :
Chng ta c th tm hng dn s dng theo t kho bng lnh search. Lnh search c th c
thc hin bng tu chn Search thc n help.
3.3. M t d liu
C php:
describe [danh sch bin]
Lnh ny hin th thng tin chung nh tn bin, nh dng, nhn bin ca cc bin c lit k
bi danh sch bin ca file s liu ang m. Nu nh khng c bin no c ch ra th lnh
describe s hin th thng tin ca tt c cc bin.
V d:
. des
storage display
value
variable name
type
format
label
variable label
-----------------------------------------------------------------------------househol
long
%12.0g
household code
year
float %9.0g
Year of interview
month
float %9.0g
Month of interview
vlssmphs
byte
%8.0g
1 if vlss, 2 if mphs
source
13
C php:
list [danh sch bin] [iu kin] [phm vi] [, nolabel]
Lnh ny hin th gi tr ca cc bin c ch ra bi danh sch bin. Tu chn nolable cho
php hin th gi tr s ch khng phi l gi tr gn nhn.
V d:
. list
1.
2.
3.
4.
5.
. list
1.
2.
3.
4.
5.
farm
farm
farm
farm
non farm
non farm
farm
1
1
1
0
0
C php:
count [iu kin] [phm vi]
Lnh ny m s quan st c ch ra bi iu kin (exp) v phm vi (range). Nu iu kin
(exp) v phm vi (range) khng c ch ra th s hin th s quan st ca tp s liu.
V d:
. count
5999
. count if reg7==1
859
. count if reg7==1 & urban98==1
187
. count if reg7==1 & urban98==0
672
3.8. Thng k c bn
C php:
summarize [danh sch bin] [quyn s] [iu kin] [phm vi] [, detail]
Lnh ny tnh ton v hin th nhng thng k c bn ca cc bin c ch ra bi danh sch
bin. Tu chn detail cho php hin th thm mt s thng k nh nhn, lnh v cc gi
tr ca thp v phn.
V d:
. sum
rlpcex1
Variable |
Obs
Mean
Std. Dev.
Min
Max
-------------+----------------------------------------------------rlpcex1 |
5999
3188.667
2692.567
357.318
45801.71
. sum
rlpcex1, detail
2397.042
3711.917
5940.803
8045.32
14163.04
Largest
26944.64
30624.77
31066.5
45801.71
Mean
Std. Dev.
3188.667
2692.567
Variance
Skewness
Kurtosis
7249918
3.791027
29.21398
15
Negative
Zero
Positive
Total
Missing
Number of Observations
NonTotal
Integers
Integers
2964
2964
3035
3035
------------5999
5999
----5999
nolabel
V d:
. tab sex
Gender of |
HH.head |
(1:M;2:F) |
Freq.
Percent
Cum.
------------+----------------------------------1 |
4375
72.93
72.93
2 |
1624
27.07
100.00
------------+-----------------------------------
16
Total |
5999
100.00
17
Cc tu chn:
chi2
missing
nofreq
cell
column
row
V d:
. tab
. tab
Cc tu chn:
means
standard
freq
Hin th mi gi tr tn sut
missing
V d:
. replace poor=poor*100
(1777 real changes made)
. format poor %4.2f
. tab reg7 urban98, sum(poor) means
Means of poor
| 1:urban 98; 0:rural
Code by 7 |
98
regions |
Rural
Urban |
Total
-----------+----------------------+---------region1 |
61.46
8.02 |
49.83
region2 |
32.57
5.87 |
23.66
region3 |
44.83
10.19 |
39.55
region4 |
37.25
11.51 |
28.65
region5 |
47.28
. |
47.28
region6 |
12.45
2.16 |
7.33
region7 |
35.78
10.28 |
29.32
-----------+----------------------+---------Total |
38.86
6.82 |
29.62
tabstat <danh sch bin> [quyn s] [iu kin] [phm vi] [, statistics(c php tk [...])
by(tn bin) missing format[(%fmt)]]
Lnh ny tnh ton cc thng k ca cc bin c ch ra bi danh sch bin cho tng gi tr
ca bin phn loi (categorical) c ch ra bi by(tn bin).
V d:
. tabstat
Cc tu chn:
statistics(statname [...])
by(tn bin)
Missing
format[(%fmt)]
Stata cho php cc loi thng k c ch ra bi statistics(c php thng k [...]) nh sau:
C php thng k
ngha
mean
count
m s quan st
sum
Tng cng
max
Gi tr ln nht
min
Gi tr nh nht
range
sd
lch chun
sdmean
skewness
kurtosis
nhn
median
p1
1% phn v
p5
5% phn v
p10
10% phn v
p25
25% phn v
p50
p75
75% phn v
p90
90% phn v
p95
95% phn v
p99
99% phn v
iqr
p75 - p25
V d:
. tabstat
table <bin dng> [bin ct [bin ct trn cng]] [iu kin] [phm vi] [quyn s] [,
contents(ni dung) row col format(%fmt) missing]
Lnh ny cho php tnh cc thng k ca cc bin c ch ra trong contents theo dng bng,
trong cc hng c nh ngha bi bin dng, cn cc ct c nh ngha bi bin ct (v
bin ct trn cng). Cc bin hng v ct ny l cc bin phn loi (categorical).
V d:
. table reg7 urban98 farm, contents(mean poor)
---------------------------------------------------|
Type of HH (1:farm; 0:nonfarm) and
|
1:urban 98; 0:rural 98
Code by 7 | ---- non farm --------- farm -----regions
|
Rural
Urban
Rural
Urban
----------+----------------------------------------region1 | 19.35484 6.015038
65.7377 12.96296
region2 | 26.66667 4.624278
33.96524 15.21739
region3 | 40.98361 10.11236
45.8159 10.52632
region4 |
21.6 11.63793
42.44032
10
region5 | 30.76923
49.24012
region6 | 15.04065 2.195609
10.07463
0
region7 | 38.62816 10.04184
34.35805 11.62791
----------------------------------------------------
Cc tu chn:
Contents(ni dung)
row
col
format(%fmt)
missing
V d:
. table reg7 urban98 farm, contents(mean poor) row col format(%4.2f)
-----------------------------------------------------| Type of HH (1:farm; 0:nonfarm) and 1:urban
|
98; 0:rural 98
Code by 7 | ----- non farm ---------- farm -----regions
| Rural Urban Total
Rural Urban Total
----------+------------------------------------------region1 | 19.35
6.02 10.26
65.74 12.96 61.45
region2 | 26.67
4.62 11.29
33.97 15.22 32.70
region3 | 40.98 10.11 27.96
45.82 10.53 44.47
region4 | 21.60 11.64 15.13
42.44 10.00 40.81
region5 | 30.77
30.77
49.24
49.24
region6 | 15.04
2.20
6.43
10.07
0.00
9.78
region7 | 38.63 10.04 25.39
34.36 11.63 32.72
|
Total | 27.91
6.17 14.84
42.30 12.11 40.63
22
-----------------------------------------------------. table urban98 farm, contents(mean poor sd poor) row col format(%4.2f)
---------------------------------------1:urban
|
98;
|
Type of HH (1:farm;
0:rural
|
0:nonfarm)
98
| non farm
farm
Total
----------+----------------------------Rural |
27.91
42.30
38.86
|
44.88
49.41
48.75
|
Urban |
6.17
12.11
6.82
|
24.07
32.71
25.22
|
Total |
14.84
40.63
29.62
|
35.55
49.12
45.66
---------------------------------------. table urban98
format(%4.2f)
farm,
contents(mean
rlpcex1
mean
rlhhex1)
row
col
---------------------------------------1:urban
|
98;
|
Type of HH (1:farm;
0:rural
|
0:nonfarm)
98
| non farm
farm
Total
----------+----------------------------Rural | 2835.83
2212.12
2361.29
| 13242.03 10120.89 10867.36
|
Urban | 5476.86
3232.17
5230.33
| 22984.44 11903.19 21767.43
|
Total | 4423.95
2268.49
3188.67
| 19100.41 10219.39 14010.74
----------------------------------------
gen region7=(reg7==7)
To bin bng lnh egen
C php:
egen <bin mi> = fcn(tham s) [iu kin] [phm vi] [, by(bin)]
Lnh ny cho php to bin mi theo gi tr ca hm s c ch ra bi fcn. Bin mi ny s
nhn gi tr c nh cho mi quan st. Hm s y c th l:
count(exp)
mean(exp)
median(exp)
sd(exp)
Thay th gi tr ca bin
C php:
replace <bin> = biu thc [iu kin] [phm vi]
Lnh ny thay th gi tr ca bin hin c bng gi tr mi xc nh bi biu thc exp.
V d:
replace poor=poor*100
replace pcexp = hhexp/hhsize
To bin phn loi bng lnh encode
C php:
encode <bin> [iu kin] [phm vi], generate(bin mi)
Lnh ny cho php to bin phn loi mi (categorical) kiu s tng ng vi cc gi tr ca
bin kiu ch ch ra bi tn bin (c xp theo vn ch ci).
V d:
25
mucsong
rlpcex1<1790 &
rlpcex1>1290
rlpcex1>=1790
. tab mucsong
mucsong |
Freq.
Percent
Cum.
----------------+----------------------------------Khong ngheo |
4222
70.38
70.38
Ngheo |
1087
18.12
88.50
Rat ngheo |
690
11.50
100.00
----------------+----------------------------------Total |
5999
100.00
. sum mucsong
Variable |
Obs
Mean
Std. Dev.
Min
Max
-------------+----------------------------------------------------mucsong |
0
. encode mucsong, gen(ma_ms)
. tab ma_ms
ma_ms |
Freq.
Percent
Cum.
------------+----------------------------------Khong ngheo |
4222
70.38
70.38
Ngheo |
1087
18.12
88.50
Rat ngheo |
690
11.50
100.00
------------+----------------------------------Total |
5999
100.00
. sum ma_ms
Variable |
Obs
Mean
Std. Dev.
Min
Max
-------------+----------------------------------------------------ma_ms |
5999
1.411235
.6871957
1
3
Lnh ny to bin phn nhm cho biu thc theo phn v. Trong nquantiles(#) ch ra s
lng phn v.
V d: To bin ng v phn theo chi tiu
. xtile quinexp= rlpcex1, nq(5)
. tab quinexp
5 quantiles |
of rlpcex1 |
Freq.
Percent
Cum.
------------+----------------------------------1 |
1200
20.00
20.00
2 |
1200
20.00
40.01
3 |
1200
20.00
60.01
4 |
1200
20.00
80.01
5 |
1199
19.99
100.00
------------+----------------------------------Total |
5999
100.00
. tab
4.2. i tn bin
C php:
rename <tn bin c> <tn bin mi>
Lnh ny thc hin vic i tn c ca mt bin sang tn mi.
V d:
rename poor nguoingheo
rename rpcexp1 chitieu
4.3. Lnh xo bin, xo quan st
C php:
drop <danh sch bin>
V d:
drop poor urban98
drop if sex==1
drop in 1/20
Xo quan st t 1 n 20
keep househol
keep in f/50
28
29
30
1.
2.
3.
4.
5.
ngheo
1
0
1
1
0
4.6. Sp xp s liu
C php:
sort <danh sch bin> [phm vi]
gsort [+|-]tn bin [[+|-]tn bin [...]]
Lnh sort sp xp quan st theo th t tng dn ca gi tr ca cc bin c ch ra trong danh
sch bin.
Lnh gsort cho php sp xp cc quan st theo th t tng dn ca ca cc bin (danh sch
bin), nu du + c ch ra (y cng l gi tr ngm nh), hoc theo th t gim dn, nu
du - c ch ra.
V d:
sort reg7 hhsize
collapse <biu thc thng k> [quyn s] [iu kin] [phm vi] [, by(danh sch bin)]
trong :
Biu thc thng k l danh sch cc thng k v cc bin tng ng. Cc thng k c k
hiu nh mc 3.12 ca chng ny.
Lnh collapse s to ra mt tp s liu mi bao gm cc bin c ch ra bi danh sch bin,
vi cc gi tr c tnh theo thng k tng ng. Cc quan st ca tp s liu c s c nhm
li theo cc gi tr cng loi ca bin c ch ra bi by(danh sch bin).
V d:
Chng ta c file s liu v thu nhp v chi tiu ca cc h thnh vin trong gia nh:
ma_tv
ma_ho thunhap Chitieu
1
101
200
500
2
101
1200
400
3
101
0
200
4
101
0
200
1
102
3200
500
2
102
1200
320
3
102
200
200
1
103
300
500
2
103
2100
250
3
103
0
300
4
103
0
300
1
104
4300
800
2
104
3500
500
3
104
300
500
4
104
0
300
5
104
0
200
6
104
0
200
Chng ta s dng lnh collapse to file v thu nhp v chi tiu bnh qun ca cc h, v to
thm 1 bin v qui m h.
. gen quimo=1
. collapse (mean) thunhap (mean) chitieu (sum) quimo, by(ma_ho)
Tp s liu mi c dng:
ma_ho thunhap chitieu
101
350
325
102
1533.33
340
103
600
337.5
104
1350
416.667
Kt hp s liu - lnh merge
quimo
4
3
4
6
C php:
merge [danh sch bin] using <tn tp s dng> [, update replace]
Lnh merge s ni cc quan st ca tp s liu ang m trong Stata (gi l tp ch (master
dataset)) vi cc quan st tng ng ca tp s liu khc c ch ra sau t kho using (gi l
tp s dng (using dataset)) thnh 1 tp mi. Cc bin ch ra trong danh sch bin c gi l
32
bin xc nh (identifying variables), v phi c sp xt bng lnh sort (hoc gsort) trc khi
thc hin lnh merge.
V d:
Chng ta c 2 tp s liu nh sau:
thunhap.dta
ma_ho
101
102
103
104
dialy.dta
thunhap chitieu
350
325
1533.33
340
600
337.5
1350
416.667
quimo
4
3
4
6
ma_ho
thanhthi
vung
204
0
1
102
1
4
103
0
3
104
0
6
Lnh merge s c thc hin nh sau:
. use "C:\dialy.dta", clear
. sort ma_ho
. save "C:\dialy.dta"
file C:\dialy.dta saved
. use "C:\thunhap.dta", clear
. sort ma_ho
. merge ma_ho using "C:\dialy.dta"
ma_ho was byte now int
. edit
Tp kt qu c dng nh sau:
ma_ho thunhap chitieu
quimo thanhthi vung
_merge
101
350
325
4
.
.
1
102
1533.33
340
3
1
4
3
103
600
337.5
4
0
3
3
104
1350
416.667
6
0
6
3
204
.
.
.
0
1
2
Trong tp kt qu c thm 1 bin tn l _merge, bin ny nhn cc gi tr nh sau:
_merge==1
Nu nh quan st ch c to t tp ch
_merge==2
Nu nh quan st ch c to t s dng
_merge==3
Nu nh quan st c to t c tp ch v tp s dng
Cc tu chn:
Trong trng hp hai tp s liu c cc bin trng nhau, cc tu chn sau y cho php x l s
liu theo cc cch khc nhau:
33
variables) dng phn bit cc quan st vi nhau trong s liu dng ngang (gi l quan st cp
1). j(tn bin) ch ra bin dng phn bit gia cc quan st cp 2 s liu dng dc.
V d 1:
Chng ta c th s liu dng bng ngang nh mt ma trn nh sau:
-i-------------------- xj ------------------maho quimo thunhap95 thunhap96 thunhap97
101
5
4500
4400
5400
102
4
3400
3300
3700
103
6
5000
5400
5500
s liu ny s c chuyn sang dng bng dc nh sau:
-i-jmaho
quimo
nam
101
5
95
101
5
96
101
5
97
102
4
95
102
4
96
102
4
97
103
6
95
103
6
96
103
6
97
V lnh reshape s c vit nh sau:
- xji thunhap
4500
4400
5400
3400
3300
3700
5000
5400
5500
V d 2:
35
nguon2
Ngan hang A
.
Ngan hang C
Ngan hang B
lanvay
1
2
1
2
1
2
1
2
sotien
1200
2000
1300
nguon
Ngan hang A
Ngan hang A
Ngan hang B
2500
1000
3000
2000
Ngan hang A
Ngan hang C
Ngan hang A
Ngan hang B
nh. cho mu mang tnh i din cho cc tnh th nn chn 400 quan st ti H Ni v 100
quan st ti Nam nh.
Nu thu nhp bnh qun ca H Ni l 900 nghn/ thng, v ca Nam nh l 300 nghn/thng
th thu nhp bnh qun ca c vng ng bng Sng Hng khng th tnh l (900 + 300)/2, v
cc quan st trong mu khng c chn t l vi cc tnh. Mi quan st ti H ni i din cho
11250 h trong vng (4500000/400). y chnh l quyn s ca quan st, bng gi tr nghch
o ca xc sut c chn vo mu. Cn mi quan st ti Nam nh i din cho 50000 quan
st ca vng (500000/100). Thu nhp ca vng ng bng Sng Hng s c tnh nh sau:
Thu nhap =
Trong VLSS 1998 c 2 quyn s. Th nht l quyn s h, bin wt, chnh l s h ca Vit
Nam m mi h i din. Quyn s th hai l quyn s ca thnh vin h, hhsizewt l s ngi
Vit Nam m mi thnh vin ca h i din. Quyn s ca thnh vin h bng quyn s h
nhn vi quy m h.
V d: Quyn s trong VLSS 1998
. tab reg7, sum(wt)
Code by 7 |
Summary of sample quyn s
regions |
Mean
Std. Dev.
Freq.
------------+-----------------------------------region1 |
3218.4296
850.74246
859
region2 |
3133.7277
849.12325
1175
region3 |
3185.1794
801.74266
708
region4 |
2199.37
492.37202
754
region5 |
1336.3098
269.14747
368
region6 |
1963.8964
528.69328
1023
region7 |
2938.2122
547.72125
1112
------------+-----------------------------------Total |
2688.5003
900.01379
5999
. tab reg7, sum(hhsizewt)
Code by 7 |
Summary of =hhsize*wt
regions |
Mean
Std. Dev.
Freq.
------------+-----------------------------------region1 |
15790.857
7555.7552
859
region2 |
12656.003
5970.9089
1175
region3 |
14814.504
7236.7592
708
region4 |
10794.537
5235.562
754
region5 |
7564.731
3185.9336
368
region6 |
9447.7077
4535.0816
1023
region7 |
14653.702
6639.8297
1112
------------+-----------------------------------Total |
12636.546
6597.6574
5999
. di 2688.5003*5999
16128313
. di 12636.546*5999
75806639
37
pweights:
aweights
iweights
tab
reg7 urban98
.
.
tab
|
Code by 7 |
38
regions |
Rural
Urban |
Total
-----------+----------------------+---------region1 | 11993763
1570583 | 13564346
region2 | 11057932
3812871 | 14870803
region3 |
9582621
906048 | 10488669
region4 |
5618709
2520372 |
8139081
region5 |
2783821
0 |
2783821
region6 |
4545303
5119702 |
9665005
region7 | 13220727
3074190 | 16294917
-----------+----------------------+---------Total | 58802876
17003766 | 75806642
39
40
Lnh ny tnh sai s chun v khong tin cy cho gi tr trung bnh ca mu theo quy lut
chun, nh thc v Poatxng.
Cc tu chn:
level(#)
binomial
poisson
exposure(tn bin)
total
V d:
. ci
poor
Variable |
Obs
Mean
Std. Err.
[95% Conf. Interval]
-------------+------------------------------------------------------------poor |
5999
29.6216
.5895501
28.46587
30.77733
.
.
. sort reg7
. by reg7: ci poor, total
_______________________________________________________________________________
-> reg7 = region1
Variable |
Obs
Mean
Std. Err.
[95% Conf. Interval]
-------------+------------------------------------------------------------poor |
859
49.82538
1.706961
46.47507
53.17569
_______________________________________________________________________________
-> reg7 = region2
Variable |
Obs
Mean
Std. Err.
[95% Conf. Interval]
-------------+------------------------------------------------------------poor |
1175
23.65957
1.240357
21.22601
26.09314
_______________________________________________________________________________
-> reg7 = region3
Variable |
Obs
Mean
Std. Err.
[95% Conf. Interval]
-------------+------------------------------------------------------------poor |
708
39.54802
1.838899
35.93767
43.15838
_______________________________________________________________________________
-> reg7 = region4
41
Variable |
Obs
Mean
Std. Err.
[95% Conf. Interval]
-------------+------------------------------------------------------------poor |
754
28.64721
1.64759
25.4128
31.88163
_______________________________________________________________________________
-> reg7 = region5
Variable |
Obs
Mean
Std. Err.
[95% Conf. Interval]
-------------+------------------------------------------------------------poor |
368
47.28261
2.606121
42.1578
52.40741
_______________________________________________________________________________
-> reg7 = region6
Variable |
Obs
Mean
Std. Err.
[95% Conf. Interval]
-------------+------------------------------------------------------------poor |
1023
7.331378
.8153306
5.731465
8.931292
_______________________________________________________________________________
-> reg7 = region7
Variable |
Obs
Mean
Std. Err.
[95% Conf. Interval]
-------------+------------------------------------------------------------poor |
1112
29.31655
1.365709
26.63689
31.99621
_______________________________________________________________________________
-> Total
Variable |
Obs
Mean
Std. Err.
[95% Conf. Interval]
-------------+------------------------------------------------------------poor |
5999
29.6216
.5895501
28.46587
30.77733
Ch :
Cc lnh c lng c th c s dng khi bit cc tham s v mu. y c th c gi l
cc lnh s dng tham s trc tip (Commands using immediate arguments). Cc lnh ny rt
hu dng khi chng ta khng c s liu gc v bin.
cii <s quan st> <gi tr trungbnh> < lch chun> [, level(#) ]
cii <s quan st> <s ln thnh cng ca quan st> [, level(#) ]
V d:
. cii 5999 1777, level (90)
-- Binomial Exact -Variable |
Obs
Mean
Std. Err.
[90% Conf. Interval]
-------------+------------------------------------------------------------|
5999
.296216
.005895
.2865107
.3060676
42
1.2.
859
---------------------------------------------------------------------------Variable |
Mean
Std. Err.
z
P>|z|
[95% Conf. Interval]
---------+-----------------------------------------------------------------poor | .4982538
.0170597
29.2065
0.0000
.4648174 .5316901
---------------------------------------------------------------------------Ho: proportion(poor) = .44
Ha: poor < .44
z = 3.440
P < z = 0.9997
prtest <bin 1> = <tn bin2> [iu kin] [phm vi] [, level(#)]
Lnh ny thc hin kim nh gi thuyt v s bng nhau ca t l ca hai gi tr bin c ch
ra bi tn bin (Ho: pX = pY).
V d: Kim nh xem t l ngho i gia vng 2 v vng 4 c khac nhau khng:
. gen poor2=poor if reg7==2
(4824 missing values generated)
. gen poor4=poor if reg7==4
(5245 missing values generated)
. prtest poor2 = poor4
Two-sample test of proportion
1175
754
-----------------------------------------------------------------------------Variable |
Mean
Std. Err.
z
P>|z|
[95% Conf. Interval]
---------+--------------------------------------------------------------------
43
poor2 | .2365957
.0123983
19.0829
0.0000
.2122955
.2608959
poor4 | .2864721
.016465
17.3989
0.0000
.2542014
.3187429
---------+-------------------------------------------------------------------diff | -.0498764
.020611
-.0902732
-.0094796
| under Ho:
.0203666 -2.44893
0.0143
-----------------------------------------------------------------------------Ho: proportion(poor2) - proportion(poor4) = diff = 0
Ha: diff < 0
z = -2.449
P < z = 0.0072
Ha: diff ~= 0
z = -2.449
P > |z| = 0.0143
prtest <bin> [iu kin] [phm vi], by(bin phn nhm) [level(#)]
Lnh ny thc hin kim nh gi thuyt v s bng nhau ca t l ca hai nhm c ch ra
bi bin phn nhm (Ho: pX1 = pX2).
V d:
. prtest poor, by(sex)
Two-sample test of proportion
1: Number of obs =
2: Number of obs =
4375
1624
-----------------------------------------------------------------------------Variable |
Mean
Std. Err.
z
P>|z|
[95% Conf. Interval]
---------+-------------------------------------------------------------------1 |
.3248
.00708
45.8755
0.0000
.3109234
.3386766
2 | .2192118
.0102661
21.353
0.0000
.1990906
.239333
---------+-------------------------------------------------------------------diff | .1055882
.0124708
.0811459
.1300304
| under Ho:
.0132673
7.95855
0.0000
-----------------------------------------------------------------------------Ho: proportion(1) - proportion(2) = diff = 0
Ha: diff < 0
z = 7.959
P < z = 1.0000
Ha: diff ~= 0
z = 7.959
P > |z| = 0.0000
= 0.000344
(one-sided test)
44
(one-sided test)
(two-sided test)
(one-sided test)
(one-sided test)
(two-sided test)
ttest
rlpcex1=3200
One-sample t test
-----------------------------------------------------------------------------Variable |
Obs
Mean
Std. Err.
Std. Dev.
[95% Conf. Interval]
---------+-------------------------------------------------------------------rlpcex1 |
5999
3188.667
34.76379
2692.567
3120.518
3256.817
-----------------------------------------------------------------------------Degrees of freedom: 5998
Ho: mean(rlpcex1) = 3200
Ha: mean < 3200
t = -0.3260
P < t =
0.3722
ttest <bin 1> = <bin 2> [iu kin] [phm vi] [, unpaired unequal level(#) ]
Lnh ny thc hin kim nh gi thuyt rng hai bin c gi tr trung bnh bng nhau. (Ho: X
= Y).
Cc tu chn:
unpaired
unequal
V d:
. ttest poor2=poor4, unpaired unequal
Two-sample t test with unequal variances
45
-----------------------------------------------------------------------------Variable |
Obs
Mean
Std. Err.
Std. Dev.
[95% Conf. Interval]
---------+-------------------------------------------------------------------poor2 |
1175
.2365957
.0124036
.425173
.2122601
.2609314
poor4 |
754
.2864721
.0164759
.4524128
.254128
.3188163
---------+-------------------------------------------------------------------combined |
1929
.2560912
.0099404
.436586
.2365962
.2755863
---------+-------------------------------------------------------------------diff |
-.0498764
.0206229
-.0903285
-.0094243
-----------------------------------------------------------------------------Satterthwaite's degrees of freedom: 1532.64
Ho: mean(poor2) - mean(poor4) = diff = 0
Ha: diff < 0
t = -2.4185
P < t =
0.0079
Ha: diff ~= 0
t = -2.4185
P > |t| =
0.0157
ttest <bin> [iu kin] [phm vi], by(bin phn nhm) [ unequal level(#) ]
Lnh ny thc hin kim nh gi thuyt v s bng nhau ca gi tr trung bnh ca hai nhm
c ch ra bi bin phn nhm (Ho: X1 = X2).
V d:
. ttest
rlpcex1, by(sex)
Ha: diff ~= 0
t = -9.8880
P > |t| =
0.0000
V d:
. sum
rlpcex1
Variable |
Obs
Mean
Std. Dev.
Min
Max
-------------+----------------------------------------------------rlpcex1 |
5999
3188.667
2692.567
357.318
45801.71
. sdtest rlpcex1=2700
One-sample test of variance
-----------------------------------------------------------------------------Variable |
Obs
Mean
Std. Err.
Std. Dev.
[95% Conf. Interval]
---------+-------------------------------------------------------------------rlpcex1 |
5999
3188.667
34.76379
2692.567
3120.518
3256.817
-----------------------------------------------------------------------------Ho: sd(rlpcex1) = 2700
chi2(5998) = 5965.022
Ha: sd(rlpcex1) < 2700
P < chi2 = 0.3838
covariance
_coef
wrap
V d:
. corr hhsize poor
(obs=5999)
rlpcex1 sex
|
hhsize
poor rlpcex1
sex
-------------+-----------------------------------hhsize |
1.0000
poor |
0.2425
1.0000
rlpcex1 | -0.2172 -0.4452
1.0000
47
sex |
-0.2570
-0.1028
0.1267
1.0000
Variable |
Mean
Std. Dev.
Min
Max
-------------+---------------------------------------------------hhsize |
4.752292
1.954292
1
19
poor |
.296216
.4566255
0
1
rlpcex1 |
3188.667
2692.567
357.318
45801.71
sex |
1.270712
.4443645
1
2
|
hhsize
poor rlpcex1
sex
-------------+-----------------------------------hhsize | 3.81926
poor | .216435 .208507
rlpcex1 | -1142.93 -547.335 7.2e+06
sex | -.223195 -.020849 151.543
.19746
pwcorr
[danh sch bin] [quyn s] [iu kin] [phm vi] [, obs sig print(#) star(#)]
Lnh ny tnh h s tng quan cho tng cp bin c ch ra bi danh sch bin.
Cc tu chn:
obs
sig
print(#)
star(#)
V d:
. pwcorr hhsize poor rlpcex1 sex, obs sig star(5)
|
hhsize
poor rlpcex1
sex
-------------+-----------------------------------hhsize |
1.0000
|
|
5999
|
poor |
0.2425* 1.0000
|
0.0000
|
5999
5999
|
rlpcex1 | -0.2172* -0.4452* 1.0000
|
0.0000
0.0000
|
5999
5999
5999
|
sex | -0.2570* -0.1028* 0.1267* 1.0000
|
0.0000
0.0000
0.0000
48
|
|
5999
5999
5999
5999
pcorr <bin> <danh sch bin> [quyn s] [iu kin] [phm vi]
Lnh ny tnh h s tng quan ca bin c ch ra bi tn bin vi cc bin c trong danh
sch bin
V d:
. pwcorr poor hhsize
rlpcex1 sex
|
poor
hhsize rlpcex1
sex
-------------+-----------------------------------poor |
1.0000
hhsize |
0.2425
1.0000
rlpcex1 | -0.4452 -0.2172
1.0000
sex | -0.1028 -0.2570
0.1267
1.0000
rlpcex1
reg7
sex
hhsize
Source |
SS
df
MS
-------------+-----------------------------Model | 3.8639e+09
3 1.2880e+09
Residual | 3.9621e+10 5995 6609032.15
-------------+-----------------------------Total | 4.3485e+10 5998 7249918.40
Number of obs
F( 3, 5995)
Prob > F
R-squared
Adj R-squared
Root MSE
=
=
=
=
=
=
5999
194.88
0.0000
0.0889
0.0884
2570.8
-----------------------------------------------------------------------------rlpcex1 |
Coef.
Std. Err.
t
P>|t|
[95% Conf. Interval]
-------------+---------------------------------------------------------------reg7 |
240.9633
15.5905
15.46
0.000
210.4003
271.5263
sex |
403.2984
77.38324
5.21
0.000
251.5994
554.9974
hhsize | -305.6382
17.70692
-17.26
0.000
-340.3501
-270.9263
_cons |
3160.201
155.6576
20.30
0.000
2855.056
3465.346
------------------------------------------------------------------------------
Cc tu chn:
level(#)
noconstant
noheader
beta
poor
Iteration
Iteration
Iteration
Iteration
0:
1:
2:
3:
reg7 sex
log
log
log
log
hhsize
likelihood
likelihood
likelihood
likelihood
=
=
=
=
-3645.1363
-3367.2185
-3364.8032
-3364.8025
Probit estimates
Number of obs
LR chi2(3)
Prob > chi2
Pseudo R2
=
=
=
=
5999
560.67
0.0000
0.0769
-----------------------------------------------------------------------------poor |
Coef.
Std. Err.
z
P>|z|
[95% Conf. Interval]
-------------+---------------------------------------------------------------reg7 |
-.116342
.0084551
-13.76
0.000
-.1329136
-.0997703
sex | -.1284525
.0422247
-3.04
0.002
-.2112113
-.0456937
hhsize |
.1808115
.0095806
18.87
0.000
.1620338
.1995892
_cons | -.8088731
.0824798
-9.81
0.000
-.9705306
-.6472157
------------------------------------------------------------------------------
stdp
redid
c lng gi tr phn d:
e i = Yi Y
i
V d:
predict exphat, xb
50
educyr98 hhsize
Source |
SS
df
MS
-------------+-----------------------------Model | 1.6960e+10
10 1.6960e+09
Residual | 2.6525e+10 5988 4429712.49
-------------+-----------------------------Total | 4.3485e+10 5998 7249918.40
Number of obs
F( 10, 5988)
Prob > F
R-squared
Adj R-squared
Root MSE
=
=
=
=
=
=
5999
382.87
0.0000
0.3900
0.3890
2104.7
------------------------------------------------------------------------------
51
rlpcex1 |
Coef.
Std. Err.
t
P>|t|
[95% Conf. Interval]
-------------+---------------------------------------------------------------urban98 |
1995.163
66.46943
30.02
0.000
1864.859
2125.467
region1 | -923.7066
132.8334
-6.95
0.000
-1184.108
-663.3052
region2 | -362.6047
130.2254
-2.78
0.005
-617.8934
-107.316
region3 | -558.0354
137.1551
-4.07
0.000
-826.9089
-289.1619
region4 | -100.7586
135.8372
-0.74
0.458
-367.0486
165.5313
region5 | (dropped)
region6 |
1742.688
131.9928
13.20
0.000
1483.934
2001.441
region7 |
151.9854
128.0272
1.19
0.235
-98.99396
402.9648
sex |
270.9142
66.61031
4.07
0.000
140.3339
401.4944
educyr98 |
153.3281
6.836934
22.43
0.000
139.9253
166.731
hhsize |
-257.691
14.73741
-17.49
0.000
-286.5816
-228.8004
_cons |
2362.355
178.3197
13.25
0.000
2012.784
2711.926
-----------------------------------------------------------------------------. test
( 1)
urban98 =2000
urban98 = 2000.0
F(
. test
( 1)
( 1)
( 1)
( 2)
( 3)
1, 5988) =
Prob > F =
27.80
0.0000
. testparm
1)
2)
3)
4)
5)
6)
7)
34.57
0.0000
F(
(
(
(
(
(
(
(
1, 5988) =
Prob > F =
region1 = (region2+region3)/2
F(
. test
0.01
0.9420
region1 = region2
F(
. test
1, 5988) =
Prob > F =
3, 5988) =
Prob > F =
20.22
0.0000
region*
region1 = 0.0
region2 = 0.0
region3 = 0.0
region4 = 0.0
region5 = 0.0
region6 = 0.0
region7 = 0.0
Constraint 5 dropped
F(
6,
5988) =
148.55
52
Prob > F =
0.0000
Chng IV: V th
1. V th (graph)
C php:
graph [danh sch bin] [quyn s] [iu kin] [phm vi] [, loi__th tu_chn_ring
tu_chn_chung]
Trong :
loi__th (graph_type)
Ch ra loi th cn v
tu_chn_ring (specific_options)
tu_chn_chung (common_options)
53
45801.7
357.318
16
95
95
19
45801.7
comp.M&Reg price
adj.pc tot exp
357.318
95
Age of household
head
16
22
schooling year
of HH.head
0
19
Household size
1
357.318
45801.7
22
54
Fraction
.329888
0
357.318
45801.7
357.318
45801.71
55
357.318
57
Audi 5000
Audi Fox
BMW 320i
Datsun 200
Datsun 210
Price
Mileage (mpg)
Repair Record 1978
Datsun 510
Datsun 810
Fiat Strada
Honda Accord
Honda Civic
Headroom (in.)
Trunk space (cu. ft.)
Weight (lbs.)
Length (in.)
Mazda GLC
Renault
Subaru
Toyota Celica
Toyota Corolla
Toyota Corona
VW Dasher
VW Diesel
VW Rabbit
VW Scirocco
Volvo 260
58
size |
Mean Std. Dev.
Freq.
------------+-----------------------------------1 | 3.7897196 4.3956537
214
2 | 5.7545272 4.7225549
497
3 | 7.3023256 4.6396425
731
4 | 8.2578348 4.2659841
1404
5 | 7.7243298 4.2998488
1318
6 | 6.8788927 4.0778062
867
7 | 6.3348958 4.1241759
480
8 | 5.7333333 3.9623557
255
9 | 5.7936508 3.4878474
126
10 | 6.1724138 3.1851516
58
11 | 4.7931034 3.1665586
29
12 | 4.4444444 3.6438685
9
13 |
5 5.0990195
4
14 |
3 2.1602469
4
16 |
4 1.4142136
2
19 |
2
0
1
------------+-----------------------------------Total | 7.0944185 4.4160917
5999
. replace meanexp= meanexp/1000
(16 real changes made)
. replace meanexp= meanexp/1000
. rename var71 ahhsize
. rename var72 meanexp
. rename var73 meanedu
. replace meanexp= meanexp/1000
. label var meanexp Chi tieu binh quan
. label var meanedu So nam hoc
. label var ahhsize Quy mo ho
59
meanexp
meanedu
8.25783
1.57978
1
19
ahhsize
* La chn v tiu :
title("chui k t") t1title("chui k t") t2title("chui k t") b1title("chui k t")
b2title("chui k t") l1title("chui k t") l2title("chui k t") r1title("chui k t")
r2title("chui k t")
Lnh ny ghi cc tiu trn pha trn (top), pha di (bottom), bn tri (left) v bn phi
(right) th.
V d:
gr meanexp meanedu ahhsize, title (Do thi chi tieu va hoc van chu ho) l1title(Chi tieu binh quan
(tr dong)) l2title(So nam hoc cua chu ho) b2title (Quy mo ho gia dinh)
Chi tieu binh quan
So nam hoc
8.25783
1.57978
1
19
60
* Hin th gi tr trc th
xlabel[(gi tr s)] ylabel[(gi tr s)] rlabel[(gi tr s)] tlabel[(gi tr s)]
V d:
gr meanexp meanedu ahhsize, title (Do thi chi tieu va hoc van chu ho) l1title(Chi tieu binh quan
(tr dong)) l2title(So nam hoc cua chu ho) b2title (Quy mo ho gia dinh) xlabel ylabel
Chi tieu binh quan
So nam hoc
2
0
10
Quy mo ho gia dinh
15
20
61
So nam hoc
2
0
10
Quy mo ho gia dinh
15
20
So nam hoc
8
4000
6
3000
So nam hoc
5000
4
2000
1000
2
0
10
Quy mo ho gia dinh
15
20
2.2. th tn sut
C php:
graph [bin] [quyn s] [iu kin] [phm vi], histogram [tu_chn_chung bin(#) freq
normal[(#,#)] density(#)]
Cc tu chn:
bin(#)
Freq
normal[(#,#)]
density(#)]
V d:
th tn sut ca chi tiu binh qun u ngi
. gr rlpcex1, hist bin(20) normal
Fraction
.56026
0
357.318
45801.7
63
Frequency
1979
0
357.318
45801.7
region2
region3
region4
region5
region6
415
Frequency
415
0
357.318
region7
45801.7
357.318
45801.7
415
0
357.318
45801.7
Household size
8.64426
65
Household size
10
region1
region2
region3
region4
region5
region6
region7
La chn stack
. gen persons=1
. gr persons urban98, bar ylabel by(reg7) stack alt
persons
1500
1000
500
region1
region2
region3
region4
region5
region6
region7
V d:
Hy v th sau:
66
foodpoor
poor
600
400
200
region1
region2
region3
region4
region5
region6
region7
67
12% foodpoor
18% poor but still above food povert
70% nonpoor
region2
region3
12% foodpoor
18% poor but still above food povert
70% nonpoor
region4
region7
region5
region6
Total
region2
region3
persons
12% foodpoor
18% poor but still above food povert
1500
70% nonpoor
region4
region7
region5
region6
1000
500
Total
region1
region2
region3
region4
region5
region6
region7
24% poor1
16% poor2
16% poor3
12% poor4
10% poor5
4% poor6
18% poor7
69
Sau khi son tho, do-file s c lu tr bng tu chn Save as trong thc n File ca ca s
do-file editor. Tn ca do-file c th c ch ra ngay ti lnh doedit nh sau:
doedit (tn do-file)
Tp do-file c phn m rng l do.
v d trn chng ta c th lu tr on chng trnh di tn l chng trnh 1 ti th mc
Vlss98 trn a C.
1.2. Thc hin cc tp do-file
chy do-file th ti ca s lnh chng ta g mt trong hai lnh sau:
do filename [, nostop]
run filename [, nostop]
70
Lnh run thc hin cc lnh trong do-file nhng khng hin th kt qu ra mn hnh.
Trong qu trnh thc hin do-file, nu c cu lnh sai th Stata s bo li v ngng vic thc
hin cc cu lnh sau . Tuy nhin nu tu chn nostop c ch ra th Stata s b qua cu
lnh b li v tip tc thc hin cc lnh sau cu lnh li .
V d:
. do "c:\vlss98\chuong trinh 1"
. clear
. set mem 32m
(32768k)
. use "C:\VLSS98\Hhexp98n.dta", clear
. tab urban98
1:urban 98; |
0:rural 98 |
Freq.
Percent
Cum.
------------+----------------------------------Rural |
4269
71.16
71.16
Urban |
1730
28.84
100.00
------------+----------------------------------Total |
5999
100.00
. sum hhsize
Variable |
Obs
Min
Max
-------------+----------------------------------------------------hhsize |
5999
4.752292 1.954292
19
. gen new=hhsizet
hhsizet not found
r(111);
end of do-file
r(111);
Vi tu chn nostop
. do "c:\vlss98\chuong trinh 1", nostop
. clear
. set mem 32m
(32768k)
. use "C:\VLSS98\Hhexp98n.dta", clear
. tab urban98
1:urban 98; |
71
0:rural 98 |
Freq.
Percent
Cum.
------------+----------------------------------Rural |
4269
71.16
71.16
Urban |
1730
28.84
100.00
------------+----------------------------------Total |
5999
100.00
. sum hhsize
Variable |
Obs
Min
Max
-------------+----------------------------------------------------hhsize |
5999
4.752292 1.954292
19
. gen new=hhsizet
hhsizet not found
r(111);
. gen new=hhsize
. end of do-file
Thc hin (chy) bng lnh run
. run "c:\vlss98\chuong trinh 1", nostop
hhsizet not found
Cc do-file c th thc hin bng tu chn Do trong thc n File, hoc thc hin trc tip
trong ca s Do-file editor bng tu chn Do hoc Run trong thc n Tool.
1.3. Mt s lu khi son tho do-file
version #
Khi son tho cc tp do-file chng ta nn a dng lnh ny vo u chng trnh thng
bo phin bn Stata c dng son tho do-file. V d nu nh chng ta dng Stata 7.0
son tho do-file th cu lnh ny s c a vo u chng trnh nh sau:
version 7.0
clear
use Hhexp98n.dta
tab reg7
.
Cc phin bn Stata khc nhau s c th c s khc nhau v c php hoc ngha ca cc cu
lnh. Lnh version cho php chng trnh Stata chy c th hiu ng c ni dung ca tp
do-file c vit bi cc phin bn khc.
set memory #[k|m]
Nu nh file s liu i hi b nh ln hn b nh m Stata ang s dng th chng ta phi
thit lp b nh ln hn cho Stata bng lnh trn. Ch l khng nn thit lp b nh ln hn
b nh ca RAM my tnh.
72
V d:
. use "C:\Hhexp98n.dta", clear
no room to add more observations
r(901);
. set mem 32m
(32768k)
. use "C:\Hhexp98n.dta", clear
set more off/on
Theo ch ngm nh, khi thc hin mt lnh nu nh kt qu ca vic x l lnh di hn
ca s kt qu (Stata Results), mn hnh s dng li v chng ta s phi n phm (chng hn
Enter hoc Space bar) kt qu tip tc c hin th. Lnh set more off cho php kt qu
khng b dng li m c hin th lin tc cho n khi thc hin xong cu lnh hoc do-file.
Lnh set more on khi phc li ch ngm nh.
K t * v /* */
Stata s khng thc hin cc cu lnh c bt u bng k t * hoc nm gia hai nhm k
t /* */. Cc k t ny dng vit ch thch trong do-file.
V d:
-------------------version 7.0
set mem 32m
use "C:\Hhexp98n.dta", clear
* Tao bien thu nhap cua ho gia dinh
/* Bien nay bang Thu nhap binh quan
nhan voi Quy mo ho*/
gen hhexp = rlpcex1 * hhsize
#delimit ;
Khi cu lnh trong do-file editor qu di th chng ta c th dng lnh ny thng bo rng 1
cu lnh c kt thc bng k t (;). Theo ch ngm nh th cu lnh c kt thc khi
xung dng bng vic g phm Enter. khi phc li ch ngm nh th dng lnh #delimit
cr
V d: lnh v th chng trc:
graph meanexp meanedu ahhsize, title (Do thi chi tieu va hoc van chu ho) l1title(Chi tieu binh
quan (tr dong)) l2title(So nam hoc cua chu ho) b2title (Quy mo ho gia dinh) xlabel ylabel xline
(5 10 to 20) yline(2 4 to 8) connect(ll)
tung ng vi:
#delimit ;
graph meanexp meanedu ahhsize, title (Do thi chi tieu va hoc van chu ho)
l1title(Chi tieu binh quan (tr dong)) l2title(So nam hoc cua chu ho)
b2title (Quy mo ho gia dinh) xlabel ylabel xline (5 10 to 20)
73
yline(2 4 to 8) connect(ll) ;
gen hhexp = rlpcex1 * hhsize ;
..
Sau chng ta nn khi phc li ch ngm nh nu nh cc cu lnh sau c th vit
trn 1 dng bng lnh:
#delimit cr
Ch :
-
Cc lnh # delimit v cch vit cu lnh di s dng k t /* */ ch dng c trong dofile ch khng dng c ti ca s lnh command.
province
commune
float %9.0g
float %9.0g
Province code
commune code PSU-SVY commands
V d:
Nu ma trn A c to ra th chng ta c th xem ni dung ca ma trn nh sau:
. matrix list A
A[3,3]
c1 c2 c3
r1
r2
r3 10 11 14
y ma trn A bao gm 9 phn t (element): 1, 2, 4, 3, 4, 7, 10, 11, 14. Cc ct c t tn l
c1, c2, v c3, v cc hng l r1, r2, v r3. Phn t l giao im ca dng 1 v ct 2 c k hiu
l A[1, 2]. Trong v d ny A[1, 2] cha gi tr bng 2.
3.2. Tch v hng (scalar)
Tch v hng cha 1 phn t l s. Tch v hng c nh ngha bng lnh sau:
scalar scalar_name = expression
V d:
. scalar a = 10
. scalar list a
a = 10
. scalar b = a* 2
. scalar list b
b=
20
matrix myvec = (1 5 3 1 3)
To ra vct hng
To ra vct ct
mkmat <danh sch bin> [iu kin] [phm vi] [, matrix(tn ma trn) ]
V d:
. input maho quymo thunhap
maho
quymo
thunhap
1. 101 6 1200
2. 103 5 1400
3. 105 5 3200
4. 107 9 1000
5. 109 4 2500
6. end
. mkmat maho quymo thunhap, matrix(A)
. matrix list A
A[5,3]
maho
quymo thunhap
r1
101
1200
r2
103
1400
r3
105
3200
r4
107
1000
r5
109
2500
=B
matrix C
= (C+C)/2
matrix D = A*A
Xo ma trn
Ma trn v tch v hng c th xo khi b nh bng lnh:
matrix drop <ma trn>
scalar drop <tch v hng>
V d:
. matrix drop A
. scalar drop B
4. Lnh iu kin v vng lp
4.1. Lnh ifelse
C php:
iu kin (iu kin logic) {
Nhm cu lnh 1
}
else Cu lnh
78
}
else {
comands 2
}
-
}
else iu kin (iu kin) {
.
4.2. Lnh while
C php:
while <iu kin logic> {
Nhm cu lnh
}
Stata s kim tra iu kin logic (expression), nu iu kin ny ng th cc lnh Nhm cu
lnh s c thc hin, nu iu kin sai th cc lnh ny s khng c thc hin.
V d:
local i=1
while `i<= 10 {
if mod(`i',2) {
display "`i' is odd"
}
79
else {
display "`i' is even"
}
local i=`i+1
}
Ch :
Vng lp c th c dng li nu s dng tu chn sau y gia vng lp:
continue [, break]
Nu gp lnh continue, Stata s b qua cc lnh sau v quay li lnh u tin ca vng lp.
Nu c tu chn break c ch ra th Stata s thot khi vng lp.
V d: Tm tch s chung nh nht ca 2, 3 v 5
local i=1
while `i<= 1000 {
if mod(`i',2)==0 & mod(`i',3)==0 & mod(`i',5)==0 {
di "The least common multiple of 2, 3, and 5 is `i'"
continue, break
}
}
5. Gii thiu v file ado
To chng trnh
Mt on chng trnh trong Stata c th c nh ngha bng lnh:
Program define <tn chng trnh>
Cc cu lnh
end
on chng trnh ny c vit trong ca s Do-file editor. Mt khi n c chy th on
chng trnh ny s lu tr trong b nh ca Stata, v ch cn gi ra bng cch g tn chng
trnh (progname)
V d:
quietly program define povline
display as text _col(3) "Poverty line" _col(16) "{c |}" _col(20) "Food"
"Overall"
_col(30)
Overall
80
---------------+-------------------------Value
| 1380
1920
Ch :
Nu chng ta chy li lnh program define povline, v nhn c thng bo:
povline already defined
r(110);
Tc l chng trnh povline c to ra ri, xo chng ny i th chng ta dng lnh:
program drop poveline
hoc xo tt c cc chng trnh
program drop _all
Ado-file
Cc ado-file to ra cc lnh ca Stata. Trong Stata c hai loi lnh. Loi th nht c vit
trong Stata, v d nh lnh summarize. Loi th hai c nh ngha bi cc tp ado, v d nh
lnh ci.
bit c lnh Stata thuc loi no, g lnh which:
. which sum
built-in command: summarize
. which ci
C:\STATA\ado\base\c\ci.ado
*! version 3.3.4 04sep2000
Cc ado-file chnh l cc chng trnh c nh ngha bng lnh program define, v lu tr vi
phn m rng l ado. Stata s tm kim cc ado-file cc th mc:
. sysdir
STATA: C:\STATA\
UPDATES: C:\STATA\ado\updates\
BASE: C:\STATA\ado\base\
SITE: C:\STATA\ado\site\
STBPLUS: c:\ado\stbplus\
PERSONAL: c:\ado\personal\
OLDPLACE: c:\ado\
V d:
Chng ta c th lu tr lnh povline di dng ado v lu tr thu mc C:\STATA\ado\base\
Lnh ny s c thc hin khi ta g povline m khng cn chng ta phi thc hin cu lnh
trc do-file.
Bi tp: Vit lnh povline vi cc la chn cho cc nm 1993, 1998, v 2002.
Ph lc
Cc thng k c bn ca mu tun theo quy lut chun
Trung bnh:
n
x=
i =1
Phng sai:
n
s2 =
(x
i =1
x)2
n 1
lch chun:
s=
s2
MAD =
i =1
lch:
n
Skewness =
(x
x)3 / n
i =1
s3
nhn:
n
Kurtosis =
(x
x)4 / n
i =1
s4
82