You are on page 1of 121

TRNG I HC Y DC TP H CH MINH

KHOA Y T CNG CNG


B mn Thng k Y Hc v Tin Hc










STATA NG DNG TRONG
NGHIN CU KHOA HC


Bin son: TS. Vn Dng














THNH PH H CH MINH
4-2008
(Lu hnh ni b)

MC LC

i cng v thng k v thng k m t ...................................................................................... 1
Mt s nh ngha ....................................................................................................................... 1
Bin s v cc loi bin s.......................................................................................................... 1
Phng php m t tm tt v trnh by s liu ......................................................................... 1
Cc s thng k m t................................................................................................................. 2
Phng php trnh by s liu .................................................................................................... 5
i cng v phn tch s liu...................................................................................................... 13
Suy lun thng k...................................................................................................................... 14
Ci t chng trnh Stata 10.0, s liu mu v cc chng trnh c lin quan.......................... 29
Khi ng v kt thc Stata.......................................................................................................... 36
Khi ng Stata ............................................................................................................................ 40
1. Khi ng Stata .................................................................................................................... 40
2. M t giao din ca chng trnh Stata ................................................................................ 40
3. Cch cch thc hin lnh trong chng trnh Stata ......................................................... 41
4. Lu li kt qu phn tch ..................................................................................................... 42
Mt vi phn tch n gin vi Stata............................................................................................ 44
M t s liu vi Stata 10.0 for Windows .................................................................................... 58
Thng k phn tch bin s nh lng vi Stata ......................................................................... 96
Thc hnh ................................................................................................................................. 98

1
i cng v thng k v thng k m t
Mt s nh ngha
Thng k l phng php khoa hc dng thu thp, tm tt, trnh by v phn tch s liu.
S liu: Kt qu c c do vic quan st hay thu thp mt bin s cc i tng khc nhau
hay thi gian khc nhau.
Th d: Khi ti quan st gii tnh ca cc hc vin trong lp, ti c s liu l:
Nam, nam, n, n, n, nam, n, v.v
Th d: Mt nh nghin cu o nng hemoglobin ca 70 thai ph c kt qu nh sau:
10.2 13.7 10.4 14.9 11.5 12.0 11.0
13.3 12.9 12.1 9.4 13.2 10.8 11.7
10.6 10.5 13.7 11.8 14.1 10.3 13.6
12.1 12.9 11.4 12.7 10.6 11.4 11.9
9.3 13.5 14.6 11.2 11.7 10.9 10.4
12.0 12.9 11.1 8.8 10.2 11.6 12.5
13.4 12.1 10.9 11.3 14.7 10.8 13.3
11.9 11.4 12.5 13.0 11.6 13.1 9.7
11.2 15.1 10.7 12.9 13.4 12.3 11.0
14.6 11.1 13.5 10.9 13.1 11.8 12.2
v nhng con s ny c gi l s liu.
Cn lu s liu phi lin kt vi mt bin s nht nh. Nu ti quan st gii tnh ngi ny,
tui ca ngi khc, qun o ca mt ngi khc na th kt qu quan st c khng phi l s
liu.
Bin s v cc loi bin s
Bin s l nhng i lng hay nhng c tnh c th thay i t ngi ny sang ngi khc hay
t thi im ny sang thi im khc.
Nh vy bin s c th th hin i lng hay c tnh.
- Nu bin s th hin mt i lng n c gi l bin s nh lng (quantitative
variable). Bin s nh lng c th cn c chia thnh bin s t s - ratio variable(c gi tr
khng tuyt i) v bin s khong interval variable (khng c gi tr khng tuyt )
- Nu bin s nhm th hin mt c tnh, bin s c gi l bin s nh tnh. Bin s
nh tnh cn c chia lm 3 loi:
- Bin s nh gi binary variable (khi ch c 2 gi tr)
- Bin s danh nh nominal variable (khi c 3 hay nhiu hn cc gi tr v cc
bn thn cc gi tr khng c tnh cht th t)
- Bin s th t - ordinal variable (khi c 3 hay nhiu hn cc gi tr v cc bn
thn cc gi tr c tnh cht th t
- Ngoi ra c khi bin c khng ch c quan tm v phng din n c xy ra hay cha
xy ra m cn c quan tm v phng din bin c xy ra vo lc no. Th d sau khi iu tr
bnh nhn ung th chng ta khng ch quan tm bnh nhn c t vong hay khng m cn quan
tm bnh nhn bnh nhn t vong bao nhiu lu sau khi iu tr v nu bnh nhn cha t vong,
bnh nhn sng c bao lu.
Phng php m t tm tt v trnh by s liu

2
-T l cho tng gi tr nu
l bin th t hay danh
nh
- T l cho gi tr tiu biu
nu l bin nh gi
-T l cho tng gi tr nu
l bin th t hay danh
nh
- T l cho gi tr tiu biu
nu l bin nh gi

Cc s thng k m t
C hai loi thng k m t: thng k m t khuynh hng tp trung v thng k m t tnh phn
tn.
Thng k m t khuynh hng tp trung
Thng k m t khuynh hng tp trung c th l trung bnh (mean), trung v (median) v yu v
(mode). Nhng thng k ny cho bit gi tr tiu biu cho s liu.
Th d: c hai loi thuc h p A v B. Gi s c 5 i tng sau khi s dng thuc h p A s c huyt p
110 - 115 -120 - 125 -130 v 5 i tng khc sau khi s dng thuc h p B s c huyt p 120 - 125 -
130 - 135 - 140. Con s tiu biu nht cho bit tc dng ca thuc A l huyt p trung bnh sau khi s
dng thuc A v l 120. Con s huyt p trung bnh ny thp hn huyt p trung bnh sau khi s dng
thuc B cho bit thuc A c tc dng mnh hn.
Trung bnh ca s liu, c k hiu l (x (c l x gch) l tng cc gi tr ca s liu chia cho
s ln quan st (N).
N
x
x
i

=
Th d: S liu v huyt p tm thu ca 5 i tng l 120, 125, 130, 135, 150. Huyt p tm thu
trung bnh s l 132
132
5
150 125 130 125 120
=
+ + + +
=

=
N
x
x
i


3
Do khng th thc hin cc php ton s hc trn cc bin s nh tnh (danh nh v th t)
chng ta ch c th tnh trung bnh cho s liu ca bin s nh lng.
Nu chng ta sp xp s liu theo th t, gi tr ng gia c gi l trung v. Nu c hai gi
tr cng ng gia, trung bnh cng ca hai gi tr ny l trung v.
Th d: S liu v huyt p tm thu (mmHg) ca 5 i tng l 120, 125, 130, 135, 150. Trung v
ca huyt p tm thu l gi tr ng gia v bng 130
S liu v chiu cao (cm) ca 6 ngi l 153, 155, 160, 162, 165, 161. tnh trung v, trc tin
chng ta phi sp xp s liu ny: 153, 155, 160, 161, 162, 165. Do c hai gi tr 160 v 161 cng
gia, trung v s l (160+161)/2 = 160,5 cm
Do bn cht ca bin s danh nh khng th sp c theo th t, chng ta ch c th tnh trung
v ca s liu nh lng v s liu ca bin s th t.
Ngoi ra yu v (mode) cng c s dng lm con s thng k tiu biu. Yu v l gi tr xut
hin ph bin nht (c tn sut cao nht).
Th d: S liu v huyt p tm thu (mmHg) ca 5 i tng l 120, 125, 130, 135, 150. Trong
trng hp ny khng c yu v.
im s ca 5 hc sinh l 5, 5, 6, 7, 9. Yu v ca im s l 5.
Trong mt p c 361 gia nh ngi Kinh, 120 gia nh ngi Khmer v 27 gia nh ngi Hoa.
Yu v ca bin s dn tc l dn tc Kinh.
Trong mt s liu c th, c th khng c yu v, c th c mt yu v hoc hai hay nhiu yu v.
y l khuyt im chnh ca s thng k ny. Do vy ngi ta thng ch dng yu v cho bin
s danh nh hay trong cc trng hp c bit
C th s dng trung bnh, trung v hay yu v cho bin s nh lng. Khi bin s nh lng c
phn phi bnh thng (hnh chung) th ba con s ny xp x bng nhau v khi ngi ta
thng tnh trung bnh bi v trung bnh c nhng c tnh ton hc mnh. Tuy nhin nu s liu
b lch th con s trung v phn nh gi tr tiu biu mt cch chnh xc hn.
Th d: Bnh nhn b lot d dy - t trng c iu tr theo mt phc dit vi khun
Helicobacter. Sau iu tr, bnh nhn c theo di v ghi nhn thi gian k t khi s dng thuc
n lc bt u ci thin triu chng au. 10 bnh nhn thi gian ny (ngy ) l nh sau: 1, 2, 2,
2, 2, 2, 3, 3, 3, 30. Bnh nhn c thi gian t lc iu tr n lc gim triu chng l 30 ngy trn
thc cht l bnh nhn khng p ng vi iu tr. Trung v v trung bnh ca s liu l 2 v 5
ngy. Con s trung v phn nh chn thc hn bi v vi t cch l mt bc s lm sng t s liu
trn c th nhn xt rng mt bnh nhn tiu biu s gim au sau 2 ngy dng thuc. Con ss 30
trong th d trn c gi l s ngoi lai (outlier) v lm s liu b lch. Nhn chung, khi s liu b
lch th con s trung bnh s b nh hng rt nhiu v khng phn nh gi tr tiu biu nh con s
trung v.
Thng k m t tnh phn tn:
C 3 thng k m t tnh phn tn: lch chun, khong t phn v v phm vi ca s liu.
Vic la chn thng k m t tnh phn tn c trnh by trong bng 2.
Thng k m t tnh phn tn c tm quan trng th hai sau con s m t khuynh hng tp
trung.
Th d: Thuc h p A c s dng trn 5 bnh nhn v huyt p tm thu sau khi dng thuc l 110, 115,
120, 125 v 130. Thuc h p B c s dng trn 5 bnh nhn v c huyt p sau s dng thuc l 100,
110, 120, 130, 140. Nh vy hai thuc h p ny c hiu qu h p l tng ng (bi v trung bnh ca
hai s liu l bng nhau) nhng kt qu ca thuc B phn tn hn v iu ny lm thuc B tr nn km an
ton.
lch chun (standard deviation - vit tt l SD hay s) l con s nh gi mc phn tn v

4
c tnh theo cng thc:


Nh vy lch chun phn nh khong cch trung bnh ca s liu so vi gi tr tiu biu. Khi
nim lch chun ch c th p dng cho bin s nh lng bi v chng ta c th thc hin
cc php ton s hc trn cc i lng nhng khng th thc hin trn cc gi tr ca bin s
nh tnh l cc c tnh.
Th d: S liu v huyt p tm thu (mmHg) ca 5 i tng l 120, 125, 130, 135, 150. Trung bnh ca
huyt p l 132 v lch chun bng
5 , 11 5 , 132
4
530
4
324 9 4 49 144
1 5
) 132 150 ( ) 132 135 ( ) 130 132 ( ) 132 125 ( ) 132 120 (
1
) (
2 2 2 2 2
1
2
= = =
+ + + +
=

+ + + +
=

=

=
n
i
i
N
x x
s

Phng sai v mt t nguyn l bnh phng ca lch chun. Phng sai (variance) c th
c k hiu v Var hay s
2
v c tnh theo cng thc sau:

=
n
i
i
N
x x
s
1
2
2
1
) (

Phm vi ca s liu l tt c cc gi tr ca s liu t gi tr nh nht n gi tr ln nht.
Th d: S liu v huyt p tm thu (mmHg) ca 5 i tng l 120, 125, 130, 135, 150. Phm vi ca bin
s huyt p l 120 n 150.
Th d: Thuc h p A c s dng trn 5 bnh nhn v huyt p tm thu sau khi dng thuc l 110, 115,
120, 125 v 130. Thuc h p B c s dng trn 5 bnh nhn v c huyt p sau s dng thuc l 100,
110, 120, 130, 140. S liu ca thuc B c tnh phn tn cao hn do phm vi thay i t 100-140 trong khi
phm v ca s liu thuc A ch t 110-130.
Khong t phn v (inter-quartile): Nu chng ta chia s liu sp theo th t lm 2 phn u
nhau, khong t phn v l khong cch ca trung v phn trn v trung v phn di.
Th d: S liu v huyt p tm thu (mmHg) ca 5 i tng l 120, 125, 130, 135, 150. S liu ny c
chia lm 2 phn: phn 1 gm 120, 125, 130 v phn 2 gm 130, 135, v 150. Trung v ca phn trn l 125
- trung v ca phn di l 135, do phm t phn v l 125-135.
Do bn cht ca khong t phn v l trung v ca phn s liu trn v phn s liu di, cng
ging nh trung v, khong t phn v khng b nh hng bi cc gi tr ngoi lai nh trong
trng hp ca lch chun. Cng nh trung v, khong t phn v ch c th p dng cho bin
s nh lng hay th t.
Cu hi: Phn tch trn my tnh v bin s hemoglobin cho kt qu sau. Hy th c v l gii
kt qu:

Variable | Obs Mean Std. Dev. Min Max
-----------+-----------------------------------------------------
hemoglobin | 70 11.98429 1.416122 8.8 15.1

=
n
i
i
N
x x
s
1
2
1
) (

5

Phng php trnh by s liu
S liu c th c trnh by thnh bng hoc cc th.
Trnh by bng:
Phn phi tn sut ca bin s nh tnh
S liu ca bin s ri rc c th c trnh by di dng mt phn phi tn sut. Phn phi tn
sut l mt bng ch ra tn sut xut hin ca tng gi tr ri rc ca bin s (Bng 1). Nh vy
bng phn phi tn sut gm 2 ct, mt ct lit k cc gi tr ca bin s v mt ct trnh by tn
sut tng ng ca cc gi tr .
Table 1. Phn phi gii tnh ca 69 hc sinh lp cm thng trng mm non 23 thng 11, Huyn
Hc mn
Gii S tr Phn trm
Nam 45 65%
N 24 35%
Tng s 69 100%
Bng trn l bn phn phi tn sut ca gii tnh. Bi v gii tnh c 2 gi tr nam v n nn ta
lit k 2 gi tr ny mt ct. ct th nh ta ghi tn sut tng ng ca cc gi tr ny. i khi
bng phn phi tn sut c thm ct phn trm nh trong th d trn. Bng 2 l mt th d khc
v bng phn phi tn sut.
Table 2. Phng php ca 600 tr trong bnh vin
Phng php

S sinh Phn trm
Sinh thng 478 79,7
Sinh forceps 65 10,8
Sinh m 57 9,5
Tng s 600 100,0

Phn phi tn sut ca bin s nh lng
Nu bin s l bin s lin tc chng ta khng th lit k tt c cc gi tr ca bin s. Trong
trng hp ny chng ta c th nhm (lm trn) gi tr ca bin s li.
C th cc bc xy dng bng phn phi tn sut cho bin s nh lng nh sau:
1- Tm phm vi (gi tr cc tiu v gi tr cc i) ca s liu. Trong th d v hemoglobin ca
70 ph n phm vi l 8,8 n 15,1
2. Chia phm vi s liu ra lm n khong vi rng ca mi khong l d. Cn lu rng mi
khong d nn l i lng chn nh 1, 2, 5, 10 hay 0,5, 0,2 v s cc khong n nn t 5-12 (trung

6
bnh l 7-8). Trong th d trn ta c th chia phm vi ra lm 8khong vi chiu rng khong bng
1 n v. Khi cc khong l: 8-8,9; 9-9,9; 10-10,9; 11-11,9; 12-12,9; 13-13,9; 14-14,9; 15-
15,9.
3. m cc gi tr thch hp vo khong nh trc

Hemoglobin
(g/100ml)
m
8-8,9 1
9-9,9 111
10-10,9 1111 1111 1111
11-11,9 1111 1111 1111 1111
12-12,9 1111 1111 1111
13-13,9 1111 1111 111
14-14,9 1111
15-15,9 1
4. Xy dng bng phn phi tn sut vi bin s v cc khong gi tr ca bin s v tn sut
tng ng vi cc khong gi tr . Chng ta cng c th thm vo ct phn trm v ct phn
trm tch ly (nu thch hp)
Table 3. Hemoglobin ca 70 ph n
Hemoglobin Tn sut Phn trm Phn trm tch ly
8-8,9 1 1.43 1.43
9-9,9 3 4.29 5.71
10-10,9 14 20.00 25.71
11-11,9 19 27.14 52.86
12-12,9 14 20.00 72.86
13-13,9 13 18.57 91.43
14-14,9 5 7.14 98.57
15-15,9 1 1.43 100.00

Th d nh nu bin s l chu vi vng cnh tay ca tr chng ta c th lm trn chu vi vng cnh
tay n 1 cm. Khi ta c th xem thang o ca bin s l ri rc v trnh by bng phn phi
tn sut ca bin s (bng 2).
Table 4. Phn phi s o vng cnh tay ca 69 tr lp cm thng nh tr 23 thng 11, Hc mn.
Vng cnh tay Tn sut Phn trm Phn trm tch ly

7
13- <14 2 2.78 2.78
14- <15 31 43.06 45.83
15- <16 27 37.50 83.33
16- <17 9 12.50 95.83
17- <18 0 12.50 95.83
18- <19 2 2.78 98.61
19- <20 1 1.39 100.00

Biu v th
S liu cng c th c trnh by di dng th hoc biu . Mc d khng c ranh gii
tuyt i hon ton r rt, ni chung th (graph) c tnh cht ton hc nhiu hn, trong c
trc honh v trc tung cn biu (chart) l hnh nh mang tnh cht tng trng.
Nu bin s l bin ri rc, c th trnh by di dng biu hnh thanh (bar chart - hnh 1)
hoc biu hnh bnh (pie chart). Nu bin s l bin lin tc, th phn phi ca bin s c th
trnh by di dng t chc (histogram - hnh 2) hoc a gic tn sut.
Hnh thc ca bng
-C ta ngn gn v r rng
-t tn cho cc hng v ct
-Trnh by tng s ca hng v ct
-nh ngha cc k hiu v ch vit tt di bng
-Ghi ngun s liu di bng
Biu hnh thanh
Biu hnh thang l biu nhm m t s phn b ca bin s ri rc. Biu hnh thanh
gm c trc honh trn xc nh nhng gi tr ca bin s. ng vi tng gi tr ca bin s
ngi ta v cc thanh c chiu cao t l vi tn sut ca gi tr . Cn lu lun lun c khong
trng gia cc thanh.

8
45
24
0
10
20
30
40
50
Nam N

Hnh 1. Biu hnh thanh (bar chart) m t phn b gii tnh ca nhng hc sinh trong trng
mm non 23/11, Hc mn
Chng ta cng c th xy dng cc thanh theo chiu ngang nh trong v d sau
478
65
57
0 100 200 300 400 500
Sinh thng
Sinh forceps
Sinh mo

Hnh 2. Phng php sinh ca 600 tr sanh ti bnh vin X trong nm 1998
i vi bin s th t, iu cn lu l cc gi tr ca bin s phi c sp xp th t theo trc
honh.

9
T
a
n

s
u
a
t
edumat
mu ch ca p 1 ca p 2-3 a i ho
0
1000
2000


Hnh 3. Trnh hc vn ca cc b m trong nghin cu
4,3%
19,5%
0,8%
3,9%
0%
5%
10%
15%
20%
25%
Dung ZDV Khong dung ZDV
ng am ao
Mo lay thai

Hnh 4. T sut ly truyn t m sang con nhng ngi m b nhim HIV theo iu tr
ha d phng v phng php sinh (Ngun: The European Mode of Delivery
Collaboration, Lancet, 27/3/1999)
Biu hnh bnh
Biu hnh bnh cng c dng m t s phn b ca bin s ri rc. Biu hnh bnh l
mt vng trn c chia lm nhiu cung tng ng vi cc gi tr ca bin s. ln ca cung
t l vi tn sut ca gi tr bin s.

10
N
35%
Nam
65%

Hnh 5. Biu hnh bnh (pie chart) m t phn b gii tnh ca nhng hc sinh trong trng
mm non 23/11, Hc mn
Sinh
thng
Sinh mo
Sinh
forceps

Hnh 6. Biu hnh bnh th hin phng php sinh ca 600 a tr sinh ti bnh vin X
T chc v a gic tn sut
T chc (histogram) v a gic tn sut (polyline) c dng trong m t phn b ca bin s
lin tc. v t chc , ngi ta chia bin ca gi tr lm nhiu khong gi tr v tnh tn
sut ca nhng khong gi tr . Nhng khong gi tr ny c biu th trn trc honh. ng
vi mi khong gi tr ngi ta v nhng hnh ch nht c din tch t l vi tn sut ca khong
gi tr . Bi v cc khong gi tr ny nm st nhau trn trc honh, cc hnh ch nht ca t
chc cng thng nm st nhau.

11

F
r
e
q
u
e
n
c
y
hemoglobin
8 9 10 11 12 13 14 15 16
0
5
10
15
20

Hnh 7. T chc mc hemoglobin ca 70 ph n.



F
r
e
q
u
e
n
c
y
hemoglobin
8 9 10 11 12 13 14 15 16
0
5
10
15
20

Hnh 8. a gic tn sut ca hemoglobin ca 70 ph n.
v a gic tn sut, ngi ta thng v t chc v ni cc trung im ca cc cnh trn
ca cc hnh ch nht. a gic tn sut thng khng p nh cc t chc nhng n c u
im l c th v nhiu a gic tn sut trn cng mt th d so snh cc phn phi ca
chng.

12


hemoglobin
8 9 10 11 12 13 14 15 16
0
5
10
15

Hnh 9. a gic tn sut hemoglobin ca 28 ph n ngho (ng ) so v 42 ph n trung bnh
v kh (ng xanh)


13
i cng v phn tch s liu
Php c lng
Dn s v mu
Thng thng chng ta khng th nghin cu ton b dn s m chng ta quan tm. Chng ta
thng ch c th nghin cu ch mt phn dn s , phn ny c gi l mu (sample) v t
c on v nhng c tnh ca dn s.
Trong nghin cu khoa hc, chng ta i t c trng ca c th (bin s - variable) c c
c trng ca mu (c gi l thng k - statistics) v t c trng ca mu chng ta s dng
phng php suy lun thng k v l gii c c c trng ca dn s (c gi l tham s -
parameter)
Mt loi mu thng c gp trong nghin cu l mu ngu nhin n. Khi ly mu ngu
nhin n, chng ta c th tnh c gi tr trung bnh v lch chun ca mu. R rng l gi
tr trung bnh v lch chun s khc nhau vi nhng mu khc nhau. Tuy vy cc nh thng
k chng minh rng gi tr trung bnh ca mu s c phn phi bnh thng v cc gi tr
trung bnh ny s tp trung ti trung bnh ca dn s. Do nu chng ta tnh trung bnh ca mu
th chng ta hi vng trung bnh ca dn s s nm ngay ti hay ln cn trung bnh ca mu.
phn tn ca trung bnh mu xung quanh chung bnh dn s c gi l sai s chun (standard
error) v s gim i khi c mu cng ln:

n
s
n
s
e s
2
. . = =

lch chun v sai s chun l hai i lng th hin s phn tn nhng lch chun th hin
s phn tn ca c th chung quanh gi trnh trung bnh dn s cn sai s chun l i lng th
hin s phn tn ca con s thng k (trung bnh mu hay t l ca mu) chung quanh gi tr ca
tham s (trung bnh dn s hay t l ca dn s).
c lng khong tin cy ca trung bnh
Nh chng ta trnh by, trung bnh ca mu s dao ng nhng tp trung ti gi tr trung bnh
ca dn s, nn chng ta c th c lng trung bnh dn s bng cch tnh trung bnh ca mu.
Nhng do trung bnh mu c dao ng, chng ta khng chc l trung bnh mu s chnh xc bng
trung bnh ca dn s m ch c th tin l trung bnh dn s nm v tr u chung quanh
trung bnh ca dn s. Cc nh thng k cho rng 95% cc trng hp trung bnh dn s khng
nm xa qu 1,96 x SE so vi trung bnh mu: phm vi ny c gi l khong tin cy 95%. Nh
vy khong tin cy 95% ca trung bnh ca bin s nh lng
Khong tin cy 95% (95% CI) : x 1,96s/n
Trong trng hp c mu nh (n < 30), chng ta khng th s dng gi tr 1,96 nh trong cng
thc trn m cn phi s dng cc gi tr hi ln hn (v cng ln nu c mu cng nh), gi tr
ny c gi l gi tr ca phn phi t vi (c mu 1) t do.
Khong tin cy 95% (95% CI) : x t
(1-/2)
s/n
Bi tp:

14
1. Mt nghin cu ghi nhn trn c mu 1235 tr s sinh tnh ng Thp cho thy trng lng
trung bnh ca tr s sinh l 3121 gram v lch chun l 435 gram. Hy c lng khong tin
cy 95% ca trng lng trung bnh ca tr s sinh tnh ng Thp.
S dng cng thc trn ta tnh c:
95%CI=3096.74 - 3145.26 gram.
2. Chiu cao ca 10 thanh nin l 160; 162; 165; 166; 169; 170; 172; 172; 176; 176. Hy c
lng khong tin cy 95% ca chiu cao trung bnh.
Trc tin chng ta phi xc nh trung bnh ca chiu cao l 168,8 cm v lch chun ca
chiu cao l 5,493. Do c mu l 10 chng ta phi d bng phn phi t 9 t do ta c gi
tr t (tng ng vi khong tin cy 95%) l 2,26. T chng ta tnh c khong tin cy 95%
95%CI=164.87 - 164.87.
c lng khong tin cy ca t l
c lng khong tin cy ca mt t l, chng ta cn xc nh t l p sau da vo p c
lng khong tin cy 95% ca p

n
) - (1 p p
p 96 , 1 n
n
) - (1 p p
p + 96 , 1
Bi tp
iu tra trn 127 thanh nin c 45 thanh nin ht thuc l. Hy tnh t l thanh nin ht thuc l
v khong tin cy 95% ca t l ht thuc l.
Chng ta tnh c t l ht thuc l thanh nin l 0.354 (35.4%). Da vo cng thc trn
chng ta tnh c khong tin cy 95% ca t l ht thuc l l 0,271 n 0,438
Suy lun thng k
Kim nh ngha
Phng php kim nh ngha c Fisher xut v da trn cn bn ca php phn chng.
Php phn chng trong logic hc s dng bng mnh : Nu A ko theo B th khng B s ko
theo khng A.
A B B A
Mt th d ca php phn chng l khi chng ta gp mt bnh nhn nghi ng tc rut v chng ta
hi bnh s xem bnh nhn c b trung tin hay khng. Gi s bnh nhn khng b trung tin th
chng ta s bc c chn on tc rut vi suy lun sau: Nu bnh nhn b tc rut s b trung tin
th bnh nhn s b trung tin, do bnh nhn khng b trung tin nn bnh nhn khng b tc rut.
Mt cch tng quan hn, khi chng ta a ra gi thuyt chn on (th d nh chn on tc
rut), chng ta thng s xem xt cc h qu ph bin gi thuyt ny (Bnh nhn tc rut thng
b au bng,nn i, b trung tin v chng bng). Vic khng c mt trong cc hu qu ph
bin ca gi thuyt ny (th d nh bnh nhn khng c au bng, khng c nn i, khng b b
trung tin hay khng c chng bng) th chng ta c th bc b chn on. Cc bin c nm
ngoi cc h qu ph bin ca gi thuyt (bin c khng c au bng, khng c nn i, khng b
b trung tin hay khng c chng bng) c gi l min bc b ca chn on.
Trong kim nh thng k ngi ta cng s dng cc lp lun tng t. kim nh mt gi
thuyt thng k (c gi l gi thuyt Ho) cn phi xc nh min xy ra ph bin ca cc con

15
s thng k (nh trung bnh, t l, thng k t, thng k z, thng k chi bnh phng, v.v.) v nu
con s thng k ny nm ngoi min xy ra ph bin th chng ta s bc b gi thuyt Ho. Min
nm ngoi min xy ra ph bin ca s thng k c gi min bc b.

Hnh 1. Nguyn tc kim nh ngha theo Fisher. ng cong phn phi hnh chung th hin
phn phi ca thng k ca z khi =0 (gi thuyt Ho). Vng din tch di ng cong mu trng
th hin min cc thng k z thng xy ra nu gi thuyt Ho l ng. Vng din tch di ng
cong mu sm l min bc b gi thuyt Ho v c din tch l xc sut sai lm loi 1 (5%).
Khi s dng kim nh ngha chng ta cn lu cc im sau:
- Kim nh da trn nguyn tc phn chng ngha l chng ta ch c th bc b ch khng
th chng minh c gi thuyt Ho. V vy nu chng ta mun chng minh ht thuc l
l yu t nguy c ca ung th phi th phi t ra gi th.uyt thng k Ho l ht thuc l
khng phi l yu t nguy c ca ung th phi v s dng phng php kim nh bc
b iu ny.
- Gi thuyt Ho phi th hin bng ng thc (th d nh gi thuyt Ho: RR=1 hay Ho:
im trung bnh v bnh ly truyn qua ng tnh dc nam thanh nin = im trung
bnh v bnh ly truyn qua ng tnh dc n thanh nin ) th mi c th tnh c
phn phi ca thng k. Gi thuyt Ho khng th th hin bng bt ng thc (Ho: RR>1
l sai)
- Do din tch min bc b l mt con s c nh (thng l 0,05), xc nh con s
thng k T c nm trong min bc b hay khng ngi ta tnh xc sut xy ra thng k
cc oan hn gi tr T nu gi thuyt Ho l ng (c th hin bng cng thc: P (>T
|Ho) ). Xc sut ny c gi l gi tr p. V nu gi tr p nh hn ngng bc b ngha
l thng k T nm trong vng bc b v chng ta c th bc b gi thuyt Ho.
Gi tr p c k hiu khc nhau trn cc phn mm thng k. Th d phn mm Epi-Info, gi
tr p c k hiu l p-value, phn mm SPSS, gi tr p c k hiu l Sig. phn mm Stata,

16
cc gi tr p thng c k hiu khc nhau ty theo thng k c s dng l thng k g. C
th, trong phn mm Stata, gi tr p c k hiu nh sau:
P > |T| (nu kim nh t)
P > |z| (nu kim nh z)
Prob > chi2 (kim nh chi bnh phng)
Prob > F (Kim nh F; Kim nh ANOVA)


Kim nh gi thuyt
Khuyt im ca phng php kim nh ngha khi khng bc b c gi thuyt H
0
chng ta
khng bit c xc sut H
0
ng l bao nhiu. Mt nh thng k hc khc tn l Neyman
ra phng php kim nh gi thuyt trong c xt n sai lm loi 2.


Phat bien H
0
; H
a
Tnh so thong ke
(z; t; chi
2
; F)
Xac suat sai
lam loai 1
Nho
Bac bo gia thuyet
Xac suat sai
lam loai 2
Khong nho
Nho
Chap nhan gia
thuyet
Thc hien nghien
cu vi c mau
ln hn
Khong nho
tra bang tnh p

Sai lm loi mt v sai lm loi hai
Sai lm loi mt: bc b gi thuyt H
0
trong khi gi thuyt H
0
l ng.
Sai lm loi hai: Khng bc b gi thuyt H
0
trong khi gi thuyt H
0
sai.
Trong nghin cu thng k ngi ta khng bao gi c th chc chn. Do vy, khi nh nghin cu
i n kt lun bc b gi thuyt H
0
, ngi nghin cu c th b sai lm (sai lm loi mt - vi
mt xc sut no ). Khi nh nghin cu khng bc b gi thuyt H
0
, nh nghin cu cng c
th b sai lm (sai lm loi hai - cng vi mt xc sut no ). Mt iu nn nh l bng kim
nh thng k ngi ta c th xc nh c xc sut sai lm loi mt nhng khng th tnh c
xc sut sai lm loi hai m ch c th tnh c da vo i thuyt Ha v c mu ca nghin
cu.
i khi ngi ta cn s dng khi nim nng lc (power) ca kim nh thng k. Nng lc ca
kim nh thng k = 1 - xc sut sai lm loi 2. Khi nim nng lc ca thng k hay c dng
trong tnh c mu.

17
Bng 1. Tm tt v sai lm loi 1, sai lm loi 2 v gi tr ngng ca n
Chn l l Ho ng
(Khng c s khc bit)
Chn l l Ha ng
(Khng c s khc bit)
Bc b gi thuyt H
0
Sai lm loi 1
(Xc sut = )
Kt lun ng
(Xc sut = 1- =
Power ca nghin cu)
Khng bc b gi thuyt H
0
Kt lun ng
(Xc sut = 1-)
Sai lm loi II
(Xc sut = )
Chn la kim nh ph hp

Nh vy nguyn l ca kim nh ngha (hay kim nh gi thuyt l nh nhau). Cc kim nh
ch khc nhau vic la chn thng k xut pht t gi thuyt H
0.
Vic la chn ny ph thuc
vo bin s ca vn quan tm v thit k ca nghin cu.

18
Bng 10. Chn la kim nh ph hp theo thit k nghin cu
Loi thit k nghin cu


Thang o ca bin s
ph thuc
Hai nhm
iu tr
gm cc c
nhn khc
nhau
Ba (hay
nhi)
nhm iu
tr gm cc
c nhn
khc nhau
Trc v
sau mt
iu tr
(hoc 2
iu tr)
trn cng
cc i
tng
Nhiu iu
tr trn cng
cc i
tng
Lin h
gia hai
bin s
nh lng (mu rt t
mt dn s c phn phi
bnh thng v phng
sai hai nhm ng nht
t-test khng
bt cp
Phn tch
phng sai
t-test bt
cp
Phn tch
phng sai
o lng
lp li
Hi quy
tuyn tnh
v tng
quan
pearson
nh tnh - Danh nh
2
bng 2 x
n

2
bng 3 x
n
test
McNemar
Cochrance
Q
H s ca
bng n x m
(phi, OR,
RR)
nh tnh -Th t
(hay bin nh lng
khng bnh thng)
Kim nh
tng sp
hng
Mann-
Whitney
Kruskal-
Wallis
Kim nh
sp hng c
du
Wilcoxon
Friedman h s tng
quan
Spearman

Bng 11. Chn la kim nh ph hp tm s lin h gia bin c lp v bin ph
thuc
Bin c lp Bin ph thuc
Nh gi Danh nh (hoc th
t)
nh lng, a
bin (hoc th t)
nh lng phn phi bnh
thng
T-test ANOVA Hi quy tuyn tnh
Bin nh lng phn phi
khng bnh thng Bin th t
Mann-Whitney Kruskal-Wallis TQ Spearman
Nh gi Chi bnh phng Chi bnh phng Hi quy logistic
Sng cn Wilcoxon tng qut
Logrank
Wilcoxon tng qut
Logrank
Hi quy Cox


19
Php kim t bt cp
Tin lng ca bnh nhn suy h hp mn tnh tng carbonic thng km (t l t vong trong 3
nm thay i t 30% n 100%) v hin ti cha c phng php iu tr hu hiu. Tilapur v
Mir (Am J Med 1984; 77:987) cho rng ch n gim carbonhydrate c th ci thin tnh trng
h hp. Cc nh nghin cu ny tin hnh thc nghim trn 8 ngi suy h hp mn tnh (c du
hiu ca tim ln, gan ln, ph v tng p phi) vi ch iu tr bng ch n 600 Kcal v
ghi nhn PaO2 (phn p oxy ng mch) v PaCO2 (phn p carbon dioxide ng mch) trc
v sau iu tr. Kt qu nghin cu c trnh by trong Bng 1. Hy so snh trung bnh ca
phn p oxy ng mch trc v sau khi iu tr.

20
Bng 1. Phn p Oxy ng mch v phn p CO2 ng mch trn 8 i tng trc v
sau ch iu tr vi ch n gim carbonhydrate

i tng Pa0
2
trc Pa0
2
sau Hiu s PaC0
2
trc PaC0
2
sau Hiu s
1 70 82 12 49 45 -4
2 59 66 7 68 54 -14
3 53 65 12 65 60 -5
4 54 62 8 57 60 3
5 44 74 30 76 59 -17
6 58 77 19 62 54 -8
7 64 68 4 49 47 -2
8 43 59 16 53 50 -3

Thc hnh:
Bc 1: Xy dng gi thuyt Ho:
Ho: Phn p oxy ng mch trc v sau iu tr khng thay i
Bc 2: Chn kim nh ph hp
Kim nh ph hp l kim nh t bt cp vi 7 t do
Bc 3: Tnh thng k t
Tnh trung bnh v lch chun ca bin s d (hiu s ca phn p oxy ng mch trc v sau
iu tr) tnh thng k t
66 , 4
/
; 2 , 8 ; 5 , 13 = = = =
n s
d
t s d
d

Bc 4: tnh xc sut ca gi tr thng k t
tnh xc sut ca gi tr thng k t ta s dng hm tdist(gi tr t, t do, 2). C th tnh p
tng ng vi gi tr t = 4.63 7 t do chng ta nh cng thc "=tdist(4.63, 7, 2) vo mt .
Kt qu ta c gi tr p= 0.002397687.
Bc 5: Kt lun
V gi tr p= 0.002397687 nh hn 0.05 nn chng ta bc b gi thuyt Ho ngha l phn p oxy
ng mch c ci thin sau khi iu tr.
Php kim t (khng bt cp)
Nhm tm hiu vai tr ca catecholamine trong tng huyt p v cn, de Champlain (Circ Res
1976; 38:109) nghin cu 22 bnh nhn tng huyt p v cn (gm 13 ngi c nng
catecholamine cao v 9 bnh thng), ghi nhn nhp tim, huyt p tm thu, huyt p tm trng.
Kt qu ca nghin cu c trnh by trong bng 2. Hy so snh nhp tim hai nhm, nhm c
tng catecholamine v nhm khng tng catecholamine.

21
Bng 1. Trung bnh v lch chun ca Lung catecholamine huyt thanh, nhp tim, huyt p
tm thu v huyt p tm trung 13 bnh nhn tng huyt p tng catecholamine v 9 bnh nhn
tng huyt p khng tng catecholamine
Tng catecholamine Khng tng
S bnh nhn 13 9
catecholamine huyt thanh (ug/mL) x=0.484 s=0.133 x=0.206 s=0.060
Nhp tim x=90.7 s=11.5 x=77.8 s=13.2
Huyt p tm thu x=171.3 s=13.7 x=147.4 s=9.9
Huyt p tm trng x=103.0 s=8.3 x=95.6 s=12.9

Thc hnh:
Bc 1: Xy dng gi thuyt Ho:
Ho: Trung bnh nhp tim nhm bnh nhn c tng catecholamine = nhp tim trung bnh nhm
bnh nhn khng tng catecholamine
Bc 2: Chn kim nh ph hp
Kim nh ph hp l kim nh t vi (n
1
+n
2
-2) = 20 t do
Bc 3: Tnh thng k t
Trc tin chng ta phi tnh lch chun gp
21 . 12
) 1 ( ) 1 (
) 1 ( ) 1 (
2 1
2
2 2
2
1 1
=
+
+
=
n n
s n s n
s
p

( d nh cng thc tnh lch chun gp chng ta cn lu phng sai gp l trung bnh
ca phng sai ca mi nhm vi trng s l t do ca phng sai )
Sau chng ta tnh thng k t
44 . 2
/ 1 / 1
) (
2 1
2 1
=
+

=
n n s
x x
t
Bc 4: tnh xc sut ca gi tr thng k t
S dng my vi tnh chng ta tnh c gi tr p= 0,024123071 (nu s dng bng s thng k
chng ta s tm c p <0,05)
Bc 5: Kt lun
V gi tr p= 0,024123071 nh hn 0,05 nn chng ta bc b gi thuyt Ho ngha l gia hai
nhm bnh nhn c s kh bit v nhp tim trung bnh.
Phn tch phng sai
Anionwo et al. (1981, BMJ; 282:283) mun tm hiu xem mc hemoglobin trong 3 nhm bnh
hng cu lim c khc nhau hay khng bng cch ghi nhn mc hemoglobin 3 nhm bnh
nhn.

22
Bng 7. Phn tch phng sai mt chiu: s khc bit trong nng hemoglobin gia cc bnh
nhn b cc loi bnh hng cu lim khc nhau. S liu t Anionwo et al. (1981) British Medical
Journal, 282, 283-6
(a) S liu
Loi bnh hng cu
lim
S bnh
nhn
(n
i
)
Trung bnh

(x
i
)
s.d.

(s
i
)
Gi tr ca cc c th
hemoglobin g%
(x)
Hb SS 16 8,712 0,844 7,2; 7,7; 8,0; 8,1; 8,3; 8,4;
8,4; 8,5; 8,6; 8,7; 9,1; 9,1;
9,1; 9,8; 10,1; 10,3
Hb S/b-
thalassaemia
10 10,630 1,284 8,1; 9,2; 10,0; 10,4; 10,6;
10,9; 11,1; 11,9; 12,0; 12,1
Hb SC 15 13,300 0,942 10,7; 11,3; 11,5; 11,6; 11,7;
11,8; 12,0; 12,1; 12,3; 12,6;
12,6; 13,3; 13,8; 13,8; 13,9
Hy s dng kim nh thng k ph hp so snh nng Hemoglobin trung bnh 3 nhm
bnh nhn b hng cu lim.

Thc hnh:
Bc 1: Xy dng gi thuyt Ho:
Ho: Trung bnh Nng hemoglobin 3 nhm bnh HC lim bng nhau
Bc 2: Chn kim nh ph hp
Kim nh ph hp l phng php phn tch phng sai (ANOVA) vi thng k F vi (s
nhm, s quan st - s nhm) = (2,38) t do ; F ti hn= 3,32
Bc 3: Lp bng ANOVA v Tnh thng k F
Chng ta lp thnh bng phn tch phng sai nh sau:
Ngun bin thin SS d.f. MS=SS/d.f. MS gia cc nhm
F= ----------------------------
MS bn trong nhm
Gia cc nhm 99,92 2 49,96 50.03 , P<0,001
Trong cc nhm 37,95 38 1,00
Tng cng 137,85 40

Cc gi tr trn c th tnh theo cng thc sau:
Gia cc nhm
SS
b
= n
i
(x
i
-x)
2
= n
i
x
i
2
-(x)
2
/N
= 16 8,7125
2
+10 10,6300
2
+15 12,300
2

- 430,2
2
/41=99,92

23
df
b
= k-1 = 2
MS
b
= SS/d.f.
Trong cc nhm
SS
w
= (n
i
-1)s
i
2
=15 x 0,84452 + 9 x 1,28412 + 14 x 0,9419 = 37,96
df
w
= N - k = 41-3 = 38
MS
w
= SS/d.f.
V gi tr thng k F
F = MS
b
/MS
w

Bc 4: tnh xc sut ca gi tr thng k F
Da vo my tnh chng ta tnh c gi tr p= 2.26 x 10
-11
. Chng ta cng c th da vo bng
thng k F tm c p <0,001
Bc 5: Kt lun
V gi tr rt nh nn chng ta bc b gi thuyt Ho ngha l ba nhm bnh nhn bnh hng cu
lim c gi tr hemoglobin trung bnh khc nhau c ngha thng k.
Php kim chi bnh phng
C 240 ngi c tim vaccine phng bnh cm v 220 ngi c tim placebo. Trong nhm
tim vaccine c 20 ngi b cm v trong nhm tim placebo c 80 ngi b cm. Hy so snh t
l mc cm gia 2 nhm: nhm tim vaccine v nhm tim placebo? Hy cho bit mc lin
h gia vaccine cm v bnh cm?
Thc hnh
Bc 1: Xy dng gi thuyt Ho:
Ho: T l mc cm nhm tim vaccine = t l mc cm nhm khng tim vaccine
Bc 2: Chn kim nh ph hp
Kim nh ph hp l kim nh chi bnh phng vi 1 t do
Bc 3: Lp bng 2 x 2 v Tnh thng k chi bnh phng
Lp bng 2 x 2 nh sau
Kt qu Mc bnh cm Khng mc Tng
C 20 a
(8,3%)
220 b

240 a+b

Placebo 80 c
(36,4%)
140 d 220 c+d
Tim chng
Tng 100 a+c 360 b+d 460 N

tnh thng k chi bnh phng c hai cch:
Phng php chnh thc:
- Tnh cc gi tr k vng (E) cc , gi tr k vng ca mt bng tch cc bin chia
cho tng s chung (th d gi tr k vng ca a E
a
= (a+b) (a+c) /N, gi tr k vng ca c E
c

= (a+b) (c+d) /N)

24
- Tnh gi tr chi bnh phng theo cng thc
1) - cot (so 1) - hang so =

= ( . . ,
) (
2
2
f d
E
E O

Trong th d ny
09 , 53 02 , 6 52 , 5 69 , 21 86 , 19
2 , 172
) 2 , 172 140 (
8 , 187
) 8 , 187 220 (
8 , 47
) 8 , 47 80 (
2 , 52
) 2 , 52 20 (
2 2 2 2
2
= + + + =

=

Cng thc tnh tt cho bng 2 2
) )( )( )( (
) (
2
2
d b d c c a b a
N bc ad
+ + + +

=
Bc 4: tnh xc sut ca gi tr thng k
2

S dng my vi tnh chng ta c gi tr p= 3,31 x 10
-13
ngha l gi tr ca p rt nh. S dng
bng s chng ta bit c p < 0,001.
Bc 5: Kt lun
V gi tr rt nh nn chng ta bc b gi thuyt Ho. Chng ta c th kt lun t l mc cm
nhm tim vaccine thp hn c ngha thng k so vi nhm tim placebo.
S tng quan ca hai bin s nh tnh
Mc lin h gia tim chng vaccine v mc bnh cm
Mc lin h gia hai bin s nh tnh c c lng bng cch s dng RR (hoc OR nu
trong nghin cu bnh chng). Gi s s liu ca bng 2 x2 nm vng C2:D3 chng ta c th
tnh RR bng cch nhp cng thc "=MHRR(C2:D3)" ta c RR=0,23 vi khong tin cy 95%
ca RR t 0,15 n 0,36
So snh t l ca bin s nh gi : Kim nh chi-bnh phng
Khi hai bin s l bin s nh gi ngi ta s dng gi tr RR hay OR o lng mc lin h
(xem li phn cc s o dch t).
Kt qu Mc bnh Khng mc
bnh
Tng
Phi nhim
a
1
b
1
N
1

Khng phi
nhim
a
o
b
0
N
0

Bin s phi
nhim
Tng
a
1+
a
0
b
1+
b
0
N=N
1
+N
0


T s nguy c (RR) l t s ca nguy c ca nhm phi nhim trn nguy c ca nhm khng phi
nhim:
RR = (a
1
/N
1
)/(a
0
/N
0
)
Khong tin cy 95% ca t s nguy c:

25
0 0 1 1
1 1 1 1
96 , 1
N a N a
e RR
+
hay
2
96 , 1
1

RR (test-based CI)

T s s chnh (OR) l t s ca s chnh mc bnh ca nhm phi nhim trn s chnh mc
bnh nhm khng phi nhim. Trong trng hp nghin cu bnh chng t s s chnh l t s
ca s chnh phi nhim ca nhm bnh trn s chnh phi nhim nhm khng chng.
RR = (a
1
/b
1
)/(a
0
/b
0
)
Khong tin cy 95% ca t s s chnh:
0 0 1 1
1 1 1 1
96 , 1
b a b a
e OR
+ + +

Bi tp
Mt nghin cu bnh chng nhm tm mi lin h gia s n tht v vim rut hoi t tm
c 61 trng hp vim rut hoi t v 57 trng hp chng. Trong nhm b vim rut hoi t
c 50 trng hp c tin cn n tht (gn y) v trong nhm chng c 16 trng hp c tin
cn n tht. Hy tm c lng s o lin h gia n tht v vim rut hoi t.
Table 5. S lin h gia n tht trong thi gian gn u v vim rut hoi t Papua New Guinea
(OR=11,6)
n tht trong thi gian gn y Khng n tht trong thi gian
gn y
Tng s
Nhm bnh 50 a
1
11 b
1
61
Nhm chng 16 a
0
41 b
0
57
Tng s 66 52 118

Nu t l n tht nhm bnh (50/61) cao hn t l n tht trong nhm chng (16/57) c ngha
thng k th chng ta c th kt lun rng c s lin quan gia n tht v vim rut hoi t. y
l bi ton so snh t l ca mt bin s nh tnh hai nhm v c gii quyt bng kim nh
chi bnh phng.
Tuy nhin bng vic kim nh gi thuyt chng ta ch xc nh c mi lin h m khng bit
ln ca s lin h. Bi v y l nghin cu bnh chng chng ta khng tnh c RR m phi
s dng OR o lng sc mnh lin h. S dng cng thc tnh OR v khong tin cy ca
OR ta c:
OR = (a
1
/b
1
)/(a
0
/b
0
) = (a
1
b
0
)/(a
0
b
1
) = 11.65 v
khong tin cy 95% ca OR = 4.87 n 27.85
Bi tp
C 240 ngi c tim vaccine phng bnh cm v 220 ngi c tim placebo. Trong nhm
tim vaccine c 20 ngi b cm v trong nhm tim placebo c 80 ngi b cm. Hy so snh t
l mc cm gia 2 nhm: nhm tim vaccine v nhm tim placebo? Hy cho bit mc lin
h gia vaccine cm v bnh cm?

26
Kt qu Mc bnh cm Khng mc Tng
C 20 a
1

(8,3%)
220 b
1


240 N
1


Placebo 80 a
0

(36,4%)
140 d
220 N
0

Tim chng
Tng 100 360 460 N
Ta tnh c RR = (a
1
/N
1
)/(a
0
/N
0
) = (20/240)/(80/220) = 0.23
Khong tin cy 95% ca t s nguy c:
0 0 1 1
1 1 1 1
96 , 1
N a N a
e RR
+
= 0.15 n 0.36
Quan h gia hai bin s nh lng
Tng quan
Tng quan l s o mc hai bin s nh lng cng thay i vi nhau. C nhiu loi h s
tng quan, nhng chng u c gi tr t -1 n 1. Nu chng c gi tr bng zero c ngha l
hai bin s c lp v khng quan h g vi nhau. Nu chng c gi tr dng c ngha l hai
bin s ng bin vi nhau, nu chng c gi tr m ngha l hai bin s nghch bin. Gi tr
tuyt i ca h s tng quan cng gn mt ngha l hai bin s c lin h cht vi nhau v vai
tr ca sai s ngu nhin s t hn. Khi tr tuyt i ca h s tng quan bng mt c ngha l
hon ton khng c sai s ngu nhin.
Loi h s tng quan c s dng ph bin nht l h s tng quan Pearson r:

1
/ ) (
) ( ) (
) )( (
2 2


=


=

n
n y x n xy
y y x x
y y x x
r
y x
i i
i i


L gii ngha ca h s tng quan Pearson
- H s tng quan lun lun nm trong on [-1,1]
- H s tng quan r dng chng t hai bin s l ng bin; h s tng quan r m chng t
hai bin s l nghch bin.
- Tr s tuyt i ca h s tng quan r ni ln mc lin quan gia hai bin s. Nu tr tuyt
i ca r bng 1 (r=1 hay r=-1), quan h hon ton tuyn tnh ngha l tt c cc im nm trn
ng hi quy (Hnh 9.2 d v 9.2f). Nu tr tuyt i ca r nh hn 1 s c cc im s liu phn
tn chung quanh ng hi quy (hnh 9.2 c v 9.2e).
- Bnh phng ca h s tng quan (r
2
) th hin t l bin thin ca bin s ph thuc c gii
thch bng s bin thin ca bin s c lp (nu mi lin h ny l nhn qu)
- Nu r=0, khng c mi lin h tuyn tnh gia hai bin s. iu ny c ngha l (1) khng c
mi lin h g gia hai bin s hoc (hnh 9.2a) (2) mi lin h gia hai bin s khng phi l
tuyn tnh (hnh 9.2b)
- Theo quy c, quan h vi r t 0,1 n 0,3 l quan h yu, t 0,3 n 0,5 quan h trung bnh v
trn 0,5 l quan h mnh.

27
Hi quy
Hi quy l mt m hnh ton hc m t s bin i ca mt bin s ny theo nhng bin s khc.
Mt phng trnh hi quy c th c dng nh sau:
cn nng (kg) = 6,85 + 0,18 thng tui
(phng trnh hi quy tnh cn nng ca tr t 9 n 40 thng tui theo thng tui)
theo phng trnh ny ngi ta gi:
cn nng: bin s ph thuc
thng tui: bin s c lp
6,85: h s ca hng s, hay cn gi l im chn (intercept)
0,18: h s ca bin s thng tui.
Mt cch tng qut phng trnh hi quy s c dng:
Y = b
0
+ b
1
x
1
+ b
2
x
2
+ b
3
x
3
Vi y l bin s ph thuc
x
1
, x
2
, x
3
l cc bin s c lp
b
0
: im chn ca phng trnh
b
1
, b
2
, b
3
: h s ca cc bin s c lp
H s ca bin s c lp ni ln nu bin s c lp tng mt n v th bin s ph thuc y s
thay i bao nhiu. C th hn nu bin s x
2
thay i mt n v th bin s y s tng gi tr l
b
2
(bin s y s gim nu gi tr b
2
m).

Bi tp
1. Mt nh nghin cu ghi nhn lng mui n v huyt p tm thu ca 5 i tng trong bng
4.
i tng Lng mui Huyt p
1 5 110
2 10 120
3 12 110
4 18 120
5 20 140
Hy tm mi lin h gia huyt p tm thu v lng mui s dng.
Thc hnh
tm s lin h gia hai bin s nh lng chng ta s dng h s tng quan. Da vo cng
thc ta tnh c
r = 0,771829.
Nh vy c mi lin quan thun gia lng mui n v huyt p tm thu. Mi lin quan ny l
mnh v lng mui n gii thch cho n 60% (0.77 0.77) s thay i ca huyt p tm thu.
Chng ta cng tm c phng trnh ca huyt p theo lng mui tiu th s l:
Huyt p tm thu = 99,8 mmHg + 1,55 x Lng mui.

28
Gi tr 99,8 c gi l im chn ca phng trnh hi quy v 1,55 l h s gc ca bin s
lng mui tiu th. iu ny c ngha l nu lng mui n tng thm 1 gram/ngy th huyt p
tm thu s tng trung bnh 1,55 mmHg.
2. L gii ngha ca phn tn sau
Figure 8. Trng lng s sinh theo tui thai (tun) ca 641 tr sinh do th thai trong ng nghim
Anh quc
t
r
o
n
g

l
u
o
n
g

t
r
e
tuoi thai
20 24 28 32 36 40 44
0
1000
2000
3000
4000
5000


29
Ci t chng trnh Stata 10.0, s liu mu
v cc chng trnh c lin quan
1. Ci t chng trnh Stata v s liu mu
C nhiu cch ci t chng trnh Stata 10. Di y s trnh by cch ci t chng trnh
Stata/SE 10.0 t tp tin "Setup Stata 10 and Data.exe" (c th ti xung t website ca khoa Y t
cng cng i Hc Y dc TP H Ch Minh hay chp t a CD ca b mn Thng k Y hc)
- Tm tp tin "Setup Stata 10 and Data.exe" ( mt s my khi khng cho php hin phn m
rng ca tn tp tin, tn tp tin ch hin ra l Setup_Stata10)
- Nhp p vo tp tin ny (hay nhp chut chn tp tin ny v sau nhn phm Enter). Tp
tin ny s thc hin vic khi ng ci t trong vng vi giy.

V tip theo, ca s cho mng (Welcome) s hin ra

Nhp vo nt lnh Next sang ca s tip theo chn th mc ca ni ci t (Choose
destination location).

30

Nu bc ny nu chng ta quyt nh khng ci t na hy nhp vo nt lnh Cancel
thot khi chng trnh ci t. Nu chng ta mun tip tc ci t th cn phi quyt nh th
mc ca ni ci t (Destination Directory). Theo mc nh th mc ca ni ci t s l
C:\Program Files v nu khng c nh g t bit, chng ta cng nn ci t th mc ny
bng cch nguyn tn th mc nm trong hp vn bn Destination Directory ri nhp Next.
Nu mun ci t vo th mc khc, nhp vo nt lnh Browse ri sau chn th mc ph hp
trc khi nhp vo nt lnh Next. Gi s chng ta tip tc ci t v chn th mc ni ci t
mc nh (C:\Program Files) th ca s ci t (Setup) s hin ra v cho tip tin ca vic
thc hin ci t.

Sau qu trnh ci t ca s hon tt (Finised) s hin ra.

31

Chng ta hy nhp vo nt lnh Close ca ca s hon tt kt thc qu trnh ci t. Sau qu
trnh ci t, chng trnh ci t s to ra mc chng trnh Stata 10 trong nhm chng trnh
MediStat. iu ny c ngha sau khi ci t khi ng chng trnh Stata trong Windows,
chng ta nhp chut vo nt lnh Start ca h iu hnh Windows v sau ch vo Alls
Program v sau di chuyn (navigate) n nhm chng trnh MediStat ri nhp vo mc
chng trnh Stata 10. Vic thc hin ton b qu trnh khi ng chng trnh Stata c th
hin tm tt nh sau Start :: Alls Program :: MediStat :: Stata 10 (Ch nt lnh u tin v
mc cui cng l im cn phi nhp chut, cc du :: th hin s di chuyn (navigate) ca con
tr chut. Vic ci t Stata 10 cng ng thi ci t cc tp tin s liu mu vo th mc Data
nm trong th mc Stata 10 ca th mc ni ci t.
2. Ci t tp tin s liu mu
c th thc tp cc bi tp c trong ti liu ny, cc bn nn c cc tp tin s liu mu. Khi
bn ci t chng trnh Stata 10 theo cch k trn th cc tp tin s liu mu c a vo
th mc C:\Data v cc bn khng cn phi thao tc g thm c tp tin s liu mu nhm thc
tp. Do vic ci t tp tin s liu mu ch nn thc hin khi cc s liu mu v mt l do g
b xa i hoc h hng. Ci t tp tin s liu mu cn i hi tp tin Data_Stata10.exe (c th
ti xung t website ca khoa Y t cng cng i Hc Y dc TP H Ch Minh hay chp t a
CD ca b mn Thng k Y hc)
- Tm tp tin "Setup Data.exe" ( mt s my khi khng cho php hin phn m rng ca tn tp
tin, tn tp tin ch hin ra l Data_Stata10)
- Nhp p vo tp tin "Setup Data.exe" (hay nhp chut chn tp tin ny v sau nhn
phm Enter). Tp tin ny s thc hin vic khi ng ci t trong vng vi giy.

32

Tip theo l ca s Choose Destination Location s hin ra.

Nu bc ny nu chng ta quyt nh khng ci t na hy nhp vo nt lnh Cancel
thot khi chng trnh ci t. Nu chng ta mun tip tc ci t th cn phi quyt nh th
mc ca ni ci t (Destination Directory). Theo mc nh th mc ca ni ci t s l
C:\Program Files v nu khng c nh g t bit, chng ta cng nn ci t th mc ny
bng cch nguyn tn th mc nm trong hp vn bn Destination Directory ri nhp Next.
Khi sau khi ci t tp tin s liu mu s nm th mc C:\Data.
Nu mun ci t vo th mc khc, nhp vo nt lnh Browse ri sau chn th mc ph hp
trc khi nhp vo nt lnh Next. Gi s chng ta tip tc ci t v chn th mc ni ci t
mc nh (C:\Program Files) th ca s ci t (Setup) ca chng trnh Data Stata 10 s hin ra
v cho tip tin ca vic thc hin ci t.

Sau khi thc hin xong vic ci t chng trnh s t ng ng li.

33
3. Ci t chng trnh chuyn i s liu
i khi chng ta c s liu c nhp bng chng trnh Epi-Info 6.04, Epi-Info for Windows,
Access hay Excel nhng chng ta li mun phn tch s liu bng Stata chng ta cn phi s
dng chng trnh chuyn i s liu nh DBMSCopy for Win hay StatTransfer. Sau y l
hng dn ci t chng trnh StatTransfer 7.0 s dng tp tin StatTransfer7Setup.exe (c th
ti xung t website ca khoa Y t cng cng i Hc Y dc TP H Ch Minh hay chp t a
CD ca b mn Thng k Y hc)
- Tm tp tin "StatTransfer7Setup.exe" ( mt s my khi khng cho php hin phn m rng
ca tn tp tin, tn tp tin ch hin ra l StatTransfer7Setup)
- Nhp p vo tp tin ny (hay nhp chut chn tp tin ny v sau nhn phm Enter). Tp
tin ny s thc hin vic khi ng ci t trong vng vi giy.

Tip theo cc ca s Welcome, Choose Destination Location

Nu bc ny nu chng ta quyt nh khng ci t na hy nhp vo nt lnh Cancel
thot khi chng trnh ci t. Nu chng ta mun tip tc ci t th cn phi quyt nh th
mc ca ni ci t (Destination Directory). Theo mc nh th mc ca ni ci t s l
C:\Program Files\StatTransfer7 v nu khng c nh g t bit, chng ta cng nn ci t
th mc ny bng cch nguyn tn th mc nm trong hp vn bn Destination Directory ri
nhp Next.

34
Nu mun ci t vo th mc khc, nhp vo nt lnh Browse ri sau chn th mc ph hp
trc khi nhp vo nt lnh Next. Gi s chng ta tip tc ci t v chn th mc ni ci t
mc nh (C:\Program Files\StatTransfer7) th ca s Ready to Install s hin ra.

Nhp vo nt lnh Next tip tc, ca s ci t (Setup) s hin ra v cho tip tin ca vic
thc hin ci t.

Sau khi ci t chng trnh StatTransfer, ca s Finished s hin ra.

35

Chng ta hy nhp vo nt lnh Close ca ca s hon tt kt thc qu trnh ci t. Sau qu
trnh ci t, chng trnh ci t s to ra mc chng trnh st32w trong nhm chng trnh
MediStat. iu ny c ngha sau khi ci t khi ng chng trnh StatTransfer7 trong
Windows, chng ta nhp chut vo nt lnh Start ca h iu hnh Windows v sau ch vo
Alls Program v sau di chuyn (navigate) n nhm chng trnh MediStat ri nhp vo
mc chng trnh st32w. Vic thc hin ton b qu trnh khi ng chng trnh Stata c th
hin tm tt bng hng dn sau Start :: Alls Program :: MediStat :: st32w (Ch nt lnh u
tin v mc cui cng l im cn phi nhp chut, cc du :: th hin s di chuyn (navigate)
ca con tr chut.








36
Khi ng v kt thc Stata
1. Khi ng Stata
khi ng Stata trong Windows XP hy thc hin
Nhp chut vo Start
Nhp chut vo All Programs
Di chuyn chut th mc MediStat v
Nhp chut vo mc Stata 10

Hoc nu c biu tng ca Stata trn desktop ca my tnh c th khi ng Stata bng cch
nhp p chut vo biu tng ca Stata 10 (Stata icon)
Ngi dng s nhn thy mn hnh nh sau khi khi ng Stata 10.0

37

Nu mn hnh Stata khng khi ng c, nguyn nhn thng thng nht l ngi s dng
cha ng k v m kho s dng Stata. Trong trng hp ny ngi s dng cn lin h vi
cng ty Stata c c s hiu (serial number) m chng trnh (code) v cha kho ch quyn
(Authorization key). Cng c th xy ra trng hp ngi s dng m kho ri nhng do v
xo file Stata.lic. Trong trng hp ny c th chp li tp tin Stata.lic ca ngi c kha
hp l.
2. Kim tra tnh hp l ca Stata
Trong ln khi ng Stata u tin, bn c th mun kim tra rng bn ci t ng. Hy
g lnh verinst v bn s thy kt xut tng t nh sau:
. verinst
Stata/SE 10.0 for Windows
Born 25 Jul 2007
Copyright (C) 1985-2007

Total physical memory: 1038712 KB
Available physical memory: 191512 KB

Unlimited-user Stata for Windows (network) perpetual license:
Serial number: 56437637415
Licensed to: Khoa Y te Cong cong
Dai hoc Y Duoc
Lnh verinst l mt lnh cn nh. Gi s nu chng ta thay i cu hnh ca my tnh v khng

38
bit mnh lm tn thng cho Stata hay khng, chng ta c th g verinst c trn an
rng Stata vn cn c ci t ng.
3. Thot khi Stata
thot khi Stata/SE 10.0 for Windows chng ta c th thc hin mt trong 2 vic sau:
- Nhp vo ng nm pha trn phi ca ca s Stata
Lu : Trong trng hp c d liu trong b nh v d liu c thay i nhng
cha c lu vo a th khi chng nhp vo ng, my tnh s hi chng ta rng
chng ta c mun thot m khng lu li s liu hay khng. Nu chng ta ng th
Stata s thot, nu khng th chng ta li tr li Stata chng ta c th lu li s liu.
- G lnh exit trong ca s Stata Command.
Lu : Trong trng hp c d liu trong b nh v d liu c thay i nhng
cha c lu vo a th khi chng g exit, my tnh s khng ng cho chng ta thot
v s thng bo no; data in memory would be lost. Trong trng hp ny nu chng
mun thot m khng lu li s liu th chng ta hy g exit, clear. Nu chng ta mun
lu li s liu hy s dng lnh save.
4. Cc loi hnh ca Stata
C mt s loi hnh ca Stata chy trn cc h iu hnh khc nhau: Stata cho Windows
98/95/NT, Stata cho Windows 3.1, Stata cho Power Macintosh, Stata cho 680x0 Macintosh,
Stata cho Linux, Stata cho RS/6000, v.v. Tuy nhin bt k bn dng loi hnh Stata no, Stata
vn l Stata v bn c th s dng cng mt cu lnh v Stata s cho ra cng mt kt qu,
chnh xc n s l tn cng.
Ngay c cc tp tin cng c th chia x. Th d tp tin s liu, tp tin chng trnh, tp tin
ho ca Stata cho Macintosh c th dng trn cc my tnh khc m khng cn phi chuyn
i.
5. Stata nh, Intercooled Stata v Stata bn c bit (Stata SE)
Stata cho Windows v Stata cho Macintosh c hai kiu: Stata nh v Intercooled Stata (trn
h iu hnh Unix ch c Intercooled Stata). C hai kiu Stata ny u c nhng nt chung
nhng Intercooled Stata c th lm vic vi tp tin d liu ln hn v nhanh hn. Tu theo
loi my Intercooled Stata c th nhanh hn Stata nh t 50 n 600%.
Sau y l s khc bit gia v gii hn kch thc gia Intercooled Stata v Stata nh
Stata nh Intercooled Stata
S quan st 1.000 Tu thuc vo b nh
S cc bin s 99 2.047
Chiu rng s liu 200 8.192
Kch thc ma trn ti a 40 800
S k t trong mt macro 1.000 18.632
S k t trong mt dng lnh 1.100 18.648

39

Ti sao Intercooled Stata chy nhanh hn Stata nh? iu ny l do s khc bit trong vic lp
chng trnh. Th d c tch s ca cc ma trn RZR, Intercooled Stata s s dng b nh
c th ghi nh kt qu tm thi l ma trn T=RZ ri sau tnh TR. Stata nh do khng c
th s dng nhiu b nh nn phi tnh ton trc tip RZR, v do mt s kt qu trung gian
phi tnh ton li nhiu ln v iu ny lm Stata nh b chm .
D sao, s khc bit ca Intercooled Stata v Stata nh mang tnh k thut v ni b, i vi
ngi dng, vic s dng Intercooled Stata v Stata nh khng c g khc bit. Nu Stata
c ci t v bn mun bit bn ang dng Stata g th c th g lnh about:
. about
Stata/SE 10.0 for Windows
Born 25 Jul 2007
Copyright (C) 1985-2007

Total physical memory: 1038712 KB
Available physical memory: 192392 KB

Unlimited-user Stata for Windows (network) perpetual license:
Serial number: 56437637415
Licensed to: Khoa Y te Cong cong
Dai hoc Y Duoc
Nh vy, chng ta ang s dng Stata Phin bn c bit 10.0 cho Windows.

40
Khi ng Stata
1. Khi ng Stata
Khi ng chng trnh STATA bng cch nhp vo nt Start :: All Programs :: Medistat ::
Stata 10 hoc nhp vo biu tng (icon) Stata 10 trn mn hnh Desktop.
2. M t giao din ca chng trnh Stata
Giao din ca Stata s hin ra vi 3 thanh v 4 ca s:
3 thanh bao gm:
1. Thanh tiu vi dng ch "Intercooled Stata 6.0"

2. Thanh menu vi cc menu File (ng m tp tin); Edit (hiu chnh); Prefs (Ty chn); Data
(S liu) Graphics ( ha) Statistics (Thng k) User (Ngi dng) Window (m ra cc ca
s) v Help (Tr gip)

3. Thanh cng c (toolbar)

Thanh cng c gm 12 nt cng c (1- Open file; 2- Save; 3- Print Results; 4- Begin (Close) log;
5- Start Viewer (Bring Viewer to Front) ; 6- Bring results window to Front 7-Bring
graph windows to Front; 8- Do-file Editor; 9-Data Editor; 10-Data Browser; 11-Clear -
more - Condition v 12- Break)
ngha ca tng cng c nh sau:

1- Open file (m tp tin)
2- Save (Lu tp tin)
3- Print Results (In kt qu)

4- Begin (Close) log: (Bt u (Kt thc) ghi bin bn kt qu)
5- Start Viewer (Bring Viewer to Front) : Bt u s dng ca s Viewer
7-Bring graph windows to Front (a ca s ha ra trc)

8- Do-file Editor: (Bin son tp tin chng trnh - do file)
9-Data Editor: Bin tp s liu (sa cha, thm bt s liu)
10-Data Browser: Duyt s liu (xem nhng khng sa cha)
11-Clear - more - Condition (Xa lnh more tip tc chng trnh)

41
12- Break: (Ngng tp tin chng trnh)

Bn ca s lit k theo ngc chiu kim ng h bao gm
1. Ca s Command (ca s lnh)

2. Ca s Result (ca s Kt qu)

3. Ca s Review (ca s Lu tr)

4. Ca s Variables (ca s Bin s)


3. Cch cch thc hin lnh trong chng trnh Stata
C hai cch thc hin lnh trong chng trnh Stata: Dng bn phm g lnh vo ca s
lnh (Stata Command) hay s dng con tr chut chn cc trnh n (menu) giao din ha
(Graphic Interface)
Dng bn phm g lnh

42
Dng bn phm g lnh vo ca s lnh (Stata Command). y l cch s dng Stata
ca ngi chuyn nghip v n cho php thc hin tt c cc lnh ca Stata mt cch
nhanh chng vi y cc chc nng ph ca lnh. Tuy nhin phng php ny c
th khng thch hp cho ngi mi s dng do n i hi ngi dng phi thuc cc cu
lnh v c php ca n
Con tr chut vi giao din ha (Graphic Interface)
C th dng chut thc hin cc lnh nhm thao tc s liu (menu Data), v th (menu
Graphics) v phn tch s liu (menu Statistics). Phng php s dng chut v menu l
phng php d s dng nn s c u tin trnh by trong ti liu ny.
4. Lu li kt qu phn tch
Kt qu ca phn tch c th hin trn ca s Stata Result v ca s ny c mt thanh trt
dc cho php xem li nhng kt qu phn tch c. Tuy nhin trnh gy nhm ln cho ngi
phn tch, ca s ny ch lu li nhng kt qu gn nht. Do nu chng ta mun lu tr li
ton b kt qu phn tch chng ta cn phi m ca s log bng cch nhp vo nt cng c Stata
Log nm v tr th t t tri trn thanh cng c . Khi ca s Open Stata Log m ra,
chng ta c th nhp tn ca tp tin lu tr (log file) vo hp vn bn File name.

Gi s chng ta chn tp tin ny l "baitap.smcl" hy g "baitap" vo hp File Name ri nhp
OK.
Khi trn ca s kt qu (Stata results) s hin ra thng bo cho bit rng bin bn kt qu
phn tch s c lu ti tp tin "D:\Dung\Science\BSCK2_Hieu_mat\baitap.smcl"

43
. log using "D:\Dung\Science\BSCK2_Hieu_mat\baitap.smcl"
------------------------------------------------------------------------------
log: D:\Dung\Science\BSCK2_Hieu_mat\baitap.smcl
log type: smcl
opened on: 10 Oct 2004, 12:01:34
Sau bn c th thc hin cc bc phn tch.
Khi mun xem li bin bn (kt qu phn tch) hy nhp vo nt cng c log mt ln na
hin ra ca s Stata Log Options.

Sau chn vo nt chn View snapshot of log file v nhp vo nt lnh OK xem bin bn.
Khi mun chm dt vic ghi bin bn (kt qu phn tch) hy nhp vo nt cng c log
hin ra ca s Stata Log Options.

Sau chn vo nt chn Close log file v nhp vo nt OK.
Li khuyn: Ngi s dng Stata c kinh nghim sau khi m tp tin s liu lun lun m tp tin
log trc khi tin hnh cc phn tch thng k khng b mt cc kt qu ca qu trnh phn
tch.


44
Mt vi phn tch n gin vi Stata
Mc tiu:
Sau khi nghin cu bi ny cc hc vin c kh nng:
- Nu c s khc bit gia bin s nh tnh v bin nh lng
- S dng c cc lnh ca Stata: edit, sum, tab1, bysort
- Hiu c khi nim v trng s (weight)
Chng ta s minh ha nhng lnh c bn trong phn tch thng k vi Stata s dng s liu n
gin gi nh ca 5 i tng nghin cu. S liu ny c 2 bin s gii v ng huyt (n v
ca ng huyt l mg/100mL).
Hnh 2. ng huyt v gii tnh ca 5 i tng nghin cu
STT Tn Gii tnh ng huyt (mg/100mL)
1 Truc nam 80
2 Phuoc nam 90
3 Han n 100
4 Hoa n 110
5 Dung nam 130
Cu hi:
1- Bin s gii tnh l bin nh lng hay bin nh tnh? Bin s ng huyt l bin
nh lng hay bin nh tnh? Hy nu s khc bit gia bin s nh lng v bin s
nh tnh.
2. Bin s tn hc l bin s nh lng hay nh tnh?
2- tm tt c trng v gii tnh ca 5 i tng nghin cu ny chng ta s dng
thng k g?
3- tm tt c trng v ng huyt ca 5 i tng nghin cu ny chng ta s dng
thng k g? Hy cho bit ng huyt trung bnh ca 5 ngi ny? Hy cho bit ng
huyt trung bnh ca 2 ngi n v ng huyt trung bnh ca 3 ngi nam?
1. Khi ng Stata
Trc tin, chng ta thc hin theo hng dn: Start :: Alls Program :: MediStat :: Stata 10
khi ng Stata t Windows (Ch nt lnh u tin Start v mc cui cng Stata 10 l im
cn phi nhp chut, cc du :: th hin s di chuyn (navigate) ca con tr chut). Khi
chng ta c mn hnh Stata vi 4 ca s bin s (variables), lnh (command), kt qu (results) v
xem li (review) nh sau:

45

Hnh 3. Giao din ca Stata vi 4 ca s bin s (variables), lnh (command), kt qu (results) v
xem li (review) nh sau
Vi Stata, chng ta c 2 cch yu cu cho Stata thc hin cc lnh v qun l s liu, v th
hay phn tch thng k (1) g lnh vo ca s lnh hay (2) s dng menu. Ngi c c
khuyn khch s dng c 2 phng php trn thc hin phn tch thng k. Nhng vi mc
ch gip ngi c pht trin nng lc t pht trin v nng lc phn on trong thc hin phn
tch thng k bng menu, cc hng dn s tp trung vic gip ngi c chuyn mt cu lnh
c vit theo c php vo mn hnh giao din.
2. Nhp liu vi lnh Edit
Trc tin chng ta s dng lnh edit nhp liu n gin. Lnh edit ngoi kh nng nhp liu
cng c th s dng iu chnh s liu. G edit trong ca s lnh (cn lu lnh edit cng nh
phn ln cc lnh khc trong Stata c vit ch thng, nu chng ta g lnh EDIT (hay Edit
hay eDit) th chng trnh Stata s khng hiu v s hin thng bo mu :
unrecognized command: EDIT
Sau khi chng ta g lnh edit, mn hnh Editor s hin ra:

46

Trong ca s Editor mi hng l mt i tng v mi ct l mt bin s, do s liu ca v
tn, gii tnh v ng huyt ca 5 i tng s th hin bng 5 hng v 3 ct. Bin s tn (l
bin s hnh chnh) c th hin Ct 1, bin gii l bin nh tnh c th hin ct 2 v
bin ng huyt l bin nh lng c th hin ct 3. Trong th d ny chng ta nhp liu
5 gi tr ca bin tn trc v sau nhp 5 gi tr ca bin gii v 5 gi tr ca bin ng
huyt:
- Trc tin nhp gi tr tn cho i tng 1 (Truc) bng cch di chuyn con tr n hng 1 ct 1
(lu khi nhn trc hp vn bn th hin l var1[1] th hin con tr ang bin var1 ca
i tng s 1. Nhp "Truc" vo hp vn bn.

Khi nhn Enter th gi tr "Truc" s c a vo hng 1 ct 1 v con tr s nm xung hng 2
ct 1 v nhn trc hp vn bn s th hin ch var1[2] th hin con tr ang bin var1

47
ca i tng s 2

Tip tc thc hin cho n khi nhp 5 tn ca 5 i tng.


Sau di chuyn con tr sang hng 1 ct 2 v quan st nhn trc hp vn bn l var2[1].

Tip tc nhp cc s 1,1,0,0,1 vo 5 trn cng ca ct 2. Cc s 1,1,0,0,1 l m ha ca cc

48
gi tr Nam, Nam, N, N, Nam ca bin s gii tnh.

Sau khi nhp cc gi tr ca gii tnh, chng ta s nhp vo cc gi tr ca ng huyt bng cch
di chuyn con tr sang hng 1 ct 2 v quan st nhn trc hp vn bn l var3[1].

Tip tc nhp cc gi tr ng huyt ca nm i tng ln lt l 80, 90, 100, 110,130
Khi nhp bin s nh lng (hoc nhp gi tr ca bin nh tnh c m ha) cn lu trnh
g ch ci (nh a, b, c, ) vo nhp liu u tin v khi trong nhp liu u tin c ch ci
(nh 8o) th d sau chng ta c xa i v nhp li cho ng (th d nh 80) th kiu (Type)
ca bin s vn l str# v khng th c x l nh l mt bin nh lng. khc ph sai lm
ny c th s dng lnh destring vi c php sau:
. destring [varlist], replace
(Data :: Create or Change variable :: Other variable transformation command :: Convert variable
from string to numeric)

49

Sau khi nhp liu chng ta c th nhp i vo tn bin var1 i tn bin (Name) bin thnh
ten.

Vic t tn bin (name) phi theo mt s quy tc: Tn bin phi bt u bng mt ch ci hoc
du gch chn v khng c du khong trng hoc du ni gia tn bin. Nu tn bin c t
c khong trng gia (th d nh "ho ten") th chng trnh stata s nhn nhm tn bin ny l
2 tn bin c t cnh nhau. Nu tn bin c t c du cch gia (th d nh "ho-ten")
th chng trnh stata s nhn nhm y l biu thc s hc (bin ho tr cho bin ten). V nhng
hn ch ny nn tn bin thng khng m t y c ngha ca bin. Khi nu mun
m t y ngha ca bin phi s dng nhn bin (label). Ni khc hn tn bin l tn ngn
gn ca bin, nhn bin (label) l tn di dng ca bin.
Trong trng hp bin ten, nu chng ta thy tn bin ten c ngha th chng ta khng cn
s dng nhn bin v trng ny.
Chng trnh Stata cho bit nh dng (format) ca bin ny l %9s, ch s ca nh dng ch
nh y l bin chui (string) v khi my tnh s khng thc hin php ton s hc trn cc

50
bin s ny.
Sau khi t xong tn bin v nhn bin, chng ta c th nhp vo nt lnh OK tip tc t
tn cho bin s gii tnh ct 2 bng cch nhp p vo ct s 2

Ta hy khai bo tn bin ct ny l gioi v nhn ca bin ny l gioi tinh. nh dng ca bin
ny l %8.0g. nh dng ny nhm cho bit y l bin th hin bng con s (g), bin s ny c
8 ch s trc du thp phn v 0 (khng) con s no sau du thp phn.

Do chng ta mun th hin l 1 l m ha cho gi tr Nam ca bin s, 0 l m ha ca gi tr N
ca bin gii. Chng ta phi khai bo nhn gi tr (Value label) bng cch nhp vo nt lnh
Define/Modify.

51

Khi khi ca s "label define Define value labels" hin ra chng ta nhp vo nt lnh
Define ca ca s ny to ra nhn gi tr mi. t tn ca nhn (Label name) l gioi (lu : c
th t tn ca nhn ging hoc khc vi tn ca bin Trong trng hp ny tn ca nhn trng
vi tn ca bin)

V sau nhp vo OK. Sau khi ca s Add value hin ra, nhp gi tr m ha (value) 1 vo hp
vn bn value v nhp gi tr cha m ha (text) nam vo hp vn bn text v sau nhp nt
lnh OK.

52

Ca s Add value li hin ra mt ln na, nhp gi tr m ha (value) 0 vo hp vn bn value v
nhp gi tr cha m ha (text) nu vo hp vn bn text v sau nhp nt lnh OK.

Ca s Add value li hin ra thm mt ln na. Tuy nhin ln ny chng ta khng nhp vo cc
hp vn bn value v hp vn bn text, m ch nhp vo nt lnh Cancel bi v chng ta khai
bo y cch m ha ca bin s gii


53

Sau chng ta nhp vo nt lnh Close ca ca s label define Define value labels. ca
s Variable properties nhp vo hp combo trn nt lnh Define/Modify chn nhn gi tr
gioi.

Sau nhp nt lnh OK ca ca s Variable properties hon tt phn khai bo cho bin
gii.
Tng t chng ta cng i tn v nhn ca bin var3 thnh duonghuyet v duong huyet luc doi.

54

Sau khi hon thnh nhp liu v m t thuc tnh ca bin s, chng ta ng s liu li bng
cch Close ca s Data Editor.
Khi hy quan st ca s Variable th hin c 3 bin s ten, gioi, duonghuyet v cc c
tnh ca bin ny (nhn, loi, nh dng).
Mt cch khc i tn bin bng cch nhp chut phi tn bin ca s Variables:

3. Thng k m t cho bin nh tnh vi lnh tab1
lm thng k m t cho bin nh tnh chng ta s dng s dng lnh tab1:
. tab1 varlist
(Statistics :: Summaries, tables, and tests :: Tables :: Multiple one-way tables)
C th trong trng hp ny, c bng phn phi tn sut ca gii tnh, bn hy g lnh

55
. tab1 gioi

-> tabulation of gioi

gioi | Freq. Percent Cum.
------------+-----------------------------------
nu | 2 40.00 40.00
nam | 3 60.00 100.00
------------+-----------------------------------
Total | 5 100.00

bng kt qu Freq. l vit tt ca frequency c ngha l tn sut, Percent l phn trm v Cum.
l ch vit tt ca Cummulative percent c ngha l phn trm cng dn. Cn lu l ch nn s
dng phn trm cng dn trong bng tn sut ca bin nh lng phn nhm hay bin th t.
Trong bng ny gi tr n c lit k trc v n c m ha bng gi tr 0 nh hn gi tr m
ha 1 ca gi tr nam
4. Thng k m t cho bin nh lng vi lnh sum
lm thng k m t cho bin nh tnh chng ta s dng s dng lnh tab1:
. summarize varlist
(Statistics :: Summaries, tables, and tests :: Summary and Descriptive Statistics :: Summary
statistics)
C th trong trng hp ny, bit trung bnh, lch chun, gi tr ti thiu v ti a ca bin
ng huyt, bn hy g lnh
. summarize duonghuyet
Hoc n gin hn
. sum duonghuyet

Variable | Obs Mean Std. Dev. Min Max
-------------+--------------------------------------------------------
duonghuyet | 5 102 19.23538 80 130

bng kt qu Obs c ngha l s quan st, mean l trung bnh (hay chnh xc hn l trung bnh
cng), Std. Dev. l vit tt ca Standard Deviation c ngha l lch chun, Min l gi tr ti
thiu v Max l gi tr ti a.
5. Thng k phn tng theo nhm
Trong cc nghin cu c nhng trng hp chng ta thng phi thc hin thng k phn tng
theo nhm th d nh cn xc nh t l suy dinh dng ca tr di 5 tui phn tng theo ni c
tr (c ngha l xc nh t l suy dinh dng ca tr di 5 tui ni thnh v t l suy dinh
dng tr di 5 tui ngoi thnh), t l c kin thc ng theo ngh nghip (t l c kin
thc ng nhm cng nhn, nhm nng dn, nhm ngh nghip khc) hoc ng huyt
trung bnh theo gii tnh (ng huyt trung bnh nhm nam v ng huyt trung bnh
nhm n). Tin t (prefix) bysort c th s dng trc cc lnh thng k thc hin cc phn
tch thng k phn tng.

56
C php cho vic s dng tin t bysort l :
bysort varlist: Stata_command
C th bit trung bnh, lch chun, gi tr ti thiu v ti a ca bin ng huyt phn
tng theo gii cn s dng tin t bysort gioi trc lnh thng k cho bin ng huyt
. bysort gioi: sum duonghuyet

-> gioi = nu

Variable | Obs Mean Std. Dev. Min Max
-------------+--------------------------------------------------------
duonghuyet | 2 105 7.071068 100 110
------------------------------------------------------------------------------------
-> gioi = nam

Variable | Obs Mean Std. Dev. Min Max
-------------+--------------------------------------------------------
duonghuyet | 3 100 26.45751 80 130

Kt qu ny cho bit ng huyt trung bnh ca 3 ngi nam l 100 mg% v ca 2 ngi n l
105 mg%.
6. Trng s
Nu chng ta l mt nghin cu v bit c ng huyt trung bnh ca 3 ngi nam l 100
mg% v ca 2 ngi n l 105 mg%, vy ng huyt trung bnh ca c 2 nhm s l bao
nhiu?
Nu s ngi nhm nam v nhm n bng nhau th ng huyt trung bnh ca chung 2 nhm
s l trung bnh ca ng huyt trung bnh ca nhm nam (100 mg%) v ng huyt trung
bnh ca nhm n (105 mg%) v s l 102,5 mg%. Tuy nhin trong trng hp ny khng ng
v s ngi nhm nam (3 ngi) khc vi s ngi nhm n. Khi ng huyt trung bnh
ca chung 2 nhm s l trung bnh ca ng huyt trung bnh ca nhm nam (100 mg%) vi
trng s l 3 v ng huyt trung bnh ca nhm n (105 mg%) vi trng s l 2. Chng ta hy
minh ha cc tnh ny vi Stata bng cch xa b s liu c v nhp s liu mi vo
. clear
. edit
V nhp s liu nh sau:


57
Sau chng ta c th s dng lnh summarize vi trng s tnh ng huyt trung bnh
chung ca c 2 nhm:
. summarize duonghuyet [fweight = trongso]

Variable | Obs Mean Std. Dev. Min Max
-------------+--------------------------------------------------------
duonghuyet | 5 102 2.738613 100 105
C th nhn xt rng nu chng ta s dng trung bnh c trng s t cc trung bnh tng nhm
chng ta s c con s trung bnh chung c gi tr. lch chun (standard deviation) trong
trng hp ny l 2,74 khng sai nhng phi l gii khc i. lch chun ny khng ni ln s
phn tn ca gi tr c th chung quanh gi tr trung bnh m ni ln s phn tn ca trung bnh
nhm chung quanh gi tr trung bnh chung. V vy lch chun ny l lch chun gia cc
nhm (hay cn li l between group standard deviation) v Tng bnh phng sai lch gia cc
nhm s bng phng sai gia cc nhm nhn vi cn bc hai ca c mu -1 . Chng ta s quay
tr li khi nim ny trong phn phn tch phng sai (analysis of variance) nhng chng ta cng
nhn th d ny minh ha cho cng thc c s dng rng ri trong phn tch phng sai l:
SS =1480=4 x 19.23538
2
= SS
b
+ SS
w

30 2.738613 4 ) 1 ( ) (
2
1
2
= = = =

=
b
k
j
j j b
Var N X X N SS
1450 7.071068 1 + 26.45751 2 ) 1 (
2 2
1
2
= = =

=
k
j
j j w
s N SS
Trong k hiu s i tng trong mi nhm l N
1
, N
2
, , N
k
. S i tng trong nhm j c
k hiu l N
j
. Tng s i tng trong tt c cc nhm l N
1
+ N
2
+ + N
k
= N. S liu c
trnh by nh sau



58
M t s liu vi Stata 10.0 for Windows
Chng ny s hng dn bn phng php m t s liu vi phn mm Stata 10.0 s dng b
s liu ivf.dta c trong th mc C:\DATA sau khi bn ci t cc tp tin s liu mu.
Thng thng trc khi m t s liu chng ta cn thc hin bc chun b v vic thao tc s
liu (data processing). Cng tc chun b bao gm vic m tp tin s liu, m tp tin log (Open
log file), kho st s liu c bao nhiu bn ghi v c nhng bin s no cng nh nghin cu
cng nghin cu (ch yu l mc tiu nghin cu) gip vic phn loi bin s. Vic thao tc
s liu l vic r sot s liu c b sai st hay nhm ln g hay khng, to bin s mi theo yu
cu ca phn tch v tin hnh vic dn nhn s liu gip cho vic hiu r hn s liu v c
kt qu ca phn tch thng k.

Trc tin chng ta hy khi ng Stata theo cch hng dn chng Khi ng Stata.
Sau thc hin cc bi tp 1 n 3 cho cng tc chun b v cc bi tp 4 n 6 cho cng tc
thao tc s liu.
1- M tp tin ivf_v.dta v m tp tin log
Khi ng ca s Use New Data bng cch 1 trong 2 cch:
- Nhn nt cng c m file ( v tr u tin trn thanh cng c).
- Chn menu File :: Open

Sau khi ca s Use New Data s hin ra. Nhp vo mi tn bn phi hp Look in chn a
thch hp v dng con chut nhp vo cc th mc chn th mc c cha s liu (thng

59
thng tp tin s liu nm th mc C:\Data). Tm tp tin s liu ivf_v.dta, nhp p vo tn tp
tin ny m tp tin (hoc nhp vo tp tin ny tn tp tin ri vo hp File Name ri sau
nhp vo nt lnh Open m tp tin).
lu tr li ton b kt qu phn tch s c thc hin, cn nh nhp vo nt cng c Stata
Log nm v tr th t t tri trn thanh cng c bt u log kt qu (begin log). My
tnh s hin ra hp thoi Begin Logging Stata Output chng ta chn tn tp tin (File name) v
th mc lu (Save In) ca tp tin log.

Th d chng ta mun lu tp tin log vi tn l ivf_v.smcl vo th mc c:\data; chng ta nhp
vo cc thng tin nh trn. Khi cui ca s kt qu c thanh trng thi vi dng ch log on
(smcl) cho bit l tt c cc kt qu phn tch ang c ghi chp li (log).


2. Kho st cc bin s ca tp tin v nghin cu mc tiu nghin cu phn loi bin s
Hng dn: xem lit k cc bin s chng ta c th nhn phm chc nng F3 hay s dng

60
menu (nhp vo menu Data :: Describe data :: Describe variable in memory) xem cc bin
s ca s liu


Chng ta c th xem danh sch cc bin s lit k sau:

. describe

Contains data from C:\DATA\ivf_v.dta
obs: 641
vars: 7 15 Aug 2006 15:27
size: 20,512 (99.8% of memory free)
-------------------------------------------------------------------------------
storage display value
variable name type format label variable label
-------------------------------------------------------------------------------
maso float %9.0g ma so
tuoime float %9.0g tuoi me (nam)
tangha float %9.0g tang huyet ap thai ki - 1=tang
ha, 0=khong tang ha
tuoithai float %9.0g tuoi thai (tuan)
gioi float %9.0g gioi tinh tre - 1=trai, 0=gai
tlsosinh float %9.0g trong luong so sinh (gram)
nghenghiep float %9.0g nghe nghiep me - 1=tu do,
2=cong nhan, 3=vien chuc
-------------------------------------------------------------------------------
Sorted by: maso


Gi s t cng nghin cu chng ta bit y l tp tin ca s liu 641 a tr c sinh t b
m th thai trong ng nghim (in-vitro fertilisation) vi mc tiu nghin cu l xem tui thai v
tng huyt p trong thai k c nh hng ln trng lng thai hay khng. Cch l gii s liu
c minh ha

STT Tn bin ngha ca bin Phn loi bin s:
(c lp hay Ph thuc)
(nh tnh hay nh lng)

61
1 Maso M s
2 Tuoime Tui ca m (nm tui)
3 Tangha Tng huyt p thai k 1= c
0= khng

4 Tuoithai Tui thai (tnh theo tun)
5 Gioi Gii tnh ca tr 1=trai 0=gi
6 Tlsosinh Trng lng sinh tnh theo
grams

7 Nghenghiep Ngh nghip ca m 1=t do
2=cng nhn 3=vin chc


3. Lm th no xem s liu
Hng dn: C th xem s liu bng 2 cch:
- Dng nt lnh Data Browser (v tr 3 tnh t pha bn phi ca thanh cng c)
- Dng menu Data :: Data browser (read-only editor)


S dng Data Browser cho php nhn s liu trong li (nh cc ca chng trnh Excel)
nhng n khng cho php in s liu. Mun nhn s liu ra ca s kt xut (output) sau in ra
hy s dng menu Data:: Describe Data :: List data.

4. Hy thc hin thng k m t tt c cc bin s trong b s liu ny:
Hng dn: trc tin chng ta phi xc nh bin s no l bin s nh lng v bin s no l
bin s nh tnh. Sau thc hin thng k m t cho cc bin s: i vi bin nh lng, thc
hin lnh summarize c trung bnh v lch chun, i vi bin nh tnh thc hin lnh
tab1 c bng phn phi tn sut ca cc bin s.
Trong b s liu ny c cc bin tuoime, tuoithai, tlsosinh l bin nh lng. m t
bin s ny chng ta s dng menu Statistics :: Summaries, tables, & tests :: Summary
Statistics.

62


Sau khi hp thoi Summarize hin ra, thc hin cc bc sau:

Bc 1: Dng con chut nhp vo du mi tn xung ( ) ca hp combo Variables. Khi s
c danh sch cc bin s c s ra.

Bc 2: Di chuyn con tr trong danh sch cc bin, v nhp vo cc bin cn m t thng k

63
(tuoime, tuoithai, tlsosinh) tn cc bin ny xut hin trn hp vn bn Variables


Bc 3: Nhp vo nt lnh OK
Kt qu c trnh by nhu sau:
. summarize tuoime tuoithai tlsosinh

Variable | Obs Mean Std. Dev. Min Max
-------------+--------------------------------------------------------
tuoime | 641 33.97192 3.87046 23 43
tuoithai | 641 38.68725 2.329931 24.69 42.35
tlsosinh | 641 3129.137 652.7827 630 4650

Cc bin s nh tnh bao gm tang_ha (tng huyt p), gioi (gii tnh ca tr), nghenghiep
(ngh nghip ca m). tm tt cc bin s nh tnh ny (tang_ha, gioi, nghenghiep) chng ta
s dng menu Statistics :: Summaries, tables & test :: Tables :: Multiple one-way tables.







64
Cc bc thc hin gm:
- Bc 1: Dng con chut nhp vo du mi tn xung ( ) ca hp combo Categorical
variables. Khi s c danh sch cc bin s c s ra.

Bc 2: Di chuyn con tr trong danh sch cc bin, v nhp vo cc bin cn m t thng k
(tang_ha, gioi, nghenghiep) tn cc bin ny xut hin trn hp combo Categorical variables
Bc 3: nhp nt lnh OK hon tt

. tab1 tang_ha gioi nghenghiep

-> tabulation of tang_ha

tang huyet |
ap thai ki |
- 1=tang |
ha, 0=khong |
tang ha | Freq. Percent Cum.
------------+-----------------------------------
0 | 552 86.12 86.12
1 | 89 13.88 100.00
------------+-----------------------------------
Total | 641 100.00

-> tabulation of gioi

gioi tinh |
tre - |
1=trai, |
0=gai | Freq. Percent Cum.
------------+-----------------------------------
0 | 315 49.14 49.14
1 | 326 50.86 100.00
------------+-----------------------------------
Total | 641 100.00

-> tabulation of nghenghiep


65
nghe nghiep |
me - 1=tu |
do, 2=cong |
nhan, |
3=vien chuc | Freq. Percent Cum.
------------+-----------------------------------
1 | 104 16.22 16.22
2 | 238 37.13 53.35
3 | 299 46.65 100.00
------------+-----------------------------------
Total | 641 100.00

5. Cc tm tt s liu nh trn l t yu cu. Tuy nhin vic m ho cc gi tr ca bin s
khin cho vic c bng bng tn sut ca bin s danh nh (nh gioi v nghenghiep) b kh
khn (nht l cho nhng ngi khng trc tip lm thng k hay phi c li kt qu sau mt
khong thi gian di). Do nhng ngi lm thng k chuyn nghip lun lun thc hin ghi
ch (dn nhn) cho cc bin s nh tnh. Hy thc hin vic dn nhn s liu.
Hng dn:
Vic dn nhn cho cc gi tr m ha l vic lm tn cng nhng n gip phn bit ngi lm
thng k chuyn nghip v ngi lm thng k khng chuyn nghip. Mc d tn cng nhng
li ch do n em li vt qua cng sc b ra v vy chng ta cn phi thc hin vic dn nhn
ny.
Vic dn nhn gi tr bin s c thc hin qua 2 bc: to nhn (define label value) v dn
nhn cho gi tr (Assign value label to variable).
- To nhn sex, tang_ha, nhn nghenghiep
to nhn s dng menu Data :: Labels :: label values :: Define or modify value label. Ca s
Define value label s c hin ra.

Khi khi ca s "label define Define value labels" hin ra chng ta nhp vo nt lnh
Define ca ca s ny to ra nhn gi tr mi. t tn ca nhn (Label name) l sex (lu : c
th t tn ca nhn ging hoc khc vi tn ca bin Trong trng hp ny tn ca nhn (sex)
khc vi tn ca bin(gioi))

66

V sau nhp vo OK.

Sau khi ca s Add value hin ra, nhp gi tr m ha (value) 1 vo hp vn bn value v nhp
gi tr cha m ha (text) nam vo hp vn bn text v sau nhp nt lnh OK.

67

Ca s Add value li hin ra mt ln na, nhp gi tr m ha (value) 0 vo hp vn bn value v
nhp gi tr cha m ha (text) nu vo hp vn bn text v sau nhp nt lnh OK.


Ca s Add value li hin ra thm mt ln na. Tuy nhin ln ny chng ta khng nhp vo cc
hp vn bn value v hp vn bn text, m ch nhp vo nt lnh Cancel bi v chng ta khai
bo y cho tn nhn sex


68

Sau c th nhp vo nt lnh Close (ca hp thoi Define value labels) thot ra hay nhp
vo nt lnh Define (ca hp thoi Define value labels) tip tc to nhn tang_ha.

Vi cc cch m ha c quy nh trong nhn c tn tang_ha l: 1 l c tng huyt p v 0 l
khng tng huyt p.

69

Cn lu : tn nhn c th khc vi tn bin (th du nh trng hp trn ta t tn nhn l sex
trong khi tn bin l gioi) hoc tn nhn c th trng vi tn bin (th d ta c th t tn nhn l
tang_ha cho bin tang_ha).
Tng t ta cng tip tc to nhn nghenghiep bng cch nhp vo nt lnh define v sau
nhp tn ca nhn nghenghiep vo hp vn bn label name ri nhp OK.

Tip tc quy nh cch m ha ca nhn c tn nhn nghenghiep l: 1 l c nghe t do v 2 l
cng nhn v 3 l vin chc.

70

hon tt vic to nhn ta nhn vo nt lnh Close

Dn nhn gi tr (Assign value label) cho cc bin gioi, tang_ha, v nghenghiep
Sau khi d to c nhn, chng ta hy dn nhn gi tr cho bin s bng cch dng menu
Data :: Labels :: Label values :: Assign value label to variable


Khi hp thoi labels value Attach value label hin ra dn nhn sex cho mi bin s gioi cn
thc hin 4 bc sau:

71

- Bc 1: khung Add or remove value label, m bo l nt chn Attach a value label to
variable c chn.
- Bc 2: t con tr vo hp combo Variable; chn bin gioi trong hp combo ny.
- Bc 3: a con tr vo hp combo Value lable v chn nhn sex hp combo ny
- Bc 4: Nhp vo nt lnh Submit thc hin vic dn nhn.

tip tc thc hin tng t dn nhn tang_ha cho bin tang_ha, hy tin hnh cc bc
sau:

- Bc 1: khung Add or remove value label, m bo l nt chn Attach a value label to
variable c chn.
- Bc 2: t con tr vo hp combo Variable; chn bin tang_ha trong hp combo ny.
- Bc 3: a con tr vo hp combo Value lable v chn nhn tang_ha hp combo ny

72
- Bc 4: Nhp vo nt lnh Submit thc hin vic dn nhn.

tip tc thc hin tng t dn nhn nghenghiep cho bin nghenghiep, hy tin hnh cc
bc sau:

- Bc 1: khung Add or remove value label, m bo l nt chn Attach a value label to
variable c chn.
- Bc 2: t con tr vo hp combo Variable; chn bin nghenghiep trong hp combo ny.
- Bc 3: a con tr vo hp combo Value lable v chn nhn nghenghiep hp combo ny
- Bc 4: Bi vy y l bc dn nhn cui cng, do khng nhp vo nt lnh Submit
thc hin vic dn nhn v nhp vo nt lnh OK ng thi thc hin vic dn nhn v ng
ca s label values

6. Lp bng phn phi tn sut cho cc bin s nh tnh sau khi dn nhn cho cc bin ny.
Hng dn:
Cc bin s nh tnh c dn nhn bao gm tang_ha sex matagegp gestcat. tm tt cc
bin s nh tnh ny (tang_ha sex matagegp gestcat) chng ta s dng menu Statistics ::
Summaries, tables & test :: Tables :: Multiple one-way tables.

73


Khi hp thoi tab1 One-way tables hin ra, chng ta tin hnh 3 bc (1) t con tr vo hp
Categorical value (2) Nhp vo ca s variable chn cc bin s tin hnh phn tch v (3)
Nhp vo nt lnh OK. Kt qu s xut hin nh sau:

. tab1 gioi tang_ha nghenghiep

-> tabulation of gioi

gioi tinh |
tre - |
1=trai, |
0=gai | Freq. Percent Cum.
------------+-----------------------------------
gai | 315 49.14 49.14
trai | 326 50.86 100.00
------------+-----------------------------------
Total | 641 100.00

-> tabulation of tang_ha

tang huyet ap |
thai ki - |
1=tang ha, |
0=khong tang |
ha | Freq. Percent Cum.
--------------+-----------------------------------
huyet ap bt | 552 86.12 86.12

74
huyet ap tang | 89 13.88 100.00
--------------+-----------------------------------
Total | 641 100.00

-> tabulation of nghenghiep

nghe nghiep |
me - 1=tu |
do, 2=cong |
nhan, |
3=vien chuc | Freq. Percent Cum.
------------+-----------------------------------
tu do | 104 16.22 16.22
cong nhan | 238 37.13 53.35
vien chuc | 299 46.65 100.00
------------+-----------------------------------
Total | 641 100.00

7. V t chc (histogram) ca bin trng lng s sinh (tlsosinh)
Hng dn:
v t chc , ta phi s dng menu Graphics :: Histogram


Khi hp thoi histogram hin ra, chng ta thc hin cc bc sau:


75

Bc 1: t con tr vo hp combo Variable nhp vo mi tn xung bn phi hp combo
s ra danh sch cc bin
Bc 2: T trong danh sch cc bin chn tlsosinh a bin ny vo hp combo Variable.
Bc 3 Bc 4: nhm xc nh t chc s bt u t gi tr 600 (Lower limit of first bin)
v mi khong tip theo (bin) c rng l 300 (Width of bins)
Bc 5: Cho bit t chc s ghi nhn t l phn trm ca cc khong gi tr bng cch nhp
vo nt chn Percent.
Nu mun th hin mt ca phn phi, nhp vo nt chn Density, nu mun th hin
t l th nhp vo nt chn Fraction, nu mun th hin tn sut th nhp vo nt chn
Frequency. Mi lin h gia cc hm phn phi ny nh sau:
T l (Fraction) = Tn sut (Frequency) / C mu (N)
Mt (Density) = T l (Fraction) / rng ca khong chia (Width of bins)

Kt qu tip theo s c trnh by trong hnh sau.

76
0
5
1
0
1
5
2
0
2
5
P
e
r
c
e
n
t
1000 2000 3000 4000 5000
t r ong l uong so si nh ( gr am)


8. th ny cho chng ta thy hnh dng ca phn phi s liu, tuy nhin chng ta cng c th
thay i thc hin vic chia khong cho trc honh, ghi ch cho trc honh, chia khong cho trc
tung v ghi ch cho trc tung. Gi s chng ta mun thc hin cc yu cu chia khong v ghi
ch nh sau:
Trc honh phi c khong gi tr t 600 n 4800 (bin l 4200). Chng ta mun chia
lm mi khong c ln l 300 nh vy cn thit phi c 14 khong. Chng ta cng mun
ghi gi tr t 600 n 4200 v mi nhn gi tr cch nhau 600 gram.
Trc honh c ghi ch l "trong luong so sinh (gram) cua 641 tre"
Trc tung c khong gi tr l 0 n 0.3, ghi nhn cho cc gi tr v cc nhn ny cch nhau
0.1
Trc tung c ghi ch l "Phn trm" (ch khng phi l Percent).

Cc bc thc hin chia khong v ghi ch cho trc hong (trc X) nh sau:
Bc 1: Nhp vo Tab X-axis
Bc 2: nhp ghi ch cho trc honh, hp vn bn Title, nhp vo ghi ch l "trong luong
so sinh (gram)"
Bc 3: chia khong cho trc honh, chn Major tick/label property, chn Range/Delta v
sau nhp cc gi tr ti thiu, ti a v khong delta.


77

Sau nhp vo nt lnh accept.

Cc bc thc hin chia khong v ghi ch cho trc hong (trc Y) nh sau:

Bc 4: Nhp vo Tab Y-axis
Bc 5: nhp ghi ch cho trc tung, hp vn bn Title, nhp vo ghi ch l "Phan tram"

78
Bc 6: chia khong cho trc tung, chn Major tick/label property, chn Range/Delta v sau
nhp cc gi tr ti thiu, ti a v khong delta.

Nhp vo nt lnh Accept.

Bc 7: Nhp vo nt lnh OK cui ca s hon tt

0
.
1
.
2
.
3
P
h

n

t
r

m
600 1200 1800 2400 3000 3600 4200 4800
Tr ng l - ng s si nh ( gr am)


9. Chng ta cng c th v th xut (p-p plot) xem bin s tlsosinh c tun theo phn phi
bnh thng

79
Hng dn:
S dng menu Graph Distributional graph - normal quantile plot


Khi ca s qnorm hin ra, chng ta tin hnh cc bc sau:
Bc 1: t con tr vo hp vn bn Variable
Bc 2: a con tr vo ca s Variables v nhp vo bin tlsosinh a bin ny vo hp vn
bn Variable.
Bc 3: nh du vo hp kim: Show grid at percentiles:
Bc 4: Nhp vo nt lnh OK

80


Kt qu nh sau:




81
Nu phn phi bnh thng th ng cong phn phi (ng nt m) s trng vi ng cho
ca hnh ch nht (ng thng mnh). Nu phn phi lch m th xc sut 0,5 ng cong
phn phi nm bn tri ng cho. Nu phn phi lch dng th xc sut 0,5 ng cong
phn phi s nm bn phi ng cho.
Nu dc ca ng cong phn phi ln hn mt (1) c ngha l phn phi thc nghim tng
chm hn phn phi bnh thng, nu ng cong phn phi nh hn mt (1) c ngha l ng
cong thc nghim tng nhanh hn phn phi bnh thng.

Nh vy, phn phi ca trng s sinh b lch tri v khong trng lng thp, phn phi trng
lng s sinh tng chm hn phn phi chun. khong trng lng cao trng lng s sinh
tng hi nhanh hn phn phi chun.
0
.
0
5
.
1
.
1
5
F
r
a
c
t
i
o
n
1000 2000 3000 4000 5000
trong luong so sinh (gram)


10. Hy v biu hnh thanh (bar chart) ca nhm ngh nghip
Hng dn:
Trc tin s dng menu Graphics :: Bar chart

hin ra ca s graph bar Chng ta hy 2 th Main v th Categories l 2 th nm bn
tri ca ca s.


82

th Main tin hnh cc bc sau:
Bc 1: Chn mc count nonmissing trong hp Combo Statistic
Bc 2: t con tr vo hp vn bn variable(s) nhp vo mi tn hng xung di bn phi
hp combo hin ra danh sch bin s.
Bc 3: a con tr chut chn danh sch bin s v nhp vo bin maso bin ny xut
hin trn hp vn bn Variable(s)
Bc 4: Nhp vo th (tab) Categories hin th ny ra

83


Bc 5: Khi th Categories, a con tr chut vo hp vn bn Variable
Bc 6: t con tr vo hp vn bn variable(s) nhp vo mi tn hng xung di bn phi
hp combo hin ra danh sch bin s.
Bc 7: a con tr chut chn danh sch bin s v nhp vo bin nghenghiep bin ny
xut hin trn hp vn bn Variable(s)
Bc 8: Nhp vo nt lnh OK xem biu hnh thanh c to ra.

84
0
1
0
0
2
0
0
3
0
0
c
o
u
n
t

o
f

m
a
s
o
t u do cong nhan vi en chuc



11. Hy v biu hnh thanh (bar chart) trung bnh trng lng s sinh ca cc a tr con ca
nhng b m c ngh nghip khc nhau.
Hng dn:
Trc tin s dng menu Graphics :: Bar chart
hin ra ca s graph bar Chng ta hy 2 th Main v th Categories l 2 th nm bn
tri ca ca s.


85

th Main tin hnh cc bc sau:
Bc 1: Chn mc mean trong hp Combo Statistic
Bc 2: t con tr vo hp vn bn variable(s) nhp vo mi tn hng xung di bn phi
hp combo hin ra danh sch bin s.
Bc 3: a con tr chut chn danh sch bin s v nhp vo bin tlsosinh bin ny xut
hin trn hp vn bn Variable(s)
Bc 4: Nhp vo th (tab) Categories hin th ny ra

86


Bc 5: Khi th Categories, a con tr chut vo hp vn bn Variable
Bc 6: t con tr vo hp vn bn variable(s) nhp vo mi tn hng xung di bn phi
hp combo hin ra danh sch bin s.
Bc 7: a con tr chut chn danh sch bin s v nhp vo bin nghenghiep bin ny
xut hin trn hp vn bn Variable(s)
Bc 8: Nhp vo nt lnh OK xem biu hnh thanh c to ra.

87
0
1
,
0
0
0
2
,
0
0
0
3
,
0
0
0
m
e
a
n

o
f

t
l
s
o
s
i
n
h
t u do cong nhan vi en chuc

12. Hy v biu hnh bnh (Pie chart) phn phi bin s ngh nghip m (nghenghiep).
Hng dn:
Trc tin s dng menu Graphics :: Pie Chart


thc hin biu hnh bnh, chng ta tip tc cc bc sau:
Bc 1: Lu nt chn Graph by categories c nh du
Bc 2: t con tr vo hp combo Category variable v nhp vo mi tn xung bn phi
s ra danh sch bin.
Bc 3: Dng con tr chn bin nghenghiep (ngh nghip m) trong danh sch bin tn bin
ny xut hin trn hp combo Category variable.
Bc 4: Nhp vo nt lnh OK

88

Chng ta s c c biu hnh bnh nh sau:


89
t u do cong nhan
vi en chuc

13. Hy to bin mi nhomtuoi, bin ny c gi tr
0 tng ng vi tui ca m t thp nht n 29
1 tong ng vi tui m t 30 n 34
2 tong ng vi tui m t 35 n 39
3 tong ng vi tui m t 40 tr ln
iu ny c ngha l chng ta chia tui m lm 4 nhm vi 3 im chia l 30, 35 v 40. iu ny
c th thc hin bng cch to bin mi vi hm irecode.




Cch thc hin vic to bin mi c thc hin vi menu Create or Change variables :: Create
new variable
30
29-30 34-35 39-40
0 1 2 3

90


Sau khi ca s generate - Generate a new variable thc hin vic to bin mi vi cc bc sau:

Bc 1: Nhp tn bin mi (nhomtuoi) vo hp vn bn Generate variable
Bic 2: Nhp cng thc to bin mi irecode(tuoime,29,34,39)
Bc 3: Nhp vo nt lnh OK hon tt
Sau khi to ra bin mi nhomtuoi, chng ta nn thc hin thm 2 bc: to nhn (define label
value) v dn nhn gi tr cho bin s (Assign value label to variable) nh c trnh by bi
5. (0 l di 30; 1 l 30 den 34; 2 l 35-39; 3 l 40+)

14. Hy to bin mi sinh non, bin ny c gi tr
1 tng ng vi tui thai <37
0 tong ng vi tui thai >=37 tun
Yu cu c ngha l chng ta cn to ra mt bin nh gi vi 2 gi tr 0 v 1.. iu ny c th
thc hin bng cch to bin mi v s dng biu thc boolean (biu thc th hin mt mnh

91
c gi tr l ng hay sai)
Vic thc hin c th bao gm vic to bin mi c thc hin vi menu Create or Change
variables :: Create new variable


Sau khi ca s generate - Generate a new variable thc hin vic to bin mi vi cc bc sau:


Bc 1: Nhp tn bin mi (sinhnon) vo hp vn bn Generate variable
Bic 2: Nhp cng thc to bin mi tuoithai<37
Bc 3: Nhp vo nt lnh OK hon tt
Sau khi to ra bin mi sinhnon, chng ta nn thc hin thm 2 bc: to nhn (define label
value) v dn nhn gi tr cho bin s (Assign value label to variable) nh c trnh by bi
5. (1 l sinh non, 0 l khng sinh non)
15. Lu li s liu
Hng dn: lu s liu chng ta c th s dng menu File :: Save (hay Ctrl-S) hoc nhn

92
vo nt save file (v tr th hai ca thanh cng c). Mt hp thoi s bt ln v hi chng ta
c mun chp chng vo tp tin s liu hay khng. Nu ng chng ta hy nhp vo nt OK
ng .

Nu chng ta khng mun thay i tp tin s liu c, chng ta nn nhp vo nt Cancel v lu s
liu vi tn mi s dng menu File :: Save As. khi hp thoi "Save Stata Data File" s hin
ra. G tn mi vo hp File Name (th d nu chng ta mun t tn tp tin l ivf_revised.dta th
chng ta g vo hp vn bn File name: ivf_revised)

nhp nt lnh Save hon tt.
16. Hy thot khi chng trnh Stata
Hng dn:
thot khi Stata/SE 10.0 for Windows chng ta c th thc hin mt trong 2 vic sau:
- Nhp vo ng nm pha trn phi ca ca s Stata

93
Lu : Trong trng hp c d liu trong b nh v d liu c thay i nhng
cha c lu vo a th khi chng nhp vo ng, my tnh s hi chng ta rng
chng ta c mun thot m khng lu li s liu hay khng.

Nu chng ta ng bng cch nhp vo nt lnh Yes th Stata s thot, nu khng (nhp
nt lnh No) th chng ta li tr li Stata chng ta c th lu li s liu.
- G lnh exit trong ca s Stata Command.
Lu : Trong trng hp c d liu trong b nh v d liu c thay i nhng
cha c lu vo a th khi chng g exit, my tnh s khng ng cho chng ta thot
v s thng bo no; data in memory would be lost. Trong trng hp ny nu chng
mun thot m khng lu li s liu th chng ta hy g exit, clear. Nu chng ta mun
lu li s liu hy s dng lnh save.
17. Nu chng ta mun xem li cc kt qu phn tch c thc hin chng ta c th xem li
tp tin log.
Cch xem li tp tin log gm cc bc sau:



94

Bc 1: Vo menu File:: View
Bc 2: Khi hin ra hp thoi Choose file to View, nhp vo nt lnh Browse, khi ca s
Choose file Name s hin ra
Bc 3: Trn ca s Choose file Name, chn thmc cha tp tin log trong hp thoi Log gin
Bc 4: Chn tp tin log cn xem li (th d tp tin baitap.smcl)
Bc 5: Nhp vo nt lnh Open ng ca s Choose file Name v tr v hp thoi Choose
file to view

Bc 6: Nhp vo nt lnh OK xem tp tin log

95





96
Thng k phn tch bin s nh lng vi Stata
S lc l thuyt v so snh 2 trung bnh
Kim nh t dng so snh 2 trung bnh ca ca bin s nh lng c phn phi bnh thng.
Kim nh t gm c (a) Kim nh t bt cp so snh trung bnh trc v sau khi can thip trn
mt nhm v (b) kim nh t khng bt cp so snh trung bnh ca 2 nhm c lp.
C hai loi kim nh t khng bt cp (khi so snh trung bnh ca 2 nhm c lp). Kim nh t
c gi nh 2 phng sai bng nhau v kim nh t khng c gi nh phng sai bng nhau. Hai
loi kim nh ny c chung nguyn l nhng khc nhau trong cch tnh ton t do (ca kim
nh t) v cch tnh sai s chun.
Kim nh t khng bt cp gi nh 2 phng sai bng nhau
Kim nh t khng bt cp gi nh 2 phng sai bng nhau dng so snh trung bnh ca 2
nhm c lp v i hi 2 gi nh.
- Cc gi tr ca bin s ca c 2 dn s c phn phi bnh thng
- lch chun 2 nhm dn s l bng nhau.
Nu chng ta k hiu:
x
1
: gi tr trung bnh nhm 1
x
2
: gi tr trung bnh nhm 2
n
1
: c mu ca nhm 1
n
2
: c mu ca nhm 2
s
1
2
: phng sai nhm 1
s
2
2
: phng sai nhm 2
Chng ta c th xc nh t do, sai s chun v gi tr ca thng k t theo cng thc sau:
- t do ca kim nh t: df = n
1
+ n
2
- 2
- Sai s chun:
2 1
/ 1 / 1 n n s se
p
+ = vi
) 1 ( ) 1 (
) 1 ( ) 1 (
2 1
2
2 2
2
1 1
+
+
=
n n
s n s n
s
p

- Gi tr thng k t:
2 1
2 1 2 1
/ 1 / 1 n n s
x x
se
x x
t
p
+

=
Sau khi tnh c gi tr thng k t, ngi ta tra bng phn phi t vi (n1 +n1 - 2) t do v
tnh c xc sut p. Thng thng nu p <0,05 ngi ta bc b gi thuyt H
0
.
Kim nh t khng bt cp khng c gi nh 2 phng sai bng nhau
Kim nh t khng bt cp gi nh 2 phng sai bng nhau dng so snh trung bnh ca 2
nhm c lp v ch i hi 1 gi nh.
- Cc gi tr ca bin s ca c 2 dn s c phn phi bnh thng
Nu chng ta k hiu:
x
1
: gi tr trung bnh nhm 1
x
2
: gi tr trung bnh nhm 2

97
n
1
: c mu ca nhm 1
n
2
: c mu ca nhm 2
s
1
2
: phng sai nhm 1
s
2
2
: phng sai nhm 2
Chng ta c th xc nh t do, sai s chun v gi tr ca thng k t theo cng thc sau:
- t do ca kim nh t (theo cng thc ca Satterthwaite):

+
=
) 1 ( ) 1 (
. .
2
2
2
4
2
1
2
1
4
1
2
2
2
2
1
2
1
n n
s
n n
s
n
s
n
s
f d
- Sai s chun:
2
2
2
1
2
1
n
s
n
s
se + =
- Gi tr thng k t:
2
2
2
1
2
1
2 1 2 1
n
s
n
s
x x
se
x x
t
+

=
Sau khi tnh c gi tr thng k t, ngi ta tra bng phn phi t vi t do ph hp (nh tnh
ton trn) v tnh c xc sut p. Thng thng nu p <0,05 ngi ta bc b gi thuyt H
0
.
Kim nh t bt cp
Gi s so snh hiu qu ca thuc A v thuc B trong ci thin th tch th ra gng sc trong
1 giy u tin (FEV1) ngi ta cho cc bnh nhn tham gia nghin cu dng thuc A (hay
thuc B) trong mt thi gian v cui thi gian ny o lng FEV1 ca bnh nhn (gi l
FEV1
A
). Sau cho li i cho bnh nhn dng thuc B (hay thuc A) trong mt khong thi
gian v cui thi gian ny li o lng FEV1 ca bnh nhn (gi l FEV1
B
). Thit k nghin
cu ny c gi l th nghim lm sng bt cho. Chng ta lu cc c im sau khi phn
tch thng k cho cc nghin cu c cng loi thit k ny.
- Trong nghin cu ny c 2 bin s o lng trn cng dn s: FEV1
A
v FEV1
B

- Cc gi tr ca bin s FEV1
A
v FEV1
B
l ca cng mt bnh nhn nn hiu s (FEV1
A
-
FEV1
B
) cng l bin s ca bnh nhn . V nu khng c s khc bit v hiu qu ca 2 loi
thuc, trung bnh ca hiu s ny bng 0.
- Khi kim nh so snh hiu qu ca thuc A v thuc B cng kim nh so snh gi tr
trung bnh ca FEV1
A
v FEV1
B
kim nh hiu s (FEV1
A
- FEV1
B
)=0
- Php kim nh ny c gi l kim nh t bt cp. Kim nh t bt cp l trng hp c bit
ca kim nh t mt mu.
Tm li kim nh t bt cp l kim nh c s dng khi thit k nghin cu cho mt i
tng (hay 2 i tng rt ging nhau) c th nghim 2 loi thuc khc nhau.
Kim nh phi tham s
Nu phn phi khng phi l bnh thng (th d nh b lch dng), c th s dng php bin
i (thng l bin i log) a phn phi v bnh thng hoc dng test phi tham s. Kim
nh phi tham s c u im l khng i hi gi nh v phn phi ca bin s nh lng
nhng c khuyt im l khng th c lng c tham s, l nh khng th c lng

98
khong tin cy 95% hiu s ca trung bnh gia 2 nhm.
S lc l thuyt v so snh cc trung bnh ca 3 nhm.
Khi chng ta cn so snh trung bnh ca nhiu nhm, chng ta khng th dng nhiu kim nh t
so snh tng cp ca nhm v nh vy chng ta s lm tng nguy c ca sai lm loi 1.
Phng php thch hp c dng cho trng hp ny c gi l test ANOVA. Test
ANOVA (phn tch phng sai) c xem nh l s tng qut ha ca test t (test t dng cho 2
nhm v test ANOVA dng cho 2 hay nhiu hn cc nhm). iu kin test ANOVA hp l l
cc gi tr c phn phi bnh thng v phng sai ca cc nhm xp x nhau.
Trong kt xut ca test ANOVA, chng ta thy c s hin din ca thng k F (thng k Fisher).
Trong trng hp ch c 2 nhm, thng k F chnh xc bng bnh phng ca thng k t v 2
phng php cho ra cng mt mc ngha.

n

Hnh 1. Gii thut la chn kim nh ph hp cho bin s ph thuc l bin nh lng
Thc hnh
1- M tp tin ivf_v2.
Chng ta hy khi ng Stata. M tp tin ivf_v2.dta bng cch s dng menu File :: Open hay
nhp vo nt cng c Open file (Use), nm v tr th hai ca thanh cng c. Khi hp
thoi Use New Data s hin ra. Nhp vo mi tn bn phi hp Look in chn a thch hp
v dng con chut nhp vo cc th mc chn th mc c cha s liu. Khi gp tp tin s liu
BPT: phn phi bnh
thng
2 nhm
Phng sai ng nht
BPT: nh lng
Phn phi bnh thng
BPT: th t
Kim nh phi tham s
BPT: danh nh
Kim nh
2

Kim nh t
Kim nh t
PS khng ng nht
Phng sai ng nht
ANOVA
ng
ng
ng
Trn 3 nhm
Khng ng nht
ng nht
ng nht
Khng ng nht
ng ng

99
ivf_v2.dta, nhp p vo tn tp tin ny m tp tin (hoc nhp vo tp tin ny tn tp tin
ri vo hp File Name ri sau nhp vo nt lnh Open m tp tin). Cn nh nhp vo nt
cng c Stata Log nm v tr th t t tri trn thanh cng c nu mun lu tr li ton b
kt qu phn tch s c thc hin.
2. Sau khi m tp tin, cn c thng tin g trc khi phn tch s liu:
Trc khi phn tch s liu, nh nghin cu (hay chuyn vin thng k) cn c li cng
nghin cu, c bit l s liu (bin s v s cc bn ghi), mc tiu v thit k nghin cu. Gi
s chng ta c thng tin v nghin cu nh sau:
MRC Working Party on Children Conceived by In Vitro Fertilisation. Births in Great Britain
resulting from assisted conception, 1978-87. BMJ 1990;300:1229-33.
Births in Great Britain resulting from assisted conception, 1978-87. MRC Working Party on
Children Conceived by In Vitro Fertilisation.

OBJECTIVE--To describe the characteristics at birth of children conceived by in vitro
fertilisation (IVF) or by gamete intrafallopian transfer (GIFT) and to assess whether they differ
from those of children conceived naturally. DESIGN--Survey of children resulting from IVF or
GIFT and comparison of their characteristics at birth with national statistics. SETTING--
England, Scotland, and Wales from 1978 to 1987. SUBJECTS--1267 Pregnancies conceived by
IVF or GIFT, which resulted in 1581 liveborn or stillborn children. MAIN OUTCOME
MEASURES--Sex ratio, multiplicity, gestational age at birth, birth weight, stillbirth rate,
perinatal and infant mortality, and prevalence of congenital malformations. RESULTS--The ratio
of male to female births was 1.07:1; 23% (249/1092) of the deliveries were multiple births
compared with 1% for natural conceptions; 24% (278) of 1015 deliveries were preterm
compared with 6% in England and Wales; 32% (406) of 1269 babies weighed less than 2500 g
compared with 7% in England and Wales. The high percentage of preterm deliveries and of low
birthweight babies was largely, but not entirely, due to the high frequency of multiple births. The
rate of stillbirth, perinatal mortality, and infant mortality were twice the national average, these
excesses being due to the high frequency of multiple births. One or more major congenital
malformations were detected during the first week of life in 35 (2.2%) of 1581 babies. This
figure is comparable with population based estimates of the prevalence of congenital
malformations. The types of malformations reported varied, and the number of each specific type
was small. The health of the children was not evaluated beyond the perinatal period.
CONCLUSIONS--Multiple pregnancies often result from assisted conception and are the main
determinant of the outcome of the pregnancies and of the health of the children at the time of
birth. Congenital malformations are comparatively rare, so larger numbers of children need to be
studied before firm conclusions can be drawn. The pooling of data from different countries is
recommended.

PMID: 2354290 [PubMed - indexed for MEDLINE]
S liu ny bao gm nhng bin s v nhng a tr sinh mt ca nhng b m c th thai
trong ng nghim (in-vitro fertilisation). Nghin cu ny c bo co trong tp ch BMJ
(1990;300:1229-1233). Tp tin ny bao gm 641 a tr v gm 8 bin s c chi tit nh sau:
STT Tn bin Gii thch ting Anh Gii thch ting Vit

100
1 Maso
identity number of mother and
baby
M s
2 tuoime maternal age in years Tui ca m (nm tui)
3 tang_ha hypertension 1=yes, 0=no
Tng huyt p thai k 1= c 0 =
khng
4 tuoithai gestational age in weeks Tui thai (tnh theo tun)
5 gioi sex of baby 1=male, 0=female Gii tnh ca tr 1=trai 0=gi
6 tlsosinh birth weight in gms Trng lng sinh tnh theo grams.
7 nghenghiep
Occupation of mother (1= self
employed; 2=blue collar
worker; 3=white collar worker)
Ngh nghip m (1= ngh t do;
2=cng nhn; 3=vin chc)
8 nhomtuoi
maternal age groups(0=<30;
1=30-34;2=35-39;3=40+)
Tui ca m phn nhm (0=<30;
1=30-34; 2=35-39; 3=40+)
9 sinhnon
gestational category (1= <37
tun; 0=37+tun)
Sinh non (1: di 37 tun; 0: thng
trn 37 tun thai)
Vic nhn bit s liu cng c th thc hin bng cch s dng lnh describe (nhn phm F3).
iu ny c bit c ch nu cc bin s v gi tr ca bin s c dn nhn y .
Trong nghin cu ny, tc gi mun xc nh tc ng ca tng huyt p ca m v tui thai ln
trng lng thai.
3. Nh vy trong cc bin s k trn, bin no l bin c lp, bin no l bin s ph thuc, bin
s ny l gy nhiu.
Hng dn:
Bng s liu viewivf ny c cha nhng bin s khc nhau. Trong bng sau hy xc nh tnh
cht ca tng bin s bng cch khoanh trn vo la chn thch hp.
Bin s Thang o bin s Quan h
tuoime
- Nh gi - Danh nh
- Th t - nh lng
- c lp - Ph thuc
- Gy nhiu
tang_ha
- Nh gi - Danh nh
- Th t - nh lng
- c lp - Ph thuc
- Gy nhiu
tuoithai
- Nh gi - Danh nh
- Th t - nh lng
- c lp - Ph thuc
- Gy nhiu
gioi
- Nh gi - Danh nh
- Th t - nh lng
- c lp - Ph thuc
- Gy nhiu
tlsosinh
- Nh gi - Danh nh
- Th t - nh lng
- c lp - Ph thuc
- Gy nhiu
nghenghiep - Nh gi - Danh nh - c lp - Ph thuc

101
- Th t - nh lng - Gy nhiu
nhomtuoi
- Nh gi - Danh nh
- Th t - nh lng
- c lp - Ph thuc
- Gy nhiu
sinhnon
- Nh gi - Danh nh
- Th t - nh lng
- c lp - Ph thuc
- Gy nhiu
4. Trc khi phn tch s liu cn thc hin thao tc s liu v cc thng k m t. Thc hin li
cc bc thao tc s liu v thng k m t nh chng trc

5. Hy so snh trng lng ca tr nam v tr n
Hng dn: Theo gii thut c trnh by u chng, so snh trng lng (bin ph
thuc c phn phi bnh thng) 2 nhm trc tin chng ta cn phi xem phng sai ca 2
nhm c bng nhau hay khng. Nu phng sai 2 nhm tng ng chng ta c th s dng t-
test thng thng (t-test phng sai ng nht). Nu phng sai 2 nhm khng tng ng,
chng ta phi s dng t-test phng sai khng ng nht hay kim nh phi tham s.
Kim nh 1: So snh 2 phng sai
so snh trung bnh ca mt bin nh lng hai hay nhiu nhm, chng ta s dng menu
Statistics :: Summaries, tables, & tests :: Classical tests of hypothesis :: Group variance
comparison test.


102
Sau khi ca s sdtest Two sample test of variance hin ra tin hnh 5 bc sau:

Bc 1: t con tr vo hp vn bn Variable name
Bc 2: a con tr vo ca s Variables v nhp vo bin tlsosinh a bin ny vo hp vn
bn Variable name
Bc 3: t con tr vo hp vn bn Group name variable
Bc 4: a con tr vo ca s Variables v nhp vo bin gioi a bin ny vo hp vn bn
Group name variable.
Bc 5: Nhp vo nt lnh OK.
Kt qu c trnh by nh sau:
. sdtest tlsosinh, by(gioi)

Variance ratio test

------------------------------------------------------------------------------
Group | Obs Mean Std. Err. Std. Dev. [95% Conf. Interval]
---------+--------------------------------------------------------------------
gai | 315 3044.127 35.421 628.6603 2974.434 3113.819
trai | 326 3211.279 36.88521 665.9798 3138.715 3283.843
---------+--------------------------------------------------------------------
combined | 641 3129.137 25.78336 652.7827 3078.507 3179.767
------------------------------------------------------------------------------

Ho: sd(gai) = sd(trai)

F(314,325) observed = F_obs = 0.891
F(314,325) lower tail = F_L = F_obs = 0.891
F(314,325) upper tail = F_U = 1/F_obs = 1.122

Ha: sd(gai) < sd(trai) Ha: sd(gai) != sd(trai) Ha: sd(gai) > sd(trai)
P < F_obs = 0.1518 P < F_L + P > F_U = 0.3032 P > F_obs = 0.8482

Vi gi tr p = 0,3032 chng ta khng th bc b gi thuyt Ho: lch chun ca nhm tr trai

103
bng lch chun ca nhm tr gi. V vy chng ta c th s dng kim nh t phng sai
ng nht nh bc 2.
Kim nh 2: So snh 2 trung bnh s dng t-test phng sai ng nht.
so snh trung bnh ca mt bin nh lng hai hay nhiu nhm, chng ta s dng menu
Statistics :: Summaries, tables, & tests :: Classical tests of hypothesis :: Group mean comparison
test

Ca s ttest- group mean comparision tests hin ra. Tin hnh cc bc sau:



104
Bc 1: t con tr vo hp vn bn Variable name
Bc 2: a con tr vo ca s Variables v nhp vo bin tlsosinh a bin ny vo hp vn
bn Variable name
Bc 3: t con tr vo hp vn bn Group name variable
Bc 4: a con tr vo ca s Variables v nhp vo bin gioi a bin ny vo hp vn bn
Group name variable.
Bc 5: Nhp vo nt lnh OK.
. ttest tlsosinh, by(gioi)

Two-sample t test with equal variances

------------------------------------------------------------------------------
Group | Obs Mean Std. Err. Std. Dev. [95% Conf. Interval]
---------+--------------------------------------------------------------------
gai | 315 3044.127 35.421 628.6603 2974.434 3113.819
trai | 326 3211.279 36.88521 665.9798 3138.715 3283.843
---------+--------------------------------------------------------------------
combined | 641 3129.137 25.78336 652.7827 3078.507 3179.767
---------+--------------------------------------------------------------------
diff | -167.1522 51.18935 -267.6718 -66.63249
------------------------------------------------------------------------------
Degrees of freedom: 639

Ho: mean(gai) - mean(trai) = diff = 0

Ha: diff < 0 Ha: diff != 0 Ha: diff > 0
t = -3.2654 t = -3.2654 t = -3.2654
P < t = 0.0006 P > |t| = 0.0012 P > t = 0.9994


Tr li: Tr trai c trng lng s sinh trung bnh l 3211.28 gram, ca tr gi l 3044.13 gram.
Vi gi tr t = 3,2654 v mc ngha (p-value) l 0.0012 chng ta kt lun c s khc bit v
trng lng s sinh gia tr trai v tr gi (p=0.0012).
6. Hy so snh trng lng s sinh ca con b m tng huyt p v b m khng tng huyt p.
Hng dn: Theo gii thut c trnh by u chng, so snh trng lng (bin ph
thuc c phn phi bnh thng) 2 nhm trc tin chng ta cn phi xem phng sai ca 2
nhm m tng huyt p v m khng tng huyt p c bng nhau hay khng. Nu phng sai 2
nhm tng ng chng ta c th s dng t-test thng thng (t-test phng sai ng nht).
Nu phng sai 2 nhm khng tng ng, chng ta phi s dng t-test phng sai khng
ng nht hay kim nh phi tham s.
Kim nh 1: So snh 2 phng sai
so snh trung bnh ca mt bin nh lng hai hay nhiu nhm, chng ta s dng menu
Statistics :: Summaries, tables, & tests :: Classical tests of hypothesis :: Group variance
comparison test.
Sau khi ca s sdtest - Group variance comparison test chng ta a bin tlsosinh vo hp vn
bn Variable name v bin tang_ha vo hp vn bn Group name variable ri nhp vo nt lnh
OK.
Kt qu c trnh by nh sau:
. sdtest tlsosinh, by( tang_ha )
Variance ratio test

105

------------------------------------------------------------------------------
Group | Obs Mean Std. Err. Std. Dev. [95% Conf. Interval]
---------+--------------------------------------------------------------------
Ha bt | 552 3191.531 25.58435 601.0962 3141.276 3241.786
Ha tang | 89 2742.157 86.17222 812.9471 2570.908 2913.406
---------+--------------------------------------------------------------------
combined | 641 3129.137 25.78336 652.7827 3078.507 3179.767
------------------------------------------------------------------------------

Ho: sd(huyet ap) = sd(huyet ap)

F(551,88) observed = F_obs = 0.547
F(551,88) lower tail = F_L = F_obs = 0.547
F(551,88) upper tail = F_U = 1/F_obs = 1.829

Ha: sd(1) < sd(2) Ha: sd(1) != sd(2) Ha: sd(1) > sd(2)
P < F_obs = 0.0000 P < F_L + P > F_U = 0.0003 P > F_obs = 1.0000

Kt qu cho thy gi tr p = 0,0003 c ngha l phng sai ca trng lng lc sinh ca 2 nhm
khng ng nht. V vy chng ta khng th dng t-test phng sai ng nht m phi s dng t-
test phng sai khng ng nht (kim nh 2A) hay kim nh phi tham s (kim nh 2B).
Kim nh 2A: so snh 2 trung bnh t-test phng sai khng ng nht
so snh trung bnh ca mt bin nh lng hai hay nhiu nhm, chng ta s dng menu
Statistics :: Summaries, tables, & tests :: Classical tests of hypothesis :: Group mean comparison
test (xem li cu 4) v bin tlsosinh vo hp vn bn Variable name; bin tang_ha vo hp vn
bn Group name variable ca ca s ttest- group mean comparison. Cn lu nh du vo hp
kim Unequal variances ri nhp vo nt OK.


Kt qu trnh by nh sau:
. ttest tlsosinh, by(tang_ha) unequal

106

Two-sample t test with unequal variances

------------------------------------------------------------------------------
Group | Obs Mean Std. Err. Std. Dev. [95% Conf. Interval]
---------+--------------------------------------------------------------------
ha bt | 552 3191.531 25.58435 601.0962 3141.276 3241.786
ha tang | 89 2742.157 86.17222 812.9471 2570.908 2913.406
---------+--------------------------------------------------------------------
combined | 641 3129.137 25.78336 652.7827 3078.507 3179.767
---------+--------------------------------------------------------------------
diff | 449.3735 89.88999 271.1197 627.6273
------------------------------------------------------------------------------
Satterthwaite's degrees of freedom: 104.069

Ho: mean(ha bt) - mean(ha tang) = diff = 0

Ha: diff < 0 Ha: diff != 0 Ha: diff > 0
t = 4.9991 t = 4.9991 t = 4.9991
P < t = 1.0000 P > |t| = 0.0000 P > t = 0.0000


Tr li: Con b m b tng huyt p c trng lng s sinh trung bnh l 2742 gram, con ca
b m khng tng huyt p l 3192 gram. S khc bit ny c ngha thng k vi p<0,0001.
Kim nh 2B: so snh 2 trung bnh vi php kim phi tham s Mann-Whitney
Thc hin kim nh phi tham s tng sp hng Mann-Whitney (Mann-Whitney rank sum test)
bng dng menu Statistics :: Summaries, tables, & tests :: Non-parametric test of hypotheses ::
Mann-Whitney two-sample ranksum test.

Sau ca s ranksum - Mann-Whitney two-sample statistic hin ra.


107

Tin hnh cc bc sau:
Bc 1: t con tr vo hp vn bn Variable name
Bc 2: a con tr vo ca s Variables v nhp vo bin tlsosinh a bin ny vo hp vn
bn Variable name
Bc 3: t con tr vo hp vn bn Group name variable
Bc 4: a con tr vo ca s Variables v nhp vo bin tang_ha a bin ny vo hp vn
bn Group name variable.
Bc 5: Nhp vo nt lnh OK.
Kt qu nh sau:

. ranksum tlsosinh, by( tang_ha )

Two-sample Wilcoxon rank-sum (Mann-Whitney) test

tang_ha | obs rank sum expected
-------------+---------------------------------
ha bt | 552 185203 177192
ha tang | 89 20558 28569
-------------+---------------------------------
combined | 641 205761 205761

unadjusted variance 2628348.00
adjustment for ties -144.78
----------
adjusted variance 2628203.22

Ho: tlsosinh(tang_ha==ha bt) = tlsosinh(tang_ha==ha tang)
z = 4.941
Prob > |z| = 0.0000
7. Hy so snh trng lng s sinh ca tr sinh ra t con ca cc nhm ngh nghip khc nhau
ca ngi m.
Hng dn: so snh trung bnh ca mt bin nh lng nhiu nhm, chng ta phi s dng
phng php phn tch ANOVA mt chiu. S dng menu Statistics :: ANOVA/MANOVA ::
oneway analysis of variance

108

Do chng ta mun phn tch tc ng ca yu t ngh nghip m (nghenghiep) ln trng lng
sinh ca tr (tlsosinh) khi ca s oneway hin ln, ta tin hnh cc bc sau:
Bc 1: t con tr vo hp vn bn Response variable
Bc 2: a con tr vo ca s Variables v nhp vo bin tlsosinh a bin ny vo hp vn
bn Response Variable.
Bc 3: t con tr vo hp vn bn Factor
Bc 4: a con tr vo ca s Variables v nhp vo bin nghenghiep a bin ny vo hp
vn bn Factor.
Bc 5: nh du vo hp kim Produce summary table th hin thng k m t trng lng
s sinh trung bnh cc nhm ngh nghip
Bc 6: nh du vo hp kim Scheffe c kim nh so snh trng lng trung bnh tng
cp i ngh nghip khc nhau
Bc 7: Nhp vo nt lnh OK


109


Trn ca s Output, trn cng thng k m t ca s liu v trng lng s sinh theo nhm tui
ca m:
nghe nghiep |
me - 1=tu |
do, 2=cong | Summary of trong luong so sinh
nhan, | (gram)
3=vien chuc | Mean Std. Dev. Freq.
------------+------------------------------------
tu do | 2981.4135 643.76283 104
cong nhan | 3118.084 646.69338 238
vien chuc | 3189.3177 654.19649 299
------------+------------------------------------
Total | 3129.1373 652.78265 641


Con b m ngh nghip t do c trng lng trung bnh l 2981 gram, ca b m vi ngh
nghip l 3118 gram, ca b m vi ngh nghip vin chc l l 3190 gram. Chng ta bit kim
nh ANOVA c th s dng kim nh s khc bit v trung bnh ca nhiu nhm, nhng
trc tin chng ta hy kim tra cc iu kin ca phn tch ANOVA l (a) bin s ph thuc c
phn phi bnh thng - iu ny c xc nhn t th ca trng lng s sinh v (b)
phng sai ca bin ph thuc cc nhm bng nhau - iu ny cng c xc nhn qua thng
k Bartlett vi p-value l 0,973.
Analysis of Variance
Source SS df MS F Prob > F
------------------------------------------------------------------------
Between groups 3381483.56 2 1690741.78 4.00 0.0187
Within groups 269338638 638 422160.875
------------------------------------------------------------------------
Total 272720122 640 426125.19

Bartlett's test for equal variances: chi2(2) = 0.0558 Prob>chi2 = 0.973
V vy trong trng hp ny kim nh ANOVA l c gi tr. Ta c kt qu ca bng ANOVA.

110
Chng ta c c gi tr F = 0.0187 v mc ngha (p-value) l 0.9723 chng ta kt lun khng
c s khc bit v trng lng s sinh con ca nhng b m c ngh nghip khc nhau. Vi
kt lun ny chng ta c th kt lun l c t nht c 1 cp i (2 nhm) ngh nghip ca m c
s khc bit v trng lng con nhng chng ta khng bit l s khc bit ny cp i ngh
nghip no. bit cp i no c s khc bit ta xem kt xut ca so snh sau kim nh (post-
hoc test) ca Scheffe:
Comparison of trong luong so sinh (gram)
by nghe nghiep me - 1=tu do, 2=cong nhan, 3=vien chuc
(Scheffe)
Row Mean-|
Col Mean | tu do cong nha
---------+----------------------
cong nha | 136.671
| 0.202
|
vien chu | 207.904 71.2337
| 0.020 0.451

Kt qu ca kim nh Scheffe c trnh by theo bng v mi ca bng c 2 con s: con s
trn th hin s khc bit v trng lng ca ngh nghip ca hng so vi ngh nghip ca ct
v gi tr di th hin gi tri p (mc ngha) ca s khc bit ny. Da vo gi tr p, c th
kt lun c s khc bit v trng lng s sinh ca con 2 nhm ngh nghip vin chc v t do
(gi tr p=0,020) v nhm ngh nghip vin chc c trng lng trung bnh cao hn nhm ngh
nghip t do l 207,9 gram.
Nhc li l thuyt v Tng quan v c lng
Tng quan l s o mc hai bin s nh lng cng thay i vi nhau. C nhiu loi h s
tng quan, nhng chng u c gi tr t -1 n 1. Nu chng c gi tr dng c ngha l hai
bin s ng bin vi nhau, nu chng c gi tr m ngha l hai bin s nghch bin. Gi tr
tuyt i ca h s tng quan cng gn mt ngha l hai bin s c lin h cht vi nhau v vai
tr ca sai s ngu nhin s t hn. Nu h s tng quan c gi tr bng zero c ngha l hai bin
s c lp v khng quan h g vi nhau. Khi tr tuyt i ca h s tng quan bng mt c
ngha l hon ton khng c sai s ngu nhin. Bnh phng ca h s tng quan (r
2
) th hin t
l cc bin thin ca bin s ph thuc c th c gii thch bng bin s c lp.
Loi h s tng quan c s dng ph bin nht l h s tng quan Pearson r:



=
2 2
) ( ) (
) )( (
y y x x
y y x x
r
i i
i i


L gii ngha ca h s tng quan:
- H s tng quan lun lun nm trong on [-1,1]
- H s tng quan r dng chng t hai bin s l ng bin; h s tng quan r m chng t
hai bin s l nghch bin; h s tng quan bng zero nu hai bin khng lin h.
- Tr s tuyt i ca h s tng quan r ni ln mc lin quan gia hai bin s. Nu tr tuyt
i ca r bng 1 (r=1 hay r=-1), quan h hon ton tuyn tnh ngha l tt c cc im nm trn
ng hi quy (Hnh 9.2 d v 9.2f). Nu tr tuyt i ca r nh hn 1 s c cc im s liu phn
tn chung quanh ng hi quy.

111
- Bnh phng ca h s tng quan (r
2
) th hin t l bin thin ca bin s ph thuc c gii
thch bng s bin thin ca bin s c lp (nu mi lin h ny l nhn qu)
- Nu r=0, khng c mi lin h tuyn tnh gia hai bin s. iu ny c ngha l (1) khng c
mi lin h g gia hai bin s hoc (2) mi lin h gia hai bin s khng phi l tuyn tnh.
- Theo quy c, quan h vi r t 0,1 n 0,3 l quan h yu, t 0,3 n 0,5 quan h trung bnh v
trn 0,5 l quan h mnh. iu quan trng l s tng quan gia hai bin s cho thy s lin h
nhng khng nht thit c ngha l c quan h 'nhn qu'.

kim nh h s tng quan Pearson c thc s khc 0 hay khng, kim nh t c th c s
dng
t r
n
r
=

2
1
2
c phn phi student vi n-2 t do.
Hi quy
Hi quy l mt m hnh ton hc m t s bin i ca mt bin s ny theo nhng bin s khc.
Mt phng trnh hi quy c th c dng nh sau:
cn nng (kg) = 6,85 + 0,18 x thng tui
(phng trnh hi quy tnh cn nng ca tr t 9 n 40 thng tui theo thng tui)
theo phng trnh ny ngi ta gi:
cn nng: bin s ph thuc
thng tui: bin s c lp
6,85: h s ca hng s (Constant), hay cn gi l im chn (intercept)
0,18: h s (Coeficient) ca bin s thng tui hay cn gi l dc (Slope) ca ng hi
quy
9. V phn tn (scattergram) gia ca bin s tui thai (tuoithai) v trng lng thai
(tlsosinh).
Hng dn: s dng menu Graphics :: Overlaid twoway graph


hin ra ca s twoway Twoway graphs

112

Trn ca s twoway Twoway graphs, nhp tn bin s ph thuc vo hp Y-axis variable v
tn bin s c lp vo hp X-axis variable sau nhp OK xem biu phn tn. Cch lm
c th tng bc nh sau:
Bc 1: Trn hp combo Type chn Scatter
Bc 2: t tn bin s c lp (tuoithai) vo vn bn X
Bc 3: t tn bin s ph thuc (tlsosinh) vo vn bn Y
Bc 4: Nhp nt lnh OK


C th cho th phn tn. Tuy nhin chng ta c th thm cc ty chn thc hin cc yu
cu sau:
B sung tiu trng lng tr s sinh (gam)" cho trc tung
Cho cc gi tr trc y t 500 n 5000 gram v chia cc khong 500 gram.
B sung tiu tuoi thai (tuan tuoi)" cho trc honh

113
Cho cc gi tr ca trc x t 24 tun tui n 42 tun tui v chia lm cc khong 4 tun
Bng cch trong ca s Trn ca s twoway Twoway graphs thc hin cc bc:
Trn th Plot 1: Bc 1: Trn hp combo Type chn Scatter
Bc 2: t tn bin s c lp (tuoithai) vo vn bn X
Bc 3: t tn bin s ph thuc (tlsosinh) vo vn bn Y
Trn th Y-Axis: Bc 4: Trn hp vn bn Title g "Trong luong tre so sinh (gam)"
Bc 5: Trn hp vn bn Rule g quy tc "500(500)5000"
Bc 6: Trn hp combo Angle chn "Horizontal"
Trn th X-Axis: Bc 7: Trn hp vn bn Title g "Tuoi thai (tuan)"
Bc 8: Trn hp vn bn Rule g quy tc "24(2)42"
V nhp vo nt lnh OK.

Tr li: C s tng quan thun tuyn tnh gia trng lng s sinh v tui thai. Mi tng quan
ny kh cht do m my c tnh cht i ln (khi n i v phi) v c ng knh b nh hn
nhiu so vi ng knh ln.
10. Hy xc nh h s tng quan gia trng lng s sinh (tlsosinh), tui thai (tuoithai) v tui
ca m (tuoime)
Hng dn: S dng menu Statistics :: Summaries, tables, & tests :: Summary statistics ::
Pairwise correlations.

114

Khi hp thoi pwcorr Pairwise correlations of variables s hin ra.


115
Tin hnh cc bc sau:
Bc 1: Nhp con tr chut vo hp vn bn Variables
Bc 2: a con tr chut vo ca s Variables v nhp vo cc bin tlsosinh, bin tuoithai v
bin tuoime tn 3 bin ny xut hin hp vn bn Variables.
Bc 3: nh du vo hp kim Print significance level for each entry
Bc 4: nh du vo hp kim Significance level for displaying with a star.
Bc 5: Nhp vo nt lnh OK xem kt qu.

. pwcorr tlsosinh tuoithai tuoime, sig star(5)

| tlsosinh tuoithai tuoime
-------------+---------------------------
tlsosinh | 1.0000
|
|
tuoithai | 0.7376* 1.0000
| 0.0000
|
tuoime | 0.0337 0.0151 1.0000
| 0.3941 0.7026

Tr li: Chng trnh cho kt qu h s tng quan ca trng lng thai vi trng lng thai l
1, gia trng lng thai v tui thai l 0.7376 (gi tr p=0,0000), gia trng lng thai v tui
ca m l 0,0337 (gi tr p = 0,3941). Nh vy c s tng quan mnh c ngha thng k gia
trng lng thai v tui thai trong khi s tng quan gia trng lng thai v tui m rt yu
v khng c ngha thng k. Do c s lin h c ngha thng k (gi tr p <0,05) gia trng
lng thai v tui thai nn gi tr ca h s tng quan c nh du sao (*).
11. Hy xy dng phng trnh hi quy ca trng lng thai theo tui thai.
Hng dn: S dng phng php hi quy n bng cch nhp vo menu "Statistics :: Linear
regression and related :: Linear regression" hin ra hp thoi regress Linear regression

116

Nhp tn bin s ph thuc vo hp Dependent variable v tn bin s c lp vo hp
Independent variable ri nhn OK tip tc.
Kt qu c trnh by nh sau:
. regress tlsosinh tuoithai
Source | SS df MS Number of obs = 641
---------+------------------------------ F( 1, 639) = 762.25
Model | 148354317 1 148354317 Prob > F = 0.0000
Residual | 124365805 639 194625.673 R-squared = 0.5440
---------+------------------------------ Adj R-squared = 0.5433
Total | 272720122 640 426125.19 Root MSE = 441.16

tlsosinh | Coef. Std. Err. t P>|t| [95% Conf. Interval]
---------+--------------------------------------------------------------------
tuoithai | 206.6412 7.484572 27.609 0.000 191.9439 221.3386
_cons | -4865.245 290.0814 -16.772 0.000 -5434.873 -4295.617
Tr li: H s tng quan bnh phng R-squared = 0.544 = 54.4% ni ln tui thai c th gii
thch cho 54.4% s thay i v trng lng s sinh. Bng ANOVA cho bit c tng cc sai lch
ca bnh phng trng lng s sinh 272.720.122 (272.7 triu) m phng trnh hi quy c th
gii thch cho 148.3 triu ca s sai lch ny (nh vy cn 124.4 triu tng bnh phng sai lch
cha c gii thch gi l Residual Sum of Square v gi tr 0.45 chnh l gi tr 148.3/272.7).
Mc ngha c trnh by trong bng ANOVA cho bit mc ngha ca phng trnh.
Da vo bng cc h s chng ta c th xy dng phng trnh hi quy nh sau:
Trng lng s sinh = -4865.245 + 206.641 x tui thai (tnh theo tun).
Mc ngha (P-value) ca bin s tui thai (Gestational age) l kt qu ca kim nh ngha
ca bin s ny trong phng trnh c thc s khc khng hay khng.
H s (coefficient) ca bin s c lp ni ln s thay i ca bin s ph thuc khi bin s

117
c lp thay i mt n v. Trong phng trnh ny (vi bin s c lp l TUOITHAI v bin
s ph thuc l TLSOSINH) chng ta c th l gii nu a tr ln hn 1 tun tui trng lng
lc sanh ca n s tng thm 206.641 gram.
12. Hy xy dng phng trnh hi quy ca trng lng thai theo tui thai, gii tnh ca tr v
huyt p cao ca m.
Hng dn: S dng phng php hi quy n bng cch nhp vo menu "Statistics :: Linear
regression and related :: Linear regression" hin ra hp thoi regress Linear regression

Nhp tn bin s ph thuc (tlsosinh) vo hp Dpendent variable v tn cc bin s c lp
(tuoithai gioi tang_ha) vo hp Idependent variables, ri nhn OK tip tc. Khi hp thoi
chn on s hin ra. Tuy nhin nu chng ta khng quan tm n vic chn on cc vn
trong phng trnh hi quy chng ta hy nhp vo nt Cancel.
. regress tlsosinh tuoithai gioi tang_ha
Source | SS df MS Number of obs = 641
-------------+------------------------------ F( 3, 637) = 275.43
Model | 153998584 3 51332861.4 Prob > F = 0.0000
Residual | 118721538 637 186376.04 R-squared = 0.5647
-------------+------------------------------ Adj R-squared = 0.5626
Total | 272720122 640 426125.19 Root MSE = 431.71

------------------------------------------------------------------------------
tlsosinh | Coef. Std. Err. t P>|t| [95% Conf. Interval]
-------------+----------------------------------------------------------------
tuoithai | 201.4248 7.541441 26.71 0.000 186.6157 216.2339
gioi | 167.8167 34.17884 4.91 0.000 100.6999 234.9335
tang_ha | -142.14 50.8685 -2.79 0.005 -242.0302 -42.24979
_cons | -4729.048 294.1447 -16.08 0.000 -5306.659 -4151.438
------------------------------------------------------------------------------
Tr li: Chng ta tm c r
2
(R-squared) l 0.5647 cho thy phng trnh hi quy gii thch

118
c 56.47% s bin thin ca trng lng thai v iu ny cho thy m hnh c c gii tnh v
tng huyt p gii thch tt hn so vi m hnh ch c tui thai (r
2
=0.54).
Chng ta cng c th vit c phng trnh hi quy theo kt qu trn:
Trng lng thai = -4729.048 + tui thai x 201.425 - tng huyt p x 142.14 + gii x 167.817
10. Bn c gi g trnh by phng trnh hi quy mt cch d hiu hn i vi ngi khng
chuyn v thng k.
Hng dn: Bi v ngi khng chuyn v thng k hay ngi cha c lm quen vi phng
php m ho s khng bit lm sao nhn tng huyt p vi 142.14 hay gii vi 167,817.
Chng ta nh li quy c ca tp tin ny:
Bin tng huyt p (tang_ha) c gi tr =0 nu m khng b tng huyt p
Bin gii tnh (gioi) c gi tr =0 nu tr l tr gi
a) Do phng trnh hi quy i vi tr gi c m khng tng huyt p l:
Trng lng thai = -4729.048 + tui thai x 201.425 (a)
b) tr trai vi m khng tng huyt p, trng bin s ph thuc ca phng trnh hi quy s
tng ln 167,817 gram nn phng trnh hi quy s l
Trng lng thai = -4561.23 + tui thai x 201.425 (b)
c) tr gi vi m b tng huyt p, trng bin s ph thuc ca phng trnh hi quy s s gim
i 142,14 gram so vi phng trnh (a) nn phng trnh hi quy cho nhm ny l
Trng lng thai = -4871.19 + tui thai x 201.425
d) tr trai vi m b tng huyt p, trng bin s ph thuc ca phng trnh hi quy s s
gim i 142,14 gram so vi phng trnh (b) nn phng trnh hi quy cho nhm ny l
Trng lng thai = -4703.37 + tui thai x 201.425
Do cc mc ngha (p-value) ca bin s u nh hn 0.05 nn tt c cc bin s c lp ca
m hnh u c ngha thng k v khng nn loi b khi m hnh.
13. Xt hai m hnh
trng lng thai = tui thai + tng huyt p m + gii tnh (cho h s ca bin s tui t hai l
201.4) trong khi m hnh
trng lng thai = tui thai (choh s ca bin s tui thai l 206.6). H s trong m hnh no l
ph hp hn nh gi s tng trng ca trng lng thai.
Tr li:
Chng ta c th gi nh yu t tng huyt p ca m l yu t gy nhiu. Do tng huyt p ca
m c th lm gim trng lng ca con v trong tng huyt p ca m ph bin hn nhm
sanh thiu thng nn a tr sinh sm 1 tun b mt trng lng l 206.6 gram nhng iu ny
l c do tc ng ca sanh non v c tc ng do tng huyt p mt s b m. Tuy nhin
nhm khng b tng huyt p tr sanh non mt tun ch b mt c 201.4 gram v do con s
201.4 l ph hp hn nh gi s tng trng ca trng lng thai.
Trn thc tin do con s 201.4 rt gn vi con s 206.6 nn c th b qua tc ng gy nhiu ca
tng huyt p ca m ln tc pht trin thai.

14. S dng kim nh t chng ta pht hin trng lng tr con cc b m b tng huyt p thp
hn con nhng ngi khng tng huyt p l 449.37 gram. Trong khi m hnh ca trng lng

119
sinh theo tui thai, tng huyt p m v gii tnh cho h s ca bin tng huyt p l 142.14
gram. Hy l gii nhng s liu ny?
Tr li: C hai con s 449.37 v 142.14 u ni ln s khc bit do tnh trng tng huyt p ca
m nhng con s 449.37 l con s khc bit th v con s 142.14 l con s khc bit c hiu
chnh theo thng tui v gii tnh. Da vo nhn xt trn ta c gii thch nhng con s ny nh
sau:
con cc b m b tng huyt p c trng lng nh con nhng ngi khng tng huyt p l
449.37 gram v iu ny do tc ng ca c tng huyt p, tui thai (v c tc ng ca gii
tnh nhng gi s chng ta bit rng tc ng gy nhiu cao gii tnh l khng ng k).
con cc b m b tng huyt p c trng lng nh con nhng ngi khng tng huyt p l
142.14 gram v iu ny do tc ng ca c tng huyt p khi khng xt n tc ng ca
tui thai. Nh vy tc ng do sinh thiu thng l 449.37-142.14 = 307.23 g
Cao huyet
ap me
Trong
lng con
Sinh thieu
thang
C che
khac
142.14g
449.37g

Nh vy % tc ng do c ch sinh thiu thng trong tng s tc ng ca tng huyt p m ln
trng lng ca con l:
% 68 68 . 0
37 . 449
22 . 307
37 . 449
14 . 142 37 . 449
= = =

= =
tho ong tac
chnh hieu ong tac - tho ong tac



Chng ta c th xem xt tc ng ca c ch sinh thiu thng trong khi so snh trng lng s
sinh ca 2 nhm m tng huyt p v m khng tng huyt p bng cch so snh tui thai trung
bnh gia 2 nhm. Nhm c m b tng huyt p c tui thai trung bnh l 37.3 tun trong khi
nhm m khng b tng huyt p c tui thai trung bnh l 38.9 v s khc bit v tui thai l 1.6
tun. S khc bit v tui thai s gii thch cho khong 200 gram/tun x 1.6 =320 gram trng
lng s sinh.

You might also like