Professional Documents
Culture Documents
176.0
155.6
171.5
165.8
162.0
156.1
166.2
171.7
166.8
160.8
160.6
165.1
151.9
172.4
158.6
162.5
155.3
162.7
157.2
165.2
158.4
170.0
166.0
162.0
164.4
158.4
157.9
155.8
158.8
161.8
165.3
167.4
166.9
149.6
176.6
156.8
167.4
161.4
162.7
163.8
158.0
166.4
162.0
159.9
159.5
167.8
171.8
163.4
157.1
164.2
155.3
162.3
152.5
157.0
149.9
168.7
170.2
148.3
165.9
174.7
164.2
167.1
147.6
154.6
164.0
164.6
178.7
160.9
162.7
158.2
157.2
154.0
163.6
162.3
162.2
170.6
171.7
156.1
176.7
162.3
159.0
159.3
163.5
171.2
162.0
165.2
171.5
165.6
172.1
168.9
148.3
155.8
157.9
159.9
162.2
163.6
165.2
166.9
170.0
172.1
149.6
156.1
158.0
160.6
162.3
163.8
165.2
167.1
170.2
172.2
149.9
156.1
158.2
160.8
162.3
164.0
165.3
167.3
170.6
172.4
151.9
156.8
158.4
160.9
162.3
164.0
165.6
167.4
171.1
174.7
152.5
157.0
158.4
161.4
162.5
164.2
165.8
167.4
171.2
176.0
154.0
157.0
158.6
161.8
162.7
164.2
165.9
167.7
171.5
176.1
154.6
157.1
158.8
162.0
162.7
164.4
166.0
167.8
171.5
176.6
155.3
157.2
159.0
162.0
162.7
164.5
166.2
168.7
171.7
176.7
155.3
157.2
159.3
162.0
163.4
164.6
166.4
168.9
171.7
178.7
Cch sp xp ny (ting Anh gi l sort) cho chng ta thy ngi c chiu cao
thp nht l 148.7 cm, v ngi cao nht l 178.7 cm. Nhng nu nhn k, chng ta cng
ch rng phn ln cc i tng c chiu cao khong 160 n 165 cm.
n y th cu hi t ra l c bao nhiu i tng vi mi chiu cao t 160 n
165 cm, v c bao nhiu i tng c chiu cao thp hn hay cao hn hai gi tr ? C
0.6
0.2
0.4
(1:n)/n
15
10
0.0
Frequency
20
0.8
25
1.0
145
150
155
160
165
Height
170
175
180
150
155
160
165
170
175
Height
Biu 1: (a) Mt phn phi ca chiu cao, vi trc tung l s i tng. (b) Biu
bn phi l xc sut tch ly (cumulative probability) ca chiu cao.
Trong Biu trn (pha tri), trc tung l s i tng v trc honh l chiu
cao. Nh bn c c th thy, c 4 i tng vi chiu cao t 145 n 150 cm, v t 151
n 155 cm. Tng t, ch c 4 i tng c chiu cao t 175 n 180 cm. ng nh
cm nhn ban u, nh ca biu l s i tng c chiu cao t 160 n 170 cm.
Biu bn phi th hin xc sut tch ly chiu cao. Nhn qua biu ny,
chng ta c th ni rng khong 30% i tng c chiu cao thp hn 160 cm, v khong
80% i tng c chiu cao thp hn hay bng 170 cm. Ni cch khc, s i tng c
chiu cao t 160 n 170 cm chim khong 50% tng s c mu.
Do , ni n phn phi l cp n tn s kh d (hay xc sut) ca cc gi
tr chiu cao.
V hnh dng, chng ta d dng thy rng s phn phi chiu cao 100 i tng
ny ging nh mt hnh chung. Cc phn phi c hnh dng ny c gi l Normal
distribution (ch N ca normal vit hoa), hay phn phi bnh thng. Nhng v tnh
cch chun ha ca phn phi ny, nn ti tm dch l phn phi chun. cho c v
khoa hc v tr thc mt cht (v lm cho nhiu ngi phi bc tc gi u), gii ton
hc thnh thong thm ch lut thnh lut phn phi!
Phn phi bnh thng cn c gi l Gaussian distribution, bi v ngi pht
hin ra lut phn phi ny l nh ton hc danh ting Carl F. Gauss (ngi c). Tht ra,
P X = x | , 2 = ?
P X = x | ,
( x )2
1
=
exp
2 2
2
[1]
Ch rng cng thc trn i khi cng xut hin trong cc sch gio khoa vi
mt hnh thc khc: thay v vit P ( X = x | , 2 ) , c tc gi vit kh hiu hn l f(x)!
Tt nhin, trong cng thc trn = 3.1416
Nh c th thy qua cng thc [1] trn y, lut phn phi chun c hon ton
xc nh bi 2 thng s: trung bnh v lch chun . Ni cch khc, nu chng ta
bit c 2 thng s ny, chng ta c th c tnh xc sut cho bt c chiu cao no. (Do
chng ta cn phi chn mu (sample) nghin cu nh th no cho cc c s ca
mu nghin cu l rt st vi cc thng s tng ng ca qun th. Phn ny c
cp chi tit trong bi chn mu nghin cu). Trong trng hp ca chng ta, c s
cho v chnh l s trung bnh v lch chun ca mu. Cc c s ny l (cc bn
c th kim tra):
Trung bnh: m = 163.3 cm
lch chun: s = 6.6 cm
Thay th cc c s ny cho cho v , chng ta c th tr li cu hi c bao nhiu n
ng ngi Vit Nam c chiu cao chnh xc l 160 cm:
(160 163.3)2
1
P ( X = 160 ) =
exp
= 0.0533
2
6.6 2 3.1416
2 ( 6.6 )
Chiu cao
(cm)
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
Xc sut
(tnh bng
%)
0.0118
0.0200
0.0331
0.0533
0.0840
0.1290
0.1947
0.2863
0.4116
0.5781
0.7935
1.0645
1.3958
1.7886
2.2398
2.7412
3.2788
3.8327
4.3786
4.8887
5.3343
Chiu cao
(cm)
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
Xc sut
(tnh bng
%)
5.6885
5.9285
6.0383
6.0107
5.8474
5.5594
5.1656
4.6908
4.1630
3.6107
3.0606
2.5354
2.0527
1.6242
1.2559
0.9491
0.7010
0.5060
0.3570
0.2461
0.1658
Probability
140
150
160
170
180
190
Height
Biu trn chnh l lut phn phi chun (theo cng thc [1]). Tt nhin, tng
din tch di ng biu din phi bng 1 (hay 100%). iu ny c ngha l nu chng
ta mun c tnh xc sut cho bt c khong chiu cao no. V d nu chng ta mun
bit c bao nhiu n ng Vit Nam c chiu thp hn 150 cm, chng ta ch cn tnh din
tch m trc honh t 150 cm hay thp hn di ng biu din. Pht biu theo ngn
ng ton hc cu hi ny l: P(X < 150) = ? Hay ni chnh xc hn na:
149
f ( x )dx
( x 163.3)2
exp
trong , f ( x ) =
. Kt qu tt nhin l 0.018. Bn c
2
6.6 2
2 ( 6.6 )
khng cn phi lm cc tnh ton tch phn phc tp, v phm mm R c mt lnh n
gin tnh tch phn trn (ti trnh by lnh ny trong phn ch thch pha cui bi).
0.03
P(X < 150) = 1.8%
0.00
0.01
0.02
Probability
0.04
0.05
0.06
140
150
160
170
180
190
Height
0.018
Tng t, chng ta c th c tnh xc sut cho bt c khong chiu cao no gia
a v b theo cng thc tch phn trn y. Chng hn nh xc sut n ng Vit Nam c
chiu cao t 160 n 170 cm l:
P (160 X 170 ) =
170
160
f ( x )dx
[2]
Trong phn trn, chng ta quan tm n vic phn tch chiu cao bng cch ng
dng lut phn phi chun. Tuy nhin, nh cp trong phn u, lut phn phi chun
c th ng dng cho rt nhiu hin tng t nhin. Nhng cc bin khc nhau v n v
o lng, nh chiu cao o bng cm, nhng huyt p o bng mmHg, nn chng ta kh
m so snh hai bin s ny bi v chng c n v o lng khc nhau, v c th lch
chun cng khc nhau. Chng hn nh nu mt i tng c chiu cao l 175 cm v
huyt p l 120 mmHg, lm sao chng ta bit cc thng s c nhn ny cao hay thp. Do
, chng ta cn phi c mt cch chun ha lut phn phi sao cho chng ta c th so
snh cc bin s ny m khng cn bit n n v o lng.
Mt trong nhng cch chun ha l phn phi chun ha, m c l bn c
tng thy u trong sch gio khoa ngi ta gi l standardized normal distrubution.
Nh thy trong cng thc [1], hai thng s trung bnh v lch chun hon ton xc
nh lut phn phi chun, cho nn, mt cch chun ha l hon chuyn chiu cao (hay
mt bin s) sao cho chng c lp vi n v o lng. Cch hon chuyn ny c tn l
z-transformation hay hon chuyn z. Kt qu ca hon chuyn l mt ch s z (thut ng
ting Anh l z-score).
Trong v d v chiu cao, z l khc bit gia chiu cao mt c nhn (k hiu l
x) v chiu cao trung bnh ca qun th chia cho lch chun. Ni cch khc:
z=
[3]
Nu x = , ch s z s l 0;
z2
1
exp
2
2
[4]
e 0.5 z
dz
2
[5]
Biu 4 di y minh ha cho phn phi chiu cao tnh bng cm v bng ch s z:
Probability
140
150
160
170
180
190
Height
10
0.2
0.1
Probability
0.3
0.4
0.0
-4
-2
Z score
khong 1. Ni cch khc, P ( 4 < z < 4 ) = f ( z )dz ; 1 . Ngoi ra, phn phi chun
4
11
n y, chng ta thy hng s 1.96, 1.64 hay 3.0 xut pht t u! Cc hng
s ny chng c g b mt c: chng l ch s z ca phn phi chun. Bng sau y s
cung cp mt s xc sut cho cc ch s z thng dng trong thng k hc v ng dng
trong y khoa:
Bng 2. Xc sut cc gi tr z
z
P(Zz)
-3.090
0.001
-2.326
0.01
-1.96
0.025
-1.645
0.05
-1.282
0.10
0
0.50
1.282
0.90
1.96
0.975
2.326
0.99
3.090
0.999
x = + z
[6]
12
IV. Kt lun
Qua bi ny, hi vng ti gii thch phn phi chun l g, v hng s 1.96 trong
cch tnh khong tin cy 95% xut pht t u. Phn phi chun ng mt vai tr thit
yu trong khoa hc thng k. Hu ht tt c cc suy lun thng k u da vo lut phn
phi chun pht trin cc kim nh thng k (statistical tests). Ngay c cc lut phn
phi nh phn hay phn phi Poisson (m ti s bn n trong mt bi khc) cng c th
m hnh bng lut phn phi chun.
Nh l mt qui lut t nhin, rt nhiu bin s lm sng v khoa hc thc nghim
ni chung u tun theo lut phn phi chun. Cng c th c mt s bin s sinh ha
khng tun theo lut phn phi chun, nhng c th hon chuyn chng tun theo lut
phn phi chun. Do , cc phng php phn tch tham s (parametric methods) vn
c th p dng cho cc bin loi ny.
13
160.6,
165.1,
151.9,
172.4,
158.6,
162.5,
155.3,
162.7,
157.2,
165.2,
158.4,
170.0,
166.0,
162.0,
164.4,
158.4,
157.9,
155.8,
158.8,
161.8,
165.3,
167.4,
166.9,
149.6,
176.6,
156.8,
167.4,
161.4,
162.7,
163.8,
158.0,
166.4,
162.0,
159.9,
159.5,
167.8,
171.8,
163.4,
157.1,
164.2,
155.3,
162.3,
152.5,
157.0,
149.9,
168.7,
170.2,
148.3,
165.9,
174.7,
164.2,
167.1,
147.6,
154.6,
164.0,
164.6,
178.7,
160.9,
162.7,
158.2,
157.2,
154.0,
163.6,
162.3,
162.2,
170.6,
171.7,
156.1,
176.7,
162.3,
159.0,
159.3,
163.5,
171.2,
162.0,
165.2,
171.5,
165.6,
172.1,
168.9)
n <- length(ht)
plot(sort(ht), (1:n)/n,
type="s", ylim=c(0,1), xlab="Height")
plot(density(ht), main="Plot of density distribution of height",
xlab="Height")
# Tm s trung bnh v lch chun ca chiu cao
mean(ht)
sd(ht)
# c tnh xc sut chiu cao = 160 cm vi trung bnh=163.3 v sd=6.6
dnorm(160, mean=163.3, sd=6.6)
# c tnh xc sut cho bng 1
height <- seq(140, 181, 1)
dnorm(height, mean=163.3, sd=6.6)*100
# V biu 2
14
P ( X < 150 ) =
149
f ( x )dx
<<<<<<-
1.65
-1.65
1.96
-1.96
2.58
-2.58
15
120,
110,
120,
150,
120,
130,
130,
100,
150,
110,
130,
110,
120,
110,
120,
130,
120,
110,
120,
120,
100,
120,
110,
140,
130,
140,
150,
140,
110,
120,
150,
110,
150,
140,
110,
100,
100,
125,
120,
150,
100,
110,
120,
120,
110,
110,
120,
100,
150,
120,
120,
120,
120,
110,
120,
110,
100,
140,
100,
130,
100,
110,
120,
120,
120,
110,
120,
110,
110,
160,
110,
85,
110,
110,
140,
120,
140,
120,
120,
90)
16