Professional Documents
Culture Documents
1 Gii thiu
Chng 5.
Nhp v x l d liu
Mn hc: Phng php nghin cu kinh t
Khoa Kinh t Pht trin
i hc Kinh T TP. H Ch Minh
Lp cng NC
Bo co nghin cu
Ra quyt nh
Thc hin:
Hnh 5. 2 Cch
nhp d liu vo
bng tnh SPSS
Xc nh thang o ca bin
10
11
12
13
14
80
Others
Honda @
70
Honda Dream
60
SYM Attila
50
Yamaha Cygnus
Honda Wave
40
Yamaha Jupiter
30
Yamaha Sirius
20
10
0
10
20
30
40
15
16
Frequency
Honda Air Blade
10
Percent
10.0
%Valid
10.0
Cumulative
Percent
10.0
30
Others
Honda AirBlade
10.0%
10.0%
20
Honda @
8.0
8.0
18.0
7.0
7.0
25.0
Yamaha Jupiter
13
13.0
13.0
38.0
Honda Wave
24
24.0
24.0
62.0
4.0
4.0
66.0
11
11.0
11.0
77.0
Honda Dream
6.0
6.0
83.0
Honda @
7.0
7.0
90.0
Others
10
10.0
10.0
100.0
Total
100
100.0
100.0
Honda Dream
Yamaha Sirius
7.0%
SYM Attila
11.0%
Yamaha Cygnus
SYM Attila
10
6.0%
s
er
th
O a @am
d re
on D
Hda
s
on
nl au
H
ygtti
CA
aM
aShY
e
am
Y
aver
Wpit
daJu
on a
H ah
uso
i ri e
am
SN
Y
a
re
auhtu
amF
Yda
de
la
on
H
irB
A
da
on
H
Yamaha Sirius
Yamaha Cygnus
Yamaha Jupiter
13.0%
4.0%
Honda Wave
24.0%
Motobike Names
17
18
20
30
20
10
Mi dng ca biu c gi l mt
thn; v mi s liu th hin trn mt
thn gi l mt l.
20 25 30 35 40 45 50 55 60 65 70 75
21
22
889999
000111122222233344
55677788
0012233334444
5556
123333334444
5555566777789
0123344444
566667779
03
5567
Biu hp, hay cn gi l biu hp-v-ru (boxand-whisker plot), cho ta mt hnh nh trc quan khc
v v tr, phn tn, dng hnh, di ui v cc gi
tr bt thng (outliers) ca phn phi.
Biu hp th hin tm tt 5 gi tr thng k ca mt
phn phi l trung v (median), hai t phn v trn v
di (the upper and lower quartiles), v cc gi tr quan
st ln nht v nh nht
10
1 case(s)
23
24
Cc gi tr ln hn 3 ln so vi di ca hp tnh t
gi tr t phn v th 3 (75th percentile) (extremes)
Cc gi tr ln hn 1,5 ln so vi di ca hp tnh
t gi tr t phn v th 3 (75th percentile) (outliers)
Gi tr ln nht quan st c khng phi l
gi tr bt thng
50% trng
hp c gi
tr nm
trong hp
Trung v (MEDIAN)
26
100
80
60
40
20
0
N=
100
100
27
28
Cc ch tiu thng k m t :
xu hng trung tm,
tnh bin thin v
dng hnh phn phi ca d liu.
30
31
32
Hnh 5.11 Cc dng phn phi lch tri v lch phi so vi phn
phi bnh thng
34
35
36
Std. Error
100
Range
58
Minimum
18
Maximum
76
Mean
39.01
Std. Deviation
14.42
Variance
1.44
207.909
Skewness
Kurtosis
.242
.241
-.948
.478
37
User
gender
female
Mean
95% Confidence
Interval for Mean
Lower
Bound
Upper
Bound
Number of used
days in a month
Statistic
Std. Error
Statistic
Std.
Error
38.46
2.11
20.71
1.07
34.19
18.54
42.74
22.88
5% Trimmed Mean
38.13
20.95
Median
41.00
22.00
183.205
47.212
13.54
6.87
Variance
Std. Deviation
38
Mean
95% Confidence
Interval for Mean
228.173
60.460
15.11
7.78
18
Maximum
76
32
Range
58
27
65
30
Interquartile Range
Range
46
23
Skewness
Kurtosis
Kurtosis
-.513
.369
-1.089
.724
-.838
39
.724
21.79
19.90
19
11.00
43.33
21.00
Minimum
.369
Upper Bound
38.87
Std. Deviation
.118
17.74
42.00
Variance
23.00
19.76
Median
Maximum
Skewness
1.97
35.45
5% Trimmed Mean
Minimum
Interquartile Range
39.39
Lower Bound
28.00
1.01
15.00
.292
.311
-.175
.311
-.932
.613
-1.271
.613
40
41
42
Motobike
Names
Honda AirBlade
Honda Future Neo
Yamaha Sirius
Yamaha Jupiter
Honda Wave
Yamaha Cy gnus
SYM Attila
Honda Dream
Honda @
Others
under 20
Count
Row %
2
20.0%
4.2%
27.3%
under 30
Count
Row %
3
30.0%
4
50.0%
1
14.3%
4
30.8%
2
8.3%
1
25.0%
4
36.4%
3
50.0%
2
28.6%
2
20.0%
Age groups
under 40
under 50
Count
Row %
Count
Row %
3
30.0%
1
10.0%
2
25.0%
1
14.3%
1
7.7%
4
30.8%
8
33.3%
7
29.2%
1
25.0%
1
25.0%
1
9.1%
2
18.2%
1
16.7%
1
16.7%
1
14.3%
2
20.0%
5
50.0%
under 60
Count
Row %
1
10.0%
2
25.0%
2
28.6%
4
30.8%
5
20.8%
1
25.0%
older than 60
Count
Row %
42.9%
4.2%
1
1
9.1%
16.7%
10.0%
57.1%
43
44
45
Count
Mot obike
Names
Tot al
Honda AirBlade
Honda Fut ure Neo
Yamaha Sirius
Yamaha Jupiter
Honda Wav e
Yamaha Cy gnus
SYM Att ila
Honda D ream
Honda @
Others
User gender
f emale
male
3
7
4
4
3
4
6
7
9
15
2
2
5
6
2
4
3
4
4
6
41
59
46
Honda
Honda
AirBlade Future Neo
User gender f emale Count
3
4
Expected Count
4.1
3.3
% within User gender
7.3%
9.8%
% within Motobike Names
30.0%
50.0%
% of Total
3.0%
4.0%
male
Count
7
4
Expected Count
5.9
4.7
% within User gender
11.9%
6.8%
% within Motobike Names
70.0%
50.0%
% of Total
7.0%
4.0%
Total
Count
10
8
Expected Count
10.0
8.0
% within User gender
10.0%
8.0%
% within Motobike Names 100.0%
100.0%
% of Total
10.0%
8.0%
Tot al
10
8
7
13
24
4
11
6
7
10
100
47
Yamaha
Sirius
3
2.9
7.3%
42.9%
3.0%
4
4.1
6.8%
57.1%
4.0%
7
7.0
7.0%
100.0%
7.0%
Motobike Names
Yamaha
Yamaha
Jupiter Honda Wave Cy gnus SYM Attila Honda Dream Honda @
6
9
2
5
2
3
5.3
9.8
1.6
4.5
2.5
2.9
14.6%
22.0%
4.9%
12.2%
4.9%
7.3%
46.2%
37.5%
50.0%
45.5%
33.3%
42.9%
6.0%
9.0%
2.0%
5.0%
2.0%
3.0%
7
15
2
6
4
4
7.7
14.2
2.4
6.5
3.5
4.1
11.9%
25.4%
3.4%
10.2%
6.8%
6.8%
53.8%
62.5%
50.0%
54.5%
66.7%
57.1%
7.0%
15.0%
2.0%
6.0%
4.0%
4.0%
13
24
4
11
6
7
13.0
24.0
4.0
11.0
6.0
7.0
13.0%
24.0%
4.0%
11.0%
6.0%
7.0%
100.0%
100.0%
100.0%
100.0%
100.0%
100.0%
13.0%
24.0%
4.0%
11.0%
6.0%
7.0%
Others
4
4.1
9.8%
40.0%
4.0%
6
5.9
10.2%
60.0%
6.0%
10
10.0
10.0%
100.0%
10.0%
Total
41
41.0
100.0%
41.0%
41.0%
59
59.0
100.0%
59.0%
59.0%
100
100.0
100.0%
100.0%
100.0%
48
Mc tiu chung
Mc tiu c th
So snh
nhm
Kiu thng k
Khc bit
Thng k khc bit
(v.d. t-test, ANOVA)
49
Biu din gi
thit H0
C s khc
bit v tui
gia nam v
n?
Khng c s
khc bit v
tui gia nam
v n.
H0: nam = n
C lin h g
gia gii tnh
v nhn hiu
xe?
Khng c lin
h g gia gii
tnh v nhn
hiu xe.
H0: GM = 0
Mc s
dng xe c
khc bit gia
cc nhm tui
khng?
Khng c khc
bit gia cc
nhm tui v
mc s
dng xe.
Lin quan
Thng k lin
quan
(v.d. tng
quan, hi quy)
Tm lc
d liu
M t
Thng k m
t (v.d. trung
bnh, t l)
50
Mc lin
quan, cc bin
lin quan
Thun M t
Gi thit H1
Biu din gi
thit H1
C s khc
bit v tui
gia nam v
n.
H0: nam n
C lin h
gia gii tnh
v nhn hiu
xe.
H0: GM 0
C khc bit
gia cc nhm
tui v mc
s dng
xe.
52
Gi tr xc sut (p Values)
Hu ht cc phn mm thng k u
cho kt qu vi gi tr xc sut (p
values).
Gi tr xc sut p value l xc sut
t c mt kt qu, t nht cao
bng, hoc cao hn gi tr c quan
st trong thc t, vi iu kin cho
trc l gi thit H0 l ng.
3. C c gi tr xc sut p
4. So snh gi tr xc sut p v
mc ngha v ra quyt nh
5. Din gii kt qu trc nghim
53
54
Gi tr xc sut (p Values)
Gi tr p value c so snh vi mc
ngha (significant level - ), v da trn kt
qu ny bc b hay khng bc b gi
thit.
Nu gi tr p value nh hn mc ngha,
gi thit b bc b (p value < , bc b gi
thit H0).
Nu gi tr p value bng hoc ln hn mc
ngha, khng bc b gi thit (p value >
, khng bc b gi thit H0).
55
56
Parametric tests
Parametric tests
Parametric tests i hi mt s gi
nh:
58
Measurement
scale
One-sample
Case
Two-Samples Tests
Related
Samples
Independent
Samples
k-Samples Tests
Related
Samples
Independent
Samples
Nominal
- Binomial
- 2 one-sample
test
- McNemar
- Fisher exact
test
- 2 twosample test
- Cochran Q
- 2 for ksamples
Ordinal
- Sign test
- Wilcoxon
matched-pairs
test
-Median test
Mann-Whitney
U
- KolmogorovSmirnov
Wald-Wolfowitz
- Median
extension
- KruskalWallis one-way
ANOVA
- T-test
- Z test
- T-test for
paired samples
- T-test
- Z test
- Repeatedmeasured
ANOVA
- One-way
ANOVA
- N-way
ANOVA60
61
62
5.7 Mt s p dng c th
5.7 Mt s p dng c th
1. One-Sample T Test
1. One-Sample T Test
V d 1 (Parametric test)
63
5.7 Mt s p dng c th
5.7 Mt s p dng c th
1. One-Sample T Test
Analyze Compare Means One-Sample T Test (TI SAO?)
65
66
5.7 Mt s p dng c th
5.7 Mt s p dng c th
1. One-Sample T Test
1. One-Sample T Test
68
5.7 Mt s p dng c th
5.7 Mt s p dng c th
V d 2 (Nonparametric test)
S liu iu tra s dng xe my.
Gi thit H0: tt c cc nhn hiu xe my
u c c hi c ngi s dng xe la
chn nh nhau.
Analyze Nonparametric Tests Chi-Square
69
5.7 Mt s p dng c th
70
5.7 Mt s p dng c th
3. Two-Sample T Test
Ta c 100 quan st v 10
nhn xe my. C hi
mi nhn xe c chn l
10%, v s lng k vng
l 10 xe/nhn hiu.
Tuy nhin, s khc bit
gia N quan st v N k
vng cho tng nhn xe l
ln.
5.7 Mt s p dng c th
5.7 Mt s p dng c th
3. Two-Sample T Test
3. Two-Sample T Test
74
5.7 Mt s p dng c th
5.7 Mt s p dng c th
3. Two-Sample T Test
3. Two-Sample T Test
75
76
5.7 Mt s p dng c th
5.7 Mt s p dng c th
3. Two-Sample T Test
F
Age of motorbike user Equal variances
assumed
Equal variances
not assumed
1.239
Sig.
.268
df
Sig. (2-tailed)
Mean
Std. Error
Dif f erence Dif f erence
-.315
98
.754
-.93
2.95
-6.77
4.92
-.321
91.785
.749
-.93
2.89
-6.66
4.81
78
5.7 Mt s p dng c th
5.7 Mt s p dng c th
Test Statisticsa
Mot obike
Names
Mann-Whit ney U
1200.000
Wilcoxon W
2970.000
Z
-. 067
Asy mp. Sig. (2-t ailed)
.946
a. Grouping Variable: User gender
Most Extreme
Dif f erences
Absolute
Positiv e
Negativ e
Kolmogorov-Smirnov Z
Asy mp. Sig. (2-tailed)
Motobike
Names
.045
.045
-.018
.224
1.000
79
80
5.7 Mt s p dng c th
5.7 Mt s p dng c th
81
82
5.7 Mt s p dng c th
5.7 Mt s p dng c th
83
84
5.7 Mt s p dng c th
5.7 Mt s p dng c th
ANOVA
Number of used day s in a month
Between Groups
Within Groups
Tot al
Sum of
Squares
1428.944
3987.806
5416.750
df
5
94
99
Mean Square
285.789
42. 423
F
6. 737
Sig.
.000
Age groups
a,b under 60
Tuk ey HSD
under 50
under 20
under 30
under 40
older t han 60
Sig.
a,b
Duncan
under 60
under 50
under 20
under 30
under 40
older t han 60
Sig.
N
19
25
6
26
17
7
19
25
6
26
17
7
5.7 Mt s p dng c th
5.7 Mt s p dng c th
Age Group
Value
Grouping
Under 60
14,5
Under 50
17,9
ab
Under 20
18,3
ab
Under 30
22,6
abc
Under 40
24,1
abc
Older than 60
26,1
abc
86
87
88
5.7 Mt s p dng c th
5.7 Mt s p dng c th
89
5.7 Mt s p dng c th
6. Nonparametric Test for k-Independent Samples
Kruskal-Wallis Test
Test Statisticsa,b
Ranks
Age groups
under 20
under 30
under 40
under 50
under 60
older t han 60
Tot al
N
6
26
17
25
19
7
100
Mean Rank
46. 25
49. 40
50. 62
55. 66
45. 87
52. 07
Chi-Square
df
Asy mp. Sig.
Mot obike
Names
1. 493
5
.914
90