You are on page 1of 13

KHAI THC D LIU & NG DNG (DATA MINING)

GV : ThS. NGUYN HONG T ANH

NI DUNG
Gii thiu v mn hc

Gii thiu v khai thc

d liu (DM)
2

GII THIU MN HC

Tai sao chn mn hc ny ?


Th mnh v nn tng kin thc :

TTNT, h QTCSDL, thng k, kinh t,

S quan tm n kin thc, vn mi. Cung cp cc khi nim v k thut c bn ca khai thc d liu (DM) Chuyn d liu v dng ph hp Tm tri thc t d liu Biu din, nh gi tri thc ng dng ca DM Cc k nng gii quyt vn
3

Mc tiu mn hc :

Thng tin lin lc

Ging vin l thuyt : Th.s. Nguyn Hong T Anh


nhtanh@fit.hcmus.edu.vn Tel : 38354266 803

Website mn hc :

http://courses.cs.hcmus.edu.vn/

CHNG TRNH
45 tit l thuyt v 30 tit thc hnh
Tng quan Chun b d liu Tp ph bin v lut kt hp Bi ton phn lp Bi ton gom nhm Cc nghin cu xa hn
5

Hnh thc hc
L thuyt:
Bi ging : GV cung cp slide theo tin . Bi tp theo nhm v bi tp c nhn. Tm hiu, nghin cu ti liu bo co xemina

Thc hnh :
Hnh thc 2 S dng PM Weka Ci t mt s thut ton

HNH THC KIM TRA V NH GI

H thng thang im:


Bi thi l thuyt: Bi tp theo nhm v c nhn: Bo co xemina:

4.5 im 1.5 im 1.5 im

Bi tp thc hnh hng tun: 2.5 im im cng cho phn TH: ti a 1 im


7

HNH THC KIM TRA V NH GI

Thi l thuyt:

4.5 im 1.5 im

Thi vit, c s dng ti liu, KHNG s dng laptop, mang theo my tnh : thi gian 120

Bi tp theo nhm v c nhn:

Bi tp lm theo nhm v c nhn trn lp v qua Moodle. Ti a 4 SV/nhm. Hn cht ng k nhm qua Moodle: 15/09/2009

Bo co xemina:

1.5 im

Thc hin theo nhm ng k bi tp nhm (4SV/nhm). Cc nhm s ng k ni dung xemina theo thng bo trn website mn hc. (trong tun t 21/9 -26/9)
8

HNH THC KIM TRA V NH GI

Bi tp (theo nhm v c nhn): 1.5 im


Bi tp lm theo nhm hoc c nhn trn lp v qua Moodle hng tun. nh gi s tham gia lp hc v s chun b bi trong sut qu trnh hc tp. im bi tp s nh gi trn tt c cc bi tp hng tun trn lp v qua Moodle.

i vi cc bi tp lm theo nhm, trng nhm cn thng k t l ng gp ca tng thnh vin trong nhm.
9

HNH THC KIM TRA V NH GI

Bi tp (theo nhm v c nhn): 1.5 im


nh gi s tham gia lp hc v s chun b bi trong sut qu trnh hc tp. 30% - bi tp c nhn trong gi hc v 70% l bi tp theo nhm. c th t kt qu tt, cc SV cn xem trc bi ging chun b.
Cc mc nh gi:

A Xut sc 100% s im B - t yu cu ~70% s im C - Khng t yu cu ~30% s im F - Khng lm hoc ging bi ca SV khc 0% s im


10

HNH THC KIM TRA V NH GI

Bo co xemina:

1.5 im

Cc nhm s ng k ni dung xemina theo thng bo trn website mn hc. (trong tun t 21/9 -26/9) Th t bo co ph thuc vo ni dung cc nhm ng k. Bt u xemina t tun th 10. Trc bui bo co, cc nhm phi gi ni dung trnh by (file.ppt) cho GV gp v post ln website cc nhm khc tham kho.
11

HNH THC KIM TRA V NH GI

Bo co xemina:

1.5 im

Cc nhm s ng k ni dung xemina theo thng bo trn website mn hc. (trong tun t 21/9 -26/9) im bo co xemina s nh gi trn ni dung trnh by, tr li cu hi ti bui xemina, trn c ni dung ca bo co chi tit v s tham d cc bui xemina. Trong tun th 16, cc nhm s post ni dung bn bo co vit chi tit (file .doc theo mu) ln website mn hc. Trong bi thi vit l thuyt cui k s c 1 cu hi lin quan n cc ni dung xemina.
12

HNH THC KIM TRA V NH GI

Bi tp thc hnh hng tun:

2.5 im

Bi tp lm theo nhm. Mt nhm : 2 SV S lng : 4 bi . Thi gian : 2 tun/bi Hn cht ng k nhm TH qua Moodle: 15/09/2009 Ni dung bi tp TH :

S dng phn mm Weka gii quyt mt s bi ton trong ni dung l thuyt : x l DL, khai thc lut kt hp, phn lp v gom nhm . C yu cu ci t mt s thut ton Thi gian np qua website mn hc theo thng bo ca 13 GV HDTH.

Cu hi v ngh ?

Chia s cu hi, thc mc vi c lp c th c nhng bn khc cng quan tm. B vo cng nhiu cng sc, cc em s t c kt qu cng cao im ca cc em t l thun vi cc n lc b ra.

14

TI LIU THAM KHO

J.Han, M.Kamber, Data mining : Concepts & Technique (ppt) http://www.cs.sfu.ca/~han/dmbook P.Tan, M. Steinbach, V. Kumar, Introduction to data Mining, 2006, - http://wwwusers.cs.umn.edu/~kumar/dmbook/index.php Phn mm WEKA - http://www.cs.waikato.ac.nz/ml/weka/ Trang web u ngnh v KTDL - Kdnuggets : www.kdnuggets.com
15

NI DUNG
Gii thiu v mn hc
Gii thiu v khai thc

d liu (DM)
16

V D : Tp D liu
age <=30 <=30 3140 >40 >40 >40 3140 <=30 <=30 >40 <=30 3140 3140 >40 income student credit_rating buys_computer high no fair no high no excellent no high no fair yes medium no fair yes low yes fair yes low yes excellent no low yes excellent yes medium no fair no low yes fair yes medium yes fair yes medium yes excellent yes medium no excellent yes high yes fair yes medium no excellent no 17

TH NO L KHAI THC DL
L qu trnh lp, khng phi plug - and play Khai thc d liu l qu trnh khng tm

thng ca vic xc nh cc mu tim n c tnh hp l, mi l, c ch v c th hiu c ti a trong CSDL - Fayyad, Piatetsky-Shapiro & Smyth, 1996

18

V d ng dng
Marketing

Phn khc th trng : Ai mua sn phm ca cng ty? Mc tiu hng khch hng (customer targeting): Lm th no tng s mail tr li? Nn qung co ci g trn web site ? Nhng mt hng no thng c khch hng mua cng vi nhau?

V d ng dng
Qun l ri ro -Risk Management

Khch hng no c th s chuyn sang nh cung cp dch v khc? Khch hng no c mc ri ro tn dng tt? Giao dch th tn dng no b li hoc gian ln ?

10

V d ng dng

C phi t bo ung th ? Nu ng th mc pht trin nh th no ?

TH NO L KHAI THC DL

Ti sao cn Khai thc d liu (KTDL)? Nhng i tng no s dng KTDL ? S dng KTDL u v khi no? S dng KTDL nh th no ? Ti sao cn nghin cu KTDL? Lch s pht trin KTDL ?
Xem bi 1 : Tng quan.
22

11

CC CNG VIC CN LM
1. ng nhp vo Moodle
2.

ng k tham gia vo lp, tho lun v ly ti liu Hn cht : 16/9/2009 Sau ngy 16/9/09, Website mn hc s kho li ng k nhm
Hn cht ng k nhm cho bi tp nhm /xemina (4Sv/nhm) v cho bi tp Thc hnh (2Sv/nhm) qua Moodle : 15/09/2009 Chun b sn BNG TN NHM v mang theo khi n lp v tt c cc bui hc tip theo.
23

CC CNG VIC CN LM
3. Chun b bi 1 : Tng quan Xem ni dung bi tp nhm s 1 Tho lun v xy dng mt v d ca khai thc d liu: nn chn la mt lnh vc nh, mt sn phm c th. Cch thc hin : c slide, xem cc v d Tham kho trn Internet cc v d v KTDL.

24

12

25

13

You might also like