You are on page 1of 33

TP ON BU CHNH VIN THNG HC VIN CNG NGH BU CHNH VIN THNG -----------@-----------

TIU LUN MN HC

X L TN HIU S NNG CAO

GVHD: TS. Nguyn Ngc Minh Nhm thc hin: Trn Minh c
H Quang Thng L Tin Hiu Phan Trung Kin

Lp: Cao hc in t (CH10DT3)

Ha Ni 6/2011

TP ON BU CHNH VIN THNG HC VIN CNG NGH BU CHNH VIN THNG -----------@-----------

TIU LUN MN HC

X L TN HIU S NNG CAO

Ni dung:

PHN TCH TN HIU TING NI TRONG KHNG GIAN WAVELETS

GVHD: TS. Nguyn Ngc Minh Nhm thc hin: Trn Minh c
H Quang Thng L Tin Hiu Phan Trung Kin

Lp: Cao hc in t (CH10DT3)

Tiu lun mn hc: X l tn hiu s nng cao

Mc Lc

MC LC
PHN TCH TN HIU TING NI...................................................................................3 TRONG KHNG GIAN WAVELETS................................................................................3 I. GII THIU...................................................................................................................3 II. CC KHI NIM C BN V TING NI.............................................................3 2.1. Ting ni ................................................................................................................3 2.2. C ch to ra ting ni ............................................................................................4 2.3. Ng m hc ............................................................................................................7 III. PHN TCH TN HIU TING NI.......................................................................11 3.1. S pht trin ca x l ngn ng v ting ni trong Vin thng v CNTT..........11 3.2. Cc phng php x l tn hiu ting ni.............................................................13 3.3. M hnh Fujisaki v p dng trong phn tch thanh iu ting Vit.....................18 IV. NHN DNG TING NI TRONG KHNG GIAN WAVELETS.......................23 4.1 php bin i wavelet ri rc (The Discrete Wavelet Transform - DWT)............24 4.2 Php bin i Fast Wavelet (Fast Wavelet Transform - FWT)............................26 TI LIU THAM KHO................................................................................................32

Thc hin: Minh c- Quang Thng- Tin Hiu- Trung Kin

Ni dung: Phn tch ting ni trong khng gian Wavelets

PHN TCH TN HIU TING NI TRONG KHNG GIAN WAVELETS


I. GII THIU.
Ngy nay, cng vi s pht trin ca ngnh in t v tin hc, cc h thng my t ng dn thay th con ngi trong nhiu cng on ca cng vic. My c kh nng lm vic hiu qu v nng sut cao hn con ngi rt nhiu. Song cho n ny, vn giao tip ngi my tuy c ci thin nhiu nhng vn cn rt th cng: thng qua bn phm v cc thit b nhp d liu khc. Giao tip vi thit b my bng ting ni s l phng thc giao tip vn minh v t nhin nht, du n giao tip ngi my s mt i m thay vo l cm nhn ca s giao tip gia ngi vi ngi, nu hon thin th y s l mt phng thc giao tip tin li v hiu qu nht. Do c s khc bit v mt ng m gia cc ngn ng nn ta khng th p dng cc chng trnh nhn dng khc nhn dng ting Vit. Mt h thng nhn dng ting ni nc ta phi c xy dng trn nn tng ca ting ni ting Vit. Vn nhn dng ting ni ting Vit ch mi c quan tm nghin cu trong nhng nm gn y v cha c mt chng trnh nhn dng hon chnh no c cng b. Trn th gii c rt nhiu h thng nhn dng ting ni (ting Anh) v ang c ng dng rt hiu qu nh: Via Voice ca IBM, Spoken Toolkit ca CSLU (Central of Spoken Laguage Understanding)nhng trong ting Vit th cn rt nhiu hn ch.

II. CC KHI NIM C BN V TING NI


2.1. Ting ni Ting ni thng xut hin di nhiu hnh thc m ta gi l m thoi, vic m thoi th hin kinh nghim ca con ngi. m thoi l mt qu trnh gm nhiu ngi, c s hiu hit chung v mt nghi thc lun phin nhau ni. Nhng ngi c iu kin th cht v tinh thn bnh thng th rt d din t ting ni ca mnh, do ting ni l phng tin giao tip chnh trong lc m thoi. Ting ni c rt nhiu yu t khc h tr nhm gip ngi nghe hiu c cn din t nh biu hin trn gng mt, c ch, iu b. V c c tnh tc ng qua li, nn ting ni c s dng trong nhu cu giao tip nhanh chng. Trong khi , ch vit li c khong cch v khng gian ln thi gian gia tc gi v ngi c. S biu t ca ting ni h tr mnh m cho vic ra i cc h thng my tnh c s dng ting ni, v d nh lu tr ting ni nh l mt loi d liu, hay dng ting ni lm phng tin giao tip qua li. Nu chng ta c th phn tch qu trnh giao tip qua nhiu lp, th lp thp nht chnh l m thanh v lp cui cng l ting ni din t ngha mun ni. 1 2.1.1. Ngun gc ca ting ni m thanh ca li ni cng nh m thanh trong th gii t nhin xung quanh ta, v bn cht u l nhng sng m c lan truyn trong mt mi trng nht nh (thng l khng kh). Khi chng ta ni dy thanh trong hu b chn ng, to nn nhng sng m, sng truyn trong khng kh n mng nh mt mng mng rt nhy cm ca tai ta
Thc hin: Minh c- Quang Thng- Tin Hiu- Trung Kin

Ni dung: Phn tch ting ni trong khng gian Wavelets lm cho mng nh cng dao ng, cc dy thn kinh ca mng nh s nhn c cm gic m khi tn s dao ng ca sng t n mt ln nht nh. 1 2.1.2. Phn loi ting ni Ting ni l m thanh mang mc ch din t thng tin, rt uyn chuyn v c bit. L cng c ca t duy v tr tu, ting ni mang tnh c trng ca loi ngi. N khng th tch ring khi nhn vo ton th nhn loi, v nh c ngn ng ting ni m loi ngi sng v pht trin x hi tin b, c vn ha, vn minh nh Ngy nay. Trong qu trnh giao tip ngi ni, c nhiu cu ni, mi cu gm nhiu t, mi t li c th gm 1 hay nhiu m tit. ting Vit, s m tit c s dng vo khong 6700. Khi chng ta pht ra mt ting th c rt nhiu b phn nh li, thanh mn, mi, hng, thanh qun, kt hp vi nhau to thnh m thanh. m thanh pht ra c lan truyn trong khng kh n tai ngi nhn. V m thanh pht ra t s kt hp ca rt nhiu b phn, do m thanh mi ln ni khc nhau hu nh khc nhau dn n kh kh khn khi ta mun phn chia ting ni theo nhng c tnh ring. Ngi ta ch chia ting ni thnh 3 loi c bn nh sau: 1 - m hu thanh: L m khi pht ra th c thanh, v d nh chng ta ni i, a, hay o chng hn. Thc ra m hu thanh c to ra l do vic khng kh qua thanh mn (thanh mn to ra s khp m ca dy thanh di s iu khin ca hai sn chp) vi mt cng ca dy thanh sao cho chng to nn dao ng. 2 - m v thanh: L m khi to ra ting th dy thanh khng rung hoc rung i cht to ra ging nh ging th, v d h, p hay th 3 - m bt: pht ra m bt, u tin b my pht m phi ng kn, to nn mt p sut, sau khng kh c gii phng mt cch t ngt, v d ch, t. 2.2. C ch to ra ting ni 1 2.2.1. Cu to ca h thng cu m Li ni l kt qu ca s hot ng vi mi lin kt gia cc b phn h hp v nhai. Hnh ng ny din ra di s kim sot ca h thn kinh trung ng, b phn ny thng xuyn nhn c thng tin bng nhng tc ng ngc ca cc b phn thnh gic v cm gic bn th. B my h hp cung cp lc cn thit khi kh c th ra bng kh qun. nh kh qun l thanh qun ni p sut kh c iu bin trc khi n tuyn m ko di t hu n mi

Thc hin: Minh c- Quang Thng- Tin Hiu- Trung Kin

Ni dung: Phn tch ting ni trong khng gian Wavelets

Thc hin: Minh c- Quang Thng- Tin Hiu- Trung Kin

Ni dung: Phn tch ting ni trong khng gian Wavelets Thanh qun l tp hp cc c v sn ng bao quanh mt khong nm phn trn ca kh qun. Cc dy thanh ging nh l mt i mi i xng nm ngang thanh qun, hai mi ny c th khp hon ton thanh qun v khi m ra chng c th to ra m hnh tam gic gi l thanh mn. Khng kh qua thanh qun mt cch t do trong qu trnh th v c trong qu trnh cu m ca nhng m ic hay m v thanh. Cc m hu thanh th ngc li, li l kt qu ca s rung ng tun hon ca nhng dy thanh. V nh vy nhng rung ng lin tip s n c tuyn m. Tuyn m l tp hp nhng khong nm gia thanh mn v mi, trn hnh ta c th phn bit c khong hu (hng), khong ming v khong mi. Khi ni, lng ngc m rng v thu hp, khng kh c y t phi vo kh qun, i qua thanh mn do cc dy thanh to thnh. Lung kh ny c gi l tn hiu kch cho tuyn m v sau n c y qua tuyn m v cui cng tn x ra mi. Tuyn m c th c coi nh mt ng m hc (gm cc on ng vi di bng nhau v thit din cc mt ct khc nhau mc ni tip) vi u vo l cc dy thanh (hay thanh mn) v u ra l mi. Nh vy tuyn m c dng thay i nh mt hm theo thi gian. Cc mt ct ca tuyn m c xc nh bng v tr ca li, mi, hm, vm ming v thit din ca nhng mt ct ny thay i t 0 cm2 (khi ngm mi) n khong 20 cm2 (khi h mi). Tuyn mi to thnh tuyn m hc ph tr cho truyn m thanh, n bt u t vm ming v kt thc cc l mi. Khi vm ming h thp, tuyn mi c ni vi tuyn m v mt m hc v to nn ting ni m mi. Cc m ca ting ni c to trong h thng ny theo ba cch ph thuc vo tn hiu kch. m hu thanh nh m / i / c to nn khi kch tuyn m bng chui xung (hay chu k dao ng ca i dy thanh) xc nh chu k pitch P v i lng nghch o ca n l tn s c bn F0. i vi ngn ng c thanh iu th kiu thay i ny cn ph thuc vo thanh iu. m v thanh nh m / s / c to nn khi cc dy thanh khng dao ng, xung kch c coi nh cc tp ngu nhin, kch bi cc dng kh xoy qua cc ch hp ca tuyn m (thng l pha khong ming). m n nh m / p / c to ra bng cch ng hon ton tuyn m, gy nn p sut bn cnh v tr ng, ri nhanh chng gii phng m ny. V tuyn m v tuyn mi bao gm cc ng m hc c mt ct khc nhau nn khi m truyn trong ng, ph tn s thay i theo tnh chn lc tn s ca ng. Trong phm vi to ting ni, nhng tn s cng hng ca tuyn m c gi l tn s formant hay n gin l formant. Nhng tn s ny ph thuc vo dng v kch thc ca tuyn m, do mi dng tuyn m c c trng bng mt t hp tn s formant. Cc m khc nhau c to bi s thay i dng ca tuyn m. Nh vy tnh cht ph ca tn hiu ting ni thay i theo thi gian ging vi s thay i dng ca tuyn m. Qu trnh truyn m qua tuyn m lm mnh ln mt vng tn s no bng cng hng v to cho mi m nhng tnh cht ring bit gi l qu trnh pht m. m c pht c ngha n mang thng tin v m v c tn x ra ngoi t mi. Trong mt vi trng hp, i vi nhng m mi (nh / m /, / n / trong ting Anh), tuyn mi cng tham gia vo qu trnh pht m v m c tn x ra t mi. Tm li, sng tn hiu c ch to bng ba ng tc: to ngun m (hu thanh v v thanh), pht m khi truyn qua tuyn m v tn x m t mi hoc t mi, nh hnh 1.2 sau y:

Thc hin: Minh c- Quang Thng- Tin Hiu- Trung Kin

Ni dung: Phn tch ting ni trong khng gian Wavelets

2.3. Ng m hc Ng m hc l khoa hc nghin cu v m thanh ca ngn ng loi ngi trong tt c cc hnh thi v chc nng ca n. Ng m hc ng dng cc phng php khoa hc t nhin nghin cu nhng c trng m hc ca m thanh thc t v nhng phng cch sn xut ra cc m thanh m khng cn bit chng thuc ngn ng no. m v hc l khoa hc nghin cu mt x hi ca ng m, nghin cu cc c im s dng ca ng m trong tng ngn ng vi nhng phng php v khi nim ring ca mnh. 1 2.3.1. C s vt l ca ng m m thanh ngn ng l s chn ng ca lung khng kh i qua b my pht m ca con ngi m c quan thnh gic ca ngi ta c th cm th c. m truyn i khng kh di dng sng m vi tc 340 m/giy. Mi m c phn bit bng cc yu t sau: 1 - cao: ph thuc vo tn s rung ng. Tai bnh thng con ngi c th nghe c trong gii hn tn s 16Hz n 20.000 Hz. 2 - vang: ph thuc vo bin dao ng. Bin cng rng th vang cng to. 3 - di: ph thuc vo thi gian lu hay mau ca m. 1 - m sc: L sc thi ring ca m. m ni chung l hp th ca nhiu m c bn, trong m c bn c tn s thp nht v cc ho m c tn s bng bi s tn s m c bn. m sc khc nhau l do s lng v tnh cht ca cc ho m, m tnh cht ca cc ho m b tc ng nhiu bi hin tng cng hng. Trong b my pht m ca con ngi, dy thanh khong thanh hu l vt th pht ra m c bn vi tn s thp nht. Khong ming, khong mi l nhng hp cng hng to ra cc ho m. Cc khong ny do hot ng ca li, mi, ... lun c kh nng thay i hnh dng, th tch, li thot ca khng kh v do c kh nng cng hng khc nhau, to ra cc nhm ho m c tng cng khc nhau. Cc nguyn m khc nhau do c s cng hng khc nhau to ra cc nhm ho m tng cng khc
Thc hin: Minh c- Quang Thng- Tin Hiu- Trung Kin

Ni dung: Phn tch ting ni trong khng gian Wavelets nhau. Di tn s c tng cng c trng cho mi nguyn m nh th c gi l formant. 1 2.3.2. C s sinh l ca ng m Trong ng m, ting thanh c to thnh do s rung ng ca dy thanh v ting ng do lung hi b cn tr v c xt. Do nu dy thanh khng hot ng hay hot ng t ta c chu k v tn s khng xc nh v gi l ting ng. Nguyn m ch yu thuc loi ting thanh, cn ph m c s tham gia nhiu ca ting ng, thm ch t l ting ng cao hn ting thanh. V cao, m v thanh cao hn m hu thanh. Mt s ngn ng nh ting Vit, Hn, Thi cao to ra n v ng m l thanh iu. V vang, cc nguyn m nghe vang to hn cc ph m. 1 2.3.3. m tit ting Vit m tit l n v ng m nh nht khi ni. D pht m c tht chm, tht tch bch th nhng m thanh ca pht ngn cng khng th chia nh c na. m tit v th c tnh ton vn c pht m bng mt t cng ca b my pht m [2]. Cc t cng ca c ni tip nhau lm thnh mt chui m tit v c th hnh dung bng mt chui ng cong hnh sin. Trong s di l hai pht m "c " v "qu". Trong pht m th nht c 2 m tit, m [u] nm nh m tit u. Trong pht m th hai c mt m tit v m [u] nm sn ca m tit.

1 2.3.4. c im m tit ting Vit a) Tnh c lp cao : - Ting Vit, m tit c th hin kh y , r rng, c tch v ngt thnh tng khc on ring bit. m tit no ca ting Vit cng mang mt thanh iu v cu trc n nh. iu ny lm cho s th hin ca m tit ting Vit trong chui li ni ni bt v tch bch hn. Do nn vic vch ra ranh gii gia cc m tit trong ting Vit d dng hn nhiu vic phn chia ranh gii m tit trong cc ngn ng chu u [2] (trong ngn ng chu u, vic phn chia m tit c khi phi dng phng php phn tch ph). Vic tch bch m tit cn c th hin ch vit, mi m tit c vit tch ra thnh mt t ring bit. C th ni so vi cc m tit chu u, ting Vit c tnh c lp cao hn hn. - Trong ting Vit khng c hin tng ni m (liaison). Trong cc ngn ng chu u thng gp cc hin tng ni m, v d nh trong ting Php: Vous allez Ting Vit khng c hin tng ni m nh vy. - Ting Vit thng khng b nhc ho (reduction) hay mt i ??? b) C kh nng biu hin ngha Tuyt i a s cc m tit ting Vit u c ngha. Gn nh ton b cc m tit u hot ng nh t. Ni cch khc trong ting Vit ranh gii ca m tit trng vi ranh gii ca hnh v [2]. (hnh v l n v c ngha nh nht trong mt ngn ng). Chnh v vy trong mt pht ngn, s lng m tit trng vi s lng hnh v.
Thc hin: Minh c- Quang Thng- Tin Hiu- Trung Kin

Ni dung: Phn tch ting ni trong khng gian Wavelets c) C cu trc cht ch Mi m tit ting Vit dng y c 5 phn nh hnh: Thanh iu m u Vn m m m chnh

m cui

m u: thng l ph m, c gi l ph m u, n c chc nng to ra m sc cho m tit lc m u. m u c th khuyt trong mt s trng hp. m m: c chc nng lm thay i m sc ca m tit lc khi u v lm khu bit m tit ny vi m tit khc. V d tn v ton. m m c th khuyt trong mt s trng hp. m chnh: lun lun c mt trong mi m tit c chc nng qui nh m sc ch yu ca m tit. m chnh lun l nguyn m. m cui: C th l ph m hoc l bn nguyn m. C chc nng l c s phn chia loi hnh m tit, nhn ra s phn b, xut hin ca thanh iu. m cui c th khuyt trong mt s trng hp Thanh iu: Lun c mt v l yu t siu on tnh c chc nng khu bit m tit v cao S lng m tit trn l thuyt ca ting Vit l 11900, cn s lng m tit trn thc t l 6700. So vi cc cc ngn ng thng thng trn th gii c s lng m tit vo khong 3000-5000, ting Trung Quc s lng m tit trn l thuyt l 1620, trn thc t l 1255 . iu ny cho thy ting Vit c s lng m tit rt ln, v chnh v th t c hin tng ng m, t gy tr ngi cho vic nhn din m tit. Trong ting Vit c 6 thanh iu, 22 m u v 155 phn vn v phn vn ng vai tr khu bit ln hn c so vi cc yu t khc trong bc 1.

4 Thanh iu c hnh thnh bng s rung ng ca dy thanh, tu theo s rung nhanh hay chm, mnh hay yu, bin chuyn ra sao m ta c cc thanh iu khc nhau. Thanh iu ting Vit thuc loi thanh lt, c ngha l cc thanh iu phn bit vi nhau bng s di chuyn cao t thp ln cao hay t cao xung thp. Theo cc nh ngn ng hc th thanh iu c nh hng bao chm ln ton b m tit, mc d gnh nng ch yu tp trung phn vn. Ting Vit c su thanh iu. Nu chia thang ca ging ni bnh thng thnh 5 bc th ta c thanh iu ting Vit c miu t nh sau: (1) Khng du (2) Huyn (3) Ng (4) Hi (5) Sc (6) Nng
Thc hin: Minh c- Quang Thng- Tin Hiu- Trung Kin

Ni dung: Phn tch ting ni trong khng gian Wavelets Cc m tit c m cui l /p,t,k/ ch c thanh nng v thanh sc. Sau y l bng tng kt s phn b gia nguyn m m chnh v cc m m v bn nguyn m cui Thanh iu lun lun l thnh phn bt buc phi c mt ca tng m tit, cho nn ci quan trng trong ng iu ca cu ting Vit l tit tu, cng v trng ; tuyn iu ca cu rt b hn ch v mt l do n gin: s ln ging, xung ging - mt cch bin ho v cao ca ng m c th nh hng n thanh iu, lm cho ngha ca t ca cu bin i.. Theo nh nghin cu V Xun Ho, thanh iu c kh nng th hin ngha gi t . Bng nhng c liu thng k c th, chng minh rng chc nng th hin ngha cm tnh ca thanh iu ting Vit c th hin ch yu nt khu bit v m vc v ng nt din bin cao. S i lp gia cc thanh iu c m vc v ng nt khc nhau gy nn nhng kh nng lin tng khng hon ton trng kht nhau, khc nhau v thm ch i lp nhau nhng nt gi t m thanh iu th hin trong cc t n tit v song tit c cng khun m tit ch khc nhau v thanh iu. . Kt qu kho st cho thy c s tng ng v sc thi biu cm mt s thanh iu nht nh. Da v bng ly kt qu nh gi chung, ta thy trong 50 t c a ra kho st, cc thanh iu ca yu t x c s tng ng vi cc sc thi biu cm nh sau: Thanh iu m vc cao m vc thp Bng phng Khng bng phng Bng phng Khng bng phng STBC Khng du Ng Sc Huyn Hi Nng Tch cc 4 0 10 1 0 1 Trung ho 0 0 2 1 1 1 Tiu cc 1 2 8 7 3 8 Tng cng 5 2 20 9 4 10 50 T bng 2 c th rt ra mt s nhn xt v s tng ng gia sc thi ngha vi thanh iu yu t x: - Cng l thanh iu c ng nt bng phng nhng thanh c m vc cao (ngang) c xu hng biu t sc thi tch cc, cn thanh c m vc thp (huyn) c xu hng biu t sc thi tiu cc. Khng du: 4/5 (chim t l 80%) c nh gi tch cc. Huyn: 7/9 (chim t l 77%) c nh gi tiu cc. - Nhng thanh c ng nt khng bng phng, gy (hi, ng) c xu hng biu t sc thi tiu cc. Hi 2/2 (chim t l 100%) c nh gi tiu cc. Ng: 3/ 4 (chim t l 75%) c nh gi tiu cc. -Cng l thanh c ng nt khng bng phng, khng gy nhng thanh c m vc thp (nng) c tnh biu trng r rng hn, thin v nh gi tiu cc. Trong khi , thanh c m vc cao biu th mc cao c 2 hng tch cc v tiu cc. Nng: 8/ 10 (chim t l 80%) c nh gi tiu cc. Sc: 10/ 20 (chim t l 50%) c nh gi tch cc, 8/ 20 (chim t l 40%) c nh gi tiu cc. Nu nh cc thanh m vc cao c xu hng nh gi c 2 hng tch cc v tiu cc: 14/ 27 (chim t l 52%) nh gi tch cc, 11/ 27 (chim t l 40%) nh gi tiu cc; s nh gi thin v tiu cc: 18/ 23 (chim t l 77%) nh gi tiu cc.

Thc hin: Minh c- Quang Thng- Tin Hiu- Trung Kin

10

Ni dung: Phn tch ting ni trong khng gian Wavelets

III. PHN TCH TN HIU TING NI


3.1. S pht trin ca x l ngn ng v ting ni trong Vin thng v CNTT C th ni x l ngn ng t ng trn my tnh l mt trong nhng vn kh nht ca CNTT. Ci kh nm ch lm sao cho my c hiu ngn ng con ngi, t vic hiu ngha tng t trong mi hon cnh c th, n vic hiu ngha mt cu, ri hiu c vn bn. Ta ly li th d ca Marvin Minsky (1992), mt cy a cy ca ngnh tr tu nhn to (artificial intelligence): Xt t si dy chng hn. Ngy nay khng mt my tnh no c th hiu ngha t ny nh con ngi. Cn chng ta ai cng bit c th dng si dy ko mt vt, nhng khng th y mt vt bng si dy ny. Ta c th gi mt gi hng hoc th diu bng mt si dy, nhng khng th n si dy. Trong vi pht, mt a tr nh c th ch ra hng trm cch dng hoc khng dng mt si dy, nhng khng my tnh no c th lm vic ny. Mu cht y l bn cht phc tp ca ngn ng ca con ngi, c bit l s a ngha v nhp nhng ngha ca ngn ng. Thm na, c mt khc bit su sc na l con ngi ngm hiu v dng qu nhiu l thng (common sense) trong ngn ng, nh bit la th nng cn chim th bit bay, hay si dy th khng dng y hay khu cc vt, trong khi rt kh lm cho my hiu cc l thng ny. Nhn dng ting ni l mt qu trnh nhn dng mu, vi cc mu l cc n v nhn dng, c th l cc t hoc cc m v. Kh khn c bn ca bi ton ny l ting ni lun bin thin theo thi gian v c s khc bit ln gia ting ni ca nhng ngi ni khc nhau, tc ni, ng cnh v mi trng m hc khc nhau. Cc nghin cu v nhn dng ting ni da trn ba nguyn tc c bn: Tn hiu ting ni c biu din chnh xc bi cc gi tr ph trong mt khung thi gian ngn (short-term amplitude spectrum). Nh vy ta c th trch ra cc c im ting ni t nhng khong thi gian ngn v dng cc c im ny lm d liu nhn dng ting ni. Ni dung ca ting ni c biu din di dng ch vit, l mt dy cc k hiu ng m. Do ngha ca mt pht m c bo ton khi chng ta phin m pht m thnh dy cc k hiu ng m. Nhn dng ting ni l mt qu trnh nhn thc. Ngn ng ni c ngha, do thng tin v ng ngha (semantics) v ng dng (pragmatics) c gi tr trong qu trnh nhn dng ting ni, c bit khi thng tin v m hc l khng r rng. Lnh vc nghin cu ca nhn dng ting ni l kh rng lin quan n nhiu ngnh khc nhau, nh x l tn hiu s (digital signal proccessing), vt l hay m hc (acoustic), nhn dng mu, l thuyt thng tin v khoa hc my tnh (information and computer science theory), ngn ng hc (linguistics), sinh l hc (physiology), tm l hc ng dng (applied psychology). Cc h thng nhn dng ting ni c th c phn chia thnh hai loi khc nhau: h thng nhn dng t ri rc v h thng nhn dng t lin tc. Trong h thng nhn dng ting ni lin tc, ngi ta li phn bit h thng nhn dng c kch thc t in nh v h thng nhn dng vi kch thc t in trung bnh hoc ln. Trong khi x l ting Vit cn ang chp chng nhng bc i u, cc nghin cu v ng dng v x l ngn ng ni chung trn th gii v nhiu nc khc c mt lch s hn na th k, tri qua nhiu giai on, v iu quan trng hn c l nhiu con ng v cch thc x l ngn ng c tri nghim v tha nhn. Lch s x l ngn ng c th c chia ra cc giai on nh sau [Jurafsky and Martin, 2000]:
Thc hin: Minh c- Quang Thng- Tin Hiu- Trung Kin

11

Ni dung: Phn tch ting ni trong khng gian Wavelets Cc l thuyt nn tng c xy dng trong cc nm 1940 v 1950: Hai kiu m hnh nn tng ca giai on ny c nh hng su sc n x l ngn ng l m hnh cc my t ng (automaton) v cc m hnh ca l thuyt thng tin hay xc sut: My tnh in t bt ngun t cc m hnh my Turing (1936), l thuyt ngn ng hnh thc, m ha, entropy, Hai nhnh tch ri nhau t cui cc nm 1950 n u nm 1970: Hai kiu x l khc bit nhau r rt: (a) kiu hnh thc (symbolic paradigm) cho vn bn nh l thuyt ngn ng hnh thc ca Chomsky v tr tu nhn to, v (b) kiu ngu nhin (stochastic paradigm) cho ting ni nh cc phng php Bayes. Bn kiu x l ngn ng ph bin trong thp k 70 n gia thp k 80: Cc m hnh ngu nhin ng mt vai tr ln, tiu biu l cc m hnh Markov n (HMM, Hidden Markov Model) trong x l ting ni; Kiu da trn logic (logic-based paradigm); Hiu ngn ng t nhin (natural language understanding); M hnh ha cc cuc i thoi lin tc (discourse modeling). Cng trong thi gian ny xut hin cc bi ton v ti nguyn chun (Penn Treebank, WordNet, MUC, etc.). Ch ngha kinh nghim v cc m hnh hu hn trng thi t gia thp k 80 n gia thp k 90: giai on ny c th hun luyn cc m hnh hu hn trng thi ra i trong thp k 60. Cc m hnh xc sut v tip cn da vo d liu (data-driven approach) xut hin trong hu ht cc nhim v ca x l ngn ng [Manning and Schutze, 1999], [Jelinek, 1998]. X l vn bn v ting ni gp nhau trong mi nm qua: y l giai on tin b vt bc vi m hnh thng k v tip cn da vo d liu, vi vic tng trng nhanh chng ca tc v b nh my tnh, ca cc ng dng da trn Web [Jurafsky and Martin, 2000]. Cng ngh x l ting ni khng th ch da trn cc k thut x l tn hiu, m cn phi da c trn vic hiu ngn ng. Do tham s ca cc m hnh thng k hoc cc m hnh hu hn trng thi c th hun luyn c t cc kho ng liu ln, nhiu m hnh kiu ny tip tc ra i v c ng dng rng ri nh Maximum Entropy (MaxEnt), Maximum Entropy Markov Model (MEMM), Conditional Random Fields (CRF), [Cohen and McCallum, 2003] Nh trnh by s b trn, x l ngn ng l mt vic kh, phc tp, ch c th lm lu di theo nhiu bc tun t, ch c th t c kt qu bc sau khi bc trc c kt qu. Chng hn nh cc chng trnh dch t ng trn th gii c theo ui hng chc nm v chng ng n ch cui vn cn rt xa. Nu chng ta mun lm dch t ng Anh-Vit, bt buc chng ta i qua cc tng ca x l ngn ng k trn, v ni chung khng th hy vng thi gian s ngn hn nhiu so vi ngi i trc [Dorr et al., 2000]. Trn th gii, nhiu t chc quc t, nhiu hip hi x l ngn ng t nhin c thnh lp vi cc hot ng phong ph hng nm, vi lc lng nghin cu ng o: ACL (Association Computational Linguistics), NAACL (North American Association on Computational Linguistics), EACL (Euro Association on Computational Linguistics), PACLIC (Pacific Association on Computational Linguistics), ICCL (International committee Computational Linguistics). Rt ng ch l nhiu ngun ti nguyn, ng liu phong ph c to ra, c chia s d i khi di dng thng mi nhng vi gi c hp l , tiu biu l LDC (Linguistic Data Consortium, http://ww.ldc.upenn.edu).
Thc hin: Minh c- Quang Thng- Tin Hiu- Trung Kin

12

Ni dung: Phn tch ting ni trong khng gian Wavelets Nhiu chnh ph u t ln cho x l ngn ng trong CNTT (M, Nht bn, Trung quc, Singapore, etc.). Hy th nhn n cc nc quanh Vit Nam. Trung quc, nghin cu v x l ngn ng c t lu v hin ang rt pht trin vi s u t mnh m t chnh ph. H lm ra nhiu cng c, ti nguyn phong ph, nh Wordnet cho ting Trung quc, cc h dch gia ting Trung quc v qua ting mt s nc, cc ngn hng ng liu cho phn tch c php nh Chines Bank. Ngay vi nc lng ging Thailand, x l ngn ng Thi l phn c u t ln ca chnh ph ti National Electronics and Computer Technology Center (NECTEC), National research council of Thailand (NRCT), Thai Research Foundation (TRF) vi nhng kt qu ban u v dch my, v POS tagging. Thailand c cc phng th nghim ln nghin cu v NLP, Thai Computational Linguistics Laboratories (http://www.tcllab.org). L ngi i sau trong lnh vc x l ngn ng, vic hiu cc cng ngh ngn ng, cc xu th cng ngh, cc ngun ng liu, kinh nghim v bi hc t cc nc khc trong lnh vc ny l ht sc quan trng. Bit, hc v khai thc c chng s gip ta cn nhc chn c ng i hp l trong x l ting Vit. 3.2. Cc phng php x l tn hiu ting ni C nhiu phng php c s dng rng ri trong cc h thng nhn dng hin ti tnh ton cc h s MFCC (Mel Scale Frequency Cepstral Coefficients) v h s PLP (Perceptual Linear Prediction). Hai k thut x l ting ni RASTA (RelAtive SpecTral) v CMS (Cepstral Mean Subtraction) cng c gii thiu. y l hai k thut loi b nhiu hay c s dng c bit l trong pht m c nhiu nhiu nh cc cu thu m qua in thoi. ngoi ra m hnh Markov n HIDDEN MARKOV MODEL (HMM) c dng rng ri cho nhn dng ting lin tc nh vo kh nng c hu bao gm tnh thng k v tun t ca tn hiu ting ni 3.2.1. Phng php tnh h s MFCC.

Hnh 3.1 miu t cc bc tnh ton h s MFCC: 1 1) Nhn mnh tn hiu (pre-emphasis). Trong bc x l u tin ny, tn hiu c a qua mt b lc s:
Thc hin: Minh c- Quang Thng- Tin Hiu- Trung Kin

13

Ni dung: Phn tch ting ni trong khng gian Wavelets 2 H pre ( Z ) = 1 + a pre Z 1 Trong apre l h s nhn mnh, thng c gi tr l 0,9700002861. B lc c tc dng tng cng tn hiu ti tn s cao (trn 1KHz) vi hai l do chnh: Ging ni c s suy gim khong 20dB/decade khi ln tn s cao do c im sinh l ca h thng pht m ca con ngi. Bc x l ny s tng cng tn hiu ln mt gi tr gn 20dB/decade khc phc s suy gim ny. H thng thnh gic ca con ngi nhy cm hn vi vng tn s cao, bc x l ny nhn mnh vng tn s cao, tr gip cho qu trnh m hnh ho m thanh sau ny ca h thng nhn dng. 1 2) To khung tn hiu (framing). Tn hiu ting ni lun lun bin thin theo thi gian, tuy nhin trong khong thi gian khong 10-20ms, tn hiu ting ni c coi l tng i n nh. Do tn hiu thng c chia thnh cc khung kch thc 20-30ms vi vng gi ln nhau khong 10-15 ms. 1 3) Lm ca s (frame windowing). Ca s Hamming thng c p dng cho mi khung tn hiu gim s tc ng ca vic chia khung tn hiu:

Trong N l s mu d liu (sample) ca ca s. 1 4) DFT (Discrete Fourier Transform). Ti bc ny vi mi khung tn hiu, bin i Fourier c p dng chuyn tn hiu v min tn s. Cng vic tnh ton c thc hin bng thut ton FFT (Fast Fourier Transform). 2 5) Lc theo thang tn s Mel (Mel-frequency bandpass filter). Cc b lc s c p dng lc cc tn hiu theo cc gii tn s khc nhau. Phn ng ca tai ngi vi cc thnh phn ca tn s l khng tuyn tnh. S khc nhau v tn s vng tn s thp (<1KHz) d c nhn bit bi con ngi hn l vng tn s cao. Lc theo thang tn s Mel m phng tnh cht ny bng cch dng cc b lc c phn b theo mt hm phi tuyn trong khong khng gian tn s, thng thng l hm Mel: 1 6) Logarit gi tr nng lng (logarit of filter energies). Cc gi tr nng lng thu c ti mi knh lc c ly logarit nn cc gi tr ny vo mt min gi tr hp hn. 2 7) DCT (Discrete Cosin Transform). Do ging ni pht m bi con ngi c ph kh trn (smooth) trn min tn s, do vy cc gi tr nng lng ca cc b lc gn nhau c s tng quan (correlated) kh gn. Bc x l ny bin i cc gi tr nng lng thnh cc h s t tng quan vi nhau hn, cc h s ny c gi l h s cepstral.

Trong N l s knh lc, mj l gi tr logarit nng lng ca mch lc th j, i l bc ca h s cepstral. 1 8) Chnh cc gi tr cepstral. Gi tr cepstral bc cao thng c gi tr rt thp, so vi cc gi tr cepstral bc thp. S khc bit ny gy kh khn cho vic m hnh ho d
Thc hin: Minh c- Quang Thng- Tin Hiu- Trung Kin

14

Ni dung: Phn tch ting ni trong khng gian Wavelets liu, v d nh khi s dng cc hm mt xc sut Gauss. Do cc h s cepstral c iu chnh li (re-scaled) theo cng thc: Sau bc hiu chnh ny ta thu c cc gi tr MFCC. 1 9) Tnh gi tr delta MFCC. Cc gi tr delta ca cc h s MFCC c tnh ton nhm phn nh s bin thin ting ni theo thi gian. Cc gi tr delta c tnh ton da trn cc gi tr MFCC ca cc khung tn hiu ln cn:

Trong l s khung tn hiu ln cn c dng (thng thng l 2). Ngoi ra gi tr delta ca delta (hay cn gi l acceleration) cng c th c tnh ton t cc gi tr delta dng cng mt cng thc nh trn. 3.2.2. Phng php tnh h s PLP

Hnh 3.2. Cc bc x l ca phng php tnh ton h s PLP Phng php PLP c pht trin da trn phng php m d bo tuyn tnh LPC (Linear Prediction Coding). Phng php LPC hay cn gi l m hnh ho t hi qui (autoregression modeling) l phng php m hnh ho tn hiu bng s kt hp tuyn tnh cc mu tn hiu trc :

trong N l s h s hay l bc ca d bo, cc a(i) l cc h s d bo tuyn tnh (linear prediction coefficients), e(n) l hm li. Cc h s a(i) c chn lm cc tiu hm li d bo trung bnh bnh phng. C mt vi phng php tnh cc h s ny: phng php dng ma trn hip phng sai (covariance matrix), phng php t tng quan (auto-correlation method), phng php li hay cn gi l phng php iu ho (lattice or harmonic). Phng php hay c p dng nht trong nhn dng ting ni l
Thc hin: Minh c- Quang Thng- Tin Hiu- Trung Kin

15

Ni dung: Phn tch ting ni trong khng gian Wavelets phng php t tng quan dng thut ton qui Levinson-Durbin. Thut ton Levinson-Durbin nh sau: Tnh p+1 cc h s t tng quan u tin (p l bc ca d bo tuyn tnh) bng cng thc sau:

Trong s(n) l tn hiu trong ca s, N l s lng mu trong ca s. Cc h s sau c tnh ton qui nh sau:

Cc bc trn c tnh ton lp vi i=1,2,...p. Cui cng ta thu c cc h s: Phng php tnh cc h s PLP da vo phng php LPC. Hnh 3.2 miu t cc bc x l tnh ton h s PLP: 1 1) FFT. Tng t nh phng php MFCC, tn hiu ting ni c chia thnh cc khung tn hiu v c bin i Fourier sang min tn s bng thut ton FFT. 2 2) Lc theo thang tn s Bark. Tng t nh phng php tnh MFCC, tn hiu ting ni c lc qua cc b lc phn b theo thang tn s phi tuyn, trong trng hp ny l thang tn s Bark:

1 3) Nhn mnh tn hiu dng hm equal-loudness. Bc x l ny tng t nh b-c nhn mnh pre-emphasis ca phng php MFCC. Hm ny m phng ng cong cn bng n (equal-loudness curve).

1 4) Dng lut cng nghe (power law of hearing). Bc x l ny ging nh bc ly gi tr logarit trong phng php MFCC. Hm cn lp phng c s dng nn cc gi tr nng lng. 5) Bin i Fourier ngc (invers DFT). Cc h s t tng quan c bin i Fourier ngc sau dng lm gi tr u vo cho phng php LPC. 2 6) Thut ton Durbin c s dng tnh ton cc h s d bo tuyn tnh ging nh trong phng php LPC. 3 7) Tnh cc gi tr delta. Phng php tnh tng t nh phng php tnh h s MFCC.
1

3.2.3. Phng php m hnh Markov n HMM


Thc hin: Minh c- Quang Thng- Tin Hiu- Trung Kin

16

Ni dung: Phn tch ting ni trong khng gian Wavelets M hnh Markov n c dng rng ri cho nhn dng ting lin tc nh vo kh nng c hu bao gm tnh thng k v tun t ca tn hiu ting ni Trong nhn dng ting ni chia ra 2 loi: - Nhn dng cc t ring bit. - Nhn dng ting ni lin tc. HMM l mt qu trnh c 2 ln tnh cht ngu nhin: mt qu trnh ngu nhin m khng quan st c (c ngha l n b n) nhng ch c quan st qua mt tp hp cc qu trnh ngu nhin khc pht sinh trnh t cc k hiu c quan st. Mt HMM cung cp mt khung xc sut cho vic m hnh mt qu trnh thay i theo thi gian. N c trng bi nhng iu sau: - Mt s hu hn cc trng thi: mc d cc trng thi l n , chng c nhng c tnh ring bit ca m hnh . Cc trng thi biu din cc la chn khng xc nh t trong cc tp hp ca nhng la chn khc nhau , c trnh by nh cc vng trn trong s HMM. - Mt s hu hn cc k hiu quan st phn bit i vi trng thi (ngha l mt bn k t ri rc c kch thc hu hn ): cc k t quan st tng ng vi cc output c th ca h thng c m hnh. - Mt phn b xc sut chuyn v :Xc sut chuyn v nh ngha xc sut ca m hnh t mt trng thi ny n mt trng thi khc . Cc chuyn v c biu din nh cc mi tn trong hnh . Cc chuyn v t trng thi u tin ca HMM i vo trng thi u tin c gi l chuyn v qui - Cc phn b xc sut k hiu quan st: Sau mi chuyn v , m hnh pht sinh mt k hiu quan st theo mt phn b xc sut ph thuc trn trng thi hin hnh . Phn b xc sut k hiu hin hnh quan st c gi c nh i vi mi trng thi , bt k th no v khi no mt trng thi c bit c a vo . V vy vi K trng thi c K phn b xc sut k hiu quan st c xem xt . - Mt phn b xc sut trang thi ban u. Mt s HMM tiu biu c cho bi hnh sau:

HMM c th c xy dng theo nhiu cch . Kin trc HMM chung nht tm thy trong nhn dng ting ni l t tri sang phi , c ch trong hnh . Hnh trn biu din mt kiu c bit ca HMM c gi l m hnh tri-phi hay bakis, v trnh t trng thi c bn kt hp vi m hnh c tnh cht m khi thi gian tng, cc trng thi c x l t tri qua phi. Trong hnh thc HMM ca nhn dng ting ni , gi s rng tn hiu ca ting ni c sn sinh bi mt automaton ngu nhin (my trng thi hu hn) c xy dng t mt tp hp cc trng thi. Q = ( q1 , q2 , . . . , qk ). Automaton ny c iu khin bi qui lut thng k . c bit i vi mi n v ting ni ( nh t vng ) hoc n m (phoneme) chng ta c c mt HMM c bit lm t L trng thi ql Q vi l = 1 , 2 , 3 , . ,L , theo mt v tng (topo ) c m t trc.
Thc hin: Minh c- Quang Thng- Tin Hiu- Trung Kin

17

Ni dung: Phn tch ting ni trong khng gian Wavelets Mt tin x l(preprocessor) c s dng bin i tn hiu ting ni thnh mt chui vector m thanh m c xc nh bi tn s ca n (ngha l s rung ng m mt ging thun ty pht sinh mi giy), bin (ngha l n ca mt m), cu trc ha iu (ging, ngha l sng hnh sin so vi sng nhiu, ngha l sng khng chu k) v s cng hng (ngha l kh nng ca mt ngun rung ca m thanh gy ra mt vt khc rung): X = (x1 , x2 , , xN ) Nhng thng s ca cc vector m thanh c trch ra t my tin x l trong mt khong thi gian c nh, th d 10 ms. Nhng vector m thanh bao gm nhng c tnh cc b nh ph v cc thng s lin h. Thm vo , xc sut chuyn v c dng m hnh s di ch cc c im ny theo thi gian. Cc vect m thanh s c dng nh cc vect input v chng s c so snh vi cc m hnh c lu tr. Cc so snh pht sinh cho ra mt xc sut biu din s chnh xc ca m hnh tham chiu HMM c lu tr so vi d liu input. Cch tip cn ny c gi l thut gii ng cc i B aum-Welch. Mt phng php thng dng khc c dng cho nhn dng ngu nhin l thut gii Viterbi. Thut gii Viterbi tm trong mng Noron mt trnh t cc trng thi HMM tng ng gn nht vi input. Nhng trng thi gn nht ny c ni kt li thnh mt ng din t c input tt nht. Gii hn ch yu ca cch tip cn HMM l yu cu v cc ga thit thng k nghim ngt m khng chc c gi tr cho lnh vc nhn dng ting ni. 3.3. M hnh Fujisaki v p dng trong phn tch thanh iu ting Vit Ngn iu ca li ni lin kt cht ch vi khi nim ng iu. C th ni ng iu l s nng cao h thp ca ging ni trong cu. Tn s c bn F0 l c trng chnh ca ng iu. Ng iu l mt thnh phn ca ngn iu. Ting Vit l ngn ng c thanh iu, cc thanh iu c cc c trng rt khc nhau v ng nt F0. Trong li ni lin tc, ng nt F0 ca cc thanh iu b bin i ph thuc vo thanh iu ca cc m tit lin k v v tr ca m tit trong cu. Vic m hnh ho ng nt F0 cc thanh iu c ngha quan trng trong vic tng hp ting ni. Fujisaki v cc ng s pht trin mt cch m t ton din ng iu ting Nht da trn mt m hnh nh lng sau ny mang tn Fujisaki [2]. M hnh Fujisaki c ng dng rng ri trong cc h thng tng hp ca ting Nht nh tng hp cc bn tin thi tit. M hnh MFGI (Mixdorff-Fujisaki model of German Intonation) c ng dng trong h thng Textto-Speech ting c. Vi mt s thay i nh, m hnh Fujisaki thch hp trong vic phn tch ng nt F0 trong ting Anh, ting Thy in, ting Ty Ban Nha, ting c, ting Hy Lp, phn tch v tng hp thanh iu ca ngn ng c thanh iu nh ting Trung, ting Thi. Trong phn ny chng ti tin hnh th nghim m hnh Fujisaki vi cc cu ni ting Vit. Cc cu ni c thit k va mang su thanh iu va th hin cc t hp thanh quan trng nh thanh ng v thanh nng. Cc tham s m hnh c iu chnh thch ng vi cc c trng ca ngn ng ting Vit. 3.3.1. M hnh Fujisaki 18

Thc hin: Minh c- Quang Thng- Tin Hiu- Trung Kin

Ni dung: Phn tch ting ni trong khng gian Wavelets

Fujisaki l mt m hnh nh lng dng m hnh ha ng iu (intonation). M hnh Fujisaki hng vo vic m hnh ha qu trnh sinh ra tn s c bn F0, gii thch v mt vt l hc, sinh l hc qu trnh sinh ra F0 v cc tnh cht ca qu trnh . M hnh c p dng ch yu trong ng dng tng hp nhm xy dng phn ng iu trong ting ni tng hp. M hnh sinh ra F0 theo b ba cng thc sau

Cc tham s ca m hnh gm c - Cc hng s: Fb l gi tr khi u ca ng tn s c bn. Fb l gi tr ph thuc vo ngi ni ch khng ph thuc vo cc mu ting ni. Gi tr l tn s gc t nhin ca lnh ng (phras command). Gi tr l tn s gc t nhin ca lnh trng m (accent command). Gi tr l mc gi tr trn tng ng vi cc thnh phn trng m - Cc i s: I l s lnh ng. J l s lnh trng m. Api l cng ca lnh ng th i. Aaj l bin ca lnh trng m th j. T0i l thi im bt u lnh ng th i. T1j v T2j l thi im bt u v kt thc thanh iu lnh trng m th j. Trong m hnh, ng F0 c xt min logF0, mc ch ca php bin i ny l lm cho ging ni ca nam v n ging nhau. Theo trn, cc gi tr =2.0/s v =20.0/s, trong mt s trng hp c bit =3.0/s. Tuy nhin theo quan st th nm trong khong [1.0;3.0], cn thuc khong [19.5;20.5]. Thnh phn ng Gp(t) trong cng thc (2) nh ngha c ch iu khin ng. u vo ca c ch iu khin ng l cc lnh ng bao gm cng Ap vi thi gian bt u T0. H s l hng s thi gian v l khng i vi mt cu ni. Thnh phn trng m Ga(t) trong cng thc (3) nh ngha c ch iu khin trng m vi u vo l cc lnh trng m bao gm bin Aa, thi gian bt u T1, thi gian kt thc T2. H s l hng s theo thi gian ca c ch iu khin trng m v l khng i vi mt cu ni. Thnh phn trng m khng bao gi vt qu gi tr trn (thng c gn gi tr 0.9). Vic phn tch ng nt F0 c thc hin bi phng php phn tch bng tng hp (vit tt l AbS: Analysis-by-Synthesis). Gi tr cc tham s ca m hnh c thay i cho ti khi xp x tt nht ng nt F0 ca cu ni c phn tch. Vi s lng lnh khng gii hn (lnh ng v lnh trng m), bt k ng nt F0 no cng c th c xp x vi chnh xc khng gii hn. V th cn c cc rng buc m bo tnh c
Thc hin: Minh c- Quang Thng- Tin Hiu- Trung Kin

19

Ni dung: Phn tch ting ni trong khng gian Wavelets ngha v mt ngn ng hc ca cc kt qu phn tch. Cc rng buc l cc c trng v ngn ng v lin quan ti mi quan h gia cc cu trc v n v ngn ng (nh ngn iu v trng m) v cc lnh ng v lnh trng m. Cc tham s Ap, T0, , Aa, T1, T2, , Fb c gi l l cc tham s Fujisaki v phng php phn tch bng tng hp ng nt F0 s dng m hnh Fujisaki c gi l phn tch Fujisaki. Cc tham s ca m hnh c th c sinh ra t ng bi nhiu cch khc nhau ty vo tng ngn ng c phn tch 3.3.2. Phn tch thanh iu ting Vit bng m hnh Fujisaki

Khi p dng m hnh Fujisaki cho cc ngn ng c thanh iu, thnh phn trng m c gi l thnh phn thanh iu v s dng c lnh thanh iu dng (At > 0), lnh thanh iu m (At < 0) m hnh ho cc thanh iu nh trong hnh trn. C s d liu phn tch ng nt F0 ca thanh iu ting Vit v s lin cu m gia cc thanh iu lin k, mt tp gm 72 cu ni, mi cu ni gm 6 m tit c xy dng t cu gc: nha mai lm nhan nhiu ng, mi m tit trong cu gc s mang cc thanh iu khc nhau th hin c nhiu t hp thanh iu lin k nh: 1) Nh mai lm nhn nhiu ng 2) Nh mi lm nhan nhiu ng 3) Nha mi lm nhn nhiu ng ..... Cc cu c pht m vi ging chun min Bc bi hai ngi mt nam, mt n. th hin c nhiu t hp m ch dng mt cu gc nn a s cc cu trong c s d liu u khng c ngha. m bo tnh t nhin ca li ni, hai ngi ni u c chun b trc, cc cu ni c pht m nhiu ln v kim tra li chn cu ni t nhin nht. ng nt F0 c tnh ton theo tng on 10ms. Phng php phn tch phn tch ng nt F0, mt cng c phn tch cc tham s ca m hnh Fujisaki c s dng. Cng c ny h tr cc lnh thanh iu m v c s dng trong phn tch ng nt F0 ting Thi. Fb c t bng 96 Hz cho ging nam v 210 Hz cho ging n. v cho c ging nam v n c ln lt t bng 2 Hz v 25 Hz. Cc bc tin hnh phn tch bao gm: 1. Tnh ng nt F0. 2. La cc chn lnh ng cu ni. 3. Da vo thanh iu ca cc m tit la chn cc lnh thanh iu ph hp. 4. iu chnh cc tham s sao cho ng nt F0 sinh ra xp x tt ng nt F0 thc. 5. Tng hp li cu ni vi ng nt thanh iu mi s dng phng php PSOLA. 6. Cm nhn bng tai cu ni tng hp, so snh vi cu ni gc v iu chnh li.
Thc hin: Minh c- Quang Thng- Tin Hiu- Trung Kin

20

Ni dung: Phn tch ting ni trong khng gian Wavelets 3.3.3 Mt s nhn xt v thanh iu ting Vit trong khi phn tch c s d liu S khc bit gia thanh nng v ng vi cc thanh iu khc l r rng. Trong 6 thanh iu ting Vit, thanh nng v thanh ng c nhng c im khc bit so vi cc thanh iu cn li. ng nt F0 b t t ngt hai thanh ny. Cc thanh ny khng ch khc cc thanh iu khc ng nt F0 m cn cc c trng khc. Do , khi cn tng hp li thanh ng v thanh nng th ch ng nt F0 l cha .

Thanh ng b gy gia. Khng nhng b gy F0, thanh ng cn b gy ph, cng chnh l khc bit ln nht gia thanh ng v cc thanh khc. Thanh nng c c trng l b gy, t, v i xung t ngt cui m, lc ny ng nt F0 khng cn quan trng, v d nh khi cho thanh nng ng nt F0 ca thanh sc th ngi nghe vn cm nhn c thanh nng v ch c cm tng ngi ni nhn mnh hn m tit mang thanh nng. nhng m tit ng kt thc bng t, p, c, k (ch c th c 2 thanh sc hoc nng), khi cho ng nt ca thanh nng i ln ri tng hp li ta nhn c thanh sc v ngc li, cho ng nt thanh sc i xung ri tng hp li ta nhn c thanh nng. cc m tit khc iu ny l khng ng. S ging nhau gia thanh ng v thanh sc l c hai thanh u c ng nt thanh iu i ln, v m vc bt u ca m sc cao hn thanh ng. Khi lm th nghim c hin tng: Cho ng nt ca thanh sc v ng ging ht nhau, khi tng hp li ngi nghe vn phn bit c hai thanh ny. 3.3.4. Kt qu phn tch thanh iu bng m hnh Fujisaki Cc kt qu phn tch c s d liu cho thy, cc thanh ngang, sc, ng c biu din bng mt lnh thanh iu dng, thanh huyn v hi c biu din bng mt lnh thanh iu m, thanh nng khng cn lnh thanh iu. Thanh iu Biu din bng lnh thanh iu Khng du 1 lnh thanh iu dng trc m tit Sc 1 lnh thanh iu dng Hi 1 lnh thanh iu m Huyn 1 lnh thanh iu m Ng 1 lnh thanh iu dng Nng khng dng lnh thanh iu
Thc hin: Minh c- Quang Thng- Tin Hiu- Trung Kin

21

Ni dung: Phn tch ting ni trong khng gian Wavelets Bng 1: Biu din cc 6 thanh iu ting Vit bng cc lnh thanh iu Cc cu c phn tch ch s dng mt lnh ng cho c cu, ph hp vi hin tng trong cu ni, ngi ni thng ln ging u cu v h ging cui cu. Tuy nhin trong ting Vit hin tng ny khng r rt nh cc ngn ng khc nn cng ca lnh ng ny khng ln. Thanh ng v thanh sc c biu din bng mt lnh ng iu dng ph hp vi nhn xt v s ging nhau gia 2 thanh ny trong phn trc. Thanh hi c ng nt F0 i xung, n gia thanh, ng nt F0 li i ln, thanh ny ging thanh T3 (low tone) ca ting Trung v c biu din bng mt lnh thanh iu m ging nh trng hp ca ting Trung.

Hnh 5: Kt qu phn tch thanh iu ting Vit bng m hnh Fujisaki Tm li: M hnh v c bn khng th p dng cho bi ton nhn dng ting ni. L do ch yu l m hnh ny thc cht tng hp ng F0 mt cch tuyn tnh, n l mt php bin i lm trn ng F0. u vo ca m hnh ny l mt F0 th, khi i qua m hnh ta c mt F0 mi vi cc c trng v ng iu, hay cn gi l F0 trn. Cc kt qu phn tch thanh iu ting Vit chng t rng c th p dng m hnh Fujisaki vo vic m hnh ha thanh iu ting Vit, cao hn na l m hnh ha ng iu ting Vit vi vic th hin ng nt F0 khng ch ca cc thanh iu m cn ca c cc loi cu nh cu trn thut, cu hi, ... T lm nng cao cht lng ca h thng tng hp ting ni v cc kt qu phn tch cng c th p dng kt qu tnh ton ng m hc vo vic nhn dng ting ni. Tuy nhin, c th lm c iu trn cn phi c nhng phn tch su hn na trn b d liu thc ln hn, phn tch thanh iu di ng cnh ca cc thanh iu khc.
Thc hin: Minh c- Quang Thng- Tin Hiu- Trung Kin

22

Ni dung: Phn tch ting ni trong khng gian Wavelets

IV. NHN DNG TING NI TRONG KHNG GIAN WAVELETS


Trong khong mt thp nin tr li y, thut bit i Wavelet c pht trin v nhanh chng tr thnh mt mi nhn nghin cu trong lnh vc x l tn hiu. Cc ng dng v nghin cu v thut bin i ny v c trin khai trong nhiu nghnh t ton hc, vin thng n chun on y khoa. Mt s chun trong gii thut nn tn hiu, hnh nh (JPEG), k thut nhn dng du vn tay ca FBI cng nh nhiu cng trnh nghin cu khc v x l hnh nh, h thng radar d tm, nhn dng ting ni v ang s dng thut bin i Wavelet.

Wavelet l mt sng con, tc l n c nh ngha trong mt khong thi gian hu hn. i vi FT. khong thi gian kho st l ton b chiu di tn hiu. i vai STFT, n chia thanh nhiu khong thi gian xc nh v cc khong thi gian ny ging nhau. Trong phn tch Wavelet, kich thc ca ca s ny s thay i tng ng vi moi vng tn s. Cc tn s thp s c phn tch vi mt khong thi gian di, v cc tn s cao s c phn tch vi mt khong thi gian ngn.

Thc hin: Minh c- Quang Thng- Tin Hiu- Trung Kin

23

Ni dung: Phn tch ting ni trong khng gian Wavelets Php bin i Wavelet (WT) l mt k thut phn tch tn hiu rt hiu qu. N l mt gii php la chn c th thay th cho STFT gii quyt cc vn lin quan vic phn tch thi gian, bin v tn s ca tn hiu. 4.1 php bin i wavelet ri rc (The Discrete Wavelet Transform - DWT) Php bin i Wavelet ri rc (DWT) l mt trng hp c bit ca WT. N cung cp mt cch biu din tn hiu di dng nn trong min thi gian-tn s gip cho vic tnh ton mt cch nhanh chng v hiu qu. Vi bi ny, DWT c dng trong vic phn tch tn hiu ting ni theo min tn s rt ra cc c trng v thanh iu ting Vit 4.1.1 Php bin i Wavelet 1 mc Trong phn tch Wavelet, khi chng ta a mt tn hiu qua cc b lc Lowpass v Highpass, tn hiu c chia thanh 2 tn hiu thnh phn: phn Approximations - k hiu A v phn Details - k hiu D. Phn A tng ng vi cc thnh phn tn s thp ca tn hiu (h s co dn ca Wavelet ln), D tng ng vi cc thnh phn tin s cao ca tn hiu (h s co dn ca Wavelet nh). Qu trnh lc 1 mc c bn nht nh sau: a tn hiu qua b lc Highpass, Lowpass v sau gim mu (downsample), n tch thnh 2 tn hiu thnh phn D, A c minh ha bi hnh v 2.12:

Vi du: Ta c tn hiu ting ni S c tch thanh hai tn hiu thnh phn: A cha cc tn s thp v D cha cc tn s cao.

Thc hin: Minh c- Quang Thng- Tin Hiu- Trung Kin

24

Ni dung: Phn tch ting ni trong khng gian Wavelets

4.1.2 Php bin i Wavelet a mc Php bin i Wavelet a mc l qu trnh thc hin php bin i Wavelet 1 mc to nn 2 thnh phn A v D, sau ta tip tc qu trnh ny trn thnh phn A (thnh phn tn s thp ca tn hiu). Ta s c cy phn tch Wavelet biu din bi hinh 2.14 sau:

Chng ta c th xy dng li tn hiu ban u bng cch thc hin kt hp ln lt cc tn hiu Aj v Dj (i = J.... 1, J l s mc phn tch). Qu trnh phn tch v tng hp Wavelet a mc c biu din bi hinh 2.15:

Thc hin: Minh c- Quang Thng- Tin Hiu- Trung Kin

25

Ni dung: Phn tch ting ni trong khng gian Wavelets

Nh vy, chng ta s cho tn hiu D, A ln lt qua cc b lc Highpass H' v b lc Lowpass L' tng ng (thc cht l tnh tch chp gia D vi b lc FT v tich chap giua A vi b lc L'), ket qua ta s thu c tn hiu ban u. Vi du: Cho tn hiu S. Php phn tch Wavelet (mc 6) tn hiu S nh sau:

Sau khi phn tch Wavelet, qu trnh xy dng li tn hiu ban u S nh sau: S = A6 + D6 + D5 + D4 + D3 + D2 + D,. 4.2 Php bin i Fast Wavelet (Fast Wavelet Transform - FWT) FWT l php bin i nhm lm tng hiu qu tnh ton ca DWT. Php bin i DWT ca hm f(x) c xc nh bi:
Thc hin: Minh c- Quang Thng- Tin Hiu- Trung Kin

26

Ni dung: Phn tch ting ni trong khng gian Wavelets

c xc nh bi cng thc (2.3.1) v (2.3.2) c gi l cc h s Approximation v Detail. V bin i DWT ngc (IDWT) c xc nh bi cng thc:

Thc hin: Minh c- Quang Thng- Tin Hiu- Trung Kin

27

Trong : - f(x), j 0,k ( x) v j , k ( x ) l cc hm ca cc bin ri rc x = 0,1,... ,M-1. Thng thng chn j0 = 0, M= 2J => J = log2M. Khi do: x = 0,1,...M-l; j = 0,1,...,J-1 v k = 0,1,...2j - 1. -
j,k

(x) l hm Baby Scaling xc nh bi cng thc: Ta c cng thc khai trin ca hm Scaling (x) nh sau:

Cc h s h(n) c gi l cc h s ca hm Scaling (x) v h l vecto Scaling. Thay x bi 2jx-k vo cng thc (2.6) v t m = 2k + n, ta c: Ta c cng thc khai trin ca hm (mother) Wavelet (x): Cc h s h,,,(n) c gi l cc h s ca hm Wavelet \|/(x) v hv l vecto Wavelet.

Nhiu nh phn tch tn hiu nhn ra c nhc im ca bin i fourier thi gian ngn STFT(Short Time Fourier Transform) i hi cn phi c gii quyt. V vy ngi ta s dng bin i wavelets gii quyt cc vn ca STFT. Wavelet l mt dng sng hiu qu trong mt khong thi gian so vi gi tr bnh qun lch khng . Do s bt li ca STFT l khng xc nh c chnh xc thi gian i vi tn hiu c tn s cao, vi kch thc ca s qu nh. Nhng nu ca s c thit lp qu nh th thay vo l bt nh v tn s li ln. Nh vy ngi ta thy rng cn phi c mt cch tip cn khc. Thay vo vic gi cho kch thc ca ca s c nh v lm y n vi nhng dao ng c tn s khc nhau, gii php ny gi cho s nhng s dao ng l c nh v thay i chiu rng ca ca s.

Hnh 1: M hnh phn tch tn hiu s dng wavelets.

Phn tch wavelet l mt phin bn ca k thut ca s, nhng vi mt kch thc ca s khc nhau. N cho php cho s s dng ca s di hn khi c thng tin tn s thp v nhng ca s ngn hn vi nhng thng tin tn s cao. Phng php phn r tn hiu da trn bin i wavelets ng nng ca lung khng kh trong ting ni c th ch ra mc hn lon da trn hc thuyt fractal, tn hiu ting ni l tng cc quy trnh ngu nhin t tng ng. Hnh 2 di y ch ra m hnh phn r tn hiu da trn wavelets, vic chia bng tn thnh cc bng tn con c thc hin nh sau: Tn hiu gc c a qua hai b lc thng thp v thng cao nhm tch ra h s tng quan v h s chi tit ca tn hiu gc. Qu trnh phn tch c lp li c c nhiu h s nhm thu c thng tin tn hiu gc chi tit nht, mi mt h s ny ng vi mt di tn ca tn hiu gc. Qua mc ta c th tch ra cc mc nhiu v mc m vi cc c tnh khc nhau cho php nhn dng tn hiu hu ch.

Trong : S (Orginal Signal) : Tn hiu gc,

L (Low pass filter): B lc thng thp

H (High pass filter): B lc thng cao Hnh 2: M hnh phn r tn hiu da trn wavelets Qu trnh tin x l tn hiu l chuyn ting ni t dng sng (wave form representation) sang dng biu din tham s (parametric form representation). Cc tham s biu din tn hiu ting ni c th l: nng lng thi gian ngn (short time energy), t l qua im khng (zero crossing rate) t l qua mc (level crossing rate) . c nhiu cch dng trch chn ra c nhng thng tin v m thanh mt cch trc tip t nhng tn hiu s ca ting ni v hiu qu hn l phng php biu din tn hiu theo ph c s dng rng ri. Phn tch ph theo phng php m d on tuyn tnh LPC c nghin cu nhm trch chn ra cc c tnh c bn ca tn hiu ting ni, l nhng tham s u vo cho h thng nhn dng ting ni. Phng php ny biu din mu ting ni ti thi im n, x(n) bng php xp x tuyn tnh p mu qu kh: x(n) a1x(n - 1) + a2x(n - 2) + ...+ apx(n - p)

Trong , x(n) l mu d on ti thi im n v cc h s a1, a2, ..., ap c coi l cc hng s trn khung (frame) phn tch ting ni. Thm thnh phn kch thch Gu(n) th:

Trong u(n) l ngun kch thch chun ho v G l h s khuych i kch thch. Ngun kch thch chun ho em nhn t l vi h s khuych i G v a vo h thng ton im cc to ra tn hiu ting ni. Mt khc, ta bit rng hm kch thch phi l xung tun hon (i vi m hu thanh) hoc l ngun nhiu ngu nhin (i vi m v thanh). i vi m hnh ny, ngun kch thch l mt cng tc c iu khin bi c tnh hu thanh/v thanh ca ting ni tng ng cho php chn xung tun hon hoc nhiu ngu nhin. Mc khuch i kch thch G c nh gi trc tip t tn hiu ting ni. i vi m hnh LPC, cc tham s l s phn lp m hu thanh/m v thanh, v tr nh m tit v cc h s b lc {ak}. Nu gi l t hp tuyn tnh ca cc mu qu kh:

th

sai

tin

k =1

on

tuyn

tnh

nh

ngha:

e ( n ) = x ( n ) x ( n ) = x ( n ) ak x ( n k ) Vn c bn ca phng php LPC l xc nh cc h s tin on sao cho cc c tnh ph ca b lc s trong m hnh tng hp ting ni ph hp vi dng sng ting ni trong ca s phn tch. Do c tnh bin thin theo thi gian ca ph tn hiu ting ni nn cc h s tin on ti thi im n phi c nh gi trong khong thi gian ngn gn vi n. Do , cch tip cn c bn xc nh cc h s tin on l ti thiu ho sai s bnh phng tin on tuyn tnh trong mt on sng ting ni ngn. Thng thng, trong x l ting ni php phn tch ph thi gian ngn c thc hin trn cc khung ting ni lin tip vi khong cch mi khung l 10 ms.

TI LIU THAM KHO


1. LAWRENCE R. RABINER, RONALD W. SCHAFER, Digital Processing of Speech Signals, Prentice-Hall, Inc. Englewood Cliffs, NewJersey, 1978. 2. LAWRENCE R. RABINER, BIING-HWANG JUANG, Fundamentals of Speech Recognition, Prentice-Hall PTR, Englewood Cliffs, NewJersey, 1992. 3. GORDON E. PELTON, Voice Processing, McGraw-Hill, Inc. 1991. 4. LAWRENCE R. RABINER, Acoustics Research Laboratory, Bell Telephone Laboratories, Murray Hill, NewJersey, 1999 5. Collins Cobuild English Language Dictionary, Collins London and Glasgow, 1st Published 1987 Reprinted 1988. 6. T in Ting Vit, Vin Ngn ng hc, H Ni, 1992.

You might also like