You are on page 1of 5

Kho st tnh n nh ca mt s c trng ng m trong nhn dng ngi ni

To Study Stability of Some Features of Speech in Speaker Recognition


Ng Minh Dng, ng Vn Chuyt
Abstract: This paper presents the results of studying stability of the fundamental frequency and the first four formant frequencies in speaker recognition. The results were learned from the experiment of 35 people speaking Vietnamese in laboratory condition. Finally the use of these in the world of forensics is examined.

c chia ra lm hai loi: Nhn dng vi t kho bit trc (Text dependent Identification) v nhn dng vi t kho cha bit trc (Text Independent Identification). C hai phng php nhn dng ngi ni l : nhn dng theo khong cch ( ging nhau) cc tham s v nhn dng theo phng php thng k trung bnh cc c trng ph trong thi gian ln. Thng phng php khong cch c s dng trong nhn dng ngi ni vi t kho bit trc, cn phng php thng k th ngc li, thng dng trong nhn dng ngi ni vi t kho cha bit trc. Trong nhn dng ngi ni, vn quan trng nht l trch chn c cc c trng khu bit trong cu m mang tnh n nh i vi mi ngi. L tng nht l cc c trng ny phi tho mn cc yu cu sau: D o lng n nh theo thi gian i vi mi ngi nhng phi thay i t ngi ny sang ngi khc t nh hng bi mi trng xung quanh Khng d b bt chc hoc c tnh thay i Xut hin mt cch t nhin v thng xuyn khi ni. y l mt vn kh v phc tp bi l ting ni t nhin pht ra khng ch ph thuc vo tm sinh l ngi ni m cn ph thuc nhiu yu t x hi nh phng ng hay tnh trng sc kho khi ni... Trn th gii, mc d nhn dng ngi ni thu c nhng kt qu nht nh nhng vn cn nhiu vn cn gii quyt nhm nng cao kh nng nhn

I. H NHN DNG NGI NI Ting ni t nhin ngoi thng tin ng ngha m ngi ni mun truyn t cho ngi nghe (thng tin c th ghi li di dng ch vit) cn cha nhng thng tin khc nh trng thi tnh cm khi ni, hay nhng thng tin ring ca ging ngi ni ... Cc thng tin ring bit ny c chia lm hai loi: mc cao v mc thp. Mc cao bao gm cc thng tin nh phng ng, ng cnh, phong cch ni ... v mc thp l cc thng tin nh cc formant, b rng formant, cao (pitch), trng ... Cc thng tin mc cao d nhn ra bng thnh gic nhng rt kh nh lng, cn cc thng tin mc thp th ngc li d nh lng hn. Cc thng tin ny i vi mi ngi tuy khng phi l bt bin t lc ngi bit ni n lc cht gi, nhng n c tnh kh n nh trong nhng giai on di ca cuc i. Khi con ngi trng thnh, nhng thi tt khi ni, nhng c trng khu bit trong cu m s hnh thnh v mang tnh n nh cao. Hin nay, hu ht cc h nhn dng ngi ni ch yu da vo cc thng tin thuc tnh mc thp do tnh d nh lng ca chng. S khi ca mt h nhn dng ngi ni c th hin trn hnh 1. Ph thuc vo thut ton nhn dng, cc h nhn dng ngi ni

dng. Hu ht cc vn ny ny sinh t tnh khng n nh ca tn hiu ting ni ngay trong bn thn mi ngi khi ni v s bin i trong qu trnh ghi v truyn tn hiu ting ni nhng iu kin khc nhau.
Tn hiu ting Tin x l

(khng, huyn, hi, ng, sc, nng). Khng ging vi nhiu ngn ng khc nh ting Anh... l ngn ng a m tit v khng c thanh iu. Thanh iu l thnh phn siu on tnh ca mi m tit, l s thay i tn s m c bn theo nhng quy lut nht nh trong qu trnh pht m m tit. Cn vic thay i tn s m c bn trong ngn ng a m tit phn nhiu thc hin trn ton b cu v th hin trng m ca cu ni. Quy lut thay i ca F0 ng vi mi loi thanh iu c nhiu cng trnh nghin cu v ng m trnh by. Theo cc ti liu ny mi m tit trong ting Vit u i lin vi mt thanh iu, l s thay i theo quy lut nht nh ca tn s m c bn ng vi mi thanh iu khi pht m m tit. Do vy, trng thi bnh thng tn s m c bn khi pht m khng ch ph thuc vo cu to sinh l ca i dy thanh mi ngi m cn ph thuc thanh iu ca m tit pht ra. Tuy nhin, khi nghe mt ngi ni ta vn cm nhn c cao trung bnh ca ging ni ngi . Do vy, tn s m c bn trung bnh theo thi gian c th l i lng c tnh n nh i vi mi ngi. c th nh gi n nh ca tn s m c bn trung bnh i vi mi ngi, chng ti tin hnh kho st mt s ging ni pht m trong khong thi gian di v c nhn xt sau: Tn s m c bn trung bnh khng phi l mt i lng hon ton bt bin i vi mi ngi. Tin hnh kho st trn 20 ngi n ng v 15 ngi n b tui 25 - 55, ni t do trong khong 3 - 5 pht. Ct ra cc khong thi gian 2 , 3 , 4 , 5 , 6 , 8 , 10 , 15 giy tnh F0 trung bnh. Phm vi thay i F0 ca mi ngi trong mi khong thi gian kho st c xc nh bng lch chun ca phn b thng k F0 trung bnh trong khong thi gian . Qua kho st chng ti thy: Thi gian tnh tn s m c bn trung bnh cng di, phm vi thay i ca F0 trung bnh (F0tb) cng c xu hng gim dn. Ging nam , thi gian tnh trung bnh t 5 giy tr ln, tn s m c bn trung bnh thay i trong phm vi khong 12 Hz.

Trch chn cc c trng Lu cc vector c trng

Cc vecter c trng ca ngi ni

Phn lp, nhn dng


Ra quyt nh nhn dng ngi ni Hnh 1. S khi h nhn dng ngi ni

Khi s dng phng php nhn dng ngi ni theo khong cch cc tham s th vn nghin cu tnh n nh ca cc tham s ng m i vi mi ngi lun phi t ra. Bi vit ny ch yu tp trung vo vic nh gi tnh n nh ca tn s m c bn trung bnh v cc tn s formant ca ngi ni ting Vit qua kho st thc t ca 35 ngi ni trong iu kin phng th nghim. Thit b s dng xc nh tn s m c bn v cc formant l phn mm phn tch ph m thanh CSL 4300B v thit b Sonagraph ca hng KAY (M). II. KT QU KHO ST TNH N NH MT S C TRNG NG M TING VIT 1. Tn s m c bn: Tn s m c bn (F0) c nh ngha l tn s dao ng ca i dy thanh khi ni, n th hin cao ca ging ni (pitch). Mt c trng c bn quan trng ca ting Vit l mt ngn ng n m tit v c 6 thanh iu

Ging n, thi gian tnh trung bnh t 6 giy tr ln, tn s m c bn trung bnh thay i trong phm vi khong 16 Hz. Kt qu kho st phm vi thay i trung bnh ca F0 ca 20 ngi n ng v 15 ngi n b trong cc khong thi gian khc nhau th hin trong bng 1 (F0tb trong bng c tnh theo khong thi gian 6 giy). 2. Cc formant: Cc formant c nh ngha l cc tn s cng hng ca tuyn pht m, do vy chng lin quan trc tip ti hnh dng, kch thc ca c quan cu m v v th, chng s cung cp nhiu thng tin c trng v ngi ni. Theo sch Ng m ting Vit, m tit trong ting Vit bao gm 3 phn on chnh: m u, m chnh, m cui. v 1 phn on ph l m lt hay m m. Mi on c th hin bng mt m t. m chnh lun l cc nguyn m cn cc thnh phn khc
Bng 1. Kho st phm vi thay i trung bnh ca F0 F0tb 2s (Hz) (Hz) Nam 132,2 43,4 N 215,3 47,5 3s (Hz) 37,6 40,2 4s (Hz) 26,5 31,4 5s (Hz) 12,3 23,6 6s (Hz) 10,7 16,3 8s (Hz) 12,1 14,3 10s (Hz) 10,9 15,6

vi m u hoc m cui. S nh hng ca m u ln cu trc formant ca m chnh t hn rt nhiu so vi nh hng ca m cui. Trong cc loi m cui s nh hng ca cc m mt li nh nh/ng ln cu trc formant ca m chnh mnh hn s nh hng ca cc m u li nh n hay m mi nh m. Do vy, khi xt n tnh n nh ca cc tn s formant chng ti ch xt trn cng mt nguyn m, trong cng mt m tit ca cng mt ngi . V d m i trong m tit bim c kho st 5 ln trong 5 ln pht m khc nhau ca cng mt ngi. Cc tn s formant c tnh theo gi tr trung bnh trong khong thi gian 100ms xung quanh im bin ln nht ca m tit. Vi mi ngi, phm vi bin i ca tng formant trong 5 ln pht m ca cng mt m tit c xc nh nh sau:

Di = Fij -max - Fij -min


Vi : i = 1 4 (4 formant u)
15s (Hz) 9,6 16,1

Fij-max , Fij-min l cc gi tr ln nht v nh nht ca Fij tnh theo j (max v min ca formant th i trong 5 ln pht m).

thng l cc ph m hay bn nguyn m. Trong mi m tit, m chnh lun lun c mt, cn cc thnh phn khc c th c hoc khng. Cu trc formant th hin r nht ti m chnh, nng lng ca m tit cng tp trung ch yu y, biu th bng vng bin ln nht trn tn hiu sng m ca mi m tit. kho st tnh n nh ca cu trc formant i vi mi ngi, chng ti tp trung vo 3 nguyn m chnh l : u, a v i. Cc nguyn m ny c kho st khi pht m ring l, khi i cng mt s m u (nh bi, ni, ba, na, bu, nu ....) v khi i cng mt s m cui (nh bin, bim, binh, ...). Trong phm vi bi vit ny, chng ti cha kho st n cc m m v nguyn m i. Nhng ngi tham gia thc nghim phi pht m cc m tit ny mt cch ri rc v lp li 5 ln nhng thi gian khc nhau. Qua kho st s b chng ti nhn thy: Cu trc formant ca nguyn m b thay i khi i

V bin i tng i tng ng c xc nh l: Qua kho st, chng ti thy cc tn s formant khng phi l n nh hon ton i vi mi ngi m dao ng trong mt phm vi nht nh. Tu thuc mi ngi, cc tn s formant c phm vi bin i khc nhau.

Cc formant vng tn s cng cao th phm vi bin i tuyt i cng ln, tuy nhin phm vi bin i tng i li c xu hng gim dn. Bng 2 th hin phm vi bin i tuyt i v phm vi bin i tng i khi chia di tn kho st ra thnh 5 on. Ta c th tm ra quy lut thay i ca phm vi bin i tng i cc formant trn trc tn s khi xy dng mt hm hi quy vi sai s nh nht t phn b thng k ca phm vi bin i tng i theo tn s. Chng

ti s quay li vn ny trong mt bi bo khc. Cn c vo kt qu trn ta c th a ra nhn nh : Vi formant th nht (F1) nm trong khong t 300 - 800 Hz, phm vi bin i tng i khng vt qu 11,8%. Vi formant th hai (F2) nm trong khong t 800 2400 Hz, phm vi bin i tng i khng vt qu 9,5%. Vi formant th ba (F3) nm trong khong t 2200 - 3300 Hz, phm vi bin i tng i khng vt qu 6,9%. Vi formant th t (F4) nm trong khong t 3100 4500 Hz, phm vi bin i tng i khng vt qu 5,8%. kim nh nhn nh ny chng ti to mt tp
Bng 2: Kt qu kho st phm vi bin i tuyt i v phm vi bin i tng i cc tn s formant Di 300 800 1600 2600 3600 tn 800 1600 2600 3600 4500 (Hz) D (Hz) 20 - 120 40 - 170 60 240 70 - 270 80 - 300 T (%) 3 9-11 8 3 5 9 5 3 3 6 6 2 5 6 9 1 8 5 8

nh v mu lun khc nhau. Kt qu kho st tnh n nh ca tn s m c bn v cc formant trn c chng ti s dng xy dng mt quy trnh gim nh ngi ni bng phng php th cng nh sau: Bc 1: S dng c quan thnh gic nhn dng ngi ni da trn cc thng tin mc cao ca ting ni nh phng ng, phong cch ni ... ca c 2 bng. Nu thy c nhiu im tng ng s chuyn sang bc 2, ngc li th kt lun ph nh (khng ng nht) v dng. Bc 2: Xc nh tn s m c bn trung bnh trong thi gian pht m lin tc khong 5 n 6 giy ca cc ting ni trong 2 bng. Nu sai khc nh hn 12 Hz (vi ging nam) v 16 Hz (vi ging n) th chuyn sang bc 3, ngc li th kt lun ph nh (khng ng nht) v dng. Bc 3: Tm nhng m tit c lp ging nhau gia ting ni trong 2 bng, xc nh cc formant ca chng. Hai m tit c coi l ng nht nu cc formant tng ng sai khc nhau khng qu 12% i vi F1, 10% i vi F2, 7% i vi F3, v 6% i vi F4. Nu s lng m tit ng nht nhiu hn 10 th kt lun khng nh (ting ni trong 2 bng l do cng mt ngi ni). Nu c t 5 n 10 m tit ng nht th kt lun kh nng (ting ni trong 2 bng c kh nng do cng mt ngi ni). Nu s m tit ng nht t hn 5 th khng a ra kt lun hoc kt lun t c kh nng ting ni trong 2 bng do cng mt ngi ni.
p dng quy trnh trn cho 107 yu cu gim nh

th gm 50 ngi ni cc t chun nh khi kho st. Khi so snh ng thi c 4 formant theo phm vi: F1 12%F1 , F2 10%F2, F3 7%F3 v F4 6%F4 ca khng qu 2 m tit bt k trong tp th chng ti thy nhn dng ra ngi ni trong tp th . T kt qu kim th ny c th suy ra rng: nhn dng ngi ni trong mt tp ln hn th s m tit cn s dng so snh cng phi tng ln. Vn cn bao nhiu m tit l chng ti s cp trong mt bi bo khc. III. NH GI KT QU Mt ng dng quan trng ca h nhn dng ngi ni l nhn dng ti pham hnh s, cn gi l gim nh ting ni t php. Trong gim nh ting ni ngi ta cn xc nh ting ni trong bng/ a cn gim nh v ting ni trong bng/ a mu c phi l do cng mt ngi ni ra khng. Bng mu thng c gng c ghi trong iu kin ging nh khi ghi bng gim nh. Ni chung, ni dung ca 2 bng cn gim

ngi ni chng ti thu c kt qu sau: S ging ni khng yu t gim nh (do s lng m tit trng nhau c th so snh c vi nhau qu t l: 32). Do vy, s ging ni thc c s dng nh gi ch cn l 75 ging. Trong : Kt lun khng nh (ng nht) : 46 Kt lun ng : 46 Kt lun sai : 0 Kt lun ph nh (khng ng nht) : 29 Kt lun ng : 21

Kt lun sai : 8 T l kt lun ng t : (46+21)/75 = 89% IV. KT LUN xy dng mt h nhn dng ngi ni cn rt nhiu vic phi lm. Trn y mi ch l mt s kt qu ban u v nh gi n nh ca tn s m c bn v bn formant u ca ngi ni ting Vit. Hin cc kt qu trn vn ang c chng ti p dng mt cch th cng trong cng tc gim nh ting ni ti Vin KHHS. Hy vng cc nghin cu tip theo v tnh n nh ca cc c trng ng m khc cng vi tng bc t ng ho quy trnh trn s gip xy dng mt h nhn dng ngi ni ting Vit trong thi gian ti. TI LIU THAM KHO
[1] ON THIN THUT - Ng m ting Vit - NXB i hc Quc gia H Ni - 1999.

[2] HONG KIM, .... Nhn dng cc phng php v ng dng, NXB Thng K , 1992 [3] F.J.OWENS, Signal Processing of University of Ulster - Macmillan - 1990. Speech,

[4] HARRY HOLLIEN, Forensic voice identifycation, Academic press, Harcount Science and Technology Company, USA - 2002 [5] EVGENY KARPOV, Real time speaker identification, Masters Thesis, Department of Computer Science, University of Joensuu, 2003

Ngy nhn bi: 17/10/2003

S LC TC GI NG MINH DNG Sinh nm 1962 ti H Ni. Tt nghip i hc Bch khoa H Ni nm 1984, nhn bng Thc s Cng ngh thng tin nm 2001. Hin ang lm nghin cu sinh ti i hc Bch khoa H Ni. Ni cng tc: Vin Khoa hc Hnh s - B Cng An. Hng nghin cu: ng dng l thuyt nhn dng trong k thut hnh s, nhn dng ging ni. Email: dung-oanh@fpt.vn NG VN CHUYT Sinh ngy 23/2/1953 ti Thi Bnh. Tt nghip i hc Bch khoa H Ni nm 1975. Tt nghip Thc s, Tin s ti i hc Grenoble, Cng ha Php nm 1984 v 1986.

Hin ang cng tc ti Khoa Cng ngh thng tin, H Bch khoa H Ni Lnh vc nghin cu: ASIC v ghp ni, x l tn hiu c bit l x l ting ni. Email: chuyetdv@it-hut.edu.vn

You might also like