You are on page 1of 75

N TT NGHIP

TI
X L TING NI
SVTH: NGUYN TH NGC DIP
n tt nghip
CNG HA X HI CH NGHA VIT NAM
c lp T do Hnh phc
LI CAM OAN
Knh gi: Hi ng bo v n tt nghip Khoa in t _ Vin thng _
Trng i hc Bch Khoa Nng.
Em tn l: Nguyn Th Ngc Dip
Hin ang hc lp 04T1- Khoa: in t - Vin thng Trng: i hc
Bch Khoa Nng.
Nhm em xin cam oan ni dung ca n ny khng phi l bn sao chp
ca bt c n hoc cng trnh c t trc.
Sinh vin thc hin
Nguyn Th Ngc Dip
SVTH: Nguyn Th Ngc Dip Trang 2
2
n tt nghip
MC LC
LI CAM OAN.....................................................................................................2
MC LC................................................................................................................3
DANH MC CC T VIT TT V CC THUT NG TING ANH...........5
M U..................................................................................................................7
CHNG 1: TNG QUAN V NNG CAO CHT LNG TING NI......11
CHNG 2 : NH GI CH T L NG TI NG NO I.....................................28
CHNG 3: THUT TON SPECTRALSUBTRACTION V WIENER
FILTERING............................................................................................................39
CHNG 4: THC HIN V NH GI CC THUT TON.....................52
KT LUN N V HNG PHT TRIN TI..................................74
PH LC...............................................................................................................75
DANH MC CC HNH V V BNG
Hnh 1.1 Tn hiu ting ni [2]...............................................................................13
Hnh 1.2 Dng v s phn b ph nng lng trung bnh nhiu trn xe [4]...........16
Hnh 1.3 Dng v s phn b ph nng lng trung bnh ca nhiu trn tu [4].. .16
Hnh 1.4 Dng v s phn b ph nng lng trung bnh ca nhiu trong nh
hng[4]....................................................................................................................17
Hnh 1.5 Mc nhiu v ting ni (c o bng SPL dB) trong cc mi trng
khc nhau [4]...........................................................................................................18
Hnh 1.6 Mu ting ni eee c ly mu vi tn s ly mu 8kHz [11]...........23
SVTH: Nguyn Th Ngc Dip Trang 3
3
n tt nghip
Hnh 1.7 Dng sng tn hiu ting ni ca cu The wife helped her husband v
dng sng ca ph m f trong t wife, dng sng ca on nguyn m er
trong t her [11]..................................................................................................25
Hnh 1.8 mt ct dc ca c quan to ting ni [11]..............................................26
Hnh 1.9 m hnh k thut to ting ni[11]...........................................................26
Hnh 1.10 bng phn loi m v trong ting Anh ca ngi M [11].....................27
Ba ng 2.1.Thang i m a nh gia ch t l ng ti ng no i theo MOS [12].....................29
Ba ng 2.4. Thang i m a nh gia ch t l ng ti n hi u ti ng no i theo CCR...............30
Ba ng 2.5. Thang a nh gia DCR..............................................................................30
Hnh 3.1 S khi cho hai thut ton SS v WF.................................................39
Hnh 3.2 S khi ca thut ton Spectral subtraction [26]................................43
Hnh 3.3 S khi ca thut ton Wiener Filtering.............................................46
Hnh 3.4 Phn tch tn hiu thnh cc frame [31]...................................................47
Hnh 3.5 qu trnh thc hin overlap v adding [32]..............................................48
Hnh 4.1. S thc hin v nh gi thut ton tng cng................................53
Hnh 4.2 Lu thut ton SS................................................................................55
Hnh 4.3 Lu thut ton WF..............................................................................56
Hnh 4.4 dng sng v spectrogram ca tn hiu sch............................................57
Hnh 4.5 Dng sng v ph ca tn hiu b nhiu xe hi vi SNR = 10dB............57
Hnh 4.6 Dng sng v spectrogram ca tn hiu sau khi x l nhiu xe hi bng
SS vi SNR = 10dB................................................................................................58
Hnh 4.7 Dng sng v spectrogram ca tn hiu sau khi x l nhiu xe hi bng
WF vi SNR = 10dB...............................................................................................58
Hnh 4.8 Quy trnh thc hin nh gi....................................................................60
Hnh 4.9. th kim tra n nh ca nh gi OE i vi nhiu xe hi ........61
SVTH: Nguyn Th Ngc Dip Trang 4
4
n tt nghip
Hnh 4.10. th kim tra n nh ca nh gi OE i vi nhiu ngi ni
xung quanh .............................................................................................................61
Hnh 4.11 th nh gi Objective vi h s IS=0.2, NoiseMargin=3................63
Hnh 4.12 th nh gi Objective vi h s IS=0.15, NoiseMargin=2..............64
Hnh 4.14 th nh gi objective vi h s alpha=0.5, 0.8,0.9 vi IS=0.15 v
NoiseMargin = 2.....................................................................................................66
Hnh 4.15 th nh gi objective vi h s gamma = 1 v gamma = 2.............67
Hnh 4.16 th nh gi vi IS=0.15 NoiMargin= 2 v alpha = 0.8 cho thut ton
WF, gama=1 cho thut ton SS...............................................................................69
Hnh 4.17 th nh gi OE vi nhiu ngi ni xung quanh............................70
DANH MC CC T VIT TT V CC THUT NG TING
ANH
T vit
tt
Ting Anh Ngha ting Vit
SNR Signal Noise Ratio T s tn hiu trn nhiu
PC Personal Computer My tnh c nhn
SVTH: Nguyn Th Ngc Dip Trang 5
5
n tt nghip
SPL Sound Pressure Level Mc p sut ca m thanh
MMSE Minium Mean-Squared Error
Ti thiu ho sai lch trung
bnh bnh phng
SVD Singular Value Decomposition Php phn tch gi tr n
DFT Discrete Fourier Transform Php bin i Fourier ri rc
FFT Fast Fourier Transform Php bin i Fourier nhanh
DTFT Discrete-Time Fourier Transform
Php bin i Fourier ca tn
hiu ri rc.
ZT Z Transform Php bin i Z
ROC Region of Convergence Min hi t
IDTFT
Inverse Discrete Fourier
Transform
Php bin i ngc Fourier
ri rc
LTI Linear Time-Invariant
H thng tuyn tnh v bt
bin theo thi gian
ITU-T
InternationalTelecommunications
Union-Telecommunication
Hip hi tiu chun vin thng
quc t
ACR Absolute Categories Rating nh gi theo gi tr tuyt i
MOS Mean Opinion Scores
nh gi theo quan im
ngi nghe
CCR Comparison Category Rating nh gi bng cch so snh
DCR Degradation Category Rating nh gi suy gim cht lng
SE Subjective Evaluation nh gi ch quan
OE Objective Evaluation nh gi khch quan
IS Itakura_Saito
LLR Log likehook Raito
WSS Weighted Spectral Slope o theo trng s ca ph
LPC Linear Prediction Coefficients H s d oa n tuy n ti nh
VAD Voice Activity Detection
Thm d s hot ng ca
ting ni
Speech Enhancement Nng cao cht lng ting ni
SS Spectral Subtraction
Thut ton gim nhiu tn hiu
ting ni bng phng php
tr ph.
WF Wiener Filter
Thut ton gim nhiu tn hiu
ting ni bng cch s dng
b lc Wiener.
SVTH: Nguyn Th Ngc Dip Trang 6
6
n tt nghip
Statistical-model-based
Thut ton gim nhiu tn hiu
ting ni da trn nguyn l
thng k
Frame
Khung tn hiu.
Hamming Ca s Hamming
Overlap v Adding Xp chng v cng
M U
Trong cuc sng, ting ni ng mt vai tr rt quan trng i vi con
ngi. Cng vi ting ni l s xut hin ca rt nhiu cc loi dch v thoi nh
ngy nay. Tuy nhin vic bo ton c tn hiu ting ni trn cc dch v ny l
iu v cng kh khn do s mt mt v suy gim tn hiu v nht l nh hng
ca nhiu s lm cho tn hiu ting ni khng cn nh ban u. V l do m cc
thut ton v Speech Enhancement ra i. Tuy khng th bo ton c y nguyn
tn hiu ban u nhng s dng cc thut ton ny ta c th tng cng c cht
lng ting ni v gim bt nhiu nn tn hiu sau khi x l n ngi nghe
vn mang y ni dung thng tin v khng gy kh chu bi nhiu i vi
ngi nghe. V vy, Speech Enhancement ng mt vai tr rt quan trng trong
lnh vc thoi.
Xut pht t thc t ny nhm bt tay vo tm hiu v Speech
Enhancement, nghin cu cc thut ton ca n thc hin v nh gi hiu qu
ca cc thut ton trong mi trng thc t.
thc hin c n, nhm phn chia thnh 3 phn tng ng vi 3
thnh vin :
SVTH: Nguyn Th Ngc Dip Trang 7
7
n tt nghip
- Nguyn Ngc Trung : nghin cu v thc hin thut ton x l ting ni s
dng phng php Spectral Subtraction.
- Nguyn Phc Nguyn : nghin cu v thc hin thut x l ting ni s
dng b lc Wiener.
- Nguyn Th Ngc Dip : nghin cu v thc hin cc phng php nh gi
t cc kt qu t c ca 2 thut ton trn trong mi trng thc t.
thc hin c ni dung phn ca em th n ca em c kt cu
thnh 2 phn, gm 5 chng :
Phn 1 : L thuyt
Chng 1 : Tng quan v nng cao cht lng ting ni. Chng ny gii
thiu mt s khi nim c bn v tn hiu s, cc php bin i, tm hiu v cc
loi nhiu , tn hiu ting ni v s hnh thnh ting ni. Bn cnh cn gii
thiu khi qut v mt s thut ton trong Speech Enhancement .
Chng 2 : nh gi cht lng ting ni. Chng ny gii thiu mt s
phng php nh gi hiu qu ca thut ton gim nhiu trong ting ni. Gm c
nh gi ch quan v nh gi khch quan.
Chng 3 : Thut ton Spectral Subtraction v Wiener Filtering. Chng
ny i su vo nghin cu nguyn l c bn ca tng thut ton.
Phn 2 : Thc hin v nh gi
Chng 4 : Thc hin v nh gi thut ton. Chng ny trnh by cc kt
qu nhm lm c gm c thc hin gim nhiu tn hiu ting ni bng hai
thut ton nghin cu chng 3. ng thi so snh kt qu thu c bng
cch dng cc phng php nh gi c gii thiu chng 2
Phng php nghin cu ca n l xy dng lu ca thut ton, thc
hin x l ting ni bng cc thut ton . Da trn cc kt qu t c sau khi
x l, sau s dng cc phng php nh gi khch quan nh gi tnh hiu
qu ca cc thut ton x l trong mi trng thc t.
n ca nhm thc hin c 2 thut ton x l ting ni trong Speech
Enhancement v a ra c cc kt qu nh gi khch quan lm c s nh
SVTH: Nguyn Th Ngc Dip Trang 8
8
n tt nghip
gi tnh hiu qu ca 2 thut ton trn. chnh l im mi trong n ca
nhm so vi cc n c trc trong cng ch nghin cu.
SVTH: Nguyn Th Ngc Dip Trang 9
9
n tt nghip
SVTH: Nguyn Th Ngc Dip Trang 10
10
Chng 1 : Tng quan v nng cao cht lng ting ni
CHNG 1: TNG QUAN V NNG CAO CHT LNG TING NI
1.1 Gii thiu chng
Ni dung ca chng trnh by mc ch ca nng cao cht lng ting ni
l g, cc loi nhiu trong ting ni, cch hnh thnh ca ting ni v cc c im
cu tn hiu ting ni. Chng ny cn gii thiu khi qut v cc thut ton s
dng trong speech enhancement.
1.2 Nng cao cht lng ting ni l g ?
Nng cao cht lng ting ni lin quan n vic ci thin cm nhn i vi
ting ni b suy gim cht lng do s c mt ca nhiu trong ting ni. Trong hu
ht cc ng dng, th mc ch ca nng cao cht lng ting ni l s ci thin
cht lng v tnh d nghe ca ting ni b suy gim do nhiu. S ci thin v
cht lng m tt th n lm gim i s kh khn cho ngi nghe khi nghe v
trong nhiu trng hp n cn gip cho ngi nghe c th nghe trong mi trng
c nhiu vi mc cao v nhiu tn ti trong thi gian di. Cc thut ton ng
cao cht lng ting ni lm gim v nn nhiu nn n mt mc no v n
c xem nh l cc thut ton nn nhiu.
Trong nhiu trng hp, s cn thit ca vic tng cng trong tn hiu ting
ni xut hin khi tn hiu ting ni hnh thnh trong vng c nhiu hoc nh hng
bi nhiu trong cc knh truyn thng. C rt nhiu kch bn yu cu t ra i vi
Speech enhancement trong nhiu trng hp khc nhau, v d i vi thng tin
thoi, trn cc h thng in thoi t bo th chu s nh hng nhiu nn t t,
nh hng,.. khi truyn n ch. Chnh v vy m cc thut ton trong nng cao
cht lng ting ni c th c s dng ci thin cht lng ca ting ni ti
im thu, mt khc, n c th c s dng trong cc khi tin x l ca h thng
m ho ting ni dng trong cc in thoi t bo chun [1]. Khi nhn dng ting
ni, ting ni b nhiu c tin x l bi cc thut ton nng cao cht lng trc
khi c nhn dng. Trong thng tin lin lc hng khng, cc k thut nng cao
ting ni cn c s dng ci thin cht lng v tnh d nghe ca ting ni
SVTH: Nguyn Th Ngc Dip Trang 11
Chng 1 : Tng quan v nng cao cht lng ting ni
ca phi cng b nh hng bi nhiu trong bung li. V vy m nng cao cht
lng ting ni cng rt cn thit trong thng tin lin lc ca qun s. Trong h
thng hi ngh qua thoi, th ngun nhiu xut hin mt vng no th n s
c truyn n tt c cc vng khc. Cc thut ton nng cao cht lng ting ni
c s dng nh tin x l hoc lm sch nhiu trong ting trc khi c
khuch i.
Nh cc v d minh ha trn th mc tiu ca cc thut ton tng cng tu
thuc vo cc ng dng m chng ta ang dng. Xt trn phng din l tng,
th chng ta mong mun Speech enhancement ci thin c c cht lng v tnh
d nghe hay s trong sut ca ting ni. Tuy nhin, xt trn phng din thc t
th cc thut ton Speech enhancement ch c th ci thin c cht lng ca
ting ni. N c th lm gim c nhiu nn trong ting ni nhng n s lm gia
tng thm mo ca tn hiu ting ni, chnh iu ny lm gim i tnh d nghe
ca ting ni. Do , yu cu chnh trong vic thit k mt thut ton Speech
enhancement phi m bo nn c nhiu v khng c gy ra mo trong s
cm nhn tn hiu ting ni.
Gii php tng qut trong cc vn ca Speech enhancement ph thuc rt
ln vo ng dng chng ta cn s dng, l cc vn nh l ngun nhiu v
giao thoa gy ra nhiu, mi lin h gia nhiu v tn hiu sch, s microphone v
cm bin c th c. S giao thoa c th xem nh l nhiu hoc c xem nh tn
hiu ting ni, n tu thuc vo mi trng ta ang xt, n c th c xem nh l
s tranh chp gia cc speaker. c tnh m nhiu c th c cng thm vo tn
hiu sch nu m thanh c hnh thnh trong cn phng b di m thanh. Hn
na, nhiu c th c tnh tng quan hoc khng tng quan v mt thng k vi
tn hiu sch. S lng microphone cng c kh nng nh hng n tnh hiu qu
ca cc thut ton Speech enhancement.
SVTH: Nguyn Th Ngc Dip Trang 12
Chng 1 : Tng quan v nng cao cht lng ting ni
1.3 L thuyt v tn hiu v nhiu
1.3.1 Tn hiu, h thng v x l tn hiu
1.3.1.1 Tn hiu
Tn hiu(signal) dng ch mt i lng vt l mang tin tc. V mt ton
hc, ta c th m t tn hiu nh mt hm theo bin thi gian, khng gian hay cc
bin c lp khc. Chng hn nh, hm: x(t) = 20t
2
m t tn hiu bin thin theo
bin thi gian t. Hay mt v d khc, hm: s(x,y) = 3x + 5xy + y
2
m t tn hiu l
hm theo hai bin c lp x v y, trong x v y biu din cho hai ta trong
mt phng [2].
Hai tn hiu trong v d trn v lp tn hiu c biu din chnh xc bng
hm theo bin c lp. Tuy nhin, trong thc t, cc mi quan h gia cc i
lng vt l v cc bin c lp thng rt phc tp nn khng th biu din tn
hiu nh trong hai v d va nu trn.
Hnh 1.1 Tn hiu ting ni [2].
Ly v d tn hiu ting ni l s bin thin ca p sut khng kh theo
thi gian. Chng hn khi ta pht m t away, dng sng ca n c biu din
nh hnh trn.
1.3.1.2 Ngun tn hiu
Tt c cc tn hiu u do mt ngun no to ra, theo mt cch thc no
. V d tn hiu ting ni c to ra bngg cch p khng kh i qua dy thanh
SVTH: Nguyn Th Ngc Dip Trang 13
Chng 1 : Tng quan v nng cao cht lng ting ni
m. Mt bc nh c c bng cch phi sng mt tm phim chp mt cnh/i
tng no . Qu trnh to tn hiu nh vy thng lin quan n mt h thng,
h thng ny p ng li mt kch thch no . Trong tn hiu ting ni, h thng
l h thng pht m, gm mi, rng, li, dy thanhKch thch lin quan n h
thng c gi l ngun tn hiu. Nh vy ta c ngun ting ni, ngun nh v cc
ngun tn hiu khc.
1.3.1.3 H thng v x l tn hiu
H thng l mt thit b vt l thc hin mt tc ng no ln tn hiu. V
d, b lc dng gim nhiu trong tn hiu mang tin c gi l mt h thng.
Khi ta truyn tn hiu qua mt h thng, nh b lc chng hn, ta ni rng x l
tn hiu . Trong trng ny, x l tn hiu lin quan n lc nhiu ra khi tn
hiu mong mun.
X l tn hiu l mun ni n mt lot cc cng vic hay cc php ton
c thc hin trn cc tn hiu nhm t mc ch no , nh l tch tin tc
cha bn trong tn hiu hoc l truyn tn hiu mang tin t ni ny n ni khc.
y ta cn lu n nh ngha h thng, n khng ch n thun l thit
b vt l m cn l phn mm x l tn hiu hoc l s kt hp gia phn cng v
phn mm. V d khi x l s tn hiu bng mch logic, h thng x l y l
phn cng. Khi x l bng my tnh s, tc ng ln tn hiu bao gm mt lot cc
php ton thc hin bi chng trnh phn mm. Khi x l bng cc b vi x l-h
thng bao gm kt hp c phn cng v phn mm, mi phn thc hin cc cng
vic ring no .
1.3.1.4 Phn loi tn hiu
Cc phng php ta s dng trong x l tn hiu ph thuc cht ch vo c
im ca tn hiu. C nhng phng php ring p dng cho mt loi tn hiu no
. Do vy, trc tin ta cn xem qua cch phn loi tn hiu lin quan n nhng
ng dng c th. Chng ta c th phn tn hiu thnh cc loi :
- Tn hiu nhiu hng v tn hiu a knh
- Tn hiu lin tc v tn hiu ri rc
SVTH: Nguyn Th Ngc Dip Trang 14
Chng 1 : Tng quan v nng cao cht lng ting ni
- Tn hiu bin lin tc v tn hiu bin ri rc
- Tn hiu xc nh v tn hiu ngu nhin
1.4 L thuyt v nhiu
1.4.1 Ngun nhiu
Nhiu mt hin thc, n tn ti mi ni, trn ng ph, trn xe, trong vn
phng, trong nh hng, trong cc to nh. N c th l ting xe chy trn ng,
ting n trn cc cng trng xy dng, ting n pht ra t cc qut chy trong
PC, chung in thoi, n tn ti vi cc hnh dng v hnh thc khc nhau
trong cuc sng hng ngy ca chng ta.
Nhiu c th hnh thnh mt ni c nh, v khng thay i theo thi gian,
v d nh l ting n pht ra t qut chy trong PC. Nhiu cng c th khng ng
yn mt ch, v d nh nhiu trong nh hng, l ting ni ca nhiu ngi xen
ln vi nhiu cch khc nhau vi ting n pht ra t nh bp. Cc c tnh v ph
cng nh thi gian ca nhiu trong nh hng thay i khng theo quy lut nn vic
nn nhiu trong cc mi trng c nhiu thay i nh vy s kh khn hn nhiu
so vi cc ngun nhiu ng yn khng thay i.
Cc c tnh c bit khc nhau ca cc loi nhiu l hnh dng ca ph
v s phn b ca nng lng nhiu trong min tn s. V d, nhiu gy ra bi gi
th nng lng ca n tp trung tn s thp di 500Hz. Nhng i vi nhiu
trong nh hng, trn xe, trn tu th khc, nng lng ca n c phn b trn
mt di tn s rng [3].
SVTH: Nguyn Th Ngc Dip Trang 15
Chng 1 : Tng quan v nng cao cht lng ting ni
Hnh 1.2 Dng v s phn b ph nng lng trung bnh nhiu trn xe [4].
Hnh 1.3 Dng v s phn b ph nng lng trung bnh ca nhiu trn tu
[4].
SVTH: Nguyn Th Ngc Dip Trang 16
Chng 1 : Tng quan v nng cao cht lng ting ni
Hnh 1.4 Dng v s phn b ph nng lng trung bnh ca nhiu trong
nh hng[4].
1.4.2 Nhiu v mc tn hiu ting ni trong cc mi trng khc nhau
im ti hn trong vic thit k cc thut ton ca Speech enhancement l s
nhn bit di bin thin ca ting ni v mc cng nhiu trong mi trng
thc t. T , chng ta c th m t min bin thin ca mc t s tn hiu
trn nhiu(SNR) c bt gp trong mi trng thc t. iu ny rt quan trng
nh gi tnh hiu qu ca cc thut ton Speech enhancement trong vic nn
nhiu v ci thin cht lng ca ting ni trong di bin thin ca mc SNR.
Mc ca ting ni v nhiu c o lng bng mc m thanh. Php
o lng y l o mc p sut ca m thanh tnh bng dB SPL(sound
pressure level)[4]. Khong cch gia ngi ni v ngi nghe cng nh hng
n mc cng m thanh, n tng ng vi php o c thc hin khi
microphone c t ti nhng v tr c khong cch khc nhau. Khong cch c
trng trong giao tip face-to-face l 1m, khi khong cch tng gp i th mc
cng m gim i 6 dB[6].
SVTH: Nguyn Th Ngc Dip Trang 17
Chng 1 : Tng quan v nng cao cht lng ting ni
Hnh bn di ny l s tng hp v mc m trung bnh gia ting ni v
nhiu trong cc mi trng khc nhau. Mc ca nhiu nh nht trong cc mi
trng nh phng hc, trong nh , trong bnh vin v trong cc to nh. Trong
cc mi trng khc nhau, th mc m ca nhiu nm trong phm vi bin thin
t 50 n 55 dB SPL, v mc m ca ting ni l 60 n 70 dB SPL. V
khuyn ngh a ra l mc t s SNR c hiu qu trong cc mi trng ny l 5
n 15 dB. Mc m ca nhiu rt cao trong cc mi trng tu in ngm,
trn my bay, n t khong 70 n 75 dB SPL. V mc m ca ting ni trong
cc mi trng ny cng t mc , nn mc t s SNR trong cc mi trng
ny gn nh l 0 dB.
Hnh 1.5 Mc nhiu v ting ni (c o bng SPL dB) trong cc mi
trng khc nhau [4].
1.5 Tn hiu ri rc theo thi gian
Tn hiu ri rc theo thi gian x(n) c th to ra bng cch ly mu tn hiu
lin tc theo thi gian x
a
(t) vi chu k ly mu l T
s
(tn s ly mu F
s
= 1/ T). Ta
c
x
a
(t)|
t=nT
= x
a
(nT) = x(n) , - < n< (1.1)
Lu n l bin nguyn, x(n) l hm theo bin nguyn, ch nh ti cc gi tr
n nguyn. Khi n khng nguyn, th x(n) khng xc nh, ch khng phi bng 0.
Trong nhiu sch v x l tn hiu s, ngi ta quy c: khi bin nguyn th bin
SVTH: Nguyn Th Ngc Dip Trang 18
Chng 1 : Tng quan v nng cao cht lng ting ni
c t trong du ngoc vung v khi bin lin tc th c t trong du ngoc
trn. T y tr i, ta k hiu tn hiu ri rc l: x[n].[7]
Mt s tn hiu ri rc c bn
1.5.1 Tn hiu bc nhy n v
u[n] =

'

<

0 , 2
0 , 1
n
n
(1.2)
Tn hiu bc nhy dch chuyn c dng sau:
u[n - n
o
] =

'

<

no n
no n
, 0
, 1
(1.3)
1.5.2 Tn hiu xung n v

'

0 , 0
0 , 1
] [
n
n
n
(1.4)
Tn hiu xung dch chuyn c dng sau

'


no n
no n
no n
, 0
, 1
] [
(1.5)
Chng ta c th biu din tn hiu ri rc theo thi gian x[n] thng qua tn
hiu xung n v nh sau
x[n] =

k
k n k x ] [ ] [
(1.6)
1.5.3 Tn hiu hm m
x[n] = C.a
n
(C,a : l nhng hng s) (1.7)
Tn hiu hm m pha phi : x[n] = C.a
n
.u[n]
Tn hiu hm m pha tri : x[n] = C.a
n
.u[-n]
1.5.4 Tn hiu hm sin ri rc
< < + n fn A n x ) 2 cos( ] [
(1.8)
A : l bin ca tn hiu sin
: pha ban u ca tn hiu sin
f : tn s s, f =
s
F
F
, F : l tn s ca tn hiu, F
s
: tn s ly mu
-0.5 < f < 0.5
SVTH: Nguyn Th Ngc Dip Trang 19
Chng 1 : Tng quan v nng cao cht lng ting ni
1.6 Php bin i Fourier ca tn hiu ri rc DTFT
Php bin i ny p dng phn tch cho c tn hiu v h thng. N c
dng trong trng hp dy ri rc di v hn v khng tun hon.
DTFT :




n
n j
e n x X ] [ ) (
(1.9)
Ta nhn xt thy rng tuy tn hiu ri rc trong min thi gian nhng DTFT
li lin tc v tun hon trong min tn s.
DTFT chnh l hm phc theo bin tn s thc. Ta gi DTFT l ph phc
(complex spectrum) hay ngn gn l ph ca tn hiu ri rc x[n].
1.6.1 S hi t ca php bin i Fourier
Khng phi l tt c DTFT u tn ti (hi t) v DTFT ch hi t khi
<



n
n j
e n x ] [
(1.10)
Ta lun lun c :


n n
n j
n
n j
n
n j
n
n j
n
n j
n x e n x
e n x e n x
e n x e n x
| ] [ | ] [
| || ] [ | ] [
] [ ] [
(1.11)
Nh vy, nu x[n] tho iu kin

n
n x | ] [ |
<

th bin i Fourier hi t [7].


1.6.2 Quan h gia bin i Z v bin i Fourier
Biu thc tnh ZT l:

n
n
z n x z X ] [ ) (
(1.12)
Gi s ROC c cha ng trn n v. Tnh X(Z) trn ng trn n v, ta
c

) ( ] [ ) (


X e n x z X
n
n j
e z
j (1.13)
Nh vy, bin i Fourier chnh l bin i Z tnh trn ng trn n v.
Da vo y, ta c th pht biu li iu kin tn ti ca DTFT nh sau :
SVTH: Nguyn Th Ngc Dip Trang 20
Chng 1 : Tng quan v nng cao cht lng ting ni
Bin i Fourier ca mt tn hiu ch tn ti khi ROC ca bin Z ca tn hiu
c cha ng trn n v.
1.6.3 Php bin i Fourier ngc
- Biu thc tnh bin i Fourier ngc
Ta thy X( ) l mt hm tun hon vi chu k 2 , do
j
e tun hon vi chu
k 2

+

j j j j j
e e e e e
2 ) 2 (
(1.14)
Do di tn s ca tn hiu ri rc l mt di tn bt k rng 2

, thng
chn l:
) , (
hay (0,2

).
Vy ta c th khai trin X( ) thnh chui Fourier trong khong (
) ,
hay
) 2 , 0 (
nu iu kin tn ti ca X( ) tho mn. Cc h s Fourier l x[n], ta c
th tnh c x[n] t X( ) theo cch sau:
Nhn 2 v ca biu thc DTFT vi
l j
e

2
1
ri ly tch phn trong khong
(
) ,
tac
] [
2
1
] [ ] [
2
1
) (
2
1
) (
l x d e n x d e e n x d e X
n l j
n
l j
n
n j l j

1
]
1


1
]
1



(1.15)
Thay l = n v thay cn tch phn, khng nht thit phi l (
) ,
m ch cn
khong gia cn trn v di l 2

, ta c biu thc tnh bin i Fourier ngc


(IDTFT) nh sau

2
) (
2
1
] [ d e X n x
n j
(1.16)
Ta c th tnh IDFT bng hai cch : mt l tnh trc tip tch phn trn, hai l
chuyn v bin i Z ri tnh nh bin i Z ngc. Tu vo tng trng hp c
th m ta chn phng php no cho thun tin.
1.6.4 Cc tnh cht ca php bin i Fourier
Tnh tuyn tnh
) ( ) ( ] [ ] [
2 1 2 1
+ + bX aX n bx n ax

(1.17)
SVTH: Nguyn Th Ngc Dip Trang 21
Chng 1 : Tng quan v nng cao cht lng ting ni
Tnh dch thi gian
(1.18)
Qua y ta thy s dch chuyn tn hiu trong min thi gian s khng nh
hng bin ca DTFT, tuy nhin pha c thm mt lng.
Tnh dch tn s / iu ch

) (
2
1
) (
2
1
] [ ) cos(
) ( ] [
) ( ] [
0 0 0
0
0
+ +

X x n x n
X n x e
X n x
n j
(1.18)
Nh vy, vic iu ch gy ra dch tn s[12].
Tnh chp thi gian
Tng t nh bin i Z, vi bin i Fourier ta cng c:
) ( ) ( ] [ * ] [
2 1 2 1
X X n x n x
F
(1.19)
Tnh nhn thi gian

2
2 1 2 1
) ( ) (
2
1
] [ ]. [ d X X n x n x
(1.20)
1.6.5 Phn tch tn s (ph) cho tn hiu ri rc
Trong min tn s, mi tn hiu u c mt c im ring ca n. V d
nh, tn hiu sin ch c duy nht mt tn s n, trong khi nhiu trng cha tt c
cc thnh phn tn s. S bin thin chm ca tn hiu l do tn s thp, trong khi
s bin thin nhanh v nhng xung nhn l do tn s cao. Nh xung vung chng
hn, n cha tt c tn s v c tn s cao.
Ph ca tn hiu l m t chi tit cc thnh phn tn s cha bn trong tn
hiu. V d nh tn hiu xung vung, ph ca n ch ra tt c cc nh nhn ca
cc sng sin ring c th kt hp li hp vi nhau to ra xung vung. Thng tin
ny quan trng v nhiu l do. V d, thnh phn tn s trong mt mu nhc ch
cho ta bit cc c trng ca loa, t khi sn xut ta li c ci tin cho hay
hn. d on cc nh hng ca b lc trn tn hiu, cn phi bit khng ch
bn cht ca b lc m cn phi bit c ph ca tn hiu na.
SVTH: Nguyn Th Ngc Dip Trang 22
Chng 1 : Tng quan v nng cao cht lng ting ni
1.6.6 Ph tn hiu v ph pha
Ph ca tn hiu gm hai phn: ph bin (magnitude spectrum) v ph
pha (phase spectrum). Ph bin ch ra ln ca tng thnh phn tn s. Ph
pha ch quan h pha gia cc thnh phn tn s khc nhau. Cng c tnh ph tn
hiu ri rc khng tun hon l DTFT.
tnh ph tn hiu , ta qua hai bc : mt l tnh DTFT ca tn hiu l
X(), hai l tnh bin v pha ca X( )
) (
) ( ) (


j
e X X
(1.21)
y |X()| l ph bin v ( ) l ph pha.
Ta d dng chng minh c rng i vi tn hiu thc, ph bin l mt
hm chn theo tn s v ph pha l mt hm l theo .
Do , nu bit ph X( ) trong khong 0 n

, ta c th suy ra ph trong
ton di tn s. d gii thch ph, tn s s t 0 n

thng c chuyn
i thnh tn s tng t t 0 n f
s
/2 nu tn s ly mu l f
s
.
Hnh 1.6 Mu ting ni eee c ly mu vi tn s ly mu 8kHz [11].
SVTH: Nguyn Th Ngc Dip Trang 23
Chng 1 : Tng quan v nng cao cht lng ting ni
1.7 Cc thut ton s dng nng cao cht lng ting ni
1.7.1 Tr ph
Spectral-subtraction (SS) hay cn gi l tr ph l mt thut ton gim
nhiu n gin nht. N da trn nguyn l c bn l n s m t v cp nht
nhiu trong tn hiu c nhiu bng cch thu nhiu khi khng c s hin din ca
tn hiu. V nhiu s c tr vi tn hiu c nhiu, kt qu l tn hiu ca
chng ta sau khi x l bng thut ton ny s c loi i nhiu v xt trn
phng din l tng th n l tn hiu sch. SS lc ban u c xut bi
Weiss[8] trong min tng quan, v sau c xut bi Boll [9] trong min
chuyn i Fourier.
1.7.2 M hnh thng k
Vn ca nng cao cht lng ting ni l phi ra c khung m t
mang tnh thng k. N l mt tp cc php o tng ng vi h chuyn i
Fourier ca tn hiu nhiu, v chng ta mong mun s tm ra c mt phng
thc c lng tuyn tnh hoc phi tuyn cc tham s c li, l h chuyn i
ca tn hiu sch. Hai thut ton c s dng l thut ton Wiener v minium
mean-squared error(MMSE)[10].
1.8 Tn hiu ting ni
Tn hiu ting ni l tn hiu lin tc v c ph nng lng thay i theo thi
gian. Tuy nhin khi kho st trong mt khong thi gian ngn (khong 10 n
30 ms) th c tnh ph ca n coi nh khng thay i.
SVTH: Nguyn Th Ngc Dip Trang 24
Chng 1 : Tng quan v nng cao cht lng ting ni
Hnh 1.7 Dng sng tn hiu ting ni ca cu The wife helped her
husband v dng sng ca ph m f trong t wife, dng sng ca on
nguyn m er trong t her [11].
Dng sng ca tn hiu c th c chia thnh mt s phn on tng ng
vi cc m/t. Trong v d trn ta thy mt s phn on c dng sng gn nh
tun hon cn s khc th khng c tnh tun hon v b nhiu
Nhng kiu ca on ting ni_chu k, nhiu, khong lng thng c
tm thy trong ting ni tri chy vi s thay i v cng , khong thi gian v
c tnh ph.
SVTH: Nguyn Th Ngc Dip Trang 25
Chng 1 : Tng quan v nng cao cht lng ting ni
1.9 C ch to ting ni
1.9.1.1 B my pht m ca con ngi
Hnh 1.8 mt ct dc ca c quan to ting ni [11].
1.9.2 M hnh k thut ca vic to ting ni
Hnh 1.9 m hnh k thut to ting ni[11].
SVTH: Nguyn Th Ngc Dip Trang 26
Chng 1 : Tng quan v nng cao cht lng ting ni
1.9.3 Phn loi m
Cc m trong ting Anh c phn loi gm: nguyn m v nguyn m i,
bn nguyn m, m mi, m stops, fricative, africatives, whisper.
Hnh 1.10 bng phn loi m v trong ting Anh ca ngi M [11]
1.9.4 Thuc tnh m hc ca ting ni
Tn hiu ting ni l tn hiu tng t, biu din cho thng tin v mt ngn
ng v c th hin bng cc m v khc nhau. S lng cc m v tu thuc vo
tng ngn ng, vo khong 20 n 30 v khng vt qu 50. i vi tng loi m
v n c c tnh m thanh khc nhau.T hp cc m v to nn m tit. m tit
ng vai tr mt t trn vn mang ng ngha.
1.10 Kt lun chng
Chng ny trnh by c mc ch chnh ca speech enhancement l
trit nhiu hoc l nn nhiu trong tn hiu ting ni b nhiu. Ngoi ra, ni
dung ca chng cng nu r c cc loi nhiu xut hin trong tng mi
trng c th t chng ta c tm ra c thut ton x l thch hp ng vi
mi trng hp c th.
SVTH: Nguyn Th Ngc Dip Trang 27
Chng 2 : nh gi cht lng ting ni
CHNG 2 : NH GI CH T LNG TI NG NOI
1.11 Gii thiu chng
Cho n nay c rt nhiu thut ton nng cao cht lng ting ni, nhng
lm th no nh gi ng hiu qu ca chng. Phn ny cung cp cc phng
php nh gi khc nhau c s dng nh gi hi u qua cu a thut ton nng
cao ting ni
nh gi cht lng c th thc hin bng cch s dng phng php a nh
gia theo ca m nhn cu a ng i nghe theo m t thang o a c xa c i nh tr c
(Subjective Evaluation_ SE) hoc da trn php o cc thuc tnh ca tn hiu
(Objective Evaluation_ OE). D OE c gi tr th n vn phi tng quan vi ca m
nh n ca ngi nghe
Phn ny s cung cp mt ci nhn tng quan v ca c phng php nh gi
cht lng ca ting ni c x l
1.12 Phng php nh gi ch quan
nh gi cht lng ch quan l nh gi cht lng da trn cm nhn
nghe ca con ngi i vi ting ni
Cht lng l mt trong cc thuc tnh ca tn hiu ting ni. V bn cht th
cht lng c tnh ch quan cao v kh c th nh gi mt cch ng tin cy.N
ch ng vai tr phn no trong k thut nh gi v mi c nhn ngi nghe c
nhng tiu chun ring v cht lng tt hay xu, cht lng l kt qu ca s
cm nhn v phn on ch quan ca ngi nghe, dn n s chnh lch ln trong
kt qu nh gi. Cht lng c rt nhiu ch tiu khng th m ht c. Ty
vo cc mc ch thc t v ty vo mi ng dng m ch tp trung vo mt s ch
tiu cht lng ting ni.
nh gi cht lng ting ni l mt cng vic y kh khn do tnh a ch
tiu v tnh ch quan cao. C mt s lng ln cc c trng nh gi khi thc
hin phng php nghe ch quan ny. kt qu nh gi l ng tin cy th s
la chn ng n cc tham s cho vic nh gi l iu cn thit. Da trn thc
SVTH: Nguyn Th Ngc Dip Trang 28
Chng 2 : nh gi cht lng ting ni
t ITU-T a ra cc khuyn ngh ban hnh trong cc chun t ITU-T Rec
P.800 n ITU-T Rec P.899. C hai loi nh gi chnh l Tuyt i v Tng i.
S nh gi da trn cc thang im chun c ra trong chun ITU-T
Rec.P.800[11]
1.12.1 Cc phng php nh gi tuyt i
1.12.1.1 Phng php nh gi tuyt i ACR
ACR c s dng rng ri. ITU-T[11] khuyn ngh dng phng php
ny trong hu ht cc ng dng. Thang o c khuyn l
Mean Opinion Scores (MOS)
c m t trong khuyn ngh P.800 ca ITU-T, MOS l mt
php o cht lng thoi ni ting. y l mt phng php o cht
lng mang tnh cht ch quan. C hai phng php kim tra l nh
gi m thoi v nh gi nghe.
Ba ng 2.1.Thang im a nh gia ch t l ng ting noi theo MOS [12]
Score Quality of the Speech Level of Distortion
5 Excellent Imperceptible
4 Good Just perceptible, but not annoying
3 Fair Perceptible and slight annoying
2 Poor Annoying but not Objectinable
1 Bad Very annoying and Objectionable
1.12.2 Cc phng php nh gi tng i
Nhn chung phng php nh gi ny c nhy cao hn i vi s suy
gim cht lng ca tn hiu qua x l
1.12.2.1 nh gi bng phng php so snh cc mu tn hiu
Dng n gin nht ca phng php ny l thch nghe mu no hn
Preference test hay cn gi l so snh nh gi theo tng cp tn hiu Paired
Comparison Test. i vi phng php ny th ngi nghe s c nghe hai mu
thoi v s nh gi thch mu tn hiu no hn
SVTH: Nguyn Th Ngc Dip Trang 29
Chng 2 : nh gi cht lng ting ni
nh gi bng cch so snh Comparison Category Rating (CCR) c
khuyn ngh bi ITU-T nh gi cc h thng dng nng cao cht lng ting
ni. [13]
Bang 2.4. Thang im a nh gia ch t l ng tin hi u ting noi theo CCR
Theshold Test hay cn gi l Isopreference Test l mt bin th ca
Preference Test. Phng php ny l so snh tn hiu qua x l vi tn hiu g c
chun m suy gim cht lng ca n c th c kim sot. c ra trong
chun ITU-T Rec.P.810
1.12.2.2 Phng php nh gi theo s suy gim cht lng
nh gi s suy gim cht lng Degradation Category Rating (DCR) S
gim st v cht lng ca tn hiu qua x l so vi tn hiu cht lng cao
cha qua x l c xc nh qua nm thang im
Bang 2.5. Thang anh gia DCR
Rating Degradation
1 Very annoying
2 Annoying
3 Sightly annoying
4 Audible but not annoying
5 Inaudible
SVTH: Nguyn Th Ngc Dip Trang 30
Rating Quality of Speech
3 Much better
2 Better
1 Slightly Better
0 About the Same
-1 Slightly Worse
-2 Worse
-3 Much Worse
Chng 2 : nh gi cht lng ting ni
1.13 Phng php anh gia khch quan
nh gi cht lng khch quan l phng php nh gi cht lng da
trn cc php o thuc tnh ca tn hiu
1.13.1 o ty s tin hi u trn nhi u trn tng khung
o SNR trn tng khung trong min thi gian l mt trong nhng phng
php nh gi v m t toa n n gin nht. phng php ny c hiu qu th
iu quan trng l tn hiu gc v tn hiu qua x l phi trong cng min thi
gian v lch pha hin ti phi c hiu chnh chnh xc. SNRseg c xc
nh nh sau
[12] (2.1)
Trong o : tn hiu gc (tn hiu sch)
: tn hiu c tng cng
N: chiu di khung (thng c chn t 15-20ms)
M: s khung ca tn hiu
Mt vn tim n vi phng php nh gi SNRseg l nng lng ca tn
hiu trong sut khong lng ca tn hiu thoi (xut hin nhiu trong cc on hi
thoi) s rt b, dn n kt qu l gi tr ca ai s SNRseg ln la m sai l ch toa n
b a nh gia . Phng a n gia i quy t duy nh t la loa i tr nh ng khung l ng trong
bi u th c trn b ng ca ch o m c nng l ng trong th i gian ng n nn gi tr
SNRseg ngng n m t gia tri be . Nu gia tri SNRseg c gi i ha n trong
khoa ng [-10dB, 35dB] [14] s tra nh c vi c c n pha i du ng b ta ch ti n hi u thoa i
va khoa ng l ng
S xa c i nh tr c cu a SNRseg d a trn ti n hi u va o g c va ti n hi u a c
x ly . Ta co th du ng ti n hi u c x ly qua b lo c d oa n th ng c s du ng
trong thu t toa n CELP [15]. Sau khi a ti n hi u g c va ti n hi u a qua x ly qua
ca c b lo c na y, ta co th ti nh toa n SNRseg da trn ti n hi u ra cu a ca c b lo c[16].
SVTH: Nguyn Th Ngc Dip Trang 31
Chng 2 : nh gi cht lng ting ni
S c ti nh SNR na y mang la i h s tng quan cao i v i ca c phng pha p
a nh gia chu quan
M t ca ch xa c i nh SNRseg kha c c xu t b i Richards [17] trong o
ha m log co thay i so v i cng th c 3.1
(2.2)
Nh v y co th tra nh c ca c gia tri sai lch l n trong su t ca c khoa ng l ng
cu a ti n hi u ti ng no i. Chu y r ng gia tri nho nh t co th a t c cu a SNRseg
R
by gi la 0 thi a t t hn nhi u so v i nh ng gia tri m v cu ng. u i m chi nh
cu a vi c xa c i nh trc phn oa n SNR la tra nh c vi c c n thi t pha i la m ro
ra ng gi a ca c khoa ng ti ng no i va khoa ng l ng
o SNR cho tng khung co th c m r ng trong mi n t n s theo
(2.3)
Trong o B
j
: Tro ng l ng ta i da i t n s th j
K : S da i t n
M : T ng s khung ti n hi u
F(m,j) : Da y ti n hi u g c qua bo lo c a c khu ch a i ta i
da i
l n th j va khung th m
: Da y ti n hi u a c tng c ng qua lo c khu ch a i
cu ng m t da i t n v i F(m,j)
u i m chi nh cu a vi c s du ng SNRseg trn mi n t n s thay vi mi n th i
gian tng thm ti nh linh ng cu a vi c phn b tro ng s ca ph kha c nhau cho
nh ng da i t n kha c nhau
M t ca ch kha c, tro ng s cu a m i da i co th thu c b ng ca ch du ng
phng pha p phn ti ch h i quy, co n go i la phng pha p a nh gia chu quan bi n
i t n s . B ng ca ch na y , tro ng s co th c cho n co h s tng quan l n
nh t gia a nh gia kha ch quan va a nh gia chu quan. V i phng pha p na y, t ng
SVTH: Nguyn Th Ngc Dip Trang 32
Chng 2 : nh gi cht lng ting ni
cu a K (cho m i da i) cu a ca c phng pha p a nh gia kha ch quan kha c nhau va D
j
c c ti nh cho m i da y, ta i D
j
c cho nh sau[12]
(2.4)
Tro ng l ng t i u cho m i D
j
cu a m i da i a t c khi du ng phng pha p
phn ti ch h i quy tuy n ti nh b c K, cho ra a nh gia chu quan bi n i t n s :
(2.5)
: Ca c h s h i quy, D
j
: c cho b i (3.4), K la s da i .Phn ti ch h i
quy khng tuy n ti nh cu ng co th c s du ng nh mt ca ch chuy n ho a a nh
gia kha ch quan bi n i t n s
1.13.2 o khoa ng cach ph da trn LPC
LPC (Linear Prediction Coefficient)s :H s d oa n tuy n ti nh
Gm cc phng pha p ph bi n la LLR (Log Likelihood Ratio) , IS (Itakura
Saito) v o theo khong cch cepstrum
1.13.2.1 Phng phap o LLR
(2.6)
:h s LPC cu a ti n hi u sa ch
:h s cu a ti n hi u a c
tng c ng ch t l ng
R
x
la (p+1)*(p+1)ma tr n t tng quan(Toeplitz) cu a ti n hi u sa ch
Bi u th c trn c vi t la i trong mi n t n s nh sau[9]
[17] (2.7)
va l n lt la ph cu a va . Bi u th c trn chi ra s kha c
nhau gi a ph ti n hi u va ph tng cng co a nh h ng nhi u hn khi
SVTH: Nguyn Th Ngc Dip Trang 33
Chng 2 : nh gi cht lng ting ni
l n, th ng g n v i i nh t n s formant. Do o , ca ch o na y xa c i nh
s kha c nhau vi tri cu a i nh t n s formant
1.13.2.2 Phng php o IS
o IS c xc nh nh sau
[12] (2.8)
va l n l t la h s khu ch a i cu a ti n hi u sa ch va ti n hi u
tng c ng. H s khu ch a i co th c ti nh nh sau:
(2.9)
ch a h s t tng quan cu a
ti n hi u sa ch (no cu ng la ha ng u tin cu a ma tr n t tng quan )
1.13.2.3 Phng php o theo khong cch cepstrum
Khng gi ng v i o LLR, IS chu tro ng s kha c nhau gi a h s khu ch a i ,
s khc nhau v mc ph ca tn hiu sch v tn hiu tng cng. Bn cnh
cng c th l hn ch ca nh gi IS, s khc nhau gia cc mc ph c tc
ng nh n cht lng[18]
H s LPC cng c th xut pht t khong cch o c da trn h s
cepstrum. Khong cch ny quy nh s c lng khong cch log ca ph ca
gia hai ph tn hiu. H s cepstrum c th thu c t php quy h s LPC
{a
j
} s dng cng thc sau
(2.10)
Vi p l bc ca phn tch LPC .Php o da trn h s cepstrum c th c
tnh nh sau [19]
(2.11)
SVTH: Nguyn Th Ngc Dip Trang 34
Chng 2 : nh gi cht lng ting ni
Vi v ln lt l h s ca tn hiu sch v tn hiu c
tng cng
1.13.3 nh gi m phng theo cm nhn nghe ca con ngi
Nhng phng php nh gi c cp trn c a dng v tnh n
gin thc hin v d dng nh gi. Tuy nhin, kh nng d on cht lng
ch quan ca chng th hn ch khi m cc phng php x l tn hiu khng
tnh n phm vi nghe ca con ngi.
1.13.3.1 Phng php o Weighted Spectral Slope
Phng php nh gi ny c tnh bi dc ph u tin c tm thy ca
mi di ph. Xt C
x
(k) l ph di ti hn ca tn hiu sch v l ca tn hiu
tng cng, xt trong n v dB. Phng trnh sai phn bc nht c dng tnh
c dc ph c cho nh sau:
(2.12)
Vi v ln lt biu din cho dc di tn th k ca tn hiu
sch v tn hiu tng cng. S khc nhau gia cc dc ph ph thuc vo
trng s mt l di tn gn vi nh hoc rnh, hai l nh l nh ln nht ca
ph. Trng s ca di th k, k hiu W(k) c tnh nh sau
[12] (2.13)
rng loga ln nht ca ph trong tt c cc bng, l gi tr
ca nh gn vi bng k nht, v , l hng s c c bng php
phn tch hi quy cc i ha s tng quan gia nh gi ch quan v gi tr
ca nh gi khch quan. Vi nhng th nghim c thc hin th ngi ta tm
c s tng quan ln nht s c c vi =20 v =1[18]
SVTH: Nguyn Th Ngc Dip Trang 35
Chng 2 : nh gi cht lng ting ni
Php o WSS tnh cho mi khung ca tn hiu thoi:
(2.14)
Vi L l s lng di ti hn
Gi tr WSS c tnh bng cch ly trung bnh cc gi tr WSS thu c t
cc khung trong cu
WSS l phng php nh gi kh hp dn bi v n khng i hi phi c
formant r rng. N ch ti v tr nh ph v t nhy cm vi cc nh xung
quanh cng nh cc chi tit ca ph cc vng thp. nh gi LPC c bn (v d
nh nh gi LLR) nhy vi cc tn s formant khc, nhng cng nhy vi s
thay i bin v s thay i nghing ph. Khng c g l bt ng khi nh
gi WSS mang li mt s tng quan ln (=0.74) hn nh gi LPC, vi s nh
gi cht lng ch quan ca ting ni b gim cht lng bi s m ha[20]
1.13.3.2 Phng php o Bark Distortion
Phng php nh gi WSS l bc u lm mu cho vic lm th no
con ngi nhn bit c ting ni, c bit l nguyn m. Cc phng php nh
gi sau ny cng da vo s x l m thanh ca tai ngi, cch m thnh gic con
ngi x l m thanh v nhiu. Nhng phng php nh gi mi ny da trn
nhng lp lun sau:
1. S phn tch tn s ca tai ngi l khng thay i, tc l s phn
tch tn s ca tn hiu m thanh khng da trn phm vi tn s tuyn
tnh
2. nhy ca tai ngi ph thuc vo tn s m thanh
3. m thanh to tng ng vi mnh ca tn hiu trong min phi tuyn
tnh
Thnh gic con ngi phng theo mt lot bin i ca tn hiu m thanh. C
tn hiu gc v tn hiu qua x l phi tri qua hng lot cc bin i ny, dn
n ci gi l ph m lng. nh gi BSD s dng khong cch gia cc ph ny
nh l nh gi cht lng ch quan
SVTH: Nguyn Th Ngc Dip Trang 36
Chng 2 : nh gi cht lng ting ni
1.13.3.3 Phng php nh gi cm nhn cht lng thoi PESQ
Trong cc phng php nh gi OE th PESQ l phng php nh gi
phc tp nht v c khuyn ngh bi ITU_T cho nh gi cht lng thoi bng
hp (3,2KHz) v l mt phng php nh gi khch quan c tnh tng quan cao
vi nh gi theo cm nhn ca ngi nghe
1.14 Kt lun chng
Chng ny trnh by mt s phng php nh gi cht lng ting ni
sau khi x l gim nhiu bng cc thut ton tng cng ting ni. Cc nh gi
SE c gii thiu gm c cc phng php nh gi tuyt i v nh gi tng
i. Cc phng php nh gi OE c trnh by chnh trong chng ny gm :
o SNRseg, nh gi LLR, IS v WSS, trong phn ny cng gii thiu s b v
BSD v PESQ
SVTH: Nguyn Th Ngc Dip Trang 37
Overlap v
adding
Chng 2 : nh gi cht lng ting ni
SVTH: Nguyn Th Ngc Dip Trang 38
Overlap v
adding
Chng 3 : Thut ton Spectral Subtraction v Wiener Filtering
CHNG 3: THUT TON SPECTRALSUBTRACTION V WIENER
FILTERING
1.15 Gii thiu chng
Ni dung ca chng ny trnh by nguyn l chung ca thut ton Spectral
subtraction v Wiener filtering,nguyn l c bn ca tng thut ton, cc bc
thc hin cn thit phn tch lin kt tn hiu, cp n vn c lung
nhiu, vn ny nh hng rt ln n qu trnh x l
1.16 S khi chung ca Spectral Subtraction v Wiener Filtering
Trong n ny, chng ti da trn cc c s l thuyt ca cc thut ton c
trong speech enhancement, v la chn ra 2 thut ton l : Spectral
subtraction v Wiener filter s dng lm thut ton x l trit nhiu.
S khi chung cho c 2 thut ton :
Hnh 3.1 S khi cho hai thut ton SS v WF
C 2 thut ton Spectral subtraction v Wiener filter ch khc nhau khi
hm x l trit nhiu, tt c cc khi cn li th ging nhau.
1.17 Thut ton Spectral Subtraction
1.17.1 Gii thiu chung
Spectral subtraction l thut ton c xut sm nht trong cc thut
ton c s dng gim nhiu trong tn hiu. c rt nhiu bi lun m t cc
bin th ca thut ton ny so vi cc thut ton khc. N da trn mt nguyn tc
c bn, tha nhn s c mt ca nhiu, n c th t c mc ch c lng
ph ca ting ni sch bng cch tr i ph ca nhiu vi ph ca ting ni b
SVTH: Nguyn Th Ngc Dip Trang 39
Phn tch tn
hiu thnh cc
frame
FFT
c lng
nhiu
Hm x l
gim nhiu
Tn hiu b
nhiu
IDFT
Overlap v
adding
Tn hiu
sch
Chng 3 : Thut ton Spectral Subtraction v Wiener Filtering
nhiu. Ph ca nhiu c th c c lng, cp nht trong nhiu chu k khi
khng c mt ca tn hiu. S tha nhn ch c thc hin i vi nhiu
khng i hoc c tc x l bin i chm, v khi ph ca nhiu s khng
thay i ng k gia cc khong thi gian cp nht. Vic tng cng tn hiu t
c bng cch tnh IDFT(bin i Fourier ri rc ngc) ca ph tn hiu
c c lng c s dng pha ca tn hiu c nhiu. Thut ton ny l mt php
tnh c lng n gin v n ch gm bin i DFT thun v DFT ngc.
Qu trnh x l hiu n gin nh vy phi tr mt ci gi, nu qu trnh x
l khng c thc hin mt cch cn thn th ting ni ca chng ta s b mo.
Nu nh vic ly hiu qu ln th c th loi b i mt phn thng tin ca ting
ni, cn nu vic thc hin ly hiu nh th nhiu s vn cn c gi li trong
tn hiu. C rt nhiu phng php c xut gim i hu ht mo trong qu
trnh x l ting ni bng spectral subtraction[21], v trong s cng c mt vi
trng hp b loi b.
1.17.2 Spectral subtraction i vi ph bin
Gi thit rng y[n] l tn hiu vo b nhiu, n tng ca tn hiu sch x[n]
v tn hiu nhiu d[n]:
y[n] = x[n] + d[n] (3.1)
Thc hin bin i Fourier ri rc c 2 v,ta c

) ( ) ( ) ( D X Y +
(3.2)
Chng ta c th biu din Y(

) di dng phc nh sau:


) (
| ) ( | ) (


y
j
e Y Y (3.3)
Khi |Y(

)| l bin ph, v
) (
y l pha ca tn hiu b nhiu.
Ph ca tn hiu nhiu D(

) c th c biu din dng bin v pha:


) (
| ) ( | ) (


d
j
e D D (3.4)
Bin ph ca nhiu |D(

)| khng xc nh c, nhng c th thay th


bng gi tr trung bnh ca n c tnh trong khi khng c ting ni(ting ni b
dng), v pha ca tn hiu nhiu c th thay th bng pha ca tn hiu b nhiu
SVTH: Nguyn Th Ngc Dip Trang 40
Chng 3 : Thut ton Spectral Subtraction v Wiener Filtering
) (
y , vic lm ny khng nh hng n tnh d nghe ca ting ni [22], c th
nh hng n cht lng ca ting ni l lm thay i pha ca ting ni nhng
cng ch vi .
Khi chng ta c th c lng c ph ca tn hiu sch:
) (
|] ) ( | | ) ( [| ) (


y
j
e D Y X


(3.5)
y |
) (

D
| l bin ph c lng ca nhiu c tnh trong khi khng
c ting ni hot ng. K hiu
" "
ch rng gi tr l gi tr c tnh gn
ng. Tn hiu ting ni c tng cng c th t c bng cch rt n gin
l bin i IDFT ca
) (

X
.
Cn ch rng bin ph ca tn hiu c tng cng l
|) ) ( | | ) ( | ( | ) ( | D Y X
, c th b m do s sai st trong vic c lng ph ca
nhiu. Tuy nhin, bin ca ph th khng th m, nn chng cn phi m bo
rng khi thc tr hai ph th ph ca tn hiu tng cng |X(

)| lun lun khng


m. Gii php c a ra khc phc iu ny l chnh lu bn sng hiu ca
ph, nu thnh phn ph no m m th chng ta s gn n bng 0:

'

>

, 0
| ) ( | | ) ( | , | ) ( | ) (
) (
^ ^

D Y D Y
X
(3.6)
Phng php x l bng chnh lu bn sng l mt trong nhng cch m
bo cho |X(

)| khng b m.
1.17.3 Spectral subtraction i vi ph cng sut
Thut ton Spectral subtraction i vi ph bin c th c m rng
sang min ph cng sut. V trong mt vi trng hp, n c th lm vic tt vi
ph cng sut hn l vi ph bin . Ly ph cng sut ca tn hiu b nhiu
trong mt khong ngn, chng ta bnh phng |Y(

)|, ta c:
{ } ) ( ) ( Re . 2 ) ( ) (
) ( ) ( ) ( . ) ( ) ( ) ( ) (
*
2 2
* *
2 2 2


D X D X
D X D X D X Y
+ +
+ + +
(3.7)
SVTH: Nguyn Th Ngc Dip Trang 41
Chng 3 : Thut ton Spectral Subtraction v Wiener Filtering
|
) ( D
|
2
, X(

). ) (

D v ) ( ). ( D X

khng th tnh c mt cch trc tip
v xp x bng E{|
) ( D
|
2
}, E{ X(

). ) (

D } v E{ ) ( ). ( D X

}, khi E[.] l
ton t k vng. Bnh thng th E{|
) ( D
|
2
} c c lng khi khng c ting
ni hot ng v c biu th l |
) (

D
|
2
. Nu chng ta tha nhn d[n] = 0 v
khng c mt s tng quan no vi tn hiu sch x[n], th E{ X(

). ) (

D } v
E{ ) ( ). ( D X

} xem l 0. Khi ph cng sut ca tn hiu sch c th tnh c
nh sau

2
^
2 2
^
| ) ( | | ) ( | | ) ( | D Y X
(3.8)
Cng thc trn biu din thut ton tr ph cng sut. Nh cng thc trn,
th ph cng sut c c lng
2
| ) ( |

X
khng c m bo lun l mt s
dng, nhng c th s dng phng php chnh lu bn sng nh trnh by
trn. Tn hiu c tng cng s thu c bng cch tnh IDFT ca
| ) ( |

X
(bng
cch ly cn bc hai ca
| ) ( |

X
2
), c s dng pha ca tn hiu ting ni b nhiu.
Ch rng, nu chng ta ly IDFT c hai v ca cng thc (4.8) trn th ta c mt
phng trnh tng t trong min t tng quan:
) ( ) ( ) (
^ ^ ^ ^
m r m r m r
dd
yy
xx

(3.9)
Khi
) (
^ ^
m r
xx
,
) (m r
yy ,
) (
^ ^
m r
dd
l cc h s t tng quan ca tn hiu sch,
tn hiu ting ni b nhiu, v tn hiu nhiu c c lng [23,24].
Cng thc (1) c th c vit theo dng sau:
2 2 2
^
| ) ( | ) ( | ) ( | Y H X
( 3.10)
Khi :
2
2
^
| ) ( |
| ) ( |
1 ) (

Y
D
H
(3.11)
Trong l thuyt h thng tuyn tnh, H(

) l hm truyn t ca h thng.
Trong l thuyt ca Speech enhancement, chng ta xem H(

) l hm li hay
SVTH: Nguyn Th Ngc Dip Trang 42
Chng 3 : Thut ton Spectral Subtraction v Wiener Filtering
hm nn. V H(

) l mt s thc v lun lun dng, v c gi tr nm trong


phm v
1 ) ( 0 H
. Nu n c gi tr m l do c sai st trong qu trnh c
lng ph ca nhiu. H(

) c gi l hm nn l v n cho ta bit t s gia ph


cng sut ca tn hiu c tng cng vi ph cng sut ca tn hiu b nhiu.
Hnh dng ca hm nn l mt c trng duy nht ca mi thut ton Speech
enhancement.
Chnh v vy m chng ta thng so snh cc thut ton bng cch so snh
cc p ng ca hm nn ca chng. H s H(

) c gi tr thc nn bin i IDFT


l h[n] i xng vi nhau qua im 0 v khng nhn qu. Trong min thi gian th
h[n] c xem l mt b lc khng nhn qu [25]. Nn s c mt phng php
c xut hiu chnh hm H(

) p ng ca n tr thnh b lc nhn
qu trong min thi gian.
Trng hp chung th thut ton Spectral subtraction c th c biu din:
p p p
D Y X | ) ( | | ) ( | | ) ( |
^ ^

(3.12)
Trong p l s m cng sut, vi p = 1 l l phng php tr ph bin
in hnh, p = 2 l phng php tr ph cng sut.
S khi ca thut ton Spectral Subtraction :
Hnh 3.2 S khi ca thut ton Spectral subtraction [26].
SVTH: Nguyn Th Ngc Dip Trang 43
Tn hiu b
nhiu
FFT |.|
p
c lng, cp
nht nhiu
Pha ca
tn hiu
|.|
1/p
IFFT
Tn hiu sau khi
tng cng
+
) ( Y
-
p
D | ) ( |
^

Chng 3 : Thut ton Spectral Subtraction v Wiener Filtering


1.18 Thut ton Wiener Filtering
1.18.1 Gii thiu chung
Thut ton Spectral Subtraction da ch yu vo trc gic v kinh
nghim.Chnh xc hn thut ton ny c pht trin da trn mt nhiu c tht
c cng vo v tn hiu sch c c lng mt cch n gin bng cch tr
i ph ca nhiu t ph ca tnh hiu ting ni c nhiu. Vi cch lm ny tn hiu
ting ni sch khng th c c bng cch ti u nht. khc phc nhc im
ny ta s dng thut ton Wiener Filtering (WF).
WF l thut ton c s dng rng ri trong nng cao cht lng ting ni.
Ngun gc c bn ca thut ton WF l to ra tn hiu ting ni sch bng cch
nn nhiu. c lng c thc hin bng cch h thp sai s bnh phng trung
bnh (Mean Square Error) gia tn hiu mong mun v tn hiu c lng.
1.18.2 Nguyn l c bn ca Wiener Filtering
Gi thit rng y[n] l tn hiu vo b nhiu, n l tng ca tn hiu sch v
tn hiu nhiu d[n]:
y[n]=x[n]+d[n] (3.13)
Thc hin bin i Fourier ri rc c 2 v,ta c
) ( ) ( ) ( D X Y +
(3.14)
Chng ta c th biu din Y(

) di dng phc nh sau:


) (
| ) ( | ) (


y
j
e Y Y (3.15)
Khi |Y(

)| l bin ph, v
) (
y l pha ca tn hiu b nhiu.
Ph ca tn hiu nhiu D(

) c th c biu din dng bin v pha:


) (
| ) ( | ) (


d
j
e D D (3.16)
Bin ph ca nhiu |D(

)| khng xc nh c, nhng c th thay th


bng gi tr trung bnh ca n c tnh trong khi khng c ting ni(ting ni b
dng), v pha ca tn hiu nhiu c th thay th bng pha ca tn hiu b nhiu
) (
y , vic lm ny khng nh hng n tnh d nghe ca ting ni [27], c th
SVTH: Nguyn Th Ngc Dip Trang 44
Chng 3 : Thut ton Spectral Subtraction v Wiener Filtering
nh hng n cht lng ca ting ni l lm thay i pha ca ting ni nhng
cng ch vi .
Ta c th c lng c bin ca ph tn hiu sch
) (

X
t Y(

) bng
mt hm phi tuyn c xc nh nh sau :
) ( / ) ( ) ( Y X G

(3.17)
) ( G
c th c p dng theo Wiener Filtering [28]:
} ) ( { } ) ( {
} ) ( {
) (
2 2
2

D E S E
S E
G
+

(3.18)
Trong
) (
s
P
v
) (
d
P
l ph cng sut ca tin hiu sch.
t Priori SNR v Posteriori SNR nh sau[11]:
} ) ( {
} ) ( {
2
2

D E
S E
SNR
pri

(3.19)
} ) ( {
} ) ( {
2
2

D E
Y E
SNR
post


(3.20)
Mt kh khn trong cc thut ton nng cao cht lng ting ni l ta khng
c tn hiu trc tn hiu sch s[n] nn ta khng th bit ph ca n. Do ta
khng th tnh c pri
SNR
m trong cc h thng nng cao cht lng ging ni
th pri
SNR
l tham s rt cn thit c lng tn hiu sch.Trong cc h thng
nng cao cht lng ging ni c th c lng c pri
SNR
v post
SNR
bng cch
cho cc thng s thch hp vo cc phng trnh sau[12]:
2
1
) ( ) 1 ( ) ( ) (
t
t
d
t
d D P P +


(3.21)
) (
} ) ( {
2

t
d
post
P
Y E
SNR

(3.22)
SVTH: Nguyn Th Ngc Dip Trang 45
Chng 3 : Thut ton Spectral Subtraction v Wiener Filtering
) (
) (
1 ) ( ) 1 ( ) (
2
1
^


d
t
t
post
t
pri
P
S
SNR P SNR


+
1
]
1


(3.23)
Trong P[.] l hm chnh lu bn sng c dng nh sau:

'

>

, 0
0 ,
) (
X X
X P
(3.24)
V v ch s
t
[.]
tn hiu ti khong thi gian ang x l.
Trong phng trnh nu cho h s

ta c th c lng c pri
SNR
bng
post
SNR
. Trong thc t h s

=0.98 rt tt cho cc tn hiu c SNR<4dB.


T phng trnh (3.18) v (3.19) c
) ( G
theo WF nh sau:
pri
pri
SNR
SNR
G
+

1
) (
(3.25)
S khi ca thut ton Wiener Filtering:
Hnh 3.3 S khi ca thut ton Wiener Filtering.
1.19 Overlap v Adding trong qu trnh x l tn hiu ting ni
1.19.1 Phn tch tn hiu theo tng frame
Do tn hiu cn x l ca chng ta l tn hiu lin tc, nn khi chng ta bin
i FFT trc tip tn hiu t min thi gian m khng thng qua mt qu trnh tin
SVTH: Nguyn Th Ngc Dip Trang 46
Tn hiu
b nhiu
FFT
|.|2
c lng, cp
nht nhiu
Pha ca
tn hiu
|.|
1/2
IFFT
Tn hiu sau khi
tng cng
) ( Y
Priori
SNR
Hm x l
gim
nhiuWF
p r i
S N R
p
D | ) ( |
^

Chng 3 : Thut ton Spectral Subtraction v Wiener Filtering


x l no trc th tn hiu sau khi c bin i FFT s bin i nhanh, lc
chng ta khng th thc hin c cc thut ton x l trit nhiu trong tn hiu v
khi tn hiu c xem l ng.
Chnh v vy, tn hiu ca chng ta cn phi c phn tch thnh nhng
khung tn hiu(frame) lin tc trong min thi gian trc khi chuyn sang min tn
s bng bin i FFT. Khi tn hiu c phn tch thnh cc frame lin tc, th
trong tng frame, tn hiu ca chng ta s bin i chm v n c xem l tnh.
Nu tn hiu c phn tch theo tng frame th khi cc thut ton x l trit
nhiu trong tn hiu mi c th thc hin c mt cch hiu qu. V cch phn
tch tn hiu ca chng ta l frame by frame.
thc hin vic phn tch tn hiu thnh cc frame, cn s dng cc loi
ca s thch hp. y, chng ta s dng ca s Hamming, vi N = 256 mu
trong tng frame :
1 ,..., 0 , ) / ) 1 2 cos(( . 85185 . 0 1 + N k N k (3.27)
Hnh 3.4 Phn tch tn hiu thnh cc frame [31].
1.19.2 Overlap v Adding
Sau khi phn tch tn hiu thnh cc frame lin tc trong min thi gian bng
ca s Hamming, nu cc frame ny lin tc vi nhau v khng theo mt iu kin
no c th khi thc hin bin i FFT th v tnh chng ta lm suy gim tn hiu
do Hamming l ca s phi tuyn.
SVTH: Nguyn Th Ngc Dip Trang 47
N : kch thc ca frame
m : s lng frame
Chng 3 : Thut ton Spectral Subtraction v Wiener Filtering
Nn khi thc hin phn tch tn hiu thnh cc frame th yu cu t ra l cc
frame phi sp xp chng ln nhau, gi l overlap. Vic xp chng cc frame
vi nhau s c thc hin theo mt t l chng lp thch hp, thng thng l
40% hoc 50%.
Sau khi cc frame tn hiu c x l trit nhiu trong min tn s, cc
frame ny c lin kt li nhau bng phng php thch hp vi phng php
phn tch tn hiu thnh cc frame u vo gi l adding.
Tp hp cc mu tn hiu trong cng mt frame sau khi c phn tch u
vo gi l mt segment. Vi cch thc hin phn tch v lin kt cc frame bng
phng php overlap v adding th tn hiu ca chng ta thu c sau khi x l
trit nhiu s khng b mo dng v s khng xut hin hin tng gi nhiu.
Hnh 3.5 qu trnh thc hin overlap v adding [32].
1.20 c lng v cp nht nhiu
Phng thc c lng nhiu c th nh hng ln n cht lng ca tn
hiu sau khi c tng cng. Nu nhiu c c lng qu nh th nhiu s vn
cn trong tn hiu v n s c nghe thy, cn nu nh nhiu c c lng
qu ln th ting ni s b mo, v lm s lm tnh d nghe ca ting ni b nh
hng. Cch n gin nht c lng v cp nht ph ca nhiu trong on tn
hiu khng c mt ca ting ni s dng thut ton thm d hot ng ca ting
SVTH: Nguyn Th Ngc Dip Trang 48
Chng 3 : Thut ton Spectral Subtraction v Wiener Filtering
ni (voice activity detection - VAD). Tuy nhin phng php ch tho mn i
vi nhiu khng thay i(nhiu trng), n s khng hiu qu trong cc mi trng
thc t (v d nh nh hng), nhng ni c tnh ph ca nhiu thay i lin
tc. Trong mc ny chng ta s cp n thut ton c lng nhiu thay i
lin tc v thc hin trong lc ting ni hot ng, thut ton ny s ph hp mi
trng c nhiu thay i cao.
1.20.1 Voice activity detection
Qu trnh x l phn bit khi no c ting ni hot ng, khi no khng
c ting ni (im lng) c gi l s thm d hot ng ca ting ni Voice
activity detection (VAD). Thut ton VAD c tn hiu ra dng nh phn quyt
nh trn mt nn tng frame-by-frame, khi frame c th xp x 20-40 ms. Mt
on ting ni c cha ting ni hot ng th VAD = 1, cn nu ting ni khng
hot ng hay chnh l nhiu th VAD = 0.
C mt vi thut ton VAD c a ra da trn nhiu c tnh ca tn hiu.
Cc thut ton VAD c a ra sm nht th da vo cc c tnh nh mc nng
lng, zero-crossing, c tnh cepstral, php o khong cch ph Itakura LPC,
php o chu k.
Phn ln cc thut ton VAD u phi i mt vi vn l iu kin SNR
thp, c bit khi nhiu b thay i. Mt thut ton VAD c chnh xc trong
mi trng thay i khng th trong cc ng dng ca Speech enhancement,
nhng vic c lng nhiu mt cch chnh xc l rt cn thit ti mi thi im
khi ting ni hot ng [26].
1.20.2 Qu trnh c lng v cp nht nhiu
Nhiu s c c lng lc ban u bng cch ly trung bnh bin ph
ca tn hiu b nhiu

1
0
) (
1
) (
M
i
i i
Y
M
D
(3.28)
Sau , s dng phng php VAD nhn bit cc frame tip theo, frame
no l frame nhiu v s cp nht nhiu cho cc frame tip theo. c th nhn
SVTH: Nguyn Th Ngc Dip Trang 49
Chng 3 : Thut ton Spectral Subtraction v Wiener Filtering
bit c frame no l nhiu th chng ta thc hin so snh bin ph ca nhiu
c c lng vi bin ph ca tn hiu b nhiu :

d
D
Y
T
i
i


|
) (
) (
|
2
1
log 20
1
(3.29)
Nu dB T 12 th frame khng phi l frame c ting ni, khi ta c
th cp nht li nhiu c c lng trc .
1.21 Kt lun chng
Ni dung ca chng gip nguyn l chung ca thut ton Spectral
Subtraction v Wiener Filtering. hai thut ton c th thc hin c th cn
phi phn tch tn hiu thnh cc frame v cc frame phi xp chng ln nhau, v
sau khi cc frame c x l trong min tn s v chuyn i v li min thi gian
th cc frame phi c lin kt li vi nhau theo ng phng php tng ng
vi phng php phn tch tn hiu u vo, qu trnh gi l overlap v
adding. Chnh iu s lm cho tn hiu ca chng ta sau khi x l trit nhiu s
khng b mo, m bo cht lng ca ting ni. Ni dung ca chng cng trnh
by vn c lng nhiu, y l ci chnh m speech enhancement cn gii
quyt, n quyt nh tnh hiu qu ca thut ton v cht lng ca ting ni sau
khi x l trit nhiu.
SVTH: Nguyn Th Ngc Dip Trang 50
Chng 3 : Thut ton Spectral Subtraction v Wiener Filtering
SVTH: Nguyn Th Ngc Dip Trang 51
Chng 4: Thc hin v nh gi cc thut ton
CHNG 4: THC HIN V NH GI CC THUT TON
1.22 Gii thiu chng
Da vo l thuyt nghin cu c, chng ny xy dng cc lu
thut ton v thc hin cc thut ton gim nhiu m phng bng Matlab, sau
nh gi cc kt qu thu c ch yu bng phng php nh gi Objective
Measure
SVTH: Nguyn Th Ngc Dip Trang 52
Chng 4: Thc hin v nh gi cc thut ton
1.23 Quy trnh thc hin v nh gi thut ton
Hnh 4.1. S thc hin v nh gi thut ton tng cng
Xy dng thut ton : da trn cc c s ton hc, cc php bin i trong
min thi gian v tn s i vi x l tn hiu s xy dng nn cc thut ton
x l nhiu trong ting ni.
SVTH: Nguyn Th Ngc Dip Trang 53
Xy dng cc thut ton
Trin khai thut ton trn Matlab
Thc hin x l ting ni bng
cc thut ton gim nhiu
Thc hin cc thut ton nh gi
da trn cc kt qu t c sau
khi x l
Nhn xt nh gi
Chng 4: Thc hin v nh gi cc thut ton
Trin khai trn Matlab: t thut ton xy dng c, thc vit m ngun
bng ngn ng lp trnh v s dng cc cng c trn Matlab to nn chng trnh
thc hin x l nhiu trong ting ni trn nn Matlab.
Thc hin x l ting ni bng cc thut ton: thc hin x l trit nhiu
trong cc file m thanh b nhiu bng chng trnh xy dng trn.
Thc hin cc phng php nh gi da trn cc kt qu t c sau khi
x l : sau khi cc file m thanh b nhiu vi cc mc v loi nhiu khc nhau
c x l trit nhiu, cng vi cc file m thanh sch tng ng, ta s dng
cc phng php nh gi ca Speech enhancement thc kim tra, nh gi tnh
hiu ca thut ton.
Nhn xt nh gi: t cc kt qu sau khi thc hin cc phng php nh
gi c trn, a ra cc kt lun nh gi : thut ton no thch hp cho loi
nhiu no, vi mc bao nhiu, thut ton no c kh x l nhiu tt hn trong
mi trng hp.
SVTH: Nguyn Th Ngc Dip Trang 54
Chng 4: Thc hin v nh gi cc thut ton
1.24 Lu thut ton Spectral Subtraction
Hnh 4.2 Lu thut ton SS
SVTH: Nguyn Th Ngc Dip Trang 55
Tnh li mc nhiu N
End
I=I+1;nhp frame tip theo
Begin
Phn chia Frame tn hiu u
vo
Tinh cong suat nhieu trung binh N ban
u
I=0;Nhp frame u tin
VAD
X(:,i)=Beta*Y(:,i)
D=YS(:,i)-N; % Thc hin tr ph
X(:,i)=max(D,0);
Y=bin i FFT cho cc frame
X
=
X =

=
X
=
S

SpeechFlag==0?
S
I<number of frame
Thc hin IFFT v ni cc
frame

Chng 4: Thc hin v nh gi cc thut ton


1.25 Lu thut ton Wiener Filtering
Hnh 4.3 Lu thut ton WF
SVTH: Nguyn Th Ngc Dip Trang 56
Tnh li mc nhiu trung bnh
N
End
I=I+1;nhp frame tip theo
Begin
Phn chia Frame tn hiu u
vo
Tinh cong suat nhieu trung bnh N ban
u
SpeechFlag==0?
I=0;Nhp frame u tin
VAD
Tnh Priori SNR
Y=bin i FFT cho cc frame
Tnh Gain Function G
X(:,i)=G.*Y(:,i);tin hiu sch
S
X
= X
=

I<number of frame
Thc hin IFFT v ni cc
frame X
Chng 4: Thc hin v nh gi cc thut ton
1.26 Thc hin thut ton
Chng ta thc hin x l cc file m thanh b nhiu, vi 2 loi nhiu l
nhiu do ting xe hi v nhiu do ngi ni xung quanh tng ng vi SNR
=10dB
Dng sng v ph ca tn hiu sch:
Hnh 4.4 dng sng v spectrogram ca tn hiu sch
Dng sng v spectrogram ca tn hiu b nhiu xe hi vi SNR = 10dB
- Trc khi x l nhiu:
Hnh 4.5 Dng sng v ph ca tn hiu b nhiu xe hi vi SNR = 10dB
- Sau khi x l trit nhiu bng thut ton Spectral Subtraction
SVTH: Nguyn Th Ngc Dip Trang 57
Chng 4: Thc hin v nh gi cc thut ton
Hnh 4.6 Dng sng v spectrogram ca tn hiu sau khi x l nhiu xe hi
bng SS vi SNR = 10dB.
- Sau khi x l bng thut ton Wiener filtering
Hnh 4.7 Dng sng v spectrogram ca tn hiu sau khi x l nhiu xe hi
bng WF vi SNR = 10dB.
Nhn xt s b
Sau khi nghe cc file m thanh ca tn hiu sch, tn hiu sau khi x l nhiu,
da trn dng sng v spectrogram ca tn hiu sch, tn hiu sau khi x l trit
nhiu bng 2 thut ton SS v WF, ta c th a ra mt s nhn xt nh sau
C hai thut ton u c th x l trit nhiu tt hn mi trng c
SNR cao hn, v x l tt hn i vi tn hiu b nhiu bin i chm
v c phn b u.
SVTH: Nguyn Th Ngc Dip Trang 58
Chng 4: Thc hin v nh gi cc thut ton
C hai thut ton u c tnh hiu qu ging nhau i vi nhiu
mc SNR thp, nhng i vi mi trng c SNR cao hn thi thut
ton Wiener x l trit nhiu tt hn.
Nhn chung th thut ton WF x l trit nhiu tt hn so vi SS
1.27 nh gi cht lng ting ni c x l
1.27.1 C s d liu cho vic nh gi
La 30 cu thoa i c ghi m trong pho ng thi nghi m theo chu n cu a IEEE
[32] la ti n hi u thoa i sa ch. Mi cu trung bnh khong 2s. Ni dung cc cu u c
s cn bng v mt ng m nn c th thy c s tc ng ca thut ton ln tt
c cc m v c th c trong tn hiu thoi
Ca c ti n hi u thoa i o sau o c c ng nhi u va o ( g m co loa i nhi u co
trong th gi i th c, v i ca c ty s SNR kha c nhau. Nh v y ta a co s n ti n hi u
sa ch va ti n hi u bi nhi u theo chu n chung.
Hai loi nhiu c dng l: nhiu xe hi (car noise) c dng lm d liu
chnh x l v nh gi, v nhiu do nhng ngi ni xung quanh (babble
noise) kim tra tc ng ca thut ton trong mi trng nhiu khc, vi cc
SNR 0dB, 5dB, 10dB, 15dB.
Sau khi tng ch t l ng ti ng no i t ca c ti n hi u ti ng no i bi nhi u b ng ca c
thu t toa n a nghin cu l SS v WF, co c ti n hi u ti ng no i a c tng
c ng. Nh v y ta co c c s d li u cho vi c a nh gia ch t l ng cu a ti n hi u
ti ng no i sau khi a c tng c ng.
1.27.2 Tng quan v quy trnh nh gi
nh gi cht lng ting ni sau khi x l s dng c hai phng
php nh gi da trn cht lng do ngi nghe cm nhn c (SE) v nh gi
da trn cc php o thuc tnh ca tn hiu (OE). Trong n ny phng php
nh gi chnh c dng l OE, SE c dng lm phng php nh gi b
sung v c thc hin bi cc thnh vin trong nhm thc hin .
Do c tnh ca cc thut ton gim nhiu c s dng trong ti l c
cc thng s nh hng n cch thc x l nu chnh cc thng s ny ta s c
SVTH: Nguyn Th Ngc Dip Trang 59
Chng 4: Thc hin v nh gi cc thut ton
cc kt qu khc nhau c th tt, c th xu i vi mt file m thanh. c th
c cc thng s tt nht v c cc nhn xt v tnh n nh, thut ton tt hay xu
ta phi thc hin qu trnh tinh chnh thng s c cc kt qu khc nhau t
so snh v a ra cc thng s ti u nht c th. Qu trnh ny l thc hin
nh gi thut ton.
Hnh 4.8 Quy trnh thc hin nh gi
1.27.3 Kim tra tin cy ca cc phng php nh gi
Cc nh gi OE c dng l : SNRseg, IS, LLR, WSS.
Kim tra n nh ca cc phng php nh gi trn bng cch so snh
tn hiu ting ni b nhiu xe hi v nhiu ngi ni xung quanh cha c x l
vi tn hiu sch
SVTH: Nguyn Th Ngc Dip Trang 60
Cc thut ton gim nhiu
Nhn xt
nh gi SE
nh gi OE
Chnh sa cc thng s ca
thut ton gim nhiu
Ting ni c
gim nhiu
Tn hiu sch
Chng 4: Thc hin v nh gi cc thut ton
Hnh 4.9. th kim tra n nh ca nh gi OE i vi nhiu xe hi
Hnh 4.10. th kim tra n nh ca nh gi OE i vi nhiu ngi
ni xung quanh
Kt qu kim tra cho thy
SVTH: Nguyn Th Ngc Dip Trang 61
Chng 4: Thc hin v nh gi cc thut ton
i vi nh gi SNRseg th i ln theo chiu tng dn ca SNR
i vi nh gi LLR, IS v WSS th th c hng i xung v
variance cng gim dn theo chiu tng dn ca SNR chng t ph
ca tn hiu c SNR cao gn vi ph tn hiu sch hn
Qua kim tra thy c cc phng php nh gi trn u n nh v tin
cy thc hin nh gi i vi cc tn hiu ting ni qua x l.
1.27.4 Thc hin nh gi
Trong qu trnh nghin cu v trin khai thut ton ta nhn thy cc thng s
sau nh hng ln n thut ton:
- NoiseMargin :l ngng nhn bit nhiu trong VAD .Mc nh ca
thut ton Noise margin s l 3db.
- IS :h s ch thi gian khng c ting ni u tin trong mi file m thanh
c dng tnh ton nhiu ban u. Do khi kim tra nhng on im lng ban
u trong cc file sch ta nhn thy rng i vi tng file th t 0.15s n 0.2s l
nhng on im lng.Ta la gi tr IS l 0.2
- i vi thut ton WF th ta c thm h s alpha l h s lm trn trong
phng php c lng t s Priori SNR.
-i vi thut ton SS th c h s Gramma l h s quyt nh nhiu s
c tr theo bin hay nng lng. Ta chn gi tr Gramma l 1 tc l thut
ton Subtraction s tr nhiu theo bin .
1.27.4.1 nh gi thut ton vi cc h s d on ban u
H s IS=0.2, NoiseMargin=3
nh gi OE
Sau khi thc hin thut ton SS v WF vi cc thng s alpha=0.9,
gamma=1, NoiseMargin=3,IS=0.2 ta c th ca nh gi bng SNR, LLR, IS,
WSS nh sau
SVTH: Nguyn Th Ngc Dip Trang 62
Chng 4: Thc hin v nh gi cc thut ton
Hnh 4.11 th nh gi Objective vi h s IS=0.2, NoiseMargin=3
Theo th ta c cc nhn xt nh sau :
i vi thng s nh gi SNR cho ta thy t s SNR c tng hn so vi
file cha x l. Chng t thut ton loi tr mt phn nhiu ra khi file sch.
Nhng i vi so snh IS, LLR, WSS th ta li thy file cha x l li c kt qu
tt hn file x l. Do nh gi IS, LLR, WSS l so snh khong cch ph gia
file x l v file sch ri tnh gi tr trung bnh nn ta c th d on l nng
lng ca file x l lch rt nhiu vi file sch c th do thut ton ti hoc l
nng lng tn hiu sch b nn mt phn .
nh gi SE
Sau khi kim tra cc file u ra bng phng php nghe th ta c cc nhn
xt sau y: Mt s file u ra ca cc thut ton SS v WF c mc nn nhiu
khc cao dn ti vic mt mt phn ting ni.
Kt lun v ti u cc thng s cho thut ton VAD
Qua cc nhn xt v nh gi OE v SE ta rt ra kt lun nh sau:
Do thut ton VAD vi cc thng s ra l IS=0.2 v NoiseMargin=3 l
khng tt nn mt phn m thanh b c lng l nhiu nn b thut ton nn i
dn ti vic mt nng lng ca phn m thanh sch.
SVTH: Nguyn Th Ngc Dip Trang 63
Chng 4: Thc hin v nh gi cc thut ton
i vi thng s IS ta phi thay i nh sau : Do on lng trong file sch
ch nm trong khong 0.15s n 0.2s. Nu ta 0.2 l qu ln i vi mt s file
nn mt phn nng lng ting ni trong nhng file nay s c thut ton VAD
xem l nhiu v th mt phn ting ni s b loi b. l mt hn ch ca thut
ton VAD c dng trong ti : gi cng gi tr IS( on im lng) ci t
nhiu l khng ph hp cho tt c mi file m thanh.
i vi thng s NoiseMargin: V ta chn mc ngng nhn bit nhiu l
3dB l kh ln nn tng t nh gi tr IS vi mc ngng nh vy mt phn tn
hiu sch s b loi b do khc gn vi nhiu d IS c ti u th no i na. Qua
thc nghim ta c h s NoiseMargin ti u l 2. l gi tr m tn hiu sch
khng b c lng l nhiu.
Vy cc gi tr ti u cho thut ton VAD l : h s IS phi iu chnh li l
0.15s, h s NoiseMargin l 2.
H s IS=0.15 ,h s NoiseMargin=2
nh gi OE
Sau khi thut hin li thut ton SS v WF vi h s IS=0.15,h s
NoiseMargin=2 ta c th nh gi IS, SNR, WSS, LLS nh sau :
Hnh 4.12 th nh gi Objective vi h s IS=0.15, NoiseMargin=2.
SVTH: Nguyn Th Ngc Dip Trang 64
Chng 4: Thc hin v nh gi cc thut ton
Ta nhn thy thng s SNR tng t nh trng hp IS = 0.2 v
NoiseMargin=2. Nhng ta cc gi tr LLR v IS ca so snh tn hiu x l bng
SS v WF gim, trong gi tr IS gim ng k.c bit vi thut ton SS
gi tr IS xung di ngng ca file nhiu. iu chng t cc thng s ny
tht s tt. Nhng cc gi tr IS cn rt ln i vi thut ton WF v cc mc SNR
0dB v 10dB v cc gi tr IS ca thut ton Wiener vn cn nm trn gi tr IS
ca file cha x l v file sch.
nh gi SE
Sau khi nghe th cc file u ra ca thut ton SS v thut ton WF. Ta nhn
thy thut ton SS tht s lm vic tt h c mc nhiu ca cc file m
thanh. Nhng i vi thut ton WF mc d h c mc nhiu ca cc file m
thanh nhng mt s file vn b mt ting ni iu chng t h s ca thut ton
WF cha tt.
Kt lun
Kt hp gia nhn xt trong OE v SE ta c kt lun l vi h s IS=0.15 v
NoiseMargin=2 th thut ton VAD lm vic tht s ti u cho nhiu xe hi. V h
s ca thut ton Wiener cha ti u chnh l h s alpha.
1.27.4.2 Ti u h s alpha cho thut ton WF
Ta nh gi h s alpha cho thut ton WF qua cc trng hp h s
alpha=0.5, 0.8,0.9 vi IS=0.15 v NoiseMargin = 2 chn ra trng hp tt nht.
nh gi objective
SVTH: Nguyn Th Ngc Dip Trang 65
Chng 4: Thc hin v nh gi cc thut ton
Hnh 4.14 th nh gi objective vi h s alpha=0.5, 0.8,0.9 vi
IS=0.15 v NoiseMargin = 2
Qua th SNR ta nhn thy h s alpha cng ln th mc nhiu b nn cng
ln (t s SNR ln). Qua th IS ta thy h s alpha cng nh th tc ng vo
file t s SNR cng ln cng tt. Gi tr alpha=0.9 tc ng vo file c SNR=10 dB
cho ra file output c khong cch ph xa hn so vi file sch v file nhiu. Cn li
cc gi tr alpha khc v alpha=0.9 vi cc mc file nhiu c t s SNR khc u
cho ra kt qu tt hn so vi file sch v file nhiu.V h s alpha bng 0.5 c v
rt tt trn th is c bit l vi file nhiu c t s SNR=15dB tc ng rt n
nh (variant nh).
nh gi subjective
Qua vic kim tra subjective ta nhn thy vi h s alpha=0.5 tc ng rt n
nh v tt vi file nhiu c mc SNR=15dB cho ra file rt sch. Nhng vi cc
mc dB khc th ko tt bng so vi cc h s alpha khc, nhiu cn tng i
nhiu.i vi h s alpha l 0.9 th vi mc file nhiu c SNR=10dB tc ng
khng tt, mt s file c tn hiu sch cng b nn.
Kt lun
SVTH: Nguyn Th Ngc Dip Trang 66
Chng 4: Thc hin v nh gi cc thut ton
Qua cc nhn xt v nh gi SE v OE ta rt ra kt lun l h s alpha=0.8
l h s ti u nht cho tt c cc trng hp c th n nn nhiu khng nhiu
bng h s alpha nhng khng nn lun tn hiu sch, bo m tn hiu vn cn
nghe tt, nhiu b h xung tng i nhiu.
Ta c thm nhn xt v cch nh gi OE l khng phi lc no cng hon
ton chnh xc nh i vi h s alpha=0.5 trn th IS n l tt nht nhng vi
vic kim tra bng SE th n ch tt nht trong trng hp 15dB hay i vi th
SNR th h s alpha tt nht nhng c mt s trng hp tn hiu sch b nn
lun.
1.27.4.3 H s gamma cho thut ton SS
V thut ton SS l thut ton tr nhiu nn ta c 2 cch tr nhiu l tr theo
nng lng v tr theo bin nn ta cung cp h s gamma nu gamma=1 th tr
theo bin gamma =2 th tr theo nng lng. Sau y ta s nh gi v tm ra
cch tr no l tt nht( gamma=1 hay 2).
nh gi OE
Hnh 4.15 th nh gi objective vi h s gamma = 1 v gamma = 2.
SVTH: Nguyn Th Ngc Dip Trang 67
Chng 4: Thc hin v nh gi cc thut ton
Ta nhn thy i vi c thng s SNR v IS th h s gamma=2 tc l tr
theo nng lng u tt hn ngoi tr i vi file nhiu c SNR l 10dB.V
gamma=1 hay gamma = 2 u a ra th tt hn th gia file nhiu vi file
sch
nh gi SE
Sau khi kim tra SE ta nhn thy rng i vi h s gamma=2 tc l tr theo
nng lng th nhiu b nn rt t, file u ra khng tt bng h s gamma=1.
Kt lun
Sau khi so snh OE v SE ta c kt lun l mc d trn th phn nh h s
gamma=2 tt hn nhng trn thc t th h s gamma=1 mi tt hn.Chng t
vic nh gi OE nh ni trn khng phi lc no cng ng.
Ta chn h s gamma ti u l 1.
1.27.4.4 nh gi thut ton sau khi ti u
Sau khi thc hin mt lot cc h s th nghim ta chn ra h s ti u l :
-Thut ton VAD: h s IS=0.15, NoiseMargin = 2.
-Thut ton WF h s alpha=0.8.
-Thut ton SS tr theo bin .
V vic nh gi OE ch nh gi v mt ton khng phi lc no cng
ng , nh gi OE phi i km vi nh gi SE.
SVTH: Nguyn Th Ngc Dip Trang 68
Chng 4: Thc hin v nh gi cc thut ton
Hnh 4.16 th nh gi vi IS=0.15 NoiMargin= 2 v alpha = 0.8 cho
thut ton WF, gama=1 cho thut ton SS.
1.27.4.5 nh gi n nh ca thut ton trong mi trng nhiu
khc
nh gi OE
Thc hin nghe i vi tn hiu qua x l thy rng mt s file tn hiu c
nhng on ch nghe c nhiu ch khng nghe c ting ni. iu ny c
gii thch l do nhiu ngi ni c nng lng nhiu tng ng vi nng lng
ting ni, trong mt s file th tn hiu ting ni c mc nng lng thp hn mc
nng lng ca nhiu nn on ting ni b tr mt ch cn li nhiu.
th
p dng cc thng s ti u i vi nhiu xe hi cho nhiu ngi ni xung
quanh c th nh gi nh sau
SVTH: Nguyn Th Ngc Dip Trang 69
Chng 4: Thc hin v nh gi cc thut ton
Hnh 4.17 th nh gi OE vi nhiu ngi ni xung quanh.
Nhn xt
Nhn xt theo th i vi c bn php nh gi ta thy i vi nhiu
ngi ni xung quanh th SS c v x l tt hn WF.
Nhng i vi c ba phng php nh gi u tin th c ba gi tr WSS,
LLR, IS ca cc tn hiu c x l so vi tn hiu sch li khng tt bng gi
tr ca tn hiu nhiu cha x l so vi tn hiu sch (so snh ca tn hiu x l
c gi tr ln hn).
Ring vi php nh gi IS ta thy thut ton x l nhiu c tc ng tt i
vi nhiu 0dB v 5dB. Bn cnh variant cn ln v c mt s file c gi tr so
snh ln hn gi tr ca cc file khc rt nhiu (iu ny cng xy ra i vi car
noise) c th hin trong bng gi tr IS [matlab file]. L gii cho iu ny l do
mt s tn hiu b nhiu t bin.
nh gi SE
Khi thc hin nghe i vi cc file m thanh b nhiu ngi ni xung quanh
c x l bng SS v WF th c mt s on ting ni b mt, ch nghe c
nhiu ch khng nghe c ting ni.
SVTH: Nguyn Th Ngc Dip Trang 70
Chng 4: Thc hin v nh gi cc thut ton
iu ny c l gii l do nhiu ngi ni xung quanh c mc nng lng
tng ng vi mc nng lng ca ting ni nn mt s file m thanh c on
ting ni c mc nng lng thp hn mc nng lng ca nhiu th ting ni
s b tr mt ch cn li nhiu.
Nhn xt chung
Khi em cc thng s ti u x l nhiu xe hi p dng vi ngi ni
xung quanh th kt qu khng tt.
i vi nhiu ngi ni xung quanh th thut ton SS tc ng tt hn WF.
1.27.5 Kt lun chng
Qua kt qu nh gi bng OE v SE a ra c kt lun l :
- i vi tng loi nhiu khc nhau th tc ng ca cc thut ton tng
cng l khc nhau.
- i vi tng mc nhiu khc nhau th thut ton cng tc ng cng
khc nhau.
SVTH: Nguyn Th Ngc Dip Trang 71
n tt nghip
TI LIU THAM KHO
[1]. Ramabadran, T.,Ashley, J., and McLaughin, M.(1997), Background noise
suppression for speech enhancement and coding, Proc. IEEE Workshop Speech
Coding Telecommun.
[2]. Ths.Hong L Uyn Thc, Gio trnh x l tn hiu s, i hc Bch Khoa
i hc Nng.
[3].Hu, Y. and Loizou, P(2006), Subjective comparison of speech enhancement
algorithms, Proc. IEEE Int.Conf. Acoust. Speech Signal Process, I.
[4]. Philippos C.Loizou, Speech Enhancement Theory and Practice,pp. 2-7.
[5]. Long, M. (2005), Dinner Conversation (An oxymoron?), Acoustics Today,l(1),
pp. 25-27.
[6]. Lombard, E.(1911), Le signe de lelevation de la voix, Ann. Mal. Oreil.
Larynx.,37, 101-119.
[7]. Nguyn Quc Trung, X l tn hiu s - tp 1, NXB Khoa hc k thut.
[8]. Lim, J. and Oppenheim, A.V.(1979), Enhancement and bandwidth
compression of noisy speech, Proc. IEEE, 67(12),pp. 1586-1604.
[9]. Weiss, M., Aschkenasy, E., and Parsons, T.(1974), Study and the development
of the INTEL technique for improving speech intelligibility, Technical Report
NSC-FR/ 4023.
[10]. Boll, S.F. (1979), Suppression of acoustic noise in speech using spectral
subtraction, IEEE Trans, Acoust. Speech Signal Process.,27(2), 113-120.
[10]. Philippos C.Loizou, Speech Enhancement Theory and Practice,pp. 46-57.
[11] Methods for Subjective Determination of Transmission Quality, ITU_T
Recommendation P.800, August 1996.
[12] Philipos C.Loizou, Speech Enhancement Theory and Practice, CRC Press,
Taylor and Francis Group.
[13] Friedrich Schafer, Artificial Bandwidth Extension of Narrowband Speech,
Signal Processing and Speech Communication Lab, Technical University Graz.
SVTH: Nguyn Th Ngc Dip Trang 72
n tt nghip
[14] Hansen J. and Pellon B. , An effective quality evaluation protocol for Speech
Enhancement algorithms, Proc. Int Conf. Spoken Language Process, 1998.
[15] http://en.wikipedia.org/wiki/Code_Excited_Linear_Prediction
[16] Beey Y. , Shpiro Z. , Simchony T. , Shatz L. and Piasetzky J., An efficient
variable_bit_rate_low_delay (VBR_LP_CELP) code , New York, Marcel Pekker,
1990.
[17] Yi Hu and Philipos C. Loizou, Evaluation of Objective Quality Measures for
Speech Enhancement, IEEE.
[18] Klatt D., Prediction of perceived phonetic distance from critical band
spectra, Proc IEEE Int. Conf. Acoust. Speech Signal Process.
[19] Kitawaki N., Nagabuchi H., and Itoh K., Objective Evaluation for low
bit_rate Speech Coding systems, IEEE J, Sel. Areas Commun.
[20] Quackenbush S., Barnwell T. and Clements M., Objective Measure of
Speech Quality, Englewood Cliffs NJ: Prentic Hall.
[21]. Boll, S.F(1979), Suppression of acoustic noise in speech using spectral
subtraction, IEEE Trans. Acoust. Speech Signal Process., 27(2), 113-120.
[22]. Paliwal, K. and Alsteris, L.(2005), On the usefulness of STFT phase
spectrum in human listening tests, Speech Commun., 45(2), 153-170.
[23]. Weiss, M., Aschkenasy, E., and Parsons, T., (1974), Study and the
Development of the INTEL Technique for Improving Speech Intelligibility,
Technical Report NSC-FR/4023, Nicolet Scientific Corporation.
[24]. Deller, J., Hansen, J.H.L., and Proakis, J. (2000), Discrete time Processing
of Speech Signals, New York : IEEE Press.
[25]. Guastafsson, H., Nordholm, S., and Claesson, I.(2001), Spectral subtraction
using reduced delay convolution and adaptive averaging, IEEE Trans. Speech
Audio Process., 9(8), 799-807.
[26]. Philippos C.Loizou, Speech Enhancement Theory and Practice,pp. 100.
[27]. Paliwal, K. and Alsteris, L.(2005), On the usefulness of STFT phase
spectrum in human listening tests, Speech Commun., 45(2), 153-170.
SVTH: Nguyn Th Ngc Dip Trang 73
n tt nghip
[28]. Lim, Oppenheim, Speech Enhancement Using a Soft-Decision noise
Suppression EEE Trans. Acoustics, Speech and Signal Processing, vol. assp-28,
no. 2, april 1980.
[29]. Y. Ephraim and D. Malah, Speech Enhancement Using a Minimum Mean-
Square Error Short-Time Spectral Amplitude Estimator, IEEE Trans. Acoustics,
Speech and Signal Processing, vol. 32, no. 6, pp. 11091121, December 1984.
[30]. P. Scalart and J. Vieira-Filho, Speech enhancement based on a priori signal
to noise estimation, in Proc. 21st IEEE Int. Conf. Acoust. Speech Signal
Processing, Atlanta, GA, May 1996, pp. 629632.
[31]. Dominic K. C. Ho, Speech Enhancement : concept and methodology, Demo
prepared by Tong Wang, University of Missouri-Columbia.
[32] http://www.utdallas.edu/~loizou/speech/noizeus/
KT LUN N V HNG PHT TRIN TI
Cht lng ca ting ni b suy gim do s tc ng ca nhiu trong mi
trng xung quanh l mt vn quan trng cn phi c gii quyt. Vic tm ra
cc phng php trit nhiu v gim nhiu trong ting ni lun lun ti c
SVTH: Nguyn Th Ngc Dip Trang 74
n tt nghip
quan tm rt nhiu. Trong cc dch v truyn thng vi phng tin ngn ng l
ting ni th vic tng cng, ci thin cht lng ting ni b nhiu l rt thit,
gip cho ngi nghe c th nghe r v ng nhng g ngi ni ni.
n thc hin c cc vn :
- Tm hiu v nghin cu cc phng php ci thin cht lng ting ni,
nhng tp trung vo 2 thut ton c trong Speech enhancement l :
Spectral Subtraction v Wiener Filtering
- Xy dng c chng trnh thc hin x l nhiu trong cc file m
thanh b nhiu da trn 2 thut ton : Spectral Subtraction v Wiener
Filtering.
- Thc hin v nh gi tnh hiu qu ca 2 thut ton trong cc mi
trng nhiu v mc nhiu khc nhau, t a ra cc bin php ti
u ha cc thut ton. Kt qu t c cho thy WF l thut ton gim
nhiu tt hn SS. Cc thut ton gim nhiu c hiu qu khc nhau i
vi tng mi trng nhiu khc nhau
Tuy nhin n vn cha gii quyt ht c cc vn trong Speech
enhancement nn hng pht trin ca ti trong tng lai s l :
- Tm hiu, nghin cu v xy dng cc chng trnh thc hin x l
nhiu trong ting ni da trn cc thut ton khc trong Speech
enhancement.
- Nghin cu v a ra thut ton mi v x l nhiu v trit nhiu trong
Speech enhancement.
Pht trin chng trnh thc hin i vi cc dch v ng dng thi gian
thc v cc dch v trong lnh vc truyn thng a phng tin nh : thoi, m
nhc, truyn hnh hi ngh.
PH LC
Ton b m ngun ca chng trnh thc hin c lu tr trn a CD nh
km.
SVTH: Nguyn Th Ngc Dip Trang 75

You might also like