You are on page 1of 83

TCVN

TI U CHUN QUC GIA

TCVN ............... : 2010


Xut bn ln 1

CNG NGH THNG TIN


B K T M HO CH VIT
Information Technology
Vietnamese Encoded Characters Sets

H NI - 2010

TCVN ................ : 2010


Ni dung

Trang

Li ni u ............................................................................................................................................................ 5
Gii thiu ............................................................................................................................................................... 6
Cng ngh thng tin - B k t m ho ch Vit (UCS).................................................................................... 7
1
Phm vi ...................................................................................................................................................... 7
2
Tun th..................................................................................................................................................... 8
2.1
Chung ................................................................................................................................................... 8
2.2
Tun th trao i thng tin ................................................................................................................... 8
2.3
Tun th ca thit b............................................................................................................................. 8
3
Tham chiu qui chun............................................................................................................................... 9
4
Thut ng v nh ngha......................................................................................................................... 10
5
Cu trc chung ca UCS ........................................................................................................................ 17
6
Cu trc v danh mc c s .................................................................................................................. 18
6.1
Cu trc .............................................................................................................................................. 18
6.2
M ho k t........................................................................................................................................ 19
6.3
Kiu im m...................................................................................................................................... 19
6.3.1
Phn loi........................................................................................................................................ 19
6.3.2
K t ho................................................................................................................................... 20
6.3.3
K t nh dng.............................................................................................................................. 20
6.3.4
K t iu khin ............................................................................................................................. 20
6.3.5
K t dng t ................................................................................................................................. 20
6.3.6
Cc im m thay th................................................................................................................... 20
6.3.7
im m phi k t.......................................................................................................................... 21
6.3.8
im m dnh ring...................................................................................................................... 21
6.4
t tn k t ........................................................................................................................................ 21
6.5
Tn gi ngn cho im m ................................................................................................................ 21
6.6
Tn gi dy UCS................................................................................................................................ 22
6.7
Tn gi dy byte................................................................................................................................. 22
7
Sa i v cp nht UCS ....................................................................................................................... 23
8
Tp con .................................................................................................................................................... 23
8.1
Tp con hn ch................................................................................................................................. 23
8.2
Tp con c la............................................................................................................................... 23
9
Dng m ho UCS.................................................................................................................................. 23
9.1
UTF-8.................................................................................................................................................. 23
9.2
UTF-16................................................................................................................................................ 25
9.3
UTF-32 (UCS-4)................................................................................................................................. 25
10
Lc m ho UCS............................................................................................................................. 25
10.1 UTF-8.................................................................................................................................................. 26
10.2 UTF-16BE........................................................................................................................................... 26
10.3 UTF-16LE ........................................................................................................................................... 26
10.4 UTF-16................................................................................................................................................ 26
10.5 UTF-32BE........................................................................................................................................... 26
10.6 UTF-32LE ........................................................................................................................................... 26
10.7 UTF-32................................................................................................................................................ 27
11
Dng chc nng iu khin vi UCS..................................................................................................... 27
12
Khai bo nhn din tnh nng................................................................................................................. 28
12.1 Mc ch v hon cnh ca nhn din ............................................................................................. 28
2

VNPF

TCVN ................ : 2010


12.2 Nhn din dng m ho ca UCS.................................................................................................... 29
12.3 Nhn din tp con cc k t ho .................................................................................................. 29
12.4 Nhn din tp chc nng iu khin ................................................................................................ 30
12.5 Nhn din h thng m ho ca ISO/IEC 2022............................................................................... 30
13
Cu trc ca s m v danh sch .................................................................................................... 31
14
Tn khi v tuyn tp.............................................................................................................................. 31
14.1 Tn khi .............................................................................................................................................. 31
14.2 Tn tuyn tp...................................................................................................................................... 32
15
K t soi gng trong ng cnh song hng ....................................................................................... 32
15.1 K t soi gng .................................................................................................................................. 32
15.2 Chiu ca vn bn song hng ....................................................................................................... 32
16
Cc k t c bit .................................................................................................................................... 32
16.1 K t du cch .................................................................................................................................... 32
16.2 K hiu tin t...................................................................................................................................... 33
16.3 K t nh dng................................................................................................................................... 33
16.4 K t m t ch biu ....................................................................................................................... 34
16.5 B la bin th v dy bin th............................................................................................................. 34
17
Dng trnh by ca cc k t................................................................................................................... 36
18
K t tng hp....................................................................................................................................... 36
19
Th t ca cc k t................................................................................................................................ 37
20
K t t hp.............................................................................................................................................. 37
20.1 Th t ca k t t hp...................................................................................................................... 37
20.2 Lp t hp v sp th t chnh tc .................................................................................................. 37
20.3 S xut hin trong s m............................................................................................................. 37
20.4 Cc biu din m thay th................................................................................................................. 37
20.5 a k t t hp ................................................................................................................................... 38
20.6 Tuyn tp cha cc k t t hp....................................................................................................... 39
20.7 B ni t v t hp.............................................................................................................................. 39
21
Dng chun ho...................................................................................................................................... 39
22
Tn k t v ch gii................................................................................................................................ 39
22.1 Tn thc th ....................................................................................................................................... 39
22.2 Hnh thnh tn.................................................................................................................................... 40
22.3 Tn n .............................................................................................................................................. 40
22.4 Tnh duy nht ca tn ........................................................................................................................ 41
22.5 Cc tn k t cho ch biu CJK ..................................................................................................... 41
23
Cu trc ca Mt phng a ng c s.................................................................................................. 42
24
Cu trc Mt phng a ng b sung (SMP) ......................................................................................... 44
25
Cu trc ca Mt phng ch biu b sung (SIP)............................................................................... 45
26
Cu trc ca mt phng chuyn dng b sung (SSP) ......................................................................... 45
27
Cc bng k t m ho ch Vit ........................................................................................................... 46
27.1 Ch Quc ng ................................................................................................................................... 46
27.2 Ch Khmer......................................................................................................................................... 52
27.3 Ch Chm.......................................................................................................................................... 53
27.4 Ch Thi (TaiViet)............................................................................................................................ 546
27.5 Ch Hn Nm.................................................................................................................................... 55
28
Tn quc t ca cc k t ch Vit ...................................................................................................... 157
28.1 Tn quc t ca ch Quc ng ...................................................................................................... 157
28.2 Tn quc t ca ch Khmer............................................................................................................ 162
28.3 Tn quc t ca ch Chm............................................................................................................. 165
VNPF

TCVN ................ : 2010


28.4 Tn quc t ca ch Thi (TaiViet)................................................................................................. 167
28.5 Tn quc t ca ch Hn Nm ....................................................................................................... 169
Ph lc A K t m t ch biu ........................................................................................................................ 176
I.1.1 C php ca dy m t ch biu ..................................................................................................... 176
I.1.2 nh ngha ring v k t m t ch biu ......................................................................................... 176
Ph lc B Hng dn t tn k t...................................................................................................................... 179
Ph lc C Th tc thng nht ho v thu xp ch biu CJK ......................................................................... 183
C.1 Th tc thng nht ho.............................................................................................................................. 183
C.1.1 Phm vi thng nht............................................................................................................................. 183
C.1.2 Phn loi hai mc............................................................................................................................... 184
C.1.3 Th tc................................................................................................................................................. 184
C.1.4 V d v khc bit ca cc hnh tru tng ...................................................................................... 185
C.1.5 Khc hnh dng thc ti ..................................................................................................................... 185
C.1.6 Qui tc tch ngun.............................................................................................................................. 186
C.2 Th tc sp xp.......................................................................................................................................... 187
C.2.1 Phm vi sp xp ................................................................................................................................. 187
C.2.2 Th tc................................................................................................................................................. 187
C.3 V d v tch m ngun............................................................................................................................. 188

VNPF

TCVN ................ : 2010

Li ni u
Trung tm Tiu chun Vit Nam chu trch nhim t chc xt duyt v ngh B
Khoa hc Cng ngh ban hnh tiu chun quc gia. Gip vic cho Trung tm tiu
chun Vit Nam v mt cng ngh thng tin l Ban K thut Cng ngh thng tin.
Ban K thut Cng ngh thng tin c Trung tm Tiu chun ngh, Tng cc Tiu
chun o lng Cht lng cng b quyt nh thnh lp.
Tiu chun quc gia c son tho tng ng vi cc qui tc c Trung tm tiu
chun thit lp.
Tiu chun Vit Nam l nhng tiu chun quc gia c nh nc chnh thc ban
hnh v c hiu lc thi hnh trn ton lnh th Vit Nam.
Cc chun quc gia c chnh thc ban hnh u c tn TCVN - s hiu : nm.

VNPF

TCVN ................ : 2010

Gii thiu
TCVN... : 2010 xc nh cc b k t m ho ch Vit ph dng c s dng trn
lnh th Vit Nam v trn ton th gii. N p dng c cho biu din, truyn, trao
i, x l, a vo v trnh by dng vit ca cc ch vit c dng Vit Nam
cng nh cc k hiu ph.
Bng vic xc nh mt cch nht qun cch m ho a ng tun th theo chun
quc t ISO 10646, chun ny to kh nng cho vic trao i d liu ch Vit trn qui
m quc t. Cho ti nay, cc b k t m ho c xc nh trong TCVN ...: 2010
c chp nhn rng ri trong cc giao thc quc t v c ci t trong cc h
iu hnh hin i v ngn ng my tnh. Chun ny bao qut hn 10 000 k t t
cc b ch c bit ti trn lnh th Vit Nam.
Chun ny l mt phn ca tiu chun quc t ISO 10646:2009 v tun theo mi qui
nh ca ISO 10646. Ni ring ton b phn vn bn ca ISO 10646 c dch v
chuyn sang ting Vit. Ton b phn cc b ch m ho ca Vit Nam c trong ISO
10646 u c a vo chun ny. Ch mt s mc v ph lc khng lin quan ti
cc ch Vit Nam l khng c a vo chun ny, nhng chun ny khng ph
nhn cc phn v vn tng hp vi chng nh trong ISO 10646. Tt c cc b
ch khc c xc nh trong ISO 10646 vn hon ton c gi tr s dng chung vi
cc b ch Vit c xc nh trong chun ny, trong khun kh ca chun quc t
ISO 10646 v Unicode.

VNPF

TCVN ................ : 2010

Cng ngh thng tin B k t m ho ch Vit (UCS)


1

Phm vi

TCVN...: 2010 xc nh b k t m ho ch Vit (UCS). n p dng c cho biu


din, truyn, trao i, x l, lu gi, a vo v trnh by dng vit ca cc ngn ng
c dng Vit Nam cng nh cc k hiu ph.
Ti liu ny:

xc nh kin trc ca TCVN...: 2010,

nh ngha cc thut ng c dng trong TCVN...: 2010,

m t cu trc chung ca khng gian m UCS;

xc nh Mt phng a ng c s (BMP) ca UCS,

xc nh cc mt phng b sung ca UCS: Mt phng a ng b sung (SMP), Mt


phng ch biu b dung (SIP), mt phng ch biu th ba (TIP) v mt phng
chuyn dng b sung (SSP),

nh ngha cc k t ho c dng trong cc b ch v dng vit ca cc


ngn ng c dng Vit Nam;

xc nh tn ca cc k t ho v k t dng thc ca BMP, SMP, SIP, TIP,


SSP v cc biu din m ho ca chng trong khng gian m UCS;

xc nh dng biu din m ho cho cc k t iu khin v k t dng ring;

xc nh ba dng m ho ca UCS: UTF-8, UTF-16, v UTF-32;

xc nh by lc m ho ca UCS: UTF-8, UTF-16, UTF-16BE, UTF-16LE,


UTF-32, UTF-32BE, UTF-32LE

xc nh vic qun l nhng b sung tng lai cho b k t m ho ny.

UCS l h thng m ho khc vi h thng m ho c xc nh trong ISO/IEC


2022. Phng php ch ra UCS t ISO/IEC 2022 c xc nh trong 12.2.
K t ho s c gn ch mt im m duy nht trong chun ny, c t hoc
trong BMP hoc mt trong cc mt phng ph.
LU Chun Unicode, Phin bn 5.2 cha tp cc k t, tn v cc biu din m ho ng nht
vi cc k t, tn v cc biu din m ho c xc nh trong chun quc gia ny. N cn cung
cp thm cc chi tit v thuc tnh k t, thut ton x l, v cc nh ngha c ch cho ngi ci
t.

VNPF

TCVN ................ : 2010

Tun th

2.1

Chung

Bt k khi no k t s dng ring c dng nh c xc nh trong TCVN...: 2010,


bn thn cc k t ny s khng phi tun theo cc yu cu tun th ny.

2.2

Tun th trao i thng tin

Phn t d liu k t m ho (phn t d liu CC) trong thng tin c m ho cho


trao i l tun th theo TCVN...:2010 nu
a)

tt c cc biu din m ho ca k t ho bn trong phn t d liu CC tun


th mc 6, tun th mt dng c nh danh, c chn t mc 9, v tun
th lc m ho c nh danh, c chn t mc 10;

b)

tt c cc k t ho c biu din trong phn t d liu CC c ly t


nhng phn t bn trong mt tp con c nhn din (xem mc 8);

c)

tt c cc biu din m ho cho chc nng iu khin bn trong phn t d


liu CC tun theo mc 11.

Tuyn b tun th s nhn din dng c chp nhn, mc ci t c chp nhn


v tp con c chp nhn bng danh sch cc tuyn tp v/hoc cc k t.

2.3

Tun th ca thit b

Mt thit b l tun th theo TCVN...: 2010 nu n tun th cc yu cu ca mc a)


di y, v hoc mt hoc c hai mc b) v c).
Tuyn b tun th s nhn din ti liu c cha m t c xc nh trong a) di
y, v s nhn din (cc) dng m ho c chp nhn, cc lc m ho c
chp nhn, v cc tp con c chp nhn (bng danh sch cc tuyn tp v/hoc k
t), v tuyn la cc chc nng iu khin c chp nhn tng ng vi mc 11.
a)

M t thit b: Mt thit b tun th TCVN...: 2010 s l ch ca mt m t


nhn din phng tin qua ngi dng c th cung cp k t cho thit b
ny v/hoc c th nhn dng chng khi chng c lm thnh sn c cho
ngi dng, khi c xc nh tng ng, trong cc mc con b), v c) di
y.

b)

Thit b ngun gi: Thit b ngun gi s cho php ngi dng cung cp bt
k k t no t mt tp con c chp nhn, v c kh nng truyn cc biu
din m ho ca chng trong phn t d liu CC tng ng vi dng thc m
ho c chp nhn v lc m ho c chp nhn. Nh vy thit b
ngun gi s khng pht i cc phn t d liu CC sai qui cch.

c)

Thit b nhn: Thit b nhn s c kh nng nhn v din gii bt k biu din
m ho no ca k t trong phn t d liu CC tng ng vi dng thc m
ho c chp nhn v lc m ho c chp nhn, v s lm ra bt k k
t tng ng no t tp con c chp nhn sn c cho ngi dng theo
cch ngi dng c th nhn din c chng. Thit b nhn s x l cc phn
t d liu CC sai qui cch nh iu kin li v s khng din gii d liu nh
vy l dy k t.

Bt k k t tng ng no khng trong tp con c chp nhn s c ch ra cho


8

VNPF

TCVN ................ : 2010


ngi dng. Cc thc c dng ch ra chng khng cn phn bit chng vi
nhau.
LU 1 Cch thc theo ngi dng c lu v iu kin li hay k t khng bn trong tp
con c chp nhn khng c xc nh bi chun ny.

Tham chiu qui chun

Cc ti liu tham chiu sau y cha cc iu khon m, qua tham chiu trong vn
bn ny, thit lp nn cc iu khon ca TCVN-...:2010. Vi cc tham chiu c ngy
thng, cc tu chnh v sau, hay cc ci bin, bt k xut bn no trong nhng xut bn
ny u khng p dng. Tuy nhin, cc bn ca nhng tho thun da trn ISO/IEC
10646 c khuyn khch nghin cu v kh nng p dng cho cc ln xut bn gn
nht ca ti liu qui chun c ch ra di y. V cc tham chiu c cp nht,
ln xut bn mi nht ca ti liu c tham chiu s p dng.
ISO/IEC 2022:1994, Information technology Character code structure and
extension techniques.
ISO/IEC 6429:1992, Information technology Control functions for coded character
sets.
Unicode Standard Annex, UAX#9, The Unicode Bidirectional Algorithm, Version 6.0.0.
Unicode Standard Annex, UAX#15, Unicode Normalization Forms, Version 6.0.0.
Unicode Standard Annex, UAX#34, Unicode Named Character Sequences, Version
6.0.
Unicode Technical Standard, UTS#37, Ideographic Variation Database, Version 1.0,
January 2006.
Unicode Standard Annex, UAX#44, Unicode Character Database, Version 6.2.

VNPF

TCVN ................ : 2010

Thut ng v nh ngha

Vi mc ch ca TCVN-....:2010, cc thut ng v nh ngha sau y c p dng.

4.1
K t c s
K t ho khng phi l k t t hp.
Lu - Phn ln cc k t ho u l k t c s. Ngha ny ca t hp ho khng ngn cn
vic trnh by k hiu c s t cc dng ng cnh khc hay t vic tham gia vo nt ch.

4.2
Mt phng a ng c s , BMP
Mt phng 00 ca Nhm 00.

4.3
Khi
Mt min lin tc cc im m c cp cho mt tp cc k t c cc c trng
chung, nh mt b ch; khi n khng chm lp ln khi khc, mt hay nhiu im
m bn trong khi c th khng c k t c cp cho chng.

4.4
Biu din chnh tc
Biu din m vi n cc k t ca b k t m ho ny c xc nh dng bn im
m bn trong khng gian m UCS.

4.5
Phn t d liu CC , Phn t d liu k t m ho , Dy n v m
Phn t thng tin c trao i, c xc nh bao gm mt dy cc n v m,
tng ng vi mt hay nhiu chun c nhn din cho cc tp k t m ho; dy
nh vy c th cha cc n v m lin kt vi bt k kiu im m no.

4.6
K t
Thnh vin ca mt tp cc phn t c dng t chc, iu khin hay biu din
d liu vn bn; mt k t c th c biu din bng mt dy mt hay nhiu k t m
ho.

4.7
Bin k t
Trong mt phn t d liu CC c gii hn gia n v m cui cng ca mt k t
m ho v n v m ho u tin ca k t m ho tip sau.

10

VNPF

TCVN ................ : 2010

4.8
S m, Bng m
Bng ch nht ch ra biu din ca cc k t m ho c cp pht bn trong min
ca khng gian m UCS.

4.9
K t m ho
Lin kt gia mt k t v mt im m.

4.10
Tp k t m ho
Tp cc k t c m ho.

4.11
im m, V tr m
Bt k gi tr no trong khng gian m UCS; thut ng im m c a chung hn.

4.12
n v m
T hp bit ti thiu c th biu din mt n v ca vn bn c m ho dnh cho
x l hay trao i.
LU - V d v cc n v m l byte (n v m 8 bit) c dng trong dng m ho UTF-8, cc
n v m 16 bit trong dng m ho UTF-16, v cc n v m 32 bit trong dng m ho UTF-32.

4.13
Tuyn tp
Tp cc thc th c nh s v t tn; vi mt tuyn tp khng m rng, cc
thc th ny ch bao gm nhng k t m ho c im m nm bn trong mt hay
nhiu min c nhn din (xem 4.24 v tuyn tp m rng).
LU Nu bt k min c nhn din no bao gm cc im m m khng k t no c cp
pht, kho ca tuyn tp ny s thay i nu mt k t b sung c gn cho bt k im m no
trong cc im m vic iu chnh tng lai ca Chun quc gia ny. Tuy nhin iu c d
nh l s hiu v tn gi tuyn tp s vn cn khng i trong cc ln bin tp tng lai ca
Chun quc gia ny.

4.14
K t t hp
Cc k t c gi tr Phn loi chung ca Du t hp dn cch (Mc), Du khng dn
cch (Mn), v Du bao (Me) tng ng vi c s d liu k t Unicode (xem 3).
LU Cc k t ny c d nh cho vic t hp vi k t ho khng t hp ng trc
hay vi dy cc k t t hp c ng trc bi k t khng t hp (xem 4.17).

VNPF

11

TCVN ................ : 2010

4.15
Lp t hp
Gi tr lin kt vi tng k t t hp xc nh tng tc loi hnh ca n v th t
chnh tc ca n bn trong dy cc k t t hp.

4.16
K t tng hp
K t ho c bao hm trong tp k t m ho ca TCVN...: 2010 ch yu dnh
cho vic tng hp vi cc k t m ho c.

4.17
Dy hp thnh
Dy cc k t ho bao gm k t c s theo sau bi mt hay nhiu k t t hp,
ZERO WIDTH JOINER, hay ZERO WIDTH NON-JOINER (cng xem c 4.14).
LU 1 K hiu ho cho mt dy hp thnh ni chung bao gm t hp ca cc k hiu
ho ca tng k t trong dy.
LU 2 Dy hp thnh c th c dng biu din cc k t khng c m ho trong kho
ch ca TCVN...: 2010.

4.18
K t iu khin
Chc nng iu khin c biu din m ho biu th bng mt im m.
LU Mc du k t iu khin thng "c t tn" bng cc thut ng nh DELETE, FORM
FEED, ESC, nhng lng t ny khng tng ng vi tn k t chnh thc. Xem 11 v danh sch
cc tn di c dng bi ISO/IEC 6429 trong lin kt vi cc k t iu khin.

4.19
Chc nng iu khin
Mt hnh ng nh hng ti vic ghi, x l, truyn, hay din gii d liu, v iu
c biu din bi mt phn t d liu CC.

4.20
Trng thi mc nh
Trng thi c gi nh khi khng trng thi no c xc nh tng minh.

4.21
Thit b
Mt cu phn ca thit b x l thng tin m c th truyn v/hoc nhn thng tin m
ho bn trong cc phn t d liu CC. (N c th l thit b vo/ra theo ngha qui c,
hay qui trnh nh chng trnh ng dng hay chc nng ca khu.)

4.22
Dng m ho

12

VNPF

TCVN ................ : 2010


Dng m ho xc nh cch tng im m UCS cho mt k t UCS c din t nh
mt hay nhiu n v m c dng bi dng m ho ny. TCVN-...:2010 xc nh
UTF-8, UTF-16, v UTF-32.

4.23
Lc m ho
Lc m ho xc nh cch chui ho cc n v m t dng m ho thnh tng
byte.
LU Mt s cc lc m ho UCS c cng nhn nh dng m ho UCS. Tuy nhin chng
c dng trong hon cnh khc nhau. Cc dng m ho UCS ni ti biu din trong b nh v
giao din ng dng ca d liu vn bn. Cc lc m ho UCS ni ti d liu vn bn chui
ho theo byte.

4.24
Tuyn tp m rng
Tuyn tp theo cc thc th cng c th bao gm dy cc im m dng chun
ho NFC (xem 21); dy cc im m c tham chiu ti bi Danh nh dy UCS c
tn (NUSI) (xem 12.5).
LU Mt s tuyn tp nh 3 LATIN EXTENDED-A, 4 LATIN EXTENDED-B, 15 ARABIC
EXTENDED v nhiu na c thut ng "extended" trong tn ca chng. iu ny khng lm cho
chng thnh tuyn tp m rng.

4.25
Tuyn tp c nh
Tuyn tp trong mi im m bn trong cc min c nhn din u c k t
c cp pht cho n, v k t ny c d nh vn cn khng i trong cc ln
bin tp tng lai ca Chun ny.

4.26
K t nh dng
K t c chc nng chnh l nh hng ti vic b tr hay x l k t quanh n; ni
chung n khng c biu din thy c ca ring n.

4.27
Phn loi chung, GC
Gi tr c gn cho tng im m UCS, xc nh ra lp chnh ca n, nh ch ci,
du ngt, v k hiu; tng gi tr u c xc nh nh cch vit tt hai ch ci trong
C s d liu Unicode (xem 3).
LU Khi c tham chiu nh mt nhm tt c cc gi tr GC c chung cng ch ci u tin,
nhm ny c th c m t bng vic ch dng ch ci u tin ny. Chng hn, 'L' vit tt cho tt
c cc ch ci 'Lu', 'LI', 'Lt', 'Lm', v Lo'.

4.28
K t ho
Mt k t, khc chc nng iu khin hay k t nh dng, c cch biu din trc quan

VNPF

13

TCVN ................ : 2010


thng c vit tay, c in hay c hin th.

4.29
K hiu ho
Cch biu din trc quan ca k t ho hay ca dy hp thnh.

4.30
im m thay th cao
im m trong min D800 ti DBFF c dnh ring dng UTF-16.

4.31
n v m thay th cao
n v m 16-bit trong min D800 ti DBFF c dng trong UTF-16 nh n v m
i u ca cp thay th (xem 9.2).

4.32
Phn t d liu CC c lp km
Phn t d liu CC ca UCS ng trong dng m ho UCS m khng tun th c
t ca dng m ho (chng hn, n v m thay th khng c cp l mt phn t
d liu m CC c lp km).

4.33
Tp con phn t d liu CC c lp km
Tp con khc rng ca phn t d liu CC X khng cha n v m no m thuc
vo tp con phn t d liu CC c lp tt ti thiu ca X.
LU Tp con phn t d liu CC c lp km khng th chm lp ln phn t d liu CC
c lp tt.

4.34
Trao i ln nhau
Vic truyn d liu m ho k t t ngi dng ny sang ngi dng khc, dng
phng tin vin thng hay phng tin trao i ln nhau c; trao i ln nhau ng
tun t ho d liu v dng lc m ho UCS.

4.35
Lm vic ln nhau
Qui trnh cho php hai hay nhiu h thng, tng h thng s dng cc tp k t m
ho khc nhau, trao i c ngha cc d liu m ho k t; c th bao gm vic
chuyn i gia hai b m.

4.36
ISO/IEC 10646-1
c tham chiu ti l Phn 1 ca ISO/IEC 10646 v cha c t v kin trc tng

14

VNPF

TCVN ................ : 2010


th v Mt phng a ng c s (BMP). C ln xut bn th nht v th hai ca
ISO/IEC 10646-1

4.37
ISO/IEC 10646-2
c tham chiu ti l Phn 2 ca ISO/IEC 10646 v cha c t v Mt phng a
ng b sung (SMP), Mt phng ch biu b sung (SIP) v Mt phng chuyn dng
b sung (SSP). Ch c ln xut bn th nht ISO/IEC 10646-2.

4.38
im m thay th thp
im m trong min DC00 ti DFFF c dnh ring cho vic dng UTF-16.

4.39
n v m thay th thp
n v m 16-bit trong min DC00 ti DFFF c dng trong UTF-16 nh n v m
i sau ca cp thay th (xem 9.2)

4.40
Phn t d liu CC c lp tt ti thiu
Phn t d liu CC c lp tt nh x ti mt gi tr v hng UCS

4.41
K t soi gng
K t c hnh nh c soi gng theo chiu ngang trong vn bn c b tr t phi
sang tri

4.42
Byte
n v m 8-bit; gi tr c biu din theo k php thp lc phn t 00 ti FF trong
UCS (xem Ph lc K)

4.43
Mt phng
Vic phn chia con ca khng gian m UCS cha 65536 im m. Khng gian m
UCS cha 17 mt phng.

4.44
Vic trnh by; trnh by
Qui trnh vit, in hay hin th k hiu ho.

4.45
Dng trnh by
VNPF

15

TCVN ................ : 2010


Trong trnh by mt s ch vit, mt dng k hiu ho biu din cho mt k t m
ph thuc vo v tr ca k t ny tng i vi cc k t khc

4.46
Mt phng s dng t
Mt phng bn trong tp k t m ho ny; ni dung ca n khng c xc nh
trong TCVN-...:2010. Mt phng 0F v 10 l mt phng s dng t

4.47
Kho
Mt tp xc nh cc k t c biu din trong tp cc k t m ho

4.48
Hng
Vic phn chia nh mt mt phng; bng bi s ca 256 im m

4.49
Ch vit
Tp cc k t ho dnh cho dng vit ca mt hay nhiu ngn ng

4.50
Mt phng b sung
Mt phng khc Mt phng 00 ca khng gian m UCS; mt phng cha cc k t
khng c cp pht cho Mt phng a ng c s

4.51
Mt phng a ng b sung cho cc ch vit v k hiu , SMP
Mt phng 01 ca khng gian m UCS

4.52
Mt phng ch biu b sung , SIP
Mt phng 02 ca khng gian m UCS

4.53
Mt phng chuyn dng b sung , SSP
Mt phng 0E ca khng gian m UCS

4.54
Cp thay th
Mt biu din cho ring mt k t c cha dy hai n v m 16-bit, vi gi tr th nht
ca cp l n v m thay th cao v gi tr th hai l n v m thay th thp

16

VNPF

TCVN ................ : 2010

4.55
Mt phng ch biu th ba TIP
Mt phng 03 ca khng gian m UCS

4.56
Khng gian m UCS
Khng gian m UCS bao gm cc s nguyn t 0 ti 10FFFF (h thp lc phn) sn
c cho vic gn kho cc k t UCS

4.57
Gi tr v hng UCS
Bt k im m UCS no ngoi tr cc im m thay th cao v thay th thp

4.58
n v m thay th khng theo cp
n v m thay th trong phn t d liu CC m hoc l n v m thay th cao
khng c theo sau ngay n mt n v m thay th thp, hay n v m thay th thp
khng c trc n ngay mt n v m thay th cao

4.59
Ngi dng
Ngi hay thc th khc gi ti dch v c mt thit b cung cp (Thc th ny c
th l mt qui trnh nh chng trnh ng dng nu "thit b" l b chuyn m hay mt
chc nng ca khu, chng hn.)

4.60
Phn t d liu CC c lp tt
Phn t d liu CC ca UCS c ng trong dng m ho UCS tun th theo c t
ca dng m ho v khng cha tp con phn t d liu CC c lp km

Cu trc chung ca UCS

Cu trc chung ca Tp k t m ho ph dng (sau y c gi l tp k t m


ho ny) c m t trong mc gii thch ny, v c minh ho trong hnh 1. c t
qui chun ca cu trc ny c cho trong cc mc sau.
Dng chnh tc ca tp k t m ho ny cch theo n c quan nim dng
khng gian m UCS c cha cc s nguyn t 0 ti 10FFFF.
TCVN...: 2010 xc nh cc k t ho v biu din m ho ca chng cho cc mt
phng sau:

Mt phng a ng c s (BMP, Mt phng 00).

Mt phng a ng b sung cho cc b ch v k hiu (SMP, Mt phng 01).

Mt phng ch biu b sung (SIP, Mt phng 02).

VNPF

17

TCVN ................ : 2010

Mt phng chuyn dng b sung (SSP, Mt phng 0E).

Mt phng ch biu th ba (TIP, Mt phng 03) c dnh ring v hin thi ang
trng. Cc mt phng t 04 ti 0D c dnh ring cho vic chun ho tng lai.
Mt phng 0F v 10 c dnh cho s dng t.
Cc tp con v khng gian m ho c th c dng cho kho con cc k t
ho.

Cu trc v danh mc c s

6.1

Cu trc

B k t m ho ph dng c xc nh trong TCVN....:2010 s c coi l mt thc


th duy nht c to nn t 17 mt phng.
Cc mt phng b sung (16)

Mt phng s dng t

(Mt phng 10)

Mt phng s dng t

(Mt phng 0F)

Mt phng chuyn dng b sung


Mt phng dnh ring

(Mt phng 04 ti 0D)

Mt phng ch biu th ba TIP

(Mt phng 3)

Mt phng ch biu b sung SIP


Mt phng a ng b sung SMP
Mt phng a ng c s BMP

0000

(Mt phng 0E)

(Mt phng 2)
(Mt phng 1)

(Mt phng 0)

0080

00FF

D7FF
D800..DFFF

Vng thay th

E000..F8FF

Vng s dng t

F900-FFFF

Hnh 1. Mt phng ca tp k t m ho ph dng

18

VNPF

TCVN ................ : 2010

6.2

M ho k t

Tng k t bn trong khng gian m UCS c biu din bng mt s nguyn gia 0
v 10FFF c nhn din nh im m.
Khi mt k t c nhn din di dng n v m ca n, n c biu din bng
mt s nguyn c dng su ch s nh
000030
000041
010000

cho DIGIT ZERO


cho LATIN CAPITAL LETTER A
cho LINEAR B SYLLABLE B008A

Khi tham chiu ti cc k t bn trong mt phng 00, hai ch s u c th c b


i; vi cc k t trong mt phng 01 ti 0F, mt ch s ng u c th c b i
nh
0030
0041
10000

6.3

cho DIGIT ZERO.


cho LATIN CAPITAL LETTER A
cho LINEAR B SYLLABLE B008 A

Kiu im m

6.3.1 Phn loi


Cc im m UCS c phn loi theo cc kiu c s, tng ng vi gi tr phn
loi chung ca chng. Bng 1 tm tt cc kiu:
Bng 1: Kiu im m
Kiu c s

M t vn
tt

Phn
loi
chung

ho

Ch ci,
du hiu,
s, ngt
cu, k
hiu v
du cch

L,M,
N, P,
S, Zs

Dng thc : Khng


thy c nhng nh
hng ti k t bn
cnh
Iu khin: Chc nng
iu khin bao gm
mt im m
S dng t: Vic s
dng c xc nh
bi tho thun t bn
ngoi chun ny
Thay th

VNPF

Trng
thi k
t

Trng
thi
im
m

phn
b
cho k
t

im
m
c
phn
b

Cc
Co

Dnh
ring vnh
vin cho
UTF 16

Cs

19

TCVN ................ : 2010


Phi k t: dnh ring
vnh vin cho s dng
ni b

Cn

Dnh ring

Dnh
ring cho
phn b
tng lai

Khng
phn
b
cho k
t
im
m
khn
g
phn
b

Cc im m thay th, phi k t, v dnh ring khng c phn b cho cc k t v


l ch hn ch trong trao i. Chng hn, im m thay th khng c biu din
c lp tt theo bt k biu din no trong bt k dng m ho UCS no.

6.3.2 K t ho
Cng k t ho s khng c phn b cho qu mt im m. C nhng k t
ho vi hnh dng tng t trong tp k t m ho; chng c dng cho cc mc
ch khc nhau v c cc tn k t khc nhau.

6.3.3 K t nh dng
Cc im m 2060 ti 206F, FFF0 ti FFFC, v E0000 ti E0FFF c dnh ring
cho K t nh dng (xem 16.3 v Ph lc F).
LU Cc im m khng c phn b trong nhng min ny c th c b qua trong x l
v hin th thng thng.

6.3.4 K t iu khin
Cc im m 0000 ti 001F, 007F ti 009F trong BMP c dnh ring cho cc k t
iu khin (xem 11).

6.3.5 K t dng t
Cc im m t E000 ti F8FF trong BMP c dnh ring cho vic s dng t. Tt
c cc im m ca Mt phng 0F v Mt phng 10, ngoi tr FFFFE, FFFFF,
10FFFE, v 10FFFF c dnh ring cho vic dng t.
K t dng t khng b TCVN-...:2010 rng buc theo bt k cch no. K t dng t
c th c dng cung cp cc k t do ngi dng xc nh. Chng hn, y l
yu cu thng thng cho ngi dng ch vit biu .
LU trao i c ngha cc k t dng t, mt tho thun, c lp vi TCVN-...: 2010 l cn
thit gia ngi gi v ngi nhn.

6.3.6 Cc im m thay th
Cc im m D800 ti DFFF c dnh ring cho vic dng dng m ho UTF-16
(xem 9.2). Na th nht (D800 ti DBFF) cha cc im m thay th cao v na th
hai (DC00 ti DFFF) cha cc im m thay th thp.
20

VNPF

TCVN ................ : 2010

6.3.7 im m phi k t
Trng thi ca im m phi k t khng th b thay i bi nhng sa i tng lai.
Cc phi k t bao gm FDD0-FDEF v bt k im m no kt thc bi gi tr FFFE
hay FFFF.
LU im m FFFE c dnh ring cho du hiu. im m FDD0 ti FDEF, v FFFF c
th c dng cho vic dng x l ni b yu cu gi tr s c m bo khng phi l k t c
m ho, nh trong bng kt thc, hay bo hiu ht vn bn. Hn na, v FFFF l gi tr BMP ln
nht, n cng c th c dng nh gi tr cui cng trong ch s tm kim nh phn hay tun t
trong ng cnh ca UTF-16.

6.3.8 im m dnh ring


Cc im m dnh ring c gi cho vic chun ho tng lai v s khng c
dng cho bt k mc ch no khc. Cc ln bin tp tng lai ca TCVN-...:2010 s
khng phn b k t no cho cc im m c dnh ring cho k t dng t hay
dng thc bin i.

6.4

t tn k t

TCVN...:2010 gn tn duy nht cho tng k t ho v k t nh dng. Tn ca mt


k t hoc:
a) k hiu cho ngha theo phong tc ca k t ny, hay
b) m t hnh dng ca k hiu ho tng ng, hoc
c) tun theo qui tc c nu trong mc 22.5 cho ch biu thng nht Trung Quc
/Nht Bn/Hn Quc (CJK),
Qui tc ph c dng xy dng tn k t c cho trong 22.2.
Danh sch cc tn k t, ngoi tr cc ch biu CJK, c cung cp t c s d
liu k t Unicode ti http://www.unicode.org/Public/UNIDATA/NamesList.txt vi c
php c m t trong http://www.unicode.org/Public/UNIDATA/NamesList.html

6.5

Tn gi ngn cho im m

TCVN-....:2010 inh ngha tn gi ngn cho tng im m, k c im m c dnh


ring (cha c phn b). Tn gi ngn cho bt k im m no u c phn bit
vi tn gi ngn cho bt k im m no khc. Nu mt k t c phn b mt
im m, tn gi ngn cho im m c th c dng tham chiu ti k t
c cp ti im m .
LU 1 V d, U+DC00 nhn din im m c dnh ring vnh vin cho UTF-16, v U+FFFF
nhn din im m c dnh ring vnh vin. U+0025 nhn din im m m mt k t s c
cp; U+0025 cng nhn din k t (c tn PERCENT SIGN).
LU 2 Nhng tn gi ngn ny l c lp ngn ng m chun ny c vit, v do vy vn
cn c gi li trong mi bn dch vn bn ny.

Cc dng thay th sau v k php tn gi ngn c nh ngha y:

VNPF

21

TCVN ................ : 2010


a) Dng su ch s ca tn gi ngn s bao gm mt dy su ch s thp lc phn
biu din cho im m ca k t (xem mc 6.2).
b) Dng bn ti nm ch s ca tn gi ngn s bao gm bn ti nm ch s cui
cng ca dng su ch s. Cc s khng ng u bn ngoi bn ch s c
ct b.
c) K t + (PLUS SIGN) c th, nh mt tu chn, ng trc dng ch s ca tn
gi ngn.
d) K t tin t U (LATIN CAPITAL LETTER U) c th, nh mt tu chn, ng
trc bt k dng ba no ca tn gi ngn c xc nh trong a) ti c) trn.
Cc ch hoa t A ti F, v U xut hin bn trong tn gi ngn c th c thay th
bng ch thng tng ng.
C php y ca k php cho tn gi ngn, di dng Backus-Naur, l:
{ U | u } [ {+}(xxxx | xxxxx | xxxxxx) ]
vi x i din cho mt ch s thp lc phn (0 ti 9, A ti F, hay a ti f).
V D:
Tn gi ngn cho LATIN SMALL LETTER LONG c th c vit theo bt k dng no sau y
017F

+017F

U017F

U+017F

Bt k ch hoa no cng c th c thay th bng ch thng tng ng

6.6

Tn gi dy UCS

TCVN-...:2010 xc nh mt tn gi cho bt k dy cc im m c ly ra t chun


ny. Tn gi nh vy c bit ti l Tn gi dy UCS (USI). Vi mt dy gm n
im m n c dng sau:
<UID1, UID2, ..., UIDn>
vi UID1, UID2, v.v. i din cho tn gi ngn ca im m tng ng, theo cng th
t nh cc im m xut hin trong dy. Nu tng im m trong dy c mt k
t c cp cho n, USI c th c dng nhn din dy cc k t c cp ti
cc im m . C php cho UID1, UID2, v.v. c xc nh trong mc 6.5. K t
COMMA (thng c tu chn i theo sau l k t SPACE) phn tch cc UID. Tn gi
dy UCS s bao gm t nht hai UID; n s bt u bng du LESS-THAN SIGN (du
nh hn) v c kt thc bng du GREATER-THAN SIGN (du ln hn).
LU Tn gi dy UCS khng th c dng cho c t tp con v ni dung tuyn tp. Chng
c th c dng bn ngoi chun ny nhn din: dy hp thnh vi mc ch i chiu, kho
font v.v.

6.7

Tn gi dy byte

biu din by c tun t ho trong ng cnh nh ngha lc m ho (xem


10), TCVN-....:2010 xc nh tn gi cho dy cc byte c tun t ho. Vi dy gm
n byte n c dng sau:
<xx1 xx2 xxn>
vi xx1, xx2, v xxn, biu din cho cc byte th nht, th hai v th n bng vic dng
cc ch s h thp lc phn cho tng byte.

22

VNPF

TCVN ................ : 2010

Sa i v cp nht UCS

Vic sa i v cp nht tp k t m ho ny s c thc hin bi TCVN/JTC1.


LU D nh l trong cc ln cng b tng lai ca TCVN-....:2010, tn v vic phn b k t
trong ln cng b ny s vn cn khng thay i.

Tp con

TCVN-...:2010 cung cp c t v cc tp con ca cc k t ho m ho dng


trong trao i, gia thit b gc v thit b nhn.
C hai phng n cho c t tp con: tp con hn ch v tp con c la. Mt tp
con c chp nhn c th bao gm mt trong chng, hay t hp ca c hai.

8.1

Tp con hn ch

Tp con hn ch bao gm danh sch cc k t trong tp con xc nh. c t ny


cho php cc ng dng v thit b c pht trin dng cc m khc lin tc
vi tp k t m ho ny.
Tuyn b v tun th ni ti mt tp con hn ch s lit k ra cc k t ho trong
tp con ny theo tn ca cc k t ho hay im m nh c xc nh trong
TCVN-....:2010.

8.2

Tp con c la

Tp con c la bao gm danh sch cc tuyn tp cc k t ho nh c xc


nh trong TCVN-....:2010. Tp con c la bao gi cng t ng cha cc im m
t 0020 ti 007E.
Tuyn b v tun th ni ti mt tp con c la s lit k ra cc tuyn tp c
chn nh c xc nh trong ISO/IEC 10646.

Dng m ho UCS

TCVN-....:2010 cung cp ba dng m ho bng cch din t tng gi tr v hng


UCS thnh mt dy duy nht gm mt hay nhiu im m. Cc dng m ho ny c
tn l UTF-8, UTF-16, v UTF-32 tng ng.

9.1

UTF-8

UTF-8 l dng m ho UCS gn cho tng gi tr v hng UCS mt dy cc byte


gm mt cho ti bn byte, nh c xc nh trong bng 2.

Cc k t UCS t tuyn tp BASIC LATIN c biu din trong UTF-8 tng ng


vi ISO/IEC 4873, tc l cc byte n vi gi tr chy t 20 ti 7E.

Cc k t iu khin nm trong cc im m t 0000 ti 001F, v k t iu khin


im m 007F, c biu din m khng c cc byte b sung thm nh c
xc nh trong mnh 11, tc l cc byte n c gi tr chy t 00 ti 1F, v 7F
tng ng vi ISO/IEC 4873 v vi cu trc 8-bit ca ISO/IEC 2022.
VNPF

23

TCVN ................ : 2010

Cc gi tr byte 00 ti 7F khng xut hin ch no khc trong biu din m ho


UTF-8 ca bt k k t no. iu ny cung cp s tng hp vi cc h thng x
l tp hin c v cc h con truyn thng vn phn tch cc dy CC thnh cc gi
tr byte ny.

Byte u tin trong biu din m ho UTF-8 ca bt k k t no cng u c th


c nhn din trc tip khi phn t d liu CC c xem xt, mi byte mt lc,
bt u t v tr bt k. N ch ra s cc byte lin tc (nu c) trong dy a byte
thit lp nn biu din n v m ca k t .

Bng 2 xc nh phn b bit cho dng m ho UTF-8, ch ra cc min ca gi tr v


hng UCS tng ng vi cc dy mt, hai, ba v bn bytes.
Bng 2 - Phn b bit UTF-8
Gi tr v hng

Byte th nht

Byte th hai

Byte th ba By th t

00000000
0xxxxxxx

0xxxxxxx

00000yyy
yyxxxxxx

110yyyyy

10xxxxxx

zzzzyyyy yyxxxxx

1110zzzz

10yyyyyy

10xxxxxx

000uuuuu
zzzzyyyy yyxxxxxx

11110uuu

10uuzzzz

10yyyyyy

10xxxxxx

Bi v cc im m thay th khng phi l cc gi tr v hng UCS, bt k dy UTF-8


no m nh x vo cc im m D800-DFFF u l c lp km.
Bng 3 lit k tt c cc min (k c) cc dy byte c lp tt trong UTF-8. Bt k
dy UTF-8 no khng sng ng cc mu c lit k trong bng 3 u l lp km
Bng 3: Dy byte UTF-8 c lp tt
im m

Byte th nht

Byte th hai

Byte th ba By th t

0000-007F

00-7F

0080-07FF

C2-DF

80-BF

0800-0FFF

E0

A0-BF

80-BF

1000-CFFF

E1-EC

80-BF

80-BF

D000-D7FF

ED

80-9F

80-BF

E000-FFFF

EE-EF

80-BF

80-BF

10000-3FFFF

F0

90-BF

80-BF

80-BF

40000-FFFFF

F1-F3

80-BF

80-BF

80-BF

100000-10FFFF

F4

80-8F

80-BF

80-BF

Xem nh h qu ca iu kin lp tt c xc nh trong bng 9.2, cc gi tr byte


sau y l khng c php trong UTF-8: C0-C1, F5-FE

24

VNPF

TCVN ................ : 2010

9.2

UTF-16

UTF-16 l dng m ho UCS gn cho tng gi tr v hng UCS mt dy gm mt


hay hai n v m 16-bit khng du, nh c xc nh trong bng 4.
Trong dng m ho UTF-16, cc im m trong min 0000-D7FF v E000-FFFF c
biu din nh mt n v m 16-bit ring; cc im m trong min 10000-10FFFF
c biu din nh mt cp cc n v m 16-bit. Cc cp ny ca cc n v m
c bit c bit ti nh cc cp thay th.
Gi tr ca cc n v m c dng cho cc cp thay th l khng trng nhau vi cc
n v m c dng cho biu din n v m ring, do vy duy tr vic khng chm
lp cho mi biu din im m trong UTF-16.
UTF-16 ti u vic biu din cc k t trong BMP c cha i a s cc k t thng
c dng.
Bi v cc im m thay th khng phi l cc gi tr v hng UCS, cc n v m
thay th khng i cp i u l lp km.
Bng 4 xc nh phn b bit cho dng m ho UTF-16. Tnh ton v cc gi tr cp
thay th bao gm vic tr i 10000 tnh khong chnh bt u ca gi tr v hng
(c din t l wwww = uuuuu-1 trong bng).
Bng 4: Phn b bit UTF-16
Gi tr v hng
xxxxxxxxxxxxxxxxx

UTF-16
xxxxxxxxxxxxxxxxx

000uuuuuxxxxxxxxxxxxxxxxx 110110wwwwxxxxxx 110111xxxxxxxxxx


LU Phin bn trc ca ISO 10646 bao gm cc tham chiu ti dng BMP hai byte c gi
l UCS-2 m chnh l tp con ca dng m ho UTF-16 c hn ch cho cc gi tr v hng
UCS BMP. Dng UCS-2 nay b phn i.

9.3

UTF-32 (UCS-4)

UTF-32 (hay UCS-4) l dng m ho UCS gn cho tng gi tr v hng UCS mt


n v m 32-bit khng du duy nht. Thut ng UTF-32 v UCS-4 c th c dng
i ln cho nhau ch dng m ho ny.
Bi v cc im m thay th khng phi l cc gi tr v hng UCS, cc n v m
UTF-32 trong min 0000D800 - 0000DFFF u l lp km.

10

Lc m ho UCS

Cc lc m ho l cc tun t ho theo byte chuyn dng cho tng dng m ho


UCS, k c c t ca du hiu, nu c php. Du hiu l dy n v m tng
ng vi im m FEFF ZERO WIDTH NO-BREAK SPACE trong dng m ho tng
ng. Khi c dng, du hiu u lung cc byte tun t ho ch ra trt t ca cc
byte bn trong dng m ho c dng cho vic biu din cc k t.
TCVN-....:2010 xc nh cc lc m ho: UTF-8, UTF-16BE, UTF-16LE, UTF-16,
UTF-32BE, UTF-32LE, v UTF-32.

VNPF

25

TCVN ................ : 2010

10.1 UTF-8
Lc m ho UTF-8 tun t ho mt n v m UTF-8 theo ch xc cng th t
nh bn thn dy n v m.
Khi c biu din trong UTF-8, du hiu bin thnh dy byte <EF BB BF>. Vic s
dng ca n ch bt u lung d liu UTF-8 c cn ti hay khuyn co nhng
khng nh hng ti s tun th.

10.2 UTF-16BE
Lc m ho UTF-16BE tun t ho phn t d liu CC UTF-16 theo cch byte
c ngha hn i trc byte km ngha hn (cng cn c bit ti nh l trt t
u ln).
Trong UTF-16BE, dy byte khi u ca <FE FF> c din gii l FEFF ZERO
WIDTH NO-BREAK SPACE v khng truyn ngha ca du hiu.

10.3 UTF-16LE
Lc m ho UTF-16LE tun t ho phn t d liu CC UTF-16 theo sp th t
cc byte theo cch byte t ngha hn i trc by nhiu ngha hn (cng c bit
ti nh th t u b).
Trong UTF-16LE, dy byte khi u ca <FF FE> c din gii l FEFF ZERO
WIDTH NO-BREAK SPACE v khng truyn t ngha du hiu.

10.4 UTF-16
Lc m ho UTF-16 tun t ho mt phn t d liu CC UTF-16 bng vic sp
th t cc byte theo cch hoc byte c ngha t i trc hay theo sau byte c ngha
hn.
Trong lc m ho UTF-16, du hiu khi u c c l <FE FF> ch ra rng
byte c ngha hn i trc byte t ngha hn, cn <FF FE> ch ra iu ngc li.
Du hiu ny khng phi l mt phn ca d liu vn bn.
Nu khng c du hiu, th t byte ca lc m ho UTF-16 l byte c ngha
hn i trc byte t ngha hn.

10.5 UTF-32BE
Lc UTF-32BE tun t ho mt phn t d liu CC bng vic sp th t cc
byte theo cch byte c ngha hn i trc byte t ngha hn (cng cn c bit
nh th t u ln).
Trong UTF-32BE, dy byte khi u <00 00 FE FF> c din gii l FEFF ZERO
WIDTH NOBREAK SPACE v khng truyn t ngha du hiu.

10.6 UTF-32LE
Lc UTF-32LE tun t ho mt phn t d liu CC bng vic sp th t cc byte
theo cch byte t ngha hn i trc byte nhiu ngha hn (cng cn c bit
nh th t u b).
Trong UTF-32BE, dy byte khi u <FE FF 00 00> c din gii l FEFF ZERO
26

VNPF

TCVN ................ : 2010


WIDTH NOBREAK SPACE v khng truyn t ngha du hiu.

10.7 UTF-32
Lc m ho UTF-32 tun t ho dy n v m UTF-32 bng vic sp th t cc
byte theo cch hoc byte t ngha hn i trc hay i sau byte nhiu ngha hn.
Nu khng c du hiu, th t byte ca lc m ho UTF-32 l byte c ngha
hn i trc byte t ngha hn.

11

Dng chc nng iu khin vi UCS

Tp k t m ho ny cung cp vic dng chc nng iu khin c m ho tng


ng theo ISO/IEC 6429 hay cc chun c cu trc tng t i vi chc nng iu
khin, v cc chun c suy dn ra t nhng chun ny. Tp hay tp con ca
nhng chc nng iu khin c m ho nh vy c th c dng gn vi tp k
t m ho ny. Cc chun ny m ho chc nng iu khin nh mt dy gm mt
hay nhiu byte.
Khi mt k t iu khin ca ISO/IEC 6429 c dng vi tp k t m ho ny, biu
din m ho ca n nh c xc nh trong ISO/IEC 6429 s c b sung thm
tng ng vi s byte trong n v m ca dng m ho c chp nhn (xem 9).
Vy, byte t ngha s l t hp bit c xc nh trong ISO/IEC 6429, v cc byte
nhiu ngha hn s l cc s khng.
Chng hn, k t iu khin FORM FEED c biu din bi 000C trong dng m
ho UTF-16, v 0000 000C trong dng m ho UTF-32.
Vi cc dy thot, dy iu khin, v xu iu khin (xem ISO/IEC 6429) bao gm
mt k t iu khin c m ho theo sau bi cc t hp bit ph trong min 20 ti
7F, tng t hp bit s c b sung thm cc byte c gi tr 00.
Chng hn, dy thot ESC 02/00 04/00 c biu din bi 1B 20 40 trong dang
m ho UTF-8, bi 001B 0020 0040 trong dng m ho UTF-16, v 0000001B
00000020 00000040 trong dng m ho UTF-32.
LU 1 Thut ng k t xut hin trong nh ngha ca nhiu chc nng iu khin c xc
nh trong ISO/IEC 6429, nhn din cc phn t trn chc nng iu khin s tc ng. Khi
cc chc nng iu khin nh vy c p dng cho cc k t m ho tng ng vi TCVN...:2010 hnh ng ca nhng chc nng iu khin s ph thuc vo kiu phn t t TCVN...:2010 c chn, bi ng dng, l phn t (hay k t) trn chc nng iu khin tc
ng. Cc phn t ny c th c chn l cc k t (k t phi t hp v/hoc k t t hp) hay c
th c chn theo cc cch khc (nh dy hp thnh) khi p dng c.

Cc chc nng iu khin m rng m cho cc k thut m rng m ISO/IEC 2022


(nh dy thot ch nh, dch chuyn n, v dch chuyn kho) s khng c dng
vi tp k t m ho ny.
LU 2 Danh sch sau y cung cp cc tn di t ISO/IEC 6429 c dng trong lin h vi
cc k t iu khin.

0000 NULL

0003 END OF TEXT

0001 START OF HEADING

0004 END OF TRANSMISSION

0002 START OF TEXT

0005 ENQUIRY

VNPF

27

TCVN ................ : 2010


0006 ACKNOWLEDGE

0084 INDEX

0007 BELL

0085 NEXT LINE

0008 BACKSPACE

0086 START OF SELECTED AREA

0009 CHARACTER TABULATION

0087 END OF SELECTED AREA

000A LINE FEED

0088 CHARACTER TABULATION SET

000B LINE TABULATION

0089 CHARACTER TABULATION WITH


JUSTIFICATION

000C FORM FEED


000D CARRIAGE RETURN
000E SHIFT-OUT
000F SHIFT-IN
0010 DATA LINK ESCAPE
0011 DEVICE CONTROL ONE
0012 DEVICE CONTROL TWO
0013 DEVICE CONTROL THREE
0014 DEVICE CONTROL FOUR
0015 NEGATIVE ACKNOWLEDGE
0016 SYNCHRONOUS IDLE
0017 END OF TRANSMISSION BLOCK
0018 CANCEL
0019 END OF MEDIUM
001A SUBSTITUTE
001B ESCAPE
001C INFORMATION SEPARATOR
FOUR
001D INFORMATION SEPARATOR
THREE
001E INFORMATION SEPARATOR TWO
001F INFORMATION SEPARATOR ONE
007F DELETE
0082 BREAK PERMITTED HERE

008A LINE TABULATION SET


008B PARTIAL LINE FORWARD
008C PARTIAL LINE BACKWARD
008D REVERSE LINE FEED
008E SINGLE-SHIFT TWO
008F SINGLE-SHIFT THREE
0090 DEVICE CONTROL STRING
0091 PRIVATE USE ONE
0092 PRIVATE USE TWO
0093 SET TRANSMIT STATE
0094 CANCEL CHARACTER
0095 MESSAGE WAITING
0096 START OF GUARDED AREA
0097 END OF GUARDED AREA
0098 START OF STRING
009A SINGLE CHARACTER
INTRODUCER
009B CONTROL SEQUENCE
INTRODUCER
009C STRING TERMINATOR
009D OPERATING SYSTEM COMMAND
009E PRIVACY MESSAGE
009F APPLICATION PROGRAM
COMMAND

0083 NO BREAK HERE


K t iu khin 0084 INDEX c loi b khi ISO/IEC 6492:1992. Thm vo , cc k t iu
khin 000E v 000F c t tn l SHIFT-OUT v SHIFT-IN tng ng trong mi trng 7-bit v
LOCKING-SHIFT ONE v LOCKINGSHIFT ZERO tng ng trong mi trng 8-bit.

12

Khai bo nhn din tnh nng

12.1 Mc ch v hon cnh ca nhn din


Phn t d liu CC tun th theo TCVN-...:2010 c d nh hnh thnh tt c hay
mt phn ca n v hp thnh ca thng tin m ho c trao i gia ngun pht
v ngun nhn. Vic nhn din ca TCVN-...:2010 (k c dng m ho v lc m

28

VNPF

TCVN ................ : 2010


ho) v bt k tp con no ca khng gian m ho m c ngun pht chp nhn
cng u phi c sn cho ngun nhn. Con ng qua vic nhn din nh vy
c trao i cho bn nhn l ngoi phm vi ca TCVN-...:2010.
Tuy nhin, mt s chun cho trao i thng tin m ho c th cho php, hay yu cu,
rng biu din m ho ca vic nhn din p dng c cho phn t d liu CC to
thnh mt phn ca thng tin c trao i. Mc ny xc nh mt biu din m ho
cho vic nhn din UCS v tp con ca TCVN-...:2010, v cng ca tp C0 v C1 cc
chc nng iu khin t ISO/IEC 6429 dng tip ni vi TCVN-...:2010. Cc biu
din m ho nh vy cung cp tt c hay mt phn ca phn t d liu nhn din,
m c th c a vo trong trao i thng tin tng ng vi chun c lin quan.
Trong ng cnh ca nhng nhn din ny, bi v cc byte c ngha hn i trc cc
byte t ngha hn khi c tun t ho, cc lc m ho duy nht m c th
c la l UTF-8, UTF-16BE, v UTF-32BE tng ng vi cc dng m ho c lin
quan (UTF-8, UTF-16, v UTF-32 tng ng).
Nu hai hay nhiu nhn din ang c, trt t ca nhng nhn din ny s tun theo
th t nh c xc nh trong mc ny.
LU Phng php thay phin cho nhn din c m t trong ph lc N.

12.2 Nhn din dng m ho ca UCS


Khi dy thot t ISO/IEC 2022 c s dng, vic nhn din dng m ho UCS (xem
9) c xc nh bi TCVN-....:2010 s l dy ch nh c chn t danh sch sau:
ESC 02/05 02/15 04/09
Dng m ho UTF-8; lc m ho UTF-8
ESC 02/05 02/15 04/12
Dng m ho UTF-16; lc m ho UTF-16BE
ESC 02/05 02/15 04/06
Dng m ho UTF-32; lc m ho UTF-32BE
LU 1 Dy ch nh ch c h tr theo nh ngha ni dung phn t d liu CC.
LU 2 Dy thot sau cng c th c dng:
ESC 02/05 04/07
Dng m ho UTF-8; lc m ho UTF-8
Dy thot c dng cho li h thng m ho ca ISO/IEC 2022 khng c chp thm (xem
12.5).

Nu dy thot nh vy xut hin bn trong phn t d liu CC tun th theo ISO/IEC


2022, n s ch bao gm cc dy t hp bit nh c nu trn.
Nu dy thot nh vy xut hin bn trong mt phn t d liu CC tun th theo
TCVN-....:2010, n s c chp thm tng ng vi mc 11.

12.3 Nhn din tp con cc k t ho


Khi dy iu khin ca ISO/IEC 6429 c dng, vic nhn din cc tp con (xem 8)
c xc nh bi TCVN-....:2010 s l bi dy iu khin IDENTIFY UNIVERSAL
CHARACTER SUBSET (IUCS) nh c nu di y.

VNPF

29

TCVN ................ : 2010


CSI Ps... 02/00 06/13
Ps... ngha l c th c bt k s cc tham bin la chn no. Cc tham bin c ly
t s tuyn tp cc tp con nh c ch ra trong Ph lc A ca TCVN-....:2010. Khi
c nhiu tham bin, tng gi tr tham bin c phn tch bi mt byte c gi tr
03/11.
Cc gi tr tham bin c biu din bi cc ch s c gi tr byte 03/00 ti 03/09
biu din cho cc ch s 0 ti 9.
Nu dy thot nh vy xut hin trong phn t d liu CC tun th theo ISO/IEC
2022, n s ch bao gm dy cc t hp bit nh c nu trn.
Nu dy thot nh vy xut hin trong phn t d liu CC tun th theo TCVN....:2010, n s c chp thm tng ng vi mc 11.

12.4 Nhn din tp chc nng iu khin


Khi cc dy thot t ISO/IEC 2022 c dng, vic nhn din cho tng tp cc chc
nng iu khin (xem mc 11) ca ISO/IEC 6429 c dng tip ni vi TCVN....:2010 s l mt dy tn gi c kiu c nu di y.
ESC 02/01 04/00

nhn din ton b tp C0 ca ISO/IEC 6429

ESC 02/02 04/03

nhn din ton b tp C1 ca ISO/IEC 6429

Vi cc tp C0 hay C1 khc, byte cui cng F s thu c t ng k quc t cho


Tp k t m ho - International Register of Coded Character Sets. Dy tn gi cho
cc tp ny s l
ESC 02/01 F

ng nht vi tp C0

ESC 02/02 F

ng nht vi tp C1

Nu dy thot nh vy xut hin bn trong phn t d liu CC tun th theo ISO/IEC


2022, n s ch bao gm dy cc t hp bit nh c nu trn.
Nu dy thot xut hin bn trong phn t d liu CC tun th theo TCVN-....:2010,
n s c chp thm tng ng vi mc 11.

12.5 Nhn din h thng m ho ca ISO/IEC 2022


Khi cc dy thot t ISO/IEC 2022 c dng, vic nhn din tr li, hay truyn, t
UCS sang h thng m ho ca ISO/IEC 2022 s bng dy thot ESC 02/05 04/00.
Nu dy thot nh vy xut hin bn trong phn t d liu CC tun th theo TCVN....:2010, n s c chp thm tng ng vi mc 11.
Nu dy thot nh vy xut hin bn trong phn t d liu CC tun th ISO/IEC
2022, n s ch bao gm dy cc t hp bit nh c nu trn.
LU Dy thot ESC 02/05 04/00 thng thng c dng tr li trang thi c khi
phc ca ISO/IEC 2022. Dy thot ESC 02/05 04/00 xc nh y i khi khng ch xc nh
c xc nh trong ISO/IEC 2022 do s hin din ca cc byte chp thm. Bi l do ny dy thot
trong mc 12.2 nhn din UCS c cha byte 02/15 ch ra rng vic tr li khng phi bao gi
cng tun th chun .

30

VNPF

TCVN ................ : 2010

13

Cu trc ca s m v danh sch

Mc 30 lp ra s m chi tit v danh sch cc tn k t cho k t ho. N xc


nh cc k t ho, biu din m ho ca chng, v tn k t cho tng k t.
LU Mc 30 cng c cha c thng tin ph v cc k t lm r rng mt s tnh nng ca k t,
nh vic t tn v cch dng n hay k hiu ho lin kt ca n.

K hiu ho c xem nh biu din trc quan in hnh ca cc k t. TCVN....:2010 khng nh m t trc ch xc hnh dng ca tng k t. Hnh dng ny b
nh hng bi thit k ca font c s dng, iu bn ngoi phm vi ca TCVN....:2010.
Cc k t c xc nh trong TCVN-....:2010 c nhn din duy nht theo tn ca
chng. iu ny khng ng rng k hiu ho m chng thng c to nh bao
gi cng khc nhau. V d v k t ho vi cc k hiu ho tng t l LATIN
CAPITAL LETTER A, GREEK CAPITAL LETTER ALPHA v CYRILLIC CAPITAL
LETTER A.
Ngha c gn cho mi k t u khng c TCVN-....:2010 xc nh; n c th
khc gia cc loi ch vit, hay gia cc ng dng.
Vi cc b ch theo ch ci, nguyn tc chung l sp xp cc k t bn trong hng
theo trnh t ch ci xp x; nhng b ch c ch hoa v ch thng, cc ch ny
c sp theo cp. Tuy nhin, nguyn tc chung ny trong vi trng hp cng b b
qua. Chng hn, vi cc b ch c trong cc chun c lin quan, cc k t c
cp pht tng ng theo chun . Vic b tr ny bn trong s m s h tr cho
chuyn i gia cc chun hin c v tp k t m ho ny. Tuy nhin, ni chung
ngi ta d kin rng chuyn i gia tp k t m ho ny v bt k tp k t m ho
no khc s dng k thut bng tra.
iu khng c d nh, m cng khng thng xy ra, l cc k t c cn ti
bi bt k ngi dng no s c tm thy tt c c gp nhm cng nhau trong
mt phn ca s m ny.
Hn na, ngi dng bt k b ch no s thy rng cc k t c cn ti c th
c m ho u trong tp k t m ho ny. iu ny c bit p dng cho
ch s, cho k hiu, v cho vic dng cc ch Latin trong cc ng dng bng ch
kp.
Do , trong khi dng tp k t m ho ny, c gi c khuyn nn tham chiu ti
danh sch cc tn khi trong tng quan v cc mt phng trong hnh 3 ti 7, v ri
quay sang s m ring cho b ch lin quan v cho cc k hiu v ch s.

14

Tn khi v tuyn tp

14.1 Tn khi
Cc khi c tn cc im m lin tc c xc nh bn trong mt mt phng vi mc
ch cp pht cc k t c chung mt s c trng chung, nh mt b ch vit. Cc
khi c xc nh bn trong BMP, SMP, SIP v SSP c minh ho trong cc hnh
2 ti 6.
Qui tc c dng xy dng cc tn khi c nu trong 24.4.1.

VNPF

31

TCVN ................ : 2010

14.2 Tn tuyn tp
Qui tc c dng xy dng tn ca tuyn tp c nu trong 24.4.2.

15

K t soi gng trong ng cnh song hng

15.1 K t soi gng


Lp cc k t c ngha c bit trong ng cnh ca vn bn song hng. Din gii
v ti to bt k trong nhng k t ny tu thuc vo chiu ca k t ang c ti to
m c hiu lc ti im trong phn t d liu CC, ti biu din m ho ca k t
ny xut hin. Danh sch cc k t ny c xc nh bng vic c thuc tnh
Bidi_Mirrored c t l Y trong c s d liu k t Unicode (xem 3).
LU 1 V cn bn, k t soi gng c nh ca n c soi gng theo chiu ngang trong vn
bn c t t phi sang tri. Tuy nhin, vi mt s k hiu ton hc, dng soi gng khng phi
l hnh nh hng ch xc. Xem Bo co k thut Unicode #25, H tr ca Unicode cho Ton
hc bit thm chi tit.

Vic soi gng k t ny khng b gii hn vo cc k t c cp v s c p dng


cho mi k t thuc vo lp .
V D
Trong on vn bn phi sang tri, GREATER-THAN SIGN (c ti to l ">" trong vn bn tri
sang phi) c th c ti to l k hiu ho "<".
LU 2 Nhiu b ch vit c v mt s b ch vit ang dng hin i c th c vit hoc t
phi sang tri hoc t tri sang phi. Thng tc l cho mt trong nhng b ch ny l dng k
hiu ho soi gng thch hp cho bt k k t no c biu din bi k hiu ho m khng
i xng quanh trc ng. Trong nhng trng hp nh vy, iu tu thuc vo h thng ti
to cho hin th hnh nh ho thch hp vi chiu vit c s dng. Tnh chiu ca k hiu
ho biu din c nu trong s m k t snh ng vi chiu vit mc nh cho b ch vit .
Cc k t thuc vo nhng b ch vit ny u c tnh cht Bidi_Mirrored c t l N trong c
s d liu Unicode (xem 3).
Cc v d v nhng b ch nh vy bao gm, nhng khng b gii hn, vo Old Italic, b ch c
m vi chng chiu vit mc nh trong chun ny l tri sang phi, v Cypriot, b ch c m vi
n chiu vit mc nh trong chun ny l phi sang tri.

15.2 Chiu ca vn bn song hng


Thut ton song hng Unicode (xem 3) m t thut ton c dng xc nh
chiu cho vn bn song hng.

16

Cc k t c bit

C cc k t khng c k hiu ho in c hay l c bit theo cch no .

16.1 K t du cch
Cc k t sau y l cc k t du cch. Chng biu din cho mi k t c gi tr phn
loi chung c t l Zs.
im m

32

Tn

0020

SPACE

VNPF

TCVN ................ : 2010


00A0

NO-BREAK SPACE

2006

SIX-PER-EM SPACE

1680

OGHAM SPACE MARK

2007

FIGURE SPACE

180E

MONGOLIAN VOWEL
SEPARATOR

2008

PUNCTUATION SPACE

2009

THIN SPACE

200A

HAIR SPACE

202F

NARROW NO-BREAK SPACE

205F

MEDIUM MATHEMATICAL
SPACE

3000

IDEOGRAPHIC SPACE

2000

EN QUAD

2001

EM QUAD

2002

EN SPACE

2003

EM SPACE

2004

THREE-PER-EM SPACE

2005

FOUR-PER-EM SPACE

LU 1 K t 180E MONGOLIAN VOWEL SEPARATOR c th c dng gia MONGOLIAN


LETTER A hay MONGOLIAN LETTER E ti cui ca t v ch ci ph m i trc. N ch ra mt
dng c bit ca k hiu ho cho ch A hay E v ph m i trc. Khi c ti to di dng
trc quan n ni chung ch ra khong cch hp gia casdc ch ci, nhng i khi n c th c
ch ra nh mt k hiu ho phn bit tr gip cho ngi dng.
LU 2 K t 202F NARROW NO-BREAK-SPACE l du cch khng ngt. N tng t vi
00A0 NO-BREAK SPACE, ngoi tr rng n c ti to vi chiu rng hp hn. Khi c dng
vi b ch Mng C k t ny thng c ti to mt phn ba chiu rng ca du cch thng
thng, v n tch phn hu t khi chn t Mng C. iu ny cho php cc qui tc thng
thng ca vic hnh thnh k t Mng C c p dng, trong khi vn ch ra rng khng c bin
t ti v tr .

16.2 K hiu tin t


K hiu tin t trong TCVN-....:2010 khng nht thit ng nht vi k hiu tin t ca
mt nc. Chng hn, YEN SIGN c th c dng cho ng Yn Nht Bn v Yuan
Trung Quc. Cng vy, DOLLAR SIGN c dng trong mt s nc k c M.
Ni ring, k hiu tin t cho ng ca Vit Nam c cp im m 20AB.

16.3 K t nh dng
Cc k t sau y l cc k t nh dng (xem 6.3.3). Chng biu din cho tt c cc
k t c gi tr Phn loi chung c t l Cf, Zl, v Zp. Xem ph lc F.
im m Tn

200B

ZERO WIDTH SPACE

00AD

SOFT HYPHEN

200C

ZERO WIDTH NON-JOINER

0600

ARABIC NUMBER SIGN

200D

ZERO WIDTH JOINER

0601

ARABIC SIGN SANAH

200E

LEFT-TO-RIGHT MARK

0602

ARABIC FOOTNOTE MARKER

200F

RIGHT-TO-LEFT MARK

0603

ARABIC SIGN SAFHA

2028

LINE SEPARATOR

06DD

ARABIC END OF AYAH

2029

PARAGRAPH SEPARATOR

070F

SYRIAC ABBREVIATION MARK

202A

LEFT-TO-RIGHT EMBEDDING

17B4

KHMER VOWEL INHERENT AQ

202B

RIGHT-TO-LEFT EMBEDDING

17B5

KHMER VOWEL INHERENT AA

202C

POP DIRECTIONAL FORMATTING

1A60

TAI THAM SIGN SAKOT

202D

LEFT-TO-RIGHT OVERRIDE

1CBF

MEITEI MAYEK SIGN VIRAMA

202E

RIGHT-TO-LEFT OVERRIDE

VNPF

33

TCVN ................ : 2010


2060

WORD JOINER

2061

FUNCTION APPLICATION

2062

INVISIBLE TIMES

2063

INVISIBLE SEPARATOR

2064

INVISIBLE PLUS

206A

INHIBIT SYMMETRIC SWAPPING

206B

ACTIVATE SYMMETRIC
SWAPPING

206C

INHIBIT ARABIC FORM SHAPING

206D

ACTIVATE ARABIC FORM


SHAPING

FFFA

INTERLINEAR ANNOTATION
SEPARATOR

FFFB

INTERLINEAR ANNOTATION
TERMINATOR

110BD

KAITHI NUMBER SIGN

1D173

MUSICAL SYMBOL BEGIN BEAM

1D174

MUSICAL SYMBOL END BEAM

1D175

MUSICAL SYMBOL BEGIN TIE

1D176

MUSICAL SYMBOL END TIE

1D177

MUSICAL SYMBOL BEGIN SLUR

1D178

MUSICAL SYMBOL END SLUR

206E

NATIONAL DIGIT SHAPES

1D179

MUSICAL SYMBOL BEGIN PHRASE

206F

NOMINAL DIGIT SHAPES

1D17A

MUSICAL SYMBOL END PHRASE

2D7F

TIFINAGH CONSONANT JOINER

E0001

LANGUAGE TAG

FEFF

ZERO WIDTH NO-BREAK SPACE

E0020-E007F

FFF9

INTERLINEAR ANNOTATION
ANCHOR

TAG SPACE to CANCEL

16.4 K t m t ch biu
K t m t ch biu - Ideographic Description Character (IDC) l mt k t ho,
c dng vi mt dy cc k t ho khc to nn Dy m t ch biu Ideographic Description Sequence (IDS). Dy nh vy c th c dng m t k
t ch biu khng c xc nh bn trong chun ny. Ph lc I m t chng chi
tit hn. Danh sch IDC l nh sau:
im m

Tn

2FF0

IDEOGRAPHIC DESCRIPTION CHARACTER LEFT TO RIGHT

2FF1

IDEOGRAPHIC DESCRIPTION CHARACTER ABOVE TO BELOW

2FF2

IDEOGRAPHIC DESCRIPTION CHARACTER LEFT TO MIDDLE AND RIGHT

2FF3

IDEOGRAPHIC DESCRIPTION CHARACTER ABOVE TO MIDDLE AND BELOW

2FF4

IDEOGRAPHIC DESCRIPTION CHARACTER FULL SURROUND

2FF5

IDEOGRAPHIC DESCRIPTION CHARACTER SURROUND FROM ABOVE

2FF6

IDEOGRAPHIC DESCRIPTION CHARACTER SURROUND FROM BELOW

2FF7

IDEOGRAPHIC DESCRIPTION CHARACTER SURROUND FROM LEFT

2FF8

IDEOGRAPHIC DESCRIPTION CHARACTER SURROUND FROM UPPER LEFT

2FF9

IDEOGRAPHIC DESCRIPTION CHARACTER SURROUND FROM UPPER RIGHT

2FFA

IDEOGRAPHIC DESCRIPTION CHARACTER SURROUND FROM LOWER LEFT

2FFB

IDEOGRAPHIC DESCRIPTION CHARACTER OVERLAID

16.5 B la bin th v dy bin th


B la bin th l lp c bit cc k t t hp i ngay sau k t c s khng hp
thnh v n ch ra dng bin th c bit ca k hiu ho cho k t . K t phn
34

VNPF

TCVN ................ : 2010


r c l k t m c tn ti mt dy hp thnh tng cng cho n. Dy k t ny
bao gm k t c s khng phn r c c theo sau bi b la bin th c gi l
dy bin th.
LU 1 Mt s b la bin th l ring cho b ch vit, nh b la bin th t do Mng C, s
khc c dng vi cc k t c s khc a dng nh k hiu ton hc.

Ch dy bin th c xc nh hay c tham chiu trong mc ny mi ch ra dng


bin th c bit ca k hiu ho; tt c cc dy nh vy u khng xc nh. Hn
na, cc b la bin th i theo sau cc k t c s khc v bt k k t khng c s
no u khng c tc dng ln vic la k hiu ho cho k t .
Khng dy bin th no dng cc k t t VARIATION SELECTOR-2 ti VARIATION
SELECTOR-16 c xc nh lc ny. Dy bin th bao gm mt ch biu thng
nht nh k t c s v mt trong cc VARIATION SELECTOR-17 ti VARIATION
SELECTOR-256 trong Mt phng chuyn dng b sung (SSP) c ng k trong c
s d liu bin th ch biu c xc nh bi chun k thut Unicode #37 (xem 3).
LU 2 Phin bn ny ca chun ny t hp theo tham chiu dy bin th c lit k trong
phin bn 2007-12-14 ca c s bin th ch biu , nh c m t ti
http://www.unicode.org/ivd/data/2007-12-14/ .

Danh sch sau y cung cp m t v s xut hin bin thin tng ng vi vic
dng b la bin th thch hp vi mi k hiu ton hc c s c php.
LU 3 VARIATION SELECTOR-1 l b la bin th duy nht c dng cho cc k hiu ton
hc.

Dy

M t dy cc s xut hin bin thin

(k php UID)
<2229, FE00>

INTERSECTION with serifs

<222A, FE00>

UNION with serifs

<2268, FE00>

LESS-THAN BUT NOT EQUAL TO with vertical stroke

<2269, FE00>

GREATER-THAN BUT NOT EQUAL TO with vertical stroke

<2272, FE00>

LESS-THAN OR EQUIVALENT TO following the slant of the lower leg

<2273, FE00>

GREATER-THAN OR EQUIVALENT TO following the slant of the lower leg

<228A, FE00>

SUBSET OF WITH NOT EQUAL TO with stroke through bottom members

<228B, FE00>

SUPERSET OF WITH NOT EQUAL TO with stroke through bottom members

<2293, FE00>

SQUARE CAP with serifs

<2294, FE00>

SQUARE CUP with serifs

<2295, FE00>

CIRCLED PLUS with white rim

<2297, FE00>

CIRCLED TIMES with white rim

<229C, FE00>

CIRCLED EQUALS equal sign touching the circle

<22DA, FE00>

LESS-THAN EQUAL TO OR GREATER-THAN with slanted equal

<22DB, FE00>

GREATER-THAN EQUAL TO OR LESS-THAN with slanted equal

<2A3C, FE00>

INTERIOR PRODUCT tall variant with narrow foot

<2A3D, FE00>

RIGHTHAND INTERIOR PRODUCT tall variant with narrow foot

<2A9D, FE00>

SIMILAR OR LESS-THAN with similar following the slant of the upper leg

VNPF

35

TCVN ................ : 2010


<2A9E, FE00>

SIMILAR OR GREATER-THAN with similar following the slant of the upper leg

<2AAC, FE00>

SMALLER THAN OR EQUAL TO with slanted equal

<2AAD, FE00>

LARGER THAN OR EQUAL TO with slanted equal

<2ACB, FE00>

SUBSET OF ABOVE NOT EQUAL TO with stroke through bottom members

<2ACC, FE00>

SUPERSET OF ABOVE NOT EQUAL TO with stroke through bottom members

17

Dng trnh by ca cc k t

Tng dng trnh by ca mt k t cung cp mt dng thay phin, dng trong ng


cnh c bit, theo dng chnh tc ca k t hau dy cc k t t vng k t ho
khc. Bin i t dng chnh tc sang dng trnh by c th bao gm vic thay th,
chng ln, hay t hp.
Cc qui tc cho chng ln, chn cc k t c hnh khc, hay t hp thnh nt ch, hay
ghp ni, iu thng cc k phc tp, khng c xc nh trong TCVN-....:2010.
Ni chung, dng trnh by khng c d nh dng nh ci thay th cho dng
chnh tc ca k t ho c xc nh u bn trong tp k t m ho ny.
Tuy nhin, cc ng dng ring c th m ho dng trnh by thay v dng chnh tc v
nhng l do ring m trong d c s tng hp vi cc thit b hin c. Qui tc tm
kim, sp xp v cc thao tc x l khc v dng trnh by nm ngoi phm vi ca
TCVN-....:2010.
Bn trong BMP cc k t ny c cp pht ch yu cho cc im m bn trong cc
hng t FB ti FF.

18

K t tng hp

K t tng hp c a vo trong TCVN-....:2010 ch yu tng hp vi cc


tp k t m ho hin c cho php chuyn i m hai chiu khng mt thng tin.
Bn trong BMP nhiu trong cc k t ny c cp pht cho cc im m bn trong
cc hng F9, FA, FE, v FF, v bn trong cc hng 31 v 33. Mt s k t tng hp
cng c cp pht bn trong cc hng khc.
LU 1 C mi hai im m trong cc hng FA ca BMP c cp pht cho cc ch biu
thng nht.

Bn trong Mt phng ch biu b sung (SIP) cc k t ny c cp pht trong cc


hng F8 ti FA.
Ch biu tng hp CJK l cc ch biu ng phi c thng nht vi mt
trong cc ch biu thng nht CJK, theo qui tc thng nht c m t trong ph
lc S. Tuy nhin, chng c a vo trong chun ny nh cc k t tch bit, bi v,
da trn cc l do lch s, vn ho hay quc gia a dng vi mt nc hay vng ring,
mt s chun quc gia v vng gn cc im m tch bit cho chng.
LU 2 Bi l do ny, cc ch biu tng hp ch nn c dng duy tr v m bo
chuyn i vng trn vi cc chun quc gia, vng hay chun khc. Vic dng khc khng c
khuyn khch nhiu.

36

VNPF

TCVN ................ : 2010

19

Th t ca cc k t

Thng thng, cc k t m ho xut hin trong phn t d liu CC theo th t logic


(th t logic hay th t lu sau tng ng xp x vi th t theo cc k t c
a vo t bn phm, sau khi sa cha nh chn thm, xo, v g xy ra).
iu ny p dng ngay c khi cc k t ca hng chi phi khc c trn ln: tri
sang phi (Hi Lp, Latin, Thai) vi phi sang tri (Arabic, Do Thi), hay vi ch vit
hng ng (Mng C).
Mt s k t c th khng xut hin tuyn tnh trong vn bn c ti to. Chng hn,
dng trung gian ca DEVANAGARI VOWEL SIGN I c hin th trc k t m n i
sau v mt logic trong phn t d liu CC.

20

K t t hp

Mnh ny xc nh vic dng cc k t t hp (xem 14.4).

20.1 Th t ca k t t hp
Cc biu din m ho ca cc k t t hp s i theo sau k t ho m chng c
lin kt (chng hn, biu din m ca LATIN SMALL LETTER A c i theo sau bi
COMBINING TILDE biu din cho dy hp thnh ch Latin ).
Nu mt k t t hp c coi nh dy hp thnh theo quyn ring ca n, n s
c m ho nh dy hp thnh bi vic lin kt vi k t 00AD NO-BREAK SPACE.
Chng hn, du thanh huyn c th c to ra khi 00AD NO-BREAK SPACE c i
theo sau bi 0300 COMBINING GRAVE ACCENT.

20.2 Lp t hp v sp th t chnh tc
Tng k t t hp u c gi tr lp t hp c xc nh bi c s d liu Unicode
(xem 3). Lp t hp c dng xc nh th t chnh tc m l mt phn ca qu
trnh chun ho (xem 21). Th t chnh tc bao gm sp th t cc k t t hp theo
th t tng ca lp t hp ca chng. Cc k t t hp c gi tr lp t hp l khng
l khng c quan h sp th t li i vi cc k t khc.

20.3 S xut hin trong s m


Cc k t t hp c d nh t tng i vi k t lin kt c m t bn trong
bng m k t trn, di, bn phi, bn tri, gia, bao quanh hay xuyn qua mt
vng trn chm ch ra v tr ca chng tng i vi k hiu c s. Trong trnh by,
cc k t ny c d nh t tng i vi k t c s i trc theo cch no ,
v khng ng mt mnh hay vn hnh nh k t c s. y l ng c cho thut
ng "t hp".
LU Du ph l lp chnh cc k t t hp c dng trong cc bng ch chu u. Vi nhiu
ch vit khc c dng n v ng Nam , cc k t t hp m cho cc ch nguyn m;
nh vy chng ni chung khng c tham chiu l du ph.

20.4 Cc biu din m thay th


Cc biu din m ho thay th ca vn bn c sinh ra bng vic dng a k t t
hp theo th t khc nhau, hay dng cc t hp k t tng ng a dng v dy
VNPF

37

TCVN ................ : 2010


hp thnh. Nhng biu din m ho thay th ny lm ny sinh a biu din ca cng
mt vn bn. Vic chun ho (xem mc 24) nhng biu din m ho ny lm gim
ng k, nhng khng xo b, s xut hin ca a biu din ny.
LU Chng hn, trong ci t mc 3 t ting Php l c th c biu din bi cc k t
LATIN SMALL LETTER L theo sau bi LATIN SMALL LETTER A WITH GRAVE, hay c th c
biu din bi k t LATIN SMALL LETTER L theo sau bi LATIN SMALL LETTER A theo sau bi
COMBINING GRAVE ACCENT. Khi cc dng chun ho c p dng cho cc biu din m ho
thay th , ch mt biu din cn li. Dng ca biu din cn li ph thuc vo dng chun ho
c dng.

20.5 a k t t hp
C nhng trng hp nhiu hn mt k t t hp c p dng cho mt k t ho.
TCVN...:2010 khng hn ch s cc k t t hp m c th i sau k t c s. Cc qui
tc sau p dng cho vic trnh by cc k t ny:
a) Nu k t t hp c th tng tc trong trnh by (chng hn, COMBINING
MACRON v COMBINING DIAERESIS), th v tr ca k t t hp trong hin th
ho kt qu c xc nh bi th t ca biu din m ho ca cc k t t hp.
Cc trnh by ca k t t hp c nh v t k t c s hng ra ngoi. Chng
hn, cc k t t hp c t ln trn k t c s c chng ln theo chiu
ng, bt u vi k t u tin c gp trong dy cc biu din m ho v tip
tc cho nhiu du trn nh c yu cu bi cc k t t hp m ho i sau k
t c s m ho. Vi cc k t t hp c t bn di k t c s, tnh hung
o ngc li, vi cc k t t hp bt u t k t c s v chng ngc xung.
Mt v d v a k t t hp trn k t c s c thy trong ch Thi, ni ch
ph m c th c trn n mt hay nhiu nguyn m 0E34 ti 0E37 v, trn ,
mt trong bn du thanh 0E48 ti 0E4B. Th t ca biu din m ho l: ph m
c s, i sau bi mt nguyn m, i sau bi mt du thanh.
b) Mt s k t t hp ring ghi ln hnh vi xp chng mc nh bng vic nh v
theo chiu ngang thay v chiu chng ln, hay bng vic hnh thnh nt ch vi k
t t hp lin k. Khi c nh v theo chiu ngang, th t ca cc biu din m
ho c phn nh bi vic nh v theo th t chi phi ca ch vit m chng
c dng. Chng hn, du ging chiu ngang trong ch vit tri qua phi c
m ho tri qua phi.
Cc k t ni bt ch ra hnh vi ghi nh vy c lin kt vi cc ch vit hay
bng ch ring. Chng hn, COMBINING GREEK KORONIS (0343) yu cu
rng, cng vi mt du sc hay huyn, chng c ti to ngay bn cnh ch,
thay v du ging c chng ln trn COMBINING GREEK KORONIS. Th t
ca cc biu din m ho l: bn thn ch , tip sau l du bt hi, tip sau l
du ging. Hai thanh ting Vit c cng s xut hin ho nh du ging sc v
huyn latin khng chng ln trn ba ch nguyn m Vit Nam m cha du
ph m (, , ). Thay v th, chng to nn nt ch cng cu phn m ca ch
nguyn m.
c) Nu cc k t t hp khng tng tc trong trnh by (chng hn, khi mt k t t
hp trn mt k t ho v k t khc di), k hiu ho kt qu t k t
c s v k t t hp cc th t khc nhau c th xut hin nh nhau. Chng
hn, biu din m ho ca LATIN SMALL LETTER A, theo sau bi COMBINING
CARON, theo sau bi COMBINING OGONEK c th lm pht sinh cng k hiu
ho nh biu din m ho ca LATIN SMALL LETTER A, theo sau bi

38

VNPF

TCVN ................ : 2010


COMBINING OGONEK, theo sau bi COMBINING CARON.
Cc k t t hp trong ch Do Thi hay A rp thng khng c tng tc. Do ,
dy cc biu din m ho ca chng trong dy hp thnh khng nh hng ti k
hiu ho ca n. Cc qui tc to ra k hiu ho t hp bn ngoi phm
vi ca TCVN...:2010.

20.6 Tuyn tp cha cc k t t hp


Trong mt s tuyn tp cc k t c lit k trong ph lc A, nh cc tuyn tp 14
(BASIC ARABIC) hay 25 (THAI), c k t t hp v k t khng t hp u c bao
hm.
Cc tuyn tp khc ca cc k t c lit k trong ph lc A ch bao gm cc k t t
hp, chng hn tuyn tp 7 (COMBINING DIACRITICAL MARKS).

20.7 B ni t v t hp
K t 034F COMBINING GRAPHEME JOINER c dng ch ra rng k t k l
c x l nh mt n v vi mc ch sp xp v tm kim nhy cm ngn ng.
Trong sp xp v tm kim nhy cm ngn ng, b ni t v t hp nn c b qua
tr phi n xut hin c bit vi nh x phn t th t c iu chnh. ti to, b
ni t v t hp l v hnh.
LU 1 B ni t v t hp c th c dng lm khc bit hai cch dng ca k t t hp
bng vic dng n cho mt trong hai trng hp. Chng hn, ni cn ti phn bit gia umlaut ca
c v trma, COMBINING GRAPHEME JOINER (034F) theo sau bi COMBINING DIAERESIS
(0308) nn c dng biu din cho trma trong khi COMBINING DIAERESIS (0308) mt mnh
nn c dng biu din cho umlaut c.

21

Dng chun ho

Dng chun ho l c ch cho php chn la mt cch biu din m ho duy nht
trong nhiu cch biu din, nhng cc biu din vn bn m ho u l tng ng
ca cng mt vn bn. Dng chun ho cho vic dng vi TCVN...:2010 c xc
nh trong Chun Unicode UAX#15 (xem mc 3).
LU 1 Theo nh ngha, kt qu ca vic p dng bt k dng no trong cc dng chun ny
u n nh qua thi gian. N ngha l biu din c chun ho ca vn bn vn cn c chun
ho khi chun ny c tu chnh.
LU 2 Mt s dng chun ho thin v dy hp thnh i vi cc biu din ngn hn ca vn
bn, mt s khc thin v cc biu din ngn hn. Yu cu tng hp ngc c cung cp bng
vic thit lp TCVN...:2010 nh phin bn tham chiu cho nh ngha v cc biu din ngn hn
ca vn bn. Vic thng nht kho ca chng l ng nht vi tuyn tp c nh UNICODE 3.2.
LU 3 Mc ch ca chun ho l cung cp mt kt qu c chun ho duy nht cho bt k
dy vn bn cho no to iu kin, trong s nhng iu khc, nhn din vic i snh. Dng
chun ho khng nht thit biu din dy ti u t quan im ngn ng.

22

Tn k t v ch gii

22.1 Tn thc th
Chun ny xc nh tn cho cc kiu thc th sau y

VNPF

39

TCVN ................ : 2010

k t
danh nh dy UCS c tn (xem 25)
khi (xem 14)
tuyn tp

Tn c cho trong chun ny vi cc thc th ny s tun theo cc qui tc to thnh


tn v tnh duy nht ca tn c xc nh trong mc ny. c t ny p dng cho
ton b tn trong ting Anh ca chun ny.
LU 1 Trong mt phin bn ca chun ny trong ngn ng khc a) cc qui tc ny c th
c tu chnh cho php tn c sinh ra dng cc t v c php c xem xt thch hp bn
trong ngn ng ; b) tn thc th t phin bn ny ca chun ny c th c thay th bng tn
quy nht tng ng c xy dng theo cc qui tc c tu chnh nh trong a) trn.
LU 2 Cc hng dn ph cho vic xy dng tn thc th c cho trong ph lc L.

22.2 Hnh thnh tn


Tn thc th s ch bao gm cc k t sau

LATIN CAPITAL LETTER A ti LATIN CAPITAL LETTER Z,

DIGIT ZERO ti DIGIT NINE,

SPACE,

HYPHEN-MINUS, v

FULL STOP nu thc th c tn l tuyn tp

K t u tin trong tn thc th s l ch Latin hoa. K t cui cng trong tn thc


th s hoc l ch Latin hoa hoc ch s.
Tn thc th s khng cha hai hay nhiu k t SPACE lin tip hay k t HYPHENMINUS lin tip. Tn tuyn tp s khng cha hai hay nhiu k t FULL STOP lin
tip.
Dy SPACE theo sau l HYPHEN-MINUS hay dy HYPHEN-MINUS theo sau l
SPACE c th xut hin ch trong tn k t hay danh nh dy UCS c tn.
V D 1 Mi mt trong hai tn k t sau y cha mt SPACE v HYPHEN-MINUS lin tip:
TIBETAN LETTER -A
TIBETAN MARK BKA- SHOG YIG MGO

FULL STOP c th xut hin ch gia hai k t ch-s (LATIN CAPITAL LETTER A
ti LATIN CAPITAL LETTER Z, DIGIT ZERO ti DIGIT NINE) trong tn tuyn tp.
V D 2 Tn tuyn tp sau y cha FULL STOP gia hai ch s, DIGIT FOUR v DIGIT ONE:
UNICODE 4.1
V D 3 Tn tuyn tp sau y cha FULL STOP gia mt ch Latin, LATIN CAPITAL LETTER
D, v mt ch s, DIGIT SEVEN: BMP-AMD.7

22.3 Tn n
Tng thc th c tn trong chun ny s c cho duy nht mt tn.
LU iu ny khng ngn cn vic dng thng tin ca tn bit hiu hay vit tt vi mc hc
lm sng t. Tuy nhin, thc th qui chun s l duy nht.

40

VNPF

TCVN ................ : 2010

22.4 Tnh duy nht ca tn


Tng tn thc th cng phi duy nht bn trong khng gian tn thch hp, nh c
xc nh y.
22.4.1 Tn khi

Tn khi to nn mt khng gian tn. Tng tn khi s l duy nht v phn bit vi
cc tn khi khc c xc nh trong chun ny.
22.4.2 Tn tuyn tp

Tn tuyn tp thit lp nn khng gian tn. Tng tn tuyn tp s l duy nht v phn
bit vi mi tn tuyn tp khc c xc nh trong chun ny.
22.4.3 Tn k t v danh nh dy UCS c tn

Tn k t v danh nh dy UCS c tn, c ly cng nhau, to nn khng gian tn.


Tng tn k t hay danh nh dy UCS c tn s l duy nht v phn bit vi tt c
cc tn k t khc hay danh jnh dy UCS c tn.
22.4.4 Xc nh tnh duy nht

Vi cc tn khi v tn tuyn tp, hai tn s c coi l duy nht v phn bit nu


chng khc nhau ngay c khi cc k t SPACE v HYPHEN-MINUS trung gian b b
qua trong so snh cc tn.

22.5 Cc tn k t cho ch biu CJK


Vi ch biu CJK cc tn c xy dng theo thut ton bng vic gn thm biu
din m ho ca chng theo k php h thp lc phn vo CJK UNIFIED
IDEOGRAPH- cho cc ch biu thng nht CJK v CJK COMPATIBILITY
IDEOGRAPH- cho cc ch biu tng hp CJK.
Vi cc ch biu CJK bn trong BMP, biu din m ho l gi tr hai byte ca chng
c din t nh bn ch s h thp lc phn. Chng hn, k t ch biu CJK u
tin trong BMP c tn l CJK UNIFIED IDEOGRAPH-3400.
Vi cc ch biu CJK bn trong SIP, biu din m ho l gi tr nm ch s thp lc
phn ca chng. Chng hn, k t ch biu CJK u tin trong SIP c tn l CJK
UNIFIED IDEOGRAPH-20000.
Gi tr USI tng ng vi tng NUSI c vit bng vic dng biu din m ho
c xc nh bi dng chun NFC (xem 21). Tng dy UCS c tn u c biu din
m duy nht. Tt c cc danh nh dy UCS c tn c php dng vi TCVN....:2010 c xc nh trong Unicode Standard UAX#34 (xem 3); tt c cc dy c
tn khc u khng c xc nh.

VNPF

41

TCVN ................ : 2010

23

Cu trc ca Mt phng a ng c s

Mt tng quan v Mt phng a ng c s c nu trong hnh 3 v tng quan chi


tit hn v cc hng 00 ti 33 c nu trong hnh 4. Mt phng ng c s bao
gm cc k t s dng chung trong cc b ch c bng ch ci, c m tit v cc ch
biu vi cc k hiu v ch s khc nhau.
Byte Hng
00
Hng 00 ti 33
..
(xem hnh 4)
..
..
33
34
Extension A v ch biu thng nht
..
..
4D
4E
..
Ch biu thng nht CJK
..
9F
A0
m tit Yi
..
A3
A4
B Yi
Lisu
A5
Vai
A6
Cyrillic Extended-B
Bamum
A7
Latin Extended-D
Modifier T L
Syloti Nagri
CINF
Phags-Pa
Saurashtra
Dev Ext
A8
Kayah Li
Rejang
Hangul Jamo Ext-A
Javanese
A9
Cham
Myanmar Ext-A
Tai Viet
AA
Meetei Mayek
AB
AC
m tit Hangul
..
D7
D8..
DF
E0
..
F8
F9
FA
FB
FC
FD
FE
FF

Thay th (ch dng cho UTF-16)


Vng s dng t
Ch biu tng hp CJK
Dng trnh by ch ci
Dng trnh by A rp A
VS

VF

CHM

CJK CF
Samll Form Variants
Arabic Presentation Forms-B
Halfwidth and Fullwidth Form
Sp.

= c dnh ring vnh vin

= dnh cho chun ho tng lai

Hnh 5 - Tng quan v Mt phng a ng c s

42

VNPF

TCVN ................ : 2010


Byte hng
00
01
02
03
04
05
06
07
08
09
0A
0B
0C
0D
0E
0F
10
11
12
13
14
15
16
17
18
19
1A
1B
1C
1D
1E
1F
20
21
22
23
24
25
26
27
28
29
2A
2B
2C
2D
2E
2F
30
31
32
33

Control

Basic latin
Control
Latin-1 Supplement
Latin Extended A
Latin Extended B
Latin Extended B
IPA Extension
Spacing Modifier Letters
Combination Diacritic Marks
Greek and Coptic
Cyrlic
Cyrlic Supplement
Armenia
Hebrew
Arabic
Syriac
Arabic Sup.
Thaana
Nko
Samaritan
Mandaic
Devanagari
Bengali
Gumukhi
Gujarati
Oriya
Tamil
Telugu
Kannada
Malayalam
Sinhala
Thai
Lao
Tibetan
Myanmar
Georgian
Hangul Jamo
Ethiopic
Ethiopi Sup.
Cherokee
Unified Canadian Aboriginal Syllabics
Orgham
Runic
Tagbanwa
Buhid
Khmer
Mongolian
UCAS Extended
Limbu
Tai Le
New Tai Lue
Khmer symbols
Buginese
Thai Tham
Balinese
Sudanese
Batak
Lepcha
Ol Chiki
Vedic Extensions
Phonetic Extension Sup.
Phonetic Extension
Comb. Mks. Symb.
Latin Extended Additional
Greek Extended
Com. Mrk. Symb.
Genereral Punctuation
Super-/Subscripts Current Symbols
Letterlike Symbols
Number Forms
Arrows
Mathematical Operations
Miscellaneous Technical
Control Pictures
O.C.R
Enclosed Alphanumerics
Box Drawing
Blocck Element
Geometric Shapes
Miscellaneous Symbols
Dingbats
Misc. Math. Symb. A
S. Arrows-A
Braille Pattern
Supplement Arrows-B
Miscellaneous Mathematical Symbol B
Supplemental Mathematical Operrators
Miscellaneous Symbols and Arrows
Glagolitic
Latin Ext-C
Coptic
Georgian Sup.
Tifinagh
Ethiopic Extended
Cyrillic Ext-A
Supplemental Punctuation
CJK Radical Suplement
Kangxi Radicals
Ideo. Descr.
CJK Symbols and Punctuation
Hiragana
Katakana
Hangul Compatible Jamo
Bopomofo
Kanbun
Bopomofo Ext.
K.P.E
Enclosed CJK Letters and Months
CJK Compatibility
Tagalog

Hanunoo

Dnh cho chun ho tng lai


Lu - Bin ng trong cc hng c ch ra theo v tr xp x

Hnh 6 - Tng quan v Hng 00 ti 33 ca Mt phng a ng c s


VNPF

43

TCVN ................ : 2010

24

Cu trc Mt phng a ng b sung (SMP)

Bi v mt phng b sung khc c dnh ring cho ch biu CJK b sung, SMP
(mt phng 1) khng c dng cho ti ny m ho cc ch biu CJK. Thay v
th, SMP c dng m ho cc k t ho c dng trong cc ch vit khc
trn th gii m cn cha c m ho trong BMP. Phn ln, nhng khng phi tt
c, cc ch vit c m ho trong SMP khng phi l cc ch vit sng ang dng
bi cc cng ng ngi dng hin i.
Tng quan v Mt phng a ng b sung cho cc ch vit v k hiu c v trong
hnh 7.

= dnh cho chun ho tng lai


LU 2 Bin ng bn trong cc hng c ch ra v tr xp x.

Hnh 7 Tng quan v Mt phng a ng b sung

44

VNPF

TCVN ................ : 2010

25

Cu trc ca Mt phng ch biu b sung (SIP)

SIP (mt phng 2) c dng cho cc ch biu CJK (ch biu ng thng
nht) vn khng c m ho trong BMP. Th tc thng nht v cc qui tc cho
b tr chng c m t trong Ph lc S.
SIP cng c dng cho cc ch biu CJK tng hp. Cc ch biu ny l cc k
t tng hp nh c xc nh trong 18.
Hnh sau ch ra mt tng quan v Mt phng ch biu b sung.
Hng
00
..
..
A6
A7
..
B7
B7
B8

CJK Unified Ideographs Extension C

..
F8
..
FA

CJK Compatible Ideographs Supplement

CJK Unified Ideographs Extension B

FB
..
FF
= dnh cho chun ho tng lai
LU Bin ng bn trong cc hng ch v tr xp x.

Hnh 8 Tng quan v Mt phng ch biu b sung

26

Cu trc ca mt phng chuyn dng b sung (SSP)

SSP (mt phng 0E) c dng cho cc k t ho chuyn dng. Cc im m t


E0000 ti E0FFF c dnh ring cho K t nh dng (xem 16).
LU 1 Mt s trong cc k t ny khng c biu din trc quan v khng c k hiu ho in
c. Cc k t Tag l v d v nhng k t nh vy.

Tng quan v Mt phng chuyn dng b sung c nu trong hnh 9.


LU 2 Cc im m khng c gn trong min ny nn c b qua trong x l v hin th
thng thng.

VNPF

45

TCVN ................ : 2010

Byte-hng

= dnh cho chun ho tng lai


LU 3 Bin ng bn trong hng c ch ra v tr xp x.

Hnh 9 Tng quan v Mt phng chuyn dng b sung

27

Cc bng k t m ho ch Vit

27.1 Bng k t m ho ch Quc ng


27.2 Bng k t m ho ch Khmer
27.3 Bng k t m ho ch Chm
27.4 Bng k t m ho ch Thi (TaiViet)
27.5 Bng k t m ho ch Hn Nm

46

VNPF

TCVN ................ : 2010

28

Tn quc t ca cc k t ch Vit

28.1 Tn quc t ca ch Quc ng


M

Ch

Tn

0018

0018 CANCEL

0000

0000 NULL

0019

0019 END OF MEDIUM

0001

0001 START OF HEADING

001A

001A SUBSTITUTE

0002

0002 START OF TEXT

001B

001B ESCAPE

0003

0003 END OF TEXT

001C

001C FILE SEPARATOR

0004

0004 END OF TRANSMISSION

001D

001D GROUP SEPARATOR

0005

0005 ENQUIRY

001E

001E RECORD SEPARATOR

0006

0006 ACKNOWLEDGE

001F

001F UNIT SEPARATOR

0007

0007 BELL

0020

0020 SPACE

0008

0008 BACKSPACE

0021

EXCLAMATION MARK

0009

0009 HORIZONTAL TABULATION

0022

"

QUOTATION MARK

000A

000A LINE FEED

0023

NUMBER SIGN

000B

000B VERTICAL TABULATION

0024

DOLLAR SIGN

000C

000C FORM FEED

0025

PERCENT SIGN

000D

000D CARRIAGE RETURN

0026

&

AMPERSAND

000E

000E SHIFT OUT

0027

'

APOSTROPHE

000F

000F SHIFT IN

0028

LEFT PARENTHESIS

0010

0010 DATA LINK ESCAPE

0029

RIGHT PARENTHESIS

0011

0011 DEVICE CONTROL ONE

002A

ASTERISK

0012

0012 DEVICE CONTROL TWO

002B

PLUS SIGN

0013

0013 DEVICE CONTROL THREE

002C

COMMA

0014

0014 DEVICE CONTROL FOUR

002D

HYPHEN-MINUS

0015

0015 NEGATIVE ACKNOWLEDGE

002E

FULL STOP

0016

0016 SYNCHRONOUS IDLE

002F

SOLIDUS

0017

0017 END OF TRANSMISSION BLOCK

0030

DIGIT ZERO

VNPF

157

TCVN ................ : 2010


0031

DIGIT ONE

004E

LATIN CAPITAL LETTER N

0032

DIGIT TWO

004F

LATIN CAPITAL LETTER O

0033

DIGIT THREE

0050

LATIN CAPITAL LETTER P

0034

DIGIT FOUR

0051

LATIN CAPITAL LETTER Q

0035

DIGIT FIVE

0052

LATIN CAPITAL LETTER R

0036

DIGIT SIX

0053

LATIN CAPITAL LETTER S

0037

DIGIT SEVEN

0054

LATIN CAPITAL LETTER T

0038

DIGIT EIGHT

0055

LATIN CAPITAL LETTER U

0039

DIGIT NINE

0056

LATIN CAPITAL LETTER V

003A

COLON

0057

LATIN CAPITAL LETTER W

003B

SEMICOLON

0058

LATIN CAPITAL LETTER X

003C

<

LESS-THAN SIGN

0059

LATIN CAPITAL LETTER Y

003D

EQUALS SIGN

005A

LATIN CAPITAL LETTER Z

003E

>

GREATER-THAN SIGN

005B

LEFT SQUARE BRACKET

003F

QUESTION MARK

005C

REVERSE SOLIDUS

0040

COMMERCIAL AT

005D

RIGHT SQUARE BRACKET

0041

LATIN CAPITAL LETTER A

005E

CIRCUMFLEX ACCENT

0042

LATIN CAPITAL LETTER B

005F

LOW LINE

0043

LATIN CAPITAL LETTER C

0060

GRAVE ACCENT

0044

LATIN CAPITAL LETTER D

0061

LATIN SMALL LETTER A

0045

LATIN CAPITAL LETTER E

0062

LATIN SMALL LETTER B

0046

LATIN CAPITAL LETTER F

0063

LATIN SMALL LETTER C

0047

LATIN CAPITAL LETTER G

0064

LATIN SMALL LETTER D

0048

LATIN CAPITAL LETTER H

0065

LATIN SMALL LETTER E

0049

LATIN CAPITAL LETTER I

0066

LATIN SMALL LETTER F

004A

LATIN CAPITAL LETTER J

0067

LATIN SMALL LETTER G

004B

LATIN CAPITAL LETTER K

0068

LATIN SMALL LETTER H

004C

LATIN CAPITAL LETTER L

0069

LATIN SMALL LETTER I

004D

LATIN CAPITAL LETTER M

006A

LATIN SMALL LETTER J

158

VNPF

TCVN ................ : 2010


006B

LATIN SMALL LETTER K

0323

006C

LATIN SMALL LETTER L

006D

LATIN SMALL LETTER M

0102

006E

LATIN SMALL LETTER N

00C2

006F

LATIN SMALL LETTER O

00CA

0070

LATIN SMALL LETTER P

00D4

0071

LATIN SMALL LETTER Q

01A0

0072

LATIN SMALL LETTER R

01AF

0073

LATIN SMALL LETTER S

0110

0074

LATIN SMALL LETTER T

00C0

0075

LATIN SMALL LETTER U

1EA2

0076

LATIN SMALL LETTER V

00C3

0077

LATIN SMALL LETTER W

00C1

0078

LATIN SMALL LETTER X

1EA0

0079

LATIN SMALL LETTER Y

1EB0

007A

LATIN SMALL LETTER Z

1EB2

007B

LEFT CURLY BRACKET

1EB4

007C

VERTICAL LINE

1EAE

007D

RIGHT CURLY BRACKET

1EB6

007E

TILDE

1EA6

007F

DELETE

1EA8

NO-BREAK SPACE

1EAA

00A0
0300

COMBINING GRAVE ACCENT

1EA4

0301

COMBINING ACUTE ACCENT

1EAC

0302

COMBINING CIRCUMFLEX
ACCENT

00C8

0303

COMBINING TILDE

1EBA

COMBINING BREVE

1EBC

COMBINING HOOK ABOVE

00C9

COMBINING HORN

1EB8

0306
0309
031B

VNPF

COMBINING DOT BELOW

LATIN CAPITAL LETTER A WITH


BREVE
LATIN CAPITAL LETTER A WITH
CIRCUMFLEX
LATIN CAPITAL LETTER E WITH
CIRCUMFLEX
LATIN CAPITAL LETTER O WITH
CIRCUMFLEX
LATIN CAPITAL LETTER O WITH
HORN
LATIN CAPITAL LETTER U WITH
HORN
LATIN CAPITAL LETTER D WITH
STROKE
LATIN CAPITAL LETTER A WITH
GRAVE
LATIN CAPITAL LETTER A WITH
HOOK ABOVE
LATIN CAPITAL LETTER A WITH
TILDE
LATIN CAPITAL LETTER A WITH
ACUTE
LATIN CAPITAL LETTER A WITH
DOT BELOW
LATIN CAPITAL LETTER A WITH
BREVE AND GRAVE
LATIN CAPITAL LETTER A WITH
BREVE AND HOOK ABOVE
LATIN CAPITAL LETTER A WITH
BREVE AND TILDE
LATIN CAPITAL LETTER A WITH
BREVE AND ACUTE
LATIN CAPITAL LETTER A WITH
BREVE AND DOT BELOW
LATIN CAPITAL LETTER A WITH
CIRCUMFLEX AND GRAVE
LATIN CAPITAL LETTER A WITH
CIRCUMFLEX AND HOOK ABOVE
LATIN CAPITAL LETTER A WITH
CIRCUMFLEX AND TILDE
LATIN CAPITAL LETTER A WITH
CIRCUMFLEX AND ACUTE
LATIN CAPITAL LETTER A WITH
CIRCUMFLEX AND DOT BELOW
LATIN CAPITAL LETTER E WITH
GRAVE
LATIN CAPITAL LETTER E WITH
HOOK ABOVE
LATIN CAPITAL LETTER E WITH
TILDE
LATIN CAPITAL LETTER E WITH
ACUTE
LATIN CAPITAL LETTER E WITH
DOT BELOW

159

TCVN ................ : 2010


1EC0

1EC2

1EC4

1EBE

1EC6

00CC

1EC8

0128

00CD

1ECA

00D2

1ECE

00D5

00D3

1ECC

1ED2

1ED4

1ED6

1ED0

1ED8

1EDC

1EDE

1EE0

1EDA

1EE2

00D9

1EE6

0168

00DA

160

LATIN CAPITAL LETTER E WITH


CIRCUMFLEX AND GRAVE
LATIN CAPITAL LETTER E WITH
CIRCUMFLEX AND HOOK ABOVE
LATIN CAPITAL LETTER E WITH
CIRCUMFLEX AND TILDE
LATIN CAPITAL LETTER E WITH
CIRCUMFLEX AND ACUTE
LATIN CAPITAL LETTER E WITH
CIRCUMFLEX AND DOT BELOW
LATIN CAPITAL LETTER I WITH
GRAVE
LATIN CAPITAL LETTER I WITH
HOOK ABOVE
LATIN CAPITAL LETTER I WITH
TILDE
LATIN CAPITAL LETTER I WITH
ACUTE
LATIN CAPITAL LETTER I WITH
DOT BELOW
LATIN CAPITAL LETTER O WITH
GRAVE
LATIN CAPITAL LETTER O WITH
HOOK ABOVE
LATIN CAPITAL LETTER O WITH
TILDE
LATIN CAPITAL LETTER O WITH
ACUTE
LATIN CAPITAL LETTER O WITH
DOT BELOW
LATIN CAPITAL LETTER O WITH
CIRCUMFLEX AND GRAVE
LATIN CAPITAL LETTER O WITH
CIRCUMFLEX AND HOOK ABOVE
LATIN CAPITAL LETTER O WITH
CIRCUMFLEX AND TILDE
LATIN CAPITAL LETTER O WITH
CIRCUMFLEX AND ACUTE
LATIN CAPITAL LETTER O WITH
CIRCUMFLEX AND DOT BELOW
LATIN CAPITAL LETTER O WITH
HORN AND GRAVE
LATIN CAPITAL LETTER O WITH
HORN AND HOOK ABOVE
LATIN CAPITAL LETTER O WITH
HORN AND TILDE
LATIN CAPITAL LETTER O WITH
HORN AND ACUTE
LATIN CAPITAL LETTER O WITH
HORN AND DOT BELOW
LATIN CAPITAL LETTER U WITH
GRAVE
LATIN CAPITAL LETTER U WITH
HOOK ABOVE
LATIN CAPITAL LETTER U WITH
TILDE
LATIN CAPITAL LETTER U WITH
ACUTE

1EE4

1EEA

1EEC

1EEE

1EE8

1EF0

1EF2

1EF6

1EF8

00DD

1EF4

0103

00E2

00EA

00F4

01A1

01B0

0111

00E0

1EA3

00E3

00E1

1EA1

1EB1

1EB3

1EB5

1EAF

1EB7

LATIN CAPITAL LETTER U WITH


DOT BELOW
LATIN CAPITAL LETTER U WITH
HORN AND GRAVE
LATIN CAPITAL LETTER U WITH
HORN AND HOOK ABOVE
LATIN CAPITAL LETTER U WITH
HORN AND TILDE
LATIN CAPITAL LETTER U WITH
HORN AND ACUTE
LATIN CAPITAL LETTER U WITH
HORN AND DOT BELOW
LATIN CAPITAL LETTER Y WITH
GRAVE
LATIN CAPITAL LETTER Y WITH
HOOK ABOVE
LATIN CAPITAL LETTER Y WITH
TILDE
LATIN CAPITAL LETTER Y WITH
ACUTE
LATIN CAPITAL LETTER Y WITH
DOT BELOW

LATIN SMALL LETTER A WITH


BREVE
LATIN SMALL LETTER A WITH
CIRCUMFLEX
LATIN SMALL LETTER E WITH
CIRCUMFLEX
LATIN SMALL LETTER O WITH
CIRCUMFLEX
LATIN SMALL LETTER O WITH
HORN
LATIN SMALL LETTER U WITH
HORN
LATIN SMALL LETTER D WITH
STROKE
LATIN SMALL LETTER A WITH
GRAVE
LATIN SMALL LETTER A WITH
HOOK ABOVE
LATIN SMALL LETTER A WITH
TILDE
LATIN SMALL LETTER A WITH
ACUTE
LATIN SMALL LETTER A WITH
DOT BELOW
LATIN SMALL LETTER A WITH
BREVE AND GRAVE
LATIN SMALL LETTER A WITH
BREVE AND HOOK ABOVE
LATIN SMALL LETTER A WITH
BREVE AND TILDE
LATIN SMALL LETTER A WITH
BREVE AND ACUTE
LATIN SMALL LETTER A WITH
BREVE AND DOT BELOW

VNPF

TCVN ................ : 2010


1EA7

1EA9

1EAB

1EA5

1EAD

00E8

1EBB

1EBD

00E9

1EB9

1EC1

1EC3

1EC5

1EBF

1EC7

00EC

1EC9

0129

00ED

1ECB

00F2

1ECF

00F5

00F3

1ECD

1ED3

1ED5

1ED7

VNPF

LATIN SMALL LETTER A WITH


CIRCUMFLEX AND GRAVE
LATIN SMALL LETTER A WITH
CIRCUMFLEX AND HOOK ABOVE
LATIN SMALL LETTER A WITH
CIRCUMFLEX AND TILDE
LATIN SMALL LETTER A WITH
CIRCUMFLEX AND ACUTE
LATIN SMALL LETTER A WITH
CIRCUMFLEX AND DOT BELOW
LATIN SMALL LETTER E WITH
GRAVE
LATIN SMALL LETTER E WITH
HOOK ABOVE
LATIN SMALL LETTER E WITH
TILDE
LATIN SMALL LETTER E WITH
ACUTE
LATIN SMALL LETTER E WITH
DOT BELOW
LATIN SMALL LETTER E WITH
CIRCUMFLEX AND GRAVE
LATIN SMALL LETTER E WITH
CIRCUMFLEX AND HOOK ABOVE
LATIN SMALL LETTER E WITH
CIRCUMFLEX AND TILDE
LATIN SMALL LETTER E WITH
CIRCUMFLEX AND ACUTE
LATIN SMALL LETTER E WITH
CIRCUMFLEX AND DOT BELOW
LATIN SMALL LETTER I WITH
GRAVE
LATIN SMALL LETTER I WITH
HOOK ABOVE
LATIN SMALL LETTER I WITH
TILDE
LATIN SMALL LETTER I WITH
ACUTE
LATIN SMALL LETTER I WITH
DOT BELOW
LATIN SMALL LETTER O WITH
GRAVE
LATIN SMALL LETTER O WITH
HOOK ABOVE
LATIN SMALL LETTER O WITH
TILDE
LATIN SMALL LETTER O WITH
ACUTE
LATIN SMALL LETTER O WITH
DOT BELOW
LATIN SMALL LETTER O WITH
CIRCUMFLEX AND GRAVE
LATIN SMALL LETTER O WITH
CIRCUMFLEX AND HOOK ABOVE
LATIN SMALL LETTER O WITH
CIRCUMFLEX AND TILDE

LATIN SMALL LETTER O WITH


CIRCUMFLEX AND ACUTE
LATIN SMALL LETTER O WITH
CIRCUMFLEX AND DOT BELOW
LATIN SMALL LETTER O WITH
HORN AND GRAVE
LATIN SMALL LETTER O WITH
HORN AND HOOK ABOVE
LATIN SMALL LETTER O WITH
HORN AND TILDE
LATIN SMALL LETTER O WITH
HORN AND ACUTE
LATIN SMALL LETTER O WITH
HORN AND DOT BELOW
LATIN SMALL LETTER U WITH
GRAVE
LATIN SMALL LETTER U WITH
HOOK ABOVE
LATIN SMALL LETTER U WITH
TILDE
LATIN SMALL LETTER U WITH
ACUTE
LATIN SMALL LETTER U WITH
DOT BELOW
LATIN SMALL LETTER U WITH
HORN AND GRAVE
LATIN SMALL LETTER U WITH
HORN AND HOOK ABOVE
LATIN SMALL LETTER U WITH
HORN AND TILDE
LATIN SMALL LETTER U WITH
HORN AND ACUTE
LATIN SMALL LETTER U WITH
HORN AND DOT BELOW
LATIN SMALL LETTER Y WITH
GRAVE
LATIN SMALL LETTER Y WITH
HOOK ABOVE
LATIN SMALL LETTER Y WITH
TILDE
LATIN SMALL LETTER Y WITH
ACUTE
LATIN SMALL LETTER Y WITH
DOT BELOW

1ED1

1ED9

1EDD

1EDF

1EE1

1EDB

1EE3

00F9

1EE7

0169

00FA

1EE5

1EEB

1EED

1EEF

1EE9

1EF1

1EF3

1EF7

1EF9

00FD

1EF5

201C

201D

20AB

DONG SIGN

25CC

DOTTED CIRCLE

LEFT DOUBLE QUOTATION


MARK
RIGHT DOUBLE QUOTATION
MARK

161

TCVN ................ : 2010

28.2 Tn quc t ca ch Khmer


Tn

1799

KHMER LETTER YO

KHMER LETTER KA

179A

KHMER LETTER RO

1781

KHMER LETTER KHA

179B

KHMER LETTER LO

1782

KHMER LETTER KO

179C

KHMER LETTER VO

1783

KHMER LETTER KHO

179D

KHMER LETTER SHA

1784

KHMER LETTER NGO

179E

KHMER LETTER SSO

1785

KHMER LETTER CA

179F

KHMER LETTER SA

1786

KHMER LETTER CHA

17A0

KHMER LETTER HA

1787

KHMER LETTER CO

17A1

KHMER LETTER LA

1788

KHMER LETTER CHO

17A2

KHMER LETTER QA

1789

KHMER LETTER NYO

17A3

178A

KHMER LETTER DA

17A4

178B

KHMER LETTER TTHA

17A5

KHMER INDEPENDENT VOWEL QI

178C

KHMER LETTER DO

17A6

KHMER INDEPENDENT VOWEL QII

178D

KHMER LETTER TTHO

17A7

KHMER INDEPENDENT VOWEL QU

178E

KHMER LETTER NNO

17A8

178F

KHMER LETTER TA

17A9

1790

KHMER LETTER THA

17AA

1791

KHMER LETTER TO

17AB

KHMER INDEPENDENT VOWEL RY

1792

KHMER LETTER THO

17AC

KHMER INDEPENDENT VOWEL


RYY

1793

KHMER LETTER NO

17AD

KHMER INDEPENDENT VOWEL LY

1794

KHMER LETTER BA

17AE

KHMER INDEPENDENT VOWEL


LYY

1795

KHMER LETTER PHA

17AF

KHMER INDEPENDENT VOWEL QE

1796

KHMER LETTER PO

17B0

1797

KHMER LETTER PHO

17B1

1798

KHMER LETTER MO

17B2

Ch

1780

162

KHMER INDEPENDENT VOWEL


QAQ
KHMER INDEPENDENT
VOWELQAA

KHMER INDEPENDENT
VOWELQUK
KHMER INDEPENDENT VOWEL
QUU
KHMER INDEPENDENT VOWEL
QUUV

KHMER INDEPENDENT VOWEL


QAI
KHMER INDEPENDENT VOWEL
QOOO TYPE ONE
KHMER INDEPENDENT VOWEL
QOOO TYPE TWO

VNPF

TCVN ................ : 2010


17B3

KHMER INDEPENDENT VOWEL


QAU

17D0

KHMER SIGN SAMYOK SANNYA

17B4

KHMER VOWEL INHERENT AQ

17D1

KHMER SIGN VIRIAM

17B5

KHMER VOWEL INHERENT AA

17D2

KHMER SIGN COENG

17B6

KHMER VOWEL SIGN AA

17D3

KHMER SIGN BATHAMASAT

17B7

KHMER VOWEL SIGN I

17C4

KHMER SIGN KHAN

17B8

KHMER VOWEL SIGN II

17D5

KHMER SIGN BARIYOOSAN

17B9

KHMER VOWEL SIGN Y

17D6

KHMER SIGN CAMNUC PII KUUH

17BA

KHMER VOWEL SIGN YY

17D7

KHMER SIGN LEK TOO

17BB

KHMER VOWEL SIGN U

17D8

17BC

KHMER VOWEL SIGN UU

17D9

17BD

KHMER VOWEL SIGN UA

17DA

17BE

KHMER VOWEL SIGN OE

17DB

KHMER CURRENCY SYMBOL RIEL

17BF

KHMER VOWEL SIGN YA

17DC

KHMER SIGN AVAKRAHASANYA

17C0

KHMER VOWEL SIGN IE

17DD

17C1

KHMER VOWEL SIGN E

17E0

KHMER DIGIT ZERO

17C2

KHMER VOWEL SIGN AE

17E1

KHMER DIGIT ONE

17C3

KHMER VOWEL SIGN AI

17E2

KHMER DIGIT TWO

17C4

KHMER VOWEL SIGN OO

17E3

KHMER DIGIT THREE

17C5

KHMER VOWEL SIGN AU

17E4

KHMER DIGIT FOUR

17C6

KHMER SIGN NIKAHIT

17E5

KHMER DIGIT FIVE

17C7

KHMER SIGN REHMUK

17E6

KHMER DIGIT SIX

17C8

KHMER SIGN YUUKALEAPINTU

17E7

KHMER DIGIT SEVEN

17C9

KHMER SIGN Y MUUSIKATOAN

17E8

KHMER DIGIT EIGHT

17CA

KHMER SIGN TRIISAP

17E9

KHMER DIGIT NINE

17CB

KHMER SIGN BANTOC

17F0

KHMER SYMBOL LEK ATTAK SON

17CC

KHMER SIGN ROBAT

17F1

KHMER SYMBOL LEK ATTACK


MUOY

17CD

KHMER SIGN TOANDAKHIAT

17F2

KHMER SYMBOL LEK ATTACK PII

17CE

KHMER SIGN KAKABAT

17F3

KHMER SYMBOL LEK ATTACK BEI

17CF

KHMER SIGN AHSDA

17F4

KHMER SYMBOL LEK ATTACK


BUON

VNPF

KHMER SIGN BEYYAL


KHMER SIGN PHNAEK MUAN
KHMER SIGN KOOMUUT

KHMER SIGN ATTHACAN

163

TCVN ................ : 2010


17F5

17F6

17F7

KHMER SYMBOL LEK PRAM-BII

19F0

17F8

KHMER SYMBOL LEK PRAM-BEI

19F1

KHMER SYMBOL MUOY ROC

17F9

KHMER SYMBOL LEK PRAM-BUON

19F2

KHMER SYMBOL PII ROC

19E0

KHMER SYMBOL PATHAMASAT

19F3

KHMER SYMBOL BEI ROC

19E1

KHMER SYMBOL MUOY KOET

19F4

KHMER SYMBOL BUON ROC

19E2

KHMER SYMBOL PII KOET

19F5

KHMER SYMBOL PRAM ROC

19E3

KHMER SYMBOL BEI KOET

19F6

KHMER SYMBOL PRAM-MUOY


ROC

19E4

KHMER SYMBOL BUON KOET

19F7

KHMER SYMBOL PRAM-PII ROC

19E5

KHMER SYMBOL PRAM KOET

19F8

KHMER SYMBOL PRAM-BEI ROC

19E6

KHMER SYMBOL PRAM-MUOY


KOET

19F9

KHMER SYMBOL PRAM-BUON


ROC

19E7

KHMER SYMBOLPRAM-PII KOET

19FA

KHMER SYMBOL DAP ROC

19E8

KHMER SYMBOL PRAM-BEI KOET

19FB

KHMER SYMBOL DAP-MUOY ROC

19E9

KHMER SYMBOL PRAM-BUON


KOET

19FC

KHMER SYMBOL DAP-PII ROC

19EA

KHMER SYMBOL DAP KOET

19FD

KHMER SYMBOL DAP-BEI ROC

19EB

KHMER SYMBOL DAP-MUOY KOET

19FE

KHMER SYMBOL DAP-BUON ROC

19EC

KHMER SYMBOL DAP-PII KOET

19FF

KHMER SYMBOL DAP-PRAM ROC

19ED

KHMER SYMBOL DAP-BEI KOET

164

KHMER SYMBOL LEK ATTACK


PRAM
KHMER SYMBOL LEK PRAMMUYON

19EE

KHMER SYMBOL DAP-BUON KOET

19EF

KHMER SYMBOL DAP-PRAM KOET


KHMER SYMBOL TUTEYASAT

VNPF

TCVN ................ : 2010

28.3 Tn quc t ca ch Chm


AA1A

CHAM LETTER PA

CHAM LETTER A

AA1B

CHAM LETTER PPA

CHAM LETTER I

AA1C

CHAM LETTER PHA

AA02

CHAM LETTER U

AA1D

CHAM LETTER BA

AA03

CHAM LETTER E

AA1E

CHAM LETTER BHA

AA04

CHAM LETTER AI

AA1F

CHAM LETTER MUE

AA05

CHAM LETTER O

AA20

CHAM LETTER MA

AA06

CHAM LETTER KA

AA21

CHAM LETTER BBA

AA07

CHAM LETTER KHA

AA22

CHAM LETTER YA

AA08

CHAM LETTER GA

AA23

CHAM LETTER RA

AA09

CHAM LETTER GHA

AA24

CHAM LETTER LA

AA0A

CHAM LETTER NGUE

AA25

CHAM LETTER VA

AA0B

CHAM LETTER NGA

AA26

CHAM LETTER SSA

AA0C

CHAM LETTER CHA

AA27

CHAM LETTER SA

AA0D

CHAM LETTER CHHA

AA28

CHAM LETTER HA

AA0E

CHAM LETTER JA

AA29

CHAM VOWEL SIGN AA

AA0F

CHAM LETTER JHA

AA2A

CHAM VOWEL SIGN I

AA10

CHAM LETTER NHUE

AA2B

CHAM VOWEL SIGN II

AA11

CHAM LETTER NHA

AA2C

CHAM VOWEL SIGN EI

AA12

CHAM LETTER NHJA

AA2D

CHAM VOWEL SIGN U

AA13

CHAM LETTER TA

AA2E

CHAM VOWEL SIGN OE

AA14

CHAM LETTER THA

AA2F

CHAM VOWEL SIGN O

AA15

CHAM LETTER DA

AA30

CHAM VOWEL SIGN AI

AA16

CHAM LETTER DHA

AA31

CHAM VOWEL SIGN AU

AA17

CHAM LETTER NUE

AA32

CHAM VOWEL SIGN UE

AA18

CHAM LETTER NA

AA33

CHAM CONSONANT SIGN YA

AA19

CHAM LETTER DDA

AA34

CHAM CONSONANT SIGN RA

Ch

AA00

AA01

VNPF

Tn

165

TCVN ................ : 2010


AA35

CHAM CONSONANT SIGN LA

AA4D

CHAM CONSONANT SIGN


FINAL H

AA36

CHAM CONSONANT SIGN WA

AA50

CHAM DIGIT ZERO

AA40

CHAM LETTER FINAL K

AA51

CHAM DIGIT ONE

AA41

CHAM LETTER FINAL G

AA52

CHAM DIGIT TWO

AA42

CHAM LETTER FINAL NG

AA53

CHAM DIGIT THREE

AA43

CHAM CONSONANT SIGN


FINAL NG

AA54

CHAM DIGIT FOUR

AA44

CHAM LETTER FINAL CH

AA55

CHAM DIGIT FIVE

AA45

CHAM LETTER FINAL T

AA56

CHAM DIGIT SIX

AA46

CHAM LETTER FINAL N

AA57

CHAM DIGIT SEVEN

AA47

CHAM LETTER FINAL P

AA58

CHAM DIGIT EIGHT

AA48

CHAM LETTER FINAL Y

AA59

CHAM DIGIT NINE

AA49

CHAM LETTER FINAL R

AA5C

CHAM PUNCTUATION SPIRAL

AA4A

CHAM LETTER FINAL L

AA5D

CHAM PUNCTUATION DANDA

AA4B

CHAM LETTER FINAL SS

AA5E

AA4C

CHAM CONSONANT SIGN


FINAL M

AA5F

166

CHAM PUNCTUATION DOUBLE


DANDA
CHAM PUNCTUATION TRIPLE
DANDA

VNPF

TCVN ................ : 2010

28.4 Tn quc t ca ch Thi (TaiViet)


AA99

TAI VIET LETTER HIGH NO

TAI VIET LETTER LOW KO

AA9A

TAI VIET LETTER LOW BO

TAI VIET LETTER HIGH KO

AA9B

TAI VIET LETTER HIGH BO

AA82

TAI VIET LETTER LOW KHO

AA9C

TAI VIET LETTER LOW PO

AA83

TAI VIET LETTER HIGH KHO

AA9D

TAI VIET LETTER HIGH PO

AA84

TAI VIET LETTER LOW KHHO

AA9E

TAI VIET LETTER LOW PHO

AA85

TAI VIET LETTER HIGH KHHO

AA9F

TAI VIET LETTER HIGH PHO

AA86

TAI VIET LETTER LOW GO

AAA0

TAI VIET LETTER LOW FO

AA87

TAI VIET LETTER HIGH GO

AAA1

TAI VIET LETTER HIGH FO

AA88

TAI VIET LETTER LOW NGO

AAA2

TAI VIET LETTER LOW MO

AA89

TAI VIET LETTER HIGH NGO

AAA3

TAI VIET LETTER HIGH MO

AA8A

TAI VIET LETTER LOW CO

AAA4

TAI VIET LETTER LOW YO

AA8B

TAI VIET LETTER HIGH CO

AAA5

TAI VIET LETTER HIGH YO

AA8C

TAI VIET LETTER LOW CHO

AAA6

TAI VIET LETTER LOW RO

AA8D

TAI VIET LETTER HIGH CHO

AAA7

TAI VIET LETTER HIGH RO

AA8E

TAI VIET LETTER LOW SO

AAA8

TAI VIET LETTER LOW LO

AA8F

TAI VIET LETTER HIGH SO

AAA9

TAI VIET LETTER HIGH LO

AA90

TAI VIET LETTER LOW NYO

AAAA

TAI VIET LETTER LOW VO

AA91

TAI VIET LETTER HIGH NYO

AAAB

TAI VIET LETTER HIGH VO

AA92

TAI VIET LETTER LOW DO

AAAC

TAI VIET LETTER LOW HO

AA93

TAI VIET LETTER HIGH DO

AAAD

TAI VIET LETTER HIGH HO

AA94

TAI VIET LETTER LOW TO

AAAE

TAI VIET LETTER LOW O

AA95

TAI VIET LETTER HIGH TO

AAAF

TAI VIET LETTER HIGH O

AA96

TAI VIET LETTER LOW THO

AAB0

TAI VIET MAI KANG

AA97

TAI VIET LETTER HIGH THO

AAB1

TAI VIET VOWEL AA

AA98

TAI VIET LETTER LOW NO

AAB2

TAI VIET VOWEL I

Ch

AA80

AA81

VNPF

Tn

167

TCVN ................ : 2010


AAB3

TAI VIET VOWEL UE

AABE

AAB4

TAI VIET VOWEL U

AABF

TAI VIET TONE MAI EK

AAB5

TAI VIET VOWEL E

AAC0

TAI VIET TONE MAI NUENG

AAB6

TAI VIET VOWEL O

AAC1

TAI VIET TONE MAI THO

AAB7

TAI VIET MAY KHIT

AAC2

TAI VIET TONE MAI SONG

AAB8

TAI VIET VOWEL IA

AAC3

TAIVIET

AAB9

TAI VIET VOWEL UEA

AADB

TAI VIET SYMBOL KON

AABA

TAI VIET VOWEL UA

AADC

TAI VIET SYMBOL NUENG

AABB

TAI VIET VOWEL AUE

AADD

TAI VIET SYMBOL SAM

AABC

TAI VIET VOWEL AY

AADE

TAI VIET SYMBOL HO HOI

AABD

TAI VIET VOWEL AN

AADF

TAI VIET SYMBOL KOI KOI

168

TAI VIET VOWEL AM

VNPF

TCVN ................ : 2010

28.5 Tn quc t ca ch Hn Nm
28.5.1 B th
M

Ch

2E80

2E99

2E9A

2E9B

2E81
2E82
2E83
2E84
2E85
2E86
2E87
2E88
2E89
2E8A
2E8B
2E8C
2E8D
2E8E
2E8F
2E90
2E91
2E92
2E93
2E94
2E95
2E96
2E97
2E98

2E9C
2E9D
2E9E
2E9F

VNPF

Tn
CJK RADICAL REPEAT
CJK RADICAL CLIFF
CJK RADICAL SECOND ONE
CJK RADICAL SECOND TWO
CJK RADICAL SECOND THREE
CJK RADICAL PERSON
CJK RADICAL BOX
CJK RADICAL TABLE
CJK RADICAL KNIFE ONE
CJK RADICAL KNIFE TWO
CJK RADICAL DIVINATION
CJK RADICAL SEAL
CJK RADICAL SMALL ONE
CJK RADICAL SMALL TWO
CJK RADICAL LAME ONE
CJK RADICAL LAME TWO
CJK RADICAL LAME THREE
CJK RADICAL LAME FOUR
CJK RADICAL SNAKE
CJK RADICAL THREAD
CJK RADICAL SNOUT ONE
CJK RADICAL SNOUT TWO
CJK RADICAL HEART ONE
CJK RADICAL HEART TWO
CJK RADICAL HAND
CJK RADICAL RAP

CJK RADICAL CHOKE


CJK RADICAL SUN
CJK RADICAL MOON
CJK RADICAL DEATH
CJK RADICAL MOTHER

2EA0
2EA1
2EA2
2EA3
2EA4
2EA5
2EA6
2EA7
2EA8
2EA9
2EAA
2EAB
2EAC
2EAD
2EAE
2EAF
2EB0
2EB1
2EB2
2EB3
2EB4
2EB5
2EB6
2EB7
2EB8
2EB9
2EBA
2EBB
2EBC
2EBD
2EBE
2EBF

CJK RADICAL CIVILIAN


CJK RADICAL WATER ONE
CJK RADICAL WATER TWO
CJK RADICAL FIRE
CJK RADICAL PAW ONE
CJK RADICAL PAW TWO
CJK RADICAL SIMPLIFIED HALF
TREE TRUNK
CJK RADICAL COW
CJK RADICAL DOG
CJK RADICAL JADE
CJK RADICAL BOLT OF CLOTH
CJK RADICAL EYE
CJK RADICAL SPIRIT ONE
CJK RADICAL SPIRIT TWO
CJK RADICAL BAMBOO
CJK RADICAL SILK
CJK RADICAL C-SIMPLIFIED SILK
CJK RADICAL NET ONE
CJK RADICAL NET TWO
CJK RADICAL NET THREE
CJK RADICAL NET FOUR
CJK RADICAL MESH
CJK RADICAL SHEEP
CJK RADICAL RAM
CJK RADICAL EWE
CJK RADICAL OLD
CJK RADICAL BRUSH ONE
CJK RADICAL BRUSH TWO
CJK RADICAL MEAT
CJK RADICAL MORTAR
CJK RADICAL GRASS ONE
CJK RADICAL GRASS TWO

169

TCVN ................ : 2010


2EC0
2EC1
2EC2
2EC3
2EC4
2EC5
2EC6
2EC7
2EC8
2EC9
2ECA
2ECB
2ECC
2ECD
2ECE
2ECF
2ED0
2ED1
2ED2
2ED3
2ED4
2ED5
2ED6
2ED7
2ED8
2ED9
2EDA
2EDB
2EDC
2EDD
2EDE
2EDF
2EE0
2EE1
2EE2
2EE3
2EE4

170

CJK RADICAL GRASS THREE

2EE5

CJK RADICAL TIGER

2EE6

CJK RADICAL CLOTHES

2EE7

CJK RADICAL WEST ONE

2EE8

CJK RADICAL WEST TWO

2EE9

CJK RADICAL C-SIMPLIFIED SEE

2EEA

CJK RADICAL SIMPLIFIED HORN

2EEB

CJK RADICAL HORN


CJK RADICAL C-SIMPLIFIED
SPEECH
CJK RADICAL C-SIMPLIFIED SHELL

2EEC

CJK RADICAL FLOOT

2EEF

CJK RADICAL C-SIMPLIFIED CART

2EF0

CJK RADICAL SIMPLIFIED WALK

2EF1

CJK RADICAL WALK ONE

2EF2

CJK RADICAL WALK TWO

2EF3

CJK RADICAL CITY

2F00

CJK RADICAL C-SIMPLIFIED GOLD

2F01

CJK RADICAL LONG ONE

2F02

CJK RADICAL LONG TWO

2F03

CJK RADICAL C-SIMPLIFIED LONG

2F04

CJK RADICAL C-SIMPLIFIED GATE

2F05

CJK RADICAL MOUND ONE

2F06

CJK RADICAL MOUND TWO

2F07

CJK RADICAL RAIN

2F08

CJK RADICAL BLUE


CJK RADICAL C-SIMPLIFIED
TANNED LEATHER
CJK RADICAL C-SIMPLIFIED LEAF

2F09
2F0A

CJK RADICAL C-SIMPLIFIED WIND

2F0C

CJK RADICAL C-SIMPLIFIED FLY

2F0D

CJK RADICAL EAT ONE

2F0E

CJK RADICAL EAT TWO

2F0F

CJK RADICAL EAT THREE

2F10

CJK RADICAL C-SIMPLIFIED EAT

2F11

CJK RADICAL HEAD

2F12

CJK RADICAL C-SIMPLIFIED HORSE

2F13

CJK RADICAL BONE

2F14

CJK RADICAL GHOST

2F15

2EED
2EEE

2F0B

CJK RADICAL C-SIMPLIFIED FISH


CJK RADICAL C-SIMPLIFIED BIRD
CJK RADICAL C-SIMPLIFIED SALT
CJK RADICAL SIMPLIFIED WHEAT
CJK RADICAL SIMPLIFIED YELLOW
CJK RADICAL C-SIMPLIFIED FROG
CJK RADICAL J-SIMPLIFIED EVEN
CJK RADICAL C-SIMPLIFIED EVEN
CJK RADICAL J-SIMPLIFIED TOOTH
CJK RADICAL C-SIMPLIFIED TOOTH
CJK RADICAL I-SIMPLIFIED DRAGON
CJK RADICAL C-SIMPLIFIED
DRAGON
CJK RADICAL TURTLE
CJK RADICAL J-SIMPLIFIED TURTLE
CJK RADICAL C-SIMPLIFIED TURTLE
KANGXI RADICAL ONE
KANGXI RADICAL LINE
KANGXI RADICAL DOT
KANGXI RADICAL SLASH
KANGXI RADICAL SECOND
KANGXI RADICAL HOOK
KANGXI RADICAL TWO
KANGXI RADICAL LID
KANGXI RADICAL MAN
KANGXI RADICAL LEGS
KANGXI RADICAL ENTER
KANGXI RADICAL EIGHT
KANGXI RADICAL DOWN BOX
KANGXI RADICAL COVER
KANGXI RADICAL ICE
KANGXI RADICAL TABLE
KANGXI RADICAL OPEN BOX
KANGXI RADICAL KNIFE
KANGXI RADICAL POWER
KANGXI RADICAL WRAP
KANGXI RADICAL SPOON
KANGXI RADICAL RIGHT OPEN BOX

VNPF

TCVN ................ : 2010


2F16
2F17
2F18
2F19
2F1A
2F1B
2F1C
2F1D
2F1E
2F1F
2F20
2F21
2F22
2F23
2F24
2F25
2F26
2F27
2F28
2F29
2F2A
2F2B
2F2C
2F2D
2F2E
2F2F
2F30
2F31
2F32
2F33
2F34
2F35
2F36
2F37
2F38
2F39
2F3A

VNPF

KANGXI RADICAL HIDING


ENCLOSURE
KANGXI RADICAL TEN

2F3C

KANGXI RADICAL DIVINATION

2F3D

KANGXI RADICAL SEAL

2F3E

KANGXI RADICAL CLIFF

2F3F

KANGXI RADICAL PRIVATE

2F40

KANGXI RADICAL AGAIN

2F41

KANGXI RADICAL MOUTH

2F42

KANGXI RADICAL ENCLOSURE

2F43

KANGXI RADICAL EARTH

2F44

KANGXI RADICAL SCHOLAR

2F45

KANGXI RADICAL GO

2F46

KANGXI RADICAL GO SLOWLY

2F47

KANGXI RADICAL EVENING

2F48

KANGXI RADICAL BIG

2F49

KANGXI RADICAL WOMAN

2F4A

KANGXI RADICAL CHILD

2F4B

KANGXI RADICAL ROOF

2F4C

KANGXI RADICAL INCH

2F4D

KANGXI RADICAL SMALL

2F4E

KANGXI RADICAL LAME

2F4F

KANGXI RADICAL CORPSE

2F50

KANGXI RADICAL SPROUT

2F51

KANGXI RADICAL MOUNTAIN

2F52

KANGXI RADICAL RIVER

2F53

KANGXI RADICAL WORK

2F54

KANGXI RADICAL ONESELF

2F55

KANGXI RADICAL TURBAN

2F56

KANGXI RADICAL DRY

2F57

KANGXI RADICAL SHORT THREAD

2F58

KANGXI RADICAL DOTTED CLIFF

2F59

KANGXI RADICAL LONG STRIDE

2F5A

KANGXI RADICAL TWO HANDS

2F5B

KANGXI RADICAL SHOOT

2F5C

KANGXI RADICAL BOW

2F5D

KANGXI RADICAL SNOUT

2F5E

KANGXI RADICAL BRISTLE

2F5F

2F3B

KANGXI RADICAL STEP


KANGXI RADICAL HEART
KANGXI RADICAL HALBERD
KANGXI RADICAL DOOR
KANGXI RADICAL HAND
KANGXI RADICAL BRANCH
KANGXI RADICAL RAP
KANGXI RADICAL SCRIPT
KANGXI RADICAL DIPPER
KANGXI RADICAL AXE
KANGXI RADICAL SQUARE
KANGXI RADICAL NOT
KANGXI RADICAL SUN
KANGXI RADICAL SAY
KANGXI RADICAL MOON
KANGXI RADICAL TREE
KANGXI RADICAL LACK
KANGXI RADICAL STOP
KANGXI RADICAL DEATH
KANGXI RADICAL WEAPON
KANGXI RADICAL DO NOT
KANGXI RADICAL COMPARE
KANGXI RADICAL FUR
KANGXI RADICAL CLAN
KANGXI RADICAL STEAM
KANGXI RADICAL WATER
KANGXI RADICAL FIRE
KANGXI RADICAL CLAW
KANGXI RADICAL FATHER
KANGXI RADICAL DOUBLE X
KANGXI RADICAL HALF TREE
TRUNK
KANGXI RADICAL SLICE
KANGXI RADICAL FANG
KANGXI RADICAL COW
KANGXI RADICAL DOG
KANGXI RADICAL PROFOUND
KANGXI RADICAL JADE

171

TCVN ................ : 2010


2F60
2F61
2F62
2F63
2F64
2F65
2F66
2F67
2F68
2F69
2F6A
2F6B
2F6C
2F6D
2F6E
2F6F
2F70
2F71
2F72
2F73
2F74
2F75
2F76
2F77
2F78
2F79
2F7A
2F7B
2F7C
2F7D
2F7E
2F7F
2F80
2F81
2F82
2F83
2F84
2F85

172

KANGXI RADICAL MELON

2F86

KANGXI RADICAL TILE

2F87

KANGXI RADICAL SWEET

2F88

KANGXI RADICAL FILE

2F89

KANGXI RADICAL USE

2F8A

KANGXI RADICAL FIELD

2F8B

KANGXI RADICAL BOLT OF CLOTH

2F8C

KANGXI RADICAL SICKNESS

2F8D

KANGXI RADICAL DOTTED TENT

2F8E

KANGXI RADICAL WHITE

2F8F

KANGXI RADICAL SKIN

2F90

KANGXI RADICAL DISH

2F91

KANGXI RADICAL EYE

2F92

KANGXI RADICAL SPEAR

2F93

KANGXI RADICAL ARROW

2F94

KANGXI RADICAL STONE

2F95

KANGXI RADICAL SPIRIT

2F96

KANGXI RADICAL TRACK

2F97

KANGXI RADICAL GRAIN

2F98

KANGXI RADICAL CAVE

2F99

KANGXI RADICAL STAND

2F9A

KANGXI RADICAL BAMBOO

2F9B

KANGXI RADICAL RICE

2F9C

KANGXI RADICAL SILK

2F9D

KANGXI RADICAL JAR

2F9E

KANGXI RADICAL NET

2F9F

KANGXI RADICAL SHEEP

2FA0

KANGXI RADICAL FEATHER

2FA1

KANGXI RADICAL OLD

2FA2

KANGXI RADICAL AND

2FA3

KANGXI RADICAL PLOW

2FA4

KANGXI RADICAL EAR

2FA5

KANGXI RADICAL BRUSH

2FA6

KANGXI RADICAL MEAT

2FA7

KANGXI RADICAL MINISTER

2FA8

KANGXI RADICAL SELF

2FA9

KANGXI RADICAL ARRIVE

2FAA

KANGXI RADICAL TONGUE


KANGXI RADICAL OPPOSE
KANGXI RADICAL BOAT
KANGXI RADICAL STOPPING
KANGXI RADICAL COLOR
KANGXI RADICAL GRASS
KANGXI RADICAL TIGER
KANGXI RADICAL INSECT
KANGXI RADICAL BLOOD
KANGXI RADICAL WALK
ENCLOSURE
KANGXI RADICAL CLOTHES
KANGXI RADICAL WEST
KANGXI RADICAL SEE
KANGXI RADICAL HORN
KANGXI RADICAL SPEECH
KANGXI RADICAL VALLEY
KANGXI RADICAL BEAN
KANGXI RADICAL PIG
KANGXI RADICAL BADGER
KANGXI RADICAL SHELL
KANGXI RADICAL RED
KANGXI RADICAL RUN
KANGXI RADICAL FOOT
KANGXI RADICAL BODY
KANGXI RADICAL CART
KANGXI RADICAL BITTER
KANGXI RADICAL MORNING
KANGXI RADICAL WALK
KANGXI RADICAL CITY
KANGXI RADICAL WINE
KANGXI RADICAL DISTINGUISH
KANGXI RADICAL VILLAGE
KANGXI RADICAL GOLD
KANGXI RADICAL LONG
KANGXI RADICAL GATE
KANGXI RADICAL MOUND
KANGXI RADICAL SLAVE

KANGXI RADICAL MORTAR

VNPF

TCVN ................ : 2010


2FAB
2FAC
2FAD
2FAE
2FAF
2FB0
2FB1
2FB2
2FB3
2FB4
2FB5
2FB6
2FB7
2FB8
2FB9
2FBA
2FBB
2FBC
2FBD
2FBE
2FBF
2FC0
2FC1
2FC2
2FC3
2FC4
2FC5
2FC6
2FC7
2FC8
2FC9
2FCA
2FCB
2FCC

VNPF

KANGXI RADICAL SHORT TAILED


BIRD
KANGXI RADICAL RAIN

2FCD

KANGXI RADICAL BLUE

2FCF

KANGXI RADICAL WRONG

2FD0

KANGXI RADICAL FACE

2FD1

KANGXI RADICAL LEATHER


KANGXI RADICAL TANNED
LEATHER
KANGXI RADICAL LEEK

2FD2

KANGXI RADICAL SOUND


KANGXI RADICAL LEAF
KANGXI RADICAL WIND
KANGXI RADICAL FLY
KANGXI RADICAL EAT
KANGXI RADICAL HEAD

2FF0

2FF1

2FF2

2FF3

2FF4

2FF5

2FCE

2FD3
2FD4
2FD5

KANGXI RADICAL FRAGRANT


KANGXI RADICAL HORSE
KANGXI RADICAL BONE
KANGXI RADICAL TALL
KANGXI RADICAL HAIR

2FF6

KANGXI RADICAL FIGHT


KANGXI RADICAL SACRIFICIAL
WINE
KANGXI RADICAL CAULDRON

2FF7

KANGXI RADICAL GHOST

2FF8

2FF9

2FFA

2FFB

KANGXI RADICAL FISH


KANGXI RADICAL BIRD
KANGXI RADICAL SALT
KANGXI RADICAL DEER
KANGXI RADICAL WHEAT
KANGXI RADICAL HEMP

KANGXI RADICAL TRIPOD


KANGXI RADICAL DRUM
KANGXI RADICAL RAT
KANGXI RADICAL NOSE
KANGXI RADICAL EVEN
KANGXI RADICAL TOOTH
KANGXI RADICAL DRAGON
KANGXI RADICAL TURTLE
KANGXI RADICAL FLUTE
IDEOGRAPHIC DESCRIPTION
CHARACTER LEFT TO RIGH
IDEOGRAPHIC DESCRIPTION
CHARACTER ABOVE TO BELOW
IDEOGRAPHIC DESCRIPTION
CHARACTER LEFT TO MIDDLE AND
RIGHT
IDEOGRAPHIC DESCRIPTION
CHARACTER ABOVE TO MIDDLE
AND BELOW
IDEOGRAPHIC DESCRIPTION
CHARACTER FULL SURROUND
IDEOGRAPHIC DESCRIPTION
CHARACTER SURROUND FROM
ABOVE
IDEOGRAPHIC DESCRIPTION
CHARACTER SURROUND FROM
BELOW
IDEOGRAPHIC DESCRIPTION
CHARACTER SURROUND FROM
LEFT
IDEOGRAPHIC DESCRIPTION
CHARACTER SURROUND FROM
UPPER LEFT
IDEOGRAPHIC DESCRIPTION
CHARACTER SURROUND FROM
UPPER RIGHT
IDEOGRAPHIC DESCRIPTION
CHARACTER SURROUND FROM
LOWER LEFT
IDEOGRAPHIC DESCRIPTION
CHARACTER OVERLAID

KANGXI RADICAL YELLOW


KANGXI RADICAL MILLET
KANGXI RADICAL BLACK
KANGXI RADICAL EMBROIDERY
KANGXI RADICAL FROG

173

TCVN ................ : 2010


28.5.2 Tn cc ch Nm

Vic t tn cc ch Nm tun theo nguyn tc t tn ca cc ch biu , da trn


m ca chng. V d:
Trong BMP
EXTENSION A & CJK
340C
3431
343B
344D
344F
346B
3499
34B7
34DF
34E0

CJK UNIFIED IDEOGRAPH-340C

CJK UNIFIED IDEOGRAPH-4E00

CJK UNIFIED IDEOGRAPH-4E0B

CJK UNIFIED IDEOGRAPH-9F90

CJK UNIFIED IDEOGRAPH-3431


CJK UNIFIED IDEOGRAPH-343B
CJK UNIFIED IDEOGRAPH-344D
CJK UNIFIED IDEOGRAPH-344F
CJK UNIFIED IDEOGRAPH-346B
CJK UNIFIED IDEOGRAPH-3499
CJK UNIFIED IDEOGRAPH-34B7
CJK UNIFIED IDEOGRAPH-34DF
CJK UNIFIED IDEOGRAPH-34E0

...
4E00
4E01
4E03
4E07
4E08
4E09
4E0A
4E0B
4E0D
4E0E
4E10
4E11

CJK UNIFIED IDEOGRAPH-4E01


CJK UNIFIED IDEOGRAPH-4E03
CJK UNIFIED IDEOGRAPH-4E07
CJK UNIFIED IDEOGRAPH-4E08
CJK UNIFIED IDEOGRAPH-4E09
CJK UNIFIED IDEOGRAPH-4E0A

CJK UNIFIED IDEOGRAPH-4E0D


CJK UNIFIED IDEOGRAPH-4E0E
CJK UNIFIED IDEOGRAPH-4E10
CJK UNIFIED IDEOGRAPH-4E11

...
9F90
9F95
9F9C
FA24

174

CJK UNIFIED IDEOGRAPH-9F95


CJK UNIFIED IDEOGRAPH-9F9C
CJK UNIFIED COMPATIBILITY
IDEOGRAPH-FA24

VNPF

TCVN ................ : 2010


Trong SIP
EXTENSION B
20016
20017
20027
20028
2002A
2002B
20032
20033
20034
2003F
20040
20042

CJK UNIFIED IDEOGRAPH-20016

CJK UNIFIED IDEOGRAPH-2A69A

CJK UNIFIED IDEOGRAPH-20017


CJK UNIFIED IDEOGRAPH-20027
CJK UNIFIED IDEOGRAPH-20028
CJK UNIFIED IDEOGRAPH-2002A
CJK UNIFIED IDEOGRAPH-2002B
CJK UNIFIED IDEOGRAPH-20032
CJK UNIFIED IDEOGRAPH-20033
CJK UNIFIED IDEOGRAPH-20034
CJK UNIFIED IDEOGRAPH-2003F
CJK UNIFIED IDEOGRAPH-20040
CJK UNIFIED IDEOGRAPH-20042

...
2A69A
2A6A4
2A6C5
2A6C7

CJK UNIFIED IDEOGRAPH-2A6A4


CJK UNIFIED IDEOGRAPH-2A6C5
CJK UNIFIED IDEOGRAPH-2A6C7

EXTENSION C
2A700
2A964
2B52C
2A709
2A712
2A715
2A718
2A71A

CJK UNIFIED IDEOGRAPH-2A700

CJK UNIFIED IDEOGRAPH-2B6CE

CJK UNIFIED IDEOGRAPH-2A964


CJK UNIFIED IDEOGRAPH-2B52C
CJK UNIFIED IDEOGRAPH-2A709
CJK UNIFIED IDEOGRAPH-2A712
CJK UNIFIED IDEOGRAPH-2A715
CJK UNIFIED IDEOGRAPH-2A718
CJK UNIFIED IDEOGRAPH-2A71A

...
2B6CE
2B6D0
2B6D5
2B708
2B70D
2B717
2B727

VNPF

CJK UNIFIED IDEOGRAPH-2B6D0


CJK UNIFIED IDEOGRAPH-2B6D5
CJK UNIFIED IDEOGRAPH-2B708
CJK UNIFIED IDEOGRAPH-2B70D
CJK UNIFIED IDEOGRAPH-2B717
CJK UNIFIED IDEOGRAPH-2B727

175

TCVN ................ : 2010

Ph lc A
K t m t ch biu
(thng tin)
K t m t ch biu - Ideographic Description Character (IDC) l mt k t ho,
c dng cng vi mt dy cc k t ho khc to ra Dy m t ch biu
Ideographic Description Sequence (IDS). Dy nh vy c th c dng m t k
t ch biu m cha c xc nh trong chun ny v cc chun quc t.
IDS m t cho ch biu di dng tru tng. N khng c din gii nh mt k
t hp thnh v khng ng bt k dng ti to no.
LU IDS khng phi l k t v do khng phi l thnh vin ca kho ch ISO/IEC 10646.

I.1.1 C php ca dy m t ch biu


IDS bao gm mt IDC theo sau bi mt s c nh cc cu phn m t - Description
Components (DC). DC c th l bt k phn t no sau y:

ch biu m ho

b th m ho

IDS khc
LU 1 M t trn ng rng bt k IDS no cng c th c lng bn trong IDS khc.

tng IDC c bn tnh cht nh c tm tt trong bng I.1 di y;

s cc DC c dng trong IDS bt u vi IDC ,

nh ngha v vit tt ch u ca n,

c php ca IDS tng ng,

v tr tng i ca cc DC trong biu din trc quan ca ch biu c m t


trong dng tru tng ca n.

C php ca IDS c a ra bi tng IDC c ch ra trong ct vit tt v c php


ca IDS ca bng ny bng tn vit tt ca IDC (tc l. IDC-LTR) c tip theo sau
bi s tng ng ca DCs, chng hn (D1 D2) hay (D1 D2 D3).
LU 2 IDS b hn ch khng qu 16 k t chiu di. Cng khng c qu su ch biu v/hoc
b th c th xut hin gia bt k hai th nghim no ca mt k t IDC bn trong mt IDS.

I.1.2 nh ngha ring v k t m t ch biu


IDEOGRAPHIC DESCRIPTION CHARACTER LEFT TO RIGHT (2FF0): IDS ny
c a vo bi k t ny m t cho dng tru tng ca ch biu vi D1 bn
tri v D2 bn phi.
IDEOGRAPHIC DESCRIPTION CHARACTER ABOVE TO BELOW (2FF1): IDS ny
c a vo bi k t ny m t cho dng tru tng ca ch biu vi D1 trn D2.

176

VNPF

TCVN ................ : 2010


IDEOGRAPHIC DESCRIPTION CHARACTER LEFT TO MIDDLE AND RIGHT
(2FF2): IDS ny c a vo bi k t ny m t cho dng tru tng ca ch biu
vi D1 bn tri ca D2, v D2 bn tri ca D3.
IDEOGRAPHIC DESCRIPTION CHARACTER ABOVE TO MIDDLE AND BELOW
(2FF3): IDS ny c a vo bi k t ny m t cho dng tru tng ca ch biu
vi D1 trn D2, v D2 trn D3.
IDEOGRAPHIC DESCRIPTION CHARACTER FULL SURROUND (2FF4): IDS ny
c a vo bi k t ny m t cho dng tru tng ca ch biu vi D1 bao
quanh D2.
IDEOGRAPHIC DESCRIPTION CHARACTER SURROUND FROM ABOVE (2FF5):
IDS ny c a vo bi k t ny m t cho dng tru tng ca ch biu vi
D1 trn D2, v bao quanh D2 c hai bn.
IDEOGRAPHIC DESCRIPTION CHARACTER SURROUND FROM BELOW (2FF6):
IDS ny c a vo bi k t ny m t cho dng tru tng ca ch biu vi
D1 di D2, v bao quanh D2 c hai bn.
IDEOGRAPHIC DESCRIPTION CHARACTER SURROUND FROM LEFT (2FF7): IDS
ny c a vo bi k t ny m t cho dng tru tng ca ch biu vi D1
bn tri D2, v bao quanh D2 trn v di.
IDEOGRAPHIC DESCRIPTION CHARACTER SURROUND FROM UPPER LEFT
(2FF8): IDS ny c a vo bi k t ny m t cho dng tru tng ca ch biu
vi D1 gc trn bn tri ca D2, v bao quanh mt phn D2 trn v bn tri.
IDEOGRAPHIC DESCRIPTION CHARACTER SURROUND FROM UPPER RIGHT
(2FF9): IDS ny c a vo bi k t ny m t cho dng tru tng ca ch biu
vi D1 gc trn bn phi ca D2, v bao mt phn D2 trn v bn phi.
IDEOGRAPHIC DESCRIPTION CHARACTER SURROUND FROM LOWER LEFT
(2FFA): IDS ny c a vo bi k t ny m t cho dng tru tng ca ch biu
vi D1 gc di bn tri ca D2, v bao mt phn D2 di v bn tri.
IDEOGRAPHIC DESCRIPTION CHARACTER OVERLAID (2FFB): IDS ny c a
vo bi k t ny m t cho dng tru tng ca ch biu vi D1 v D2 chm lp
ln nhau.
Bng I.1: Tnh cht ca cc k t m t ch biu
Tn k t:
K T M T CH
BIU

s
cc
DC

Vit tt v c
php ca IDS

TRI SANG PHI

IDC-LTR D1 D2

TRN XUNG DI

TRI SANG GIA SANG


PHI

VNPF

IDC-ATB D1 D2

IDC-LMR D1 D2 D3

V tr tng
i ca cc
DC

D1 D2

D1 D2

D1D2D3

V d v IDS

V d
IDS
biu
din:

177

TCVN ................ : 2010

TRN GIA V DI

IDC-AMB D1 D2 D3

D1 D2

D3

BAO Y

BAO T TRN

BAO T DI

BAO QUANH T TRI

BAO PHN TRN BN TRI

BAO PHN TRN BN PHI

BAO PHN DI BN TRI

CHM LP

IDC-FSD D1 D2

IDC-SAV D1 D2

IDC-SBL D1 D2

IDC-SLT D1 D2

IDC-SUL D1 D2

IDC-SUR D1 D2

IDC-SLL D1 D2

IDC-OVL D1 D2

D2

D2 D1

D1 D2

D2 D1

D2D1

D2 D1

D2 D1

D2 D1

* D1

LU D1 v D2 chm lp ln nhau. Biu ny khng ng rng D1 gc trn bn tri v D2


gc di bn phi.

178

VNPF

TCVN ................ : 2010

Ph lc B
Hng dn t tn k t
(thng tin)
Mc 22 ca chun ny xc nh cc qui tc hnh thnh tn v tnh duy nht ca
tn. Cc qui tc ny c dng trong cc chun tp k t m ho cng ngh thng tin
khc nh ISO/IEC 646, ISO/IEC 6937, ISO/IEC 8859, v ISO/IEC 10367. Ph lc ny
cung cp hng dn ph cho vic to ra cc tn thc th duy nht ny.
Nhng hng dn ny khng p dng cho tn ch biu CJK v m tit Hangul
c hnh thnh bng vic dng cc qui tc c xc nh trong mc 22.5 v 22.6
tng ng.
Hng dn 1

Tn ca thc th bt k ch no c th c u k hiu cho ngha thng tc ca n


(chng hn, tn k t: PLUS SIGN hay tn khi: BENGALI).
Mt s thc th, nh k t, c th c tn m t hnh dng, khng phi l cch dng,
(chng hn, tn k t: UPWARDS ARROW).
Tn ca thc th khng c d nh nhn din c tnh hay thuc tnh ca n,
hay cung cp thng tin v c trng ngn ng, ngoi tr nh c xc nh trong
hng dn 4 di y.
Hng dn 2

Tn theo ch u ca cc ch ci Latin A ti Z v ch s c lin kt vi tn.


Tn theo ch u c th c dng trong tn thc th ni vic s dng c ri v
yu cu r rng v n. Chng hn, tn ca chc nng iu khin c i km vi tn
vit tt theo ch u.
V D

Tn

Vit tt ch u
LOCKING-SHIFT TWO RIGHT

LS2R

SOFT HYPHEN

SHY

INTERNATIONAL PHONETIC ALPHABET

IPA

LU Trong ISO/IEC 6429, cc tn ca phng thc cng c trnh by theo cng cch nh
chc nng iu khin.

Hng dn 3

Tn k t v danh nh dy UCS ch cha ch s 0 ti 9 nu chnh t tn ca ch s


tng ng l khng thch hp.
LU Xem nh v d v tn ca k t ti im m gi tr 201A l SINGLE LOW-9 QUOTATION
MARK; k hiu cho ch s 9 c a vo trong tn ny minh ho cho hnh dng ca k t, v
khng c ngha s.

Hng dn 4

VNPF

179

TCVN ................ : 2010


Cc tn k t v danh nh dy UCS c tn c xy dng t mt tp thch hp ca
thut ng p dng c ca li sau y v c sp th t theo trnh t ca li
ny. Cc ngoi l c xc nh theo hng dn 9 ti 11. T WITH v AND c th
c a vo cho sng t thm khi cn.
1

Script

Attribute

Case

Designation

Type

Mark(s)

Language

Qualifier

V D V CC THUT NG NH VY
Script Latin, Cyrillic, Arabic
Case capital, small
Type letter, ligature, digit
Language Ukrainian
Attribute final, sharp, subscript, vulgar
Designation customary name, name of letter
Mark acute, ogonek, ring above, diaeresis
Qualifier sign, symbol
V D V CC TN
LATIN CAPITAL LETTER A WITH ACUTE
1
2
3
6
7
DIGIT FIVE
3
6
LEFT CURLY BRACKET
5
5
6
LU 1 Nt ch l mt k hiu ho trong hai hay nhiu k hiu ho khc c to nh
nh mt k hiu ho n.

Vi cc tn k t, ni mt k t cha mt ch c s vi nhiu du, dy cc du


trong tn ny l th t m theo cc du c nh v tng i vi ch c s. Dy
ny c th bt u bng cc du trn ch c ly theo trnh t ngc ln, v tip
theo bng cc du di ch c ly theo trnh t xung di, hay theo o
ngc (di/trn).
Vi danh nh dy UCS (UCS Sequence Identifiers), ni dy ny cha ch c s vi
nhiu du, tn m t cho cc k t ring theo trnh t m chng c m ho trong
dy.
V D

LATIN CAPITAL LETTER O WITH CIRCUMFLEX AND DOT BELOW


LATIN CAPITAL LETTER C WITH CEDILLA AND ACUTE
LATIN CAPITAL LETTER U WITH OGONEK AND ACUTE

Hng dn 5

Cc ch ci ca b ch Latin script c biu din bn trong tn ca chng bng cc


k hiu ho c s (A, B, C, v.v.). Ch ci ca tt c cc b ch khc c biu
din bng phin m theo ngn ng ca chun c xut bn u tin.

180

VNPF

TCVN ................ : 2010


V D
K

LATIN CAPITAL LETTER K


CYRILLIC CAPITAL LETTER YU

Hng dn 6

V nguyn tc khi mt k t ca b ch cho c dng trong nhiu ngn ng,


khng tn ngn ng no c xc nh. Ngoi l c dung th l ch s nhp
nhng c th l kt qu.
V D

CYRILLIC CAPITAL LETTER I


CYRILLIC CAPITAL LETTER BYELORUSSIAN-UKRAINIAN I

Hng dn 7

Cc ch ci l phn t ca nhiu b ch c coi l khc nhau cho d hnh dng ca


chng nh nhau; chng c tn khc nhau.
V D
A

LATIN CAPITAL LETTER A


GREEK CAPITAL LETTER ALPHA
CYRILLIC CAPITAL LETTER A

Hng dn 8

Mi ch c th, Danh nh dy UCS c tn c xy dng bng vic chp thm tn


ca yu t cu thnh cng vi cc phn t p khi c nut vn. Nu qui trnh ny to
ra kt qu trong tn c ri, tn ny b thay i cho ph hp m bo tnh duy
nht trong cc tn k t v cc danh nh dy UCS c tn. Cc t WITH v AND c
th c a vo lm sng t thm khi cn.
Hng dn 9

Mt k t ca mt b ch c dng c lp vi b ch khc, chng hn nh mt k


hiu ho c quan h vi cc n v vt l v chiu, c coi l mt k t khc vi k
t ca b ch gc ca n.
V D

MICRO SIGN

Hng dn 10

Mt s cc k t c tn truyn thng bao gm mt hay hai t. Khng c nh thay i


cch s dng ny.
V D
'
:
@
_
~

APOSTROPHE
COLON
COMMERCIAL AT
LOW LINE
TILDE

Hng dn 11

Trong mt s trng hp, cc k t ca b ch cho, thng l du ngt, c

VNPF

181

TCVN ................ : 2010


dng trong b ch khc cho vic s dng khc. Trong nhng trng hp ny tn tc
l phn nh vic dng chung nht c trao cho k t . Tn tc l c th c vit
theo sau trong danh sch cc k t ca chun c bit bng tn trong du ngoc trn,
m k t ny c trong b ch c xc nh bi chun c bit ny.
V D

182

UNDERTIE (Enotikon)

VNPF

TCVN ................ : 2010

Ph lc C
Th tc thng nht ho v thu xp ch biu
CJK
(thng tin)
Tuyn tp k t ho ca ch biu thng nht trong ISO/IEC 10646 c xc nh
trong 30. Chng c suy dn ra t nhiu ch biu c tm thy trong a dng
chun quc gia v vng khc nhau vi cc tp k t m ho ("ngun").
Ph lc ny m t cch ch biu trong chun ny c suy dn ra t cc ngun
bng vic p dng tp cc th tc thng nht. N cng m t cch cc ch biu
trong chun ny thu xp vo dy cc im m k tip m chng c gn cho.
Tham chiu ngun cho cc ch biu thng nht CJK c xc nh trong 22.1.
Bn trong ng cnh ca ISO/IEC 10646 qui trnh thng nht c p dng cho cc k
t ch biu c ly ra t cc m trong cc nhm ngun. Trong qui trnh ny, mt
ch biu t hai hay nhiu nhm c lin kt vi nhau, mt im m duy nht c
gn cho chng trong chun ny. Cc lin kt c thc hin tng ng vi tp cc
th tc c m t di y. Cc ch biu c lin kt s c m t y l
"c thng nht.
LU Qui trnh thng nht khng p dng cho cc tuyn tp sau ca cc ch biu :
CJK RADICALS SUPPLEMENT ( 2F00 - 2EFF)
KANGXI RADICALS (2F00 - 2FDF)
CJK COMPATIBILITY IDEOGRAPHS (F900 - FAFF vi ngoi l FA0E, FA0F, FA11, FA13, FA14,
FA1F, FA21, FA23, FA24, FA27, FA28 v FA29)
CJK COMPATIBILITY IDEOGRAPHS SUPPLEMENT (2F800-2FA1F).

C.1 Th tc thng nht ho


C.1.1 Phm vi thng nht
Cc ch biu khng c quan h trong suy dn lch s (cc k t khng cng gc)
khng c thng nht.
V d

,
LU S khc bit v hnh dng gia hai ch biu trong v d trn l chiu di ca ng
ngang di thp. iu ny c xem xt l s khc bit thc t ca hnh ch. Hn na cc ch
biu ny c ngha khc nhau. Ngha ca ch th nht l "S" cn ngha ca ch th hai la "Th ".

Lin kt gia cc ch biu t cc ngun khc nhau c thc hin y nu hnh


dng ca chng l tng t, theo h thng phn loi sau.

VNPF

183

TCVN ................ : 2010

C.1.2 Phn loi hai mc


H thng phn loi hai mc c dng lm khc bit (a) gia cc hnh tru tng
v (b) gia cc hnh thc ti c xc nh bi cc mt ch c th. Cc dng bin
th ca ch biu , m khng th c thng nht, c nhn din da trn s khc
bit gia cc hnh tru tng ca chng.

C.1.3 Th tc
Th tc thng nht c dng xc nh liu hai ch biu c cng mt hnh tru
tng hay l cc ch khc bit. Th tc thng nht c hai giai on, c p dng
theo th t sau:
a) Phn tch cu trc cu phn;
b) Phn tch tnh nng cu phn;
C.1.3.1 Phn tch cu trc cu phn

Trong giai on u ca th tc ny, cu trc cu phn ca tng ch biu c


xem xt. Cu phn ca ch biu l t hp hnh hc ca cc yu t nguyn thu. cc
ch biu khc c th c lp cu hnh t cng tp cc cu phn. Cu phn c th
c t hp to ra cu phn mi vi cu trc phc tp hn. Ch biu , do , c
th c xc nh nh cy cu phn, vi nt nh l bn thn ch biu , v nt y
l cc yu t nguyn thu. iu ny c v trong Hnh S.1.

Hnh C.1 - Cu trc cu phn

C.1.3.2 Phn tch tnh nng cu phn

Trong giai on th hai ca th tc ny, cc cu phn c nh v ti cc nt tng


ng ca hau ch biu c so snh, bt u t nt cao nht, nh c v trong
hnh S.2.

Hnh C.2 - Nt cao nht ca cu phn

Tnh nng sau ca tng ch biu c a ra so snh c xem xt:


a) s cc cu phn,
b) v tr tng i ca cu phn trong tng ch biu y ,
c) cu trc ca cu phn tng ng.
184

VNPF

TCVN ................ : 2010

Nu mt hay nhiu tnh nng a) ti c) trn l khc nhau gia cc ch biu trong
so snh, cc ch biu ny c coi l c hnh tru tng khc nhau v do
khng c thng nht.
Nu tt c cc tnh nng a) ti c) trn u l nh nhau gia cc ch biu , cc ch
biu ny c coi l c cng hnh trwcu tng v do c thng nht.

C.1.4 V d v khc bit ca cc hnh tru tng


minh ho cc qui tc c dn ra t a) ti c) trong S.1.3.2, mt s v d in hnh
v ch biu khng c thng nht, c s khc bit v cc hnh tru tng, c
nu di y.
C.1.4.1 S cu phn khc nhau

Cc v d di y minh ho cho qui tc a) v hai ch biu trong tng cp c s cu


phn khc nhau.

, ,
C.1.4.2 V tr tng i ca cc cu phn khc nhau

V d di y minh ho cho qui tc b). Mc du hai ch biu trong tng cp c


cng s cu phn, v tr tng i ca cc cu phn l khc nhau.

,
C.1.4.3 Cu trc khc nhau ca cc cu phn tng ng

V d di y minh ho cho qui tc c). Cu trc ca mt (hay nhiu) cu phn tng


ng bn trong hai ch biu trong tng cp l khc nhau.

, , , , , , ,
, , , , , ,

, , , , , ,
C.1.5 Khc hnh dng thc ti
minh ho cho phn lp c m t trong S.1.2, mt s v d in hnh v cc ch
biu c thng nht c nu di y. Hai hay ba ch biu trong tng nhm
di y c hnh dng thc t khc nhau, nhng chng c coi l c cng hnh tru
tng, v do c thng nht.

,
VNPF

185

TCVN ................ : 2010


Nhng khc bit c phn loi thm na tng ng vi cc v d sau.
a) Khc bit trong nt/chm c quay

, , , , ,
b) Khc bit trong phng i ch khi u nt v/hoc ch kt thc

, , , , ,
c) Khc bit ch tip xc ca nt

, ,
d) Khc bit ch nh ra ti gc gp ca nt

e) Khc bit cc nt un


f) Khc bit phn sau gp ch kt thc nt

g) Khc bit du nhn ti im u ca nt

,
h) Khc bit trong thay i "nc"

,
i) T hp ca cc khc bit trn

Nhng khc bit ny trong hnh dng thc t ca ch biu thng nht c trnh
by trong cc ct ngun tng ng cho tng im m trong s m trong mc 30
ca chun ny.

C.1.6 Qui tc tch ngun


gn gi tnh ton vn qua nhiu giai on chuyn i m (thng vn c bit l
ton vn i trn), mi ch biu c m ho tch bit trong bt k mt chun
ngun c lit k di y u khng b thng nht.
Ngun G: GB2312-80, GB12345-90, GB7589-87*, GB7590-87*, GB8565-88*,
Danh sch ch Hanzi vn nng cho ngn ng ting Trung hin i *
Ngun T: TCA-CNS 11643-1986/ mt phng 1, TCA-CNS 11643-1986/ mt phng 2,
TCA-CNS 11643-1986/ mt phng 14*
J-source: JIS X 0208-1990, JIS X 0212-1990

186

VNPF

TCVN ................ : 2010


K-source: KS C 5601-1989, KS C 5657-1991
LU Du " * " sau s hiu tham chiu ca chun ch ra rng mt s ch biu c cha
trong chun khng c a vo trong tuyn tp thng nht.

Tuy nhin, mt s ch biu c m ho trong hai chun thuc vo cng nhm


ngun (nh GB231280 v GB12345-90) c thng nht trong qu trnh thu thp
ch biu t nhm ngun.
Qui tc tch ngun c m t trong mc ny ch p dng cho khi CJK UNIFIED
IDEOGRAPHS c xc nh trong Mt phng a ng c s.
LU cc ch biu tng hp CJK c to ra cng tun theo qui tc rt ging qui tc tch
ngun. Tuy nhin, kt qu cui cng l t hp ca mt ch biu thng nht CJK v mt hay mt
vi ch biu tng hp CJK. Khi qui tc tch ngun c p dng, tt c cc ch biu CJK
ngun 'tng t' ny sinh trong cc ch biu thng nht CJK tch bit.

C.2 Th tc sp xp
C.2.1 Phm vi sp xp
Sp xp cho CJK UNIFIED IDEOGRAPHS trong s m ca mc 30 ca chun ny
c da trn vic sp th t theo cc t in sau.
u tin
1
2
3
4

T in Kangxi Dictionary
Beijing
Daikanwa Jiten
Hanyu Dazidian
Daejaweon

Xut bn ln th 9
xut bn ln th 9
xut bn ln th nht
xut bn ln th nht

cc t in ny c dng theo th t u tin c cho trong bng trn. u tin 1 l


cao nht. Nu ch biu c tm thy trong mt t in, cc t in c u tin thp
hn s khng c xem xt.

C.2.2 Th tc
C.2.2.1 Ch biu c tm thy trong cc t in

a)
Nu mt ch biu c tm thy trong t in Kangxi Dictionary, n c
nh v theo s m tng ng vi th t ca Kangxi Dictionary.
b)
Nu mt ch biu khng c tm thy trong Kangxi Dictionary nhng c
tm thy trong Daikanwa Jiten, n c cho v tr cui ca nhm b th-nt m di
n c ly ch s gn nht vi k t Daikanwa Jiten i trc, cng xut hin trong t
in Kangxi dictionary.
c)
Nu mt ch biu c tm thy khng c trong c Kangxi ln Daikanwa, cc
t in Hanyu Dazidian v Daejaweon c tham chiu ti theo th tc tng t.
C.2.2.2 Ch biu khng tm thy trong cc t in

Nu mt ch biu khng tm thy trong bt k bn cun t in no trn, n c


cho v tr cui ca nhm b th-nt (sau ksi t hin c trong cc t in) v n c
nh ch s di cng s m b th-nt.

VNPF

187

TCVN ................ : 2010

C.3 V d v tch m ngun


Cp (hay b ba) cc ch biu c ch ra di y l ngoi l vi qui tc thng nht
c m t trong S.1. Chng khng c thng nht bi v qui tc tch ngun c
m t trong S.1.6.
LU Nhm (hay cc nhm) ngun c bit gy ra vic p dng qui tc tch ngun c ch ra
theo ch ci (G, J, K, hay T) xut hin bn phi ca tng cp (hay b ba) cc ch biu . Nhm
ngun tng ng vi cc ch ny c nhn din ch bt u ca ph lc ny.
T


4E1F 4E22

4FF1 5036
T

4E48 5E7A

5024 503C

TJ

4E89 722D

5077 5078

5204
TJ

4EDE 4EED

507D 50DE

520B
T

4F75 5002

514C 5151

522A

514E 5154

522B
TJ

4FC1 4FE3

5156 5157

52B5

53C3 53C4

5759 5DE0

541E 5451

5433 5434 5449

188

5716 5717

5415 5442

TJ

57D2 57D3

TJ

5848 588D

5239

53C1

GT

53C2

598D 59F8

5BDC 5BE7

GT

5A1B 5A2F 5A31

GTJ

5BDD 5BE2

59EB 59EC

5377

5DFB

59CD 59D7

TJ

5238

524E

5373

537D

518A 518C

TK

5225

5355

5220

TJ

4FA3 4FB6

5358

4FDE 516A

52FB

5300

520A

TJK

5292

5294

5203

TJ

525D

5265

51E2

51E3

GTJ

524F

5259

51C0 51C8

GT

5C02 5C08

GTJ

5C06 5C07

VNPF

TCVN ................ : 2010

5436 5450

5861 586B

543F 544A

5897 589E

55A9 55BB

5618 5653

5910 657B

568F 5694

5932 672C

56EF 56FD

5965 5967

TJ

5708 570F

TJ

5968 596C 734E

570E 5713

5986 599D
T

5E76 5E77

TJ

60E0

60A6

60AE

609E

5BAB 5BAE

5E21 5E32

5F39 5F3E

614D

6120

TJ

5E2F 5E36

TJ 633F

66FD

66FE
TJ

634F

67B4

67FA
TJ

635C 641C

67FB

67E5

63B2

67F5

6805

GT

63ED

VNPF

5DD3 5DD4

60B3

60EA

5F37 5F3A

5B73 5B76

63D1

5F11 5F12

6085

5C4F 5C5B

63D2 63F7

5CE5 5D22

5C36 5C37

5B24 5B37

5EC4 5ECF

5BDB 5BEC

6075

5C2A 5C2B

GT

GT

5B0E 5B14

GTJ

5C19 5C1A

TK

GTJ

5AAF 5B00

58FD 5900

5C13 5C14

5AAA 5ABC

5A55 5AAB

GTJ

58EE 58EF

5A7E 5AAE

5527 559E

TJ

6416 6447

TJ 63FA

68B2

68C1

189

TCVN ................ : 2010


TJ

TJ

5F50 5F51

614E

613C

63FE 6435

6986

GT

5F54 5F55

622C

5F59 5F5A

6231

622F

5F5B 5F5C

6237 6238

T 6236

5F5D 5F5E

623E

62CB

663B

5FB3 5FB7

62D4

6B72 6B73

6A23

6329 635D

66C1

6DF8 6E05
T

665A

6669

69D8

629C

TJ

6602

TJ

69C7

69D9

T
629B

65E2

65E3

699D

6A27
T

623B

5FB4 5FB5

6553

655A

6985

69B2
T

5F65 5F66

654E

6559

6982

69EA

6483

64CA

6961

TJ

6229

6A2A

6A6B

66A8

6B65

TJ

6B7F 6B81

6E07 6E34

74F6 7501

7BE1 7C12

6BBB 6BBC

7522 7523
T

6E29 6EAB
T

7CA4 7CB5
J

6BC0 6BC1

6E88 6F59

75E9 7626

7D55 7D76

6BCE 6BCF

6E89 6F11

76A1 76A5

7DA0 7DD1

TJ

6C32 6C33

6EDA 6EFE

771E 771F

7DD2 7DD6

190

7BB3 7C08
T

GTJ

6B69

7464 7476
T

VNPF

TCVN ................ : 2010


T

TJK

GTJK

6C5A 6C61

6F5B 6FF3

773E 8846

TJ

7DE3 7E01
T

6C92 6CA1

7028 702C

7814 784F

7DFC 7E15

TJ

GTJ

TJ

6D44 6DE8

70BA 7232

797F 7984

7E48 7E66

GTJK

TJ

6D89 6E09

712D 7162

79BF 79C3

7FAE 7FB9

6D97 6D9A

7155 7199

7A05 7A0E

7FF6 7FFA

TJ

6D99 6DDA

7174 7185

7A42 7A57

80FC 8141

T
86FB

GJ

885B

885E

812B 8131
T

8FBE 8FD6
TJK

GT

8203


7B5D 7B8F

8715

817D 8183

GT

72B6 72C0

6DE5 6E0C

T
95B1

95B2
TJ

8FF8 902C

9667 9689

8204
TJ

TK

886E

889E

820D 820E
J

9059 9065

9752
T

GJK

GTJ

8216 8217

88C5 88DD

90A2 90C9

975C

TJ

8358 838A

8A2E 8A7D

90CE 90DE

9771

976D

83D1 8458

8AAA 8AAC

9109 9115

TJ

T 90F7

9839

983D
T

TJ

8480 8495

8ACC 8AEB

9196 919E

9854

VNPF

9759

TJ

9751

984F

191

TCVN ................ : 2010

GJ

848B

8B20

8B21

91A4 91AC

985B

985A

8523
T

848D 853F

8C5C 8C63

9292

9203

8570 8580

8D71

92B3 92ED
T

85AB 85B0

8EFF 8F27

9332

TJK

9304

TK

TK

85F4 860A

8F1C 8F3A

932C 934A

9A08

865A 865B

8F3C 8F40

TJ

93AD 93AE

99B1

99C4

9905

9920

TJ

8D70

98EE

98F2

TJ

99E2

9AA9

9AAB

TJJ T

9AD8 9AD9

9C1B 9C2E 9DC6 9DCF 9EC3 9EC4

TJ

9AEA 9AEE 9CEF 9CF3 9EAA 9EAB 9ED1 9ED2

JT

9B2C 9B2D 9D87 9DAB 9EBC 9EBD

Theo th tc thng nht c m t trong S.1 cp (hay b ba) cc ch biu c


nu di y kaf khng c thng nht. L do cho khng thng nht c ch ra bi
tham chiu xut hin bn phi mi cp (hay b ba). V "khng cng gc" xem S.1.1.
LU L do cho khng thng nht trong cc v d ny l khc vi qui tc tch ngun c m t
trong mc S.1.6

5191 80C4

192

non cognate


S.1.4.3

non cognate

S.1.4.3

5BF3 5BF6 6710 80CA 7A32 7A3B

VNPF

TCVN ................ : 2010


S.1.4.3

S.1.4.1

non cognate


S.1.4.3

S.1.4.1

non cognate

S.1.4.3

51B2 6C96 5EF0 5EF3 6713 8101 7FF1 7FF6

S.1.4.3

51B3 6C7A 61D0 61F7 6718 8127 8007

8008 8009

S.1.4.3

S.1.4.3S.1.4.1



S.1.4.3

S.1.4.2

5B7C 5B7D

VNPF

non cognate

non cognate

S.1.4.3

S.1.4.3

S.1.4.2

51B5 6CC1 6560 656A 6723 81A7 8074 807C 807D

579B 579C 670C 80A6 6735 6736 8346 834A

S.1.4.3

non cognate 7054 7067 8EB1 8EB2 670F 80D0

193

You might also like