Professional Documents
Culture Documents
Automatic Grapheme-to-Phoneme
Conversion of Arabic text
Belal Al-Daradkah and Bashir Al-Diri
School of Computer Science
University of Lincoln
Lincoln, UK
Baldaradkah, Baldiri @ Lincoln.ac.uk
AbstractAn automated computerized system is
presented that converts Arabic graphemes to
phonemes (G2P), by using the phonology rules of
the Arabic language, supported by a dictionary of
exceptions.. The system was evaluated on a
publicly available dataset, consisting of 620 fully
diacritic Arabic sentences, which were manually
segmented. The validation process was applied to
three different cases: sentences, words, and
phonemes, yielding very promising results. The
performance of the proposed system, using the
precision and the sensitivity as metrics, was
99.19% and 99.42%, respectively. All Arabic
language rules were applied and tested. The
developed system can be applied to any diacritic
Arabic text.
KeywordsArabic; phonemes; graphemes; phonology;
rules.
I.
INTRODUCTION
BACKGROUND
1|Page
www.conference.thesai.org
PROPOSED METHOD
X
s
x d D R Z s
S
d` t` D`
\
`
Grapheme
(G)
Phoneme
?
u
G F q k L m n H W J i:
i
u
a:
a
(F)
`
:
In our system we split sentences into words; each word has to
be checked against the exception word dictionary. If a word is
It is worth mentioning that not all words can be converted
to correct phonemic outcomes, using phonetization rules. There
are some distinct exceptions, so we created a special dictionary
found, then the mapping transcript will be retrieved; otherwise
for these words. This dictionary includes the most frequent
the word has to be processed, using the structured rules,
words that require special pronunciation attributes and do not
according to the proper rule sequence, taking into consideration
follow certain rules, including also their correct phonemic
the possibility of handling an adjacent pair of words. A sample
transcriptions. The exception words that belong to this special
of exception words from the dictionary is shown in Table 2.
dictionary, sometimes contain graphemes not pronounced or
has to be articulated and did not occur in the word. This usually
TABLE II. EXCEPTIONAL WORD SAMPLE
happens with words containing unseen Alif or Hamzah
TRANSCRIPTIONS.
e.g., /?rraXma:nu/.
?
B t
T Z
Special Words
Phonemic
2|Page
www.conference.thesai.org
Transcription
?rraXma:nu
?ula:?ika
ha:Da:
Da:lika
la:kin
ha:?ula:?i
A. System Limitations
Our proposed system requires the input Arabic text data to
be fully vowelized (diacritic). Arabic phonology and
pronunciation are highly affected by the short vowels, which
are used to punctuate the Arabic pronunciation and to add
semantic information to the words. In general, short vowels do
not appear in Arabic text, since native speakers can recognize
these marks by context, meaning and sense. Therefore, it is
necessary to use diacritics to obtain the correct pronunciation.
This is an important issue for all text-to-speech and automatic
speech recognition applications [20], as well as for those
applications that are designed to teach Arabic language to nonnative Arabic speakers. Therefore, fully diacritics Arabic text is
highly needed in the process of phonetization.
B. System Description
After analyzing the Arabic phonemic rules, a clear and
well-structured algorithm was implemented, in order to
maintain the high priority rules sequence. Our framework of
phonetization depends on the rule-based method, which mainly
utilizes the Arabic phonology rules, augmented by the
exception word dictionary. Figure 1 shows the workflow of our
system.
G F / XGY ,
(1)
where G is the target grapheme, F is the corresponding
phoneme, X and Y are the right and left grapheme of G
respectively (X and Y might be null). The order of applying
these rules is very important, so the additional rules should be
applied, before the converting rules and the omitting rules. The
set C represents the Arabic consonants, whereas sets V l
and V s represent the Arabic long and short vowels.
C= ,
V = {V l Vs } , V l= { } , V s ={ } .
The following are our definitions of the implemented rules:
R 01:{Gi GiGi C } .
R 02:{Gi Fi Fi +1|Gi= } ,
(2)
(3
)
R 03 :{Gi Gi { }} ,
R 04 : { Gi | Gi= i=1 }
(5)
3|Page
www.conference.thesai.org
(6
)
R 05 : { Gi |Gi= G i+1= } .
R 06 : { Gi
|Gi=
}.
R 08 : { Gi |Gi= Gi1=
R 09: { Gi | Gi= Gi +1 {V , }}
(9
)
(1
0)
R 14 :{Gi | G i=
},
R 15:{Gi | Gi=
}.
(8
)
},
vowel
R 13:{Gi | Gi=
short
V
R 17: { G F|G Vl V s { }} .
l
R 18: {G i |Gi { }} .
(19)
(1
1)
R 10: {G i | Gi= Gi +1 {V , }} .
4|Page
www.conference.thesai.org
TABLE III. SAMPLE OF BOTH OUR SYSTEM OUTPUT AND THE MANUAL SEGMENTED PHONETIZATION
1
2
3
4
5
6
7
8
9
1
0
Arabic sentence
Manual
System
i:amSilX\is`a:nu+?`ala:+?arba?`ati+?arZul
i:amSilX\is`a:nu+?`ala:+?arba?`ati+?arZul
?`amilna:+?`ala:+qadamin+u:asa:q
?alku:?
`u+u:arrukbatu+mafa:s`ilun+muhimmatun
+fil?at`ra:f
nasX\abuddama+minalka?`bi+?`indal?
at`fa:lirrud`d`a?`
?`amilna:+?`ala:+qadamin+u:asa:q
?alku:?
`u+u:arrukbatu+mafa:s`ilun+muhimmatun
+fil?at`ra:f
nasX\abuddama+minalka?`bi+?`indal?
at`fa:lirrud`d`a?`
?abun+?ummun+?
una:di:kum+fahal+tus`Gu:na+lau:
+marrah
?abi:+?ummi:+?uxti:+u:a?axi:+?
uX\ibbukum+minal?a?`ma:q
?`ut`latussabti+u:al?aX\adi+?`a:lami:i:ah
s`i:a:mul?
iTnai:ni+mustaX\abbun+u:akaDa:lika+s`i:
a:mulxami:s
sa?azu:ruka+i:au:maTTula:Ta:?i+?au:il?
arbi?`a:?
?abi:+?ummi:+?uxti:+u:a?axi:+?
uX\ibbukum+minal?a?`ma:q
?`ut`latussabti+u:al?aX\adi+?`a:lami:i:ah
s`i:a:mul?
iTnai:ni+mustaX\abbun+u:akaDa:lika+s`i:a
:mulxami:s
sa?azu:ruka+i:au:maTTula:Ta:?i+?au:il?
arbi?`a:?
?usbu:?`un+u:a:X\idun+la:+i:akfi:lissafar
?usbu:?`un+u:a:X\idun+la:+i:akfi:lissafar
?abun+?ummun+?
una:di:kum+fahal+tus`Gu:na+lau:+marrah
5|Page
www.conference.thesai.org
Frequency number
of rule
1066
969
961
669
477
368
239
95
30
Short vowels
Shaddah
Lam
Hamza
Taa
Tanween
Alif
Long vowels
Last Word and
Skoon
V.
CONCLUSION
[2] AHMED, M.E., 1991. Toward an Arabic text-tospeech system. The Arabian Journal for Science and
Engineering, 16(4).
[3] AINSWORTH, W., 1973. A system for converting
English text into speech. Audio and Electroacoustics,
IEEE Transactions on, 21(3), pp. 288-290.
[4] ANDERSEN, O., KUHN, R., LAZARIDS, A.,
DALSGAARD, P., HAAS, J. and NOTH, E., 1996.
Comparison of two tree-structured approaches for
grapheme-to-phoneme
conversion,
Spoken
Language, 1996. ICSLP 96. Proceedings., Fourth
International Conference on 1996, IEEE, pp. 17001703.
[5] MEIHAMI, H., 2013. Text-To-Speech Software: a
New Perspective in Learning and Teaching Word
Stress, Word Intonation, Pitch Contour, and Fluency
of English Reading. International Letters of Social
and Humanistic Sciences, (08), pp. 24-33.
[6] GIBSON, E.J., PICK, A., OSSER, H. and
HAMMOND, M., 1962. The role of graphemephoneme correspondence in the perception of words.
The American Journal of Psychology, , pp. 554-570.
[7] TAJCHMAN, G., FOSTER, E. and JURAFSKY, D.,
1995. Building multiple pronunciation models for
novel words using exploratory computational
phonology. EUROSPEECH 1995, Citeseer.
[8] EL-IMAM, Y.A., 2004. Phonetization of Arabic:
rules and algorithms. Computer Speech & Language,
18(4), pp. 339-373.
[9] KUJALA, J.V., 2013. A probabilistic approach to
pronunciation by analogy. Computer Speech &
Language, 27(5), pp. 1049-1067.
[10] RIZK, M., MOHANNA, Y., MOUGNIEH, H.,
HAMAD, M., KHALIL, F. and GHADDAR, A.,
2011. Arabic Text to Speech Synthesizer: Arabic
Letter to Sound Rules. International Review on
Computers & Software, 6(1), pp. 72-85.
[11] ALOTAIBI, Y.A., 2012. Comparing ANN to HMM in
implementing limited Arabic vocabulary ASR
systems. International Journal of Speech Technology,
15(1), pp. 25-32.
[12] HABASH, N., SOUDI, A. and BUCKWALTER, T.,
2007.
On
Arabic
Transliteration.
Arabic
Computational Morphology. Springer, pp. 15-22.
[13] AL-GHAMDI, M.M., AL-MUHTASIB, H. and
ELSHAFEI, M., 2004. Phonetic Rules in Arabic
Script. Journal of King Saud University-Computer
and Information Sciences, 16, pp. 85-115.
[14] SELIM, H. and ANBAR, T., 1987. A phonetic
transcription system of Arabic text, Acoustics,
Speech, and Signal Processing, IEEE International
Conference on ICASSP'87. 1987, IEEE, pp. 14461449.
[15] EL-IMAM, Y.A., 1989. An unrestricted vocabulary
Arabic speech synthesis system. Acoustics, Speech
and Signal Processing, IEEE Transactions on,
37(12), pp. 1829-1845.
6|Page
www.conference.thesai.org
7|Page
www.conference.thesai.org