Professional Documents
Culture Documents
SEMINAR
II.
I.
IA
B
N
S
T
T
R
R
O
Table of Contents
A
D
C
U
T
C
T.
III.
I.. TYPES OF TELECOM
DATA
..................................
..
O
1.
Call
Summary
............................................................................................................................
Data
.............................................
..
N
..................................
4
.. 2. Network 4
...
..VI.Data ..........
...
Customer
..C 3.
...
3.1
...................
..O Data
...
Ch............
IV....................
Mining
Sequential Patterns In Telecommunication Database Using
..N ....................
...
ro
...................
Genetic
Algorithm ................. 5
..CL....................
...
1.
Sequential Patterns
...............................................................................................................
Mining
5
m
...................
..US....................
...
os
...................
2.
....................
..VII.
...
IO...................
oGenetic
....................
..RE
...
N Algorithm
. me
........
4
3.
Mining
Sequential Patterns in Telecommunication Database Using
.................
..................
..FE
...
.....GA. ...
.......................................... 6
4 .....
..RE
...
......................
..NC
...
......................
.....
3.2 Genetic...........................................................................................................................
Operators
.................
..ES
...
..... .....
7
........
...
......................
..... Fitness
3.3
........
...
......................
.....
Function.
.
.................
3.4
SPT-GA
............................................................................................................................
Algorithm
........
...
..... .................
.....
5
8
.. .......................
...
.....
......
4. .....
Experiment
..
...
.....
......
..... ..........
.................
Results
.. Discovering
...
V.
Structural Patterns In Telecommunications
.....
......
.....
.................
.......................
..
...
Data
........................................................... 11
.....
......
.....
.................
.......................
..
...
.....
......
.....
.................
.......................
..
...
.....
......
.....
.................
.......................
..
...
.....
......
..... 7
.....................
..
...
.....
......
9 .....
..
...
.....
......
.....
........
...
..... .....
........
...
..... .....
........
...
..... .....
........
...
..... .....
........
...
..... .....
........
...
..... .....
........
...
..... .....
........
...
..... .....
........
...
..... .....
........
...
..... .....
..13 .. 6
...
I.
ABSTRACT
II.
INTRODUCTION
III.
The Initial step in the data mining process is to understand the data.
Here
discuss three
types we
of telecom
data. main
They are as follows:
2. Network Data
Telecommunication networks are extremely complex configurations of
equipment,
comprised
of Each network element is capable of
interconnected
components.
generating
error and
status
messages, which
leads
to a tremendous amount of network data.
3. Customer Data
Telecommunication companies, like other businesses have millions of
customers.
necessitya database of information on these customers.
they have toFor
maintaining
This
information
will and may include other information such as
include
name, address
service plan,
credit score,
contract
information,
family income and payment history.
IV.
The goal is to find all sub sequences from the given sets of transactions;
this
whenapproach
the data istouseful
be mined have some sequential nature to deal with
databases
that have a timeseries
characteristics.
Sequential Pattern can be defined as follows.
Definition : Let I ={x1...xn} be a set of items. An itemset is a non-empty subset of
andisan
itemset withitems,
k items
called k-itemset. A sequence s=(X
) is an1 ...X
order
list of item sets, and
l
an item set Xi (1= i = l) in a sequence is called a transaction. In a set of
sequences,
s
is maximal aifsequence
s is not contained
in other sequences.
2. Genetic Algorithm
Genetic Algorithm (GA) is a part of evolutionary computing, which is a
artificial
chromosomes)
new
Best
their
population
solutions
best
intelligence.
fitness.
which
called
by
This
mutation
are
population.
Genetic
is repeated
selected
and
algorithm
crossover.
to
Solutions
until
form
some
starts
new
This
from
condition
with
solutions
isone
motivated
a set
population
(for
(offspring)
of solutions
example
by aare
hope,
are
rapidly
growing
area
of
(represented
taken
that
will
selected
number
the
beand
better
new
of
according
used
populations
population
by
than
to form
the
to old
or
a one.
IN
4.
Experiment
Results
transactions
The
follows.
population
A
Telecommunication
results
SPT-GA
size
ofand
some
(N),
60
algorithm
country
experiments
generations
Database
is
codes.
written
is
(G),
ontaken
The
telecommunication
confidence
with
crossover
from
MATLAB
a Telecommunication
priority
probability
programming
database
(a) and
usedisis
presented
language.
minimum
Company
0.8,
while and
The
fitness
the mutation
analyzed
ituser
has
(minF).
can
1091
tune
is
as
0.001. The output of this experiment is a text file that includes the
interesting
rules that
represent
the most suitable
telecommunication
sequences. For example, the rule
of
Figure
5 told
us that
when
country
code
91 is called, that means (40%) of callers will call
country code 92 afterwards.
The tests showed that when the generation increases the time will be
around
each
other butincreases the time will be increased. However,
when
the
population
experiments,
From
increasing
the
experiments,
theas
of
population
population.
shown itinsize.
isFigure
Moreover,
observed
But,6,either
GA
that
thetakes
tests
ways,
increasing
less
also
GA
time
showed
does
of when
generation
notthat
take
increasing
when
awill
long
comparing
the
two
the generation
population
take
time;
less
it istime
only
increase,
than
than
and
a matter
the best fitness will be around.
V.
VI.
CONCLUSION
VII. REFERENCES
1. Data Mining In Telecommunications; Gary M. Weiss
2. Data Mining In Telecommunications And Studying Its Status In Iran
Telecom
Companies And Operator; Jamal Sophieh
3. Data Mining And CRM In Telecommunications; D. Camilovic
4. Genetic Algorithms; William H. Hsu
5. A Fraud Detection Approach in Telecommunication using Cluster GA;
V.Umayaparvathi
& Dr.K.Iyakutti