Professional Documents
Culture Documents
Diwaker Singh
Reg.Id.10808015
Punjab
2
There are several important problems We first introduce decision trees. A
where AI approaches are particularly decision tree has the following
promising properties: each node is connected to a
set of possible answers; each non-leaf
Prediction of Protein node is connected to a test which splits
Structure its
Semiautomatic drug design
Knowledge acquisition from set of possible answers into subsets
genetic data corresponding to different test results;
and each branch carries a particular
test result’s subset to another node.To
see how decision trees are useful for
nearest neighbour calculations, let us
consider 8 blocks ofknown width,
IV. FUNCTIONS OF AI IN height and colour . A new block then
BIOINFORMATICS appears of known size but unknown
colour. On the basis of existing
a. Data Mining information, can we make an informed
b. Bio-Medical Informatics guess as to what the colour of the new
block is?
AI provides several powerful
algorithms and techniques for solving To answer this question, we need to
important problems in bioinformatics assume a consistency heuristic, as
and chemo-informatics. Approaches follows. Find the most similar case, as
measured by known properties, for
like Neural Networks, Hidden Markov
which the property is known; then
Models, Bayesian Networks and guess that the unknown property is the
Kernel Methods are ideal for areas same as the known property. This is
with lots of data but very little theory. the basis of all nearest neighbour
The goal in applying AI to calculations. Although such nearest
bioinformatics and chemo-informatics neighbour calculations can be
is to extract useful information from performed by keeping all samples in
memory and calculating the nearest
the wealth of available data by
neighbour of a new sample only when
building good probabilistic models. required by comparing the new sample
with previously stored samples, there
Data Mining is an AI powered tool are advantages in storing the
that can discover useful information information about how to calculate
within a database that can then be used nearest neighbours in the form of a
to improve actions. decision tree, as will be seen later.
3
provably correct answer. That is, in which, the 8 blocks of known size are
many applications it is important to first divided into two equal subsets
attribute a property of some sort to an using, say, the height attribute. The
object when the object’s real property tallest of the shorter subset has height
is not known. Rather than having to 2 and the shortest of the taller subset
leave the object out of consideration, has height 5. The midpoint 3.5 is
an attribution of a property to the therefore chosen as the dividing line.
object with unknown property may be
useful and desirable, even if it cannot
be proved that the attributed property
is the real property. At least, with
nearest neighbour calculations, there is
some systematicity (the consistency
heuristic) in the way that an unknown
property is attributed.
Figure 1(a)
6
Units are connected by weights which output node values can be compared
propagate signals from one unit to the with the known class values to
next (usually from layer to layer). determine the validity of the network.
These connections are variable in Network validity can be measured in
nature and determine the strength of terms of how many unseen samples
the activation which is passed from were falsely classified as positive by
one unit to the next. The modification the trained ANN when they are in fact
of these weights is the negative (‘false positive’ rate) and vice
versa (‘false negative’ rate). If the
primary way in which the neural class of an unseen sample is not
network learns the data. This is known, then the output node values
accomplished in supervised learning make a prediction as to the class of
by a method known as output into which the sample falls.
backpropagation where the output Such predictions may need to be tested
from the neural network is compared empirically.
with the desired output from the data
and the error from this is used to
change the weights to minimise the
error. More formally and very generally,
the training phase of an ANN starts
by allocating random
weights w1, w2,… wn to the
The task of the ANN is to modify the connections between the n input
weights between the input nodes and units and the output units. Second,
the output node (or output nodes if we feed in the first pattern p of bits
there is more than one output node in x (p), x (p)… xn(p) to the network
1 2
7
Typically, an ANN is said to have
converged or learned when the sum
of squared errors (SSE) on
the output nodes for all patterns in
one epoch is sufficiently small
(typically, 0.001 or below).
The equations above constitute the
delta learning rule which can be
used to train single-layer
networks. A slightly more complex
set of equations exists for learning in
ANNs with more than one layer and
making use of the sigmoid function
described earlier. Figure 4
8
identified in the AI literature. Neural approach yields a 90% classification
networks can perform with better rate (therefore only 10% error) from
accuracy than equivalent symbolic the final network, which is in contrast
techniques (for instance decision trees) to 75% for each expert.
on the same data. Also, while
identification tree approaches such as
See5 can identify dominant factors the
importance of which can be Another approach has been to operate
represented by their positions high up many different feature selection
in the tree, ANNs may be able to algorithms and a variety of neural
detect non-dominant relationships (i.e. networks on this leukaemia dataset.
relationships involving several factors, Feature selection is a method of
each of which by itself may not be reducing the number of features or
dominant) among the attributes. attributes within the dataset which
However, there are also a number of reduces the time taken to train any
disadvantages. Data often has to be algorithm, but can be very important
pre-processed to conform to the for neural networks. The standard
requirements of the input nodes of neural network with a three layers and
ANNs (e.g. normalised and converted 5-15 hidden units performed very well,
into binary form). Training times can achieving an error rate of just 2.8% on
be long in comparison with symbolic the test dataset, when coupled with a
techniques. Finally, and perhaps most Pearson rank correlation feature
importantly, solutions are encoded in selection.
the weights and therefore are not as
immediately obvious as the rules and
It is obvious from these approaches
trees produced by symbolic
that neural methods are capable of
approaches.
classifying this data with very good
accuracy. The problem with using
neural networks in this manner is that
whilst the network itself can be used a
method for predicting classifications,
Leukaemia Dataset the attributes which have been used to
make the classifications cannot be
The leukaemia dataset consists of 72 easily determined. That is, the model
individuals and 7129 gene expression can be used to predict the class, but we
values is collected from each of them. cannot easily find the genes which are
Classifications consist of individuals responsible for that classification. This
suffering from a type of leukaemia, is especially the case where networks
ALL or AML. Distinguishing between with hidden layers have been used (as
these two types of data is very in the previous experiments) because
important as they respond very the hidden layer can find non-linear
differently to different types of therapy relationships between combinations of
and therefore the survival rate may attributes and classes. This non-linear
well be improved by the correct relationship cannot be easily
classification of this leukaemia. The described in the rule format that has
outward symptoms are very similar, so been shown earlier.
a method which can differentiate
between the two based on gene These were only a few examples.
expression profiles could help patients There a lot more examples which
of the disease. An approach uses prove the importance if AI in
several neural networks trained on Bioinformatics.
different aspects of the data in
conjunction to give a final
classification. It uses the original data
and two fourier transforms of the data
to train three separate "experts", a
gating network then uses a majority
VII. WORKING EXAMPLE
vote to determine the final
classification of these networks. This
9
Functional Genomics and the Robot
Scientist:
Robot scientist developed by
University of Wales researchers
Designed for the study of
functional genomics
Tested on yeast metabolic
pathways
Utilizes logical and association
to knowledge representation
schemes
Figure 6.b
CONCLUSION
The Robot scientist
In the nutshell I want to conclude
Yeast Metabolic Pathways about the basics of the term paper
that though Bioinformatics is a
totally different field, it depends
on Artificial Intelligence for
speed, accuracy, inference and
prediction. Artificial intelligence
aims at providing tools and
techniques which can help
bioinformatics accomplish its
tasks with convenience.
ACKNOWLEDGMENT
10
REFERENCES
[1]www.wikipedia.com
[2]www. iiconference.org
[3] www.sciencedirect.com
[4] www.bioplanet.com
[5] www.aaai.org
11