Professional Documents
Culture Documents
NEURAL NETWORK
BASED
ADAPTIVE
CONTROL
Dpto.
Ingenieria
Abstract.This
paper presents differents ways of using artificial neural networks in adaptive control. A classification of architectures for control using neural networks is presented, showing the existing paralelism with Adaptive Control techniques.
Key
Words-
Adaptive
control;
Automatic
control;
1. INTRODUCTION
Nonlinear
control
systems.
In Robust Control the process is usually modeled by a linear model and some of the problems mentioned above are treated by considering uncertainties
about the model. The main
assumption
in most cases is that the underlaying process is linear. In Adaptive Control the
main idea is that by an appropriate
adaptation mechanism, the controller and/o1 model of
the process, linear in most cases, will cope with
unknown, changing and possibly nonlinear
dynamics. Advanced control strategies,
normally
based on an exact cancellation
of the nonlinear dynamics (Craig, 1988) have to be used for
nonlinear processes such as robots. The uncertainties on the dynamic parameters
of the processes, such as inertias and payload conditions
in robots, have motivated
the design of adaptive controllers
(Slotine and Li, 1990; Kelly,
Carelli and Ortega, 1989; Ortega and Spong,
1988). This type of controller is designed assuming an exact knowledge of the model structure and does not include aspects, such as nonlinear frictions, elasticity in the joints and links,
backlash and torque perturbations,
which can
be found in robots.
a) A linear mathematical
model of the plant is
needed and finding one is not a trivial problem
in many cases.
These
nets;
b) Mathematical
models of
not take all aspect of reality
plifying assumptions
have to
els are only approximations
Neural
treated
13
The AI type of approaches try somehow to reproduce the behavior of hzlman controllers that
are able to use natural intelligence
to control
processes exhibiting all the problems described.
A further difference in both approaches
has
been that while the Linear Control Community approach seemed to be more interested in
demonstrating
results about stability
of proposed control schemes, the AI Control Community seemed more interested in showing that
the technique worked on particular
processes.
This is however changing lately and there are
a number of works relating both types of disciplines. Stability analysis is one of the converging fields and some results have appeared in
Iiterature establishing
conditions to ensure the
stability
of AI controllers
(Aracil, Ollero and
Garcia-Cerezo,
1989). Adaptive Control is another field where there is a strong confluence
with AI controllers.
The idea of adaptation
is
strongly linked to the idea of learning which is
fundamental
to NN.
2. NEURAL NETWORK
BASED CONTROL
the plant. The output of the net is an estimation of the plants next state (using a discretetime representation).
Backpropagating
the error made in the prediction,
the net learns the
behavior
of the plant.
Once the emulator
knows the plants dynamics with a certain accuracy, the training of the controller begins. It
is commissioned
by using b~ckpropagation
of
error at the final state, The aim is to find the
weights that minimize a measure of the states
error at each time step. But, as the error is only
available in the final state, it has to be backpropagated
through the plant emulator in order
to estimate the controllers error at each step.
From trial to trial, each one having different initial conditions,
the controller is driven by the
emulator to give the correct control law. The
fact that the real plant carmot be used to backpropagate the errors made by the controller explains the need for an emulator.
Another
way of viewing NN in control is as
look-up tables. The NN stores control signals,
given the state of the plant and the next-step
desired state. In (Kraft and Campagna,
1990) a
NN was used to control three types of systems:
linear, linear + noise and nonlinear.
The performance of the NN controller was compared
with a couple of adaptive controllers:
STR and
MRAC showing good characteristics.
ear plants
eters.
with unkno~v~
the effectiveness of
and control nonlin-
structure
and param-
15
time
FIG. 1. Adaptation
(a)
works
mechThese
In adaptive
control,
the adaptation
is performed in a single trajectory.
Normally at the
beginning of the trajectory, while the controller
is not properly tuned, the process trajectory
differs substantially
from the reference trajectory. Once the parameters
are properly tuned,
the process follows the desired trajectory
with
greater accuracy.
Learning is performed by modifying the NN parameters during repeated performance
trials of
the desired trajectory.
It is like practicing
the
same stroke of, lets say tennis, a number of
times until success has been achieved.
This
idea is illustrated
by figure lb where the different trajectories
obtained at different learning stages are shown. It can be seen that process trajectories
reproduce the reference trajectory with more accuracy when learning
progresses. A practice strategy has been suggested
which instead of using the reference trajectory
in each learning period a sequence of trajectories is used. The first element of the sequence is
a previously learned trajectory
and the last element is the desired trajectory.
This approach
could solve some problems found in NN learning when the desired trajectory
is far from any
of the previously learned ones.
The training signals used for adaptation
and
learning are of great importance
and some parallelism can be established
between NN learning and adaptive control regarding this issue.
chitectures
will be classified according
role played by the network.
Feed Forwnrd
controller
uff
L---.-_____________:
4. NN BASED ADAPTIVE
CLASSIFICATION
to the
I
plant.
CUNT~OLLERS
Many controllers
have been proposed including NN as a part of them, Most of them are
neural versions of classical adaptive controllers
(&t&m
and Wittenmark,
1989). The way the
NN is incorporated
in the system differs from
one to the others.
Most frequently
used ar-
17
TOT.
Rd.
FIG. 3. Indirect
MGi-j
ym
adaptive control.
In Fig. 3, the TDL blocks represent tapped delay lines whose function is to provide delayed
values of the plants inputs and outputs.
In some structures,
the NN is given the task of
generating a small part of the control signal. In
these cases the network counts for structured
and non-structured
uncertainties
of a model.
The main part of the command
signal is produced by a conventional
controller based on the
model. Examples can be found in (Iiguni, Sakai
and Tokumaru,
1991) and (Zomaya, 1993).
/l~I/
I J
I
4.2. NN as estimator,
Internal Model Control.
The Internal
Model
Control
scheme
proposed
in (Economou,
Morari and Palsson, 1986) uses a system forThe system
ward and an inverse model.
models output is compared to the plants output and the difference is fed back to a controller, This structure
can incorporate
NN for
the identification
of nonlinear plants (Hunt and
Sbarbaro, 1991).
Predictive Control. A predictive controller produces a command
signal that minimizes
the
squared error between the predicted output of
the plant and the reference over a certain temporal horizon at every time step k. A predictian can be computed for linear plants by using a Diophantine
equation
(Clarke, Mohtadi
and Tuffs, 1987). To extend the idea to nonlinear plants a predictor has to be developed.
In (Takahashi,
1993) a NN is used to produce
a prediction of a nonlinear plants output.
I
I
Inferential
Control.
In some industrial
processes control is difficult due to the fact that the
plants output is non measurable
at a proper
frequency. This is the case of quality measurements in chemical processes. Inferential control
uses secondary measurements
to estimate the
plants output.
The mapping from secondary
18
pressed
as?
r(t)
= ~(~)~
(5-I)
The dynamic
structure
can be expressed
(Khoda and Kanade,
1935) as a linear function of a suitable selected set of robot and load
parameters:
FIG.
5s NN ADAPTIVE
CONTROL
ADAPTATION SPEED
WITH
FAST
This section
describes
a structure
for the
adaptive
control of manipulators
proposed by
Carelli,
Camacho
and Patifio,
(1993, 1994)
which does not require a long adaptation
period
{see Fig. 6). Although
the control structure
has been designed for robot manipulator
it can
be applied to any nonlinear process that can be
reparameterized
in the way described. The controller uses a set of fixed feedforward NN which
are trained in an off-line manner.
This structure allows the adaptation
of the controller to
deal with dynamic uncertainties,
such as link
inertias or payloads, minimizing
the amount of
computation
that has to be performed on-line.
As the number of parameters
to be adapted is
small, the adaptation
to changes in robot parameters is faster than when using the learning
capabilities
of the NN to adapt.
(5.3)
where,
Bi =
[et,@,, I I. , OrIT,
1,2, * * ,N and z = [q,i,ilT
with
=5
+ a?zBz + 1 4 + ap$arJ
(5.4)
19
Thus, the inverse robot dynamic can be approximated to any payload condition by,
The adaptive
& = -r
where,
= al@&)
Equation
+ a&(z)
+.a.
+ WV%+$
(5.6)
I? is a positive
[@I (xd)@2(xd)
* * $+d)]=
8 = (cirpIi2 - CiJq]
as,
there
is a
definite,
W(zd)
ANALYSIS
= W(qd,
id, id)
~3
control
y(r) = -Kp~-K~,~+W(~d,~d,~~)~
with the parameter
Assume
bounds,
law is:
y(t) = Ypbd)+ fc
w%dh zd = fed, &, &IT the desired
-k(t) =
-Kp.@-
K,*$
(5.12)
state
vector
with d= Q-Qd
(5.13)
as,
(6.16)
where IV(*) is completely known and CKis an unknown but constant vector. Suppose the adaptive control law
update
that
yp(.)
(6.17)
law
li = -P * ~~~d~ * ^ic(tf
The proposed
and
6. STABILITY
(54
(5.I5)
satisfies
the
(6.18)
following
-t-[@$?,
id) - fb(h id)]i- c(q, i&d
hlldl + ~211irl12
+ ~3ll~llll!ill
b.
5
(6.20)
+ ~4ll!illllw
where a; and bi, i = 1, a. -, 4, are functions of time only and are independent
of i
and (ii. Furthermore, assume that a; and bi
< 0, XN A?
jlI;i,H(q)ll,
then
given in (Carelli,
Camacho
and Patiiio,
IL(t)=
z&(t) = f(XQ),
upd(t)
+ (1 -
+>>%d(t>
+ m(t)%l(t)
(6.22)
where z+,b is a pd-like control term, aad
is an
adaptive component
provided by the network
to count for the nonlinear functions and 2~~1
(t} is
the contribution
of the sliding mode controller.
Notice that m(t) is a modulation
function that
mixes adaptive and sliding control modes depending upon the situation
of the state of the
plant in the set A. When the operating
point
is next to the boFder, the sliding component
is
preferred; so that, the state is driven back to
the set A.
Lyapunov
theory was used by Sanner and Slotine (1992) to derive a stable adaptive system
for control of nonlinear
plants.
Lets describe
briefly the network architecture:
for a SISC
plant of order n whose dynamics are
+(t), ...,X-l(t))
Jt- bz@)
(6.21)
where u(t) is the control input, functions
f
If x =
and b aFe nonlinear
and unknown.
fx, i, .*., z (n-1)] is the state of the plant, the
objective
of the control is to track a desired
state trajectory
included in a set A. The role
of the NN, consisting in a single layer of nodes
possessing radial Gaussian characteristics,
is to
provide an estimation
of functions f and b at
any time.
That is, the net has to uniformly
approximate
a continuous
function with a pre
specified accuracy on a compact subset of R
using a finite number of nodes. The controller
proposed by Sanner and Slotine has three parts:
a linear combination
of the tracking error, a
feedforward
component
of t,he n-th derivative
of the desired trajectory
and an adaptive control law that attempt
to cancel the unknown
nonlinear
function that governs the plant. Every node in the Gaussian
(RBF) network provides an output that is a Gaussian function of
the distance from the current state to the input weight of the node in the input space. The
summation
of all nodes gives an approximation
of the function to be estimated,
provided t,hat
the output, weights are correctly set.
7. IMPLEMENTATIONS
Most of the control applications
of NN are implemented
by programs
in digital computers
which simulate the behavior of the neural nets.
One of the potential advantages of NN which is
the inherent pardelism
is therefore lost. Hardware implementations
are consequently
convenient in OFdeF to use NN to its full potential,
In order to use NN for adaptive control, implementations
must perform some kind of parameter adjustment.
This is more easily achieved
when using some types of circuits but at the
The conditions
that function f has to meet in
order to be approximate
by a Gaussian NN are
21
ACKNOWLEDGEMENT
The authors would like to acknowledge CICYT
for funding the work under grant TAP93-0804
and project CYTED-D.
Digital implementations
are more robust than
analog ones against dispersion
in the characR,asure et al.
teristics
of the components.
designed a feedforward
net that was able to
classify hand-written
digits.
A 3-D structure
of NETSIM boards was proposed in (Garth,
1987).
Each board contains
communication
buses, control circuits and neural coprocessors.
To reduce the surface needed in digital circuits
synchronous
stochastic implementations
can be
used (Janer, 1994; Janer, Quero and Franquelo,
1993).
Akhyar,
8. CONCLUSIONS
Neural networks have the ability of learning a
nonlinear
model without a prior knowledge of
its structure
and are adequate for working in
real time because of the high parallelism.
The
use of NN seems therefore to be a way of implementing
adaptive
controllers
for processes
where standard
adaptive
control is not adequate, hat is, non linear processes with no necessarily known model structure
and/or changing dynamics.
22
Neural
are universal
approximators.
Networks, 2, 359-366.
Hunt, K. J. and D. Sbarbaro
(1991). Neural
networks for non-linear
internal
model
control. Proc. IEE Pt. D, 138,431-438.
Kelly, R., R. Carelli and R. Ortega (1989).
Adaptive motion control design of robot
An input-output
apmanipulators:
proach. Int. J. Control, 50, 2563-2581.
Iiguni, Y., H. Sakai and H. Tokumaru
(1991).
A nonlinear regulator design in the presence of system uncertainties
using multilayered neural networks.
IEEE Trans.
on NN, 2, 410-417.
Janer, C. L. (1994). Paralell stochastic architectures for the microelectronic
implementation
of neural networks, (in spanish).
Ph.
D. Thesis,
University
of
Seville, Spain.
Janer, C. L., J. Quero and L. G. Franquelo
(1993) Fully Parallel summation
in a new
stochastic
neural network architecture.
IEEE Int.
Conf on Neural Networks
ICNN93, 1498-1503.
Kawato M., Y. Uno, M. Isobe and R. Suzuki
(1987).
A hierarchical
model for voluntary movement and its application
to
robotics.
Proc. IEEE Control Systems
Magazine, 8, 8-17.
Kraft, L. G. and D. P. Campagna
(1990). A
comparison
between CMAC neural network and two traditional
adaptive control systems.
IEEE Controb Systems
Magazine, pp. 36-43.
Ligtbody
G. and G. Irwing (1992).
Neural
networks for nonlinear
adaptive control
IFAC Workshop on Algorithms and Ar&tectures
for Rent Time Controt, pp. l13.
Luo, R. F., H. H. Shao and Z. J. Zhang (1993).
Fuzzy neural nets based inferen&& control for a high purity distillation
column
Automatica To appear.
Mead, C. A. and M. A. Mahowald
(1988) A
silicon model of early visual processing.
Neural Networks, 1, 91-97.
McCulloch, W. S. and W. Pitts (1943). Analogical calculus of the ideas inmanent
in
nervous activity. Bulletin of Mathematical Biophysics, 9, 127-147.
Minsky, M. L. and S. A. Papert (1969). Perceptrons. The MIT Press, Cambridge,
MA.
Morari, A. J. and K. W. Fung (1982). Nonlinear inferential
control.
Comput. Chem.
Eng., 6, 271-281.
Murray, A. F., D. Del Corso and L. Tarassenko
(1991) Pulse-stream
VLSI neural networks mixing analog and digital techniques.
IEEE Transactions on Neural
Networks, 2.
C. Mohtadi
and P.S. Tuffs
D.W.,
(1987b). Generalized
predictive control.
Part II Extensions
and Interpretations.
Automatica 23, 149-160.
Clarke, D.W. and C. Mohtadi (1989). Properties of generalized
predictive
control.
Automatica 25, 859-875.
Craig, J.J. (1988).
Adaptive Control of Mechanical Manipulators Addison - Wesley
Publishing
Co.
Cutler, C.R. and B.L. Ramaker (1980). Dynamic matrix control. A computer cont,rol algorithm.
Proc. JACC80.
Cybenko, G. (1989). Approximation
by superpositions
of sigmoidal function.
Math,.
Controt Signal System.s, 2, 303-314.
Economou, C. G., M. Morari and B. 0. Palsson
(1986). Internal
model control.
5. Extension to nonlinear
systems. Ind. Eng.
Chem. Process Des. Den., 25, 403-411.
Feild, W. B., and J. K. Navlakha
(1988).
Transputer
implementation
of Hopfield
NN. IEEE Int. Conf. on, NN. ICNN88.
Funahashi,
K. I. (1989). On t,he approximate
realization
of continuous
mappings
by
neural networks.
New& Netloo&,
2,
183-192.
Garcia, C.E., D.M. Prett and M. Morari (1989).
Model predictive
control:
theory and
practice - a survey. Automatica 25, 335348.
Garth, S (1987). A chipset for high speed simulation of neural network systems. IEEE
Int. Conf. on, NN. ICNN87 443-452.
Glinski, S., T. Lalumia, D. Cassiday, T. Koh,
C. Gerveshi,
G. Wilson and J. Kumar
(1987).
The graph search machine:
A
VLSI architecture
for connected
word
speech recognition
and other applications. Proc. IEEE, 75, 1172-1184.
G6mez-Ortega,
J. and E. F. Camacho (1994)
Neural network GPC for mobile robots
path tracking.
EURISCON9.j, To appear.
G6mez-Ortega,
J., E. F. Camacho and J. Quero
(1994) Neural network local navigation
of mobile robots in a moving obstacles
environment.
PreprinGs of -Intelligent
Componen,ts and Instrum,ents for Control Applications SICICA94. 263-268.
Guez A. and I. Bar-Kana
(1990). Two-degreeof-freedom robot neurocontroller.
Proc.
2gfh Conference on Decision and Control, pp 3260-3264.
Hopfield,
J. J. (1982) Neural Networks and
physical systems with emergent collective computational
abilities.
Proc. of
the National Academy of Sciences, 79,
2554-2558.
Hornik,
K., M. Stinchcombe
and H. White
(1989). Multilayer feedforward networks
Clarke,
23
24