Intelligent Agents

Intelligent Agent
percepts
sensors
Intelligent Agents ?
environment
agent
Russell and Norvig: actuators actions

Chapter 2
CMSC421 – Fall 2003
Definition: An intelligent agent perceives its
environment via sensors and acts rationally
upon that environment with its actuators.
e.g., Humans Notion of an Artificial Agent

Sensors: sensors
Eyes (vision), ears (hearing), skin (touch), tongue
(gustation), nose (olfaction), neuromuscular system
(proprioception)
?
Percepts: environment
At the lowest level – electrical signals
agent

After preprocessing – objects in the visual field (location,

textures, colors, …), auditory streams (pitch, loudness, laser range
direction), … actuators
finder
Actuators: limbs, digits, eyes, tongue, …
Actions: lift a finger, turn left, walk, run, carry an
object, …
sonars
touch sensors
1
Notion of an Artificial Agent Vacuum Cleaner World
sensors
A B
?
environment
agent
actuators
Percepts: location and contents, e.g. [A, Dirty]

Actions: Left, Right, Suck, NoOp
Vacuum Agent Function Rational Agent

What is rational depends on:
Percept Sequence Action
Performance measure - The performance measure
[A, Clean] Right that defines the criterion of success
[A, Dirty] Suck Environment - The agents prior knowledge of the
environment
[B, Clean] Left
Actuators - The actions that the agent can perform
[B, Dirty] Suck
Sensors - The agent’s percept sequence to date
[A, Clean], [A, Clean] Right
We’ll call all this the Task Environment (PEAS)
[A, Clean], [A, Dirty] Suck
…
2
Vacuum Agent PEAS Automated taxi driving system
Performance Measure: minimize energy Performance Measure: Maintain safety, reach
consumption, maximize dirt pick up. Making destination, maximize profits (fuel, tire wear), obey
this precise: one point for each clean square laws, provide passenger comfort, …
over lifetime of 1000 steps. Environment: U.S. urban streets, freeways, traffic,
Environment: two squares, dirt distribution pedestrians, weather, customers, …
unknown, assume actions are deterministic
and environment is static (clean squares stay Actuators: Steer, accelerate, brake, horn,
clean) speak/display, …
Actuators: Left, Right, Suck, NoOp Sensors: Video, sonar, speedometer, odometer,
engine sensors, keyboard input, microphone, GPS, …
Sensors: agent can perceive it’s location and
whether location is dirty
Autonomy Properties of Environments

Fully Observable/Partially Observable
A system is autonomous to the extent If an agent’s sensors give it access to the complete state of the
environment needed to choose an action, the environment is
that its own behavior is determined by fully observable.
its own experience. Such environments are convenient, since the agent is freed
from the task of keeping track of the changes in the
Therefore, a system is not autonomous environment.
Deterministic/Stochastic
if it is guided by its designer according An environment is deterministic if the next state of the
to a priori decisions. environment is completely determined by the current state of
the environment and the action of the agent.
To survive, agents must have: In an accessible and deterministic environment, the agent need
not deal with uncertainty.
Enough built-in knowledge to survive.
The ability to learn.
3
Properties of Environments Environment Characteristics
Static/Dynamic. Fully Deterministic Static Discrete
A static environment does not change while the Observable
agent is thinking. Solitaire No Yes Yes Yes
The passage of time as an agent deliberates is
Backgammon Yes No Yes Yes
irrelevant.
The agent doesn’t need to observe the world during Taxi driving No No No No
deliberation.
Discrete/Continuous. Internet No No No No
shopping
If the number of distinct percepts and actions is
Medical No No No No
limited, the environment is discrete, otherwise it is diagnosis
continuous.
→ Lots of real-world domains fall into the hardest case!
Some agent types (0) Table-driven agents

(0) Table-driven agents
use a percept sequence/action table in memory to find the next Table lookup of percept-action pairs mapping from
action. They are implemented by a (large) lookup table. every possible perceived state to the optimal action
(1) Simple reflex agents for that state
are based on condition-action rules, implemented with an

appropriate production system. They are stateless devices which do Problems
not have memory of past world states. Too big to generate and to store (Chess has about
(2) Model-based reflex agents 10120 states, for example)
have internal state, which is used to keep track of past states of
No knowledge of non-perceptual parts of the

the world.
(3) Goal-based agents current state
are agents that, in addition to state information, have goal Not adaptive to changes in the environment;
information that describes desirable situations. Agents of this kind requires entire table to be updated if changes occur
take future events into consideration.
Looping: Can’t make actions conditional on previous
(4) Utility-based agents
base their decisions on classic axiomatic utility theory in order to actions/states
act rationally.
4
(0/1) Table-driven/reflex agent
(1) Simple reflex agents architecture
Rule-based reasoning to map from percepts to
optimal action; each rule handles a collection of
perceived states
Problems
Still usually too big to generate and to store
Still no knowledge of non-perceptual parts of state
Still not adaptive to changes in the environment;

requires collection of rules to be updated if
changes occur
Still can’t make actions conditional on previous
state
Simple Vacuum Reflex Agent (2) Model-based reflex agents

function Vacuum-Agent([location,status]) Encode “internal state” of the world to
remember the past as contained in earlier
returns Action
percepts.
Needed because sensors do not usually give
if status = Dirty then return Suck the entire state of the world at each input, so
else if location = A then return Right perception of the environment is captured
over time. “State” is used to encode different
else if location = B then return Left "world states" that generate the same
immediate percept.
5
(2)Model-based agent architecture (3) Goal-based agents
Choose actions so as to achieve a (given or
computed) goal.
A goal is a description of a desirable situation.
Keeping track of the current state is often not
enough − need to add goals to decide which
situations are good
Deliberative instead of reactive.
May have to consider long sequences of
possible actions before deciding if goal is
achieved – involves consideration of the
future, “what will happen if I do...?”
(3) Architecture for goal-based

Example: Tracking a Target agent
• The robot must keep

the target in view
• The target’s trajectory
is not known in advance
• The robot may not know
all the obstacles in target
robot
advance
• Fast decision is required
6
(4) Architecture for a complete
(4) Utility-based agents utility-based agent
When there are multiple possible alternatives, how to
decide which one is best?
A goal specifies a crude distinction between a happy
and unhappy state, but often need a more general
performance measure that describes “degree of
happiness.”
Utility function U: State → Reals indicating a
measure of success or happiness when at a given
state.
Allows decisions comparing choice between
conflicting goals, and choice between likelihood of
success and importance of goal (if achievement is
uncertain).
Summary: Agents
An agent perceives and acts in an environment, has an architecture,
and is implemented by an agent program.
Task environment – PEAS (Performance, Environment, Actuators,
Sensors)
An ideal agent always chooses the action which maximizes its expected
performance, given its percept sequence so far.
An autonomous agent uses its own experience rather than built-in
knowledge of the environment by the designer.
An agent program maps from percept to action and updates internal
state.
Reflex agents respond immediately to percepts.
Goal-based agents act in order to achieve their goal(s).
Utility-based agents maximize their own utility function.
Representing knowledge is important for successful agent design.
The most challenging environments are inaccessible, nondeterministic,
dynamic, and continuous

Intelligent Agents

Uploaded by

Document Information

Original Description:

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Intelligent Agents

Uploaded by

Copyright:

Available Formats

Intelligent Agent

Russell and Norvig: actuators actions

e.g., Humans Notion of an Artificial Agent

After preprocessing – objects in the visual field (location,

Percepts: location and contents, e.g. [A, Dirty]

Vacuum Agent Function Rational Agent

Autonomy Properties of Environments

The ability to learn.

→ Lots of real-world domains fall into the hardest case!

Some agent types (0) Table-driven agents

Still no knowledge of non-perceptual parts of state

Still not adaptive to changes in the environment;

Simple Vacuum Reflex Agent (2) Model-based reflex agents

(3) Architecture for goal-based

• The robot must keep

You might also like

Intelligent Agents

Uploaded by

Document Information

Original Description:

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Intelligent Agents

Uploaded by

Copyright:

Available Formats

Intelligent Agent

Russell and Norvig: actuators actions

e.g., Humans Notion of an Artificial Agent

 After preprocessing – objects in the visual field (location,

Percepts: location and contents, e.g. [A, Dirty]

Vacuum Agent Function Rational Agent

Autonomy Properties of Environments

 The ability to learn.

→ Lots of real-world domains fall into the hardest case!

Some agent types (0) Table-driven agents

 Still no knowledge of non-perceptual parts of state

 Still not adaptive to changes in the environment;

Simple Vacuum Reflex Agent (2) Model-based reflex agents

(3) Architecture for goal-based

• The robot must keep

You might also like

After preprocessing – objects in the visual field (location,

The ability to learn.

Still no knowledge of non-perceptual parts of state

Still not adaptive to changes in the environment;