Intelligent Agents Lecture

Intelligent Systems
Lecture 2
Bicol University College of Science

1st Semester, 2019-2020
Intelligent Agents
Sensors
percepts
? Environment
Agent
actions
Effectors
What is an agent?
An agent is system that...
How to design this? Sensors

percepts
? Environment
Agent
actions
Effectors
perceives its environment through its sensors

acts on its environment through actuators or effectors
What is an agent?
Human-Robot Analogy
Agent Sensors Effectors

Human eyes, ears, nose, etc. arms, legs, mouth, etc.
Robot camera, microphone, motors, wheels, robot arm,
infrared, etc. etc.
Software keystrokes, file contents, graphic display, file output,
etc. etc.
Agent’s Behavior
Some agents tend to behave better than others
Agent’s behavior is dependent on the nature of its

environment
Environment’s difficulty varies
Agent’s Percept, Actions and Behavior
Percept refers to what the agent perceives at any

given instant
Percept Sequence is the complete history of
everything the agent has ever perceived
An agent’s choice of action at any given instant can
depend on the entire percept sequence observed to
date
An agent’s behavior is described by the agent
function that maps any given percept sequence to an
action
An agent program is a concrete implementation of
the agent function
Example
The vacuum-cleaner agent
• Environment: square A and B

• Percepts: (location and content) e.g. ( A, Dirty)
• Actions: left, right, suck, and no-op
Example
Percept sequence Action

[A,Clean] Right
[A, Dirty] Suck
[B, Clean] Left
[B, Dirty] Suck
[A, Clean],[A, Clean] Right
[A, Clean],[A, Dirty] Suck
… …
Example
function REFLEX-VACUUM-AGENT ([location, status]) return an action

if status == Dirty then return Suck
else if location == A then return Right
else if location == B then return Left
Rational Agents (RA)
A rational agent is an agent that does the right

thing given what it knows
The right action is one that will cause the agent to be

most successful
Therefore, a way to measure the agent’s success is
needed
RA Performance Measure
Performance measure embodies the criterion for

success of an agent’s behavior
The agent’s sequence of actions causes the environment to

go through a sequence of states...
If the resulting sequence is desirable, then the agent has
performed well.
It is also important when to evaluate performance.
Going back to the vacuum cleaner agent...
What is the most suitable performance measure for this

agent?
Is the amount of dirt cleaned up in a single 8-hr shift a good
performance measure?
Why or why not?
IN GENERAL...
It is better to design performance measures according to

what one actually wants in the environment, rather than
according to how one thinks the agent should behave.
What is Rationality?
Rationality at any given time depends on 4 things:
performance measure
prior knowledge of the environment
actions that the agent can perform
percept sequence up to date or everything that the agent has
perceived so far
Definition of an Ideal Rational Agent
For each possible percept sequence, a rational agent

should select an action that is expected to maximize its
performance measure, given the evidence provided by the
percept sequence and whatever built-in knowledge the agent
has.
Rationality vs. Omniscience
An omniscient agent knows the actual outcome of its actions

and can act accordingly.
→ Omniscience is impossible to achieve

Being rational is NOT being...
Clairvoyant
Successful
Perfect
Rationality involves LEA...
Learning - as the agent gains experience, its prior knowledge of

the environment may be modified and augmented → the agent
then modifies its behavior accordingly
Exploration - must be undertaken by an agent in an initially
unknown environment
Autonomy - learning what it can to compensate for partial or
incorrect prior knowledge so that after sufficient experience of its
environment, its behavior can become effectively independent of
its prior knowledge
Agents
How does the inside of the agent work?
• Agent = architecture + program
All agents have the same skeleton:

• Input = current percepts
• Output = action
• Program= manipulates input to produce output
Specification of Task Environment
Designing a rational agent requires specification of

task environment (PAGE)
Percepts
Actions
Goals
Environment
Specification of Task Environment
Agents
How does the inside of the agent work?
• Agent = architecture + program
All agents have the same skeleton:

• Input = current percepts
• Output = action
• Program= manipulates input to produce output
Table-Driven Agent (TDA)
A table-driven agent stores percept sequence in memory

and use it as an index into a table which contains the
appropriate action for all possible sequences.
This approach is doomed to failure

What’s wrong with TDA’s?

Table can be extremely large (i.e. Possible moves in Chess)
Time constraints (i.e. Lifetime)
Autonomy - since table is built-in knowledge, there is no autonomy at
all. If environment changes in a way that was not foreseen by the
designer, the agent will not be able to act rationally
Even if there is a learning mechanism, it will take a very long time to
learn the right value for all table entries
Despite all this, a table-driven agent does do what we want: It
implements the desired agent function
The key challenge in AI is to find out how to write programs that
produce rational behavior from a small amount of code rather than
a large number of table entries.
Agent types
Four basic kind of agent programs
Simple reflex agents

Model-based reflex agents
Goal-based agents
Utility-based agents
All these can be turned into learning agents.

Agent types
Simple Reflex
• Select action on the

basis of only the
current percept.
• E.g. the vacuum-agent
• Large reduction in
possible percept/action
situations.
• Implemented through
condition-action rules
• If dirty then suck
Agent types
The vacuum-cleaner world
function REFLEX-VACUUM-AGENT ([location, status]) return an action

if status == Dirty then return Suck
else if location == A then return Right
else if location == B then return Left
Reduction from 4T to 4 entries

Agent types
Simple reflex
function SIMPLE-REFLEX-AGENT(percept) returns an action
static: rules, a set of condition-action rules
state  INTERPRET-INPUT(percept)
rule  RULE-MATCH(state, rule)
action  RULE-ACTION[rule]
return action
Will only work if the environment is fully observable

otherwise infinite loops may occur.
Agent types
• To tackle partially
observable
environments.
• Maintain internal state
• Over time update state
using world knowledge
• How does the world change.
• How do actions affect world.
 Model of World
Agent types
function REFLEX-AGENT-WITH-STATE(percept) returns an action
static: rules, a set of condition-action rules

state, a description of the current world state
action, the most recent action.
state  UPDATE-STATE(state, action, percept)

rule  RULE-MATCH(state, rule)
action  RULE-ACTION[rule]
return action
Agent types
Goal-based
• The agent needs a goal to know

which situations are desirable.
• Things become difficult when long
sequences of actions are required to find
the goal.
• Typically investigated in search
and planning research.
• Major difference: future is taken
into account
• Is more flexible since
knowledge is represented
explicitly and can be
manipulated.
Agent types
Utility-based
• Certain goals can be

reached in different ways.
• Some are better, have a higher
utility.
• Utility function maps a
(sequence of) state(s) onto
a real number.
• Improves on goals:
• Selecting between conflicting
goals
• Select appropriately between
several goals based on likelihood
of success.
Agent types
Utility-based
• Certain goals can be

reached in different ways.
• Some are better, have a higher
utility.
• Utility function maps a
(sequence of) state(s) onto
a real number.
• Improves on goals:
• Selecting between conflicting
goals
• Select appropriately between
several goals based on likelihood
of success.
Agent types
Learning
• All previous agent-programs

describe methods for
selecting actions.
• Yet it does not explain the origin
of these programs.
• Learning mechanisms can be used
to perform this task.
• Teach them instead of instructing
them.
• Advantage is the robustness of the
program toward initially unknown
environments.
Agent types
Learning agent
• Learning element: introduce

improvements in performance
element.
• Critic provides feedback on agents
performance based on fixed
performance standard.
• Performance element: selecting
actions based on percepts.
• Corresponds to the previous agent
programs
• Problem generator: suggests
actions that will lead to new
and informative experiences.
• Exploration vs. exploitation
Environment
Environment provides percepts to agents

and receives actions from them
Environment types
Solitaire Backgammom Intenet shopping Taxi

Accessible??
Deterministic??
Episodic??
Static??
Discrete??
Single-agent??
Environment types
Accessible vs. inaccessible: an environment is accessible when the

sensors can detect all aspects that are relevant to the choice of action.

accessible??
Deterministic??
Episodic??
Static??
Discrete??
Single-agent??
Environment types
Accessible vs. inaccessible: an environment is accessible when the

sensors can detect all aspects that are relevant to the choice of action.

accessible?? FULL FULL PARTIAL PARTIAL
Deterministic??
Episodic??
Static??
Discrete??
Single-agent??
Environment types
Deterministic vs. nondeterministic: if the next environment state is completely

determined by the current state the executed action then the environment is
deterministic.

Deterministic??
Episodic??
Static??
Discrete??
Single-agent??
Environment types
Deterministic vs. nondeterministic: if the next environment state is completely

determined by the current state the executed action then the environment is
deterministic.

Deterministic?? YES NO YES NO
Episodic??
Static??
Discrete??
Single-agent??
Environment types
Episodic vs. nonepisodic: In an episodic environment the agent’s experience

can be divided into atomic steps where the agents perceives and then performs
a single action. The choice of action depends only on the episode itself

Episodic??
Static??
Discrete??
Single-agent??
Environment types
Episodic vs. nonepisodic: In an episodic environment the agent’s experience

can be divided into atomic steps where the agents perceives and then performs
A single action. The choice of action depends only on the episode itself

Episodic?? NO NO NO NO
Static??
Discrete??
Single-agent??
Environment types
Static vs. dynamic: If the environment can change while the agent is choosing
an action, the environment is dynamic. Semi-dynamic if the agent’s performance
changes even when the environment remains the same.

Static??
Discrete??
Single-agent??
Environment types
Static vs. dynamic: If the environment can change while the agent is choosing
an action, the environment is dynamic. Semi-dynamic if the agent’s performance
changes even when the environment remains the same.

Static?? YES YES SEMI NO
Discrete??
Single-agent??
Environment types
Discrete vs. continuous: This distinction can be applied to the state of the
environment, the way time is handled and to the percepts/actions of the agent.

Discrete??
Single-agent??
Environment types
Discrete vs. continuous: This distinction can be applied to the state of the
environment, the way time is handled and to the percepts/actions of the agent.

Discrete?? YES YES YES NO
Single-agent??
Environment types
Single vs. multi-agent: Does the environment contain other agents who
are also maximizing some performance measure that depends on the
current agent’s actions?

Single-agent??
Environment types
Single vs. multi-agent: Does the environment contain other agents who
are also maximizing some performance measure that depends on the
current agent’s actions?

Single-agent?? YES NO NO NO
Environment types
The simplest environment is
• Fully observable, deterministic, episodic, static, discrete and single-
agent.
Most real situations are:

• Partially observable, stochastic, sequential, dynamic, continuous and
multi-agent.

Intelligent Agents Lecture

Hochgeladen von

Dokumentinformationen

Originaltitel

Copyright

Verfügbare Formate

Dieses Dokument teilen

Dokument teilen oder einbetten

Freigabeoptionen

Stufen Sie dieses Dokument als nützlich ein?

Sind diese Inhalte unangemessen?

Copyright:

Verfügbare Formate

Intelligent Agents Lecture

Hochgeladen von

Copyright:

Verfügbare Formate

Intelligent Systems

Bicol University College of Science

How to design this? Sensors

perceives its environment through its sensors

Agent Sensors Effectors

Some agents tend to behave better than others

Agent’s behavior is dependent on the nature of its

Percept refers to what the agent perceives at any

The vacuum-cleaner agent

• Environment: square A and B

The vacuum-cleaner agent

Percept sequence Action

The vacuum-cleaner agent

function REFLEX-VACUUM-AGENT ([location, status]) return an action

A rational agent is an agent that does the right

The right action is one that will cause the agent to be

Performance measure embodies the criterion for

The agent’s sequence of actions causes the environment to

Going back to the vacuum cleaner agent...

What is the most suitable performance measure for this

It is better to design performance measures according to

Rationality at any given time depends on 4 things:

Definition of an Ideal Rational Agent

For each possible percept sequence, a rational agent

Rationality vs. Omniscience

An omniscient agent knows the actual outcome of its actions

→ Omniscience is impossible to achieve

Being rational is NOT being...

Rationality involves LEA...

Learning - as the agent gains experience, its prior knowledge of

All agents have the same skeleton:

Designing a rational agent requires specification of

All agents have the same skeleton:

A table-driven agent stores percept sequence in memory

This approach is doomed to failure

What’s wrong with TDA’s?

Simple reflex agents

All these can be turned into learning agents.

• Select action on the

function REFLEX-VACUUM-AGENT ([location, status]) return an action

Reduction from 4T to 4 entries

function SIMPLE-REFLEX-AGENT(percept) returns an action

static: rules, a set of condition-action rules

Will only work if the environment is fully observable

function REFLEX-AGENT-WITH-STATE(percept) returns an action

static: rules, a set of condition-action rules

state  UPDATE-STATE(state, action, percept)

• The agent needs a goal to know

• Certain goals can be

• Certain goals can be

• All previous agent-programs

• Learning element: introduce

Environment provides percepts to agents

Solitaire Backgammom Intenet shopping Taxi

Accessible vs. inaccessible: an environment is accessible when the

Solitaire Backgammom Intenet shopping Taxi

Accessible vs. inaccessible: an environment is accessible when the

Solitaire Backgammom Intenet shopping Taxi

Deterministic vs. nondeterministic: if the next environment state is completely

Solitaire Backgammom Intenet shopping Taxi

Deterministic vs. nondeterministic: if the next environment state is completely

Solitaire Backgammom Intenet shopping Taxi

Episodic vs. nonepisodic: In an episodic environment the agent’s experience