Lecture 2: Intelligent Agents - A Modern Approach to AI

Listen — slide 1 Captions (VTT)

Lecture 2: Intelligent Agents¶

AIMA Chapter 2 — 1 hour¶

Listen — slide 2 Captions (VTT)

Learning Objectives¶

Define agents and environments
Understand rationality and performance measures
Classify task environments by key properties
Describe agent architectures: reflex, model-based, goal-based, utility-based, learning

Listen — slide 3 Captions (VTT)

Agents and Environments¶

Agent: Entity that perceives and acts
Environment: Everything outside the agent
Percept: Agent’s perceptual input at any instant
Percept sequence: Complete history of percepts

Listen — slide 4 Captions (VTT)

The Agent-Environment Loop¶

Agent–environment loop

Agent receives percepts, chooses actions
Environment changes in response to actions
Agent’s next percept depends on the new state

Listen — slide 5 Captions (VTT)

Good Behavior: Rationality¶

Rational agent: For each possible percept sequence, selects action expected to maximize performance measure
Performance measure: Criterion for success (e.g., score, safety)
Omniscience: Knowing actual outcome of actions — we assume agents don’t have this

Listen — slide 6 Captions (VTT)

Rationality vs. Omniscience¶

Rational ≠ omniscient
Rational agent acts on available information
Learning: Improve from experience
Autonomy: Operate correctly without constant human intervention

Listen — slide 7 Captions (VTT)

Specifying the Task Environment¶

PEAS framework:

Performance measure
Environment
Actuators
Sensors

Example: Autonomous taxi

P: Safe, fast, legal, comfortable
E: Streets, traffic, pedestrians
A: Steering, accelerator, brake, display
S: Cameras, sonar, GPS, odometer

Listen — slide 8 Captions (VTT)

Properties of Task Environments¶

2×2 or matrix

Property	Options
Fully vs. partially observable	Can agent see full state?
Single vs. multi-agent	Other agents?
Deterministic vs. stochastic	Next state determined?
Episodic vs. sequential	Current choice affect future?
Static vs. dynamic	Environment changes while deciding?
Discrete vs. continuous	Finite vs. infinite states?
Known vs. unknown	Transition model known?

Listen — slide 9 Captions (VTT)

Fully vs. Partially Observable¶

Fully observable: Sensors give access to complete state
Partially observable: Noisy/incomplete sensors (e.g., poker, medical diagnosis)
Unobservable: No sensors — agent must act blindly

Listen — slide 10 Captions (VTT)

Single vs. Multi-Agent¶

Single-agent: Only one agent (e.g., crossword puzzle)
Multi-agent: Other agents (competitive or cooperative)
Competitive: Zero-sum games
Cooperative: Team goals

Listen — slide 11 Captions (VTT)

The Structure of Agents¶

Hierarchy or flowchart

Agent program: Implementation of agent function
Agent function: Maps percept sequences to actions
Agent = architecture + program

Listen — slide 12 Captions (VTT)

Simple Reflex Agents¶

Select action based on current percept only
Condition–action rules: if percept then action
No memory of past percepts
Limitation: Cannot handle partial observability

Listen — slide 13 Captions (VTT)

Model-Based Reflex Agents¶

Maintain internal state to track world
Model: How world evolves, effect of actions
Update state from: (1) how world changes, (2) how actions affect world
Can handle partially observable environments

Listen — slide 14 Captions (VTT)

Goal-Based Agents¶

Have goals — desirable states
Consider future: “What if I do action A?”
Search and planning to achieve goals
More flexible than reflex agents

Listen — slide 15 Captions (VTT)

Utility-Based Agents¶

Utility function: Maps states to real numbers (degree of happiness)
Handles trade-offs (e.g., fast vs. safe)
Handles uncertainty (expected utility)
Generalizes goal-based: goal = states with utility above threshold

Listen — slide 16 Captions (VTT)

Learning Agents¶

Performance element: Selects actions (like previous agents)
Learning element: Modifies performance element
Critic: Feedback on how well agent is doing
Problem generator: Suggests exploratory actions

Listen — slide 17 Captions (VTT)

Summary: Agent Types¶

Type	Key feature
Simple reflex	Current percept → action
Model-based reflex	Internal state + model
Goal-based	Goals + search/planning
Utility-based	Utility function + expected utility
Learning	Improves from experience

Listen — slide 18 Captions (VTT)

References¶

Russell & Norvig, AIMA 4e, Ch. 2
Chapter PDF: chapters/chapter-02.pdf
Exercises: aimacode.github.io/aima-exercises/

Listen — slide 19 Captions (VTT)

Questions?¶

Next lecture: Solving Problems by Searching (Chapter 3)