Lecture 4: Search in Complex Environments - A Modern Approach to AI

Listen — slide 1 Captions (VTT)

Lecture 4: Search in Complex Environments¶

AIMA Chapter 4 — 1 hour¶

Listen — slide 2 Captions (VTT)

Why Complex Environments Matter¶

Many problems have huge or continuous state spaces
Actions can be stochastic; we may not see the full state
Online settings: we discover the world while acting
Local search and approximation often beat systematic search

Listen — slide 3 Captions (VTT)

Learning Objectives¶

Apply local search: hill-climbing, simulated annealing, genetic algorithms
Handle continuous and nondeterministic search spaces
Search in partially observable environments
Design online search agents for unknown environments

Listen — slide 4 Captions (VTT)

Local Search¶

State space: Set of configurations
Objective: Find goal state or maximize objective function
Local search: Keep single current state, move to neighbors
Complete state formulation: All variables assigned

Listen — slide 5 Captions (VTT)

Hill-Climbing Search¶

Move to neighbor with highest value
Greedy: No backtracking
Problems: Local maxima, plateaus, ridges
Variants: Stochastic HC, first-choice HC, random-restart HC

Listen — slide 6 Captions (VTT)

Simulated Annealing¶

Allow downhill moves with probability decreasing over time
Temperature T: High → explore, Low → exploit
Schedule: T decreases (e.g., T = T × 0.95)
Convergence: To global optimum with proper schedule

Listen — slide 7 Captions (VTT)

Local Beam Search¶

Keep k states (not just one)
Generate all successors, keep best k
Stochastic beam search: Probabilistic selection
Shares information across parallel searches

Listen — slide 8 Captions (VTT)

Evolutionary Algorithms¶

Population: Set of individuals (states)
Fitness: Objective function
Selection: Fitter individuals more likely to reproduce
Crossover: Combine two parents
Mutation: Random change

Listen — slide 9 Captions (VTT)

Local Search in Continuous Spaces¶

Gradient descent: Follow ∇f (gradient)
Discretization: Grid or random sampling
Constraint satisfaction: Lagrange multipliers, penalty methods

Listen — slide 10 Captions (VTT)

Nondeterministic Actions¶

Erratic vacuum: Suck may fail, may deposit dirt
AND-OR search trees: OR (agent choice), AND (nature’s choice)
Contingency planning: Plan for possible outcomes

Listen — slide 11 Captions (VTT)

Partially Observable Environments¶

Belief state: Set of possible actual states
Belief-state space: Often exponentially large
Sensorless (conformant) planning: No observations
Contingent planning: Use observations when available

Listen — slide 12 Captions (VTT)

Online Search¶

Offline: Know full problem before acting
Online: Discover state while acting
Competitive ratio: Online cost / optimal offline cost

Listen — slide 13 Captions (VTT)

Online Search Agents¶

LRTA* (Learning Real-Time A*): Update h as we go
Exploration vs. exploitation: Try new states vs. follow known path
Learning: Remember visited states and costs

Listen — slide 14 Captions (VTT)

Summary¶

Local search: Hill-climbing, simulated annealing, genetic algorithms
Continuous: Gradient descent, discretization
Nondeterministic: AND-OR trees, contingency plans
Partial observability: Belief states
Online: LRTA*, exploration

Listen — slide 15 Captions (VTT)

References¶

Russell & Norvig, AIMA 4e, Ch. 4
Chapter PDF: chapters/chapter-04.pdf
aima-python: search4e.ipynb

Listen — slide 16 Captions (VTT)

Questions?¶

Next lecture: Adversarial Search and Games (Chapter 5)