Lecture 20: Learning Probabilistic Models - A Modern Approach to AI

Listen — slide 1 Captions (VTT)

Lecture 20: Learning Probabilistic Models¶

AIMA Chapter 20 — 1 hour¶

Listen — slide 2 Captions (VTT)

Learning Objectives¶

Learn Bayesian network parameters (ML, Bayesian)
Apply EM for hidden variables
Learn HMM parameters
Learn network structure

Listen — slide 3 Captions (VTT)

Maximum Likelihood¶

Data: D = {x¹,...,xᵐ}
Likelihood: L(θ) = P(D|θ)
ML estimate: θ* = argmax L(θ)

Listen — slide 4 Captions (VTT)

Bayesian Parameter Learning¶

Prior: P(θ)
Posterior: P(θ|D) ∝ P(D|θ) P(θ)
Predict: P(x|D) = ∫ P(x|θ) P(θ|D) dθ

Listen — slide 5 Captions (VTT)

EM Algorithm¶

Hidden variables: Z unobserved
E-step: P(Z|X,θ)
M-step: θ = argmax E[log P(X,Z|θ)]
Convergence: Local optimum

Listen — slide 6 Captions (VTT)

EM: Mixture of Gaussians¶

Components: K Gaussians
Hidden: Which component each point
E: Soft assignment
M: Update means, covariances

Listen — slide 7 Captions (VTT)

Learning HMMs¶

Baum-Welch: EM for HMM
Parameters: A, B, π
E: Forward-backward
M: Update parameters

Listen — slide 8 Captions (VTT)

Summary¶

ML: Maximize likelihood
Bayesian: Posterior over parameters
EM: Hidden variables
HMM: Baum-Welch

Listen — slide 9 Captions (VTT)

References¶

AIMA Ch. 20
Russell & Norvig, AIMA 4e, Ch. 20
Chapter PDF: chapters/chapter-20.pdf

Listen — slide 10 Captions (VTT)

Questions?¶

Next lecture: Deep Learning (Chapter 21)