BD Brain Drip
đź§ 
Module 03 7 concepts

Function Approximation & Deep RL

DQN, experience replay, and deep reinforcement learning.

01

Deep Q-Networks

Neural network Q-function with experience replay and target networks – the breakthrough that launched deep RL.

02

Double DQN

Decoupling action selection from evaluation to correct DQN’s systematic overestimation of Q-values.

03

Dueling DQN

Separate network streams for state value and action advantage – learning “how good is this state” independently from “how good is this action.”

04

Experience Replay

Storing and randomly sampling past transitions to break temporal correlations and improve sample efficiency.

05

Function Approximation

Replacing lookup tables with parameterized functions to generalize across the vast state spaces of real-world problems.

06

Rainbow DQN

Combining six orthogonal DQN improvements into one agent – the definitive value-based deep RL algorithm.

07

Target Networks

A frozen copy of the Q-network providing stable regression targets – preventing the “moving target” instability.