Photo by Marco Assmann

RL — DQN Deep Q-network

Atari game

Q-learning

source
Source
Source

Solutions

Source

Implementation details

Green is the Huber loss and blue is the quadratic loss (Wikipedia)
Q-learning: modified from source
Modified from source

Architecture

Improvements to DQN

Source
Source

More thoughts

Credit and references

Deep Learning