These notes are from David Silver's Reinforcement Learning course.
I've tried to write down most of the explanations he goes through and expanded on certain sections I personally felt needed simplification. I'll be linking them below as I progress through the lectures.
There might be notational inconsistencies, in which case the source material supersedes everything. If there's a conceptual flaw in my writing, feel free to reach out.