仓库源文站点原文


layout: post title: "Reinforcement Learning 第六周课程笔记" date: "2015-09-26 04:17:30" categories: 计算机科学 excerpt: "Advanced Algorithmic Analysis Value iteration 1 tells us that VI converg..."

auth: conge

Advanced Algorithmic Analysis

Advanced Algorithmic Analysis

Value iteration

Value iteration

Linear Programming

Primal

The Dual

Policy Iteration

Policy Iteration

the concept of Domination

Why Does Policy Iteration Work

B<sub>2</sub> is Monotonic

Quiz 1

Quiz 1 answers

wrap-up

2015-09-23 初稿
2015-09-26 完成
2015-12-04 reviewed and revised.