仓库源文站点原文


layout: post title: "Reinforcement Learning 第十二周课程笔记" date: "2015-11-04 11:01:52" categories: 计算机科学 excerpt: "This week watch Game Theory. The readings are Littman (1994), Littman an..."

auth: conge

This week

Game Theory III

Definition of Game Theory

Example

Quiz 1: simple game example

Quiz 2: Represent the tree using a matix

Minimax

MiniMax

Von Neumann theorem

This is important so I am writing it down:

Now, to make the problem a bit more complex, we change the game to be non-deterministic:

quiz 3: strategy matrix for non-deterministic game

Minipoker

Mini-Poker: description

Mini-Poker: Tree and Matrix

Mixed Strategy

Quiz 5: Given B's strategy, we can figure out A's expected profit

Quiz 6: A's expected value is dependent on B

Snitch

Prisonders' Dilemma

A Beautiful Equilibrium

Nash Equilibrium

Theorem

Play the game multiple times: won't change NE

Recap

Recap

Andrew Moore's slides on Zero-Sum Games Andrew Moore's slides on Non-Zero-Sum Games

2015-11-03 初稿
2016-04-26 复习并添加部分内容