Page 3 | NIUHE

Introduction

Last lecture:

Model-free prediction
Estimate the value function of an unknown MDP

This lecture:

Model-free control
Optimise the value function of an unknown MDP

笔记 Reinforcement Learning 增强学习 Q-learning Monte-Carlo Control Sarsa

Introduction

Last lecture, David taught us how to solve a known MDP, which is planning by dynamic programming. In this lecture, we will learn how to estimate the value function of an unknown MDP, which is model-free prediction. And in the next lecture, we will optimise the value function of an unknown MDP.

笔记 Reinforcement Learning 增强学习 Model-Free Monte-Carlo Learning TD

RL - Planning by Dynamic Programming

2017-12-07

Table of Contents

Introduction
Policy Evaluation
Policy Iteration
Value Iteration
Extentions to Dynamic Programming
Contraction Mapping

笔记 Reinforcement Learning 增强学习 MDP Dynamic Programming Policy Iteration Value Iteration

RL - Markov Decision Processes

2017-08-18

Table of Contents

Markov Processes
Markov Reward Process
Markov Decision Process

Markov Processes

Basically, Markov decision processes formally describe an environment for reinforcement learning, where the environment is fully observable, which means the current state completely characterises the process.

笔记 Reinforcement Learning 增强学习 MDP

RL - Introduction to Reinforcement Learning

2017-08-15

RL, especially DRL (Deep Reinforcement Learning) has been an fervent research area during these years. One of the most famous RL work would be AlphaGo, who has beat Lee Sedol, one of the best players at Go, last year. And in this year (2017), AlphaGo won three games with Ke Jie, the world No.1 ranked player. Not only in Go, AI has defeated best human play in many games, which illustrates the powerful of the combination of Deep Learning and Reinfocement Learning. However, despite AI plays better games than human, AI takes more time, data and energy to train which cannot be said to be very intelligent. Still, there are numerous unexplored and unsolved problems in RL research, that's also why we want to learn RL.

This is the first note of David Silver's RL course.

笔记 Reinforcement Learning AlphaGo 增强学习 DRL

Deep NLP - Question Answering

2017-08-12

Question answering (QA) is a computer science discipline within the fields of information retrieval and natural language processing (NLP), which is concerned with building systems that automatically answer questions posed by humans in a natural language.

笔记 Deep Learning Deep NLP NLP Attention QA

Deep NLP - Speech Recognition

2017-08-10

Speech recognition (SR) is the inter-disciplinary sub-field of computational linguistics that develops methodologies and technologies that enables the recognition and translation of spoken language into text by computers.

笔记 Deep Learning Deep NLP NLP Speech Recognition