NIUHE

日々私たちが过ごしている日常というのは、実は奇迹の连続なのかもしれんな

归档

2018-11-22

RL - Trust Region Policy Optimization (TRPO)

2018-11-16

RL - DQN & A3C & GAE

2018-11-07

中国象棋Zero技术详解

2018-11-06

Hexo 搭建博客踩坑记录

2018-11-03

Microeconomics - Interdependence and the Gains from Trade

2018-11-01

Unity学习笔记

2018-10-03

Microeconomics - Thinking Like an Economist

2018-09-16

Microeconomics - Ten Principles of Economics

2018-05-15

AlphaGo, AlphaGo Zero and AlphaZero

2018-03-10

论文翻译：在没有人类知识的情况下掌握围棋

2018-01-09

RL - Integrating Learning and Planning

2018-01-06

RL - Policy Gradient

2018-01-03

RL - Value Function Approximation

2017-12-21

RL - Model-Free Control

2017-12-16

RL - Model-Free Prediction

2017-12-07

RL - Planning by Dynamic Programming

2017-08-18

RL - Markov Decision Processes

2017-08-15

RL - Introduction to Reinforcement Learning

2017-08-15

Paper Reading - Stacked Attention Networks for Image QA

2017-08-14

Paper Reading - Neural Machine Translation In Linear Time (ByteNet)