
Reinforcement Learning Seminar Ⅰ


Text editor: $\text{Textmacs}$

Email: yli340@uic.edu

From Physics to Control Theory

State of Newton’s System(deterministic):

State-space form of Newton’s 2nd Law:

What’s a linear transformation?

Group, Flow, Lie Group

General form of state transform:

From Deterministic to Stochastic

Discrete System Only

Without Controller:

$\dot s(t)=f(s(t)) \rightarrow s_{t+1} = f(s_t)$

Stochastic dynamic system:

$s_{t+1} = f(s_t) \rightarrow s_{t+1} \sim p(s_t)$

Markov System (memoryless):

$s_{t+1} \sim p(s_0,\cdots,s_t)=p(s_t)$

With Controller:

$x_{t+1}=f(x_t,u_t)\rightarrow s_{t+1}\sim p(s_t,a_t)$

Introduction to RL

Interact with environment

Logic -> Supervised -> RL

Safe RL


Next Seminar: MDP

Maximize reward function

$\pi: S\rightarrow\Delta(A)$

$r: S\times A\rightarrow[0,1]$