Skip to content

Commit 5dac8ea

Browse files
committed
Updates dl/reinforcement/reinforcement.md
Auto commit by GitBook Editor
1 parent fc3ef73 commit 5dac8ea

File tree

2 files changed

+2
-2
lines changed

2 files changed

+2
-2
lines changed

assets/reinforcementlearning4.png

141 KB
Loading

dl/reinforcement/reinforcement.md

+2-2
Original file line numberDiff line numberDiff line change
@@ -21,8 +21,6 @@ Log-Likelihood:计算每一个动作的概率,$$log\pi_\theta(a|s) = log[P_\
2121

2222
Log-Likelihood: $$log\pi_\theta(a|s) = -\frac{1}{2}( \sum_{i=1}^{k}(\frac{(a_i-\mu_i)^2)}{\delta_i^2}))+klog2\pi)$$
2323

24-
25-
2624
下一步的表示:
2725

2826
$$s_{t+1} = f(s_t,a_t)$$
@@ -40,6 +38,8 @@ $$s_{t+1} \sim P(\odot|s_t, a_t)$$
4038

4139
![](/assets/reinforcementlearning1.png)
4240

41+
![](/assets/reinforcementlearning4.png)
42+
4343
强化学习因其注重agent在与环境的直接交互中进行学习而有别于其他学习方
4444

4545
---

0 commit comments

Comments
 (0)