We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
在引理7.2的证明中, $$Q_{\pi}(s,a)=E_{S’\ \sim \ p\ (\ \dot\ |\ s\ ;\ \theta)}[R(s,a,S')+\gamma\ \dot \ V_{\pi}(s') ]$$ 此处 $V_{\pi}(S')$ 应该仍然是 $S'$ 的函数,此处在数学上并未是描述某个具体状态,我认为此处或许出了错误? 应该为, $$Q_{\pi}(s,a)=E_{S’\ \sim \ p\ (\ \dot\ |\ s\ ;\ \theta)}[R(s,a,S')+\gamma\ \dot \ V_{\pi}(S') ]$$ GitHub上的markdown我不是很熟悉,见谅。
The text was updated successfully, but these errors were encountered:
引理7.4中或许有相同问题。
Sorry, something went wrong.
是我笔误。赶在出版前发现了,超级感谢!!!
No branches or pull requests
在引理7.2的证明中,
此处
应该为,
GitHub上的markdown我不是很熟悉,见谅。
The text was updated successfully, but these errors were encountered: