Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

难道没人发现其实就是两个一摸一样的模型,一个训练好的去训练另一个没训练的嘛... #13

Open
46319943 opened this issue Apr 12, 2022 · 4 comments

Comments

@46319943
Copy link

而这个训练好的模型是根据图片输出操作列表...即一个多分类器

@46319943 46319943 changed the title 难道没人发现其实就是用一个训练好的模型去训练另一个模型嘛... 难道没人发现其实就是两个一摸一样的模型,一个训练好的模型去训练另一个没训练好的嘛... Apr 12, 2022
@46319943 46319943 changed the title 难道没人发现其实就是两个一摸一样的模型,一个训练好的模型去训练另一个没训练好的嘛... 难道没人发现其实就是两个一摸一样的模型,一个训练好的去训练另一个没训练的嘛... Apr 12, 2022
@46319943
Copy link
Author

也算是知识蒸馏了😏

@f200ten
Copy link

f200ten commented Sep 7, 2023

多分类器吗...我还以为是强化学习T_T

@wwwsctvcom
Copy link

一个模型是计算reward的值,一个模型是用于分类,PPO算法本身就是这样的

@46319943
Copy link
Author

一个模型是计算reward的值,一个模型是用于分类,PPO算法本身就是这样的

PPO算法确实是这样的。但是这个代码并不是。不建议阅读并参考这份代码,纯属浪费时间。

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants