Understanding the Significance of Loss Change in Conservative Q-Learning Training #70

constellation39 · 2024-05-15T04:26:20Z

constellation39
May 15, 2024

Hello Cryolite,

Is the change in loss during Conservative Q-Learning training of any reference value?
In my attempts at training, the loss in CQL is always increasing. It remains very stable until 1 million training steps, after which it starts to increase linearly.

Thank you.

constellation39 · 2024-05-25T13:20:39Z

constellation39
May 25, 2024
Author

I understand that there is an error in the reward function I wrote.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Understanding the Significance of Loss Change in Conservative Q-Learning Training #70

{{title}}

Replies: 1 comment

{{title}}

Select a reply

Understanding the Significance of Loss Change in Conservative Q-Learning Training #70

constellation39 May 15, 2024

Replies: 1 comment

constellation39 May 25, 2024 Author

constellation39
May 15, 2024

constellation39
May 25, 2024
Author