You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Thank you for your great work. After train and test with test.py, I got results.npz file. However, the mean of train_returns and valid returns seems to be very similar, the difference is always below 1. I've tried different fast-lr rate 0.1 0.01 0.001 even 10, increased number of batches to 1000, and also different num-step in the test.py. Nevertheless, there is no significant difference between train_returns and valid returns. I tried on halfcheetah-vel and dir environments and the problem occurred in both env. Do you have any suggestions?
The text was updated successfully, but these errors were encountered:
Hi,
Thank you for your great work. After train and test with test.py, I got results.npz file. However, the mean of train_returns and valid returns seems to be very similar, the difference is always below 1. I've tried different fast-lr rate 0.1 0.01 0.001 even 10, increased number of batches to 1000, and also different num-step in the test.py. Nevertheless, there is no significant difference between train_returns and valid returns. I tried on halfcheetah-vel and dir environments and the problem occurred in both env. Do you have any suggestions?
The text was updated successfully, but these errors were encountered: