Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Gymnasium api #1

Open
wants to merge 76 commits into
base: main
Choose a base branch
from
Open

Gymnasium api #1

wants to merge 76 commits into from

Conversation

korneelf1
Copy link

This pull request adds a file which adds functionalities to the simulator such as:

  • Reward calculation
  • Episode management
  • Reset management
  • Gymnasium integration

Offers a step function which operates as expected (performs one singular step for N_drones)
When using it with conventional RL frameworks, you should stick to N_drones = 1.
When using N_drones > 1, apply a post processor to add the transitions to the buffer for each drone (depending on how your framework operates).

Offers a step_rollout function which acts similar to the step function, but accepts a controller as input to accelerate the rollout collection. While this method allows a 50x increase in speed, care should be given to eg exploration vs exploitation (no noise is injected now) and the way that the resets are handled for each independent requirement (see reset_subenvs).

TODO:
reset sub envs using Numba, easier, faster

@korneelf1
Copy link
Author

As for the Kerneller and Jitter, now created in gym_sim
Potentially do it in init such that these become available to all code in the repo?

@korneelf1
Copy link
Author

Adding basic unit tests for ensuring same output as og sim

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant