Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[REQUEST] Add DroQ? #340

Open
asmith26 opened this issue Sep 23, 2023 · 0 comments
Open

[REQUEST] Add DroQ? #340

asmith26 opened this issue Sep 23, 2023 · 0 comments
Labels
enhancement New feature or request

Comments

@asmith26
Copy link
Contributor

asmith26 commented Sep 23, 2023

I've been recently learning about Dropout-Q-Functions-for-Doubly-Efficient-Reinforcement-Learning (paper, poster) which sounds like a great RL algorithm that is "doubly (sample and computationally) efficient". It's already been used by a number of interesting projects (e.g. "A Walk in the Park: Learning to Walk in 20 Minutes With Model-Free Reinforcement Learning, which contains code for training a simulated or real A1 quadrupedal robot to walk" and "DroQ is used in RoboPianist. It can learn to play the Super Mario Bros. theme song and The Legend of Zelda songs (Main theme and Song of storms) in 400k samples!!!:musical_keyboard::musical_keyboard:". Hence I thought it might be beneficial to add it to this library.

There are quite a few code implementations available, and I have found robopianist to be the easiest to follow (I'm no expert in this though), and it looks like SAC is heavily used too.

Not sure if DroQ is of interest to the d3rlpy community?

I'm loving d3rlpy (massive thanks!!) so would be interested in learning how to implement DroQ in d3rlpy, but I'm still quite new to the lib. Thus I very much welcome if anyone is interested in implementing this (and I could learn from any PRs etc.) or if anyone is able to provide me with any guidance regarding how I would go about creating a PR to integrate DroQ into d3rlpy (only if there is interest in integrating these of course).

Many thanks for any help!

@asmith26 asmith26 added the enhancement New feature or request label Sep 23, 2023
@asmith26 asmith26 changed the title [REQUEST] Implement DroQ [REQUEST] Add DroQ? Sep 26, 2023
@asmith26 asmith26 mentioned this issue Sep 30, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant