Reward model inference #23

shahbuland · 2023-08-30T18:47:29Z

Need to add reward model inference for when the RM is a sizable model. Currently attempts to have RM on each GPU. This is problematic because there are many cases where RM is too big to fit alongside the denoiser model. Solution in LLM case is often to use Triton inference server or to put RM on one gpu while main model uses rest of GPUs. Should be explored further.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reward model inference #23

Reward model inference #23

shahbuland commented Aug 30, 2023

Reward model inference #23

Reward model inference #23

Comments

shahbuland commented Aug 30, 2023