Scale decoding architectures to lower parameter counts and to fit on smaller GPUs #6

reesekneeland · 2024-04-09T18:22:39Z

MindEye 1 and 2 in their default training/inference configurations require an A100 to use. There has been other recent work exploring a reduction in parameter counts that could be valuable to implement, in service of our tertiary goal of making these decoding algorithms more scalable and easier to use. This is also a good item for people with limited compute (no A100s) to work on.

Lite-Mind paper: https://arxiv.org/html/2312.03781v1

Other easy things:

Don't load all of the images onto the CPU
Smaller batch sizes
Disable unnecessary modules (captioning module, etc)

reesekneeland added the limited-gpu Issues that can be addressed with access to limited GPUs (less than an A100) label Apr 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Scale decoding architectures to lower parameter counts and to fit on smaller GPUs #6

Scale decoding architectures to lower parameter counts and to fit on smaller GPUs #6

reesekneeland commented Apr 9, 2024

Scale decoding architectures to lower parameter counts and to fit on smaller GPUs #6

Scale decoding architectures to lower parameter counts and to fit on smaller GPUs #6

Comments

reesekneeland commented Apr 9, 2024