Investigate GPU offload in scheduler #137

antoniupop · 2024-11-25T11:30:11Z

No description provided.

antoniupop · 2025-01-20T15:15:11Z

The use of Multi Bit PBS parameters alongside compression is not well supported due to an inconsistency in the HL API.

Issues linked to this:

Work to fix this is still ongoing.

antoniupop · 2025-01-20T15:51:08Z

It is currently not possible to target specific GPUs for execution which restricts the scheduling options and forces multi-gpu usage to parallelize operations rather than across operations.
This has been requested as a feature from tfhe-rs and is now a Q1 ORK: https://github.com/zama-ai/tfhe-rs-internal/issues/889

Work ongoing.

antoniupop · 2025-01-20T15:54:46Z

Performance inconsistencies in ERC20 benchmarks in particular between GPU types (on H100 - with NVLink or SXM).
Ongoing work: https://github.com/zama-ai/tfhe-rs-internal/issues/851

antoniupop self-assigned this Nov 25, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Investigate GPU offload in scheduler #137

Investigate GPU offload in scheduler #137

antoniupop commented Nov 25, 2024

antoniupop commented Jan 20, 2025 •

edited

Loading

antoniupop commented Jan 20, 2025

antoniupop commented Jan 20, 2025

Investigate GPU offload in scheduler #137

Investigate GPU offload in scheduler #137

Comments

antoniupop commented Nov 25, 2024

antoniupop commented Jan 20, 2025 • edited Loading

antoniupop commented Jan 20, 2025

antoniupop commented Jan 20, 2025

antoniupop commented Jan 20, 2025 •

edited

Loading