Sharing two extension libraries for TorchSharp - cross-platform module loading/saving and flash-attention! #1231

shaltielshmid · 2024-02-10T21:58:58Z

shaltielshmid
Feb 10, 2024

I'm thrilled to introduce two new extension libraries to our community:

TorchSharp.PyBridge - This library enables the loading of PyTorch model weights saved using the standard torch.save(model.state_dict(), path) method, as well as weights saved in HuggingFace's safetensors format, including sharded models split across several files. For usage examples and further instructions, please refer to the README on the project page.
TorchSharp.FlashAttention - With the rise of attention mechanisms in the latest generation of language models, the demand for computational resources increases with sequence length. Enter Flash Attention, a brilliant solution offering fast and memory-efficient exact attention that is IO-aware. This package serves as a wrapper, bringing flash-attention's benefits to the TorchSharp ecosystem.

Hope you all enjoy, and any comments/bug reports/contributions are more than welcome!

lostmsu · 2024-03-15T06:07:10Z

lostmsu
Mar 15, 2024

I thought Torch now ships with an (even more improved) implementation of flash attention.

0 replies

GeorgeS2019 · 2024-03-28T07:17:58Z

GeorgeS2019
Mar 28, 2024

@shaltielshmid

With these two extension libraries, it would be great if there is a summary what we can and still can not do now when dealing with distributed pytorch binary trained models available from HuggingFace.

Those model which do not require additional preprocessing e.g. Bert

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sharing two extension libraries for TorchSharp - cross-platform module loading/saving and flash-attention! #1231

{{title}}

Replies: 2 comments

{{title}}

{{title}}

Select a reply

Sharing two extension libraries for TorchSharp - cross-platform module loading/saving and flash-attention! #1231

shaltielshmid Feb 10, 2024

Replies: 2 comments

lostmsu Mar 15, 2024

GeorgeS2019 Mar 28, 2024

shaltielshmid
Feb 10, 2024

lostmsu
Mar 15, 2024

GeorgeS2019
Mar 28, 2024