flash_attention + sample packing for stablelm 3b #671
+429
−1
Merged
The logs for this run have expired and are no longer available.
Loading