Code Quality Checks

Fix flash attention GQA bug to use the dynamic size of the key/value tensors - used for eval/inference #3109

Sign in to view logs

Triggered via pull request November 21, 2023 23:12

sashaDoubov

opened #756

sashaDoubov:sasha/fix-flash-attn-seq-len

Status Success

Total duration 6m 55s

Artifacts –

code-quality.yaml

on: pull_request

Matrix: code-quality