Enable PagedAttention through Pallas · pytorch/xla@5778694

Triggered via pull request April 25, 2024 00:53

wonjoolee95

synchronize #6912

wonjoo/pallas-pagedattention

Status Cancelled

Total duration 9m 26s

Artifacts 1

build_and_test.yml

on: pull_request

Build PyTorch/XLA (GPU) / build

9m 2s

Build PyTorch/XLA (TPU) / build

8m 50s

Build XLA CUDA plugin / build

4m 41s

Build & publish docs / push-docs

Matrix: CPU C++ tests / test

Waiting for pending jobs

Matrix: GPU C++ tests / test

Waiting for pending jobs

TPU tests / tpu-test

Matrix: CPU Python tests / test

Waiting for pending jobs

Matrix: GPU Python tests / test

Waiting for pending jobs

Annotations

4 errors

Build PyTorch/XLA (TPU) / build

Canceling since a higher priority waiting request for 'Build and test-6912-false-false' exists

Build PyTorch/XLA (TPU) / build

The operation was canceled.

Build PyTorch/XLA (GPU) / build

Canceling since a higher priority waiting request for 'Build and test-6912-false-false' exists

Build PyTorch/XLA (GPU) / build

The operation was canceled.

Artifacts

Produced during runtime

Name	Size
cuda-plugin Expired	95 MB

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable PagedAttention through Pallas #7766

Summary

Enable PagedAttention through Pallas #7766

Jobs

Run details

build_and_test.yml

Annotations

Artifacts