Enable PagedAttention through Pallas #7766
build_and_test.yml
on: pull_request
Build PyTorch/XLA (GPU)
/
build
9m 2s
Build PyTorch/XLA (TPU)
/
build
8m 50s
Build XLA CUDA plugin
/
build
4m 41s
Build & publish docs
/
push-docs
Matrix: CPU C++ tests / test
Waiting for pending jobs
Matrix: GPU C++ tests / test
Waiting for pending jobs
TPU tests
/
tpu-test
Matrix: CPU Python tests / test
Waiting for pending jobs
Matrix: GPU Python tests / test
Waiting for pending jobs
Annotations
4 errors
Build PyTorch/XLA (TPU) / build
Canceling since a higher priority waiting request for 'Build and test-6912-false-false' exists
|
Build PyTorch/XLA (TPU) / build
The operation was canceled.
|
Build PyTorch/XLA (GPU) / build
Canceling since a higher priority waiting request for 'Build and test-6912-false-false' exists
|
Build PyTorch/XLA (GPU) / build
The operation was canceled.
|
Artifacts
Produced during runtime
Name | Size | |
---|---|---|
cuda-plugin
Expired
|
95 MB |
|