Skip to content

(TG/TGG) Choose your pipeline #45

(TG/TGG) Choose your pipeline

(TG/TGG) Choose your pipeline #45

Manually triggered December 24, 2024 15:51
Status Failure
Total duration 3h 58m 35s
Artifacts 3

pipeline-select-galaxy.yaml

on: workflow_dispatch
build-artifact  /  ...  /  build-docker-image
1m 45s
build-artifact / build-docker-image / build-docker-image
Matrix: build-artifact / build-artifact
Matrix: tg-frequent-tests / tg-frequent-tests
Matrix: tg-model-perf-tests / tg-model-perf-tests
Waiting for pending jobs
Matrix: tg-unit-tests / TG-tests
Matrix: tg-unit-tests / TG-UMD-tests
Matrix: tgg-frequent-tests / tgg-frequent-tests
Matrix: tgg-model-perf-tests / tgg-model-perf-tests
Matrix: tgg-unit-tests / TGG-tests
Fit to window
Zoom out
Zoom in

Annotations

21 errors and 20 notices
tgg-unit-tests / TGG unit tests
Process completed with exit code 1.
pcie-cards-are-being-used-cleanup
Tenstorrent cards seem to be in use. Killing PIDs and exiting unsuccessfully. This can happen if a test hung and is normally an issue with the test, rather than infra.
tgg-frequent-tests / TGG frequent tests
Process completed with exit code 1.
tgg-frequent-tests / TGG frequent tests
The action 'Run frequent regression tests' has timed out after 90 minutes.
pcie-cards-are-being-used-cleanup
Tenstorrent cards seem to be in use. Killing PIDs and exiting unsuccessfully. This can happen if a test hung and is normally an issue with the test, rather than infra.
tg-frequent-tests / tg-frequent-tests (TG Llama3-70B (old) frequent tests, wormhole_b0, llama3-70b-old, 90, U03FJB5TM5Y)
The action 'Run frequent regression tests' has timed out after 90 minutes.
tg-unit-tests / TG Llama3-70b unit tests
Process completed with exit code 1.
tgg-model-perf-tests / TGG CNN model perf tests
The action 'Run model perf regression tests' has timed out after 60 minutes.
tgg-model-perf-tests / TGG CNN model perf tests
Process completed with exit code 2.
pcie-cards-are-being-used-cleanup
Tenstorrent cards seem to be in use. Killing PIDs and exiting unsuccessfully. This can happen if a test hung and is normally an issue with the test, rather than infra.
tgg-model-perf-tests / TGG CNN model perf tests
Process completed with exit code 1.
tg-frequent-tests / tg-frequent-tests (TG unit/distributed frequent tests, wormhole_b0, unit, 90, XXXXX)
The action 'Run frequent regression tests' has timed out after 90 minutes.
pcie-cards-are-being-used-cleanup
Tenstorrent cards seem to be in use. Killing PIDs and exiting unsuccessfully. This can happen if a test hung and is normally an issue with the test, rather than infra.
tg-unit-tests / TG Llama3-small unit tests
Process completed with exit code 1.
tg-unit-tests / TG unit tests
The action 'Run unit regression tests' has timed out after 30 minutes.
pcie-cards-are-being-used-cleanup
Tenstorrent cards seem to be in use. Killing PIDs and exiting unsuccessfully. This can happen if a test hung and is normally an issue with the test, rather than infra.
tg-unit-tests / TG unit tests
Process completed with exit code 1.
disk-usage-before-startup
Disk usage is 39 %
disk-usage-after-startup
Disk usage is 39 %
printing-smi-info-startup
Touching and printing out SMI info
reset-successful-startup
tt-smi reset was successful
disk-usage-before-startup
Disk usage is 39 %
disk-usage-after-startup
Disk usage is 39 %
printing-smi-info-startup
Touching and printing out SMI info
reset-successful-startup
tt-smi reset was successful
printing-out-smi-info-cleanup
Touching and printing out SMI info
printing-out-smi-info-cleanup
Touching and printing out SMI info
printing-out-smi-info-cleanup
Touching and printing out SMI info
printing-out-smi-info-cleanup
Touching and printing out SMI info
printing-out-smi-info-cleanup
Touching and printing out SMI info
printing-out-smi-info-cleanup
Touching and printing out SMI info
printing-out-smi-info-cleanup
Touching and printing out SMI info
printing-out-smi-info-cleanup
Touching and printing out SMI info
printing-out-smi-info-cleanup
Touching and printing out SMI info
printing-out-smi-info-cleanup
Touching and printing out SMI info
printing-out-smi-info-cleanup
Touching and printing out SMI info
printing-out-smi-info-cleanup
Touching and printing out SMI info

Artifacts

Produced during runtime
Name Size
TTMetal_build_grayskull
313 MB
TTMetal_build_wormhole_b0
313 MB
perf-report-csv-LLM-wormhole_b0-
346 Bytes