Skip to content

(Single-card) Model perf tests #6483

(Single-card) Model perf tests

(Single-card) Model perf tests #6483

Manually triggered November 26, 2024 13:12
Status Failure
Total duration 13h 50m 28s
Artifacts 6

perf-models.yaml

on: workflow_dispatch
build-artifact  /  ...  /  build-docker-image
20s
build-artifact / build-docker-image / build-docker-image
Matrix: build-artifact / build-artifact
Matrix: models-perf / models-perf
Fit to window
Zoom out
Zoom in

Annotations

5 errors, 6 warnings, and 46 notices
models-perf / other GS
The runner has received a shutdown signal. This can happen when the runner service is stopped, or a manually started runner is canceled.
models-perf / other GS
The operation was canceled.
models-perf / cnn_javelin N300 WH B0
request to https://productionresultssa7.blob.core.windows.net/actions-results/5cf9a98b-f94b-4bf7-bd43-0c1fe6b69a5e/workflow-job-run-fff14941-96e9-543e-fa43-c7da19435d4a/artifacts/f42e325622d36002fdf7c72c93e1daa4e6f24de74412cb9d88e834d615306319.zip?se=2024-11-27T02%3A50%3A47Z&sig=6xO48knmk5%2BoNxfkoIjUSvI9YV0X0oTuKP1M9NX7iC8%3D&ske=2024-11-27T12%3A21%3A01Z&skoid=ca7593d4-ee42-46cd-af88-8b886a2f84eb&sks=b&skt=2024-11-27T00%3A21%3A01Z&sktid=398a6654-997b-47e9-b12b-9515b896b4de&skv=2024-11-04&sp=cw&spr=https&sr=b&st=2024-11-27T01%3A50%3A42Z&sv=2024-11-04&comp=block&blockid=NDEwYWVmOTctNGJhYS00N2NjLWI0OTYtMmNmYjJlNDkyNjAwMDAwMDAwMDAwMDAw failed, reason: getaddrinfo EAI_AGAIN productionresultssa7.blob.core.windows.net
models-perf / other N300 WH B0
Process completed with exit code 1.
models-perf / llm_javelin N300 WH B0
unable to access 'https://github.com/tenstorrent/tt-metal/': Could not resolve host: github.com
build-artifact / build-docker-image / build-docker-image
Unable to find merge base between e82667eb9bdd42bcd9a0fe256e0081563624772a and 2ba5a2973915aa1c5afbf6f1524e87e0c43bc058
build-artifact / build-docker-image / build-docker-image
Set 'fetch_additional_submodule_history: true' to fetch additional submodule history for: tt_metal/third_party/lfs
models-perf / cnn_javelin N300 WH B0
Failed to restore: getCacheEntry failed: Request timeout: /kE5pH1GYM3Yhxzhzfofu0B5IIUz8dzneBRSYuWcoacd9fWqJEP/_apis/artifactcache/cache?keys=setup-venv-Linux-py-3.8.18-%2Fhome%2Fubuntu%2Factions-runner%2F_work%2F_tool%2FPython%2F3.8.18%2Fx64%2Fbin%2Fpython-6e53e915dc6cae7bc216bca21416e65c2c37d74d62bc7e916a52ccd90b584ee7-.%2Fcreate_venv.sh&version=0f2a4d78a25b8dc6a98c7870cee2871c84b54ade7e9a0c38e3b80906041e7a71
models-perf / other N300 WH B0
Failed to restore: downloadCacheMetadata failed: getaddrinfo EAI_AGAIN vth0acprodeus2file0.blob.core.windows.net
models-perf / llm_javelin N300 WH B0
Failed to download action 'https://api.github.com/repos/getsentry/action-setup-venv/tarball/a133e6fd5fa6abd3f590a1c106abda344f5df69f'. Error: Resource temporarily unavailable (codeload.github.com:443)
models-perf / llm_javelin N300 WH B0
Back off 25.169 seconds before retry.
printing-out-smi-info-cleanup
Touching and printing out SMI info
successful-reset-cleanup
tt-smi reset was successful
reset-successful-cleanup
tt-smi reset was successful
disk-usage-before-startup
Disk usage is 69 %
disk-usage-after-startup
Disk usage is 69 %
printing-smi-info-startup
Touching and printing out SMI info
reset-successful-startup
tt-smi reset was successful
disk-usage-before-startup
Disk usage is 34 %
disk-usage-after-startup
Disk usage is 34 %
printing-smi-info-startup
Touching and printing out SMI info
reset-successful-startup
tt-smi reset was successful
printing-out-smi-info-cleanup
Touching and printing out SMI info
successful-reset-cleanup
tt-smi reset was successful
reset-successful-cleanup
tt-smi reset was successful
disk-usage-before-startup
Disk usage is 34 %
disk-usage-after-startup
Disk usage is 34 %
printing-smi-info-startup
Touching and printing out SMI info
reset-successful-startup
tt-smi reset was successful
printing-out-smi-info-cleanup
Touching and printing out SMI info
successful-reset-cleanup
tt-smi reset was successful
reset-successful-cleanup
tt-smi reset was successful
disk-usage-before-startup
Disk usage is 34 %
disk-usage-after-startup
Disk usage is 34 %
printing-smi-info-startup
Touching and printing out SMI info
reset-successful-startup
tt-smi reset was successful
printing-out-smi-info-cleanup
Touching and printing out SMI info
successful-reset-cleanup
tt-smi reset was successful
reset-successful-cleanup
tt-smi reset was successful
disk-usage-before-startup
Disk usage is 91 %
disk-usage-after-startup
Disk usage is 91 %
printing-smi-info-startup
Touching and printing out SMI info
reset-successful-startup
tt-smi reset was successful
disk-usage-before-startup
Disk usage is 59 %
disk-usage-after-startup
Disk usage is 59 %
printing-smi-info-startup
Touching and printing out SMI info
reset-successful-startup
tt-smi reset was successful
printing-out-smi-info-cleanup
Touching and printing out SMI info
successful-reset-cleanup
tt-smi reset was successful
reset-successful-cleanup
tt-smi reset was successful
printing-out-smi-info-cleanup
Touching and printing out SMI info
successful-reset-cleanup
tt-smi reset was successful
reset-successful-cleanup
tt-smi reset was successful
disk-usage-before-startup
Disk usage is 90 %
disk-usage-after-startup
Disk usage is 90 %
printing-smi-info-startup
Touching and printing out SMI info
reset-successful-startup
tt-smi reset was successful

Artifacts

Produced during runtime
Name Size
TTMetal_build_grayskull
300 MB
TTMetal_build_wormhole_b0
300 MB
perf-report-csv-cnn_javelin-grayskull-bare_metal
359 Bytes
perf-report-csv-llm_javelin-grayskull-bare_metal
772 Bytes
perf-report-csv-llm_javelin-wormhole_b0-bare_metal
1.4 KB
perf-report-csv-other-wormhole_b0-bare_metal
734 Bytes