Refactor Superset model benchmarking tools to use Pydantic classes and save one json #16790

skhorasganiTT · 2025-01-15T23:18:28Z

Ticket

#15435

Problem description

The current benchmark infra for uploading data to Superset is using the deprecated method of generating 3 separate CSV files for run/measurement/environment data.
The Llama3 t3k benchmark data was not being uploaded by the t3k demo tests

What's changed

Refactored the model benchmarking tools to generate a single json file which is partially filled with run/measurement data during model tests, and afterwards completed with environment data by the CI test infra
Refactored the benchmark data to be stored using the Pydantic classes implemented by the data science team so that data types are validated prior to json creation
Updated the benchmark environment job in the Produce data for external analysis workflow to test completing a partial benchmark json
Minor var cleanup in demo tests, modifying 'csv' to 'json'
Enabled uploading of benchmark data for the llama3 T3K demo tests

Checklist

Post commit CI passes
Blackhole Post commit (if applicable)
Model regression CI testing passes (if applicable)
Device performance regression CI testing passes (if applicable)
(For models and ops writers) Full new models tests passes
New/Existing tests provide coverage for changes

…d save one json WIP Signed-off-by: Salar Hosseini <[email protected]>

…l json and add data uploading for t3k llama tests Signed-off-by: Salar Hosseini <[email protected]>

Signed-off-by: Salar Hosseini <[email protected]>

skhorasganiTT · 2025-01-15T23:19:07Z

Produce data for external analysis test: https://github.com/tenstorrent/tt-metal/actions/runs/12818671201
Single-card demo tests: https://github.com/tenstorrent/tt-metal/actions/runs/12815226849
T3K demo tests: https://github.com/tenstorrent/tt-metal/actions/runs/12815926839
All post-commit tests: https://github.com/tenstorrent/tt-metal/actions/runs/12815232660

mtairum

Looks good from the llama3 demo side.

I'll be updating my local changes to use the new pydantic as well 👍

tt-rkim

I propose a decently big change in benchmarking_utils.py

Rest of the code is ok

.github/scripts/data_analysis/create_dummy_partial_benchmark_json.py

models/perf/benchmarking_utils.py

…ading them Signed-off-by: Salar Hosseini <[email protected]>

Signed-off-by: Salar Hosseini <[email protected]>

models/perf/benchmarking_utils.py

Signed-off-by: Salar Hosseini <[email protected]>

skhorasganiTT added 6 commits January 15, 2025 19:56

Refactor Superset model benchmarking tools to use Pydantic classes an…

a153b66

…d save one json WIP Signed-off-by: Salar Hosseini <[email protected]>

Modify environment csv creation to generate complete json from partia…

535fbaa

…l json and add data uploading for t3k llama tests Signed-off-by: Salar Hosseini <[email protected]>

Add test for producing complete benchmark json with environment data

f25b2d1

Signed-off-by: Salar Hosseini <[email protected]>

minor cleanup

b30993f

Signed-off-by: Salar Hosseini <[email protected]>

minor fixes to partial json test

690c2a9

Signed-off-by: Salar Hosseini <[email protected]>

Minor fix to t3k demo tests

faa4909

Signed-off-by: Salar Hosseini <[email protected]>

skhorasganiTT requested review from djordje-tt, uaydonat, yieldthought, mtairum, esmalTT, kpaigwar, cglagovichTT and a team as code owners January 15, 2025 23:18

mtairum approved these changes Jan 16, 2025

View reviewed changes

esmalTT approved these changes Jan 16, 2025

View reviewed changes

tt-rkim requested changes Jan 16, 2025

View reviewed changes

.github/scripts/data_analysis/create_dummy_partial_benchmark_json.py Show resolved Hide resolved

models/perf/benchmarking_utils.py Outdated Show resolved Hide resolved

skhorasganiTT added 3 commits January 16, 2025 17:59

Delete partial run jsons after producing complete jsons to avoid uplo…

81961a5

…ading them Signed-off-by: Salar Hosseini <[email protected]>

Move pydantic models to infra/data_collection

313ae0b

Signed-off-by: Salar Hosseini <[email protected]>

Change benchmarking_utils functions to no-ops unless running in CI env

b02a731

Signed-off-by: Salar Hosseini <[email protected]>

djordje-tt approved these changes Jan 16, 2025

View reviewed changes

tt-rkim reviewed Jan 17, 2025

View reviewed changes

models/perf/benchmarking_utils.py Show resolved Hide resolved

tt-rkim approved these changes Jan 17, 2025

View reviewed changes

models/perf/benchmarking_utils.py Show resolved Hide resolved

Add logger warning when skipping import of pydantic models

0480364

Signed-off-by: Salar Hosseini <[email protected]>

skhorasganiTT merged commit af79262 into main Jan 17, 2025
17 of 23 checks passed

skhorasganiTT deleted the skhorasgani/superset_json branch January 17, 2025 17:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor Superset model benchmarking tools to use Pydantic classes and save one json #16790

Refactor Superset model benchmarking tools to use Pydantic classes and save one json #16790

skhorasganiTT commented Jan 15, 2025 •

edited

Loading

skhorasganiTT commented Jan 15, 2025 •

edited

Loading

mtairum left a comment

tt-rkim left a comment

Refactor Superset model benchmarking tools to use Pydantic classes and save one json #16790

Refactor Superset model benchmarking tools to use Pydantic classes and save one json #16790

Conversation

skhorasganiTT commented Jan 15, 2025 • edited Loading

Ticket

Problem description

What's changed

Checklist

skhorasganiTT commented Jan 15, 2025 • edited Loading

mtairum left a comment

Choose a reason for hiding this comment

tt-rkim left a comment

Choose a reason for hiding this comment

skhorasganiTT commented Jan 15, 2025 •

edited

Loading

skhorasganiTT commented Jan 15, 2025 •

edited

Loading