Run this benchmark via CM

Do a test run to detect and record the system performance

cmr "run-mlperf inference _find-performance _all-scenarios" \
--model=rnnt --implementation=reference --device=cpu --backend=pytorch \
--category=edge --division=open --quiet

Use --device=cuda to run the inference on Nvidia GPU
Use --division=closed to run all scenarios for the closed division (compliance tests are skipped for _find-performance mode)
Use --category=datacenter to run datacenter scenarios

Do full accuracy and performance runs for all the scenarios

cmr "run-mlperf inference _submission _all-scenarios" --model=rnnt \
--device=cpu --implementation=reference --backend=pytorch \
--execution-mode=valid --category=edge --division=open --quiet

Use --power=yes for measuring power. It is ignored for accuracy and compliance runs
--offline_target_qps, --server_target_qps and --singlestream_target_latency can be used to override the determined performance numbers

Generate and upload MLPerf submission

Follow this guide to generate the submission tree and upload your results.

Questions? Suggestions?

Check the MLCommons Task Force on Automation and Reproducibility and get in touch via public Discord server.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README_reference.md

README_reference.md

Run this benchmark via CM

Do a test run to detect and record the system performance

Do full accuracy and performance runs for all the scenarios

Generate and upload MLPerf submission

Questions? Suggestions?

Files

README_reference.md

Latest commit

History

README_reference.md

File metadata and controls

Run this benchmark via CM

Do a test run to detect and record the system performance

Do full accuracy and performance runs for all the scenarios

Generate and upload MLPerf submission

Questions? Suggestions?