Skip to content

v0.53.1-rc8

Pre-release
Pre-release
Compare
Choose a tag to compare
@github-actions github-actions released this 03 Dec 01:59
· 663 commits to main since this release

Note

If you are installing from a release, please refer to the README, INSTALLATION instructions, and any other documentation packaged with the release, not on the main branch. There may be differences between the latest main and the previous release.

The changelog will now follow, showing the changes from last release.

This release was generated by the CI workflow https://github.com/tenstorrent/tt-metal/actions/runs/12131243392

🚀 Features

📦 Uncategorized

  • Fix the test for whether to install the wheel, and also exit the script on the first error
  • Use the post-merge commit of UMD
  • [skip ci] Disable clang-format in pre-commit and move version ahead
  • Support for new matmul1d op with gather_in0
  • Remove dead includes that break without proper include paths
  • More formatting of ttnn
  • Make MEM_LOCAL_BASE accessible behind Hal
  • Add support for rank-n tensors to tilize and untilize
  • Add reduce scatter perf to tg
  • Fix Circular Buffer Allocation in untilize_with_halo
  • [skip ci] use git-clang-format in CI
  • #13676: i1 op kernel implementation and improve i0_bw pcc
  • Increase perf margin on unet
  • #12558: TTNN implementation of MNIST model
  • Remove unsupported shapes to make pipeline green
  • #13401: Add data parallel support for Bert-Tiny model
  • #15297: Allow MeshDevice to be initialized for chips without eth coordinates
  • #0: Disable clang-format precommit check once again due to errors
  • #15337: Fix incorrectly sized cb in remote cb microbenchmark
  • [skip ci] Update CONTRIBUTING.md with pre-commit info
  • Remove ClusterDescriptor path from constructor
  • Add performance and accuracy configurations to Llama 3
  • Disable upblock 3 and 4 unet unit tests
  • Re-enable git-clang-format for pre-commit again
  • Fix race condition in DRAM sharded MM
  • Fix to concat support for tensors with tile padding
  • Adjust perf test targets for Falcon7b-t3k-decode-noasync to account for CI instability
  • Format Subset of tests/ directory
  • [skip ci] Update git-blame-ignore-revs
  • #14982: Update Unary examples docs
  • fix graph_processor buffer_alloc_id calculus
  • #13415: Update left shift
  • #14982: Update Unary examples
  • #14982: Update Unary example and docs
  • #5893: Fix ops with low PCC
  • #13127: Clean up layout conversion
  • Sharded sweep tests
  • #0: Fix typos in ttnn docs
  • #0: Make sub-device bank ids for cores the same as global bank ids
  • #12151: Generalize max_pool2d code into generic pool op
  • Update INSTALLING.md
  • #13944: Prohibit copying memory objects
  • #15267: Merge erisc kernel data and bss sections
  • Update perf and latest features for llm models (Dec 2)
  • Update llms.md
  • #11795: Update pgm dispatch golden file
  • [tt-train] Add kahan summation in AdamW
  • Remove core_config.h from host code inclusion in dev_msgs.h
  • Add new Hal API "valid_reg_addr"
  • #0: Fix DPRINT_DATA macro bug