-
Notifications
You must be signed in to change notification settings - Fork 7
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Remove set_driver_*_params and set_device_*_params #395
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Nice work!!! LGTM!
My only minor concern is that tt-metal uses: https://github.com/tenstorrent/tt-metal/blob/main/tt_metal/hw/inc/wormhole/dev_mem_map.h for L1 memory map.
While UMD is using: https://github.com/tenstorrent/tt-umd/blob/main/src/firmware/riscv/wormhole/l1_address_map.h
I actually like UMD's method of defining the address map... MUCH BETTER.
However, I don't like that we have two roots of truth.
What happens if these diverge?
Asked on tt_metal PR. I'm also not sure why are these membar values different, and whether that can be chosen arbitrarily or not. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
nice
@broskoTT can you check which constants from our own memory map are metal/buda specific and remove it? You can open issue as well if you don't want to this in this or some follow up PR right now |
From looking at earlier UMD code, I never understood whether these constants were passed into UMD because they were architecture-specific (and the structure of the code at the time favored handling GS vs WH in this manner) or because the application/firmware developers want to move them around (this certainly seems to be possible for the FW_VERSION_ADDR, not sure about the barrier addresses). |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It is probably worth understanding what determines the barrier addresses: hardware or firmware? I.e. can they move?
Moving the FW_VERSION_ADDR is asking for trouble, so probably safe to assume that is fixed.
3fc4e7c
to
6e2d0bb
Compare
Yes For barrier parameters, they can be moved around.
Memory map MIGHT consists of some constants which are truly defined by hardware, I can't find any at the moment, but maybe L1_MEM_SIZE, etc. Although most of these hardware specific are in noc_params I think. |
barrier params some changes minor modification fix grayskull barrier tests
e14c7fa
to
969afb9
Compare
### Ticket Related to #13948 ### Problem description Related to UMD change: tenstorrent/tt-umd#395 These parameters are defined by hardware. The only parameters to be defined by the client are membars. This change adds set_barrier_address_params accordingly. ### What's changed Removed set_device_dram_address_params and set_device_l1_address_params, and related constants. Replace with call to set_barrier_address_params. ### Checklist - [ ] All post-commit tests : https://github.com/tenstorrent/tt-metal/actions/runs/12361649190 - main generally unhealthy, I haven't seen new fails related to this change. - [x] Blackhole post-commit tests : https://github.com/tenstorrent/tt-metal/actions/runs/12361650912 - [ ] (Single-card) Model perf tests : https://github.com/tenstorrent/tt-metal/actions/runs/12361652594 - N300 job failing on main - [x] (Single-card) Device perf regressions : https://github.com/tenstorrent/tt-metal/actions/runs/12361654455 - [x] (T3K) T3000 unit tests : https://github.com/tenstorrent/tt-metal/actions/runs/12361656223 - [ ] (T3K) T3000 demo tests : https://github.com/tenstorrent/tt-metal/actions/runs/12361657485 - falcon job failing on main - [ ] (TG) TG unit tests : https://github.com/tenstorrent/tt-metal/actions/runs/12361658889 - unit test failing on main - [ ] (TG) TG demo tests : https://github.com/tenstorrent/tt-metal/actions/runs/12361660547 - failing on main - [ ] (TGG) TGG unit tests : https://github.com/tenstorrent/tt-metal/actions/runs/12361662343 - failing on main for a long time due to hugepages - [x] (TGG) TGG demo tests : https://github.com/tenstorrent/tt-metal/actions/runs/12361664160
Issue
Fixes #106
Description
Part of this was done previously in #260 #261 and #234
List of the changes
Testing
CI tests, but more importantly CI tests on the related tt_metal PR.
API Changes
This PR has API changes: