First-order CUDA followup fix: explicit CC list for Jetson and Tegra platforms #163

griwodz · 2024-08-12T09:30:13Z

Description

Using CUDA as a language in CMake projects requires a CC selection in the variable CMAKE_CUDA_ARCHITECTURES.

The default here is the oldest CC supported by the installed nvcc. To have a better but not complete CC list for discrete GPUs, set CMAKE_CUDA_ARCHITECTURES to "all-major". Both fails for Tegras and Jetson because CMake guesses that these are ARM platforms with discrete GPUs.

Therefore, we test if the file /etc/nv_tegra_release exists. In this case, we guess that the intention is to compile for Tegras or Jetsons and set CMAKE_CUDA_ARCHITECTURES="53;62;72;87".

Features list

Select sensible CCs when PopSift is configured on a Tegra/Jetson to compile for a Tegra/Jetson.

Implementation remarks

We are not adding CC 32 to the list since it is deprecated. The current CUDA code may still work on CC32 platforms.

griwodz · 2024-08-12T09:49:05Z

Waiting to hhackbarth to confirm that it works on their Jetson version as well.
I also learned that CMAKE_CUDA_ARCHITECTURES belongs before project() and not after.

griwodz · 2024-08-13T07:23:51Z

Hi @simogasp ,
the fix is confirmed in the bug report #160. I don't know if I'm able to follow up on the Python wrapper, but I'll try. I think that this fix is only beneficial for the develop branch in any case.

simogasp

Beside a small comment it seems ok to me (but I cannot test it atm)

CMakeLists.txt

explicit CC list for Jetson and Tegra platforms

494408e

griwodz self-assigned this Aug 12, 2024

griwodz added type:bug ready cuda issues related to cuda versions labels Aug 12, 2024

griwodz linked an issue Aug 12, 2024 that may be closed by this pull request

runtime error: cudaMemcpyToSymbol failed for Gauss kernel initialization #160

Closed

griwodz changed the title ~~First-order CUDA followup fit: explicit CC list for Jetson and Tegra platforms~~ First-order CUDA followup fix: explicit CC list for Jetson and Tegra platforms Aug 12, 2024

griwodz requested a review from simogasp August 12, 2024 09:46

griwodz mentioned this pull request Aug 12, 2024

runtime error: cudaMemcpyToSymbol failed for Gauss kernel initialization #160

Closed

griwodz added scope:build and removed type:bug cuda issues related to cuda versions labels Aug 12, 2024

simogasp approved these changes Aug 13, 2024

View reviewed changes

CMakeLists.txt Show resolved Hide resolved

griwodz merged commit 66f1ed6 into develop Aug 13, 2024
6 checks passed

griwodz deleted the dev/cmake-native-cuda-jetson branch August 13, 2024 09:43

hhackbarth mentioned this pull request Aug 14, 2024

Pypopsift crashes. Investigate. #164

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

First-order CUDA followup fix: explicit CC list for Jetson and Tegra platforms #163

First-order CUDA followup fix: explicit CC list for Jetson and Tegra platforms #163

griwodz commented Aug 12, 2024

griwodz commented Aug 12, 2024

griwodz commented Aug 13, 2024

simogasp left a comment

First-order CUDA followup fix: explicit CC list for Jetson and Tegra platforms #163

First-order CUDA followup fix: explicit CC list for Jetson and Tegra platforms #163

Conversation

griwodz commented Aug 12, 2024

Description

Features list

Implementation remarks

griwodz commented Aug 12, 2024

griwodz commented Aug 13, 2024

simogasp left a comment

Choose a reason for hiding this comment