Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Architecture and OS identification macros #3237

Open
wants to merge 12 commits into
base: main
Choose a base branch
from

Conversation

fbusato
Copy link
Contributor

@fbusato fbusato commented Jan 2, 2025

Fixes #2505

Description

Add internal architecture and OS identification macros for CUDA supported platforms

@fbusato fbusato self-assigned this Jan 2, 2025
@fbusato fbusato requested review from a team as code owners January 2, 2025 23:52
Copy link
Contributor

github-actions bot commented Jan 3, 2025

🟩 CI finished in 1h 52m: Pass: 100%/170 | Total: 1d 14h | Avg: 13m 42s | Max: 1h 24m | Hits: 9%/22530
  • 🟩 libcudacxx: Pass: 100%/48 | Total: 10h 16m | Avg: 12m 50s | Max: 46m 33s | Hits: 3%/9826

    🟩 cpu
      🟩 amd64              Pass: 100%/46  | Total: 10h 09m | Avg: 13m 15s | Max: 46m 33s | Hits:   3%/9826  
      🟩 arm64              Pass: 100%/2   | Total:  6m 58s | Avg:  3m 29s | Max:  3m 37s
    🟩 ctk
      🟩 11.1               Pass: 100%/7   | Total: 53m 45s | Avg:  7m 40s | Max: 36m 16s | Hits:   3%/2242  
      🟩 12.5               Pass: 100%/2   | Total:  1h 10m | Avg: 35m 08s | Max: 38m 30s
      🟩 12.6               Pass: 100%/39  | Total:  8h 12m | Avg: 12m 37s | Max: 46m 33s | Hits:   3%/7584  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/4   | Total:  1h 09m | Avg: 17m 18s | Max: 22m 37s
      🟩 nvcc11.1           Pass: 100%/7   | Total: 53m 45s | Avg:  7m 40s | Max: 36m 16s | Hits:   3%/2242  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  1h 10m | Avg: 35m 08s | Max: 38m 30s
      🟩 nvcc12.6           Pass: 100%/35  | Total:  7h 03m | Avg: 12m 05s | Max: 46m 33s | Hits:   3%/7584  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/4   | Total:  1h 09m | Avg: 17m 18s | Max: 22m 37s
      🟩 nvcc               Pass: 100%/44  | Total:  9h 07m | Avg: 12m 26s | Max: 46m 33s | Hits:   3%/9826  
    🟩 cxx
      🟩 Clang9             Pass: 100%/4   | Total: 14m 53s | Avg:  3m 43s | Max:  4m 20s
      🟩 Clang10            Pass: 100%/1   | Total:  4m 39s | Avg:  4m 39s | Max:  4m 39s
      🟩 Clang11            Pass: 100%/1   | Total:  3m 57s | Avg:  3m 57s | Max:  3m 57s
      🟩 Clang12            Pass: 100%/1   | Total:  4m 10s | Avg:  4m 10s | Max:  4m 10s
      🟩 Clang13            Pass: 100%/1   | Total:  4m 10s | Avg:  4m 10s | Max:  4m 10s
      🟩 Clang14            Pass: 100%/1   | Total:  4m 12s | Avg:  4m 12s | Max:  4m 12s
      🟩 Clang15            Pass: 100%/1   | Total:  4m 05s | Avg:  4m 05s | Max:  4m 05s
      🟩 Clang16            Pass: 100%/1   | Total: 13m 45s | Avg: 13m 45s | Max: 13m 45s
      🟩 Clang17            Pass: 100%/1   | Total:  4m 05s | Avg:  4m 05s | Max:  4m 05s
      🟩 Clang18            Pass: 100%/8   | Total:  1h 45m | Avg: 13m 09s | Max: 24m 03s
      🟩 GCC6               Pass: 100%/2   | Total:  5m 15s | Avg:  2m 37s | Max:  2m 39s
      🟩 GCC7               Pass: 100%/2   | Total:  6m 30s | Avg:  3m 15s | Max:  3m 30s
      🟩 GCC8               Pass: 100%/1   | Total:  3m 35s | Avg:  3m 35s | Max:  3m 35s
      🟩 GCC9               Pass: 100%/3   | Total:  9m 24s | Avg:  3m 08s | Max:  3m 41s
      🟩 GCC10              Pass: 100%/1   | Total:  3m 32s | Avg:  3m 32s | Max:  3m 32s
      🟩 GCC11              Pass: 100%/1   | Total:  3m 41s | Avg:  3m 41s | Max:  3m 41s
      🟩 GCC12              Pass: 100%/1   | Total:  3m 51s | Avg:  3m 51s | Max:  3m 51s
      🟩 GCC13              Pass: 100%/10  | Total:  2h 46m | Avg: 16m 40s | Max: 43m 37s
      🟩 Intel2023.2.0      Pass: 100%/1   | Total: 24m 28s | Avg: 24m 28s | Max: 24m 28s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 36m 16s | Avg: 36m 16s | Max: 36m 16s | Hits:   3%/2242  
      🟩 MSVC14.29          Pass: 100%/1   | Total: 35m 17s | Avg: 35m 17s | Max: 35m 17s | Hits:   3%/2479  
      🟩 MSVC14.39          Pass: 100%/2   | Total:  1h 24m | Avg: 42m 16s | Max: 46m 33s | Hits:   3%/5105  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  1h 10m | Avg: 35m 08s | Max: 38m 30s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/20  | Total:  2h 43m | Avg:  8m 09s | Max: 24m 03s
      🟩 GCC                Pass: 100%/21  | Total:  3h 22m | Avg:  9m 38s | Max: 43m 37s
      🟩 Intel              Pass: 100%/1   | Total: 24m 28s | Avg: 24m 28s | Max: 24m 28s
      🟩 MSVC               Pass: 100%/4   | Total:  2h 36m | Avg: 39m 01s | Max: 46m 33s | Hits:   3%/9826  
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 10m | Avg: 35m 08s | Max: 38m 30s
    🟩 gpu
      🟩 v100               Pass: 100%/48  | Total: 10h 16m | Avg: 12m 50s | Max: 46m 33s | Hits:   3%/9826  
    🟩 jobs
      🟩 Build              Pass: 100%/41  | Total:  7h 21m | Avg: 10m 45s | Max: 46m 33s | Hits:   3%/9826  
      🟩 NVRTC              Pass: 100%/4   | Total:  2h 05m | Avg: 31m 20s | Max: 43m 37s
      🟩 Test               Pass: 100%/2   | Total: 47m 59s | Avg: 23m 59s | Max: 24m 03s
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  1m 54s | Avg:  1m 54s | Max:  1m 54s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total: 13m 53s | Avg: 13m 53s | Max: 13m 53s
      🟩 90a                Pass: 100%/2   | Total: 17m 54s | Avg:  8m 57s | Max: 13m 56s
    🟩 std
      🟩 11                 Pass: 100%/6   | Total: 39m 21s | Avg:  6m 33s | Max: 24m 09s
      🟩 14                 Pass: 100%/5   | Total:  1h 30m | Avg: 18m 04s | Max: 43m 37s | Hits:   3%/2242  
      🟩 17                 Pass: 100%/13  | Total:  3h 24m | Avg: 15m 42s | Max: 38m 00s | Hits:   3%/4958  
      🟩 20                 Pass: 100%/23  | Total:  4h 40m | Avg: 12m 12s | Max: 46m 33s | Hits:   3%/2626  
    
  • 🟩 cub: Pass: 100%/47 | Total: 13h 21m | Avg: 17m 03s | Max: 1h 08m | Hits: 0%/3132

    🟩 cpu
      🟩 amd64              Pass: 100%/45  | Total: 13h 11m | Avg: 17m 35s | Max:  1h 08m | Hits:   0%/3132  
      🟩 arm64              Pass: 100%/2   | Total:  9m 36s | Avg:  4m 48s | Max:  4m 55s
    🟩 ctk
      🟩 11.1               Pass: 100%/7   | Total:  1h 25m | Avg: 12m 12s | Max: 59m 34s | Hits:   0%/783   
      🟩 12.5               Pass: 100%/2   | Total:  2h 11m | Avg:  1h 05m | Max:  1h 08m
      🟩 12.6               Pass: 100%/38  | Total:  9h 44m | Avg: 15m 22s | Max:  1h 08m | Hits:   0%/2349  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  8m 39s | Avg:  4m 19s | Max:  4m 26s
      🟩 nvcc11.1           Pass: 100%/7   | Total:  1h 25m | Avg: 12m 12s | Max: 59m 34s | Hits:   0%/783   
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 11m | Avg:  1h 05m | Max:  1h 08m
      🟩 nvcc12.6           Pass: 100%/36  | Total:  9h 35m | Avg: 15m 59s | Max:  1h 08m | Hits:   0%/2349  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  8m 39s | Avg:  4m 19s | Max:  4m 26s
      🟩 nvcc               Pass: 100%/45  | Total: 13h 12m | Avg: 17m 37s | Max:  1h 08m | Hits:   0%/3132  
    🟩 cxx
      🟩 Clang9             Pass: 100%/4   | Total: 21m 06s | Avg:  5m 16s | Max:  6m 22s
      🟩 Clang10            Pass: 100%/1   | Total:  6m 13s | Avg:  6m 13s | Max:  6m 13s
      🟩 Clang11            Pass: 100%/1   | Total:  5m 42s | Avg:  5m 42s | Max:  5m 42s
      🟩 Clang12            Pass: 100%/1   | Total:  5m 28s | Avg:  5m 28s | Max:  5m 28s
      🟩 Clang13            Pass: 100%/1   | Total:  5m 24s | Avg:  5m 24s | Max:  5m 24s
      🟩 Clang14            Pass: 100%/1   | Total:  5m 22s | Avg:  5m 22s | Max:  5m 22s
      🟩 Clang15            Pass: 100%/1   | Total:  5m 54s | Avg:  5m 54s | Max:  5m 54s
      🟩 Clang16            Pass: 100%/1   | Total:  5m 32s | Avg:  5m 32s | Max:  5m 32s
      🟩 Clang17            Pass: 100%/1   | Total:  5m 24s | Avg:  5m 24s | Max:  5m 24s
      🟩 Clang18            Pass: 100%/7   | Total:  1h 15m | Avg: 10m 44s | Max: 30m 34s
      🟩 GCC6               Pass: 100%/2   | Total:  8m 26s | Avg:  4m 13s | Max:  4m 24s
      🟩 GCC7               Pass: 100%/2   | Total: 10m 22s | Avg:  5m 11s | Max:  5m 21s
      🟩 GCC8               Pass: 100%/1   | Total:  5m 05s | Avg:  5m 05s | Max:  5m 05s
      🟩 GCC9               Pass: 100%/3   | Total: 14m 09s | Avg:  4m 43s | Max:  5m 14s
      🟩 GCC10              Pass: 100%/1   | Total:  5m 23s | Avg:  5m 23s | Max:  5m 23s
      🟩 GCC11              Pass: 100%/1   | Total:  5m 29s | Avg:  5m 29s | Max:  5m 29s
      🟩 GCC12              Pass: 100%/3   | Total: 28m 24s | Avg:  9m 28s | Max: 18m 31s
      🟩 GCC13              Pass: 100%/8   | Total:  2h 00m | Avg: 15m 02s | Max: 40m 00s
      🟩 Intel2023.2.0      Pass: 100%/1   | Total:  1h 07m | Avg:  1h 07m | Max:  1h 07m
      🟩 MSVC14.16          Pass: 100%/1   | Total: 59m 34s | Avg: 59m 34s | Max: 59m 34s | Hits:   0%/783   
      🟩 MSVC14.29          Pass: 100%/1   | Total:  1h 08m | Avg:  1h 08m | Max:  1h 08m | Hits:   0%/783   
      🟩 MSVC14.39          Pass: 100%/2   | Total:  2h 14m | Avg:  1h 07m | Max:  1h 08m | Hits:   0%/1566  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 11m | Avg:  1h 05m | Max:  1h 08m
    🟩 cxx_family
      🟩 Clang              Pass: 100%/19  | Total:  2h 21m | Avg:  7m 26s | Max: 30m 34s
      🟩 GCC                Pass: 100%/21  | Total:  3h 17m | Avg:  9m 24s | Max: 40m 00s
      🟩 Intel              Pass: 100%/1   | Total:  1h 07m | Avg:  1h 07m | Max:  1h 07m
      🟩 MSVC               Pass: 100%/4   | Total:  4h 23m | Avg:  1h 05m | Max:  1h 08m | Hits:   0%/3132  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 11m | Avg:  1h 05m | Max:  1h 08m
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 22m 49s | Avg: 11m 24s | Max: 18m 31s
      🟩 v100               Pass: 100%/45  | Total: 12h 58m | Avg: 17m 18s | Max:  1h 08m | Hits:   0%/3132  
    🟩 jobs
      🟩 Build              Pass: 100%/40  | Total: 10h 32m | Avg: 15m 48s | Max:  1h 08m | Hits:   0%/3132  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 22m 38s | Avg: 22m 38s | Max: 22m 38s
      🟩 GraphCapture       Pass: 100%/1   | Total: 16m 27s | Avg: 16m 27s | Max: 16m 27s
      🟩 HostLaunch         Pass: 100%/3   | Total: 59m 08s | Avg: 19m 42s | Max: 20m 27s
      🟩 TestGPU            Pass: 100%/2   | Total:  1h 10m | Avg: 35m 17s | Max: 40m 00s
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 22m 49s | Avg: 11m 24s | Max: 18m 31s
      🟩 90a                Pass: 100%/1   | Total:  4m 34s | Avg:  4m 34s | Max:  4m 34s
    🟩 std
      🟩 11                 Pass: 100%/5   | Total: 23m 57s | Avg:  4m 47s | Max:  6m 10s
      🟩 14                 Pass: 100%/4   | Total:  1h 15m | Avg: 18m 50s | Max: 59m 34s | Hits:   0%/783   
      🟩 17                 Pass: 100%/12  | Total:  5h 09m | Avg: 25m 47s | Max:  1h 08m | Hits:   0%/1566  
      🟩 20                 Pass: 100%/26  | Total:  6h 32m | Avg: 15m 06s | Max:  1h 08m | Hits:   0%/783   
    
  • 🟩 thrust: Pass: 100%/46 | Total: 12h 15m | Avg: 15m 59s | Max: 1h 24m | Hits: 20%/9260

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 18m 44s | Avg:  9m 22s | Max: 12m 15s
    🟩 cpu
      🟩 amd64              Pass: 100%/44  | Total: 12h 06m | Avg: 16m 30s | Max:  1h 24m | Hits:  20%/9260  
      🟩 arm64              Pass: 100%/2   | Total:  9m 33s | Avg:  4m 46s | Max:  5m 00s
    🟩 ctk
      🟩 11.1               Pass: 100%/7   | Total:  1h 30m | Avg: 12m 58s | Max:  1h 05m | Hits:   0%/1852  
      🟩 12.5               Pass: 100%/2   | Total:  2h 32m | Avg:  1h 16m | Max:  1h 16m
      🟩 12.6               Pass: 100%/37  | Total:  8h 12m | Avg: 13m 18s | Max:  1h 24m | Hits:  25%/7408  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 10m 10s | Avg:  5m 05s | Max:  5m 21s
      🟩 nvcc11.1           Pass: 100%/7   | Total:  1h 30m | Avg: 12m 58s | Max:  1h 05m | Hits:   0%/1852  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 32m | Avg:  1h 16m | Max:  1h 16m
      🟩 nvcc12.6           Pass: 100%/35  | Total:  8h 02m | Avg: 13m 46s | Max:  1h 24m | Hits:  25%/7408  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 10m 10s | Avg:  5m 05s | Max:  5m 21s
      🟩 nvcc               Pass: 100%/44  | Total: 12h 05m | Avg: 16m 29s | Max:  1h 24m | Hits:  20%/9260  
    🟩 cxx
      🟩 Clang9             Pass: 100%/4   | Total: 20m 11s | Avg:  5m 02s | Max:  5m 51s
      🟩 Clang10            Pass: 100%/1   | Total:  6m 25s | Avg:  6m 25s | Max:  6m 25s
      🟩 Clang11            Pass: 100%/1   | Total:  5m 19s | Avg:  5m 19s | Max:  5m 19s
      🟩 Clang12            Pass: 100%/1   | Total:  5m 00s | Avg:  5m 00s | Max:  5m 00s
      🟩 Clang13            Pass: 100%/1   | Total:  5m 14s | Avg:  5m 14s | Max:  5m 14s
      🟩 Clang14            Pass: 100%/1   | Total:  5m 11s | Avg:  5m 11s | Max:  5m 11s
      🟩 Clang15            Pass: 100%/1   | Total:  5m 32s | Avg:  5m 32s | Max:  5m 32s
      🟩 Clang16            Pass: 100%/1   | Total:  5m 13s | Avg:  5m 13s | Max:  5m 13s
      🟩 Clang17            Pass: 100%/1   | Total:  5m 23s | Avg:  5m 23s | Max:  5m 23s
      🟩 Clang18            Pass: 100%/7   | Total: 44m 27s | Avg:  6m 21s | Max: 11m 37s
      🟩 GCC6               Pass: 100%/2   | Total:  8m 17s | Avg:  4m 08s | Max:  4m 24s
      🟩 GCC7               Pass: 100%/2   | Total:  9m 43s | Avg:  4m 51s | Max:  5m 01s
      🟩 GCC8               Pass: 100%/1   | Total:  5m 33s | Avg:  5m 33s | Max:  5m 33s
      🟩 GCC9               Pass: 100%/3   | Total: 14m 03s | Avg:  4m 41s | Max:  5m 18s
      🟩 GCC10              Pass: 100%/1   | Total:  5m 12s | Avg:  5m 12s | Max:  5m 12s
      🟩 GCC11              Pass: 100%/1   | Total:  5m 56s | Avg:  5m 56s | Max:  5m 56s
      🟩 GCC12              Pass: 100%/1   | Total:  5m 49s | Avg:  5m 49s | Max:  5m 49s
      🟩 GCC13              Pass: 100%/8   | Total:  1h 01m | Avg:  7m 41s | Max: 13m 23s
      🟩 Intel2023.2.0      Pass: 100%/1   | Total: 51m 48s | Avg: 51m 48s | Max: 51m 48s
      🟩 MSVC14.16          Pass: 100%/1   | Total:  1h 05m | Avg:  1h 05m | Max:  1h 05m | Hits:   0%/1852  
      🟩 MSVC14.29          Pass: 100%/1   | Total:  1h 05m | Avg:  1h 05m | Max:  1h 05m | Hits:   0%/1852  
      🟩 MSVC14.39          Pass: 100%/3   | Total:  2h 56m | Avg: 58m 52s | Max:  1h 24m | Hits:  33%/5556  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 32m | Avg:  1h 16m | Max:  1h 16m
    🟩 cxx_family
      🟩 Clang              Pass: 100%/19  | Total:  1h 47m | Avg:  5m 40s | Max: 11m 37s
      🟩 GCC                Pass: 100%/19  | Total:  1h 56m | Avg:  6m 06s | Max: 13m 23s
      🟩 Intel              Pass: 100%/1   | Total: 51m 48s | Avg: 51m 48s | Max: 51m 48s
      🟩 MSVC               Pass: 100%/5   | Total:  5h 07m | Avg:  1h 01m | Max:  1h 24m | Hits:  20%/9260  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 32m | Avg:  1h 16m | Max:  1h 16m
    🟩 gpu
      🟩 v100               Pass: 100%/46  | Total: 12h 15m | Avg: 15m 59s | Max:  1h 24m | Hits:  20%/9260  
    🟩 jobs
      🟩 Build              Pass: 100%/40  | Total: 11h 00m | Avg: 16m 30s | Max:  1h 24m | Hits:   0%/7408  
      🟩 TestCPU            Pass: 100%/3   | Total: 38m 28s | Avg: 12m 49s | Max: 22m 49s | Hits:  99%/1852  
      🟩 TestGPU            Pass: 100%/3   | Total: 37m 15s | Avg: 12m 25s | Max: 13m 23s
    🟩 sm
      🟩 90a                Pass: 100%/1   | Total:  4m 33s | Avg:  4m 33s | Max:  4m 33s
    🟩 std
      🟩 11                 Pass: 100%/5   | Total: 22m 04s | Avg:  4m 24s | Max:  5m 31s
      🟩 14                 Pass: 100%/4   | Total:  1h 20m | Avg: 20m 04s | Max:  1h 05m | Hits:   0%/1852  
      🟩 17                 Pass: 100%/12  | Total:  5h 06m | Avg: 25m 31s | Max:  1h 16m | Hits:   0%/3704  
      🟩 20                 Pass: 100%/23  | Total:  5h 08m | Avg: 13m 24s | Max:  1h 24m | Hits:  49%/3704  
    
  • 🟩 cudax: Pass: 100%/26 | Total: 2h 17m | Avg: 5m 17s | Max: 17m 57s | Hits: 1%/312

    🟩 cpu
      🟩 amd64              Pass: 100%/22  | Total:  2h 07m | Avg:  5m 46s | Max: 17m 57s | Hits:   1%/312   
      🟩 arm64              Pass: 100%/4   | Total: 10m 09s | Avg:  2m 32s | Max:  2m 37s
    🟩 ctk
      🟩 12.0               Pass: 100%/3   | Total: 17m 27s | Avg:  5m 49s | Max: 11m 27s | Hits:   1%/156   
      🟩 12.5               Pass: 100%/2   | Total: 17m 55s | Avg:  8m 57s | Max:  9m 17s
      🟩 12.6               Pass: 100%/21  | Total:  1h 42m | Avg:  4m 51s | Max: 17m 57s | Hits:   1%/156   
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/3   | Total: 17m 27s | Avg:  5m 49s | Max: 11m 27s | Hits:   1%/156   
      🟩 nvcc12.5           Pass: 100%/2   | Total: 17m 55s | Avg:  8m 57s | Max:  9m 17s
      🟩 nvcc12.6           Pass: 100%/21  | Total:  1h 42m | Avg:  4m 51s | Max: 17m 57s | Hits:   1%/156   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/26  | Total:  2h 17m | Avg:  5m 17s | Max: 17m 57s | Hits:   1%/312   
    🟩 cxx
      🟩 Clang9             Pass: 100%/1   | Total:  3m 02s | Avg:  3m 02s | Max:  3m 02s
      🟩 Clang10            Pass: 100%/1   | Total:  3m 46s | Avg:  3m 46s | Max:  3m 46s
      🟩 Clang11            Pass: 100%/1   | Total:  3m 12s | Avg:  3m 12s | Max:  3m 12s
      🟩 Clang12            Pass: 100%/1   | Total:  3m 13s | Avg:  3m 13s | Max:  3m 13s
      🟩 Clang13            Pass: 100%/1   | Total:  3m 09s | Avg:  3m 09s | Max:  3m 09s
      🟩 Clang14            Pass: 100%/1   | Total:  3m 11s | Avg:  3m 11s | Max:  3m 11s
      🟩 Clang15            Pass: 100%/1   | Total:  3m 14s | Avg:  3m 14s | Max:  3m 14s
      🟩 Clang16            Pass: 100%/1   | Total:  3m 13s | Avg:  3m 13s | Max:  3m 13s
      🟩 Clang17            Pass: 100%/1   | Total:  3m 20s | Avg:  3m 20s | Max:  3m 20s
      🟩 Clang18            Pass: 100%/4   | Total: 26m 15s | Avg:  6m 33s | Max: 17m 57s
      🟩 GCC9               Pass: 100%/1   | Total:  2m 58s | Avg:  2m 58s | Max:  2m 58s
      🟩 GCC10              Pass: 100%/1   | Total:  3m 04s | Avg:  3m 04s | Max:  3m 04s
      🟩 GCC11              Pass: 100%/1   | Total:  3m 10s | Avg:  3m 10s | Max:  3m 10s
      🟩 GCC12              Pass: 100%/2   | Total: 20m 26s | Avg: 10m 13s | Max: 17m 14s
      🟩 GCC13              Pass: 100%/4   | Total: 10m 44s | Avg:  2m 41s | Max:  2m 48s
      🟩 MSVC14.36          Pass: 100%/1   | Total: 11m 27s | Avg: 11m 27s | Max: 11m 27s | Hits:   1%/156   
      🟩 MSVC14.39          Pass: 100%/1   | Total: 12m 03s | Avg: 12m 03s | Max: 12m 03s | Hits:   1%/156   
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 17m 55s | Avg:  8m 57s | Max:  9m 17s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/13  | Total: 55m 35s | Avg:  4m 16s | Max: 17m 57s
      🟩 GCC                Pass: 100%/9   | Total: 40m 22s | Avg:  4m 29s | Max: 17m 14s
      🟩 MSVC               Pass: 100%/2   | Total: 23m 30s | Avg: 11m 45s | Max: 12m 03s | Hits:   1%/312   
      🟩 NVHPC              Pass: 100%/2   | Total: 17m 55s | Avg:  8m 57s | Max:  9m 17s
    🟩 gpu
      🟩 v100               Pass: 100%/26  | Total:  2h 17m | Avg:  5m 17s | Max: 17m 57s | Hits:   1%/312   
    🟩 jobs
      🟩 Build              Pass: 100%/24  | Total:  1h 42m | Avg:  4m 15s | Max: 12m 03s | Hits:   1%/312   
      🟩 Test               Pass: 100%/2   | Total: 35m 11s | Avg: 17m 35s | Max: 17m 57s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total:  2m 48s | Avg:  2m 48s | Max:  2m 48s
      🟩 90a                Pass: 100%/1   | Total:  2m 45s | Avg:  2m 45s | Max:  2m 45s
    🟩 std
      🟩 17                 Pass: 100%/6   | Total: 22m 30s | Avg:  3m 45s | Max:  8m 38s
      🟩 20                 Pass: 100%/20  | Total:  1h 54m | Avg:  5m 44s | Max: 17m 57s | Hits:   1%/312   
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 10m 17s | Avg: 5m 08s | Max: 8m 09s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 10m 17s | Avg:  5m 08s | Max:  8m 09s
    🟩 ctk
      🟩 12.6               Pass: 100%/2   | Total: 10m 17s | Avg:  5m 08s | Max:  8m 09s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/2   | Total: 10m 17s | Avg:  5m 08s | Max:  8m 09s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 10m 17s | Avg:  5m 08s | Max:  8m 09s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 10m 17s | Avg:  5m 08s | Max:  8m 09s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 10m 17s | Avg:  5m 08s | Max:  8m 09s
    🟩 gpu
      🟩 v100               Pass: 100%/2   | Total: 10m 17s | Avg:  5m 08s | Max:  8m 09s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 08s | Avg:  2m 08s | Max:  2m 08s
      🟩 Test               Pass: 100%/1   | Total:  8m 09s | Avg:  8m 09s | Max:  8m 09s
    
  • 🟩 python: Pass: 100%/1 | Total: 27m 48s | Avg: 27m 48s | Max: 27m 48s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 27m 48s | Avg: 27m 48s | Max: 27m 48s
    🟩 ctk
      🟩 12.6               Pass: 100%/1   | Total: 27m 48s | Avg: 27m 48s | Max: 27m 48s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/1   | Total: 27m 48s | Avg: 27m 48s | Max: 27m 48s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 27m 48s | Avg: 27m 48s | Max: 27m 48s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 27m 48s | Avg: 27m 48s | Max: 27m 48s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 27m 48s | Avg: 27m 48s | Max: 27m 48s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 27m 48s | Avg: 27m 48s | Max: 27m 48s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 27m 48s | Avg: 27m 48s | Max: 27m 48s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
+/- libcu++
CUB
Thrust
CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 170)

# Runner
125 linux-amd64-cpu16
19 linux-amd64-gpu-v100-latest-1
15 windows-amd64-cpu16
10 linux-arm64-cpu16
1 linux-amd64-gpu-h100-latest-1-testing

@pciolkosz
Copy link
Contributor

Would it make sense to add some minimal testing? Maybe confirm one of the arch and one of the os macro is always present? Would it make sense to replace the current usage of the underlying macros or would it make more sense as a separate change?

@fbusato
Copy link
Contributor Author

fbusato commented Jan 3, 2025

thanks @pciolkosz. I will add minimal testing. I would also love to add stronger tests, but I'm not sure how to implement them. Let me see if I can do something in this direction

Copy link
Contributor

@pciolkosz pciolkosz left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I like the testing, thanks @fbusato

@fbusato
Copy link
Contributor Author

fbusato commented Jan 3, 2025

indeed, I found that amazing things happen with nvrtc

#if _CCCL_ARCH(32BIT)
  static_assert(sizeof(void*) == 4, ""); // FAIL!!
#endif

@leofang
Copy link
Member

leofang commented Jan 3, 2025

Right, NVRTC follows the host compiler behavior on the respective platforms, though it does not compile host code and therefore has no such concept of host compiler unlike in NVCC.

Comment on lines +39 to +44
// X86 32-bit
#if defined(_M_IX86)
# define _CCCL_ARCH_X86_32_() 1
#else
# define _CCCL_ARCH_X86_32_() 0
#endif
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Q: Why is this needed? I vaguely remember that we don't support any 32-bit systems.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Windows 32bit support has been removed in CUDA 12 (not 11) https://docs.nvidia.com/cuda/cuda-installation-guide-microsoft-windows/index.html#system-requirements. Probably, we can remove it

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Those are most likely remnants from our libc++ fork

Copy link
Contributor

github-actions bot commented Jan 4, 2025

🟩 CI finished in 1h 25m: Pass: 100%/170 | Total: 1d 02h | Avg: 9m 15s | Max: 42m 11s | Hits: 67%/22538
  • 🟩 libcudacxx: Pass: 100%/48 | Total: 8h 58m | Avg: 11m 13s | Max: 39m 39s | Hits: 31%/9834

    🟩 cpu
      🟩 amd64              Pass: 100%/46  | Total:  8h 51m | Avg: 11m 33s | Max: 39m 39s | Hits:  31%/9834  
      🟩 arm64              Pass: 100%/2   | Total:  6m 58s | Avg:  3m 29s | Max:  3m 39s
    🟩 ctk
      🟩 11.1               Pass: 100%/7   | Total: 47m 04s | Avg:  6m 43s | Max: 29m 20s | Hits:  34%/2244  
      🟩 12.5               Pass: 100%/2   | Total: 38m 32s | Avg: 19m 16s | Max: 29m 49s
      🟩 12.6               Pass: 100%/39  | Total:  7h 32m | Avg: 11m 36s | Max: 39m 39s | Hits:  31%/7590  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/4   | Total:  1h 03m | Avg: 15m 55s | Max: 19m 11s
      🟩 nvcc11.1           Pass: 100%/7   | Total: 47m 04s | Avg:  6m 43s | Max: 29m 20s | Hits:  34%/2244  
      🟩 nvcc12.5           Pass: 100%/2   | Total: 38m 32s | Avg: 19m 16s | Max: 29m 49s
      🟩 nvcc12.6           Pass: 100%/35  | Total:  6h 29m | Avg: 11m 07s | Max: 39m 39s | Hits:  31%/7590  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/4   | Total:  1h 03m | Avg: 15m 55s | Max: 19m 11s
      🟩 nvcc               Pass: 100%/44  | Total:  7h 54m | Avg: 10m 47s | Max: 39m 39s | Hits:  31%/9834  
    🟩 cxx
      🟩 Clang9             Pass: 100%/4   | Total: 15m 09s | Avg:  3m 47s | Max:  4m 37s
      🟩 Clang10            Pass: 100%/1   | Total:  4m 37s | Avg:  4m 37s | Max:  4m 37s
      🟩 Clang11            Pass: 100%/1   | Total:  4m 04s | Avg:  4m 04s | Max:  4m 04s
      🟩 Clang12            Pass: 100%/1   | Total:  4m 13s | Avg:  4m 13s | Max:  4m 13s
      🟩 Clang13            Pass: 100%/1   | Total:  3m 55s | Avg:  3m 55s | Max:  3m 55s
      🟩 Clang14            Pass: 100%/1   | Total:  3m 58s | Avg:  3m 58s | Max:  3m 58s
      🟩 Clang15            Pass: 100%/1   | Total:  4m 10s | Avg:  4m 10s | Max:  4m 10s
      🟩 Clang16            Pass: 100%/1   | Total:  4m 36s | Avg:  4m 36s | Max:  4m 36s
      🟩 Clang17            Pass: 100%/1   | Total:  4m 17s | Avg:  4m 17s | Max:  4m 17s
      🟩 Clang18            Pass: 100%/8   | Total:  1h 44m | Avg: 13m 04s | Max: 28m 49s
      🟩 GCC6               Pass: 100%/2   | Total:  5m 51s | Avg:  2m 55s | Max:  3m 06s
      🟩 GCC7               Pass: 100%/2   | Total:  6m 30s | Avg:  3m 15s | Max:  3m 17s
      🟩 GCC8               Pass: 100%/1   | Total:  3m 46s | Avg:  3m 46s | Max:  3m 46s
      🟩 GCC9               Pass: 100%/3   | Total:  9m 01s | Avg:  3m 00s | Max:  3m 36s
      🟩 GCC10              Pass: 100%/1   | Total:  3m 36s | Avg:  3m 36s | Max:  3m 36s
      🟩 GCC11              Pass: 100%/1   | Total:  3m 28s | Avg:  3m 28s | Max:  3m 28s
      🟩 GCC12              Pass: 100%/1   | Total:  4m 03s | Avg:  4m 03s | Max:  4m 03s
      🟩 GCC13              Pass: 100%/10  | Total:  2h 35m | Avg: 15m 34s | Max: 31m 49s
      🟩 Intel2023.2.0      Pass: 100%/1   | Total: 22m 16s | Avg: 22m 16s | Max: 22m 16s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 29m 20s | Avg: 29m 20s | Max: 29m 20s | Hits:  34%/2244  
      🟩 MSVC14.29          Pass: 100%/1   | Total: 31m 32s | Avg: 31m 32s | Max: 31m 32s | Hits:  31%/2481  
      🟩 MSVC14.39          Pass: 100%/2   | Total:  1h 11m | Avg: 35m 36s | Max: 39m 39s | Hits:  30%/5109  
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 38m 32s | Avg: 19m 16s | Max: 29m 49s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/20  | Total:  2h 33m | Avg:  7m 40s | Max: 28m 49s
      🟩 GCC                Pass: 100%/21  | Total:  3h 12m | Avg:  9m 08s | Max: 31m 49s
      🟩 Intel              Pass: 100%/1   | Total: 22m 16s | Avg: 22m 16s | Max: 22m 16s
      🟩 MSVC               Pass: 100%/4   | Total:  2h 12m | Avg: 33m 01s | Max: 39m 39s | Hits:  31%/9834  
      🟩 NVHPC              Pass: 100%/2   | Total: 38m 32s | Avg: 19m 16s | Max: 29m 49s
    🟩 gpu
      🟩 v100               Pass: 100%/48  | Total:  8h 58m | Avg: 11m 13s | Max: 39m 39s | Hits:  31%/9834  
    🟩 jobs
      🟩 Build              Pass: 100%/41  | Total:  6h 09m | Avg:  9m 00s | Max: 39m 39s | Hits:  31%/9834  
      🟩 NVRTC              Pass: 100%/4   | Total:  1h 46m | Avg: 26m 40s | Max: 27m 46s
      🟩 Test               Pass: 100%/2   | Total:  1h 00m | Avg: 30m 19s | Max: 31m 49s
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  1m 56s | Avg:  1m 56s | Max:  1m 56s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total: 12m 18s | Avg: 12m 18s | Max: 12m 18s
      🟩 90a                Pass: 100%/2   | Total: 17m 32s | Avg:  8m 46s | Max: 13m 35s
    🟩 std
      🟩 11                 Pass: 100%/6   | Total: 40m 14s | Avg:  6m 42s | Max: 24m 30s
      🟩 14                 Pass: 100%/5   | Total:  1h 08m | Avg: 13m 36s | Max: 29m 20s | Hits:  34%/2244  
      🟩 17                 Pass: 100%/13  | Total:  3h 06m | Avg: 14m 23s | Max: 31m 33s | Hits:  31%/4962  
      🟩 20                 Pass: 100%/23  | Total:  4h 01m | Avg: 10m 29s | Max: 39m 39s | Hits:  30%/2628  
    
  • 🟩 cub: Pass: 100%/47 | Total: 7h 19m | Avg: 9m 21s | Max: 42m 11s | Hits: 99%/3132

    🟩 cpu
      🟩 amd64              Pass: 100%/45  | Total:  7h 10m | Avg:  9m 33s | Max: 42m 11s | Hits:  99%/3132  
      🟩 arm64              Pass: 100%/2   | Total:  9m 27s | Avg:  4m 43s | Max:  4m 52s
    🟩 ctk
      🟩 11.1               Pass: 100%/7   | Total: 41m 35s | Avg:  5m 56s | Max: 15m 26s | Hits:  99%/783   
      🟩 12.5               Pass: 100%/2   | Total: 18m 40s | Avg:  9m 20s | Max:  9m 44s
      🟩 12.6               Pass: 100%/38  | Total:  6h 19m | Avg:  9m 58s | Max: 42m 11s | Hits:  99%/2349  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  8m 47s | Avg:  4m 23s | Max:  4m 28s
      🟩 nvcc11.1           Pass: 100%/7   | Total: 41m 35s | Avg:  5m 56s | Max: 15m 26s | Hits:  99%/783   
      🟩 nvcc12.5           Pass: 100%/2   | Total: 18m 40s | Avg:  9m 20s | Max:  9m 44s
      🟩 nvcc12.6           Pass: 100%/36  | Total:  6h 10m | Avg: 10m 17s | Max: 42m 11s | Hits:  99%/2349  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  8m 47s | Avg:  4m 23s | Max:  4m 28s
      🟩 nvcc               Pass: 100%/45  | Total:  7h 10m | Avg:  9m 34s | Max: 42m 11s | Hits:  99%/3132  
    🟩 cxx
      🟩 Clang9             Pass: 100%/4   | Total: 21m 11s | Avg:  5m 17s | Max:  5m 58s
      🟩 Clang10            Pass: 100%/1   | Total:  6m 29s | Avg:  6m 29s | Max:  6m 29s
      🟩 Clang11            Pass: 100%/1   | Total:  5m 28s | Avg:  5m 28s | Max:  5m 28s
      🟩 Clang12            Pass: 100%/1   | Total:  5m 31s | Avg:  5m 31s | Max:  5m 31s
      🟩 Clang13            Pass: 100%/1   | Total:  5m 14s | Avg:  5m 14s | Max:  5m 14s
      🟩 Clang14            Pass: 100%/1   | Total:  5m 26s | Avg:  5m 26s | Max:  5m 26s
      🟩 Clang15            Pass: 100%/1   | Total:  5m 26s | Avg:  5m 26s | Max:  5m 26s
      🟩 Clang16            Pass: 100%/1   | Total:  5m 52s | Avg:  5m 52s | Max:  5m 52s
      🟩 Clang17            Pass: 100%/1   | Total:  5m 18s | Avg:  5m 18s | Max:  5m 18s
      🟩 Clang18            Pass: 100%/7   | Total:  1h 27m | Avg: 12m 30s | Max: 42m 11s
      🟩 GCC6               Pass: 100%/2   | Total:  8m 20s | Avg:  4m 10s | Max:  4m 12s
      🟩 GCC7               Pass: 100%/2   | Total: 10m 11s | Avg:  5m 05s | Max:  5m 19s
      🟩 GCC8               Pass: 100%/1   | Total:  5m 28s | Avg:  5m 28s | Max:  5m 28s
      🟩 GCC9               Pass: 100%/3   | Total: 13m 50s | Avg:  4m 36s | Max:  5m 23s
      🟩 GCC10              Pass: 100%/1   | Total:  5m 20s | Avg:  5m 20s | Max:  5m 20s
      🟩 GCC11              Pass: 100%/1   | Total:  5m 15s | Avg:  5m 15s | Max:  5m 15s
      🟩 GCC12              Pass: 100%/3   | Total: 25m 30s | Avg:  8m 30s | Max: 15m 45s
      🟩 GCC13              Pass: 100%/8   | Total:  2h 11m | Avg: 16m 25s | Max: 36m 49s
      🟩 Intel2023.2.0      Pass: 100%/1   | Total:  6m 25s | Avg:  6m 25s | Max:  6m 25s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 15m 26s | Avg: 15m 26s | Max: 15m 26s | Hits:  99%/783   
      🟩 MSVC14.29          Pass: 100%/1   | Total: 14m 25s | Avg: 14m 25s | Max: 14m 25s | Hits:  99%/783   
      🟩 MSVC14.39          Pass: 100%/2   | Total: 25m 46s | Avg: 12m 53s | Max: 13m 08s | Hits:  99%/1566  
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 18m 40s | Avg:  9m 20s | Max:  9m 44s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/19  | Total:  2h 33m | Avg:  8m 04s | Max: 42m 11s
      🟩 GCC                Pass: 100%/21  | Total:  3h 25m | Avg:  9m 46s | Max: 36m 49s
      🟩 Intel              Pass: 100%/1   | Total:  6m 25s | Avg:  6m 25s | Max:  6m 25s
      🟩 MSVC               Pass: 100%/4   | Total: 55m 37s | Avg: 13m 54s | Max: 15m 26s | Hits:  99%/3132  
      🟩 NVHPC              Pass: 100%/2   | Total: 18m 40s | Avg:  9m 20s | Max:  9m 44s
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 19m 45s | Avg:  9m 52s | Max: 15m 45s
      🟩 v100               Pass: 100%/45  | Total:  6h 59m | Avg:  9m 19s | Max: 42m 11s | Hits:  99%/3132  
    🟩 jobs
      🟩 Build              Pass: 100%/40  | Total:  4h 10m | Avg:  6m 16s | Max: 15m 26s | Hits:  99%/3132  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 19m 42s | Avg: 19m 42s | Max: 19m 42s
      🟩 GraphCapture       Pass: 100%/1   | Total: 36m 49s | Avg: 36m 49s | Max: 36m 49s
      🟩 HostLaunch         Pass: 100%/3   | Total: 54m 47s | Avg: 18m 15s | Max: 20m 42s
      🟩 TestGPU            Pass: 100%/2   | Total:  1h 17m | Avg: 38m 41s | Max: 42m 11s
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 19m 45s | Avg:  9m 52s | Max: 15m 45s
      🟩 90a                Pass: 100%/1   | Total:  4m 30s | Avg:  4m 30s | Max:  4m 30s
    🟩 std
      🟩 11                 Pass: 100%/5   | Total: 23m 30s | Avg:  4m 42s | Max:  5m 51s
      🟩 14                 Pass: 100%/4   | Total: 30m 51s | Avg:  7m 42s | Max: 15m 26s | Hits:  99%/783   
      🟩 17                 Pass: 100%/12  | Total:  1h 24m | Avg:  7m 03s | Max: 14m 25s | Hits:  99%/1566  
      🟩 20                 Pass: 100%/26  | Total:  5h 00m | Avg: 11m 33s | Max: 42m 11s | Hits:  99%/783   
    
  • 🟩 thrust: Pass: 100%/46 | Total: 6h 48m | Avg: 8m 52s | Max: 26m 45s | Hits: 93%/9260

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 19m 05s | Avg:  9m 32s | Max: 13m 08s
    🟩 cpu
      🟩 amd64              Pass: 100%/44  | Total:  6h 38m | Avg:  9m 03s | Max: 26m 45s | Hits:  93%/9260  
      🟩 arm64              Pass: 100%/2   | Total:  9m 29s | Avg:  4m 44s | Max:  4m 57s
    🟩 ctk
      🟩 11.1               Pass: 100%/7   | Total:  1h 12m | Avg: 10m 17s | Max: 26m 45s | Hits:  88%/1852  
      🟩 12.5               Pass: 100%/2   | Total: 34m 23s | Avg: 17m 11s | Max: 19m 36s
      🟩 12.6               Pass: 100%/37  | Total:  5h 01m | Avg:  8m 09s | Max: 24m 27s | Hits:  94%/7408  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  9m 39s | Avg:  4m 49s | Max:  4m 51s
      🟩 nvcc11.1           Pass: 100%/7   | Total:  1h 12m | Avg: 10m 17s | Max: 26m 45s | Hits:  88%/1852  
      🟩 nvcc12.5           Pass: 100%/2   | Total: 34m 23s | Avg: 17m 11s | Max: 19m 36s
      🟩 nvcc12.6           Pass: 100%/35  | Total:  4h 52m | Avg:  8m 20s | Max: 24m 27s | Hits:  94%/7408  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  9m 39s | Avg:  4m 49s | Max:  4m 51s
      🟩 nvcc               Pass: 100%/44  | Total:  6h 38m | Avg:  9m 03s | Max: 26m 45s | Hits:  93%/9260  
    🟩 cxx
      🟩 Clang9             Pass: 100%/4   | Total: 20m 32s | Avg:  5m 08s | Max:  6m 26s
      🟩 Clang10            Pass: 100%/1   | Total:  6m 59s | Avg:  6m 59s | Max:  6m 59s
      🟩 Clang11            Pass: 100%/1   | Total:  5m 31s | Avg:  5m 31s | Max:  5m 31s
      🟩 Clang12            Pass: 100%/1   | Total:  5m 01s | Avg:  5m 01s | Max:  5m 01s
      🟩 Clang13            Pass: 100%/1   | Total:  5m 11s | Avg:  5m 11s | Max:  5m 11s
      🟩 Clang14            Pass: 100%/1   | Total:  5m 23s | Avg:  5m 23s | Max:  5m 23s
      🟩 Clang15            Pass: 100%/1   | Total:  5m 15s | Avg:  5m 15s | Max:  5m 15s
      🟩 Clang16            Pass: 100%/1   | Total:  5m 55s | Avg:  5m 55s | Max:  5m 55s
      🟩 Clang17            Pass: 100%/1   | Total:  5m 15s | Avg:  5m 15s | Max:  5m 15s
      🟩 Clang18            Pass: 100%/7   | Total: 45m 40s | Avg:  6m 31s | Max: 13m 21s
      🟩 GCC6               Pass: 100%/2   | Total:  7m 57s | Avg:  3m 58s | Max:  4m 15s
      🟩 GCC7               Pass: 100%/2   | Total: 10m 50s | Avg:  5m 25s | Max:  5m 39s
      🟩 GCC8               Pass: 100%/1   | Total:  5m 02s | Avg:  5m 02s | Max:  5m 02s
      🟩 GCC9               Pass: 100%/3   | Total: 37m 30s | Avg: 12m 30s | Max: 26m 45s
      🟩 GCC10              Pass: 100%/1   | Total:  5m 42s | Avg:  5m 42s | Max:  5m 42s
      🟩 GCC11              Pass: 100%/1   | Total:  5m 23s | Avg:  5m 23s | Max:  5m 23s
      🟩 GCC12              Pass: 100%/1   | Total:  5m 35s | Avg:  5m 35s | Max:  5m 35s
      🟩 GCC13              Pass: 100%/8   | Total:  1h 13m | Avg:  9m 12s | Max: 24m 27s
      🟩 Intel2023.2.0      Pass: 100%/1   | Total:  7m 24s | Avg:  7m 24s | Max:  7m 24s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 24m 03s | Avg: 24m 03s | Max: 24m 03s | Hits:  88%/1852  
      🟩 MSVC14.29          Pass: 100%/1   | Total: 17m 18s | Avg: 17m 18s | Max: 17m 18s | Hits:  99%/1852  
      🟩 MSVC14.39          Pass: 100%/3   | Total:  1h 02m | Avg: 20m 55s | Max: 22m 36s | Hits:  93%/5556  
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 34m 23s | Avg: 17m 11s | Max: 19m 36s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/19  | Total:  1h 50m | Avg:  5m 49s | Max: 13m 21s
      🟩 GCC                Pass: 100%/19  | Total:  2h 31m | Avg:  7m 58s | Max: 26m 45s
      🟩 Intel              Pass: 100%/1   | Total:  7m 24s | Avg:  7m 24s | Max:  7m 24s
      🟩 MSVC               Pass: 100%/5   | Total:  1h 44m | Avg: 20m 49s | Max: 24m 03s | Hits:  93%/9260  
      🟩 NVHPC              Pass: 100%/2   | Total: 34m 23s | Avg: 17m 11s | Max: 19m 36s
    🟩 gpu
      🟩 v100               Pass: 100%/46  | Total:  6h 48m | Avg:  8m 52s | Max: 26m 45s | Hits:  93%/9260  
    🟩 jobs
      🟩 Build              Pass: 100%/40  | Total:  5h 19m | Avg:  7m 58s | Max: 26m 45s | Hits:  92%/7408  
      🟩 TestCPU            Pass: 100%/3   | Total: 38m 14s | Avg: 12m 44s | Max: 22m 36s | Hits:  99%/1852  
      🟩 TestGPU            Pass: 100%/3   | Total: 50m 56s | Avg: 16m 58s | Max: 24m 27s
    🟩 sm
      🟩 90a                Pass: 100%/1   | Total:  4m 40s | Avg:  4m 40s | Max:  4m 40s
    🟩 std
      🟩 11                 Pass: 100%/5   | Total: 45m 13s | Avg:  9m 02s | Max: 26m 45s
      🟩 14                 Pass: 100%/4   | Total: 40m 23s | Avg: 10m 05s | Max: 24m 03s | Hits:  88%/1852  
      🟩 17                 Pass: 100%/12  | Total:  1h 43m | Avg:  8m 35s | Max: 19m 41s | Hits:  95%/3704  
      🟩 20                 Pass: 100%/23  | Total:  3h 20m | Avg:  8m 43s | Max: 24m 27s | Hits:  94%/3704  
    
  • 🟩 cudax: Pass: 100%/26 | Total: 2h 28m | Avg: 5m 41s | Max: 30m 12s | Hits: 92%/312

    🟩 cpu
      🟩 amd64              Pass: 100%/22  | Total:  2h 17m | Avg:  6m 15s | Max: 30m 12s | Hits:  92%/312   
      🟩 arm64              Pass: 100%/4   | Total: 10m 32s | Avg:  2m 38s | Max:  2m 43s
    🟩 ctk
      🟩 12.0               Pass: 100%/3   | Total: 14m 34s | Avg:  4m 51s | Max:  8m 51s | Hits:  92%/156   
      🟩 12.5               Pass: 100%/2   | Total: 10m 09s | Avg:  5m 04s | Max:  5m 19s
      🟩 12.6               Pass: 100%/21  | Total:  2h 03m | Avg:  5m 52s | Max: 30m 12s | Hits:  92%/156   
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/3   | Total: 14m 34s | Avg:  4m 51s | Max:  8m 51s | Hits:  92%/156   
      🟩 nvcc12.5           Pass: 100%/2   | Total: 10m 09s | Avg:  5m 04s | Max:  5m 19s
      🟩 nvcc12.6           Pass: 100%/21  | Total:  2h 03m | Avg:  5m 52s | Max: 30m 12s | Hits:  92%/156   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/26  | Total:  2h 28m | Avg:  5m 41s | Max: 30m 12s | Hits:  92%/312   
    🟩 cxx
      🟩 Clang9             Pass: 100%/1   | Total:  3m 02s | Avg:  3m 02s | Max:  3m 02s
      🟩 Clang10            Pass: 100%/1   | Total:  3m 54s | Avg:  3m 54s | Max:  3m 54s
      🟩 Clang11            Pass: 100%/1   | Total:  2m 59s | Avg:  2m 59s | Max:  2m 59s
      🟩 Clang12            Pass: 100%/1   | Total:  3m 17s | Avg:  3m 17s | Max:  3m 17s
      🟩 Clang13            Pass: 100%/1   | Total:  3m 02s | Avg:  3m 02s | Max:  3m 02s
      🟩 Clang14            Pass: 100%/1   | Total:  2m 59s | Avg:  2m 59s | Max:  2m 59s
      🟩 Clang15            Pass: 100%/1   | Total:  3m 08s | Avg:  3m 08s | Max:  3m 08s
      🟩 Clang16            Pass: 100%/1   | Total:  3m 29s | Avg:  3m 29s | Max:  3m 29s
      🟩 Clang17            Pass: 100%/1   | Total:  3m 15s | Avg:  3m 15s | Max:  3m 15s
      🟩 Clang18            Pass: 100%/4   | Total: 38m 41s | Avg:  9m 40s | Max: 30m 12s
      🟩 GCC9               Pass: 100%/1   | Total:  2m 41s | Avg:  2m 41s | Max:  2m 41s
      🟩 GCC10              Pass: 100%/1   | Total:  3m 46s | Avg:  3m 46s | Max:  3m 46s
      🟩 GCC11              Pass: 100%/1   | Total:  2m 56s | Avg:  2m 56s | Max:  2m 56s
      🟩 GCC12              Pass: 100%/2   | Total: 32m 46s | Avg: 16m 23s | Max: 29m 35s
      🟩 GCC13              Pass: 100%/4   | Total: 10m 36s | Avg:  2m 39s | Max:  2m 44s
      🟩 MSVC14.36          Pass: 100%/1   | Total:  8m 51s | Avg:  8m 51s | Max:  8m 51s | Hits:  92%/156   
      🟩 MSVC14.39          Pass: 100%/1   | Total:  8m 35s | Avg:  8m 35s | Max:  8m 35s | Hits:  92%/156   
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 10m 09s | Avg:  5m 04s | Max:  5m 19s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/13  | Total:  1h 07m | Avg:  5m 12s | Max: 30m 12s
      🟩 GCC                Pass: 100%/9   | Total: 52m 45s | Avg:  5m 51s | Max: 29m 35s
      🟩 MSVC               Pass: 100%/2   | Total: 17m 26s | Avg:  8m 43s | Max:  8m 51s | Hits:  92%/312   
      🟩 NVHPC              Pass: 100%/2   | Total: 10m 09s | Avg:  5m 04s | Max:  5m 19s
    🟩 gpu
      🟩 v100               Pass: 100%/26  | Total:  2h 28m | Avg:  5m 41s | Max: 30m 12s | Hits:  92%/312   
    🟩 jobs
      🟩 Build              Pass: 100%/24  | Total:  1h 28m | Avg:  3m 40s | Max:  8m 51s | Hits:  92%/312   
      🟩 Test               Pass: 100%/2   | Total: 59m 47s | Avg: 29m 53s | Max: 30m 12s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total:  2m 44s | Avg:  2m 44s | Max:  2m 44s
      🟩 90a                Pass: 100%/1   | Total:  2m 40s | Avg:  2m 40s | Max:  2m 40s
    🟩 std
      🟩 17                 Pass: 100%/6   | Total: 18m 57s | Avg:  3m 09s | Max:  5m 19s
      🟩 20                 Pass: 100%/20  | Total:  2h 09m | Avg:  6m 27s | Max: 30m 12s | Hits:  92%/312   
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 9m 55s | Avg: 4m 57s | Max: 7m 51s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total:  9m 55s | Avg:  4m 57s | Max:  7m 51s
    🟩 ctk
      🟩 12.6               Pass: 100%/2   | Total:  9m 55s | Avg:  4m 57s | Max:  7m 51s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/2   | Total:  9m 55s | Avg:  4m 57s | Max:  7m 51s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total:  9m 55s | Avg:  4m 57s | Max:  7m 51s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total:  9m 55s | Avg:  4m 57s | Max:  7m 51s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total:  9m 55s | Avg:  4m 57s | Max:  7m 51s
    🟩 gpu
      🟩 v100               Pass: 100%/2   | Total:  9m 55s | Avg:  4m 57s | Max:  7m 51s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 04s | Avg:  2m 04s | Max:  2m 04s
      🟩 Test               Pass: 100%/1   | Total:  7m 51s | Avg:  7m 51s | Max:  7m 51s
    
  • 🟩 python: Pass: 100%/1 | Total: 28m 55s | Avg: 28m 55s | Max: 28m 55s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 28m 55s | Avg: 28m 55s | Max: 28m 55s
    🟩 ctk
      🟩 12.6               Pass: 100%/1   | Total: 28m 55s | Avg: 28m 55s | Max: 28m 55s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/1   | Total: 28m 55s | Avg: 28m 55s | Max: 28m 55s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 28m 55s | Avg: 28m 55s | Max: 28m 55s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 28m 55s | Avg: 28m 55s | Max: 28m 55s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 28m 55s | Avg: 28m 55s | Max: 28m 55s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 28m 55s | Avg: 28m 55s | Max: 28m 55s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 28m 55s | Avg: 28m 55s | Max: 28m 55s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
+/- libcu++
CUB
Thrust
CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 170)

# Runner
125 linux-amd64-cpu16
19 linux-amd64-gpu-v100-latest-1
15 windows-amd64-cpu16
10 linux-arm64-cpu16
1 linux-amd64-gpu-h100-latest-1-testing

@davebayer
Copy link
Contributor

Q: is it necessary to define the values as function-like macros? Unless we need to delay the macro expansion, I think you we can use ordinary defines and omit the else branch defining the macro to 0. I used the same approach when I implemented the compiler checks

@miscco
Copy link
Collaborator

miscco commented Jan 5, 2025

Q: is it necessary to define the values as function-like macros? Unless we need to delay the macro expansion, I think you we can use ordinary defines and omit the else branch defining the macro to 0. I used the same approach when I implemented the compiler checks

If possible we really want to have the function like macros because in that case it is not possible to silently get invalid checks in like using just ARM instead of ARM64

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Status: In Progress
Development

Successfully merging this pull request may close these issues.

[FEA]: Provide Architecture and OS identification macros
6 participants