You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
sm_86 gpus like a rtx3070 get a incorrect workgroup memory config of 163 KB when the available is only 100KB.
reference:
"For devices of compute capability 8.0 (i.e., A100 GPUs) the maximum shared memory per thread block is 163 KB. For GPUs with compute capability 8.6 maximum shared memory per thread block is 99 KB."
What happened?
sm_86 gpus like a rtx3070 get a incorrect workgroup memory config of 163 KB when the available is only 100KB.
reference:
"For devices of compute capability 8.0 (i.e., A100 GPUs) the maximum shared memory per thread block is 163 KB. For GPUs with compute capability 8.6 maximum shared memory per thread block is 99 KB."
https://docs.nvidia.com/cuda/ampere-tuning-guide/index.html
Steps to reproduce your issue
What component(s) does this issue relate to?
Compiler
Version information
No response
Additional context
No response
The text was updated successfully, but these errors were encountered: