We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Why is there still a bank conflict when using swizzle to process data? 类似于如此,由于我的代码太乱了就没放了,作者您是否在避免bank conflict时遇到过类似的问题呢? Bank Conflicts on Ampere Shared memory bank conflicts and nsight metric Allocating more share memory than needed resulted in bank conflict
The text was updated successfully, but these errors were encountered:
我ncu了您的代码,中间确实存在一些差别,您是没用使用到L1 cache的,可能扩展了L1 cache作为smem,确实上面的论坛也提到过这个问题,但是好像没有给出解决的方法。
Sorry, something went wrong.
No branches or pull requests
Why is there still a bank conflict when using swizzle to process data?
类似于如此,由于我的代码太乱了就没放了,作者您是否在避免bank conflict时遇到过类似的问题呢?
Bank Conflicts on Ampere
Shared memory bank conflicts and nsight metric
Allocating more share memory than needed resulted in bank conflict
The text was updated successfully, but these errors were encountered: