Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

为什么permuted没有bank conflict,而我改用swizzle存在bank conflict? #13

Open
tangpanyu opened this issue Dec 10, 2024 · 1 comment

Comments

@tangpanyu
Copy link

Why is there still a bank conflict when using swizzle to process data?
类似于如此,由于我的代码太乱了就没放了,作者您是否在避免bank conflict时遇到过类似的问题呢?
Bank Conflicts on Ampere
Shared memory bank conflicts and nsight metric
Allocating more share memory than needed resulted in bank conflict

@tangpanyu
Copy link
Author

我ncu了您的代码,中间确实存在一些差别,您是没用使用到L1 cache的,可能扩展了L1 cache作为smem,确实上面的论坛也提到过这个问题,但是好像没有给出解决的方法。

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant