Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[FEA]: Provide optimized <bit> functions for device #2239

Open
1 task done
fbusato opened this issue Aug 14, 2024 · 0 comments · May be fixed by #3226 or #3228
Open
1 task done

[FEA]: Provide optimized <bit> functions for device #2239

fbusato opened this issue Aug 14, 2024 · 0 comments · May be fixed by #3226 or #3228
Labels
feature request New feature or request.

Comments

@fbusato
Copy link
Contributor

fbusato commented Aug 14, 2024

Is this a duplicate?

Area

libcu++

Is your feature request related to a problem? Please describe.

The following functions

require to be constexpr which prevents potential optimizations such as intrinsics and asm.

Describe the solution you'd like

Such functions should be mapped to GPU ISA via PTX and/or intrinsics

  • Countl_zero → FLO + IADD3 (__clz) / FLO.SH (PTX: bfind_shift)
  • Countr_zeroCountl_zero(__brev(x))
  • Countl_oneCountl_zero (~x)
  • Countr_oneCountl_one(__brev(x))
  • popcount__popc
  • rotl/rotr → PTX: funnel shift
  • bit_floor, bit_ceil → FLO + shift
  • has_single_bit__popc
  • byteswap → PRMT

Describe alternatives you've considered

No response

Additional context

No response

@fbusato fbusato added the feature request New feature or request. label Aug 14, 2024
@fbusato fbusato added this to CCCL Aug 14, 2024
@github-project-automation github-project-automation bot moved this to Todo in CCCL Aug 14, 2024
@fbusato fbusato linked a pull request Dec 31, 2024 that will close this issue
@cccl-authenticator-app cccl-authenticator-app bot moved this from Todo to In Review in CCCL Dec 31, 2024
@cccl-authenticator-app cccl-authenticator-app bot moved this from In Review to In Progress in CCCL Jan 2, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feature request New feature or request.
Projects
Status: In Progress
1 participant