Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

src: cpu: aarch64: Re-enable matmul static quantisation through ACL. #2308

Merged
merged 1 commit into from
Jan 15, 2025

Conversation

renato-arantes
Copy link
Contributor

@renato-arantes renato-arantes commented Dec 22, 2024

Description

This is to re-enable matmul static quantization operations through ACL. Currently, the supported data type combinations are s8:s8:s8 and u8:s8:u8. Neoverse-N1 is skipped until failing tests are fixed.

Previous PR: #2198

General

  • Do all unit and benchdnn tests (make test and make test_benchdnn_*) pass locally for each commit?
  • Have you formatted the code using clang-format?

@renato-arantes renato-arantes requested review from a team as code owners December 22, 2024 17:57
@renato-arantes renato-arantes force-pushed the static_quant2 branch 2 times, most recently from 88cf5e0 to 5cec04d Compare December 22, 2024 20:47
@renato-arantes
Copy link
Contributor Author

Hi @mgouicem, it's the same as before but more restrictive with Neoverse-N1 until the failing tests are fixed.

src/cpu/aarch64/acl_gemm_convolution.cpp Outdated Show resolved Hide resolved
@github-actions github-actions bot added the platform:cpu-aarch64 Codeowner: @oneapi-src/onednn-cpu-aarch64 label Dec 23, 2024
@theComputeKid theComputeKid added this to the v3.7 milestone Jan 8, 2025
@Radu2k
Copy link
Contributor

Radu2k commented Jan 8, 2025

@mgouicem @dzarukin Could we have someone review this please?

src/common/convolution_pd.hpp Outdated Show resolved Hide resolved
src/common/convolution_pd.hpp Outdated Show resolved Hide resolved
src/cpu/aarch64/acl_convolution_utils.hpp Outdated Show resolved Hide resolved
src/cpu/aarch64/acl_convolution_utils.hpp Outdated Show resolved Hide resolved
src/cpu/aarch64/acl_gemm_convolution.cpp Outdated Show resolved Hide resolved
src/cpu/aarch64/matmul/acl_lowp_matmul_sq.cpp Outdated Show resolved Hide resolved
src/cpu/aarch64/matmul/acl_lowp_matmul_sq.hpp Outdated Show resolved Hide resolved
src/cpu/aarch64/matmul/acl_lowp_matmul_sq.hpp Outdated Show resolved Hide resolved
src/cpu/aarch64/matmul/acl_lowp_matmul_sq.hpp Outdated Show resolved Hide resolved
src/cpu/cpu_convolution_list.cpp Outdated Show resolved Hide resolved
@mgouicem
Copy link
Contributor

mgouicem commented Jan 9, 2025

Hi @renato-arantes I see that you added a merge commit, please rebase your branch instead as we try to keep a linear history.

@renato-arantes renato-arantes force-pushed the static_quant2 branch 4 times, most recently from ce7f10b to 66d9ea4 Compare January 9, 2025 12:47
@vpirogov
Copy link
Member

vpirogov commented Jan 9, 2025

@renato-arantes, please resolve conflicts.

@renato-arantes
Copy link
Contributor Author

Hi @vpirogov,

The conflicts are there because a commit was reverted, and my work for conv is based on it. I will drop my work related to convolution from this PR and keep only the matmul one. When the issue that caused the reverse is solved, I will create a new PR for the conv work.

@renato-arantes renato-arantes changed the title src: cpu: aarch64: Re-enable matmul and convolution static quantisation through ACL. src: cpu: aarch64: Re-enable matmul static quantisation through ACL. Jan 13, 2025
@renato-arantes
Copy link
Contributor Author

Hi @mgouicem, Can we please merge this patch? It is the same as before, but this time without the conv work, as described here.

@theComputeKid theComputeKid merged commit 4052b9b into oneapi-src:main Jan 15, 2025
16 checks passed
arm_compute::QuantizationInfo(*src_scale, -src_zero_point, true));
acl_obj.wei_tensor.info()->set_quantization_info(
arm_compute::QuantizationInfo(*wei_scale, -wei_zero_point, true));
// for efficiency reasons, OneDNN saves the inverse of the destination
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It's oneDNN, not OneDNN. Fixup is here.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
platform:cpu-aarch64 Codeowner: @oneapi-src/onednn-cpu-aarch64
Projects
None yet
Development

Successfully merging this pull request may close these issues.

7 participants