Add onednn to gemm benchmarks #3014

alexbaden · 2024-12-16T14:31:25Z

Currently we compare xetla and triton, but can compare onednn as well.

alexbaden · 2025-01-06T14:17:45Z

This needs a bit of a re-think - the current microbenchmark infrastructure uses the onednn kernel name to benchmark only the kernel time (and ignore the pytorch bits around the kernel). However, pytorch is not able to fuse many matmul operations (e.g. dot w/ add) into a single kernel - if we use the existing infra for those benchmarks, then we will be comparing the onednn matmul kernel to the triton matmul + add fused kernel, when I think we want to be comparing total pytorch execution time with total triton (fused) execution time.

alexbaden self-assigned this Dec 16, 2024

vlad-penkin added this to the 4.3 [Performance] Tracking milestone Dec 16, 2024

alexbaden linked a pull request Dec 18, 2024 that will close this issue

Add int8 to gemm w/ addmatrix #3040

Open

2 tasks

vlad-penkin linked a pull request Dec 18, 2024 that will close this issue

Add int8 to gemm w/ addmatrix #3040

Open

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add onednn to gemm benchmarks #3014

Add onednn to gemm benchmarks #3014

alexbaden commented Dec 16, 2024

alexbaden commented Jan 6, 2025

Add onednn to gemm benchmarks #3014

Add onednn to gemm benchmarks #3014

Comments

alexbaden commented Dec 16, 2024

alexbaden commented Jan 6, 2025