Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Make Pulley pass simd_f32x4_arith.wast #9897

Open
wants to merge 3 commits into
base: main
Choose a base branch
from

Conversation

tyoeer
Copy link

@tyoeer tyoeer commented Dec 23, 2024

Helping towards issue #9783.

This PR makes Pulley pass spec_testsuite/simd_f32x4_arith.wast, by adding f32x4 instructions for subtraction, negation, multiplication, and division.

Random notes:

  • Added instructions placement and doc comments in pulley/src/lib.rs are based on what is done with vtrunc32x4.
  • Added instructions implementation is based on vmuli32x4.
  • How instructions are generally grouped together/ordered appears to vary, e.g. ftrunc32, vtrunc32x4, and vtrunc64x2 are together, but ftrunc64 isn't near there.
  • It also appears inconsistent if a SIMD float instruction has an f at the end, e.g. vtrunc32x4 vs. vabsf32x4.

Adds float SIMD instructions on 4 lanes (f32x4) for subtraction, multiplication, division, and negation.
`vtrunc32x4` and `vmuli32x4` were used as basis on how to organize things.
To be exact: `spec_testsuite/simd_f32x4_arith.wast` has been removed from the should fail list for Pulley.
@tyoeer tyoeer requested review from a team as code owners December 23, 2024 20:23
@tyoeer tyoeer requested review from fitzgen and removed request for a team December 23, 2024 20:23
@github-actions github-actions bot added cranelift Issues related to the Cranelift code generator pulley Issues related to the Pulley interpreter labels Dec 23, 2024
Copy link

Subscribe to Label Action

cc @fitzgen

This issue or pull request has been labeled: "cranelift", "pulley"

Thus the following users have been cc'd because of the following labels:

  • fitzgen: pulley

To subscribe or unsubscribe from this label, edit the .github/subscribe-to-label.json configuration file.

Learn more.

Copy link
Member

@alexcrichton alexcrichton left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks!

Comment on lines 814 to 815
/// `low128(dst) = low128(src1) - low128(src2)`
vsub32x4 = Vsub32x4 { operands: BinaryOperands<VReg> };
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Mind using f32x4 in the opcodes (e.g. vsubf32x4) to clearly indicate that it's for floats and not integers?

Copy link
Author

@tyoeer tyoeer Jan 2, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Should be done

Specifically:
- "vsub32x4" -> "vsubf32x4"
- "vmul32x4" -> "vmulf32x4"
- "vdiv32x4" -> "vdivf32x4"
@tyoeer tyoeer requested a review from alexcrichton January 2, 2025 14:10
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
cranelift Issues related to the Cranelift code generator pulley Issues related to the Pulley interpreter
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants