How to Lower to VNNI Instructions? #8337
Unanswered
FabianSchuetze
asked this question in
Q&A
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I am trying to use Intels VNNI instructions. Eventually, I would like to use them for convolutions but start with a simple example based on PR #5807.
The following code compiles but does not produce the desired assembly.
I compile it with the target option
target=host-avx512_sapphirerapids
. Unfortunately, the code doesn't seem to be using VNNI instructions. The generated IR is:And the relevant assembly:
Instead, when I try to follow the example of the PR the compilation aborts. The code:
leads to a termination of the compilation because
saturating_sum
cannot matched.I know that there's also the unit-test in
simd_op_check_x86
:But I unfortunately can't figure out to which generator and schedule this maps to.
What should I do to produce VNNI instructions? And, are there any suggestions for generating VNNI instructions for a convolution?
Beta Was this translation helpful? Give feedback.
All reactions