Memory spill with bfloat16 multiplication #69
Replies: 5 comments 3 replies
-
In this case adding 0 solved the issue, but I cannot understand why and neither reproduce the fix.
|
Beta Was this translation helpful? Give feedback.
-
I do not understand why are you doing this cast |
Beta Was this translation helpful? Give feedback.
-
Can you please answer in two different threads with the full code for both case (when it builds and when it does not)? |
Beta Was this translation helpful? Give feedback.
-
This is the code that builds and runs correctly
|
Beta Was this translation helpful? Give feedback.
-
This code fails to build
|
Beta Was this translation helpful? Give feedback.
-
Hi everyone!
I'm having trouble with a memory spill caused by the multiplication of two v16bfloat16 or two v32bfloat16.
My goal is to have a function that allows to do
v16bfloat16 x , v16bfloat16 y -> v16bfloat16 z = x*y
and the implementation that I wrote is the followingto_v16bfloat16(aie::mul(x, y))
.I can use this function to produce a polynomial approximation as follows:
And it works correctly. But if I modify the kernel as follows:
I get this error:
Does anyone know how to solve this issue?
Beta Was this translation helpful? Give feedback.
All reactions