Fuse pointwise operations into matmul / convolution operations #371

robertknight · 2024-09-21T05:50:53Z

One of the most common optimizations that ML frameworks do is to fuse common pointwise operations such as Relu, Gelu, Silu etc. into preceding convolution / matmul operations. This reduces overhead by applying the pointwise operation while the data is already in the cache. This is not yet implemented in RTen.

robertknight added the performance Issues that affect model inference or loading performance label Sep 21, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fuse pointwise operations into matmul / convolution operations #371

Fuse pointwise operations into matmul / convolution operations #371

robertknight commented Sep 21, 2024 •

edited

Loading

Fuse pointwise operations into matmul / convolution operations #371

Fuse pointwise operations into matmul / convolution operations #371

Comments

robertknight commented Sep 21, 2024 • edited Loading

robertknight commented Sep 21, 2024 •

edited

Loading