Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
[OptRed] Extend
-tritonintelgpu-optimize-reduction-locality
to supp…
…ort `repCluster[0] > 2` Support `repCluster[0] > 2` by using 7-D tensors and adding a `convert_layout` operation before the final `reshape`. See code for implementation details. Signed-off-by: victor-eds <[email protected]>
- Loading branch information