Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Preserve blocked pointers used by tt.load operation with DPAS layout #2400

Merged
merged 3 commits into from
Oct 2, 2024

Conversation

etiotto
Copy link
Contributor

@etiotto etiotto commented Oct 1, 2024

Fixes #2378

@etiotto etiotto self-assigned this Oct 1, 2024
@etiotto etiotto marked this pull request as draft October 1, 2024 20:32
@etiotto
Copy link
Contributor Author

etiotto commented Oct 2, 2024

Performance before this PR:
image

Performance with this PR improves significantly (e.g. 8Kx8Kx8K from ~66TFlops to ~215TFlops):
image

@etiotto etiotto marked this pull request as ready for review October 2, 2024 13:49
Signed-off-by: Tiotto, Ettore <[email protected]>
Signed-off-by: Tiotto, Ettore <[email protected]>
@etiotto etiotto enabled auto-merge (squash) October 2, 2024 18:06
@etiotto etiotto merged commit 62c07a7 into main Oct 2, 2024
4 checks passed
@etiotto etiotto deleted the etiotto/fix_2378 branch October 2, 2024 18:29
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Failure to compile gemm_postop_addmatrix_benchmark.py with
3 participants