Update 4bit gemm kernel for warpsize 64 #49

pnunna93 · 2024-10-16T23:02:41Z

This PR creates a warp_size variable and modifies kgemm_4bit_inference_naive kernel to use the variable instead of hard coded 32 warp size. This change is needed for the kernel to work correctly on both CDNA and RNDA architectures.

Update 4bit gemm kernel for warpsize 64

6a8c711

pnunna93 requested review from lcskrishna and Lzy17 October 16, 2024 23:02

Fix spacing

44f6602

lcskrishna approved these changes Oct 21, 2024

View reviewed changes

pnunna93 merged commit 4aad810 into rocm_enabled_multi_backend Oct 23, 2024
10 of 26 checks passed

pnunna93 deleted the gemv_4bit_warpsize_64 branch October 23, 2024 18:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update 4bit gemm kernel for warpsize 64 #49

Update 4bit gemm kernel for warpsize 64 #49

pnunna93 commented Oct 16, 2024

Update 4bit gemm kernel for warpsize 64 #49

Update 4bit gemm kernel for warpsize 64 #49

Conversation

pnunna93 commented Oct 16, 2024