FP6 Speed on A100 80g #1181

shihaobai · 2024-10-28T07:05:52Z

ENV:
cuda: 12.1
torch: 2.5.0+cu121
python benchmark_fp6.py

Hello, have you tested the performance of the FP6 kernel on the A100? I found that the speed is much slower compared to FP16."

gau-nernst · 2024-10-28T07:33:00Z

Looks like related to #1092 (the speedup numbers are similar). What is your torchao version? Can you try update torchao or install nightly / from source? Should be fixed in 0.6.1

shihaobai · 2024-10-28T08:28:10Z

Thanks for your help! My torchao version is torchao-0.7.0.dev20241028+cu121. I tried the 0.6.1 and got the correct performance.

gau-nernst · 2024-10-29T01:15:52Z

torchao-0.7.0.dev20241028+cu121 should have the correct fix I think. Can you double check that torchao-0.7.0.dev20241028+cu121 is also working correctly?

If everything works as expected, let me know so I can close the issue.

cc @tobiasvanderwerff

shihaobai · 2024-10-29T04:00:14Z

Thanks for your help. I tried the latest torchao==0.7.0+gitcbd90e38 and it worked correctly. But when i installed the torchao-0.7.0.dev20241028+cu121 again, I encounterd the bug:

ENV:
cuda: 12.1
torch: 2.5.0+cu121
pip install torchao==0.7.0.dev20241028 --index-url https://download.pytorch.org/whl/nightly/cu121

tobiasvanderwerff · 2024-10-29T07:23:13Z

@shihaobai Have you tried recompiling the C++/CUDA code by running pip install . in the base directory of ao? This might help resolve the error.

shihaobai · 2024-10-29T07:27:44Z

@tobiasvanderwerff I tried based on the latest commit and it worked correctly.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FP6 Speed on A100 80g #1181

FP6 Speed on A100 80g #1181

shihaobai commented Oct 28, 2024

gau-nernst commented Oct 28, 2024

shihaobai commented Oct 28, 2024

gau-nernst commented Oct 29, 2024

shihaobai commented Oct 29, 2024

tobiasvanderwerff commented Oct 29, 2024

shihaobai commented Oct 29, 2024

FP6 Speed on A100 80g #1181

FP6 Speed on A100 80g #1181

Comments

shihaobai commented Oct 28, 2024

gau-nernst commented Oct 28, 2024

shihaobai commented Oct 28, 2024

gau-nernst commented Oct 29, 2024

shihaobai commented Oct 29, 2024

tobiasvanderwerff commented Oct 29, 2024

shihaobai commented Oct 29, 2024