Skip to content

What are supported low-bit (int8/fp8/int4) data types in MLP and Attention layers? #91

What are supported low-bit (int8/fp8/int4) data types in MLP and Attention layers?

What are supported low-bit (int8/fp8/int4) data types in MLP and Attention layers? #91

Triggered via issue January 7, 2025 14:29
Status Failure
Total duration 52s
Artifacts

auto-assign.yml

on: issues
assign_issue
38s
assign_issue
Fit to window
Zoom out
Zoom in

Annotations

1 error and 1 warning
assign_issue
Process completed with exit code 1.
assign_issue
ubuntu-latest pipelines will use ubuntu-24.04 soon. For more details, see https://github.com/actions/runner-images/issues/10636