Skip to content

Enabled configurable auto Tensor Parallelism (TP) for the inference of diverse models #13033

Enabled configurable auto Tensor Parallelism (TP) for the inference of diverse models

Enabled configurable auto Tensor Parallelism (TP) for the inference of diverse models #13033

Annotations

1 warning

unit-tests

succeeded Oct 25, 2024 in 5m 56s