Enabled configurable auto Tensor Parallelism (TP) for the inference of diverse models #11908
Triggered via pull request
October 25, 2024 16:51
Status
Success
Total duration
1h 26m 54s
Artifacts
–
nv-torch-latest-v100.yml
on: pull_request
unit-tests
1h 26m