Skip to content

Enabled configurable auto Tensor Parallelism (TP) for the inference of diverse models #11908

Enabled configurable auto Tensor Parallelism (TP) for the inference of diverse models

Enabled configurable auto Tensor Parallelism (TP) for the inference of diverse models #11908

Triggered via pull request October 25, 2024 16:51
Status Success
Total duration 1h 26m 54s
Artifacts

nv-torch-latest-v100.yml

on: pull_request
Fit to window
Zoom out
Zoom in