Skip to content

Enabled configurable auto Tensor Parallelism (TP) for the inference of diverse models #1520

Enabled configurable auto Tensor Parallelism (TP) for the inference of diverse models

Enabled configurable auto Tensor Parallelism (TP) for the inference of diverse models #1520

Re-run triggered January 21, 2025 03:34
Status Success
Total duration 1h 48m 52s
Artifacts

hpu-gaudi2.yml

on: pull_request
Fit to window
Zoom out
Zoom in