Skip to content

vLLM supports GPU HBM + host memory prefix kv caching #10387

vLLM supports GPU HBM + host memory prefix kv caching

vLLM supports GPU HBM + host memory prefix kv caching #10387

Triggered via pull request January 14, 2025 04:30
Status Success
Total duration 38s
Artifacts

ci.yaml

on: pull_request
notebook format and lint
29s
notebook format and lint
Fit to window
Zoom out
Zoom in

Annotations

1 warning
notebook format and lint
ubuntu-latest pipelines will use ubuntu-24.04 soon. For more details, see https://github.com/actions/runner-images/issues/10636