Add endpoint for emptying the model cache #7602

RyanJDick · 2025-01-28T19:43:37Z

Summary

This PR adds a backend endpoint that can be used to empty the model cache (to free RAM / VRAM). 'Locked' models that are actively being used will not be dropped.

Locking was added to ModelCache to make it thread-safe. But, I suspect that it might be possible to trigger #7513 with an ill-timed request to empty the cache (in the middle of graph execution). Even if this does happen, the current graph execution would fail, but subsequent graph executions should recover smoothly.

QA Instructions

I tested that the new endpoint successfully empties the model cache with curl -X POST 127.0.0.1:9090/api/v2/models/empty_model_cache.

I tried to break things by hitting the endpoint throughout graph execution. Nothing broke, which is a good sign. Of course, this is far from testing the thread interactions thoroughly.

Merge Plan

No special instructions.

Checklist

The PR has a short but descriptive title, suitable for a changelog
Tests added / updated (if applicable)
Documentation added / updated (if applicable)
Updated What's New copy (if doing a release after this PR)

…k to the ModelCache to make it thread-safe.

github-actions bot added api python PRs that change python files backend PRs that change backend files frontend PRs that change frontend files labels Jan 28, 2025

RyanJDick force-pushed the ryan/empty-model-cache branch from 3c2a65a to 3613ee3 Compare January 28, 2025 19:44

RyanJDick marked this pull request as ready for review January 28, 2025 19:46

RyanJDick requested review from psychedelicious, blessedcoolant, maryhipp, hipsterusername, lstein and brandonrising as code owners January 28, 2025 19:46

hipsterusername approved these changes Jan 30, 2025

View reviewed changes

RyanJDick and others added 2 commits January 30, 2025 08:47

Add endpoint for emptying the model cache. Also, adds a threading loc…

309ae09

…k to the ModelCache to make it thread-safe.

feat(ui): add button to clear model cache

24a078b

hipsterusername force-pushed the ryan/empty-model-cache branch from eb5bac9 to 24a078b Compare January 30, 2025 13:47

hipsterusername enabled auto-merge (rebase) January 30, 2025 14:16

hipsterusername merged commit 64475b8 into main Jan 30, 2025
15 checks passed

hipsterusername deleted the ryan/empty-model-cache branch January 30, 2025 14:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add endpoint for emptying the model cache #7602

Add endpoint for emptying the model cache #7602

RyanJDick commented Jan 28, 2025 •

edited

Loading

Add endpoint for emptying the model cache #7602

Add endpoint for emptying the model cache #7602

Conversation

RyanJDick commented Jan 28, 2025 • edited Loading

Summary

QA Instructions

Merge Plan

Checklist

RyanJDick commented Jan 28, 2025 •

edited

Loading