feat: Textual Inversion (embeddings) #129

FSSRepo · 2023-12-29T02:24:43Z

I made a quick implementation to support embeddings. It's not the best way to do it, but when the code is refactored, the handling of embeddings could be improved. In some cases, there are very large embeddings that could consume beyond the available context and could lead to overflow or errors.

You can specify the embedding to use by adding the embedding filename to the prompt (usually negative prompt), separate it with ,. Like this:

sd -p "a lovely cat" -n "EasyNegative, bad image" --embd-dir models/embeddings

NOTE: EasyNegative It uses a space of 75 tokens, so there are only 2 tokens left.

@leejet Any suggestions to improve this?

FSSRepo · 2023-12-29T03:58:59Z

@Green-Sky Try to see if the errors are fixed with the last commit I made. You don't necessarily have to use the feature addressed in this PR.

diimdeep · 2023-12-29T11:05:26Z

This is great.
but is max_position_embeddings = 77 sane default ? Not sure how pytorch based implemented but afaik they do not have problem with very long promts even at the same time with multiple embeddings.

Green-Sky · 2023-12-29T12:46:05Z

@FSSRepo finally, the funky neon looking vae artifacts are gone! 🎉

I noticed the vae tiling has the lowres artifacts still.

eg

diimdeep · 2023-12-29T13:37:32Z

from https://civitai.com/models/72437/baddream-unrealisticdream-negative-embeddings

[DEBUG] stable-diffusion.cpp:1529 - parse 'ugly embd:unrealisticdream' to [['ugly embd:unrealisticdream', 1], ]
[INFO]  model.cpp:641  - load models/embeddings/unrealisticdream.pt using checkpoint format
[DEBUG] model.cpp:1142 - init from 'models/embeddings/unrealisticdream.pt'
[DEBUG] model.cpp:1224 - loading tensors from models/embeddings/unrealisticdream.pt
ggml_new_object: not enough space in the context's memory pool (needed 57216, available 32768)

[DEBUG] stable-diffusion.cpp:1529 - parse 'blurry embd:baddream' to [['blurry embd:baddream', 1], ]
[INFO]  model.cpp:641  - load models/embeddings/baddream.pt using checkpoint format
[DEBUG] model.cpp:1142 - init from 'models/embeddings/baddream.pt'
[DEBUG] model.cpp:1224 - loading tensors from models/embeddings/baddream.pt
ggml_new_object: not enough space in the context's memory pool (needed 106368, available 32768)```

FSSRepo · 2023-12-29T15:50:15Z

@diimdeep As mentioned in the description, excessively large embeddings are not supported. I don't know how A1111 or ComfyUI manage very long prompts. They can be truncate, something like splitting the embeddings in half and only including the first half, IDK?

leejet · 2023-12-30T06:06:23Z

Any suggestions to improve this?

I think we can take a cue from how sd-webui operates. Upon detecting that a token in the prompt corresponds to a local embedding file, we can replace the token with its embedding rather than explicitly specifying it using embd:. By the way, I'm currently refactoring the project, and once it's done, it should support long prompts.

… into embeddings

FSSRepo · 2024-01-02T00:32:41Z

@leejet I am thinking of merging this pull request with #131 to incorporate the new refactoring changes all at once.

Green-Sky · 2024-01-02T10:25:25Z

@FSSRepo @slaren Since the sync AFTER set fixes the issue, i am pretty sure that slow-ish PCIe speeds or similar seem to break it. From what i can find online, it seems the sync needs to happen after the memcpy, since a different stream(?) might already be accessing the data before it is ready.

edit: feels like this sync should be after instead of before https://github.com/ggerganov/ggml/blob/fca1caafea7de9fbd7efc733b9818f9cf2da3050/src/ggml-cuda.cu#L9684C41-L9684C41
currently its protecting "writing while still copying to it"

slaren · 2024-01-02T12:40:20Z

That's probably the cause. I expected cudaMemcpy to be synchronous in all cases, but that's not what happens. The documentation says this:

For transfers from pageable host memory to device memory, a stream sync is performed before the copy is initiated. The function will return once the pageable buffer has been copied to the staging memory for DMA transfer to device memory, but the DMA to final destination may not have completed.

So cudaMemcpy may indeed return before the copy is completed, and a slow PCIe bus could cause the DMA transfer to not complete before the kernel is launched.

leejet · 2024-01-02T12:43:40Z

@leejet I am thinking of merging this pull request with #131 to incorporate the new refactoring changes all at once.

Great, I think this will reduce some workload.

FSSRepo · 2024-01-02T19:34:24Z

This changes were merged to #131

FSSRepo added 4 commits December 28, 2023 14:02

add embedding support

934831f

embedding dir + README.md

420f262

fix README.md

b23294b

try fix sync issues

3fadee8

FSSRepo added 2 commits December 31, 2023 17:48

Merge branch 'master' of https://github.com/leejet/stable-diffusion.cpp…

b302fbc

… into embeddings

don't require 'embd:' to load

a67c68a

FSSRepo closed this Jan 2, 2024

Green-Sky mentioned this pull request Jan 4, 2024

fix : cuda order of synchronization when setting a buffer ggml-org/ggml#679

Merged

slaren mentioned this pull request Jan 4, 2024

llama : ggml-backend integration ggerganov/llama.cpp#4766

Merged

7 tasks

FSSRepo deleted the embeddings branch January 7, 2024 19:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Textual Inversion (embeddings) #129

feat: Textual Inversion (embeddings) #129

FSSRepo commented Dec 29, 2023 •

edited

Loading

FSSRepo commented Dec 29, 2023

diimdeep commented Dec 29, 2023 •

edited

Loading

Green-Sky commented Dec 29, 2023

diimdeep commented Dec 29, 2023 •

edited

Loading

FSSRepo commented Dec 29, 2023 •

edited

Loading

leejet commented Dec 30, 2023

FSSRepo commented Jan 2, 2024

Green-Sky commented Jan 2, 2024 •

edited

Loading

slaren commented Jan 2, 2024

leejet commented Jan 2, 2024

FSSRepo commented Jan 2, 2024

feat: Textual Inversion (embeddings) #129

feat: Textual Inversion (embeddings) #129

Conversation

FSSRepo commented Dec 29, 2023 • edited Loading

FSSRepo commented Dec 29, 2023

diimdeep commented Dec 29, 2023 • edited Loading

Green-Sky commented Dec 29, 2023

diimdeep commented Dec 29, 2023 • edited Loading

FSSRepo commented Dec 29, 2023 • edited Loading

leejet commented Dec 30, 2023

FSSRepo commented Jan 2, 2024

Green-Sky commented Jan 2, 2024 • edited Loading

slaren commented Jan 2, 2024

leejet commented Jan 2, 2024

FSSRepo commented Jan 2, 2024

FSSRepo commented Dec 29, 2023 •

edited

Loading

diimdeep commented Dec 29, 2023 •

edited

Loading

diimdeep commented Dec 29, 2023 •

edited

Loading

FSSRepo commented Dec 29, 2023 •

edited

Loading

Green-Sky commented Jan 2, 2024 •

edited

Loading