Embed Llama.cpp into Marsha for local usage #164

dfellis · 2023-08-07T16:03:59Z

Currently getting to the third stage with Llama 2 13B but failing there.

…don't do that. Also some tweaks to the prompt and arg handling

dfellis · 2023-08-07T19:17:54Z

70B took a lot longer (since only about 1/4th of the layers could fit on my GPU) required different params to llama.cpp that weren't predictable in any way, and actually did a worse job than 13B in my test run, unexpectedly.

Going back to 13B for the faster iteration speed (it's on par with OpenAI response times, if not the quality).

…en I set it as suggested it generated the desired output for the first time!

…pts and also lower the temperature because Llama v2 is a bit 'wilder' than ChatGPT

dfellis added 2 commits August 7, 2023 00:14

Embed llama.cpp into Marsha for local use

41aec5e

Running more than one llama.cpp call at a time tanks performance, so …

6ae8090

…don't do that. Also some tweaks to the prompt and arg handling

dfellis self-assigned this Aug 7, 2023

dfellis requested review from depombo and aguillenv August 7, 2023 16:04

dfellis added 2 commits August 7, 2023 11:06

Run autopep8 on current code

79ef7f1

Get setup.py working

fe75ef3

dfellis added 3 commits August 7, 2023 16:00

A bit more reliable, I think

eb95bb2

I noticed a note that Llama v2 should use a special EPS value, and wh…

f30cddc

…en I set it as suggested it generated the desired output for the first time!

Fix a minor formatting bug I found while analyzing the generated prom…

8222235

…pts and also lower the temperature because Llama v2 is a bit 'wilder' than ChatGPT

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Embed Llama.cpp into Marsha for local usage #164

Embed Llama.cpp into Marsha for local usage #164

dfellis commented Aug 7, 2023

dfellis commented Aug 7, 2023

Embed Llama.cpp into Marsha for local usage #164

Are you sure you want to change the base?

Embed Llama.cpp into Marsha for local usage #164

Conversation

dfellis commented Aug 7, 2023

dfellis commented Aug 7, 2023