Add support for stop sequences to HF models #1188

MikhailTerekhov · 2025-01-24T20:12:30Z

This PR contains:

What is the current behavior? (You can also link to an open issue here)

Currently, HF models do not support stop sequences even though the underlying functionality is present in HuggingFace Transfomers' model.generate. I was not sure whether this is a bug or a missing feature, so I marked both.

What is the new behavior?

This PR adds support for stop_seqs to HF models. I also added a test that checks that stop sequences work now. One can also verify that the test does not pass before the commit that fixes the issue.

Does this PR introduce a breaking change? (What changes might users need to make in their application due to this PR?)

No

Other information:

I also discovered that when a chat template is provided to get_model with an HF model, it is not propagated to the tokenizer. Hence, I needed to separately fix the tokenizer when creating the test fixture model, to avoid the default template from here:

inspect_ai/src/inspect_ai/model/_providers/hf.py

Line 251 in 991953a

for message in hf_messages:

If this is not the desired behavior, I can add a PR to fix that too. It would be useful for me to be able to provide custom chat templates to HF models.

jjallaire · 2025-01-24T22:09:35Z

@MikhailTerekhov thanks for this! I noted there is a MyPy error in CI. Could you take a look at this?

MikhailTerekhov · 2025-01-25T15:26:35Z

@jjallaire I'm not sure if the chat template behavior I mentioned in the PR is the desired one. Could you please let me know if a fix is needed there?

MikhailTerekhov and others added 3 commits January 24, 2025 19:58

Add a test of stop_seqs with an HF model

c488506

Add support for stop_seqs with HF models

4fad95e

Update CHANGELOG.md

65a8857

MikhailTerekhov and others added 2 commits January 24, 2025 23:41

Ignore the transformers import in mypy

200016a

Merge branch 'main' into stop_seq_hf

ff0cf85

jjallaire merged commit 8863e1f into UKGovernmentBEIS:main Jan 25, 2025
9 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for stop sequences to HF models #1188

Add support for stop sequences to HF models #1188

MikhailTerekhov commented Jan 24, 2025

jjallaire commented Jan 24, 2025

MikhailTerekhov commented Jan 25, 2025

Add support for stop sequences to HF models #1188

Add support for stop sequences to HF models #1188

Conversation

MikhailTerekhov commented Jan 24, 2025

This PR contains:

What is the current behavior? (You can also link to an open issue here)

What is the new behavior?

Does this PR introduce a breaking change? (What changes might users need to make in their application due to this PR?)

Other information:

jjallaire commented Jan 24, 2025

MikhailTerekhov commented Jan 25, 2025