Guided decoding integration for autolabel #898

DhruvaBansal00 · 2024-09-11T08:52:08Z

Pull Review Summary

Description

A summary of the change. Please also include relevant motivation and context. This could include links to any docs/Slack threads/Github issues other artifacts.

Type of change

Bug fix (change which fixes an issue)
New feature (change which adds functionality)
This change requires a documentation update

Tests

Please describe the tests that you ran to verify your changes. This could include a test plan you executed locally, unit tests/integration tests that were run to verify the change works as expected.

Make sure to include screenshots, API response, log statements etc that point to the test being successful.

Put closes #XXXX in your comment to auto-close the issue that your PR addresses.

…ntly unusable)

DhruvaBansal00 · 2024-09-30T20:27:26Z

@nihit @tuxracer heads up here - we have to remove all instances of additionalProperties from the supplied JSON Schema for refuel models only due to a restriction from lm-format-enforcer (the backend we use for guided decoding). However, since OpenAI requires us sending this parameter (https://platform.openai.com/docs/guides/structured-outputs/supported-schemas) we can't just ignore it. Added additional methods for removing them post schema generation. Flagging this mostly since this small things breaks inference and we should push lm-format-enforcer to fix this so that client side code is cleaner.

Issue in lm-format-enforcer being tracked here: noamgat/lm-format-enforcer#129

DhruvaBansal00 added 15 commits September 11, 2024 01:47

WIP guided decoding integration for autolabel

70dc345

Removing JSON mode

c360daa

Removing multilabel confidence (regression) and label selector (curre…

6ec9d37

…ntly unusable)

Cleaning up imports

2397b04

rm logit bias

c55dc0a

Latest reqs

22c233f

Anthropic and google reqs

2118484

latest reqs

5689dc7

rm query params from openai

a2794c3

logprobs and top logprobs as primary keys

0839f77

Sending json schema directly

3ff29c0

Use provided schema

8cee242

merge conf

6435300

Merge conflicts

ff10e7b

fmt

f2e85a7

DhruvaBansal00 marked this pull request as ready for review September 27, 2024 19:10

DhruvaBansal00 added 5 commits September 27, 2024 14:06

tests

fe52394

Passing tests

656a4fe

fmt

7c4f478

rm error log

b147e0b

smaller test file

7161a35

DhruvaBansal00 requested review from rajasbansal, yadavsahil197 and nihit and removed request for yadavsahil197 September 27, 2024 21:58

nihit approved these changes Sep 29, 2024

View reviewed changes

Remove additionalProperties recursively

00d448e

fmt

762d5f5

DhruvaBansal00 changed the title ~~WIP guided decoding integration for autolabel~~ Guided decoding integration for autolabel Sep 30, 2024

DhruvaBansal00 merged commit d12ccc0 into main Sep 30, 2024
2 checks passed

DhruvaBansal00 deleted the guided-generation branch September 30, 2024 22:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Guided decoding integration for autolabel #898

Guided decoding integration for autolabel #898

DhruvaBansal00 commented Sep 11, 2024

DhruvaBansal00 commented Sep 30, 2024

Guided decoding integration for autolabel #898

Guided decoding integration for autolabel #898

Conversation

DhruvaBansal00 commented Sep 11, 2024

Pull Review Summary

Description

Type of change

Tests

DhruvaBansal00 commented Sep 30, 2024