Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Collect unpaired preference feedback #395

Open
Tracked by #392
RobotSail opened this issue Dec 7, 2024 · 0 comments
Open
Tracked by #392

Collect unpaired preference feedback #395

RobotSail opened this issue Dec 7, 2024 · 0 comments
Labels
enhancement New feature or request
Milestone

Comments

@RobotSail
Copy link
Member

RobotSail commented Dec 7, 2024

In order to further fine-tune language models and align them with human preferences, it's necessary to collect preference feedback on model responses. There are a few types of data points to collect: unpaired and paired.

Unpaired preference data is used for unpaired preference optimization, as described by the following paper: UPO: Unpaired Preference Optimization for Large Language Models.

In order to collect data for this form of fine-tuning, we want to introduce a thumbs up/down button that appears on each assistant response. When the user presses this button, we want to record the following information:

  • immediate model response
  • previous user message
  • conversation ID
  • model ID

For instance, consider how the UI appears in the following popular chat assistant:

Example of a response with thumbs up/down buttons:

image

Example of the thumbs up/down buttons:
image

This issue depends on #13 and #394

@vishnoianil vishnoianil added the enhancement New feature or request label Dec 17, 2024
@vishnoianil vishnoianil added this to UI Dec 17, 2024
@vishnoianil vishnoianil added this to the release-1.2 milestone Dec 17, 2024
@vishnoianil vishnoianil moved this to Backlog in UI Dec 17, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
Status: Backlog
Development

No branches or pull requests

2 participants