-
Notifications
You must be signed in to change notification settings - Fork 151
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
fullscreen/realtime interface for samples #865
Merged
Merged
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
…t_ai into feature/realtime
HUGE! |
jjallaire
added a commit
that referenced
this pull request
Nov 21, 2024
This reverts commit 047de72.
jjallaire
added a commit
that referenced
this pull request
Nov 21, 2024
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
This PR introduces a new fullscreen display UI that includes the traditional Inspect task view with additional panes with running samples and log/console output. Running samples have a live transcript along with the ability to cancel and score. (or cancel raising an error for evals with
fail-on-error
policies):The new UI is not currently enabled by default, but we'd like people to experiment with it and provide feedback before we make it the default (in ~ 2 weeks). You can enable fullscreen mode using the
--display
option or theINSPECT_DISPLAY
environment variable:The available values for the
display
option are:full
- Fullscreen UI as shown aboverich
- Classic progress UI (currently the default)plain
- No progress UI but print task summary at the end w/o using ANSI colors/formattingnone
- No display at allWe'll be working on several enhancements to fullscreen mode in the near future:
input_screen()
but async and with richer UI constructs available via textual