Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

generate synthetic examples #395

Merged
merged 2 commits into from
Aug 15, 2024
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Original file line number Diff line number Diff line change
Expand Up @@ -74,6 +74,19 @@ Select a name and description for the dataset, and then confirm that the inferre

![Confirm Columns](../static/select_columns.png)

## Generate synthetic examples

For a dataset with a specified schema, you can generate synthetic examples to enhance your dataset.

1. **Select few-shot examples**: Choose a set of examples to guide the LLM's generation. You can manually select these examples from your dataset or use the automatic selection option.
2. **Specify the number of examples**: Enter the number of synthetic examples you want to generate.
3. **Configure API Key**: Ensure your OpenAI API key is entered at the "API Key" link.
![Generate Synthetic Examples](../static/generate_synthetic_examples_create.png)

After clicking "Generate," the examples will appear on the page. You can choose which examples to add to your dataset, with the option to edit them before finalizing.
Each example will be validated against your specified dataset schema and tagged as "synthetic" in the source metadata.
![Generate Synthetic Examples](../static/generate_synthetic_examples_pane.png)

## Export a dataset

You can export your LangSmith dataset to CSV or OpenAI evals format directly from the web application.
Expand Down
1 change: 1 addition & 0 deletions versioned_docs/version-2.0/how_to_guides/index.md
Original file line number Diff line number Diff line change
Expand Up @@ -92,6 +92,7 @@ Manage datasets in LangSmith to evaluate and improve your LLM applications.
- [Dataset schema validation](./how_to_guides/datasets/manage_datasets_in_application#dataset-schema-validation)
- [Add inputs and outputs from traces to datasets](./how_to_guides/datasets/manage_datasets_in_application#add-inputs-and-outputs-from-traces-to-datasets)
- [Upload a CSV file to create a dataset](./how_to_guides/datasets/manage_datasets_in_application#upload-a-csv-file-to-create-a-dataset)
- [Generate synthetic examples](./how_to_guides/datasets/manage_datasets_in_application#generate-synthetic-examples)
- [Export a dataset](./how_to_guides/datasets/manage_datasets_in_application#export-a-dataset)
- [Create and manage dataset splits](./how_to_guides/datasets/manage_datasets_in_application#create-and-manage-dataset-splits)
- [Manage datasets programmatically](./how_to_guides/datasets/manage_datasets_programmatically)
Expand Down
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading