-
-
Notifications
You must be signed in to change notification settings - Fork 0
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
update default custom_script parameter
- Loading branch information
Showing
2 changed files
with
34 additions
and
2 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1 +1,33 @@ | ||
kaggle script runner | ||
### Kaggle Script Runner | ||
|
||
**Kaggle Script Runner** is a GitHub Action designed for automating the execution of custom scripts on Kaggle kernels, making it ideal for continuous integration (CI) and deployment (CD) of machine learning workflows directly within your GitHub repository. | ||
<br> | ||
This action pulls the repository on the branch it is executed from, installs dependencies from the `requirements.txt` file in the root directory by default, and supports executing Python scripts on Kaggle's infrastructure with configurable options, including GPU/TPU support, internet access, and custom dataset or kernel dependencies. | ||
|
||
#### Key Features | ||
- **Automated Kernel Execution**: Run Python scripts on Kaggle kernels from GitHub Actions, with options for enabling GPU, TPU, and internet. | ||
- **Flexible Configuration**: Specify kernel parameters like title, custom script, data sources, and sleep intervals for status checks. | ||
- **Dynamic Script Injection**: Accepts a custom script input that is added directly to the Kaggle notebook, making it versatile for various tests and jobs. | ||
- **CI/CD Integration**: Incorporate machine learning workflows seamlessly into your CI/CD pipeline, with GitHub Actions monitoring the kernel execution status and fetching output logs. | ||
- **Detailed Logging**: Captures and logs the complete output, including kernel execution logs and errors, and displays `data` fields for streamlined debugging. | ||
|
||
#### Inputs | ||
- **`username`** (required): Kaggle username. | ||
- **`key`** (required): Kaggle API token. | ||
- **`title`** (required): Title of execution run. | ||
- **`custom_script`** (required): Custom script content to execute within the notebook. Default: `print('Success')`. | ||
- **`enable_gpu`** (optional): Enable GPU on the Kaggle kernel. Default: `false`. | ||
- **`enable_tpu`** (optional): Enable TPU on the Kaggle kernel. Default: `false`. | ||
- **`enable_internet`** (optional): Enable internet access for the Kaggle kernel. Default: `true`. | ||
- **`dataset_sources`** (optional): List of dataset sources formatted as `{username}/{dataset-slug}`. | ||
- **`competition_sources`** (optional): List of competition data sources. | ||
- **`kernel_sources`** (optional): List of other kernel sources formatted as `{username}/{kernel-slug}`. | ||
- **`sleep_time`** (optional): Time interval (in seconds) to wait between kernel status checks. Default: `15`. | ||
|
||
#### Example Usage | ||
This GitHub Action fetches logs and kernel output after execution, making it ideal for: | ||
- Running model training or inference scripts in an automated fashion on Kaggle’s infrastructure. | ||
- Integrating Kaggle-based tests for datasets, models, or research pipelines as part of CI/CD workflows. | ||
- Leveraging Kaggle’s GPUs/TPUs for testing model performance and handling large data without local resource constraints. | ||
|
||
**Note**: To use this action, you must have a Kaggle API token with the appropriate permissions for kernels and datasets. |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters