Skip to content

JulesGM/Marg-Li-CoT

Repository files navigation

Using Reinforcement Learning to Guide Chains of Thought

Sets of TRL and/or SFT jobs:

Launch jobs with

./job_sets/launch_sets.py <job_set_name>   

Check the status with:

./job_sets/check_status.py

With TRL:

Where the reinforcement learning is located.

./with_trl/launch.py <experiment_name>

Approach SFT:

./approach_sft/launch.py <experiment_name>

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published