POCE: Primal Policy Optimization with Conservative Estimation for Multi-constraint Offline Reinforcement Learning
This project provides the open source implementation of the POCE method introduced in the paper: "POCE: Primal Policy Optimization with Conservative Estimation for\ Multi-constraint Offline Reinforcement Learning" .
cd poce/
conda env create --file conda-recipe.yaml
conda activate poce
pip install -e .
cd poce/envs/safety-gymnasium-poce/
pip install -e .
Please download the dataset from the Google Drive address provided in the TXT file and place it in the current directory.
cd poce/dataset/
cd poce/
python script/train_policy.py