Skip to content

v0.5.0

Compare
Choose a tag to compare
@PaParaZz1 PaParaZz1 released this 05 Dec 05:04
· 89 commits to main since this release

Env

  1. add tabmwp env (#667)
  2. polish anytrading env issues (#731)

Algorithm

  1. add PromptPG algorithm (#667)
  2. add Plan Diffuser algorithm (#700) (#749)
  3. add new pipeline implementation of IMPALA algorithm (#713)
  4. add dropout layers to DQN-style algorithms (#712)

Enhancement

  1. add new pipeline agent for sac/ddpg/a2c/ppo and Hugging Face support (#637) (#730) (#737)
  2. add more unittest cases for model (#728)
  3. add collector logging in new pipeline (#735)

Fix

  1. fix logger middleware problems (#715)
  2. fix ppo parallel bug (#709)
  3. fix typo in optimizer_helper.py (#726)
  4. fix mlp dropout if condition bug
  5. fix drex collecting data unittest bugs

Style

  1. polish env manager/wrapper comments and API doc (#742)
  2. polish model comments and API doc (#722) (#729) (#734) (#736) (#741)
  3. polish policy comments and API doc (#732)
  4. polish rl_utils comments and API doc (#724)
  5. polish torch_utils comments and API doc (#738)
  6. update README.md and Colab demo (#733)
  7. update metaworld docker image

News

  1. NeurIPS 2023 Spotlight: LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios
  2. OpenDILab + Hugging Face DRL Model Zoo link

Full Changelog: v0.4.9...v0.5.0

Contributors: @PaParaZz1 @zjowowen @AltmanD @puyuan1996 @kxzxvbk @Super1ce @nighood @Cloud-Pku @zhangpaipai @ruoyuGao @eltociear