Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[FEATURE] Implement self-play for two-player zero-sum games #99

Open
5 tasks
RPegoud opened this issue Jul 8, 2024 · 0 comments
Open
5 tasks

[FEATURE] Implement self-play for two-player zero-sum games #99

RPegoud opened this issue Jul 8, 2024 · 0 comments
Labels
enhancement New feature or request

Comments

@RPegoud
Copy link
Contributor

RPegoud commented Jul 8, 2024

Description:
Add self-play versions of DQN and PPO for two-player zero-sum games in PGX environments.

Checklist:

  • Determine how to keep the value estimation consistent (e.g. flip the board or reverse the discount for opponent values)
  • Add PGX environment configs
  • Implement self-play for DQN
  • And for PPO
  • (optional) If possible, for AlphaZero
@RPegoud RPegoud added the enhancement New feature or request label Jul 8, 2024
@RPegoud RPegoud changed the title [FEATURE] Implement self-play for DQN and PPO [FEATURE] Implement self-play for two-player zero-sum games Jul 8, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant