Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[TD3] 5. Addressing Variance #5

Open
HyeAnn-Lee opened this issue Aug 8, 2023 · 0 comments
Open

[TD3] 5. Addressing Variance #5

HyeAnn-Lee opened this issue Aug 8, 2023 · 0 comments

Comments

@HyeAnn-Lee
Copy link

기존 Sutton & Barto의 논문에서 밝혀졌듯이 learning speed를 줄이는 것은 variance를 줄일수는 있지만 실제로 performance에 영향을 줄 수 있다. 해당 논문에서는 **estimation error 최소화의 중요성**을 다시금 강조하고, **target network와 estimation error를 연관지어 variance를 줄일 수 있는 개선된 actor-critic 학습 과정을 제시**하려한다.

  • Sutton & Barto, 1998
    • 논문이 아니고 책입니다^^ 강화학습의 교과서라고 불리지요ㅎㅎ 2018년엔 2쇄본이 나왔습니다 (https://a.co/d/fv54CBn)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant