New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

[TD3] 5. Addressing Variance #5

Open

HyeAnn-Lee opened this issue Aug 8, 2023 · 0 comments

HyeAnn-Lee commented Aug 8, 2023

rl-paper-review/reviews/TD3.md

Line 191 in e838336

    
           기존 Sutton & Barto의 논문에서 밝혀졌듯이 learning speed를 줄이는 것은  variance를 줄일수는 있지만 실제로 performance에 영향을 줄 수 있다. 해당 논문에서는 **estimation error 최소화의 중요성**을 다시금 강조하고, **target network와 estimation error를 연관지어 variance를 줄일 수 있는 개선된 actor-critic 학습 과정을 제시**하려한다.

Sutton & Barto, 1998
- 논문이 아니고 책입니다^^ 강화학습의 교과서라고 불리지요ㅎㅎ 2018년엔 2쇄본이 나왔습니다 (https://a.co/d/fv54CBn)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment