Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

DQN的代码中,计算q_target时未考虑done为true的情况 #200

Open
ananasfl opened this issue Apr 20, 2022 · 1 comment
Open

DQN的代码中,计算q_target时未考虑done为true的情况 #200

ananasfl opened this issue Apr 20, 2022 · 1 comment

Comments

@ananasfl
Copy link

请问Morvan, DQN的代码中,计算q_target时,是否未考虑done为True的情况,即q_target = Reward?
存储在Replay memory中的经验也未包含done。请问为什么呢?

@ccconquer
Copy link

请问有想到怎么考虑done=True的情况吗,如果在memory里存储经验包含done,那怎么解决随机取batch_size得到两个及以上done的情况?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants