We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
请问Morvan, DQN的代码中,计算q_target时,是否未考虑done为True的情况,即q_target = Reward? 存储在Replay memory中的经验也未包含done。请问为什么呢?
The text was updated successfully, but these errors were encountered:
请问有想到怎么考虑done=True的情况吗,如果在memory里存储经验包含done,那怎么解决随机取batch_size得到两个及以上done的情况?
Sorry, something went wrong.
No branches or pull requests
请问Morvan, DQN的代码中,计算q_target时,是否未考虑done为True的情况,即q_target = Reward?
存储在Replay memory中的经验也未包含done。请问为什么呢?
The text was updated successfully, but these errors were encountered: