Replies: 1 comment
-
Using |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hello 👋
I'm using BindsNET to train an SNN to solve the CartPole problem. Since I want a finer control of the encoding and rewards, I've opted out of using the pipeline.
Here's the structure of my network:
I noted that whenever I do
the
reward
is applied indiscriminately to all the weights ofhidden_out
(all increase/decrease), not taking into account which output neuron caused thatreward
.How can I specify which connection weights to update? Do I have to break the output layer into two separate ones? Bonus: How can I propagate this causal update through a deep network?
Beta Was this translation helpful? Give feedback.
All reactions