Why is the length of the reward limited to 3 or more? #355
Replies: 3 comments
-
I assume this is here (could you please use the permalink feature like I did - easier to follow) Do you mean that you have cases where for every module in a particular corpus sample, there are at most 2 decisions? If there are hangs, I believe the right fix would be to handle that graciously, because there could well be modules where there are no decisions, so we should skip over and retry (i.e. resample). Probably in The value "2" there IIRC was an optimization - i.e. there was little to gain from such short trajectories. |
Beta Was this translation helpful? Give feedback.
-
Thank you! As you say, this happens when there are at most two decisions on each object. |
Beta Was this translation helpful? Give feedback.
-
Sounds good, and patches are very welcome! |
Beta Was this translation helpful? Give feedback.
-
The following filtering appears to truncate data when more than two observations are not available for each object.
In our experiments, in cases where one or two optimizations are performed on each object, all data may be filtered, causing hangs in subsequent processing. Do you know why this filter is set? Is it OK to set the filter condition arbitrary?
Beta Was this translation helpful? Give feedback.
All reactions