Fix PrioritizedReplayBuffer filtering #232

alexandrelarouche · 2025-01-22T16:26:08Z

Fix error where add() crashes if no trajectory is better than buffer trajectories.

Specifically, the following lines gave results which couldn't be handled because the Trajectories object was empty:

                batch = training_objects.last_states.tensor.float()
                batch_dim = training_objects.last_states.batch_shape[0]
                try:
                    batch_batch_dist = torch.cdist(
                        batch.view(batch_dim, -1).unsqueeze(0),
                        batch.view(batch_dim, -1).unsqueeze(0),
                        p=self.p_norm_distance,
                    ).squeeze(0)

Fix is trivial: flow control out of the function if no trajectories should be added to the buffer.

Fix error where add crashes if no trajectory is better than buffer trajectories.

josephdviviano

awesome - thanks for the fix!

Fix PrioritizedReplayBuffer filtering

1649660

Fix error where add crashes if no trajectory is better than buffer trajectories.

josephdviviano approved these changes Jan 24, 2025

View reviewed changes

josephdviviano merged commit 59a1efa into GFNOrg:master Jan 24, 2025
4 checks passed

alexandrelarouche deleted the buffer_filter_fix branch January 27, 2025 16:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix PrioritizedReplayBuffer filtering #232

Fix PrioritizedReplayBuffer filtering #232

alexandrelarouche commented Jan 22, 2025

josephdviviano left a comment

Fix PrioritizedReplayBuffer filtering #232

Fix PrioritizedReplayBuffer filtering #232

Conversation

alexandrelarouche commented Jan 22, 2025

josephdviviano left a comment

Choose a reason for hiding this comment