Add local search sampler #208

hyeok9855 · 2024-10-29T10:21:14Z

This is a PR for adding LocalSearchSampler.
The local search is based on the work [1] and [2].

Test in hypergrid env:

python tutorials/examples/train_hypergrid_simple_ls.py

josephdviviano · 2024-11-01T15:47:34Z

@hyeok9855 can you fix the merge conflicts?

josephdviviano · 2024-11-01T15:51:05Z

I noticed you are using force-push -- be careful with this, it can put the code in a state hard to resolve with the rest of the work.

https://www.gitkraken.com/learn/git/problems/git-push-force#:~:text=The%20Risks%20of%20Git%20Push%20Force&text=Because%20you%20have%20failed%20to,deleting%20your%20team%20member's%20work.

hyeok9855 · 2024-11-05T14:10:35Z

I fixed an issue in the backward mask!

Is there anything necessary to do next?

src/gfn/samplers.py

…/local-search

josephdviviano

Hey @hyeok9855 - please see my comments. This is a really awesome PR, I like the changes you made to trajectories and with some tweaks the local search sampler looks very clean.

Let me know if you want to schedule a pair programming session.

josephdviviano · 2024-12-03T17:44:08Z

src/gfn/containers/trajectories.py

+    @staticmethod
+    def reverse_backward_trajectories(trajectories: Trajectories) -> Trajectories:
+        """Reverses a backward trajectory"""
+        # FIXME: This method is not compatible with continuous GFN.


What's the major blocker here? Anyone know?

I'm not sure either... This was from here.

One guess is this:

In line 436-443:

new_actions = torch.full( ( trajectories.max_length + 1, len(trajectories), *trajectories.actions.action_shape, ), -1, )

Also, in line 461-463:

new_actions[trajectories.when_is_done[i], i] = ( trajectories.env.n_actions - 1 )

These assume that the action is an integer, which is not true for continuous case, right?

The blocker: see my response to line 462 of this file in the PR.

this function will not work on non-discrete environments!

josephdviviano · 2024-12-03T17:47:31Z

src/gfn/containers/trajectories.py

+
+        # FIXME: Can we vectorize this?
+        # FIXME: Also, loop over batch or sequence?
+        for i in range(len(trajectories)):


Can we flip the full trajectory tensor in one call, and then use indexing to resolve the padding instead? it should be much faster.

Of course, yes. I'll check and let you know if I need your help.

josephdviviano · 2024-12-03T17:48:43Z

src/gfn/env.py

@@ -61,6 +61,8 @@ def __init__(
        self.dummy_action = dummy_action
        self.exit_action = exit_action

+        # Warning: don't use self.States or self.Actions to initialize an instance of the class.


Who is this warning intended for?

Maybe us?? Regarding this, what about making them into private variables (e.g., self.__States and self.__Actions)??

It seems like we should not initialize a States object using self.States, but rather self.states_from_tensor, as in line 251 of src/gfn/gym/discrete_ebm.py.
I agree with the general sentiment here. Should we actually raise a warning when self.States is used? Or is there a way to prevent it?
I agree with @hyeok9855's comment as well.

src/gfn/samplers.py

josephdviviano · 2024-12-03T19:14:34Z

src/gfn/samplers.py

+        )
+
+        # Calculate the forward probability if needed (Metropolis-Hastings).
+        prev_trajectories = Trajectories.reverse_backward_trajectories(


maybe for clarity prev_forward_trajectories or reversed_bakward_trajectories, since you also operate on trajectories below.

I wanted to make this correspond to the new_trajectories. (Please check my comment below.)

I think adding a short explanation of why this is called prev_trajectories would be fine, e.g.,

# By reversing the backward trajectories, obtain the forward trajectories. # This is called `prev_trajectories` since they are the trajectories before # the local search. The `new_trajectories` will be obtained by performing local # search on them. prev_trajectories = Trajectories.reverse_backward_trajectories( backward_trajectories )

What do you think about this??

josephdviviano · 2024-12-03T19:15:00Z

src/gfn/samplers.py

+        prev_trajectories = Trajectories.reverse_backward_trajectories(
+            backward_trajectories
+        )
+        prev_trajectories_log_rewards = trajectories.log_rewards


should this be prev_trajectories? I actually think it does not matter since you're just looking at the reward at the end of the trajectory, but a comment here would be clarifying b/c above prev_trajectories refers to the reverse of a trajectory sampled from pf and here you're grabbing log_rewards directly from the forward trajectories.

Why this is prev_trajectories -> check like 450-456. I wanted that part to be new... +/- prev....
To alleviate the confusion, we can simply change this line to

prev_trajectories_log_rewards = prev_trajectories.log_rewards

src/gfn/samplers.py

josephdviviano · 2024-12-03T19:23:32Z

src/gfn/samplers.py

+            n_back = backward_trajectories.when_is_done[i] - K[i]
+
+            # Sanity check
+            assert (


It would be good I think to move this into a test, to prevent the assertion being run so often in production.

Is it possible to test with the local variables of a function?

josephdviviano · 2024-12-03T19:25:09Z

src/gfn/samplers.py

+                device=device, dtype=torch.float
+            )
+
+        for i in range(bs):  # FIXME: Can we vectorize this?


I think this should be vectorized -- if you need help, no problem, let's schedule a pair programming session (maybe in the evening EST instead of my early morning ;) ).

Let me check first! I will let you know :)

younik

Good job, the code is very well written :)

src/gfn/utils/prob_calculations.py

src/gfn/samplers.py

saleml

great work

src/gfn/env.py

saleml · 2024-12-06T06:02:56Z

src/gfn/env.py

@@ -61,6 +61,8 @@ def __init__(
        self.dummy_action = dummy_action
        self.exit_action = exit_action

+        # Warning: don't use self.States or self.Actions to initialize an instance of the class.


It seems like we should not initialize a States object using self.States, but rather self.states_from_tensor, as in line 251 of src/gfn/gym/discrete_ebm.py.
I agree with the general sentiment here. Should we actually raise a warning when self.States is used? Or is there a way to prevent it?
I agree with @hyeok9855's comment as well.

saleml · 2024-12-06T06:27:27Z

src/gfn/containers/trajectories.py

+        # FIXME: Can we vectorize this?
+        # FIXME: Also, loop over batch or sequence?
+        for i in range(len(trajectories)):
+            new_actions[trajectories.when_is_done[i], i] = (


The problem is here. Actions are not always integers.

saleml · 2024-12-06T06:27:51Z

src/gfn/containers/trajectories.py

+    @staticmethod
+    def reverse_backward_trajectories(trajectories: Trajectories) -> Trajectories:
+        """Reverses a backward trajectory"""
+        # FIXME: This method is not compatible with continuous GFN.


The blocker: see my response to line 462 of this file in the PR.

saleml · 2024-12-06T06:28:50Z

src/gfn/containers/trajectories.py

+    @staticmethod
+    def reverse_backward_trajectories(trajectories: Trajectories) -> Trajectories:
+        """Reverses a backward trajectory"""
+        # FIXME: This method is not compatible with continuous GFN.


this function will not work on non-discrete environments!

src/gfn/samplers.py

saleml · 2024-12-06T15:25:12Z

src/gfn/utils/prob_calculations.py

@@ -103,23 +104,34 @@ def get_trajectory_pfs(
            valid_actions.tensor
        )  # Using the actions sampled off-policy.

-        log_pf_trajectories = torch.full_like(
-            trajectories.actions.tensor[..., 0],


what's the rationale behind removing this?

This was just moved to line 78 to address the edge case in line 84!

saleml · 2024-12-06T15:25:24Z

src/gfn/utils/prob_calculations.py

@@ -145,13 +160,13 @@ def get_trajectory_pbs(
        valid_states, estimator_outputs
    ).log_prob(valid_actions.tensor)

-    log_pb_trajectories = torch.full_like(
-        trajectories.actions.tensor[..., 0],
-        fill_value=fill_value,


same question here

Same here :)

saleml · 2024-12-06T15:34:03Z

src/gfn/containers/trajectories.py

+            0, 1
+        )  # shape (max_len + 2, n_trajectories, *state_dim)
+
+        # TODO: Add below into the test suite to ensure correctness


thank you for handling the vectorization. did you test this?

Yes, I test this by uncommenting it.
@josephdviviano, Could you give any advice on how to design a test to check whether the vectorization works appropriately?

What about copying the code pre-vecotrization in a test file and compare the outputs of your new function and that code on a few hypergrid + other env trajectories ?

saleml · 2025-01-11T10:14:38Z

testing/test_samplers_and_trajectories.py

+
+
+@pytest.mark.parametrize("env_name", ["HyperGrid", "DiscreteEBM"])
+def test_reverse_backward_trajectories(env_name: str):


hyeok9855 · 2025-01-11T19:04:29Z

FYI: In terms of L1 distance (empirical distribution vs. true distribution, as did in original GFN paper), both LS-GFN (TB) w/ and w/o Metropolis-Hastings correction outperform vanilla TB in 16x16x16x16 HyperGrid (using the default hyperparameters).

TB: $2.571 \times 10^{- 5}$
TB + LS: $2.513 \times 10^{- 5}$
TB + LS + MH: $2.507 \times 10^{- 5}$

saleml · 2025-01-11T21:55:41Z

LGTM.
@josephdviviano , please merge if you're satisfied with the changed.

josephdviviano

sorry for the lag reviewing these changes - Salem and I think these contributions are excellent

hyeok9855 requested a review from josephdviviano October 29, 2024 10:21

hyeok9855 self-assigned this Oct 29, 2024

hyeok9855 marked this pull request as draft October 29, 2024 11:03

hyeok9855 force-pushed the hyeok9855/local-search branch 2 times, most recently from c2d59b3 to c6b9f64 Compare October 29, 2024 17:15

draft commit

1ccf16c

hyeok9855 force-pushed the hyeok9855/local-search branch from c6b9f64 to 1ccf16c Compare October 29, 2024 18:21

hyeok9855 added 2 commits November 2, 2024 16:28

Merge branch 'master' into hyeok9855/local-search

a3af467

fix error in backward_masks

d7d95ca

[WIP] commit for sharing the redundancy issue

b075d9e

hyeok9855 commented Nov 8, 2024

View reviewed changes

src/gfn/samplers.py Outdated Show resolved Hide resolved

hyeok9855 commented Nov 8, 2024

View reviewed changes

src/gfn/samplers.py Outdated Show resolved Hide resolved

hyeok9855 commented Nov 8, 2024

View reviewed changes

src/gfn/samplers.py Outdated Show resolved Hide resolved

hyeok9855 commented Nov 8, 2024

View reviewed changes

src/gfn/samplers.py Outdated Show resolved Hide resolved

hyeok9855 mentioned this pull request Nov 8, 2024

Issue in calculation of log_probs in Sampler #211

Closed

hyeok9855 added 4 commits November 27, 2024 03:12

fix a minor error in the Trajectory.__repr__

8481673

add and fix the

0cc32f7

Merge branch 'hyeok9855/reverse_backward_trajectories' into hyeok9855…

4e11c27

…/local-search

add Metropolis-Hastings acceptance rule

e7fa8b6

hyeok9855 changed the title ~~[Draft] Add local search sampler~~ Add local search sampler Nov 29, 2024

hyeok9855 requested a review from younik November 29, 2024 18:57

hyeok9855 added the enhancement label Nov 29, 2024

hyeok9855 marked this pull request as ready for review November 29, 2024 18:58

saleml self-requested a review December 3, 2024 16:05

josephdviviano requested changes Dec 3, 2024

View reviewed changes

Minor refactors based on review

21ce0c2

younik reviewed Dec 4, 2024

View reviewed changes

src/gfn/utils/prob_calculations.py Show resolved Hide resolved

src/gfn/samplers.py Show resolved Hide resolved

src/gfn/samplers.py Outdated Show resolved Hide resolved

src/gfn/samplers.py Outdated Show resolved Hide resolved

src/gfn/samplers.py Outdated Show resolved Hide resolved

hyeok9855 added 2 commits December 6, 2024 16:34

Minor refactors based on review

5ce1fdc

vectorize

6f13cff

saleml reviewed Dec 6, 2024

View reviewed changes

hyeok9855 added 2 commits December 7, 2024 03:05

vectorize local_search

958139f

add docstring

2672d9b

hyeok9855 mentioned this pull request Dec 22, 2024

Refactoring ideas regarding Sampler #195

Open

add tests for vectorization

8529e4a

saleml reviewed Jan 11, 2025

View reviewed changes

fix train_hypergrid_ls script

a2837bd

hyeok9855 and others added 4 commits January 12, 2025 04:14

add validation metrics in train_hypergrid_simple examples

354ee04

update readme

9c60997

Merge branch 'master' into hyeok9855/local-search

26126df

make pyright happy

731e081

josephdviviano approved these changes Jan 13, 2025

View reviewed changes

josephdviviano merged commit 7f03681 into master Jan 13, 2025
4 checks passed

saleml mentioned this pull request Jan 13, 2025

Function to revert backward trajectories #109

Closed



		@pytest.mark.parametrize("env_name", ["HyperGrid", "DiscreteEBM"])
		def test_reverse_backward_trajectories(env_name: str):

Add local search sampler #208

Add local search sampler #208

Conversation

hyeok9855 commented Oct 29, 2024 • edited Loading

josephdviviano commented Nov 1, 2024

josephdviviano commented Nov 1, 2024

hyeok9855 commented Nov 5, 2024

josephdviviano left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

younik left a comment

Choose a reason for hiding this comment

saleml left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

hyeok9855 commented Jan 11, 2025

saleml commented Jan 11, 2025

josephdviviano left a comment

Choose a reason for hiding this comment

hyeok9855 commented Oct 29, 2024 •

edited

Loading