syncing notebook states

GFNOrg · Nov 27, 2023 · cd35cb8 · cd35cb8
1 parent 12eab45
commit cd35cb8
Show file tree

Hide file tree

Showing 3 changed files with 5 additions and 5 deletions.
diff --git a/tutorials/notebooks/intro_gfn_continuous_line.ipynb b/tutorials/notebooks/intro_gfn_continuous_line.ipynb
@@ -612,7 +612,7 @@
         "gflownet = TBGFlowNet(\n",
         "    pf=pf_estimator,\n",
         "    pb=pb_estimator,\n",
-        "    on_policy=False,\n",
+        "    off_policy=True,\n",
         "    init_logZ=0.0,\n",
         ")\n",
         "\n",
@@ -766,7 +766,7 @@
         "gflownet = TBGFlowNet(\n",
         "    pf=pf_estimator,\n",
         "    pb=pb_estimator,\n",
-        "    on_policy=False,  # No replay buffer.\n",
+        "    off_policy=True,  # No replay buffer.\n",
         "    init_logZ=0.0,\n",
         ")\n",
         "\n",

diff --git a/tutorials/notebooks/intro_gfn_continuous_line_simple.ipynb b/tutorials/notebooks/intro_gfn_continuous_line_simple.ipynb
@@ -859,7 +859,7 @@
    "name": "python",
    "nbconvert_exporter": "python",
    "pygments_lexer": "ipython3",
-   "version": "3.10.12"
+   "version": "3.10.13"
   }
  },
  "nbformat": 4,

diff --git a/tutorials/notebooks/intro_gfn_smiley.ipynb b/tutorials/notebooks/intro_gfn_smiley.ipynb
@@ -1932,7 +1932,7 @@
       "source": [
         "# Trajectory Balance with `torchgfn`\n",
         "\n",
-        "Similarly, we can train a gflownet using Trajectory Balance using the `TBGFlowNet` class. Unlike before, we separately parameterize the forward and backward policies are two different `estimators`, which are passed to the `TBGFlowNet`. In this example we don't use a replay buffer, so we set `on_policy=True`.\n",
+        "Similarly, we can train a gflownet using Trajectory Balance using the `TBGFlowNet` class. Unlike before, we separately parameterize the forward and backward policies are two different `estimators`, which are passed to the `TBGFlowNet`. In this example we don't use a replay buffer, so we set `off_policy=False`.\n",
         "\n",
         "One common trick with trajectory balance is to learn the `logZ` parameter with a higher learning rate than the rest of the network."
       ]
@@ -1987,7 +1987,7 @@
         "gflownet = TBGFlowNet(\n",
         "    pf=pf_estimator,\n",
         "    pb=pb_estimator,\n",
-        "    on_policy=True,  # No replay buffer.\n",
+        "    off_policy=False,  # No replay buffer.\n",
         ")\n",
         "\n",
         "# Policy parameters recieve one LR, and LogZ gets a dedicated, typically higher LR.\n",