Skip to content

Commit

Permalink
syncing notebook states
Browse files Browse the repository at this point in the history
  • Loading branch information
josephdviviano committed Nov 27, 2023
1 parent 12eab45 commit cd35cb8
Show file tree
Hide file tree
Showing 3 changed files with 5 additions and 5 deletions.
4 changes: 2 additions & 2 deletions tutorials/notebooks/intro_gfn_continuous_line.ipynb
Original file line number Diff line number Diff line change
Expand Up @@ -612,7 +612,7 @@
"gflownet = TBGFlowNet(\n",
" pf=pf_estimator,\n",
" pb=pb_estimator,\n",
" on_policy=False,\n",
" off_policy=True,\n",
" init_logZ=0.0,\n",
")\n",
"\n",
Expand Down Expand Up @@ -766,7 +766,7 @@
"gflownet = TBGFlowNet(\n",
" pf=pf_estimator,\n",
" pb=pb_estimator,\n",
" on_policy=False, # No replay buffer.\n",
" off_policy=True, # No replay buffer.\n",
" init_logZ=0.0,\n",
")\n",
"\n",
Expand Down
2 changes: 1 addition & 1 deletion tutorials/notebooks/intro_gfn_continuous_line_simple.ipynb
Original file line number Diff line number Diff line change
Expand Up @@ -859,7 +859,7 @@
"name": "python",
"nbconvert_exporter": "python",
"pygments_lexer": "ipython3",
"version": "3.10.12"
"version": "3.10.13"
}
},
"nbformat": 4,
Expand Down
4 changes: 2 additions & 2 deletions tutorials/notebooks/intro_gfn_smiley.ipynb
Original file line number Diff line number Diff line change
Expand Up @@ -1932,7 +1932,7 @@
"source": [
"# Trajectory Balance with `torchgfn`\n",
"\n",
"Similarly, we can train a gflownet using Trajectory Balance using the `TBGFlowNet` class. Unlike before, we separately parameterize the forward and backward policies are two different `estimators`, which are passed to the `TBGFlowNet`. In this example we don't use a replay buffer, so we set `on_policy=True`.\n",
"Similarly, we can train a gflownet using Trajectory Balance using the `TBGFlowNet` class. Unlike before, we separately parameterize the forward and backward policies are two different `estimators`, which are passed to the `TBGFlowNet`. In this example we don't use a replay buffer, so we set `off_policy=False`.\n",
"\n",
"One common trick with trajectory balance is to learn the `logZ` parameter with a higher learning rate than the rest of the network."
]
Expand Down Expand Up @@ -1987,7 +1987,7 @@
"gflownet = TBGFlowNet(\n",
" pf=pf_estimator,\n",
" pb=pb_estimator,\n",
" on_policy=True, # No replay buffer.\n",
" off_policy=False, # No replay buffer.\n",
")\n",
"\n",
"# Policy parameters recieve one LR, and LogZ gets a dedicated, typically higher LR.\n",
Expand Down

0 comments on commit cd35cb8

Please sign in to comment.