Recent Budget Simulation is incorrect #35

gopikrishnajha · 2024-06-13T06:03:33Z

The simulation code for recent budget is incorrect here. Consider a recent budget 0. Applying the mask with recent budget leaves the main diagonal as it is which should not be the case.

Here is minimal code for replication.

import torch
recent_budget = 0
attn_weights =  torch.rand(4, 4)
ones = torch.ones_like(attn_weights, dtype=torch.bool)
ones = torch.tril(ones, diagonal=recent_budget)
print(ones)
ones = torch.triu(ones, diagonal=recent_budget)
print(ones)
attn_weights[~ones] = 0
print(attn_weights)

The output is this code is this:

tensor(
[[ True, False, False, False],
[ True, True, False, False],
[ True, True, True, False],
[ True, True, True, True]])
tensor(
[[ True, False, False, False],
[False, True, False, False],
[False, False, True, False],
[False, False, False, True]])
tensor(
[[0.4726, 0.0000, 0.0000, 0.0000],
[0.0000, 0.6668, 0.0000, 0.0000],
[0.0000, 0.0000, 0.4941, 0.0000],
[0.0000, 0.0000, 0.0000, 0.5652]])

With recent_budget=0, we should expect all zeros here.

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Recent Budget Simulation is incorrect #35

Recent Budget Simulation is incorrect #35

gopikrishnajha commented Jun 13, 2024

Recent Budget Simulation is incorrect #35

Recent Budget Simulation is incorrect #35

Comments

gopikrishnajha commented Jun 13, 2024