Sequential.eval() does not put model into eval mode #1426

brianberns · 2024-12-19T05:31:55Z

Calling eval() should make training false. However, this does not work for Sequential modules.

Example F# program:

open type TorchSharp.torch.nn

let linear = Linear(10, 10)
linear.eval()
assert(not linear.training)       // succeeds

let sequential = Sequential(Linear(10, 10))
sequential.eval()
assert(not sequential.training)   // fails

I think the problem is that Sequential.train() should call base.train() in addition to calling train() for each submodule:

public override void train(bool on = true)
{
    foreach (var m in _modules) { ((torch.nn.Module)m).train(on); }
    base.train(on);
}

The text was updated successfully, but these errors were encountered:

yueyinqiu · 2024-12-19T11:47:50Z

May not related to this issue, but actually I suppose that we shall reconsider about the submodules, especially the way we register them. Actually we have discussed this before here. I think the best approach would be to use source generators. But it will true add too much complexity, so previously we just consider it as a last resort and put it on hold. #1272 (comment)

alinpahontu2912 · 2025-01-20T09:38:54Z

Hey @brianberns, thanks for the issue. I tested it myself and you are right, there seems to be a problem with the Sequential module. For the moment, would you try creating and using your own custom module containing the Sequential module as specified in the wiki here ? Meaning something like this:

public class CustomModel : Module<Tensor, Tensor>
{
    private readonly Module<Tensor, Tensor> layers;
    public CustomModel()
        : base("CustomModel")
    {

        var modules = new List<(string, Module<Tensor, Tensor>)>();

        modules.Add(("lin1", Linear(10, 10)));

        layers = Sequential(modules);

        RegisterComponents();
    }

    public override Tensor forward(Tensor input)
    {
        return layers.forward(input);
    }

    protected override void Dispose(bool disposing)
    {
        if (disposing)
        {
            layers.Dispose();
        }
        base.Dispose(disposing);
    }
}

alinpahontu2912 self-assigned this Jan 20, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sequential.eval() does not put model into eval mode #1426

Sequential.eval() does not put model into eval mode #1426

brianberns commented Dec 19, 2024 •

edited

Loading

yueyinqiu commented Dec 19, 2024 •

edited

Loading

alinpahontu2912 commented Jan 20, 2025

Sequential.eval() does not put model into eval mode #1426

Sequential.eval() does not put model into eval mode #1426

Comments

brianberns commented Dec 19, 2024 • edited Loading

yueyinqiu commented Dec 19, 2024 • edited Loading

alinpahontu2912 commented Jan 20, 2025

brianberns commented Dec 19, 2024 •

edited

Loading

yueyinqiu commented Dec 19, 2024 •

edited

Loading