ResNet implementation: set bias=False for downsample-B #5477

thatgeeman · 2022-11-05T16:02:53Z

Signed-off-by: Geevarghese George [email protected]

Description

This is a simple fix following #5465.
The downsampling layer is not expected to have a bias term. The previous implementation did not explicitly set bias=False and defaulted to PyTorch Conv3D/2D where bias=True. With this change, the correct number (62 for resnet18) of parameter groups are returned:

from torchvision import models 
from monai.networks import nets
d2net_torch = models.resnet18()
d2net_monai = nets.resnet18(spatial_dims=2)
d3net_monai = nets.resnet18(spatial_dims=3)
len(list(d2net_torch.parameters())), len(list(d2net_monai.parameters())), len(list(d3net_monai.parameters())) 
# 62, 62, 62
# before: 62 65 65

Other deeper 2D ResNet architectures are also comparable to the PyTorch implementation; the pretrained/weights parameter can be allowed for these networks. Currently, it raises a NotImplemetedError with pretrained=True even for 2D ResNets.

Types of changes

Non-breaking change (fix or new feature that would not break existing functionality).
Breaking change (fix or new feature that would cause existing functionality to change).
New tests added to cover the changes.
Integration tests passed locally by running ./runtests.sh -f -u --net --coverage.
Quick tests passed locally by running ./runtests.sh --quick --unittests --disttests.
In-line docstrings updated.
Documentation updated, tested make html command in the docs/ folder.

Signed-off-by: Geevarghese George <[email protected]>

ericspod · 2022-11-06T15:44:11Z

We hadn't implemented downloading pre-trained weights so that part isn't affected by this, but other saved instances of this network like here won't load correctly. To maintain backwards compatibility we should add a bias argument at the end of the constructor's arguments whose default is True which sets the bias argument, when a standard ResNet compatible with other pretrained weights is requested this would be set to False then.

thatgeeman · 2022-11-06T16:43:10Z

Yes, that makes sense, I'll make the additions in the coming days. Making this PR a draft for now.

Signed-off-by: Geevarghese George <[email protected]>

thatgeeman · 2022-11-11T20:56:42Z

Made the requisite changes to accept an additional kwarg as discussed. How does this look @ericspod?

ericspod · 2022-11-14T14:01:06Z

Looks good to me, we should add a test case to test_resnet.py to cover this change as well.

thatgeeman · 2022-11-14T15:03:31Z

Agreed! What would the test case look like: would it be more of a sanity check to see if the expected shapes are returned with pretrained=True?

ericspod · 2022-11-14T15:18:25Z

Yes I don't think much more is needed than that, if we had a version of an existing unit test with bias_downsample=False that would be enough to show that the network still is correct.

Signed-off-by: Geevarghese George <[email protected]>

thatgeeman · 2022-11-14T18:33:38Z

Just added to the test_resnet case as discussed. ^

wyli

Thanks, it looks good to me.

wyli · 2022-11-14T18:50:02Z

/build

wyli · 2022-11-15T07:02:26Z

/build

wyli · 2022-11-15T08:09:58Z

/build

…#5477) Signed-off-by: Geevarghese George <[email protected]> ### Description This is a simple fix following Project-MONAI#5465. The downsampling layer is not expected to have a bias term. The previous implementation did not explicitly set `bias=False` and defaulted to PyTorch Conv3D/2D where `bias=True`. With this change, the correct number (62 for resnet18) of parameter groups are returned: ```python from torchvision import models from monai.networks import nets d2net_torch = models.resnet18() d2net_monai = nets.resnet18(spatial_dims=2) d3net_monai = nets.resnet18(spatial_dims=3) len(list(d2net_torch.parameters())), len(list(d2net_monai.parameters())), len(list(d3net_monai.parameters())) # 62, 62, 62 # before: 62 65 65 ``` Other deeper 2D ResNet architectures are also comparable to the PyTorch implementation; the `pretrained`/`weights` parameter can be allowed for these networks. Currently, it raises a `NotImplemetedError` with `pretrained=True` even for 2D ResNets. ### Types of changes  - [x] Non-breaking change (fix or new feature that would not break existing functionality). - [ ] Breaking change (fix or new feature that would cause existing functionality to change). - [ ] New tests added to cover the changes. - [x] Integration tests passed locally by running `./runtests.sh -f -u --net --coverage`. - [x] Quick tests passed locally by running `./runtests.sh --quick --unittests --disttests`. - [ ] In-line docstrings updated. - [ ] Documentation updated, tested `make html` command in the `docs/` folder. Signed-off-by: Geevarghese George <[email protected]> Signed-off-by: Behrooz <[email protected]>

acerdur · 2023-08-01T10:37:18Z

Currently, the bias_downsample=False argument is contradicting with pretraining, as it is hard coded to be not pretrained in ResNet constructor:

model: ResNet = ResNet(block, layers, block_inplanes, bias_downsample=not pretrained, **kwargs)
    if pretrained:
        # Author of paper zipped the state_dict on googledrive,
        # so would need to download, unzip and read (2.8gb file for a ~150mb state dict).
        # Would like to load dict from url but need somewhere to save the state dicts.
        raise NotImplementedError(
            "Currently not implemented. You need to manually download weights provided by the paper's author"
            " and load then to the model with `state_dict`. See https://github.com/Tencent/MedicalNet"
        )
    return model

When manually loading MedicalNet weights, the downsample bias terms raise errors as they are not present in the loaded weights. It is also not possible to remove bias_downsample by setting pretrained=True, this raises NotImplementedError.
So, can you please remove the hard coding from the model constructor in the source code?
Thanks

thatgeeman · 2023-08-07T10:32:11Z

Hi @acerdur @wyli
Since there were no error logs provided in the above comments, I'm assuming the issue comes only from the shortcut_type used.

Details:
The sole purpose of passing bias_downsample=False is to match with the MedNet and official PyTorch implementation of ResNet which sets the bias=False in the downsampling layer. As for the downsampling layer, there are two variants in MONAI:
shortcut_type='B' # uses a conv1x1 as downsampling layer
shortcut_type='A' # uses a avgpool1x1 as downsampling layer

AFAIU the error you are facing comes from the shortcut layer. To correctly load the pretrained weights of MedNet, you should initialize the model with the correct achitecture with shortcut_type='A' for MedNet:

from monai.networks import nets
# MONAI ResNet18
net = nets.resnet18(pretrained=False, spatial_dims=3, n_input_channels=1, num_classes=2, shortcut_type='A')
wt_path = 'resnet_18.pth'  # path to weights from Google Drive of Tencent
pretrained_weights = torch.load(f=wt_path , map_location=device)

# match the keys 
weights = OrderedDict()
for k, v in pretrained_weights['state_dict'].items():
    weights.update({k.replace('module.', ''): v})

net.load_state_dict(weights, strict=False)  # _IncompatibleKeys(missing_keys=['fc.weight', 'fc.bias'], unexpected_keys=[])

The only pair of incompatible keys are for the last linear layer, which is expected for finetuning, and not provided/inferable in the MedNet weights.

Related: #6811

set bias=False for downsample-B

7fa2dcf

Signed-off-by: Geevarghese George <[email protected]>

thatgeeman changed the title ~~ResNet implementaion: set bias=False for downsample-B~~ ResNet implementation: set bias=False for downsample-B Nov 5, 2022

thatgeeman marked this pull request as draft November 6, 2022 16:38

add bias_downsample kwarg to ResNet

b27a6a3

Signed-off-by: Geevarghese George <[email protected]>

thatgeeman marked this pull request as ready for review November 12, 2022 12:01

thatgeeman added 2 commits November 14, 2022 19:00

add to test_resnet: TEST_CASE_7

a203493

Signed-off-by: Geevarghese George <[email protected]>

Merge branch 'Project-MONAI:dev' into dev

e5dafbe

thatgeeman force-pushed the 5465-resnet-downsample branch from 745b5af to e5dafbe Compare November 14, 2022 18:12

wyli approved these changes Nov 14, 2022

View reviewed changes

wyli enabled auto-merge (squash) November 14, 2022 18:50

Merge branch 'dev' into 5465-resnet-downsample

8ef2a7c

wyli merged commit 5e6f105 into Project-MONAI:dev Nov 15, 2022

thatgeeman deleted the 5465-resnet-downsample branch November 15, 2022 11:52

wyli mentioned this pull request Aug 1, 2023

bias_downsample=False ResNet constructor #6811

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ResNet implementation: set bias=False for downsample-B #5477

ResNet implementation: set bias=False for downsample-B #5477

thatgeeman commented Nov 5, 2022 •

edited

Loading

ericspod commented Nov 6, 2022

thatgeeman commented Nov 6, 2022

thatgeeman commented Nov 11, 2022

ericspod commented Nov 14, 2022

thatgeeman commented Nov 14, 2022

ericspod commented Nov 14, 2022

thatgeeman commented Nov 14, 2022 •

edited

Loading

wyli left a comment

wyli commented Nov 14, 2022

wyli commented Nov 15, 2022

wyli commented Nov 15, 2022

acerdur commented Aug 1, 2023 •

edited

Loading

thatgeeman commented Aug 7, 2023 •

edited

Loading

ResNet implementation: set bias=False for downsample-B #5477

ResNet implementation: set bias=False for downsample-B #5477

Conversation

thatgeeman commented Nov 5, 2022 • edited Loading

Description

Types of changes

ericspod commented Nov 6, 2022

thatgeeman commented Nov 6, 2022

thatgeeman commented Nov 11, 2022

ericspod commented Nov 14, 2022

thatgeeman commented Nov 14, 2022

ericspod commented Nov 14, 2022

thatgeeman commented Nov 14, 2022 • edited Loading

wyli left a comment

Choose a reason for hiding this comment

wyli commented Nov 14, 2022

wyli commented Nov 15, 2022

wyli commented Nov 15, 2022

acerdur commented Aug 1, 2023 • edited Loading

thatgeeman commented Aug 7, 2023 • edited Loading

thatgeeman commented Nov 5, 2022 •

edited

Loading

thatgeeman commented Nov 14, 2022 •

edited

Loading

acerdur commented Aug 1, 2023 •

edited

Loading

thatgeeman commented Aug 7, 2023 •

edited

Loading