-
Notifications
You must be signed in to change notification settings - Fork 4
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Restructure of blocks + adding ConvLSTM #3
base: dlesm
Are you sure you want to change the base?
Conversation
anna-dodson
commented
Feb 12, 2024
- Divided blocks file into BasicConvBlock, ConvNeXt block, SymmetricConvNext, DoubleConvNext block
- ConvGRUBlock, and ConvLSTM block now have their own files as well
- pool, interpolate split apart too
… building the LSTM block for testing
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Initial look over the changes. I like the reorganization of our blocks but added a couple comments about some of the other changes. We can also talk about this when we meet tomorrow.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Is this supposed to be in this PR? If it is please add a descriptive filename and some documentation about its purpose.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Not sure what the best practice is for this type of thing. Whoever uses the module needs to have write permissions to the "prefix" directory defined here. Can we use a relative path?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Perhaps we should leave the generic "run_training.sh" as a template script that trains our most recently published version of the model. Specific job scripts for different experiments probably don't need to be pushed to the repo. Open to you thoughts on this though...