Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Why is there a Embedding layer in each stage of the model framework #25

Open
WuliSmart opened this issue Oct 15, 2024 · 1 comment
Open
Assignees
Labels
question Further information is requested

Comments

@WuliSmart
Copy link

No description provided.

@Lupin1998
Copy link
Member

Hi, @WuliSmart! Thanks for your interesting question, and sorry for the late reply. We apply the embedding layer (Conv2d 2x2 with a norm layer) at the start of each stage to stabilize the training and introduce more nonlinearity for the lightweight and small-size variants. A similar design can be found in the Metaformer baseline (https://arxiv.org/abs/2210.13452).

@Lupin1998 Lupin1998 self-assigned this Jan 16, 2025
@Lupin1998 Lupin1998 added the question Further information is requested label Jan 16, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Projects
None yet
Development

No branches or pull requests

2 participants