You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi, @WuliSmart! Thanks for your interesting question, and sorry for the late reply. We apply the embedding layer (Conv2d 2x2 with a norm layer) at the start of each stage to stabilize the training and introduce more nonlinearity for the lightweight and small-size variants. A similar design can be found in the Metaformer baseline (https://arxiv.org/abs/2210.13452).
No description provided.
The text was updated successfully, but these errors were encountered: