Grams: Gradient Descent with Adaptive Momentum Scaling Install pip install grams-pytorch How to use Import: from grams import Grams Instantiate: optimizer = Grams(lr=1e-3, weight_decay=0.0)