Skip to content

Latest commit

 

History

History
10 lines (4 loc) · 397 Bytes

README.md

File metadata and controls

10 lines (4 loc) · 397 Bytes

VisionDeltaNet is a lightweight vision backbone inspired by Parallelizing Linear Transformers with the Delta Rule over Sequence Length. It combines transformers with convolutions for efficient feature extraction and achieves 70%+ accuracy on CIFAR-10 within 10 epochs.

  1. Install dependencies: pip install torch torchvision.

  2. To train and evaluate on CIFAR-10, run:

python vdelta.py