Optimization model based on attention for Few-shot Learning

By Ruizhi Liao, Junhai zhai.
This repo is the pytorch implementation of [Optimization model based on attention for Few-shot Learning ]

Model

Prerequisites

python 3+
pytorch 0.4+ (developed on 1.0.1 with cuda 9.0)
pillow
tqdm (a nice progress bar)

Data

Mini-Imagenet as described here
- You can download it from here (~2.7GB, google drive link)

Preparation

Make sure Mini-Imagenet is split properly. For example:

- data/
  - miniImagenet/
    - train/
      - n01532829/
        - n0153282900000005.jpg
        - ...
      - n01558993/
      - ...
    - val/
      - n01855672/
      - ...
    - test/
      - ...
- main.py
- ...

It'd be set if you download and extract Mini-Imagenet from the link above

Check out scripts/train_5s_5c.sh, make sure --data-root is properly set

Run

For 5-shot, 5-class training, run

bash scripts/train_5s_5c.sh

Hyper-parameters are referred to the author's repo.

For 5-shot, 5-class evaluation, run (remember to change --resume and --seed arguments)

bash scripts/eval_5s_5c.sh

Notes

Training with the default settings takes ~2.5 hours on a single Titan Xp while occupying ~2GB GPU memory.
The implementation replicates two learners similar to the author's repo:
- learner_w_grad functions as a regular model, get gradients and loss as inputs to meta learner.
- learner_wo_grad constructs the graph for meta learner:
  - All the parameters in learner_wo_grad are replaced by cI output by meta learner.
  - nn.Parameters in this model are casted to torch.Tensor to connect the graph to meta learner.
Several ways to copy a parameters from meta learner to learner depends on the scenario:
- copy_flat_params: we only need the parameter values and keep the original grad_fn.
- transfer_params: we want the values as well as the grad_fn (from cI to learner_wo_grad).
  - .data.copy_ v.s. clone() -> the latter retains all the properties of a tensor including grad_fn.
  - To maintain the batch statistics, load_state_dict is used (from learner_w_grad to learner_wo_grad).

Acknowledement

This code borrows heavily from the meta-learning-lstm framework.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
.ipynb_checkpoints		.ipynb_checkpoints
__pycache__		__pycache__
imgs		imgs
scripts		scripts
README.md		README.md
Untitled.ipynb		Untitled.ipynb
compare.py		compare.py
dataloader.py		dataloader.py
learner.py		learner.py
main.py		main.py
metalearner.py		metalearner.py
split.py		split.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Optimization model based on attention for Few-shot Learning

Model

Prerequisites

Data

Preparation

Run

Notes

Acknowledement

About

Releases

Packages

Languages

wflrz123/MLAL

Folders and files

Latest commit

History

Repository files navigation

Optimization model based on attention for Few-shot Learning

Model

Prerequisites

Data

Preparation

Run

Notes

Acknowledement

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages