To run experiments on Google Cloud

Setup

Open a cloud shell
Create a tpu cluster (from our experiments, n2-standard-2 has enough memory & storage but you might need more). If you're using the TPU Research Cloud credit, you probably won't have access to any beefier setup than v3-8 TPUs.
```
gcloud compute tpus execution-groups create \
--name=what-you-want-to-name-your-tpu \
--zone=whatever-zone-you-get-free-tpus-in \
--tf-version=2.4.1 \
--machine-type=n2-standard-2 \
--accelerator-type="v3-8"
```

It should automatically connect to your TPU. If not:

gcloud compute ssh whatever-you-named-your-tpu --zone whatever-zone-you-chose

Start a tmux session to prevent your commands from being interrupted from disconnects
```
    tmux
```

Set some environment variables. Note: be sure to make a bucket first.

export PROJECT=your-google-cloud-project-name
export ZONE=whatever-zone-you-chose
export BUCKET=gs://your-bucket
export TPU=whatever-you-named-your-tpu
export TOPOLOGY=v3-8

Download the Indirect Response code

git clone https://github.com/frederiknolte/indirect-response
cd indirect_response
git submodule init
git submodule update

Download and install packages

./gcp_setup.sh
source bin/activate
cd google-research/wt5

Pick one of the T5 models pretrained by Google. Note the latest checkpoint of that model. That is the value you need to provide for PRETRAINED_STEPS.
```
export PRETRAINED_DIR=gs://t5-data/pretrained_models/large
export PRETRAINED_STEPS=1000700
```
Select the number of steps you would like to finetune for
```
export FINETUNE_STEPS=20000
```
Pick one of the random seeds used to split the Circa dataset. Currently, those are: 13, 948, or 2756.
```
export RANDOM_SEED=13
```
Select the Circa dataset type you would like to use (matched or unmatched):
```
export CIRCA_TYPE=matched
```
Pick the .gin sequence length file for whichever dataset in your mixture has the largest sequences. This file controls the maximum sequence sequence. By default we use use {'inputs': 512, 'targets': 256} for everything.
```
export SEQ_LENGTH_FILE=wt5/gin/sequence_lengths/esnli_v002.gin
```

Running Experiments

The following commands hold for all experiments. You just need to set the appropriate environment variables first. We describe the procedure below. The variables we used to run our experiments are available in the next section.

Training

First set the task or mixture from the WT5 registry as well as the directory in your bucket where you would like to save generated outputs:

    export TASK=...(e..g, "esnli_and_cos_e_to_circa_nli_${CIRCA_TYPE}${RANDOM_SEED}")
    export MODEL_DIR="${BUCKET}/${TASK}"

Then you can start training. The t5_mesh script can recover from the last checkpoint if training is disrupted for any reason. Simply re-run the command.

PYTHONPATH=$PYTHONPATH:/home/$USER/indirect-response/google-research/ t5_mesh_transformer \
    --tpu="${TPU}" \
    --gcp_project="${PROJECT}" \
    --tpu_zone="${ZONE}" \
    --model_dir="${MODEL_DIR}" \
    --gin_file="dataset.gin" \
    --gin_file="${PRETRAINED_DIR}/operative_config.gin" \
    --gin_file="${SEQ_LENGTH_FILE}" \
    --gin_param="utils.tpu_mesh_shape.tpu_topology = '${TOPOLOGY}'" \
    --gin_param="MIXTURE_NAME = '${TASK}'" \
    --gin_param="mesh_train_dataset_fn.use_cached=False" \
    --gin_param="utils.run.save_checkpoints_steps=1000" \
    --gin_param="utils.run.batch_size=('tokens_per_batch', 65536)" \
    --gin_param="utils.run.train_steps=$((PRETRAINED_STEPS+FINETUNE_STEPS))" \
    --gin_param="utils.run.init_checkpoint='${PRETRAINED_DIR}/model.ckpt-${PRETRAINED_STEPS}'" \
    --gin_param="utils.run.learning_rate_schedule=@learning_rate_schedules.constant_learning_rate" \
    --gin_param="constant_learning_rate.learning_rate=1e-3" \
    --gin_param="mesh_train_dataset_fn.seed=${RANDOM_SEED}" \
    --t5_tfds_data_dir="${BUCKET}/t5-tfds" \
    --module_import="wt5.tasks" \
    --module_import="wt5.mixtures" \
    --module_import="circa.circa_splits.circa_${CIRCA_TYPE}${RANDOM_SEED}" \
    --gin_location_prefix="wt5/wt5/gin/"

Validation

To validate the model you need to set the VAL_TASK variable. The value depends on what you want to validate. For example, if you want to validate every task in your training mixture, you can set VAL_TASK to the same value as TASK.

    export VAL_TASK=...(e.g., "circa_eval_v100_nli_relaxed_${CIRCA_TYPE}${RANDOM_SEED}")

Then you can start validating your model:

    PYTHONPATH=$PYTHONPATH:/home/$USER/indirect-response/google-research/ t5_mesh_transformer \
        --tpu="${TPU}" \
        --gcp_project="${PROJECT}" \
        --tpu_zone="${ZONE}" \
        --model_dir="${MODEL_DIR}" \
        --gin_file="dataset.gin" \
        --gin_file="${MODEL_DIR}/operative_config.gin" \
        --gin_file="${SEQ_LENGTH_FILE}" \
        --gin_file="eval.gin" \
        --gin_param="utils.tpu_mesh_shape.tpu_topology = '${TOPOLOGY}'" \
        --gin_param="MIXTURE_NAME = '${VAL_TASK}'" \
        --gin_param="mesh_eval_dataset_fn.use_cached=False" \
        --gin_param="utils.run.dataset_split = 'validation'" \
        --gin_param="utils.run.batch_size=('tokens_per_batch', 65536)" \
        --gin_param="utils.run.eval_checkpoint_step='all'" \
        --gin_param="mesh_eval_dataset_fn.seed=${RANDOM_SEED}" \
        --t5_tfds_data_dir="${BUCKET}/t5-tfds" \
        --module_import="wt5.tasks" \
        --module_import="wt5.mixtures" \
        --module_import="circa.circa_splits.circa_${CIRCA_TYPE}${RANDOM_SEED}" \
        --gin_location_prefix="wt5/wt5/gin/" \
        --gin_param="utils.run.eval_summary_dir='${MODEL_DIR}/validation_eval'"

Test Set Evaluation

Once you've run the validation task, it's up to you to find the best checkpoint for the final test set evaluation. For example, we used Tensorboard to monitor the accuracy during the validation process. Once you know the step number of your checkpoint, set that variable first:

    export BEST_VAL_CHECKPOINT=...(e.g., 1020700)

Then set the name of the task/mixture you would like to use.

    export FINAL_EVAL_TASK=... (e.g., "circa_eval_v100_nli_relaxed_${CIRCA_TYPE}${RANDOM_SEED}")

Finally, run the evaluation:

    PYTHONPATH=$PYTHONPATH:/home/$USER/indirect-response/google-research/ t5_mesh_transformer \
    --tpu="${TPU}" \
    --gcp_project="${PROJECT}" \
    --tpu_zone="${ZONE}" \
    --model_dir="${MODEL_DIR}" \
    --gin_file="dataset.gin" \
    --gin_file="${MODEL_DIR}/operative_config.gin" \
    --gin_file="${SEQ_LENGTH_FILE}" \
    --gin_file="eval.gin" \
    --gin_param="utils.tpu_mesh_shape.tpu_topology = '${TOPOLOGY}'" \
    --gin_param="MIXTURE_NAME = '${FINAL_EVAL_TASK}'" \
    --gin_param="mesh_eval_dataset_fn.use_cached=False" \
    --gin_param="utils.run.dataset_split = 'test'" \
    --gin_param="utils.run.batch_size=('tokens_per_batch', 65536)" \
    --gin_param="utils.run.eval_checkpoint_step=${BEST_VAL_CHECKPOINT}" \
    --gin_param="mesh_eval_dataset_fn.seed=${RANDOM_SEED}" \
    --t5_tfds_data_dir="${BUCKET}/t5-tfds" \
    --module_import="wt5.tasks" \
    --module_import="wt5.mixtures" \
    --module_import="circa.circa_splits.circa_${CIRCA_TYPE}${RANDOM_SEED}" \
    --gin_location_prefix="wt5/wt5/gin/" \
    --gin_param="utils.run.eval_summary_dir='${MODEL_DIR}/test_eval'"

Cleanup

All your artifacts (checkpoints, inputs, predictions, targets, etc...) should now be available in the MODEL_DIR folder in your bucket. To exit and delete your TPU node simply exit tmux and then exit the TPU node. Then run:

    gcloud compute tpus execution-groups delete whatever_your_tpu_name_is --zone=whatever_your_zone_is

To verify no more TPU nodes are active, run:

    gcloud compute tpus execution-groups list --zone=whatever_your_zone_is

Reproducing our Experiments

Consult the following table to see the environment variables we set for our experiments.

Experiment	`TASK`	`VAL_TASK`	`FINAL_EVAL_TASK`
Circa baseline	`TASK=circa_v100_0_expln_nli_relaxed_${CIRCA_TYPE}${RANDOM_SEED}`	`VAL_TASK="${TASK}`	`FINAL_EVAL_TASK="${TASK}"`
Circa baseline - premise only	`TASK="circa_nli_baseline_premise_only_relaxed_${CIRCA_TYPE}${RANDOM_SEED}"`	`VAL_TASK="${TASK}`	`FINAL_EVAL_TASK="${TASK}"`
Circa baseline - hypothesis only	`TASK="circa_nli_baseline_hypothesis_only_relaxed_${CIRCA_TYPE}${RANDOM_SEED}"`	`VAL_TASK="${TASK}`	`FINAL_EVAL_TASK="${TASK}"`
Zero Shot - e-SNLI and Cos-E to Circa	`TASK="esnli_and_cos_e_to_circa_zero_shot"`	`VAL_TASK="circa_eval_v100_nli_relaxed_${CIRCA_TYPE}${RANDOM_SEED}"`	`FINAL_EVAL_TASK="${VAL_TASK}"`
Full mixture (the main experiments)	`TASK="esnli_and_cos_e_to_circa_nli_${CIRCA_TYPE}${RANDOM_SEED}"`	`VAL_TASK="circa_eval_v100_nli_relaxed_${CIRCA_TYPE}${RANDOM_SEED}"`	`FINAL_EVAL_TASK="${VAL_TASK}"`

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

finetune_on_gcp.md

finetune_on_gcp.md

To run experiments on Google Cloud

Setup

Running Experiments

Training

Validation

Test Set Evaluation

Cleanup

Reproducing our Experiments

Files

finetune_on_gcp.md

Latest commit

History

finetune_on_gcp.md

File metadata and controls

To run experiments on Google Cloud

Setup

Running Experiments

Training

Validation

Test Set Evaluation

Cleanup

Reproducing our Experiments