Incorrect Starting State representation, and Valid Ids after every evaluation is run? #5

hitzkrieg · 2022-11-23T18:58:30Z

Shouldn't the state representation and valid_ids be rebuilt when we reset all environments by doing evs.reset() after every evaluation?

drrn-scienceworld/drrn/train-scienceworld.py

Line 232 in 4ed8909

envs.reset()

Shouldn't line 232 be replaced by following code:
obs, infos = envs.reset()
states = agent.build_state(obs, infos)
valid_ids = [agent.encode(info['valid']) for info in infos]

The text was updated successfully, but these errors were encountered:

MarcCote · 2022-11-23T20:27:56Z

You are right, that will mess up the next action. That said, I'm questioning the purpose of that reset in the first place. @wsxzwps @PeterAJansen any thoughts?

PeterAJansen · 2022-11-23T20:59:18Z

Yes, I think that's a bug, and it should mirror lines 142-145 on reset. It might either be legacy code or my mistake. The evaluation frequency is generally quite low in our runs (e.g. every 1k-5k steps), so I don't think this would negatively affect performance in our evaluation much or at all.

IIRC, the ScienceWorld DRRN keeps two instances of the environments: one for training, and one for evaluation (e.g. on the dev or test set). I think the call to reset() is intended as a safety call, to reset the training environments to a new variation/start of the game just in case evaluation does anything to the model states that might need to be reset. If you determine that it's not needed, then we can always remove it and it should continue training in each episode from where it left off when it started evaluation.

hitzkrieg · 2022-12-04T08:24:14Z

Thank you for the clarifications! :)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Incorrect Starting State representation, and Valid Ids after every evaluation is run? #5

Incorrect Starting State representation, and Valid Ids after every evaluation is run? #5

hitzkrieg commented Nov 23, 2022

MarcCote commented Nov 23, 2022

PeterAJansen commented Nov 23, 2022

hitzkrieg commented Dec 4, 2022

Incorrect Starting State representation, and Valid Ids after every evaluation is run? #5

Incorrect Starting State representation, and Valid Ids after every evaluation is run? #5

Comments

hitzkrieg commented Nov 23, 2022

MarcCote commented Nov 23, 2022

PeterAJansen commented Nov 23, 2022

hitzkrieg commented Dec 4, 2022