improved retry behavior #1277

jjallaire · 2025-02-10T13:19:32Z

Various suggestions from @craigwalton-dsit:

Sometimes when running evals against models which have flaky APIs which trigger their is_rate_limit() override to return True, users get confused as to why their eval seems to be "stuck" indefinitely. A few observations:

Some provider implementations like google treat numerous errors as is_rate_limit (e.g. 500, 503 and 504) (source). Maybe this override would be better named is_retryable_error.
AFAIK it is not that obvious to users that Inspect is busy retrying/waiting to retry failed HTTP requests. The inspect trace anomalies is helpful. It might be even more user-friendly if the "Running samples" UI somehow indicated that a sample was in a retry loop (a bit like how we show "Generating ...").
The "rate limits" counter in the UI only updates for actual HTTP 429 errors (not other errors we treat as rate limits/retryable). This might well be the right behaviour, but just wanted to highlight this was one source of confusion. May be resolved by tackling 1.\

tadamcz · 2025-02-12T14:03:56Z

Previous discussion: #1174

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

improved retry behavior #1277

improved retry behavior #1277

jjallaire commented Feb 10, 2025

tadamcz commented Feb 12, 2025

improved retry behavior #1277

improved retry behavior #1277

Comments

jjallaire commented Feb 10, 2025

tadamcz commented Feb 12, 2025