You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The main goal of this task is to end up with alignment on how we want to implement this new evaluation (e.g. as a new workflow which calls inference + LLM judge inference + evaluation vs having evaluation call LLM judge inference).
The deliverable should be a diagram explaining how we will run this new feature in Lumigator
The text was updated successfully, but these errors were encountered:
aittalam
changed the title
align on architecture for running both the evaluation with LLM as judge and the evaluation of judges themselves
Align on architecture for running both the evaluation with LLM as judge and the evaluation of judges themselves
Feb 13, 2025
aittalam
changed the title
Align on architecture for running both the evaluation with LLM as judge and the evaluation of judges themselves
Align on architecture for running LLM-as-judge evaluation
Feb 13, 2025
See rationale here.
The main goal of this task is to end up with alignment on how we want to implement this new evaluation (e.g. as a new workflow which calls inference + LLM judge inference + evaluation vs having evaluation call LLM judge inference).
The deliverable should be a diagram explaining how we will run this new feature in Lumigator
The text was updated successfully, but these errors were encountered: