Once your evaluation job has finished, go to Evaluations and click on your evaluation to view the results. Each evaluator’s score indicates how well the model performed against the defined criteria. Clicking onDocumentation Index
Fetch the complete documentation index at: https://docs.oumi.ai/llms.txt
Use this file to discover all available pages before exploring further.
Explore Results gives access to individual sample-level results.
INTERPRETING RESULTS
You can use your evaluation results to:- Compare baseline models to fine-tuned models
- Identify regressions or improvements due after model changes
- Decide whether to retrain, adjust data, or refine evaluators