RUNNING TRAINING

Oumi streamlines model fine-tuning and performance iteration by providing multiple training methods and flexible configuration options. This allows you to experiment efficiently while retaining full control over your setup.

HOW TO RUN TRAINING JOBS

You can either selectSupervised Fine-Tuning to train a model using labeled examples, or On-policy Distillation to train a student model using a teacher model for knowledge distillation.

SUPERVISED FINE-TUNING (SFT)

To start an SFT job, initiate a training run from the Models page.

Click on Train New Model.
In the Builder, select Supervised Fine-Tuning.
Select the base model to fine-tune. Oumi offers a broad range of commonly used models.
Choose your training dataset, and optionally select validation and test datasets. You can use uploaded datasets, synthesized data, or merged datasets.
Select a training method. Oumi supports full fine-tuning (FFT) and parameter-efficient fine-tuning (PEFT), including LoRA.
Adjust advanced hyperparameters (e.g., maximum steps, learning rate) if needed.
Review your configuration, (optionally) save it as a reusable recipe, and launch the training job.

ON-POLICY DISTILLATION

To start an on-policy distillation job, initiate a training run from the Models page.

Click on Train New Model.
In the Builder, select On-Policy Distillation.
Leave Training Method on On-Policy Distillation.
Choose your Base Model and Teacher Model.
Select your Training Dataset.
Configure advanced settings (e.g., Training Settings, Distillation Settings, Parameter-Efficient Settings) if needed.

Please see On-Policy Distillation for more information regarding configuration optionss and settings.

CHECKING JOB STATUS

After a training job launches, it will appear on the Activity log page with a status of Running. When training completes, you can access your model from the Model page.

Getting started

Oumi workflow

RUNNING TRAINING

HOW TO RUN TRAINING JOBS

SUPERVISED FINE-TUNING (SFT)

ON-POLICY DISTILLATION

CHECKING JOB STATUS

Getting started

Oumi workflow

Documentation Index

​HOW TO RUN TRAINING JOBS

​SUPERVISED FINE-TUNING (SFT)

​ON-POLICY DISTILLATION

​CHECKING JOB STATUS

HOW TO RUN TRAINING JOBS

SUPERVISED FINE-TUNING (SFT)

ON-POLICY DISTILLATION

CHECKING JOB STATUS