The following is a list of models available on the Oumi platform, organized by use case, with more supported models added continuously. For the most up-to-date list, ask the Oumi Agent with this prompt:Documentation Index
Fetch the complete documentation index at: https://docs.oumi.ai/llms.txt
Use this file to discover all available pages before exploring further.
TRAINING (FINE-TUNING BASE MODELS)
Oumi supports a broad set of models that can be fine-tuned on your own data, with each model offering different training approaches and requirements depending on its architecture and capabilities.| Model | Training methods |
|---|---|
| Qwen/Qwen3.5-9B | SFT (FFT) |
| Qwen/Qwen3.5-9B-Base | SFT (FFT) |
| Qwen/Qwen3.5-4B | SFT (FFT) |
| Qwen/Qwen3.5-4B-Base | SFT (FFT) |
| Qwen/Qwen3.5-2B | SFT (FFT) |
| Qwen/Qwen3.5-0.8B | SFT (FFT) |
| Qwen/Qwen3-32B | SFT (LoRA) |
| Qwen/Qwen3-8B | SFT (LoRA, FFT), OPD |
| Qwen/Qwen3-4B-Instruct-2507 | SFT (LoRA, FFT), OPD |
| Qwen/Qwen3-0.6B | SFT (LoRA, FFT), OPD |
| Qwen/Qwen2.5-7B-Instruct | SFT (LoRA, FFT), OPD |
| Qwen/Qwen2.5-7B | SFT (LoRA, FFT), OPD |
| Qwen/Qwen2.5-3B-Instruct | SFT (LoRA, FFT), OPD |
| Qwen/Qwen2.5-1.5B-Instruct | SFT (LoRA, FFT), OPD |
| meta-llama/Llama-4-Scout-17B-16E-Instruct | SFT (LoRA) |
| meta-llama/Llama-3.1-8B-Instruct | SFT (LoRA, FFT), OPD |
| meta-llama/Llama-3.2-3B-Instruct | SFT (LoRA, FFT), OPD |
| meta-llama/Llama-3.2-1B-Instruct | SFT (LoRA, FFT), OPD |
| google/gemma-3-4b-it | SFT (LoRA, FFT) |
| microsoft/Phi-3.5-mini-instruct | SFT (LoRA, FFT) |
| microsoft/Phi-3.5-MoE-instruct | SFT (LoRA) |
| openai/gpt-oss-20b | SFT (FFT) |
| HuggingFaceTB/SmolLM2-1.7B-Instruct | SFT (LoRA, FFT) |
| HuggingFaceTB/SmolLM2-360M-Instruct | SFT (LoRA, FFT), OPD |
| HuggingFaceTB/SmolLM2-135M-Instruct | SFT (LoRA, FFT), OPD |
| HuggingFaceTB/SmolLM-1.7B-Instruct | SFT (LoRA, FFT) |
| HuggingFaceTB/SmolLM-360M-Instruct | SFT (LoRA, FFT) |
| HuggingFaceTB/SmolLM-135M-Instruct | SFT (LoRA, FFT) |
INFERENCE (EVALUATION, SYNTHESIS, AND COMPLETIONS)
The following models are available for evaluation judges, data synthesis, and generating completions.You can unlock access to frontier models for inference by adding API keys for major providers.
| Model | Provider |
|---|---|
| Qwen/Qwen3.5-397B-A17B | oumi |
| Qwen/Qwen3.5-9B | oumi |
| Qwen/Qwen3-235B-A22B-Instruct-2507 | oumi |
| deepseek-ai/DeepSeek-V3.1 | oumi |
| moonshotai/Kimi-K2.5 | oumi |
| openai/gpt-oss-120b | oumi |
| zai-org/GLM-5.1 | oumi |
| zai-org/GLM-5 | oumi |
| zai-org/GLM-4.6 | oumi |
| meta-llama/Llama-3.3-70B-Instruct | oumi |
| Qwen/Qwen2.5-7B-Instruct | oumi |
| mistralai/Mixtral-8x7B-Instruct-v0.1 | oumi |
All models available as a training base can also be used for evaluation.