Lambda

May 8, 2026 ยท View on GitHub

Oumi Logo

Documentation Blog Twitter Discord PyPI version License Tests GPU Tests GitHub Repo stars Code style: black pre-commit About

Everything you need to build state-of-the-art foundation models, end-to-end

GitHub trending

๐Ÿ”ฅ News

Older updates

๐Ÿ”Ž About

Oumi is a fully open-source platform that streamlines the entire lifecycle of foundation models - from data preparation and training to evaluation and deployment. Whether you're developing on a laptop, launching large scale experiments on a cluster, or deploying models in production, Oumi provides the tools and workflows you need.

With Oumi, you can:

  • ๐Ÿš€ Train and fine-tune models from 10M to 405B parameters using state-of-the-art techniques (SFT, LoRA, QLoRA, GRPO, and more)
  • ๐Ÿค– Work with both text and multimodal models (Llama, DeepSeek, Qwen, Phi, and others)
  • ๐Ÿ”„ Synthesize and curate training data with LLM judges
  • โšก๏ธ Deploy models efficiently with popular inference engines (vLLM, SGLang)
  • ๐Ÿ“Š Evaluate models comprehensively across standard benchmarks
  • ๐ŸŒŽ Run anywhere - from laptops to clusters to clouds (AWS, Azure, GCP, Lambda, and more)
  • ๐Ÿ”Œ Integrate with both open models and commercial APIs (OpenAI, Anthropic, Vertex AI, Together, Parasail, ...)

All with one consistent API, production-grade reliability, and all the flexibility you need for research.

Learn more at oumi.ai, or jump right in with the quickstart guide.

๐Ÿš€ Getting Started

NotebookTry in ColabGoal
๐ŸŽฏ Getting Started: A TourOpen In ColabQuick tour of core features: training, evaluation, inference, and job management
๐Ÿ”ง Model Finetuning GuideOpen In ColabEnd-to-end guide to LoRA tuning with data prep, training, and evaluation
๐Ÿ“š Model DistillationOpen In ColabGuide to distilling large models into smaller, efficient ones
๐Ÿ“‹ Model EvaluationOpen In ColabComprehensive model evaluation using Oumi's evaluation framework
โ˜๏ธ Remote TrainingOpen In ColabLaunch and monitor training jobs on cloud (AWS, Azure, GCP, Lambda, etc.) platforms
๐Ÿ“ˆ LLM-as-a-JudgeOpen In ColabFilter and curate training data with built-in judges

๐Ÿ”ง Usage

Installation

Choose the installation method that works best for you:

Using pip (Recommended)
# Basic installation
uv pip install oumi

# With GPU support
uv pip install 'oumi[gpu]'

# Latest development version
uv pip install git+https://github.com/oumi-ai/oumi.git

Don't have uv? Install it or use pip instead.

Using Docker
# Pull the latest image
docker pull ghcr.io/oumi-ai/oumi:latest

# Run oumi commands
docker run --gpus all -it ghcr.io/oumi-ai/oumi:latest oumi --help

# Train with a mounted config
docker run --gpus all -v $(pwd):/workspace -it ghcr.io/oumi-ai/oumi:latest \
    oumi train --config /workspace/my_config.yaml
Quick Install Script (Experimental)

Try Oumi without setting up a Python environment. This installs Oumi in an isolated environment:

curl -LsSf https://oumi.ai/install.sh | bash

For more advanced installation options, see the installation guide.

Oumi CLI

You can quickly use the oumi command to train, evaluate, and infer models using one of the existing recipes:

# Training
oumi train -c configs/recipes/smollm/sft/135m/quickstart_train.yaml

# Evaluation
oumi evaluate -c configs/recipes/smollm/evaluation/135m/quickstart_eval.yaml

# Inference
oumi infer -c configs/recipes/smollm/inference/135m_infer.yaml --interactive

For more advanced options, see the training, evaluation, inference, and llm-as-a-judge guides.

Running Jobs Remotely

You can run jobs remotely on cloud platforms (AWS, Azure, GCP, Lambda, etc.) using the oumi launch command:

# GCP
oumi launch up -c configs/recipes/smollm/sft/135m/quickstart_gcp_job.yaml

# AWS
oumi launch up -c configs/recipes/smollm/sft/135m/quickstart_gcp_job.yaml --resources.cloud aws

# Azure
oumi launch up -c configs/recipes/smollm/sft/135m/quickstart_gcp_job.yaml --resources.cloud azure

# Lambda
oumi launch up -c configs/recipes/smollm/sft/135m/quickstart_gcp_job.yaml --resources.cloud lambda

Note: Oumi is in beta and under active development. The core features are stable, but some advanced features might change as the platform improves.

๐Ÿ’ป Why use Oumi?

If you need a comprehensive platform for training, evaluating, or deploying models, Oumi is a great choice.

Here are some of the key features that make Oumi stand out:

  • ๐Ÿ”ง Zero Boilerplate: Get started in minutes with ready-to-use recipes for popular models and workflows. No need to write training loops or data pipelines.
  • ๐Ÿข Enterprise-Grade: Built and validated by teams training models at scale
  • ๐ŸŽฏ Research Ready: Perfect for ML research with easily reproducible experiments, and flexible interfaces for customizing each component.
  • ๐ŸŒ Broad Model Support: Works with most popular model architectures - from tiny models to the largest ones, text-only to multimodal.
  • ๐Ÿš€ SOTA Performance: Native support for distributed training techniques (FSDP, DeepSpeed, DDP) and optimized inference engines (vLLM, SGLang).
  • ๐Ÿค Community First: 100% open source with an active community. No vendor lock-in, no strings attached.

๐Ÿ“š Examples & Recipes

Explore the growing collection of ready-to-use configurations for state-of-the-art models and training workflows:

Note: These configurations are not an exhaustive list of what's supported, simply examples to get you started. You can find a more exhaustive list of supported models, and datasets (supervised fine-tuning, pre-training, preference tuning, and vision-language finetuning) in the oumi documentation.

Qwen Family

ModelExample Configurations
Qwen3-Next 80B A3BLoRA โ€ข Inference โ€ข Inference (Instruct) โ€ข Evaluation
Qwen3 30B A3BLoRA โ€ข Inference โ€ข Evaluation
Qwen3 32BLoRA โ€ข Inference โ€ข Evaluation
Qwen3 14BLoRA โ€ข Inference โ€ข Evaluation
Qwen3 8BFFT โ€ข Inference โ€ข Evaluation
Qwen3 4BFFT โ€ข Inference โ€ข Evaluation
Qwen3 1.7BFFT โ€ข Inference โ€ข Evaluation
Qwen3 0.6BFFT โ€ข Inference โ€ข Evaluation
QwQ 32BFFT โ€ข LoRA โ€ข QLoRA โ€ข Inference โ€ข Evaluation
Qwen2.5-VL 3BSFT โ€ข LoRAโ€ข Inference (vLLM) โ€ข Inference
Qwen2-VL 2BSFT โ€ข LoRA โ€ข Inference (vLLM) โ€ข Inference (SGLang) โ€ข Inference โ€ข Evaluation

๐Ÿ‹ DeepSeek R1 Family

ModelExample Configurations
DeepSeek R1 671BInference (Together AI)
Distilled Llama 8BFFT โ€ข LoRA โ€ข QLoRA โ€ข Inference โ€ข Evaluation
Distilled Llama 70BFFT โ€ข LoRA โ€ข QLoRA โ€ข Inference โ€ข Evaluation
Distilled Qwen 1.5BFFT โ€ข LoRA โ€ข Inference โ€ข Evaluation
Distilled Qwen 32BLoRA โ€ข Inference โ€ข Evaluation

๐Ÿฆ™ Llama Family

ModelExample Configurations
Llama 4 Scout Instruct 17BFFT โ€ข LoRA โ€ข QLoRA โ€ข Inference (vLLM) โ€ข Inference โ€ข Inference (Together.ai)
Llama 4 Scout 17BFFT
Llama 3.1 8BFFT โ€ข LoRA โ€ข QLoRA โ€ข Pre-training โ€ข Inference (vLLM) โ€ข Inference โ€ข Evaluation
Llama 3.1 70BFFT โ€ข LoRA โ€ข QLoRA โ€ข Inference โ€ข Evaluation
Llama 3.1 405BFFT โ€ข LoRA โ€ข QLoRA
Llama 3.2 1BFFT โ€ข LoRA โ€ข QLoRA โ€ข Inference (vLLM) โ€ข Inference (SGLang) โ€ข Inference โ€ข Evaluation
Llama 3.2 3BFFT โ€ข LoRA โ€ข QLoRA โ€ข Inference (vLLM) โ€ข Inference (SGLang) โ€ข Inference โ€ข Evaluation
Llama 3.3 70BFFT โ€ข LoRA โ€ข QLoRA โ€ข Inference (vLLM) โ€ข Inference โ€ข Evaluation
Llama 3.2 Vision 11BSFT โ€ข Inference (vLLM) โ€ข Inference (SGLang) โ€ข Evaluation

๐Ÿฆ… Falcon family

ModelExample Configurations
Falcon-H1FFT โ€ข Inference โ€ข Evaluation
Falcon-E (BitNet)FFT โ€ข DPO โ€ข Evaluation

๐Ÿ’Ž Gemma 3 Family

ModelExample Configurations
Gemma 3 4B InstructFFT โ€ข Inference โ€ข Evaluation
Gemma 3 12B InstructLoRA โ€ข Inference โ€ข Evaluation
Gemma 3 27B InstructLoRA โ€ข Inference โ€ข Evaluation

๐Ÿฆ‰ OLMo 3 Family

ModelExample Configurations
OLMo 3 7B InstructFFT โ€ข Inference โ€ข Evaluation
OLMo 3 32B InstructLoRA โ€ข Inference โ€ข Evaluation

๐ŸŽจ Vision Models

ModelExample Configurations
Llama 3.2 Vision 11BSFT โ€ข LoRA โ€ข Inference (vLLM) โ€ข Inference (SGLang) โ€ข Evaluation
LLaVA 7BSFT โ€ข Inference (vLLM) โ€ข Inference
Phi3 Vision 4.2BSFT โ€ข LoRA โ€ข Inference (vLLM)
Phi4 Vision 5.6BSFT โ€ข LoRA โ€ข Inference (vLLM) โ€ข Inference
Qwen2-VL 2BSFT โ€ข LoRA โ€ข Inference (vLLM) โ€ข Inference (SGLang) โ€ข Inference โ€ข Evaluation
Qwen3-VL 2BInference
Qwen3-VL 4BInference
Qwen3-VL 8BInference
Qwen2.5-VL 3BSFT โ€ข LoRAโ€ข Inference (vLLM) โ€ข Inference
SmolVLM-Instruct 2BSFT โ€ข LoRA

๐Ÿ” Even more options

This section lists all the language models that can be used with Oumi. Thanks to the integration with the ๐Ÿค— Transformers library, you can easily use any of these models for training, evaluation, or inference.

Models prefixed with a checkmark (โœ…) have been thoroughly tested and validated by the Oumi community, with ready-to-use recipes available in the configs/recipes directory.

๐Ÿ“‹ Click to see more supported models

Instruct Models

ModelSizePaperHF HubLicenseOpen 1
โœ… SmolLM-Instruct135M/360M/1.7BBlogHubApache 2.0โœ…
โœ… DeepSeek R1 Family1.5B/8B/32B/70B/671BBlogHubMITโŒ
โœ… Llama 3.1 Instruct8B/70B/405BPaperHubLicenseโŒ
โœ… Llama 3.2 Instruct1B/3BPaperHubLicenseโŒ
โœ… Llama 3.3 Instruct70BPaperHubLicenseโŒ
โœ… Phi-3.5-Instruct4B/14BPaperHubLicenseโŒ
โœ… Qwen30.6B-32BPaperHubLicenseโŒ
Qwen2.5-Instruct0.5B-70BPaperHubLicenseโŒ
OLMo 2 Instruct7BPaperHubApache 2.0โœ…
โœ… OLMo 3 Instruct7B/32BPaperHubApache 2.0โœ…
MPT-Instruct7BBlogHubApache 2.0โœ…
Command R35B/104BBlogHubLicenseโŒ
Granite-3.1-Instruct2B/8BPaperHubApache 2.0โŒ
Gemma 2 Instruct2B/9BBlogHubLicenseโŒ
โœ… Gemma 3 Instruct4B/12B/27BBlogHubLicenseโŒ
DBRX-Instruct130B MoEBlogHubApache 2.0โŒ
Falcon-Instruct7B/40BPaperHubApache 2.0โŒ
โœ… Llama 4 Scout Instruct17B (Activated) 109B (Total)PaperHubLicenseโŒ
โœ… Llama 4 Maverick Instruct17B (Activated) 400B (Total)PaperHubLicenseโŒ

Vision-Language Models

ModelSizePaperHF HubLicenseOpen
โœ… Llama 3.2 Vision11BPaperHubLicenseโŒ
โœ… LLaVA-1.57BPaperHubLicenseโŒ
โœ… Phi-3 Vision4.2BPaperHubLicenseโŒ
โœ… BLIP-23.6BPaperHubMITโŒ
โœ… Qwen2-VL2BBlogHubLicenseโŒ
โœ… Qwen3-VL2B/4B/8BBlogHubLicenseโŒ
โœ… SmolVLM-Instruct2BBlogHubApache 2.0โœ…

Base Models

ModelSizePaperHF HubLicenseOpen
โœ… SmolLM2135M/360M/1.7BBlogHubApache 2.0โœ…
โœ… Llama 3.21B/3BPaperHubLicenseโŒ
โœ… Llama 3.18B/70B/405BPaperHubLicenseโŒ
โœ… GPT-2124M-1.5BPaperHubMITโœ…
DeepSeek V27B/13BBlogHubLicenseโŒ
Gemma22B/9BBlogHubLicenseโŒ
GPT-J6BBlogHubApache 2.0โœ…
GPT-NeoX20BPaperHubApache 2.0โœ…
Mistral7BPaperHubApache 2.0โŒ
Mixtral8x7B/8x22BBlogHubApache 2.0โŒ
MPT7BBlogHubApache 2.0โœ…
OLMo1B/7BPaperHubApache 2.0โœ…
โœ… Llama 4 Scout17B (Activated) 109B (Total)PaperHubLicenseโŒ

Reasoning Models

ModelSizePaperHF HubLicenseOpen
โœ… gpt-oss20B/120BPaperHubApache 2.0โŒ
โœ… Qwen30.6B-32BPaperHubLicenseโŒ
โœ… Qwen3-Next80B-A3BBlogHubLicenseโŒ
Qwen QwQ32BBlogHubLicenseโŒ

Code Models

ModelSizePaperHF HubLicenseOpen
โœ… Qwen2.5 Coder0.5B-32BBlogHubLicenseโŒ
DeepSeek Coder1.3B-33BPaperHubLicenseโŒ
StarCoder 23B/7B/15BPaperHubLicenseโœ…

Math Models

ModelSizePaperHF HubLicenseOpen
DeepSeek Math7BPaperHubLicenseโŒ

๐Ÿ“– Documentation

To learn more about all the platform's capabilities, see the Oumi documentation.

๐Ÿค Join the Community

Oumi is a community-first effort. Whether you are a developer, a researcher, or a non-technical user, all contributions are very welcome!

  • To contribute to the oumi repository, please check the CONTRIBUTING.md for guidance on how to contribute to send your first Pull Request.
  • Make sure to join our Discord community to get help, share your experiences, and contribute to the project!
  • If you are interested in joining one of the community's open-science efforts, check out our open collaboration page.

๐Ÿ™ Acknowledgements

Oumi makes use of several libraries and tools from the open-source community. We would like to acknowledge and deeply thank the contributors of these projects! โœจ ๐ŸŒŸ ๐Ÿ’ซ

๐Ÿ“ Citation

If you find Oumi useful in your research, please consider citing it:

@software{oumi2025,
  author = {Oumi Community},
  title = {Oumi: an Open, End-to-end Platform for Building Large Foundation Models},
  month = {January},
  year = {2025},
  url = {https://github.com/oumi-ai/oumi}
}

๐Ÿ“œ License

This project is licensed under the Apache License 2.0. See the LICENSE file for details.

Footnotes

  1. Open models are defined as models with fully open weights, training code, and data, and a permissive license. See Open Source Definitions for more information. โ†ฉ