TRL
<p>
Fine-tune Llama 3.1 8B with SFT and QLoRA, single-node or distributed across multiple nodes.
</p>
</a>
<a href="/docs/examples/training/axolotl"
class="feature-cell">
<h3>
Axolotl
</h3>
<p>
Fine-tune Llama models with FSDP and QLoRA, single-node or distributed across multiple nodes.
</p>
</a>
<a href="/docs/examples/training/ray-ragen"
class="feature-cell">
<h3>
Ray+RAGEN
</h3>
<p>
Fine-tune an agent on multiple nodes with RAGEN, verl, and Ray.
</p>
</a>
<a href="/docs/examples/training/miles"
class="feature-cell">
<h3>
Miles
</h3>
<p>
RL-fine-tune Qwen2.5-32B with Miles.
</p>
</a>