Puzzletron Pruning

April 30, 2026 · View on GitHub

Distillation results for models compressed with Puzzletron MIP-based heterogeneous pruning, followed by Megatron-Bridge knowledge distillation.

Results

ModelFile
Llama-3.1-8B-Instruct and Qwen3-8BLlama-3.1-8B-Instruct.md

Related

  • Puzzletron pruning example
  • Megatron-Bridge distillation instructions
  • Megatron dataset tokenization
  • Minitron pruning instructions

Contents

  1. 1Results
  2. 2Related