llama.cpp Builder

April 28, 2026 ยท View on GitHub

Prebuilt binaries for CUDA and Vulkan on Linux

This repo builds binary versions of llama.cpp libraries and executables for architectures that are not already part of the normal builds, such as Linux with CUDA or Vulkan support, and Linux arm64 CPU or Vulkan.

New releases are automatically built for the latest release version of llama.cpp. The latest release is checked once per hour.

yzma logo

Used by yzma installer. yzma lets you write Go applications that directly integrate the latest llama.cpp libraries.

CUDA

Currently supported CUDA build configurations:

CPU archOSCUDANvidia Compute arch
amd64Ubuntu 24.0412.986, 89
amd64Ubuntu 24.0413.0.8886, 89
arm64Ubuntu 22.0412.987
arm64Ubuntu 22.0413.0.8887

Compute architectures 86 and 89 are those used by consumer video cards.

Compute architecture 87 is used by Jetson Orin and Jetson AGX.

Vulkan

Currently supported Vulkan build configurations:

CPU archOSVulkan
arm64Ubuntu 22.04/Debian Bookworm1.4.328.1
arm64Ubuntu 24.04/Debian Trixie1.4.328.1

The prebuilt Vulkan SDK for ARM64 used for our builds comes from https://github.com/jakoch/vulkan-sdk-arm

Thank you!

CPU

Currently supported CPU build configurations:

CPU archOS
arm64Ubuntu 22.04/Debian Bookworm
arm64Ubuntu 24.04/Debian Trixie

How to check the latest version

VERSION=$(curl -s https://hybridgroup.github.io/llama-cpp-builder/version.json | jq -r '.tag_name')