Open Source

llama.cpp b9822 Release Expands Platform Support

llama.cpp ReleasesJune 28, 2026high confidence

Why it matters

→Expands platform support, particularly for AMD GPU users with ROCm 7.2.
→Ensures broad compatibility across macOS, Windows, and Linux.
→Reinforces llama.cpp's versatility as an inference runtime.

The b9822 release of llama.cpp has been announced, focusing on expanding platform support rather than introducing new features. This update includes support for Ubuntu x64 with ROCm 7.2, enhancing options for AMD GPU users. The release also covers a wide array of platforms, including macOS, Windows, and Linux, ensuring broad compatibility. While there are no new models or quantization methods, the release strengthens llama.cpp's role as a versatile tool for developers across various systems.

Read original

More from llama.cpp Releases

Models & Labsmodels

llama.cpp b9817 release enhances OpenVINO support

The latest b9817 release of llama.cpp brings significant updates to its OpenVINO backend, including an upgrade to OV 2026.2.1 and the introduction of self-contained release packages. These changes streamline the deployment process and improve operator handling, making it easier for developers to integrate and utilize OpenVINO in their projects. Additionally, the update removes hardcoded compute operation types, enhancing flexibility and adaptability. This release marks a step forward in making llama.cpp a more versatile and developer-friendly platform, particularly for those leveraging OpenVINO's capabilities.

llama.cpp ReleasesJun 28, 2026

Models & Labsmodels

llama.cpp b9820 Release Enhances CUDA Performance

The b9820 release of llama.cpp brings notable improvements to CUDA performance by cutting down on unnecessary synchronizations, which can streamline token processing. This update introduces asynchronous copy capabilities between CPU and CUDA, facilitating smoother data transfers and potentially speeding up computations. Backend detection has been refined to avoid linking conflicts, and synchronization adjustments have been made more general, allowing other backends like Vulkan to benefit. These enhancements aim to optimize performance across different hardware setups, making llama.cpp a more adaptable tool for developers working with diverse configurations.

llama.cpp ReleasesJun 28, 2026

Open Sourcemodels

llama.cpp b9821 Release Expands Platform Support

The latest b9821 release of llama.cpp enhances user interaction with new command-line options like --version, --licenses, and --help. This update significantly broadens platform compatibility, adding support for Vulkan and ROCm 7.2 on Ubuntu, and CUDA 12 and 13 on Windows. Although KleidiAI support is currently disabled for macOS Apple Silicon, the release still caters to numerous operating systems and architectures. This update underscores llama.cpp's commitment to making its tools more accessible and functional for developers across different computing environments.

llama.cpp ReleasesJun 28, 2026

More in Open Source

Open Sourcemodels

Krea 2 Releases Open Weights

Krea 2 has made its model weights open, allowing broader access.

Matt WolfeJun 26, 2026

Open Sourcecoding

Hugging Face Automates Weekly Releases with AI

Hugging Face has streamlined its release process for the huggingface_hub Python client, moving from a 4-6 week cycle to weekly releases. This shift is powered by a combination of open-source tools and AI, which drafts release notes and automates mechanical tasks, while humans oversee critical judgment areas. The process is designed to be replicable by other maintainers, emphasizing transparency and adaptability. This change not only accelerates the release cycle but also ensures that updates are consistently delivered without the need for proprietary tools.

Hugging Face BlogJun 23, 2026

Open Sourcemodels

PewDiePie Builds Private AI Workspace

PewDiePie has invested $41,000 in creating a private, self-hosted AI workspace using open-source tools.

Matt WolfeJun 22, 2026