Open Source

llama.cpp b9596 Release Expands Platform Support

llama.cpp ReleasesJune 12, 2026high confidence

Why it matters

→Expands platform support, enhancing accessibility for AMD GPU users.
→Reduces performance disparity between AMD and NVIDIA GPUs.
→Increases llama.cpp's versatility across diverse hardware configurations.

The b9596 release of llama.cpp introduces expanded platform support, including ROCm 7.2 for Ubuntu x64, enhancing usability for AMD GPU users. This update aims to reduce the performance gap between AMD and NVIDIA GPUs, making llama.cpp more accessible across different systems. While certain features like KleidiAI on macOS remain disabled, the release still represents a significant step forward in platform compatibility. This update allows developers to explore improved performance on a wider range of hardware configurations.

Read original

More from llama.cpp Releases

Models & Labsmodels

Llama.cpp adds GLM-5.2 speculative decoding support

Llama.cpp's latest update introduces speculative decoding support for GLM-5.2, enhancing its capabilities with NextN/MTP features. This addition allows for more efficient tensor loading and context management, particularly benefiting models using the GLM_DSA architecture. The update also includes options for exporting models with or without the MTP feature, providing flexibility for developers. This release marks a step forward in optimizing model performance and adaptability, especially for those leveraging the GLM-5.2 framework.

llama.cpp ReleasesJul 30, 2026

Open Sourcemodels

llama.cpp b10175 Release Expands Platform Support

The latest b10175 release of llama.cpp continues its trend of broadening platform compatibility, making it a versatile tool for developers across different systems. Notably, this update includes support for ROCm 7.2 on Ubuntu x64, which is significant for AMD GPU users seeking alternatives to NVIDIA's CUDA. The release also maintains a wide array of builds for Windows, macOS, and Linux, ensuring that developers can leverage llama.cpp's capabilities regardless of their hardware setup. While there are no groundbreaking new features, the consistent expansion of platform support solidifies llama.cpp's position as a flexible inference runtime option.

llama.cpp ReleasesJul 30, 2026

Open Sourcemodels

llama.cpp b10176 Release Expands Platform Support

The b10176 release of llama.cpp enhances its platform reach, notably adding ROCm 7.2 support on Ubuntu x64, which is a significant boost for AMD GPU users. This update continues to cater to a wide array of systems, from macOS to Windows and Linux, ensuring developers can deploy llama.cpp across various hardware setups. While there are no groundbreaking new features, the release solidifies llama.cpp's role as a flexible tool for AI inference. By improving compatibility and functionality, this update makes llama.cpp more accessible and practical for developers working with different systems.

llama.cpp ReleasesJul 30, 2026

More in Open Source

Open Sourcemodels

Alibaba Announces Qwen3.8 Open-Weight Release

Alibaba plans to release open weights for its Qwen3.8 model.

Matt WolfeJul 24, 2026

Open Sourceagents

Grabette: Open System for Robot Data Collection

Grabette is a new open-source system designed to simplify the collection of robot manipulation data. By using a handheld gripper equipped with cameras, it allows users to record tasks without needing a robot or lab setup. This democratizes data collection, enabling anyone to contribute to a large, collaborative dataset. The system is built on standard, easily accessible components, making it accessible for widespread use. This release aims to address the data bottleneck in robot learning by encouraging community participation in building diverse datasets.

Hugging Face BlogJul 21, 2026

Nous Research Secures $75 Million Funding

Investment · $75 Mn

Open Sourceresearch

Nous Research Secures $75 Million Funding

Nous Research, an open-source AI lab, has raised $75 million at a $1.5 billion valuation.

Lev SelectorJul 17, 2026