Open Source

llama.cpp b9505 release expands platform support

llama.cpp ReleasesJune 5, 2026high confidence

Why it matters

→Expands GPU support with CUDA and ROCm, enhancing performance options.
→Continues to broaden platform compatibility, making it more versatile.
→Some features remain disabled, indicating areas for future development.

The latest b9505 release of llama.cpp introduces expanded platform support, particularly for Windows and Ubuntu users. Notably, Windows now supports both CUDA 12 and 13, enhancing its GPU capabilities, while Ubuntu users benefit from ROCm 7.2 integration. However, some features like KleidiAI on macOS Apple Silicon remain disabled, suggesting areas still under development. This release highlights llama.cpp's ongoing efforts to cater to a diverse range of hardware configurations, though certain limitations persist.

Read original

More from llama.cpp Releases

Models & Labsmodels

Llama.cpp adds GLM-5.2 speculative decoding support

Llama.cpp's latest update introduces speculative decoding support for GLM-5.2, enhancing its capabilities with NextN/MTP features. This addition allows for more efficient tensor loading and context management, particularly benefiting models using the GLM_DSA architecture. The update also includes options for exporting models with or without the MTP feature, providing flexibility for developers. This release marks a step forward in optimizing model performance and adaptability, especially for those leveraging the GLM-5.2 framework.

llama.cpp ReleasesJul 30, 2026

Open Sourcemodels

llama.cpp b10175 Release Expands Platform Support

The latest b10175 release of llama.cpp continues its trend of broadening platform compatibility, making it a versatile tool for developers across different systems. Notably, this update includes support for ROCm 7.2 on Ubuntu x64, which is significant for AMD GPU users seeking alternatives to NVIDIA's CUDA. The release also maintains a wide array of builds for Windows, macOS, and Linux, ensuring that developers can leverage llama.cpp's capabilities regardless of their hardware setup. While there are no groundbreaking new features, the consistent expansion of platform support solidifies llama.cpp's position as a flexible inference runtime option.

llama.cpp ReleasesJul 30, 2026

Open Sourcemodels

llama.cpp b10176 Release Expands Platform Support

The b10176 release of llama.cpp enhances its platform reach, notably adding ROCm 7.2 support on Ubuntu x64, which is a significant boost for AMD GPU users. This update continues to cater to a wide array of systems, from macOS to Windows and Linux, ensuring developers can deploy llama.cpp across various hardware setups. While there are no groundbreaking new features, the release solidifies llama.cpp's role as a flexible tool for AI inference. By improving compatibility and functionality, this update makes llama.cpp more accessible and practical for developers working with different systems.

llama.cpp ReleasesJul 30, 2026

More in Open Source

Open Sourcemodels

Alibaba Announces Qwen3.8 Open-Weight Release

Alibaba plans to release open weights for its Qwen3.8 model.

Matt WolfeJul 24, 2026

Open Sourceagents

Grabette: Open System for Robot Data Collection

Grabette is a new open-source system designed to simplify the collection of robot manipulation data. By using a handheld gripper equipped with cameras, it allows users to record tasks without needing a robot or lab setup. This democratizes data collection, enabling anyone to contribute to a large, collaborative dataset. The system is built on standard, easily accessible components, making it accessible for widespread use. This release aims to address the data bottleneck in robot learning by encouraging community participation in building diverse datasets.

Hugging Face BlogJul 21, 2026

Nous Research Secures $75 Million Funding

Investment · $75 Mn

Open Sourceresearch

Nous Research Secures $75 Million Funding

Nous Research, an open-source AI lab, has raised $75 million at a $1.5 billion valuation.

Lev SelectorJul 17, 2026