Open Source

llama.cpp b9140 Release Expands Platform Support

llama.cpp ReleasesMay 14, 2026high confidence

Why it matters

→Expands llama.cpp's usability across diverse hardware platforms.
→Enhances accessibility for developers working with different systems.
→Focuses on platform compatibility rather than new model introductions.

The b9140 release of llama.cpp has been announced, expanding its support across multiple platforms. This update includes compatibility with macOS Apple Silicon, Ubuntu with ROCm 7.2, and Windows with CUDA 12 and 13, among others. The release aims to enhance accessibility for developers by supporting a variety of hardware configurations, including Vulkan and SYCL. While no new models are introduced, the focus is on broadening the tool's usability for AI inference tasks.

Read original

More from llama.cpp Releases

Models & Labsmodels

Llama.cpp adds GLM-5.2 speculative decoding support

Llama.cpp's latest update introduces speculative decoding support for GLM-5.2, enhancing its capabilities with NextN/MTP features. This addition allows for more efficient tensor loading and context management, particularly benefiting models using the GLM_DSA architecture. The update also includes options for exporting models with or without the MTP feature, providing flexibility for developers. This release marks a step forward in optimizing model performance and adaptability, especially for those leveraging the GLM-5.2 framework.

llama.cpp ReleasesJul 30, 2026

Open Sourcemodels

llama.cpp b10175 Release Expands Platform Support

The latest b10175 release of llama.cpp continues its trend of broadening platform compatibility, making it a versatile tool for developers across different systems. Notably, this update includes support for ROCm 7.2 on Ubuntu x64, which is significant for AMD GPU users seeking alternatives to NVIDIA's CUDA. The release also maintains a wide array of builds for Windows, macOS, and Linux, ensuring that developers can leverage llama.cpp's capabilities regardless of their hardware setup. While there are no groundbreaking new features, the consistent expansion of platform support solidifies llama.cpp's position as a flexible inference runtime option.

llama.cpp ReleasesJul 30, 2026

Open Sourcemodels

llama.cpp b10176 Release Expands Platform Support

The b10176 release of llama.cpp enhances its platform reach, notably adding ROCm 7.2 support on Ubuntu x64, which is a significant boost for AMD GPU users. This update continues to cater to a wide array of systems, from macOS to Windows and Linux, ensuring developers can deploy llama.cpp across various hardware setups. While there are no groundbreaking new features, the release solidifies llama.cpp's role as a flexible tool for AI inference. By improving compatibility and functionality, this update makes llama.cpp more accessible and practical for developers working with different systems.

llama.cpp ReleasesJul 30, 2026

More in Open Source

Open Sourcemodels

Alibaba Announces Qwen3.8 Open-Weight Release

Alibaba plans to release open weights for its Qwen3.8 model.

Matt WolfeJul 24, 2026

Open Sourceagents

Grabette: Open System for Robot Data Collection

Grabette is a new open-source system designed to simplify the collection of robot manipulation data. By using a handheld gripper equipped with cameras, it allows users to record tasks without needing a robot or lab setup. This democratizes data collection, enabling anyone to contribute to a large, collaborative dataset. The system is built on standard, easily accessible components, making it accessible for widespread use. This release aims to address the data bottleneck in robot learning by encouraging community participation in building diverse datasets.

Hugging Face BlogJul 21, 2026

Nous Research Secures $75 Million Funding

Investment · $75 Mn

Open Sourceresearch

Nous Research Secures $75 Million Funding

Nous Research, an open-source AI lab, has raised $75 million at a $1.5 billion valuation.

Lev SelectorJul 17, 2026