Open Source

llama.cpp b9144 Release Expands Platform Support

llama.cpp ReleasesMay 14, 2026high confidence

Why it matters

→Expands platform support, making llama.cpp more versatile for developers.
→Optimizes performance for specific hardware configurations, enhancing efficiency.
→Strengthens llama.cpp's position as a tool for diverse hardware environments.

The b9144 release of llama.cpp introduces targeted optimizations for hardware configurations, particularly in the ggml-webgpu component. This update ensures that subgroup-matrix paths are used efficiently, only when head dimensions are divisible by specific parameters. The release also expands platform support, including macOS, Linux, Windows, and Android, with notable enhancements for Apple Silicon, Vulkan, and CUDA environments. These improvements make llama.cpp a more versatile tool for developers working with various hardware setups.

Read original

More from llama.cpp Releases

Models & Labsmodels

Llama.cpp adds GLM-5.2 speculative decoding support

Llama.cpp's latest update introduces speculative decoding support for GLM-5.2, enhancing its capabilities with NextN/MTP features. This addition allows for more efficient tensor loading and context management, particularly benefiting models using the GLM_DSA architecture. The update also includes options for exporting models with or without the MTP feature, providing flexibility for developers. This release marks a step forward in optimizing model performance and adaptability, especially for those leveraging the GLM-5.2 framework.

llama.cpp ReleasesJul 30, 2026

Open Sourcemodels

llama.cpp b10175 Release Expands Platform Support

The latest b10175 release of llama.cpp continues its trend of broadening platform compatibility, making it a versatile tool for developers across different systems. Notably, this update includes support for ROCm 7.2 on Ubuntu x64, which is significant for AMD GPU users seeking alternatives to NVIDIA's CUDA. The release also maintains a wide array of builds for Windows, macOS, and Linux, ensuring that developers can leverage llama.cpp's capabilities regardless of their hardware setup. While there are no groundbreaking new features, the consistent expansion of platform support solidifies llama.cpp's position as a flexible inference runtime option.

llama.cpp ReleasesJul 30, 2026

Open Sourcemodels

llama.cpp b10176 Release Expands Platform Support

The b10176 release of llama.cpp enhances its platform reach, notably adding ROCm 7.2 support on Ubuntu x64, which is a significant boost for AMD GPU users. This update continues to cater to a wide array of systems, from macOS to Windows and Linux, ensuring developers can deploy llama.cpp across various hardware setups. While there are no groundbreaking new features, the release solidifies llama.cpp's role as a flexible tool for AI inference. By improving compatibility and functionality, this update makes llama.cpp more accessible and practical for developers working with different systems.

llama.cpp ReleasesJul 30, 2026

More in Open Source

Open Sourcemodels

Alibaba Announces Qwen3.8 Open-Weight Release

Alibaba plans to release open weights for its Qwen3.8 model.

Matt WolfeJul 24, 2026

Open Sourceagents

Grabette: Open System for Robot Data Collection

Grabette is a new open-source system designed to simplify the collection of robot manipulation data. By using a handheld gripper equipped with cameras, it allows users to record tasks without needing a robot or lab setup. This democratizes data collection, enabling anyone to contribute to a large, collaborative dataset. The system is built on standard, easily accessible components, making it accessible for widespread use. This release aims to address the data bottleneck in robot learning by encouraging community participation in building diverse datasets.

Hugging Face BlogJul 21, 2026

Nous Research Secures $75 Million Funding

Investment · $75 Mn

Open Sourceresearch

Nous Research Secures $75 Million Funding

Nous Research, an open-source AI lab, has raised $75 million at a $1.5 billion valuation.

Lev SelectorJul 17, 2026