Open Source

llama.cpp b9503 release focuses on Gemma 4 fix

llama.cpp ReleasesJune 5, 2026high confidence

Why it matters

→Fixes a specific issue with Gemma 4, improving functionality.
→Enhances compatibility across multiple operating systems.
→Reflects ongoing efforts to refine and stabilize the software.

The b9503 release of llama.cpp focuses on fixing an issue related to the Gemma 4 audio projector embedding size. This update removes the projection_dim from clip_n_mmproj_embd, which is a technical adjustment aimed at improving the software's performance. The release includes compatibility updates for multiple operating systems, such as macOS, Linux, and Windows. While not introducing new features, this update is part of continuous efforts to enhance the software's stability and functionality.

Read original

More from llama.cpp Releases

Models & Labsmodels

Llama.cpp adds GLM-5.2 speculative decoding support

Llama.cpp's latest update introduces speculative decoding support for GLM-5.2, enhancing its capabilities with NextN/MTP features. This addition allows for more efficient tensor loading and context management, particularly benefiting models using the GLM_DSA architecture. The update also includes options for exporting models with or without the MTP feature, providing flexibility for developers. This release marks a step forward in optimizing model performance and adaptability, especially for those leveraging the GLM-5.2 framework.

llama.cpp ReleasesJul 30, 2026

Open Sourcemodels

llama.cpp b10175 Release Expands Platform Support

The latest b10175 release of llama.cpp continues its trend of broadening platform compatibility, making it a versatile tool for developers across different systems. Notably, this update includes support for ROCm 7.2 on Ubuntu x64, which is significant for AMD GPU users seeking alternatives to NVIDIA's CUDA. The release also maintains a wide array of builds for Windows, macOS, and Linux, ensuring that developers can leverage llama.cpp's capabilities regardless of their hardware setup. While there are no groundbreaking new features, the consistent expansion of platform support solidifies llama.cpp's position as a flexible inference runtime option.

llama.cpp ReleasesJul 30, 2026

Open Sourcemodels

llama.cpp b10176 Release Expands Platform Support

The b10176 release of llama.cpp enhances its platform reach, notably adding ROCm 7.2 support on Ubuntu x64, which is a significant boost for AMD GPU users. This update continues to cater to a wide array of systems, from macOS to Windows and Linux, ensuring developers can deploy llama.cpp across various hardware setups. While there are no groundbreaking new features, the release solidifies llama.cpp's role as a flexible tool for AI inference. By improving compatibility and functionality, this update makes llama.cpp more accessible and practical for developers working with different systems.

llama.cpp ReleasesJul 30, 2026

More in Open Source

Open Sourcemodels

Alibaba Announces Qwen3.8 Open-Weight Release

Alibaba plans to release open weights for its Qwen3.8 model.

Matt WolfeJul 24, 2026

Open Sourceagents

Grabette: Open System for Robot Data Collection

Grabette is a new open-source system designed to simplify the collection of robot manipulation data. By using a handheld gripper equipped with cameras, it allows users to record tasks without needing a robot or lab setup. This democratizes data collection, enabling anyone to contribute to a large, collaborative dataset. The system is built on standard, easily accessible components, making it accessible for widespread use. This release aims to address the data bottleneck in robot learning by encouraging community participation in building diverse datasets.

Hugging Face BlogJul 21, 2026

Nous Research Secures $75 Million Funding

Investment · $75 Mn

Open Sourceresearch

Nous Research Secures $75 Million Funding

Nous Research, an open-source AI lab, has raised $75 million at a $1.5 billion valuation.

Lev SelectorJul 17, 2026