Models & Labs

llama.cpp b9128 Release Optimizes Hexagon and macOS Support

llama.cpp ReleasesMay 13, 2026high confidence

Why it matters

→Optimizes performance for Hexagon by eliminating scalar VTCM loads.
→Enhances macOS support, including Apple Silicon with KleidiAI.
→Expands compatibility across diverse hardware configurations.

The b9128 release of llama.cpp introduces optimizations for Hexagon, focusing on eliminating scalar VTCM loads through HVX splat helpers. This update also enhances support for macOS, including Apple Silicon with KleidiAI enabled, and extends compatibility across multiple platforms such as Windows and Linux. Key improvements include optimized per-group scale handling and slope load from VTCM. These enhancements aim to boost performance and efficiency, making llama.cpp more adaptable for developers working with various hardware setups.

Read original

llama.cpp b9128 Release Optimizes Hexagon and macOS Support

Why it matters

More from llama.cpp Releases

Llama.cpp b9116 Release Adds MiMo v2.5 Vision

llama.cpp b9118 Release Expands Platform Support

More in Models & Labs

Google unveils Gemini-powered Googlebooks and Android updates

b9119 release addresses Intel GPU performance

Google's Gboard Adds Gemini-Powered Dictation

TML Unveils Real-Time AI Interaction Models