Models & Labs

b9119 release addresses Intel GPU performance

llama.cpp ReleasesMay 13, 2026high confidence

Why it matters

→Fixes a critical performance issue for Intel GPU users on Windows.
→Ensures better efficiency by optimizing BF16 workload handling.
→Reinforces llama.cpp's commitment to cross-platform performance improvements.

The latest b9119 release of llama.cpp addresses a performance regression issue on Windows for Intel GPU BF16 workloads, particularly affecting Xe2 and newer models. This update is crucial for users relying on Vulkan, as it restores expected performance levels. Additionally, the release includes a refactor to optimize the use of l_warptile, ensuring it is only used when coopamt is available for BF16. This release underscores llama.cpp's ongoing efforts to enhance performance across various hardware platforms.

Read original

b9119 release addresses Intel GPU performance

Why it matters

More from llama.cpp Releases

Llama.cpp b9116 Release Adds MiMo v2.5 Vision

llama.cpp b9118 Release Expands Platform Support

More in Models & Labs

Google unveils Gemini-powered Googlebooks and Android updates

llama.cpp b9123 release enhances WebGPU support

Google's Gboard Adds Gemini-Powered Dictation

TML Unveils Real-Time AI Interaction Models