Models & Labs

llama.cpp b9145 release addresses SYCL memory issues

llama.cpp ReleasesMay 15, 2026high confidence

Why it matters

→Addresses critical memory management issues in multi-GPU systems using SYCL.
→Reduces system RAM usage significantly, preventing out-of-memory crashes.
→Enhances the stability and performance of the SYCL backend for developers.

The b9145 release of llama.cpp introduces a fix for SYCL's memory allocation issues on multi-GPU systems. By switching from sycl::malloc_device to zeMemAllocDevice, the update significantly reduces system RAM usage, preventing out-of-memory crashes on systems like the dual Intel Arc Pro B70. This change ensures efficient memory management without performance loss. The release also includes various improvements and bug fixes, enhancing the overall stability and functionality of the SYCL backend.

Read original

llama.cpp b9145 release addresses SYCL memory issues

Why it matters

More from llama.cpp Releases

Llama.cpp Adds Qwen3.5 Tokenizer Handler

llama.cpp b9150 Release Expands Platform Support

More in Models & Labs

GitHub API Adds Team-Level Copilot Metrics

Llama.cpp b9158 Release Enhances AMD Support

OpenAI's Codex Now Available on Mobile

GPT-5.5 Emerges as Versatile AI Model