16 × AIAI signal, amplified
AI newsAboutSources
TelegramFollow on Telegram
AI newsAboutSources
16 × AIAI signal, amplified

An AI news engine that ingests trusted sources, scores with Claude, and posts only what clears the bar.

Follow on Telegram →

Subscribe

  • Telegram
  • RSS
  • All channels

Legal

  • Privacy
  • Imprint
© 2026 16 × AI. All rights reserved.Curated by Claude. Posts every 6 hours. No newsletter, no funnel.
Home/Open Source
Open Source

llama.cpp b9439 release expands platform support

llama.cpp Releases·June 1, 2026·high confidence

Why it matters

  • →Expands support for AMD GPUs with ROCm 7.2, offering alternatives to CUDA.
  • →Enhances llama.cpp's versatility across different hardware platforms.
  • →Focuses on refining existing capabilities rather than introducing new models.

The latest b9439 release of llama.cpp focuses on expanding platform support and refining existing capabilities. Key updates include the addition of ROCm 7.2 support for Ubuntu x64, enhancing options for AMD GPU users. While some features like KleidiAI on macOS and SYCL on Windows are disabled, the release continues to position llama.cpp as a versatile tool across various hardware setups. This update does not introduce new models but strengthens the software's adaptability and usability.

Read original

More from llama.cpp Releases

Models & Labsmodels

llama.cpp adds EXAONE 4.5 implementations

The latest llama.cpp release expands its capabilities with the integration of EXAONE 4.5, bringing new vision markers and projector paths into the fold. This update aligns EXAONE 4.5 with the Qwen2.5-VL-style encode path, enhancing model loading and tensor registration processes. Developers will find improved performance and compatibility, particularly when working with EXAONE models. While no new models are introduced, the release refines existing functionalities, ensuring robust performance across various systems. This step forward is crucial for developers seeking to leverage EXAONE 4.5's full potential.

llama.cpp Releases·Jun 2, 2026
Models & Labsmodels

llama.cpp b9455 Release Adds Quantized KV Cache

The latest b9455 release of llama.cpp introduces quantized KV cache support, a notable enhancement for efficiency in AI model inference. This update also addresses a partial view fix and removes an overly strict assert, improving the overall robustness of the software. While the release includes various platform builds, the focus remains on optimizing performance across different environments. The addition of quantized KV cache support is a step forward in making AI models more resource-efficient, particularly beneficial for developers working with limited computational resources.

llama.cpp Releases·Jun 2, 2026
Models & Labsmodels

llama.cpp b9457 release focuses on Vulkan improvements

The latest b9457 release of llama.cpp brings a notable improvement in Vulkan performance by reducing host memory lock contention, which can enhance efficiency in certain workloads. This update replaces unique_lock with lock_guard, aiming to streamline operations. While the release doesn't introduce new models or major features, it continues to refine the platform's compatibility across various systems, including macOS, Linux, and Windows. The focus remains on optimizing existing capabilities rather than expanding into new territories.

llama.cpp Releases·Jun 2, 2026

More in Open Source

Cohere Command A+ Open Sourced© Lev Selector
Open Sourcemodels

Cohere Command A+ Open Sourced

Cohere has open-sourced its Command A+ model, making it accessible for public use.

Lev Selector·May 29, 2026
Reachy Mini Enables Local Speech Processing© Hugging Face Blog
Open Sourceagents

Reachy Mini Enables Local Speech Processing

Hugging Face has introduced a fully local speech processing setup for the Reachy Mini robot, eliminating the need for cloud services and enhancing privacy. By utilizing a cascaded voice pipeline, users can run speech-to-speech interactions entirely on their own hardware, ensuring that no data leaves their network. This setup leverages components like llama.cpp for LLM and Parakeet-TDT for STT, allowing for customizable and cost-effective speech processing. The move empowers users with full control over their speech processing pipeline, offering flexibility to swap components as new models become available.

Hugging Face Blog·May 27, 2026
Karpathy Open-Sources CLAUDE md© Lev Selector
Open Sourcemodels

Karpathy Open-Sources CLAUDE md

Andrej Karpathy has released CLAUDE md as open source.

Lev Selector·May 22, 2026