16 × AIAI signal, amplified
AI newsAboutSources
TelegramFollow on Telegram
AI newsAboutSources
16 × AIAI signal, amplified

An AI news engine that ingests trusted sources, scores with Claude, and posts only what clears the bar.

Follow on Telegram →

Subscribe

  • Telegram
  • RSS
  • All channels

Legal

  • Privacy
  • Imprint
© 2026 16 × AI. All rights reserved.Curated by Claude. Posts every 6 hours. No newsletter, no funnel.
Home/Models & Labs
Models & Labs

Llama.cpp Update Fixes iGPU Device Selection

llama.cpp Releases·May 31, 2026·high confidence

Why it matters

  • →Fixes a critical bug affecting systems with integrated GPUs as main compute devices.
  • →Ensures proper tensor allocation and model loading on diverse hardware configurations.
  • →Enhances the reliability and usability of llama.cpp for developers using integrated GPUs.

Llama.cpp has released an update to fix a bug in its device selection logic that affected systems with integrated GPUs. The issue arose when RPC servers were present, causing the local iGPU to be ignored and leading to model loading failures. The update now ensures that iGPUs are included unless no GPUs are available, resolving the problem for systems where the iGPU is the main compute device. This change improves the functionality of llama.cpp on systems with integrated GPUs, such as those with large unified memory.

Read original

More from llama.cpp Releases

Open Sourcemodels

llama.cpp b9428 Release Enhances Platform Support

The b9428 release of llama.cpp significantly enhances its platform support, addressing key issues and expanding compatibility. This update fixes the s390x release job and introduces multi-thread build capabilities for iOS-Xcode, improving performance. It also broadens support for macOS, Linux, and Windows, with specific enhancements like Vulkan and ROCm 7.2 on Ubuntu, and CUDA on Windows. While some features like KleidiAI on macOS remain disabled, the release demonstrates a commitment to making llama.cpp more accessible and versatile for developers working across different systems.

llama.cpp Releases·May 31, 2026
Open Sourcemodels

llama.cpp b9430 Release Adds LSX Support

The latest b9430 release of llama.cpp introduces LSX support, optimizing performance for LoongArch architectures. By implementing native intrinsics for fp16 load/store operations and adding LSX implementations for various dot products, the update enhances computational efficiency. This release also includes improvements for macOS, Linux, and Windows platforms, with specific enhancements for Apple Silicon and Vulkan support. While some features remain disabled, the update signifies a step forward in making llama.cpp more versatile across different hardware configurations.

llama.cpp Releases·May 31, 2026
Open Sourcemodels

llama.cpp b9431 Release Updates macOS and Windows Builds

The b9431 release of llama.cpp brings targeted updates to its build processes, particularly enhancing the iOS-Xcode release job by moving to macOS-26. This update also involves disabling the libcommon build from the xcframework, which may indicate a strategic optimization. On the Windows side, the release includes updates for CUDA 12 and CUDA 13 DLLs, ensuring the software remains compatible with the latest GPU advancements. While no new features are introduced, these changes reflect a commitment to refining performance and maintaining compatibility with current technologies across different operating systems.

llama.cpp Releases·May 31, 2026

More in Models & Labs

Models & Labsmodels

vLLM v0.22.0 Release Enhances Model Performance

The vLLM v0.22.0 release marks a significant step forward in model performance and infrastructure. With 459 commits from 230 contributors, this update introduces major enhancements like the DeepSeek V4 model's reorganization and NVFP4 fused MoE support, which improve accuracy and efficiency. The Model Runner V2 now defaults to Qwen3 dense models, offering better performance with new features like sleep-mode weight reload. Additionally, the introduction of a Rust frontend and batch-invariant inference improvements highlight the release's focus on speed and flexibility. These updates collectively enhance the vLLM framework's capability to handle complex AI tasks more efficiently.

vLLM Releases·May 31, 2026
OpenAI Updates GPT-5.5 Instant© The AI Daily Brief
Models & Labsmodels

OpenAI Updates GPT-5.5 Instant

OpenAI has released an update to GPT-5.5 Instant, enhancing its capabilities.

The AI Daily Brief·May 30, 2026
Claude Opus 4.8 Launches with Dynamic Workflows© Lev Selector
Models & Labsmodels

Claude Opus 4.8 Launches with Dynamic Workflows

Claude Opus 4.8 has been released as the new default model, featuring a fast mode and dynamic ultra-code workflows.

Lev Selector·May 29, 2026