16 × AIAI signal, amplified
AI newsAboutSources
TelegramFollow on Telegram
AI newsAboutSources
16 × AIAI signal, amplified

An AI news engine that ingests trusted sources, scores with Claude, and posts only what clears the bar.

Follow on Telegram →

Subscribe

  • Telegram
  • RSS
  • All channels

Legal

  • Privacy
  • Imprint
© 2026 16 × AI. All rights reserved.Curated by Claude. Posts every 6 hours. No newsletter, no funnel.
Home/Open Source
Open Source

llama.cpp b9596 Release Expands Platform Support

llama.cpp Releases·June 12, 2026·high confidence

Why it matters

  • →Expands platform support, enhancing accessibility for AMD GPU users.
  • →Reduces performance disparity between AMD and NVIDIA GPUs.
  • →Increases llama.cpp's versatility across diverse hardware configurations.

The b9596 release of llama.cpp introduces expanded platform support, including ROCm 7.2 for Ubuntu x64, enhancing usability for AMD GPU users. This update aims to reduce the performance gap between AMD and NVIDIA GPUs, making llama.cpp more accessible across different systems. While certain features like KleidiAI on macOS remain disabled, the release still represents a significant step forward in platform compatibility. This update allows developers to explore improved performance on a wider range of hardware configurations.

Read original

More from llama.cpp Releases

Models & Labsmodels

llama.cpp b9590 Release Fixes JSON Schema Handling

The latest b9590 release of llama.cpp addresses a critical issue where the LFM2 template handler was ignoring the json_schema from response_format, focusing solely on tool-calling grammar. This update ensures more robust handling of JSON schemas, which is crucial for developers relying on precise data formatting. The release also includes a variety of platform-specific builds, though some features like KleidiAI on macOS and SYCL on Windows remain disabled. This update is a step forward in refining the tool's functionality, particularly for those working with complex data structures.

llama.cpp Releases·Jun 12, 2026
Models & Labsmodels

llama.cpp b9591 Release Enhances MTP Efficiency

The b9591 release of llama.cpp brings notable improvements to Multi-Task Processing (MTP) by removing padding and optimizing data handling. The update refines the ggml_gated_delta_net function, which now only requires the initial recurrent state and uses a snapshot count as an operational parameter, enhancing processing efficiency. These changes are implemented across all backends, addressing previous review comments and fixing CI build errors. With support for diverse hardware configurations, including macOS Apple Silicon, ROCm 7.2 on Ubuntu, and CUDA 12 and 13 on Windows, this release is a significant step forward for developers seeking improved performance and reliability.

llama.cpp Releases·Jun 12, 2026
Models & Labsmodels

llama.cpp b9601 Release Expands Platform Support

The b9601 release of llama.cpp significantly extends its reach by supporting more platforms, enhancing its utility for developers. This update includes Ubuntu builds with ROCm 7.2, which is a boon for AMD GPU users seeking alternatives to NVIDIA's CUDA. Although features like KleidiAI on macOS and SYCL on Windows are currently disabled, the release still represents a meaningful step in making llama.cpp adaptable to a wider range of hardware. While no new models are introduced, the focus on expanding runtime compatibility marks a strategic move to increase the tool's versatility.

llama.cpp Releases·Jun 12, 2026

More in Open Source

OpenEnv Gains Open Source Community Support© Hugging Face Blog
Open Sourceagents

OpenEnv Gains Open Source Community Support

OpenEnv is evolving into a pivotal open-source tool for agentic reinforcement learning (RL), now backed by a coalition of major AI organizations including Meta-PyTorch, Nvidia, and Hugging Face. This initiative aims to standardize the interface between RL environments and trainers, promoting interoperability and efficiency. By serving as a common socket for various RL components, OpenEnv facilitates seamless integration across different ecosystems. This move is set to enhance the development of specialized models and harnesses, making RL more accessible and efficient for the open-source community.

Hugging Face Blog·Jun 8, 2026
JetBrains Releases Mellum2 12B MoE Open-source© Lev Selector
Open Sourcemodels

JetBrains Releases Mellum2 12B MoE Open-source

JetBrains has open-sourced Mellum2, a 12 billion parameter mixture of experts model.

Lev Selector·Jun 5, 2026
Google Open Sources AI Hydrology Model for Flood Forecasting© Google Research Blog
Open Sourceresearch

Google Open Sources AI Hydrology Model for Flood Forecasting

Google has open-sourced its advanced AI-based hydrology model, aiming to enhance global flood forecasting capabilities. This move allows National Meteorological and Hydrological Services to integrate sophisticated AI tools into their workflows, potentially improving the accuracy and timeliness of flood warnings. By releasing the model on GitHub, Google empowers local experts to refine and adapt the technology using their own data, fostering a more resilient approach to flood management. This initiative democratizes access to cutting-edge forecasting tools, especially benefiting regions with limited resources.

Google Research Blog·Jun 3, 2026