16 × AIAI signal, amplified
AI newsAboutSources
TelegramFollow on Telegram
AI newsAboutSources
16 × AIAI signal, amplified

An AI news engine that ingests trusted sources, scores with Claude, and posts only what clears the bar.

Follow on Telegram →

Subscribe

  • Telegram
  • RSS
  • All channels

Legal

  • Privacy
  • Imprint
© 2026 16 × AI. All rights reserved.Curated by Claude. Posts every 6 hours. No newsletter, no funnel.
Home/Open Source
Open Source

llama.cpp b9331 Release Enhances CI Workflows

llama.cpp Releases·May 27, 2026·high confidence

Why it matters

  • →Streamlining CI workflows can significantly improve development efficiency.
  • →Separating tasks into distinct workflows allows for better management and troubleshooting.
  • →Disabling certain builds can optimize resource allocation and focus on more critical tasks.

The b9331 release of llama.cpp introduces significant changes to its continuous integration workflows. Key updates include the separation of Android, HIP, WebGPU, and RPC tasks into individual workflows, enhancing the efficiency of the CI process. The release also disables SYCL f16 builds and reduces pull request jobs by matching backend paths. These changes aim to streamline development and build management, although no new model architectures are introduced in this update.

Read original

More from llama.cpp Releases

Models & Labsmodels

Llama.cpp b9329 Release Enhances CUDA Performance

The b9329 release of llama.cpp brings a notable performance enhancement with the integration of a fast Walsh-Hadamard transform for CUDA, which is set to improve computational efficiency. This update also includes optimizations such as unrolling and changes from size_t to int, aimed at boosting processing speed. The release is compatible with platforms like macOS, Linux, Windows, and openEuler, ensuring developers can leverage these improvements across different environments. While there are no new models introduced, the emphasis on performance optimization makes this update significant for those working with CUDA and other supported systems.

llama.cpp Releases·May 27, 2026
Models & Labsmodels

llama.cpp b9330 release improves model performance

The b9330 release of llama.cpp resolves a key issue by correctly tagging the ffn_latent operation as MUL_MAT, aligning it with the backend's operational expectations. This correction ensures that weights and their matrix multiplications remain on the GPU, avoiding unnecessary CPU fallback and graph splitting. As a result, performance on the Nemotron 3 Super 120B Q5_K_M model has significantly improved, with throughput increasing from 64.9 to 103.22 tokens per second. This update reflects llama.cpp's dedication to enhancing AI model performance across different computing environments, including macOS with KleidiAI and Ubuntu with ROCm 7.2. By maintaining efficient GPU processing, llama.cpp continues to optimize AI model execution, ensuring robust performance on platforms like CUDA 12 and CUDA 13.

llama.cpp Releases·May 27, 2026
Open Sourcemodels

llama.cpp b9333 release expands platform support

The b9333 release of llama.cpp marks a significant expansion in its platform reach, enhancing its utility across various systems. With this update, macOS Apple Silicon users can now leverage KleidiAI, while Ubuntu users benefit from Vulkan and ROCm 7.2 enhancements. Windows compatibility is also improved with the inclusion of CUDA 12 and 13 DLLs, and openEuler architectures are now part of the supported lineup. Although there are no new model architectures in this release, llama.cpp is becoming a more versatile inference runtime, catering to a broader range of hardware configurations.

llama.cpp Releases·May 27, 2026

More in Open Source

Karpathy Open-Sources CLAUDE md© Lev Selector
Open Sourcemodels

Karpathy Open-Sources CLAUDE md

Andrej Karpathy has released CLAUDE md as open source.

Lev Selector·May 22, 2026
Stable Audio 3.0 Released for Artistic Experimentation© Matt Wolfe
Open Sourcemusic

Stable Audio 3.0 Released for Artistic Experimentation

Stability AI has launched Stable Audio 3.0, a model family designed for artistic experimentation with open-weight models.

Matt Wolfe·May 22, 2026
GitHub Copilot for Eclipse Goes Open Source© GitHub Changelog
Open Sourcecoding

GitHub Copilot for Eclipse Goes Open Source

GitHub has open-sourced its Copilot plugin for Eclipse, marking a significant step in integrating AI-powered tools within the Eclipse ecosystem. By releasing the code under the MIT license, GitHub invites developers to explore, contribute, and innovate on how AI enhances developer experiences in Eclipse. This move not only promotes transparency but also encourages community-driven development, allowing developers to understand and influence the plugin's functionality. With the source code available, developers can now delve into the mechanics of Copilot's features like code completion and agentic workflows, fostering a collaborative environment for future enhancements.

GitHub Changelog·May 21, 2026