16 × AIAI signal, amplified
AI newsAboutSources
TelegramFollow on Telegram
AI newsAboutSources
16 × AIAI signal, amplified

An AI news engine that ingests trusted sources, scores with Claude, and posts only what clears the bar.

Follow on Telegram →

Subscribe

  • Telegram
  • RSS
  • All channels

Legal

  • Privacy
  • Imprint
© 2026 16 × AI. All rights reserved.Curated by Claude. Posts every 6 hours. No newsletter, no funnel.
Home/Open Source
Open Source

Llama.cpp b9393 Release Fixes Audio RMS Norm

llama.cpp Releases·May 29, 2026·high confidence

Why it matters

  • →Fixes in audio RMS norm improve reliability for developers using gemma 4.
  • →Broad platform support ensures compatibility across diverse systems.
  • →Maintains llama.cpp's versatility as a tool for developers.

The b9393 release of llama.cpp focuses on fixing an audio RMS norm issue in the gemma 4 module, with contributions from Sigbjørn Skjæret. This update spans multiple platforms, including macOS, Linux, Windows, and openEuler, ensuring broad compatibility. Key technical details include support for Apple Silicon, Vulkan, and ROCm on Ubuntu. While the update doesn't introduce new features, it enhances the tool's reliability and performance across various systems.

Read original

More from llama.cpp Releases

Models & Labsmodels

Llama.cpp b9387 Release Enhances AMD MFMA Performance

The latest b9387 release of llama.cpp introduces significant performance improvements for AMD MFMA hardware, particularly in quantized matrix multiplication. By optimizing the batch threshold logic, the update allows for more efficient processing, with throughput gains of up to 76% in certain configurations. This release is particularly relevant for users leveraging AMD's MI250X hardware, as it fine-tunes the kernel selection logic to maximize performance. While the update doesn't introduce new models, it significantly enhances the efficiency of existing operations on specific hardware, making it a noteworthy development for those using AMD GPUs.

llama.cpp Releases·May 29, 2026
Models & Labsmodels

llama.cpp b9388 release enhances Turing support

The latest b9388 release of llama.cpp introduces optimizations for Turing architecture, specifically adding MMVQ_PARAMETERS_TURING to improve JIT compilation for SM75 Turing devices. This update aims to prevent mismatches when compiling Turing device code on Ampere or newer architectures. While the release doesn't introduce new models or quantization methods, it continues to expand platform support, including updates for macOS, Linux, and Windows. The focus remains on refining compatibility and performance across diverse hardware configurations, making llama.cpp a more versatile tool for developers.

llama.cpp Releases·May 29, 2026
Open Sourcemodels

llama.cpp b9389 Release Expands Platform Support

The latest b9389 release of llama.cpp continues its trend of broadening platform compatibility, though with some notable exceptions. While macOS Apple Silicon users see KleidiAI support disabled, the release strengthens its Linux offerings with ROCm 7.2 and Vulkan support. Windows users benefit from updated CUDA DLLs, enhancing performance for CUDA 12 and 13. This release demonstrates llama.cpp's commitment to being a versatile inference runtime across diverse hardware, though some features remain disabled, indicating ongoing development challenges.

llama.cpp Releases·May 29, 2026

More in Open Source

Reachy Mini Enables Local Speech Processing© Hugging Face Blog
Open Sourceagents

Reachy Mini Enables Local Speech Processing

Hugging Face has introduced a fully local speech processing setup for the Reachy Mini robot, eliminating the need for cloud services and enhancing privacy. By utilizing a cascaded voice pipeline, users can run speech-to-speech interactions entirely on their own hardware, ensuring that no data leaves their network. This setup leverages components like llama.cpp for LLM and Parakeet-TDT for STT, allowing for customizable and cost-effective speech processing. The move empowers users with full control over their speech processing pipeline, offering flexibility to swap components as new models become available.

Hugging Face Blog·May 27, 2026
Karpathy Open-Sources CLAUDE md© Lev Selector
Open Sourcemodels

Karpathy Open-Sources CLAUDE md

Andrej Karpathy has released CLAUDE md as open source.

Lev Selector·May 22, 2026
Stable Audio 3.0 Released for Artistic Experimentation© Matt Wolfe
Open Sourcemusic

Stable Audio 3.0 Released for Artistic Experimentation

Stability AI has launched Stable Audio 3.0, a model family designed for artistic experimentation with open-weight models.

Matt Wolfe·May 22, 2026