16 × AIAI signal, amplified
AI newsAboutSources
TelegramFollow on Telegram
AI newsAboutSources
16 × AIAI signal, amplified

An AI news engine that ingests trusted sources, scores with Claude, and posts only what clears the bar.

Follow on Telegram →

Subscribe

  • Telegram
  • RSS
  • All channels

Legal

  • Privacy
  • Imprint
© 2026 16 × AI. All rights reserved.Curated by Claude. Posts every 6 hours. No newsletter, no funnel.
Home/Models & Labs
Models & Labs

llama.cpp b9591 Release Enhances MTP Efficiency

llama.cpp Releases·June 12, 2026·high confidence

Why it matters

  • →Enhances efficiency in Multi-Task Processing by removing unnecessary padding.
  • →Applies improvements across all backends, ensuring broad compatibility.
  • →Addresses build errors, improving reliability for developers.

The b9591 release of llama.cpp introduces significant improvements in Multi-Task Processing (MTP) by eliminating padding and optimizing data handling. The update modifies the ggml_gated_delta_net function to enhance efficiency, applying these changes across all backends. Additionally, it addresses previous review comments and resolves CI build errors. This release is particularly relevant for developers working with various hardware setups, as it aims to streamline processing and improve performance.

Read original

More from llama.cpp Releases

Models & Labsmodels

llama.cpp b9590 Release Fixes JSON Schema Handling

The latest b9590 release of llama.cpp addresses a critical issue where the LFM2 template handler was ignoring the json_schema from response_format, focusing solely on tool-calling grammar. This update ensures more robust handling of JSON schemas, which is crucial for developers relying on precise data formatting. The release also includes a variety of platform-specific builds, though some features like KleidiAI on macOS and SYCL on Windows remain disabled. This update is a step forward in refining the tool's functionality, particularly for those working with complex data structures.

llama.cpp Releases·Jun 12, 2026
Open Sourcemodels

llama.cpp b9596 Release Expands Platform Support

The b9596 release of llama.cpp marks another step in broadening its compatibility, with ROCm 7.2 now supported on Ubuntu x64, enhancing the experience for AMD GPU users. This update helps close the performance gap with NVIDIA's CUDA, making llama.cpp a more attractive option for developers using AMD hardware. Although features like KleidiAI on macOS Apple Silicon are still disabled, the release underscores llama.cpp's commitment to becoming a versatile tool across different systems. Developers can now tap into improved performance on a wider array of hardware, though some expected features remain on the horizon.

llama.cpp Releases·Jun 12, 2026
Models & Labsmodels

llama.cpp b9601 Release Expands Platform Support

The b9601 release of llama.cpp significantly extends its reach by supporting more platforms, enhancing its utility for developers. This update includes Ubuntu builds with ROCm 7.2, which is a boon for AMD GPU users seeking alternatives to NVIDIA's CUDA. Although features like KleidiAI on macOS and SYCL on Windows are currently disabled, the release still represents a meaningful step in making llama.cpp adaptable to a wider range of hardware. While no new models are introduced, the focus on expanding runtime compatibility marks a strategic move to increase the tool's versatility.

llama.cpp Releases·Jun 12, 2026

More in Models & Labs

Models & Labsmodels

Claude Code v2.1.170 Release

Claude Code's latest update introduces the Claude Fable 5, a Mythos-class model now safe for general use. This model surpasses previous offerings in capability, marking a significant step forward for developers using Claude Code. Additionally, the update resolves an issue with session transcripts not saving when launched from certain environments. This release enhances both the power and reliability of the Claude Code platform, offering developers a more robust toolset for their projects.

Claude Code Releases·Jun 12, 2026
OpenAI Aims to Transform ChatGPT into a Super App© WIRED AI
Models & Labsagents

OpenAI Aims to Transform ChatGPT into a Super App

OpenAI is taking a bold step by evolving ChatGPT into a 'super app,' a move that could revolutionize AI interaction. Under the guidance of Thibault Sottiaux, the initiative seeks to merge ChatGPT and Codex into a unified platform designed to manage diverse personal and professional tasks. The vision is to develop a proactive digital assistant that integrates seamlessly into daily life, potentially revitalizing OpenAI's growth and re-establishing its leadership in the AI sector. Although the specifics of the super app's capabilities are still unfolding, the integration of Codex indicates a focus on sophisticated task automation and user personalization.

WIRED AI·Jun 11, 2026
Anthropic Releases Claude Fable Model© Fireship
Models & Labsmodels

Anthropic Releases Claude Fable Model

Anthropic has introduced Claude Fable, a new model in their Mythos class, designed for public use. This release comes amid Anthropic's previous calls for caution in AI development, highlighting a shift towards making advanced AI more accessible. The Claude Fable model is described as 'carefully lobotomized,' suggesting it has been modified to ensure safety and compliance for broader deployment. This move indicates Anthropic's strategy to balance innovation with responsibility, offering a model that is both powerful and safe for public interaction.

Fireship·Jun 11, 2026