Models & Labs

Sakana AI launches Fugu for model orchestration

The Rundown AIJune 23, 2026high confidence

Why it matters

→Model orchestration could mitigate risks from export controls on AI models.
→Fugu's approach may inspire similar strategies in AI development.
→Early performance reviews highlight the challenges of achieving claimed benchmarks.

Sakana AI launches Fugu for model orchestration — ©The Rundown AI

Sakana AI has introduced Fugu, a model designed to orchestrate multiple AI models through a single API, as a response to export controls that affected Anthropic's models. Fugu offers two versions: a faster model for routine tasks and a more powerful one for complex applications. Despite claims of high performance, early feedback indicates it may not yet match top models. This development underscores the trend towards model orchestration, though concerns about cost and transparency persist.

Read original

More from The Rundown AI

Market & Regulationbusiness

Google's AlphaFold Lead John Jumper Joins Anthropic

In a significant talent shift, John Jumper, the Nobel Prize-winning co-creator of AlphaFold, is leaving Google DeepMind for Anthropic. This move follows closely on the heels of another high-profile departure, Noam Shazeer, to OpenAI, highlighting a trend of top AI talent migrating from Google to its rivals. Jumper's expertise in protein-structure AI, which earned him a Nobel Prize, could bolster Anthropic's scientific edge. His departure signals a potential weakening of DeepMind's dominance in AI research, particularly in scientific applications, as it faces increasing competition from Anthropic and OpenAI.

The Rundown AIJun 22, 2026

More in Models & Labs

Models & Labsmodels

llama.cpp b9767 Release Enhances MTP Inference

The b9767 release of llama.cpp introduces significant improvements to MTP inference by optimizing the mat-vec path for small batches, which enhances decoding efficiency. A new barrier in the NUM_COLS loop of the mul-mat-vec process is expected to boost performance. While no new model architectures are included, this update refines the platform's capabilities across macOS, Linux, and Windows. Notably, it supports macOS Apple Silicon, Ubuntu with ROCm 7.2, and Windows with CUDA 12 and 13. This release continues llama.cpp's focus on performance optimization and compatibility, making it a more powerful tool for developers.

llama.cpp ReleasesJun 24, 2026

Models & Labsmodels

Granite Speech Plus Support Added in b9768 Release

The b9768 release of llama.cpp expands its capabilities by integrating Granite Speech Plus, which enhances audio processing with multi-layer concatenation. This update is particularly relevant for developers focused on audio applications, as it resolves naming inconsistencies and standardizes feature layer usage. While no new models are introduced, the release fortifies the existing framework, making it more reliable for audio tasks. This iteration marks a refinement in the tool's functionality, especially for those utilizing its audio features.

llama.cpp ReleasesJun 24, 2026

Models & Labsmodels

Llama.cpp b9774 Release Enhances Vulkan Support

The latest b9774 release of llama.cpp brings significant improvements to Vulkan support, enabling backend tests for various mathematical operations like SQR, SQRT, SIN, and COS. This update also enhances the handling of noncontiguous data in norm operations, broadening the library's applicability across different platforms. While the release doesn't introduce new models, it strengthens the existing infrastructure, particularly for developers working with Vulkan and other supported platforms. This makes llama.cpp a more robust choice for those looking to leverage GPU capabilities beyond NVIDIA's CUDA ecosystem.

llama.cpp ReleasesJun 24, 2026