Models & Labs

Sakana AI Launches Fugu Model

Matt WolfeJune 26, 2026high confidence

Why it matters

→Demonstrates the potential of orchestrator systems in AI.
→Offers a competitive alternative to existing models like Fable.
→Highlights the evolving landscape of AI model performance.

Sakana AI Launches Fugu Model — ©Matt Wolfe

Sakana AI has introduced the Fugu model, an orchestrator system that routes prompts across multiple AI models. In tests, Fugu outperformed the Fable model on certain benchmarks, matched it on others, and underperformed in a few areas. The model was evaluated through real build challenges, showcasing its capabilities in practical applications.

Read original

More from Matt Wolfe

Market & Regulationbusiness

US Government Involvement in OpenAI Model Releases

The US government is now involved in overseeing OpenAI's model releases.

Matt WolfeJun 26, 2026

Open Sourcemodels

Krea 2 Releases Open Weights

Krea 2 has made its model weights open, allowing broader access.

Matt WolfeJun 26, 2026

Researchresearch

MIT Study: AI Enhances Human Critical Thinking

An MIT study finds that combining human skills with AI leads to better performance than relying on human skills alone.

Matt WolfeJun 25, 2026

More in Models & Labs

Models & Labsmodels

llama.cpp b9817 release enhances OpenVINO support

The latest b9817 release of llama.cpp brings significant updates to its OpenVINO backend, including an upgrade to OV 2026.2.1 and the introduction of self-contained release packages. These changes streamline the deployment process and improve operator handling, making it easier for developers to integrate and utilize OpenVINO in their projects. Additionally, the update removes hardcoded compute operation types, enhancing flexibility and adaptability. This release marks a step forward in making llama.cpp a more versatile and developer-friendly platform, particularly for those leveraging OpenVINO's capabilities.

llama.cpp ReleasesJun 28, 2026

Models & Labsmodels

llama.cpp b9820 Release Enhances CUDA Performance

The b9820 release of llama.cpp brings notable improvements to CUDA performance by cutting down on unnecessary synchronizations, which can streamline token processing. This update introduces asynchronous copy capabilities between CPU and CUDA, facilitating smoother data transfers and potentially speeding up computations. Backend detection has been refined to avoid linking conflicts, and synchronization adjustments have been made more general, allowing other backends like Vulkan to benefit. These enhancements aim to optimize performance across different hardware setups, making llama.cpp a more adaptable tool for developers working with diverse configurations.

llama.cpp ReleasesJun 28, 2026

Models & Labsmodels

llama.cpp b9826 Release Expands Platform Support

The b9826 release of llama.cpp continues to enhance its reach by supporting a wider array of systems, though it doesn't bring new model architectures. With ROCm 7.2 now available for Ubuntu x64, AMD GPU users gain a viable alternative to NVIDIA's CUDA, broadening their options for AI inference. The update also includes builds for macOS, Linux, Windows, and openEuler, ensuring developers can utilize llama.cpp regardless of their operating environment. While the release doesn't introduce groundbreaking features, it reinforces llama.cpp's utility as a flexible tool for AI developers working across different hardware and software configurations.

llama.cpp ReleasesJun 28, 2026