Models & Labs

Claude Opus 4.8 Released by Anthropic

Matt WolfeMay 29, 2026high confidence

Why it matters

→Enhances AI model capabilities.
→Part of ongoing AI advancements.
→Supports Anthropic's competitive position.

Claude Opus 4.8 Released by Anthropic — ©Matt Wolfe

Anthropic has announced the release of Claude Opus 4.8, the latest iteration of their AI model. This update is expected to enhance the capabilities of the Claude series, offering improved performance and new features. The release is part of Anthropic's ongoing efforts to advance AI technology.

Read original

More from Matt Wolfe

Models & Labsimage

MAI-Image-2.5 Launches at Arena AI

Microsoft's MAI-Image-2.5 debuts at number three on Arena AI.

Matt WolfeMay 29, 2026

General AIproductivity

Microsoft 365 Copilot Gets a Redesign

Microsoft has introduced a new design for its 365 Copilot tool.

Matt WolfeMay 29, 2026

Video & Creative AImusic

ElevenLabs Releases Music and Dubbing v2

ElevenLabs has launched version 2 of its music and dubbing tools.

Matt WolfeMay 29, 2026

More in Models & Labs

Models & Labsmodels

vLLM v0.22.0 Release Enhances Model Performance

The vLLM v0.22.0 release marks a significant step forward in model performance and infrastructure. With 459 commits from 230 contributors, this update introduces major enhancements like the DeepSeek V4 model's reorganization and NVFP4 fused MoE support, which improve accuracy and efficiency. The Model Runner V2 now defaults to Qwen3 dense models, offering better performance with new features like sleep-mode weight reload. Additionally, the introduction of a Rust frontend and batch-invariant inference improvements highlight the release's focus on speed and flexibility. These updates collectively enhance the vLLM framework's capability to handle complex AI tasks more efficiently.

vLLM ReleasesMay 31, 2026

Models & Labsmodels

Llama.cpp Update Fixes iGPU Device Selection

Llama.cpp has addressed a critical issue in its device selection logic that affected systems using integrated GPUs as their main compute device. Previously, the presence of any RPC server would cause the local iGPU to be ignored, leading to model loading failures. This update ensures that iGPUs are included unless no GPUs are available, allowing for proper tensor allocation and model loading on systems like the Strix Halo with significant unified memory. This fix enhances the reliability of llama.cpp on diverse hardware configurations.

llama.cpp ReleasesMay 31, 2026

Models & Labsmodels

llama.cpp b9434 release focuses on GPU granularity

The b9434 release of llama.cpp targets granularity improvements for Qwen 3.5/3.6 across three GPUs, offering a technical refinement rather than a major overhaul. This update is crucial for developers optimizing performance on specific GPU setups, enhancing compatibility and efficiency. While it doesn't bring new models or groundbreaking features, it extends support to platforms like macOS, Linux, and Windows. The release ensures that llama.cpp continues to be a flexible tool for developers, focusing on incremental improvements that enhance its utility without introducing radical changes.

llama.cpp ReleasesMay 31, 2026