Models & Labs

NVIDIA Unveils Cosmos 3 World Foundation Model

Sam WitteveenJune 1, 2026high confidence

Why it matters

→Cosmos 3's omnimodal capabilities allow for more integrated AI applications.
→It enhances NVIDIA's position in the AI market by offering advanced multi-modal processing.
→Developers gain new tools for creating versatile AI solutions.

NVIDIA Unveils Cosmos 3 World Foundation Model — ©Sam Witteveen

NVIDIA has introduced Cosmos 3, its latest world foundation model, which is designed to be omnimodal, capable of processing five different types of inputs and generating five types of outputs. This model is part of NVIDIA's push towards enhancing physical AI and open-world reasoning capabilities. Cosmos 3 is expected to provide developers with new opportunities to create more integrated and versatile AI applications. This development underscores NVIDIA's commitment to advancing AI technology and expanding its application across various domains.

Read original

More in Models & Labs

Models & Labsmodels

llama.cpp adds EXAONE 4.5 implementations

The latest llama.cpp release expands its capabilities with the integration of EXAONE 4.5, bringing new vision markers and projector paths into the fold. This update aligns EXAONE 4.5 with the Qwen2.5-VL-style encode path, enhancing model loading and tensor registration processes. Developers will find improved performance and compatibility, particularly when working with EXAONE models. While no new models are introduced, the release refines existing functionalities, ensuring robust performance across various systems. This step forward is crucial for developers seeking to leverage EXAONE 4.5's full potential.

llama.cpp ReleasesJun 2, 2026

Models & Labsmodels

llama.cpp b9455 Release Adds Quantized KV Cache

The latest b9455 release of llama.cpp introduces quantized KV cache support, a notable enhancement for efficiency in AI model inference. This update also addresses a partial view fix and removes an overly strict assert, improving the overall robustness of the software. While the release includes various platform builds, the focus remains on optimizing performance across different environments. The addition of quantized KV cache support is a step forward in making AI models more resource-efficient, particularly beneficial for developers working with limited computational resources.

llama.cpp ReleasesJun 2, 2026

Models & Labsmodels

llama.cpp b9457 release focuses on Vulkan improvements

The latest b9457 release of llama.cpp brings a notable improvement in Vulkan performance by reducing host memory lock contention, which can enhance efficiency in certain workloads. This update replaces unique_lock with lock_guard, aiming to streamline operations. While the release doesn't introduce new models or major features, it continues to refine the platform's compatibility across various systems, including macOS, Linux, and Windows. The focus remains on optimizing existing capabilities rather than expanding into new territories.

llama.cpp ReleasesJun 2, 2026