Models & Labs

Ollama 0.30 Enhances Performance and Model Support

Ollama BlogJune 5, 2026high confidence

Why it matters

→Enhances performance on NVIDIA hardware by up to 20%, improving efficiency for developers.
→Expands GPU acceleration to AMD and Intel devices, increasing accessibility for more users.
→Broadens model compatibility, allowing more models to run out of the box, simplifying AI deployment.

Ollama 0.30 Enhances Performance and Model Support — ©Ollama Blog

Ollama has released version 0.30, bringing improved performance and expanded model support through GGUF compatibility. The update enhances performance on NVIDIA hardware by up to 20% and extends GPU acceleration to AMD and Intel devices using Vulkan. This version also increases compatibility with more models, including those from the GGUF ecosystem, allowing for easier deployment on various hardware. These improvements make it simpler for developers to utilize a broader range of models and hardware configurations.

Read original

Ollama 0.30 Enhances Performance and Model Support

Why it matters

More in Models & Labs

Llama.cpp adds GLM-5.2 speculative decoding support

Llama.cpp b10178 Release Adds Trace Logging

llama.cpp b10180 Release Enhances SYCL Performance