
Ollama has released version 0.30, bringing improved performance and expanded model support through GGUF compatibility. The update enhances performance on NVIDIA hardware by up to 20% and extends GPU acceleration to AMD and Intel devices using Vulkan. This version also increases compatibility with more models, including those from the GGUF ecosystem, allowing for easier deployment on various hardware. These improvements make it simpler for developers to utilize a broader range of models and hardware configurations.
Read originalThe b9534 release of llama.cpp brings significant improvements for Intel users, notably adding FWHT support in Vulkan with shared memory reduction. This update tackles specific driver issues by disabling features like subgroup shuffle on MoltenVK AMD and the FWHT shader on Intel Windows, ensuring smoother operation. While KleidiAI remains disabled on macOS Apple Silicon, the release continues to refine compatibility with systems such as Ubuntu and Windows. With ROCm 7.2 and CUDA 12 and 13 DLLs included, llama.cpp is steadily optimizing its performance for a variety of hardware setups. These enhancements reflect a focused effort to support diverse computing environments.
The b9536 release of llama.cpp significantly boosts OpenCL performance, refining operations like get_rows, cpy, and concat for better efficiency. It now handles multiple workgroups in large rows, optimizing processing capabilities. Although KleidiAI support for macOS Apple Silicon is currently disabled, the release continues to cater to a wide array of platforms, including Windows, Linux, and Android, with specific enhancements for Vulkan and ROCm. These updates make llama.cpp more adaptable and efficient across various hardware setups, though some features remain inactive.
The latest b9543 release of llama.cpp introduces video support for Qwen3.5, marking a significant step in expanding the capabilities of this AI framework. This update also includes support for 'frame merge' in qwen-vl-based models, enhancing the model's ability to handle video data. While the release focuses on technical improvements and bug fixes, it notably broadens the platform's utility by integrating video processing capabilities. This positions llama.cpp as a more versatile tool for developers looking to incorporate video functionalities into their AI applications.