
NVIDIA has launched its latest hardware offerings, the RTX Spark laptops and DGX Stations, designed to support the local execution of large AI models. These devices aim to provide enhanced computational power and efficiency for AI developers and researchers, facilitating more robust and scalable AI solutions.
Read original
© Lev SelectorMicrosoft has introduced Project Solara, a new chip-to-cloud platform aimed at enhancing AI integration.
© Lev SelectorJetBrains has open-sourced Mellum2, a 12 billion parameter mixture of experts model.
© Lev SelectorMiniMax has launched the M3 Multimodal Model, enhancing capabilities across multiple data types.
The b9534 release of llama.cpp brings significant improvements for Intel users, notably adding FWHT support in Vulkan with shared memory reduction. This update tackles specific driver issues by disabling features like subgroup shuffle on MoltenVK AMD and the FWHT shader on Intel Windows, ensuring smoother operation. While KleidiAI remains disabled on macOS Apple Silicon, the release continues to refine compatibility with systems such as Ubuntu and Windows. With ROCm 7.2 and CUDA 12 and 13 DLLs included, llama.cpp is steadily optimizing its performance for a variety of hardware setups. These enhancements reflect a focused effort to support diverse computing environments.
The b9536 release of llama.cpp significantly boosts OpenCL performance, refining operations like get_rows, cpy, and concat for better efficiency. It now handles multiple workgroups in large rows, optimizing processing capabilities. Although KleidiAI support for macOS Apple Silicon is currently disabled, the release continues to cater to a wide array of platforms, including Windows, Linux, and Android, with specific enhancements for Vulkan and ROCm. These updates make llama.cpp more adaptable and efficient across various hardware setups, though some features remain inactive.
The latest b9543 release of llama.cpp introduces video support for Qwen3.5, marking a significant step in expanding the capabilities of this AI framework. This update also includes support for 'frame merge' in qwen-vl-based models, enhancing the model's ability to handle video data. While the release focuses on technical improvements and bug fixes, it notably broadens the platform's utility by integrating video processing capabilities. This positions llama.cpp as a more versatile tool for developers looking to incorporate video functionalities into their AI applications.