The b9535 release of llama.cpp introduces several updates aimed at expanding platform compatibility. Notably, Vulkan support is now available for both Ubuntu and Windows, offering more GPU utilization options. The release also includes ROCm 7.2 support for Ubuntu x64, enhancing AMD GPU compatibility. However, some features like KleidiAI on macOS Apple Silicon and SYCL support remain disabled. These updates reflect llama.cpp's ongoing efforts to support a wide range of hardware configurations.
Read originalThe b9533 release of llama.cpp continues its focus on enhancing platform compatibility, though some features are notably absent. While macOS Apple Silicon users will find KleidiAI support disabled, the release introduces Vulkan support for both Ubuntu and Windows, and keeps CUDA support updated with new DLLs for Windows. The addition of ROCm 7.2 for Ubuntu x64 is particularly important for AMD GPU users, helping to close the gap with NVIDIA's CUDA. This update is more about refining existing capabilities and ensuring that llama.cpp runs smoothly across various environments, rather than unveiling new model architectures.
The b9534 release of llama.cpp brings significant improvements for Intel users, notably adding FWHT support in Vulkan with shared memory reduction. This update tackles specific driver issues by disabling features like subgroup shuffle on MoltenVK AMD and the FWHT shader on Intel Windows, ensuring smoother operation. While KleidiAI remains disabled on macOS Apple Silicon, the release continues to refine compatibility with systems such as Ubuntu and Windows. With ROCm 7.2 and CUDA 12 and 13 DLLs included, llama.cpp is steadily optimizing its performance for a variety of hardware setups. These enhancements reflect a focused effort to support diverse computing environments.
The b9536 release of llama.cpp significantly boosts OpenCL performance, refining operations like get_rows, cpy, and concat for better efficiency. It now handles multiple workgroups in large rows, optimizing processing capabilities. Although KleidiAI support for macOS Apple Silicon is currently disabled, the release continues to cater to a wide array of platforms, including Windows, Linux, and Android, with specific enhancements for Vulkan and ROCm. These updates make llama.cpp more adaptable and efficient across various hardware setups, though some features remain inactive.
© Lev SelectorJetBrains has open-sourced Mellum2, a 12 billion parameter mixture of experts model.
© Google Research BlogGoogle has open-sourced its advanced AI-based hydrology model, aiming to enhance global flood forecasting capabilities. This move allows National Meteorological and Hydrological Services to integrate sophisticated AI tools into their workflows, potentially improving the accuracy and timeliness of flood warnings. By releasing the model on GitHub, Google empowers local experts to refine and adapt the technology using their own data, fostering a more resilient approach to flood management. This initiative democratizes access to cutting-edge forecasting tools, especially benefiting regions with limited resources.
© TechCrunch AIMicrosoft's new Agent Control Specification (ACS) offers developers a unified way to manage AI agent behavior across various environments. By allowing teams to define specific policies, ACS ensures agents operate within set boundaries, reducing the risk of unintended actions. This open-source standard integrates controls into a common governance layer, making it easier to audit and reuse across different systems. With ACS, developers can maintain consistent oversight, enhancing both security and compliance in AI deployments.