The latest b9537 release of llama.cpp focuses on expanding platform support, particularly for Linux and Windows users. Notably, the release includes ROCm 7.2 and Vulkan support for Linux, alongside CUDA 12 and 13 DLLs for Windows, enhancing GPU performance capabilities. However, some features like KleidiAI on macOS Apple Silicon and SYCL on Windows remain disabled. This update highlights llama.cpp's ongoing efforts to cater to a wide range of systems, though certain configurations still need attention.
Read originalThe b9533 release of llama.cpp continues its focus on enhancing platform compatibility, though some features are notably absent. While macOS Apple Silicon users will find KleidiAI support disabled, the release introduces Vulkan support for both Ubuntu and Windows, and keeps CUDA support updated with new DLLs for Windows. The addition of ROCm 7.2 for Ubuntu x64 is particularly important for AMD GPU users, helping to close the gap with NVIDIA's CUDA. This update is more about refining existing capabilities and ensuring that llama.cpp runs smoothly across various environments, rather than unveiling new model architectures.
The b9534 release of llama.cpp brings significant improvements for Intel users, notably adding FWHT support in Vulkan with shared memory reduction. This update tackles specific driver issues by disabling features like subgroup shuffle on MoltenVK AMD and the FWHT shader on Intel Windows, ensuring smoother operation. While KleidiAI remains disabled on macOS Apple Silicon, the release continues to refine compatibility with systems such as Ubuntu and Windows. With ROCm 7.2 and CUDA 12 and 13 DLLs included, llama.cpp is steadily optimizing its performance for a variety of hardware setups. These enhancements reflect a focused effort to support diverse computing environments.
The b9535 release of llama.cpp continues to broaden its platform compatibility, though some features remain unavailable. While macOS Apple Silicon users won't see KleidiAI support this time, the release introduces Vulkan support for both Ubuntu and Windows, offering more options for GPU utilization. The addition of ROCm 7.2 for Ubuntu x64 marks a significant step towards better AMD GPU support, helping to close the gap with NVIDIA's CUDA. However, features like SYCL support are still not enabled, indicating areas where development is ongoing. This release reflects llama.cpp's ongoing efforts to become a versatile inference runtime across a wide range of hardware setups.
© Lev SelectorJetBrains has open-sourced Mellum2, a 12 billion parameter mixture of experts model.
© Google Research BlogGoogle has open-sourced its advanced AI-based hydrology model, aiming to enhance global flood forecasting capabilities. This move allows National Meteorological and Hydrological Services to integrate sophisticated AI tools into their workflows, potentially improving the accuracy and timeliness of flood warnings. By releasing the model on GitHub, Google empowers local experts to refine and adapt the technology using their own data, fostering a more resilient approach to flood management. This initiative democratizes access to cutting-edge forecasting tools, especially benefiting regions with limited resources.
© TechCrunch AIMicrosoft's new Agent Control Specification (ACS) offers developers a unified way to manage AI agent behavior across various environments. By allowing teams to define specific policies, ACS ensures agents operate within set boundaries, reducing the risk of unintended actions. This open-source standard integrates controls into a common governance layer, making it easier to audit and reuse across different systems. With ACS, developers can maintain consistent oversight, enhancing both security and compliance in AI deployments.