The latest b9533 release of llama.cpp focuses on expanding and refining platform support. Notably, it includes Vulkan support for both Ubuntu and Windows, while maintaining CUDA support with updated DLLs for Windows. However, KleidiAI support for macOS Apple Silicon is disabled in this release. The addition of ROCm 7.2 for Ubuntu x64 is a key update for AMD GPU users, enhancing compatibility. This release underscores llama.cpp's commitment to broad platform compatibility, though it doesn't introduce new model architectures.
Read originalThe b9534 release of llama.cpp brings significant improvements for Intel users, notably adding FWHT support in Vulkan with shared memory reduction. This update tackles specific driver issues by disabling features like subgroup shuffle on MoltenVK AMD and the FWHT shader on Intel Windows, ensuring smoother operation. While KleidiAI remains disabled on macOS Apple Silicon, the release continues to refine compatibility with systems such as Ubuntu and Windows. With ROCm 7.2 and CUDA 12 and 13 DLLs included, llama.cpp is steadily optimizing its performance for a variety of hardware setups. These enhancements reflect a focused effort to support diverse computing environments.
The b9535 release of llama.cpp continues to broaden its platform compatibility, though some features remain unavailable. While macOS Apple Silicon users won't see KleidiAI support this time, the release introduces Vulkan support for both Ubuntu and Windows, offering more options for GPU utilization. The addition of ROCm 7.2 for Ubuntu x64 marks a significant step towards better AMD GPU support, helping to close the gap with NVIDIA's CUDA. However, features like SYCL support are still not enabled, indicating areas where development is ongoing. This release reflects llama.cpp's ongoing efforts to become a versatile inference runtime across a wide range of hardware setups.
The b9536 release of llama.cpp significantly boosts OpenCL performance, refining operations like get_rows, cpy, and concat for better efficiency. It now handles multiple workgroups in large rows, optimizing processing capabilities. Although KleidiAI support for macOS Apple Silicon is currently disabled, the release continues to cater to a wide array of platforms, including Windows, Linux, and Android, with specific enhancements for Vulkan and ROCm. These updates make llama.cpp more adaptable and efficient across various hardware setups, though some features remain inactive.
© Lev SelectorJetBrains has open-sourced Mellum2, a 12 billion parameter mixture of experts model.
© Google Research BlogGoogle has open-sourced its advanced AI-based hydrology model, aiming to enhance global flood forecasting capabilities. This move allows National Meteorological and Hydrological Services to integrate sophisticated AI tools into their workflows, potentially improving the accuracy and timeliness of flood warnings. By releasing the model on GitHub, Google empowers local experts to refine and adapt the technology using their own data, fostering a more resilient approach to flood management. This initiative democratizes access to cutting-edge forecasting tools, especially benefiting regions with limited resources.
© TechCrunch AIMicrosoft's new Agent Control Specification (ACS) offers developers a unified way to manage AI agent behavior across various environments. By allowing teams to define specific policies, ACS ensures agents operate within set boundaries, reducing the risk of unintended actions. This open-source standard integrates controls into a common governance layer, making it easier to audit and reuse across different systems. With ACS, developers can maintain consistent oversight, enhancing both security and compliance in AI deployments.