The b9789 release of llama.cpp has been announced, focusing on expanding platform support rather than introducing new features. Key updates include support for Ubuntu x64 with ROCm 7.2, which benefits AMD GPU users, and Windows x64 with CUDA 13.3 DLLs for NVIDIA users. Although KleidiAI support for macOS Apple Silicon is disabled, the release maintains a broad compatibility across various systems, including Vulkan and OpenVINO. This update reinforces llama.cpp's role as a versatile tool for AI inference across diverse hardware environments.
Read originalThe latest b9781 release of llama.cpp continues its trend of broadening platform compatibility, though without major new features. Notably, the release includes support for ROCm 7.2 on Ubuntu x64, which is significant for AMD GPU users seeking alternatives to NVIDIA's CUDA. While KleidiAI support for macOS Apple Silicon is disabled, the release still covers a wide array of platforms, including Windows and openEuler. This update reinforces llama.cpp's position as a versatile inference runtime, though it remains focused on platform expansion rather than introducing new model architectures.
The latest b9782 release of llama.cpp continues its trend of broadening platform compatibility, though without major new features. Notably, the release includes support for ROCm 7.2 on Ubuntu x64, which is significant for AMD GPU users seeking alternatives to NVIDIA's CUDA. While KleidiAI support for Apple Silicon remains disabled, the release still covers a wide array of platforms, from Windows to openEuler. This update solidifies llama.cpp's position as a versatile inference runtime, though it doesn't introduce groundbreaking changes.
Hugging Face has streamlined its release process for the huggingface_hub Python client, moving from a 4-6 week cycle to weekly releases. This shift is powered by a combination of open-source tools and AI, which drafts release notes and automates mechanical tasks, while humans oversee critical judgment areas. The process is designed to be replicable by other maintainers, emphasizing transparency and adaptability. This change not only accelerates the release cycle but also ensures that updates are consistently delivered without the need for proprietary tools.
© Matt WolfePewDiePie has invested $41,000 in creating a private, self-hosted AI workspace using open-source tools.