The b9296 release of llama.cpp introduces expanded support across multiple platforms, including macOS, Linux, Windows, and Android. Key updates include enabling KleidiAI on macOS Apple Silicon and adding CUDA 12 and 13 DLLs for Windows x64. The release also supports ROCm 7.2 on Ubuntu x64, enhancing performance for AMD GPUs. This update reinforces llama.cpp's adaptability, making it a reliable choice for developers working with varied hardware setups.
Read originalThe b9297 release of llama.cpp brings a notable enhancement with the introduction of NVFP4 MTP scale tensors, boosting its tensor processing capabilities. This update also integrates Qwen3.5 MTP tensors, which improves performance across a spectrum of hardware configurations, including Apple Silicon, Vulkan, and ROCm on Ubuntu, as well as CUDA on Windows. The release supports a wide array of architectures, from macOS to Linux and Windows, ensuring compatibility with both CPU and GPU setups. While there are no new model architectures, the inclusion of KleidiAI on Apple Silicon and ROCm 7.2 on Ubuntu highlights llama.cpp's commitment to optimizing for diverse environments. This update reinforces llama.cpp's role as a flexible inference runtime, catering to a broad range of hardware setups.
The b9309 release of llama.cpp tackles significant integer overflow issues in its perplexity calculations, co-authored by Stanisław Szymczyk. This update is vital for enhancing the accuracy and reliability of the model's performance metrics, which are crucial for developers. By resolving these overflows, the release ensures that users can depend on precise data outputs. This fix is a testament to the ongoing efforts to improve the tool's robustness, allowing developers to trust the integrity of their AI computations. While it might seem like a minor adjustment, it plays a critical role in maintaining the tool's reliability.
The b9283 release of llama.cpp tackles significant build issues, particularly enhancing support for Apple systems and ensuring proper installation of implementation libraries. By adding install functionality for shared libraries, the update prevents runtime errors that previously disrupted operations. Developers using macOS, Windows, and Linux can now expect more reliable performance, with specific improvements for Apple Silicon and KleidiAI. The update also addresses issues with CUDA and ROCm builds, reinforcing llama.cpp's stability. While no new features are introduced, this release is a crucial step in refining the software's cross-environment functionality.
© Lev SelectorAndrej Karpathy has released CLAUDE md as open source.
© Matt WolfeStability AI has launched Stable Audio 3.0, a model family designed for artistic experimentation with open-weight models.
© GitHub ChangelogGitHub has open-sourced its Copilot plugin for Eclipse, marking a significant step in integrating AI-powered tools within the Eclipse ecosystem. By releasing the code under the MIT license, GitHub invites developers to explore, contribute, and innovate on how AI enhances developer experiences in Eclipse. This move not only promotes transparency but also encourages community-driven development, allowing developers to understand and influence the plugin's functionality. With the source code available, developers can now delve into the mechanics of Copilot's features like code completion and agentic workflows, fostering a collaborative environment for future enhancements.