The b9692 release of llama.cpp has been announced, featuring expanded support across multiple platforms including macOS, Linux, Windows, and openEuler. Key highlights include the addition of ROCm 7.2 support on Ubuntu x64, enhancing options for AMD GPU users. The release also continues to support Vulkan and OpenVINO, providing developers with flexibility in hardware choices. This update reinforces llama.cpp's role as a comprehensive inference runtime solution.
Read originalThe b9684 release of llama.cpp marks a significant enhancement with the integration of 3D convolution, boosting its ability to handle complex data processing tasks. This update also brings optimizations and a cleaner codebase, enhancing overall efficiency. The release extends support across a broad spectrum of platforms, including macOS, Linux, and Windows, with specific configurations like Vulkan, ROCm, and SYCL. By expanding its platform compatibility and functionality, llama.cpp becomes an even more versatile tool for developers tackling diverse AI challenges.
The b9685 release of llama.cpp brings notable advancements in SYCL support, particularly with the addition of device-to-device memory copy via the SYCL API. This update also refines the detection method for peer-to-peer communication, resolving previous conflicts. While there are no new model architectures introduced, the release enhances the platform's adaptability across macOS, Linux, and Windows. With ROCm 7.2 support on Ubuntu and CUDA 12 and 13 DLLs for Windows, llama.cpp becomes a more robust choice for developers working with diverse hardware configurations. The inclusion of KleidiAI on Apple Silicon further optimizes performance for M-series Macs. These improvements make llama.cpp a more versatile tool for developers.
© GitHub ChangelogGitHub has introduced a new feature allowing repository maintainers to set a cap on the number of open pull requests from users without write access. This change aims to streamline the management of contributions by reducing the clutter of low-quality or drive-by pull requests. Maintainers can also designate trusted contributors who can exceed this limit without needing full collaborator access. This update is designed to help maintainers focus on meaningful contributions and reduce unnecessary review and CI overhead.
© Hugging Face BlogThe Strands Robots SDK, an open-source toolkit from AWS, simplifies the process of deploying AI models from the Hugging Face Hub to robot hardware. By integrating the LeRobot stack as AgentTools, developers can now create a single agent that handles simulation, policy inference, and deployment to physical robots with minimal code changes. This integration allows for seamless coordination across multiple robots using a peer mesh network. The SDK's ability to maintain consistent dataset formats between simulation and hardware ensures that developers can easily transition from testing to real-world applications.