The b9691 release of llama.cpp introduces conditional support for the POWER11 backend, ensuring compatibility with future hardware developments. By guarding the POWER11 backend creation behind a compiler flag check, the update prevents build failures on current GCC and Clang toolchains. Additionally, the CMakeLists.txt has been updated to optimize for both POWER10 and POWER11 architectures. This release highlights llama.cpp's proactive approach to supporting emerging hardware platforms.
Read originalThe b9684 release of llama.cpp marks a significant enhancement with the integration of 3D convolution, boosting its ability to handle complex data processing tasks. This update also brings optimizations and a cleaner codebase, enhancing overall efficiency. The release extends support across a broad spectrum of platforms, including macOS, Linux, and Windows, with specific configurations like Vulkan, ROCm, and SYCL. By expanding its platform compatibility and functionality, llama.cpp becomes an even more versatile tool for developers tackling diverse AI challenges.
The b9685 release of llama.cpp brings notable advancements in SYCL support, particularly with the addition of device-to-device memory copy via the SYCL API. This update also refines the detection method for peer-to-peer communication, resolving previous conflicts. While there are no new model architectures introduced, the release enhances the platform's adaptability across macOS, Linux, and Windows. With ROCm 7.2 support on Ubuntu and CUDA 12 and 13 DLLs for Windows, llama.cpp becomes a more robust choice for developers working with diverse hardware configurations. The inclusion of KleidiAI on Apple Silicon further optimizes performance for M-series Macs. These improvements make llama.cpp a more versatile tool for developers.
The latest release candidate for vLLM, version 0.22.1rc1, introduces a change in the Docker setup by removing the use of extra-index-url for the flashinfer-jit-cache. This adjustment simplifies the Docker configuration, potentially reducing dependency management issues and improving build reliability. While this update might seem minor, it reflects ongoing efforts to streamline the development process and enhance the usability of vLLM for developers. This change is particularly relevant for those maintaining Docker environments and looking for more efficient ways to manage dependencies.
© Hugging Face BlogMolmoMotion is a breakthrough in 3D motion forecasting, offering a new way to predict object trajectories based on video frames and language instructions. By using a sparse set of 3D points attached to objects, it efficiently forecasts motion without rendering full video, making it highly applicable for robotics and video generation. The release includes the MolmoMotion-1M dataset, the largest of its kind, and the PointMotionBench benchmark for accuracy testing. This model sets a new standard in motion prediction, outperforming existing methods and opening new possibilities for AI-driven applications.