The b9571 release of llama.cpp has been announced, featuring expanded support across various platforms. Notably, ROCm 7.2 is now supported on Ubuntu x64, enhancing the experience for AMD GPU users. The release continues to support a wide range of operating systems, including macOS, Windows, and openEuler, though some features remain disabled. This update does not introduce new models but reinforces llama.cpp's role as a flexible inference tool across different environments.
Read originalThe b9561 release of llama.cpp continues to enhance its platform reach, adding Vulkan support for Ubuntu and Windows, and ROCm 7.2 for Ubuntu, which is a significant boost for AMD GPU users. While features like KleidiAI on macOS and SYCL on Windows remain inactive, this update reinforces llama.cpp's role as a flexible inference runtime across various systems. Although no new models are introduced, the release focuses on strengthening the existing infrastructure, making it more adaptable for developers working with different hardware setups. This ongoing expansion of capabilities ensures that llama.cpp remains a vital tool for AI inference across a broad spectrum of environments.
The latest b9562 release of llama.cpp introduces video input support, marking a significant step in expanding its capabilities. This update includes a new mtmd_helper_video feature and allows video input on servers via base64 encoding. The CLI has been updated to support video arguments, enhancing user interaction. While the release doesn't introduce new models, it broadens the scope of llama.cpp by integrating video processing, making it more versatile for developers working with multimedia inputs.
OpenEnv is evolving into a pivotal open-source tool for agentic reinforcement learning (RL), now backed by a coalition of major AI organizations including Meta-PyTorch, Nvidia, and Hugging Face. This initiative aims to standardize the interface between RL environments and trainers, promoting interoperability and efficiency. By serving as a common socket for various RL components, OpenEnv facilitates seamless integration across different ecosystems. This move is set to enhance the development of specialized models and harnesses, making RL more accessible and efficient for the open-source community.
© Lev SelectorJetBrains has open-sourced Mellum2, a 12 billion parameter mixture of experts model.