
Moonshot AI and Zhipu AI have launched new open weight coding models named Kimi K2.7 and GLM-5.2, respectively. These models are part of a growing trend towards open-source AI tools that provide developers with more flexibility and control. The release of these models is expected to enhance coding capabilities and foster innovation in AI development.
Read original
© Lev SelectorClaude Fable 5 was released and then withdrawn as Anthropic negotiates access with the administration.
© Lev SelectorAmazon will begin selling its Trainium chips to third-party companies, expanding its hardware offerings.
© Lev SelectorSpaceX has acquired Cursor for $60 billion, expanding its technological capabilities.
The b9724 release of llama.cpp is all about enhancing stability through a series of bug fixes, including improvements to build processes and overflow prevention in the area() function. This update ensures smoother operations across macOS, Windows, and Ubuntu, with specific support for Vulkan and ROCm 7.2 on Ubuntu. While it doesn't introduce groundbreaking features, the release strengthens llama.cpp's reliability as a tool for developers working in diverse environments. By refining and optimizing the platform, this update makes llama.cpp a more robust choice for AI development, ensuring compatibility with CUDA 12 and 13 on Windows and KleidiAI on Apple Silicon.
The latest b9728 release of llama.cpp continues its trend of broadening platform compatibility, though with some notable exceptions. While macOS Apple Silicon support is present, the KleidiAI feature is disabled, indicating a focus on stability over new features. The release also includes support for a variety of Linux distributions, including Ubuntu with ROCm 7.2 and Vulkan, as well as Windows with CUDA 12 and 13. This update highlights llama.cpp's commitment to being a versatile inference runtime across diverse hardware, though it remains conservative in introducing new capabilities.
The b9732 release of llama.cpp enhances the internal workings of server components, focusing on child-to-router communication. This update addresses the wakeup case and improves the update_status function, while also adding new documentation. Although it doesn't bring new model architectures, it broadens platform support, notably with ROCm 7.2 on Ubuntu x64, which benefits AMD GPU users. The release also maintains support for various configurations, including CUDA 12 and 13 on Windows x64. This iteration is about strengthening the infrastructure and ensuring smoother operations across supported systems.