The b9591 release of llama.cpp introduces significant improvements in Multi-Task Processing (MTP) by eliminating padding and optimizing data handling. The update modifies the ggml_gated_delta_net function to enhance efficiency, applying these changes across all backends. Additionally, it addresses previous review comments and resolves CI build errors. This release is particularly relevant for developers working with various hardware setups, as it aims to streamline processing and improve performance.
Read originalThe latest b9590 release of llama.cpp addresses a critical issue where the LFM2 template handler was ignoring the json_schema from response_format, focusing solely on tool-calling grammar. This update ensures more robust handling of JSON schemas, which is crucial for developers relying on precise data formatting. The release also includes a variety of platform-specific builds, though some features like KleidiAI on macOS and SYCL on Windows remain disabled. This update is a step forward in refining the tool's functionality, particularly for those working with complex data structures.
The b9596 release of llama.cpp marks another step in broadening its compatibility, with ROCm 7.2 now supported on Ubuntu x64, enhancing the experience for AMD GPU users. This update helps close the performance gap with NVIDIA's CUDA, making llama.cpp a more attractive option for developers using AMD hardware. Although features like KleidiAI on macOS Apple Silicon are still disabled, the release underscores llama.cpp's commitment to becoming a versatile tool across different systems. Developers can now tap into improved performance on a wider array of hardware, though some expected features remain on the horizon.
The b9601 release of llama.cpp significantly extends its reach by supporting more platforms, enhancing its utility for developers. This update includes Ubuntu builds with ROCm 7.2, which is a boon for AMD GPU users seeking alternatives to NVIDIA's CUDA. Although features like KleidiAI on macOS and SYCL on Windows are currently disabled, the release still represents a meaningful step in making llama.cpp adaptable to a wider range of hardware. While no new models are introduced, the focus on expanding runtime compatibility marks a strategic move to increase the tool's versatility.
Claude Code's latest update introduces the Claude Fable 5, a Mythos-class model now safe for general use. This model surpasses previous offerings in capability, marking a significant step forward for developers using Claude Code. Additionally, the update resolves an issue with session transcripts not saving when launched from certain environments. This release enhances both the power and reliability of the Claude Code platform, offering developers a more robust toolset for their projects.
© WIRED AIOpenAI is taking a bold step by evolving ChatGPT into a 'super app,' a move that could revolutionize AI interaction. Under the guidance of Thibault Sottiaux, the initiative seeks to merge ChatGPT and Codex into a unified platform designed to manage diverse personal and professional tasks. The vision is to develop a proactive digital assistant that integrates seamlessly into daily life, potentially revitalizing OpenAI's growth and re-establishing its leadership in the AI sector. Although the specifics of the super app's capabilities are still unfolding, the integration of Codex indicates a focus on sophisticated task automation and user personalization.
© FireshipAnthropic has introduced Claude Fable, a new model in their Mythos class, designed for public use. This release comes amid Anthropic's previous calls for caution in AI development, highlighting a shift towards making advanced AI more accessible. The Claude Fable model is described as 'carefully lobotomized,' suggesting it has been modified to ensure safety and compliance for broader deployment. This move indicates Anthropic's strategy to balance innovation with responsibility, offering a model that is both powerful and safe for public interaction.