
NVIDIA has announced a collaboration with Ineffable Intelligence, an AI lab founded by AlphaGo architect David Silver, to develop infrastructure for large-scale reinforcement learning. The partnership focuses on creating systems that learn continuously from experience, a step beyond traditional AI models. Utilizing NVIDIA's Grace Blackwell and the upcoming Vera Rubin platform, the project aims to build a pipeline that supports the unique demands of reinforcement learning. This effort could enable AI systems to autonomously discover new knowledge, potentially leading to significant advancements in AI capabilities.
Read original
© NVIDIA BlogHermes Agent, developed by Nous Research, is making waves in the AI community with its self-improving capabilities and robust performance on NVIDIA hardware. Unlike traditional agents, Hermes can autonomously refine its skills, making it a standout in the field of agentic AI. Its design allows for seamless integration with local systems, leveraging NVIDIA RTX and DGX Spark for optimal performance. This development signifies a shift towards more reliable and efficient AI agents that can operate continuously and improve over time, offering a new level of autonomy and efficiency for developers.
© NVIDIA BlogThe latest b9133 release of llama.cpp introduces significant improvements for reasoning models, particularly in server and web UI environments. By removing the blocking assistant prefill and orchestrating thinking tags, the update ensures smoother continuation of generation tasks. This release also drops the reasoning guard on the Continue button, allowing for persistent reasoning content even after reloads. While the update focuses on templates with simple thinking tags, it sets the stage for future enhancements in reasoning model capabilities.
The latest b9142 release of llama.cpp introduces significant updates for OpenCL, particularly enhancing support for Adreno GPUs with the addition of q5_0 and q5_1 Mixture of Experts (MoE) models. This update also addresses potential memory leaks and suppresses warnings for unused variables when building for non-Adreno platforms. These improvements make llama.cpp more robust and versatile, especially for developers working with diverse hardware configurations. The release continues to solidify llama.cpp's position as a flexible inference runtime across multiple operating systems and architectures.
NVIDIA and SAP are collaborating to enhance the security and governance of AI agents within enterprise systems. By integrating NVIDIA's OpenShell into the SAP Business AI Platform, they provide a secure runtime for developing and deploying autonomous agents. This partnership aims to ensure that AI agents can operate safely within enterprise environments, addressing critical needs for policy enforcement and audit trails. This development marks a significant step in making AI agents trustworthy and ready for production use in complex business workflows.