
Anthropic has released a report on recursive self-improvement (RSI) in AI, focusing on how their AI system, Claude, is advancing its own development. The report indicates that Claude authored over 80% of the company's code merges, suggesting a faster-than-expected pace of AI evolution. This raises concerns about the readiness of institutions to manage fully self-improving AI systems. Anthropic proposes a potential pause in AI development across the industry to address these risks, highlighting the need for policy discussions. This development underscores the urgency of aligning AI innovation with regulatory frameworks.
Read original
© MIT News AIThe National Science Foundation has renewed its support for the MIT-led Institute for Artificial Intelligence and Fundamental Interactions (IAIFI), increasing its annual funding to nearly $5 million. This renewal marks a significant phase for IAIFI, which has been pioneering a model where AI and physics mutually enhance each other. The institute's work has led to breakthroughs in particle physics, nuclear physics, and astrophysics, demonstrating AI's potential to tackle complex scientific challenges. With this funding, IAIFI aims to deepen its exploration of the 'physics of AI,' fostering a community that bridges disciplines and pushes the boundaries of scientific discovery.
EVA-Bench Data 2.0 significantly broadens its scope by expanding from one to three enterprise domains, covering Airline Customer Service Management, Enterprise IT Service Management, and Healthcare HR Service Delivery. This update quadruples the scenario coverage to 213, offering a robust benchmark for evaluating voice agents across diverse workflows. The scenarios are meticulously validated against leading models like OpenAI GPT-5.4 and Google Gemini 3.1 Pro, ensuring they are both challenging and fair. This release not only enhances the realism and variety of the dataset but also sets a new standard for reproducibility and authentication in voice agent evaluation.
OpenAI has released an action plan focused on leveraging artificial intelligence to enhance biological resilience. This initiative aims to integrate AI technologies into biodefense strategies, potentially transforming how biological threats are detected and managed. By harnessing AI's predictive capabilities, the plan seeks to improve early warning systems and response mechanisms against biological hazards. This development marks a significant step in applying AI to public health and safety, offering new tools for anticipating and mitigating biological risks.