OpenAI has launched the Economic Research Exchange, a platform dedicated to studying the impact of AI on the economy. The initiative invites applications for research projects focused on AI's effects on jobs, productivity, and economic dynamics. This move aims to deepen understanding of how AI technologies are reshaping economic landscapes. By facilitating targeted research, OpenAI hopes to inform policy and strategic decisions in the evolving AI-driven economy.
Read original
© The Rundown AIAnthropic's latest report delves into the emerging concept of recursive self-improvement (RSI) in AI systems, highlighting how their AI, Claude, is accelerating its own development. The report reveals that over 80% of Anthropic's code merges were authored by Claude, suggesting a rapid pace of AI evolution. This raises concerns about the readiness of institutions to handle fully self-improving AI. Anthropic suggests a potential industry-wide pause in AI development to address these risks, emphasizing the need for coordinated policy discussions. This marks a significant moment in AI development, where the pace of innovation might outstrip regulatory and ethical frameworks.
© MIT News AIThe National Science Foundation has renewed its support for the MIT-led Institute for Artificial Intelligence and Fundamental Interactions (IAIFI), increasing its annual funding to nearly $5 million. This renewal marks a significant phase for IAIFI, which has been pioneering a model where AI and physics mutually enhance each other. The institute's work has led to breakthroughs in particle physics, nuclear physics, and astrophysics, demonstrating AI's potential to tackle complex scientific challenges. With this funding, IAIFI aims to deepen its exploration of the 'physics of AI,' fostering a community that bridges disciplines and pushes the boundaries of scientific discovery.
EVA-Bench Data 2.0 significantly broadens its scope by expanding from one to three enterprise domains, covering Airline Customer Service Management, Enterprise IT Service Management, and Healthcare HR Service Delivery. This update quadruples the scenario coverage to 213, offering a robust benchmark for evaluating voice agents across diverse workflows. The scenarios are meticulously validated against leading models like OpenAI GPT-5.4 and Google Gemini 3.1 Pro, ensuring they are both challenging and fair. This release not only enhances the realism and variety of the dataset but also sets a new standard for reproducibility and authentication in voice agent evaluation.