
Chinese AI firm SenseTime has launched a new image model designed for speed, focusing on compatibility with Chinese-made chips due to US tech restrictions. The model emphasizes open-source development.
Read original
© WIRED AIIn a landmark decision, a German court has ruled that Google is liable for false statements generated by its AI Overviews feature. This ruling challenges the traditional view of search engines as mere conduits of third-party content, arguing that AI-generated summaries create new, independent statements. The court emphasized that Google's warnings about potential errors in AI outputs do not absolve it of responsibility, as these AI-generated claims can mislead users without any basis in the original sources. This decision could set a precedent for how AI-generated content is regulated globally, potentially impacting other tech companies using similar technologies.
© WIRED AIAnthropic has removed its AI models, Claude Fable 5 and Mythos 5, from availability following a directive from the US government, which cited national security concerns. This action reflects ongoing friction between Anthropic and the Trump administration, which had previously labeled the company a 'supply chain risk.' The government order suggests a potential method to bypass the models' safeguards, though Anthropic maintains that the vulnerabilities are minor and not unique to their models. The situation highlights the complex relationship between AI development and regulatory oversight, raising questions about the transparency and fairness of such government interventions.
© WIRED AIMeta's Applied AI team is embroiled in internal conflict, with employees voicing dissatisfaction over their roles and tasks. Formed to bolster AI research at Meta Superintelligence Labs, the unit is criticized for assigning tasks perceived as unchallenging and unfulfilling. This unrest is part of a larger morale issue at Meta following recent layoffs and restructuring efforts. CEO Mark Zuckerberg has acknowledged these difficulties and promised to provide more stability, but the situation underscores the ongoing tension between Meta's ambitious AI goals and the well-being of its workforce.
The vLLM v0.23.0 release marks a significant step forward with enhancements across various components. DeepSeek-V4 has been optimized further, decoupling its metadata from previous versions and adding new attention kernels. Model Runner V2 now supports more dense models by default, improving performance for Llama and Mistral. The Rust frontend has matured with new endpoints and tool parsers, while compatibility with Transformers v5 ensures broader model support. These updates collectively enhance the robustness and versatility of vLLM, making it a more powerful tool for developers working with large language models.
The latest b9626 release of llama.cpp introduces architectural support for the cohere2-MoE model, marking a significant update for developers working with this model. This release also includes various technical improvements such as the removal of redundant checks and enhancements in tensor handling, which streamline the model's performance. By adding cohere2moe to the Llama Model Saver supported list, the update broadens the toolkit available for AI practitioners. While these changes may seem incremental, they collectively enhance the robustness and flexibility of llama.cpp, making it a more versatile tool for AI development.
The b9627 release of llama.cpp continues to enhance its platform reach, though it doesn't introduce any groundbreaking features. This update includes support for a wide array of systems, from macOS and iOS to various Linux distributions and Windows configurations, including CUDA and Vulkan support. Notably, the release maintains its focus on making llama.cpp a versatile tool across different hardware setups, but it doesn't introduce new model architectures or quantization methods. This iteration is more about solidifying its presence across multiple operating systems rather than introducing novel capabilities.