The b9562 release of llama.cpp has added support for video input, a notable enhancement for the platform. This update includes the mtmd_helper_video feature and allows video input on servers using base64 encoding. The command-line interface has been updated to accommodate video arguments, improving user interaction. This release expands llama.cpp's functionality beyond text, enabling developers to work with video data more effectively.
Read originalThe b9561 release of llama.cpp continues to enhance its platform reach, adding Vulkan support for Ubuntu and Windows, and ROCm 7.2 for Ubuntu, which is a significant boost for AMD GPU users. While features like KleidiAI on macOS and SYCL on Windows remain inactive, this update reinforces llama.cpp's role as a flexible inference runtime across various systems. Although no new models are introduced, the release focuses on strengthening the existing infrastructure, making it more adaptable for developers working with different hardware setups. This ongoing expansion of capabilities ensures that llama.cpp remains a vital tool for AI inference across a broad spectrum of environments.
The b9564 release of llama.cpp marks a notable enhancement in WebGPU capabilities, specifically through the implementation of 2D workgroups for operations like scale, binary, and unary functions. This update is designed to boost performance across macOS, Linux, and Windows systems. While the KleidiAI feature on Apple Silicon remains inactive, the release broadens hardware compatibility, including Vulkan and ROCm 7.2 support on Ubuntu. By refining these technical aspects, llama.cpp becomes a more flexible tool for developers dealing with a range of computing environments, making it a valuable asset for those working with CUDA and other advanced configurations.
The b9565 release of llama.cpp brings crucial improvements to WebGPU, specifically tackling buffer overlap and aliasing for the concat operator. This update is vital for developers relying on WebGPU, as it enhances the reliability and efficiency of their operations. The release also includes updates to build workflows and shader files, demonstrating a focus on refining the development process. Although there are no new groundbreaking features, these enhancements make llama.cpp a more dependable tool for developers working on macOS, Linux, and Windows. The inclusion of ROCm 7.2 and CUDA 12 and 13 DLLs further supports diverse hardware configurations. By addressing these technical challenges, llama.cpp continues to solidify its position as a versatile and robust development tool.
© TechCrunch AIApple's WWDC 2026 showcased significant advancements in AI, particularly with Siri, which now integrates Google Gemini for enhanced conversational abilities and visual intelligence. This marks a pivotal shift as Apple aims to revitalize its AI offerings, emphasizing privacy with data usage transparency. The event also introduced iOS 27, extending support back to the iPhone 11, and highlighted new AI-driven features in apps like Photos and Shortcuts. These updates reflect Apple's commitment to integrating AI more deeply into its ecosystem, offering users a more seamless and intelligent experience.
© The Verge AIApple has introduced a revamped Siri AI, marking a significant step in its AI strategy. This new version of Siri is more conversational and capable, with features like a customizable voice and systemwide accessibility. It can interact with apps, read onscreen content, and manage tasks like writing messages and organizing calendars. While these capabilities echo existing AI tools, Apple's focus on privacy and integration across its ecosystem sets it apart. However, the rollout is limited, with initial availability only in English and restricted to certain devices and regions.
© The Verge AIGoogle's NotebookLM has received a significant upgrade with the integration of the Gemini 3.5 model, enhancing its ability to provide more accurate and reliable information. This update introduces a cloud computing feature, allowing users to start research projects directly through chat, leveraging Google Search for sourcing. The app now supports a variety of output formats, including PDFs and data visualizations, thanks to its integration with Google's Antigravity coding platform. This makes NotebookLM a more versatile tool for research and note-taking, particularly for users on Google's AI Ultra plan and Workspace customers.