The b9544 release of llama.cpp focuses on fixing reasoning round-trip issues and memory leaks in LFM2 and LFM2.5 models. This update is crucial for developers as it improves the stability and performance of these models across multiple platforms, including macOS, Linux, and Windows. The release continues to support a variety of hardware configurations, such as Apple Silicon, CUDA, and ROCm. Although no new models are introduced, the update's emphasis on resolving existing issues makes it a significant improvement for users of llama.cpp.
Read originalThe b9533 release of llama.cpp continues its focus on enhancing platform compatibility, though some features are notably absent. While macOS Apple Silicon users will find KleidiAI support disabled, the release introduces Vulkan support for both Ubuntu and Windows, and keeps CUDA support updated with new DLLs for Windows. The addition of ROCm 7.2 for Ubuntu x64 is particularly important for AMD GPU users, helping to close the gap with NVIDIA's CUDA. This update is more about refining existing capabilities and ensuring that llama.cpp runs smoothly across various environments, rather than unveiling new model architectures.
The b9534 release of llama.cpp brings significant improvements for Intel users, notably adding FWHT support in Vulkan with shared memory reduction. This update tackles specific driver issues by disabling features like subgroup shuffle on MoltenVK AMD and the FWHT shader on Intel Windows, ensuring smoother operation. While KleidiAI remains disabled on macOS Apple Silicon, the release continues to refine compatibility with systems such as Ubuntu and Windows. With ROCm 7.2 and CUDA 12 and 13 DLLs included, llama.cpp is steadily optimizing its performance for a variety of hardware setups. These enhancements reflect a focused effort to support diverse computing environments.
The b9535 release of llama.cpp continues to broaden its platform compatibility, though some features remain unavailable. While macOS Apple Silicon users won't see KleidiAI support this time, the release introduces Vulkan support for both Ubuntu and Windows, offering more options for GPU utilization. The addition of ROCm 7.2 for Ubuntu x64 marks a significant step towards better AMD GPU support, helping to close the gap with NVIDIA's CUDA. However, features like SYCL support are still not enabled, indicating areas where development is ongoing. This release reflects llama.cpp's ongoing efforts to become a versatile inference runtime across a wide range of hardware setups.
© TechCrunch AIOpenAI has launched Lockdown Mode, a new feature aimed at enhancing security against prompt injection attacks in ChatGPT. This mode restricts certain functionalities like live web browsing and image retrieval, focusing on reducing the risk of sensitive data exposure. While it doesn't completely eliminate vulnerabilities, it offers an added layer of protection for users handling sensitive information. Currently, Lockdown Mode is being rolled out to ChatGPT Business accounts and select personal accounts, marking a step towards more secure AI interactions.
© GitHub ChangelogGitHub has deprecated the GPT-5.2 and GPT-5.2-Codex models across most Copilot experiences, signaling a shift in their AI offerings. While these models are no longer available for general use in Copilot Chat, inline edits, and code completions, they remain accessible for Copilot code review. This move requires users and administrators to update their workflows and enable alternative models through Copilot settings. The deprecation reflects GitHub's ongoing evolution of its AI tools, pushing users towards newer or more efficient models.
© The AI AdvantageChatGPT's memory update is a significant development, allowing the AI to retain information across sessions, which enhances user interaction by making conversations more coherent and personalized. This update is part of a broader trend of integrating AI more deeply into everyday tools, as seen with upcoming AI features in YouTube and the merger of Codex and ChatGPT apps. These changes reflect a growing emphasis on making AI more intuitive and useful in various contexts. The ability to remember past interactions could transform how users engage with AI, making it a more seamless part of daily digital life.