Models & Labs

llama.cpp b9826 Release Expands Platform Support

llama.cpp ReleasesJune 28, 2026high confidence

Why it matters

→Expands platform support, making llama.cpp more versatile for developers.
→Enhances AMD GPU support with ROCm 7.2, offering alternatives to CUDA.
→Solidifies llama.cpp's position as a key tool for AI inference across diverse systems.

The b9826 release of llama.cpp has been announced, focusing on expanding platform support rather than introducing new features. This update includes ROCm 7.2 support for Ubuntu x64, enhancing options for AMD GPU users. The release covers a wide range of platforms, including macOS, Linux, Windows, and openEuler, ensuring broad accessibility for developers. Although no new model architectures are introduced, the release strengthens llama.cpp's role as a flexible AI inference tool.

Read original

More from llama.cpp Releases

Models & Labsmodels

llama.cpp b9817 release enhances OpenVINO support

The latest b9817 release of llama.cpp brings significant updates to its OpenVINO backend, including an upgrade to OV 2026.2.1 and the introduction of self-contained release packages. These changes streamline the deployment process and improve operator handling, making it easier for developers to integrate and utilize OpenVINO in their projects. Additionally, the update removes hardcoded compute operation types, enhancing flexibility and adaptability. This release marks a step forward in making llama.cpp a more versatile and developer-friendly platform, particularly for those leveraging OpenVINO's capabilities.

llama.cpp ReleasesJun 28, 2026

Models & Labsmodels

llama.cpp b9820 Release Enhances CUDA Performance

The b9820 release of llama.cpp brings notable improvements to CUDA performance by cutting down on unnecessary synchronizations, which can streamline token processing. This update introduces asynchronous copy capabilities between CPU and CUDA, facilitating smoother data transfers and potentially speeding up computations. Backend detection has been refined to avoid linking conflicts, and synchronization adjustments have been made more general, allowing other backends like Vulkan to benefit. These enhancements aim to optimize performance across different hardware setups, making llama.cpp a more adaptable tool for developers working with diverse configurations.

llama.cpp ReleasesJun 28, 2026

Open Sourcemodels

llama.cpp b9821 Release Expands Platform Support

The latest b9821 release of llama.cpp enhances user interaction with new command-line options like --version, --licenses, and --help. This update significantly broadens platform compatibility, adding support for Vulkan and ROCm 7.2 on Ubuntu, and CUDA 12 and 13 on Windows. Although KleidiAI support is currently disabled for macOS Apple Silicon, the release still caters to numerous operating systems and architectures. This update underscores llama.cpp's commitment to making its tools more accessible and functional for developers across different computing environments.

llama.cpp ReleasesJun 28, 2026

More in Models & Labs

Models & Labsmodels

Claude Tag Introduced for AI Models

Claude Tag is a new feature introduced for AI models, enhancing their functionality.

The AI Daily BriefJun 27, 2026

Models & Labsmodels

Asian AI Startups Launch Models Amid Anthropic Ban

In a strategic move, Asian AI startups are stepping into the spotlight as the U.S. export ban on Anthropic's Mythos and Fable models continues. Chinese cybersecurity firm 360 has introduced Tulongfeng, an AI tool aimed at software vulnerability detection, while Tokyo-based Sakana AI has launched Fugu, a model designed for agent orchestration and optimized for Japanese language and culture. These launches highlight a growing trend of regional AI development, offering alternatives to U.S. models and addressing local needs. As the export ban persists, these startups are seizing the opportunity to fill the void left by restricted access to U.S. AI technologies.

TechCrunch AIJun 27, 2026

Models & Labsbusiness

GitHub Enhances AI Adoption Metrics for Enterprises

GitHub has expanded its Copilot usage metrics API to include total pull requests merged by AI adoption phase, offering a more comprehensive view of user engagement. Previously, only per-user averages were available, but now enterprise administrators and organization owners can see the total number of pull requests merged daily by users in each adoption phase. This enhancement allows for better analysis of how AI adoption impacts development throughput and user behavior. By providing both total and average metrics, GitHub enables a deeper understanding of AI's role in software development processes.

GitHub ChangelogJun 26, 2026