Models & Labs

Llama.cpp b9626 Release Adds Cohere2-MoE Support

llama.cpp ReleasesJune 14, 2026high confidence

Why it matters

→The update enhances support for the cohere2-MoE model, broadening its applicability.
→Technical improvements streamline performance and reduce redundancy.
→Adding cohere2moe to the supported list increases the framework's versatility.

The b9626 release of llama.cpp has been announced, featuring architectural support for the cohere2-MoE model. This update includes several technical improvements, such as the removal of redundant checks and enhancements in tensor handling. The release also adds cohere2moe to the Llama Model Saver supported list, expanding its utility for developers. These changes aim to improve the performance and flexibility of the llama.cpp framework, making it a more robust tool for AI development.

Read original

Llama.cpp b9626 Release Adds Cohere2-MoE Support

Why it matters

More from llama.cpp Releases

Llama.cpp adds GLM-5.2 speculative decoding support

llama.cpp b10175 Release Expands Platform Support

More in Models & Labs

Microsoft to Launch Copilot 'Super App' This Year

llama.cpp b10176 Release Expands Platform Support

OpenAI Plans 'Family of Devices' for AI Interaction

Anthropic's Opus 5 Release Raises Concerns for Indie Hackers