Models & Labs

New b8998 Release for Llama.cpp

llama.cpp ReleasesMay 2, 2026medium confidence

Why it matters

→This release enhances cross-platform compatibility for developers. • It supports a wide range of architectures, improving accessibility. • The inclusion of GPU support options allows for better performance in AI applications.

The latest release of Llama.cpp, version b8998, has been announced, providing support for multiple operating systems. This update includes compatibility for macOS Apple Silicon, Intel, various Linux distributions, Android, and Windows with specific configurations for CPU and GPU support. Notably, it features enhancements for CUDA, Vulkan, and SYCL across different architectures. This release expands the accessibility of Llama.cpp for developers working on diverse platforms.

Read original

New b8998 Release for Llama.cpp

Why it matters

More from llama.cpp Releases

llama-quant update fixes tensor-type issue

Vulkan Support Added in Llama.cpp Release

More in Models & Labs

DeepSeek V4 Preview Released

NVIDIA Launches Nemotron 3 Nano Omni

ggml-webgpu Update Fixes Vectorized Handling

Mistral Introduces Vibe Remote Agents