The llama-quant project has released an update that fixes an issue related to tensor-type when the default qtype is overridden. This update addresses a previously reported issue (#22544) and includes contributions from @Anai-Guo. The release supports multiple platforms, including macOS, Linux, Android, and Windows, with specific configurations for each. This update enhances the functionality and compatibility of llama-quant across different systems.
Read originalThe b8998 release of Llama.cpp introduces support for various platforms including macOS, Linux, Android, and Windows.
The latest Llama.cpp release introduces Vulkan support for asymmetric FA in the coopmat2 path, enhancing mixed quantization capabilities.
The v0.18.2rc0 release includes a fix for handling the max_pixels parameter in the PaddleOCR-VL image processor across transformations.
© Lev SelectorAnthropic has released a suite of plugins that enhance the Claude ecosystem.
The latest update to ggml-webgpu addresses vectorized handling in the mul-mat and mul-mat-id functions. This release includes support for various operating systems and architectures.
© Lev SelectorGoogle makes its Gemma 4 AI model available to the public.