
DeepSeek has launched its latest version, DeepSeek V4, which includes PRO and Flash variants featuring million-token context windows. This new offering is priced significantly lower than leading Western models, shifting the focus towards more affordable and deployable AI solutions rather than solely on high-performance metrics. The introduction of these models could democratize access to advanced AI capabilities for a broader range of users.
Read originalThe latest version b8991 of llama.cpp has been released, featuring updates for various operating systems.
The latest update to llama-mmap improves compatibility with various platforms and model sizes. Key enhancements include support for 32-bit wasm and updates to gguf.cpp style.
