
Ollama has launched a new feature that allows users to enable or disable the model's thinking behavior, providing flexibility for various applications. This update aims to enhance user control over the model's performance.
Read originalThe latest version b8991 of llama.cpp has been released, featuring updates for various operating systems.
The latest update to llama-mmap improves compatibility with various platforms and model sizes. Key enhancements include support for 32-bit wasm and updates to gguf.cpp style.
