
Ollama is now available on Windows in preview, allowing users to pull, run, and create large language models with built-in GPU acceleration and access to a full model library.
Read originalThe latest version b8991 of llama.cpp has been released, featuring updates for various operating systems.
The latest update to llama-mmap improves compatibility with various platforms and model sizes. Key enhancements include support for 32-bit wasm and updates to gguf.cpp style.
