
Google has launched Gemini 3.1 Ultra, a new AI model boasting a remarkable 2 million token window. This advancement allows for more extensive context handling, enhancing the model's performance in various applications. The release signifies Google's commitment to pushing the boundaries of AI capabilities.
Read originalThe latest version b8991 of llama.cpp has been released, featuring updates for various operating systems.
The latest update to llama-mmap improves compatibility with various platforms and model sizes. Key enhancements include support for 32-bit wasm and updates to gguf.cpp style.
