
Meta's Llama 3 model has significantly lowered false refusal rates, refusing less than one-third of the prompts that Llama 2 would have refused. This indicates a shift towards less censorship in the model's responses.
Read originalThe latest version b8991 of llama.cpp has been released, featuring updates for various operating systems.
The latest update to llama-mmap improves compatibility with various platforms and model sizes. Key enhancements include support for 32-bit wasm and updates to gguf.cpp style.
