The Yi series models, developed by 01.AI, are large language models that can be run in the cloud with a simple API call. The blog provides a guide on how to implement this.
Read originalThe latest version b8991 of llama.cpp has been released, featuring updates for various operating systems.
The latest update to llama-mmap improves compatibility with various platforms and model sizes. Key enhancements include support for 32-bit wasm and updates to gguf.cpp style.
