
Hugging Face has launched a local speech processing solution for the Reachy Mini robot, allowing users to run speech-to-speech interactions without relying on cloud services. This setup uses a cascaded voice pipeline, including components like llama.cpp and Parakeet-TDT, to process speech entirely on local hardware. The approach enhances privacy and reduces costs by keeping data within the user's network. Users can customize the pipeline by swapping components, providing flexibility as new models are released. This development offers a significant step towards more private and customizable AI interactions.
Read originalThe latest b9389 release of llama.cpp continues its trend of broadening platform compatibility, though with some notable exceptions. While macOS Apple Silicon users see KleidiAI support disabled, the release strengthens its Linux offerings with ROCm 7.2 and Vulkan support. Windows users benefit from updated CUDA DLLs, enhancing performance for CUDA 12 and 13. This release demonstrates llama.cpp's commitment to being a versatile inference runtime across diverse hardware, though some features remain disabled, indicating ongoing development challenges.
The b9391 release of llama.cpp continues to broaden its platform support, making it more accessible to a diverse range of users. Notably, this update includes support for Ubuntu x64 with ROCm 7.2, which is significant for AMD GPU users seeking alternatives to NVIDIA's CUDA. While some features like KleidiAI on macOS Apple Silicon and SYCL FP32 on Ubuntu are disabled, the release still marks a step forward in making llama.cpp a versatile tool across different operating systems. This update doesn't introduce new models but enhances the existing infrastructure, ensuring more users can leverage llama.cpp's capabilities.
The b9393 release of llama.cpp resolves a critical issue with the audio RMS norm in the gemma 4 module, enhancing its stability. This update, with contributions from Sigbjørn Skjæret, impacts a wide array of systems, including macOS, Linux, Windows, and openEuler. It continues to support architectures like Apple Silicon, Vulkan, and ROCm on Ubuntu, ensuring developers can rely on it across different environments. While it doesn't introduce new features, the update focuses on improving performance and compatibility, reinforcing llama.cpp's position as a reliable tool for developers working with diverse hardware configurations.