
Hugging Face's DharmaOCR has applied Direct Preference Optimization (DPO) to tackle text degeneration in OCR tasks. This method uses the model's own degenerate outputs as negative training signals, achieving an average degeneration reduction of 59.4%. Unlike supervised fine-tuning, which doesn't directly address degeneration, DPO targets the structural failure modes of models. This innovative approach shows promise for improving model performance in structured tasks, offering a new direction for AI training methodologies.
Read originalReachy Mini, a conversational robot, now supports remote tools, expanding its capabilities beyond local Python scripts. This update allows the robot to access external tools like web search and weather information, enhancing its ability to provide real-time responses. By integrating these remote tools, users can easily share and update functionalities without altering the core app. This development marks a significant step in making Reachy Mini more versatile and interactive, as it can now handle complex queries involving both local and remote data sources.
Holo3.1 marks a significant advancement in the deployment of computer-use agents across various environments, including web, desktop, and mobile. By introducing quantized checkpoints like FP8, Q4 GGUF, and NVFP4, it enables fast local inference with minimal performance loss. This release is particularly notable for its improvements in mobile environments, with substantial performance gains on Android devices. The ability to run agents locally on consumer hardware while maintaining privacy is a key feature. Holo3.1's enhancements make it a versatile tool for developers aiming to integrate AI agents into diverse workflows.