
A recent study reveals that state-of-the-art speech recognition models struggle with accurately transcribing street names, particularly for speakers with diverse linguistic backgrounds, showing a 39% average error rate. The research introduces two new benchmarks, SF Streets and US Streets, to evaluate named entity recognition in real-world scenarios. By employing a synthetic data generation technique called cross-lingual style transfer, the study demonstrates a potential 60% improvement in accuracy with fewer than 1,000 training samples. This highlights a critical gap in current speech models and suggests a viable path forward for enhancing their reliability in high-stakes applications.
Read original
© Together AI BlogTogether AI and Adaption have formed a partnership to integrate Together Fine-Tuning into Adaptive Data, enabling teams to optimize datasets and deploy stronger open models.
© Together AI BlogTogether AI has shut down the vulnerable crypto socket interface Copy Fail across its infrastructure to mitigate risks associated with a logic bug in the Linux kernel.