Models & Labs

NVIDIA Releases Nemotron 3.5 ASR for Multilingual Streaming

Sam WitteveenJune 7, 2026high confidence

Why it matters

→Nemotron 3.5 ASR enhances multilingual streaming capabilities with advanced features like word boosting and diarization.
→The model's cache-aware and quantized versions improve efficiency, making it suitable for long-running applications.
→This release strengthens NVIDIA's position in the ASR market, offering developers more powerful tools for audio processing.

NVIDIA Releases Nemotron 3.5 ASR for Multilingual Streaming — ©Sam Witteveen

NVIDIA has unveiled Nemotron 3.5 ASR, a new automatic speech recognition model designed for live multilingual streaming. This release includes features such as word boosting and speaker diarization, enhancing its application in diverse audio scenarios. The model is also cache-aware and available in quantized versions, which improve its efficiency and performance. This development underscores NVIDIA's commitment to advancing ASR technology, providing developers with more robust tools for building sophisticated audio processing systems.

Read original

NVIDIA Releases Nemotron 3.5 ASR for Multilingual Streaming

Why it matters

More in Models & Labs

vLLM v0.23.0 Release Enhances Model Support

Llama.cpp b9626 Release Adds Cohere2-MoE Support

llama.cpp b9627 Release Expands Platform Support