
OpenAI has launched new voice intelligence features in its API, enhancing the capabilities of applications to interact with users through speech. The new GPT-Realtime-2 model offers advanced vocal simulation with GPT-5-class reasoning, while GPT-Realtime-Translate supports real-time translation in over 70 languages. Additionally, GPT-Realtime-Whisper provides live speech-to-text functionality. These features aim to move voice interfaces beyond basic interactions, allowing them to perform tasks and respond dynamically. OpenAI has also included safeguards to prevent misuse of these tools.
Read original
© TechCrunch AIBasata is tackling a critical bottleneck in healthcare by automating the referral process between primary care doctors and specialists. Founded by Kaled Alhanafi and Chetan Patel, the company uses AI to process referrals and schedule appointments, aiming to bridge the gap that often leaves patients waiting. With a recent $21 million Series A funding led by Basis Set Ventures, Basata is poised to expand its operations, focusing initially on cardiology and urology. This approach not only promises to reduce administrative burdens but also enhances patient care by speeding up the scheduling process.
© TechCrunch AIPit, a new AI startup from the co-founders of Voi, has secured a $16 million seed round led by Andreessen Horowitz. The company aims to revolutionize enterprise AI by creating custom software that automates internal processes, freeing employees to focus on core business tasks. Unlike many competitors, Pit offers an 'AI product team as a service,' allowing enterprises to guide AI in automating their specific needs. This approach could set Pit apart in a crowded market, especially as it leverages its European roots to cater to industrial sectors. With backing from a16z and a strong founding team, Pit is poised to make a significant impact in the enterprise AI space.
© TechCrunch AIOpenAI has introduced a new feature called Trusted Contact, aimed at enhancing user safety by alerting a designated person if self-harm is mentioned in conversations with ChatGPT. This feature allows users to select a trusted contact who will be notified if the AI detects potential self-harm discussions, encouraging them to check in with the user. This move comes in response to lawsuits alleging that ChatGPT has been involved in cases of suicide. While the feature is optional, it represents OpenAI's ongoing efforts to responsibly manage AI interactions and collaborate with experts to improve safety measures.
The latest b9060 release of llama.cpp introduces several new SYCL operations, including FILL, CUMSUM, and DIAG, which expand the library's computational capabilities. This update also addresses a critical issue that caused aborts during test-backend-ops, ensuring more stable performance. With the addition of scope_dbg_print to both new and existing SYCL operations, developers gain enhanced debugging tools. This release continues to broaden llama.cpp's platform support, making it a more versatile tool for developers working across different environments.
The b9066 release of llama.cpp brings notable improvements for CUDA users by integrating cublasSgemmStridedBatched, which optimizes batch operations' inner loops. This enhancement is designed to boost performance for developers leveraging CUDA technology. The update also extends compatibility to include macOS Apple Silicon, Ubuntu with ROCm, and Windows with CUDA 12 and 13, ensuring developers can work seamlessly across different systems. While no new models are introduced, the release strengthens llama.cpp's role as a flexible tool for developers working with diverse hardware setups.
The b9070 release of llama.cpp introduces a notable enhancement with the addition of Q4_0 MoE GEMM support for Adreno GPUs via OpenCL. This update is particularly significant for developers working on mobile platforms, as it optimizes performance for Qualcomm's Adreno graphics. The release also includes various technical adjustments, such as fixing whitespace and removing unused code, which streamline the codebase. While the update doesn't introduce new models, it enhances the existing infrastructure, making it more efficient and accessible across different platforms.