Models & Labs

OpenAI and Broadcom unveil LLM-optimized chip

OpenAIJune 24, 2026high confidence

Why it matters

→Jalapeño is tailored for LLM inference, potentially reducing computational costs.
→The chip enhances performance and efficiency, aiding in AI system scalability.
→Specialized hardware like Jalapeño could make advanced AI models more accessible.

OpenAI and Broadcom have announced the release of Jalapeño, a custom AI chip designed to optimize inference for large language models (LLMs). The chip is engineered to improve performance and efficiency, which could help AI systems scale more effectively. This collaboration highlights a move towards specialized hardware that can handle the demanding computational needs of AI models. Jalapeño's introduction could lead to reduced costs and increased accessibility for developers working with LLMs.

Read original

More from OpenAI

Researchagents

OpenAI Paper Explores AI Agents in Work Transformation

OpenAI's latest research paper examines the transformative potential of AI agents in the workplace. These agents are not merely automating simple tasks; they are enabling longer and more complex workflows, which could significantly boost productivity across various roles. The study reveals how AI agents can manage multi-step tasks, potentially reshaping how work is structured and executed. This development suggests a future where AI agents are integral to workplace efficiency, offering a glimpse into how roles might evolve with AI integration.

OpenAIJun 25, 2026

Researchresearch

GPT-5 aids in solving immunology mystery

GPT-5 Pro has made a notable impact in the field of immunology by resolving a complex issue related to T cell behavior that had puzzled researchers for three years. This achievement opens new avenues for cancer and autoimmune disease research, demonstrating AI's potential to contribute to scientific breakthroughs. By offering innovative data analysis and insights, GPT-5 Pro proves its value beyond conventional applications, potentially speeding up medical discoveries. This development signifies a shift in how AI can be utilized to tackle intricate biological challenges, setting the stage for future advancements in healthcare.

OpenAIJun 23, 2026

More in Models & Labs

Models & Labsmodels

Llama.cpp b9784 Release Enhances Hexagon Performance

The latest b9784 release of llama.cpp brings significant optimizations to Hexagon's matrix multiplication capabilities. By reworking the MUL_MAT and MUL_MAT_ID operations, the update introduces a 32x32 tiled weight repack and improved kernel parameters, enhancing performance and efficiency. These changes aim to optimize register usage and streamline activation processing, particularly benefiting users leveraging Hexagon's architecture. This release doesn't introduce new models but focuses on refining existing processes, making llama.cpp more robust for developers working with diverse hardware configurations.

llama.cpp ReleasesJun 26, 2026

Models & Labsmodels

llama.cpp b9788 release enhances dual-GPU support

The latest release of llama.cpp, b9788, introduces significant improvements for dual-GPU setups with SYCL support, particularly enhancing tensor parallelism. By implementing a degenerate ring all-reduce for dual-GPU configurations, the update optimizes performance for both small and large tensor operations, mirroring CUDA's NCCL allreduce pattern. This release notably boosts performance metrics, with Llama-3.3-70B and Qwen3-Coder-Next-80B-A3B models showing substantial speed improvements. The update positions llama.cpp as a more competitive option for multi-GPU environments, without adding new dependencies or altering build configurations.

llama.cpp ReleasesJun 26, 2026

Models & Labsmodels

OpenAI Develops Custom Chip 'Jalapeño'

OpenAI has announced the development of its first custom chip, named 'Jalapeño'.

The AI Daily BriefJun 25, 2026