16 × AIAI signal, amplified
AI newsAboutSources
TelegramFollow on Telegram
AI newsAboutSources
16 × AIAI signal, amplified

An AI news engine that ingests trusted sources, scores with Claude, and posts only what clears the bar.

Follow on Telegram →

Subscribe

  • Telegram
  • RSS
  • All channels

Legal

  • Privacy
  • Imprint
© 2026 16 × AI. All rights reserved.Curated by Claude. Posts every 6 hours. No newsletter, no funnel.
Home/Models & Labs
Models & Labs

NVIDIA Unveils AI Factories for Continuous Intelligence

NVIDIA Blog·May 27, 2026·high confidence

Why it matters

  • →AI factories represent a shift from traditional data centers to continuous intelligence production.
  • →They optimize performance per watt, significantly reducing the cost of AI inference at scale.
  • →This infrastructure enables enterprises to integrate AI more deeply into daily operations, enhancing productivity.
NVIDIA Unveils AI Factories for Continuous Intelligence
©NVIDIA Blog

NVIDIA has introduced the concept of AI factories, a new class of infrastructure aimed at producing continuous intelligence. These factories convert energy into tokens, optimizing performance per watt to enhance efficiency and reduce costs. Utilizing advanced technologies like the NVIDIA Blackwell Ultra GPU and the Vera Rubin platform, AI factories promise up to 50x higher throughput per megawatt compared to previous generations. This development signifies a shift from traditional data centers to AI as a core infrastructure, enabling enterprises to scale AI capabilities more effectively.

Read original

More from NVIDIA Blog

NVIDIA Advances Robotics with Simulation-to-Real Transfer© NVIDIA Blog
Researchresearch

NVIDIA Advances Robotics with Simulation-to-Real Transfer

NVIDIA's latest research is pushing the boundaries of robotics by enhancing the transition from simulation to real-world applications. At the ICRA conference, NVIDIA showcased eight papers that highlight advancements in robotic perception, reasoning, and action across unpredictable environments. These innovations include multi-arm coordination, adaptive grasping, and navigation across diverse robot bodies, all trained in simulation without real-world data. This approach not only speeds up robotic processes but also improves success rates significantly, marking a step forward in creating adaptable and reliable autonomous robots.

NVIDIA Blog·May 28, 2026
NVIDIA Vera CPU Challenges Intel and AMD© NVIDIA Blog
Models & Labsmodels

NVIDIA Vera CPU Challenges Intel and AMD

NVIDIA's new Vera CPU is making waves with its impressive performance in AI-centric workloads, challenging the dominance of Intel and AMD. Featuring 88 custom Olympus cores and a remarkable 1.2TB/s memory bandwidth, Vera is designed to handle the demanding tasks of modern AI factories efficiently. Initial benchmarks by Phoronix highlight its superior memory performance and power efficiency, particularly in comparison to traditional x86 CPUs. This positions Vera as a formidable competitor in the CPU market, offering a significant generational leap over NVIDIA's previous Grace CPU. As Vera becomes available through partners, it promises to redefine performance standards in AI infrastructure.

NVIDIA Blog·May 26, 2026

More in Models & Labs

Models & Labsmodels

vLLM v0.20.2 Patch Release

The vLLM v0.20.2 release is a minor update focusing on bug fixes for DeepSeek V4, gpt-oss, and Qwen3-VL. This patch addresses specific issues such as the MTP=1 hang on DeepSeek V4 by re-enabling the persistent topk path and fixing a KV cache allocation error. For gpt-oss, the update ensures compatibility with MXFP4 under torch.compile, while Qwen3-VL sees the removal of an invalid boundary check. These fixes enhance the stability and performance of the models, ensuring smoother operations under various conditions.

vLLM Releases·May 29, 2026
Models & Labsmodels

Llama.cpp b9387 Release Enhances AMD MFMA Performance

The latest b9387 release of llama.cpp introduces significant performance improvements for AMD MFMA hardware, particularly in quantized matrix multiplication. By optimizing the batch threshold logic, the update allows for more efficient processing, with throughput gains of up to 76% in certain configurations. This release is particularly relevant for users leveraging AMD's MI250X hardware, as it fine-tunes the kernel selection logic to maximize performance. While the update doesn't introduce new models, it significantly enhances the efficiency of existing operations on specific hardware, making it a noteworthy development for those using AMD GPUs.

llama.cpp Releases·May 29, 2026
Models & Labsmodels

llama.cpp b9388 release enhances Turing support

The latest b9388 release of llama.cpp introduces optimizations for Turing architecture, specifically adding MMVQ_PARAMETERS_TURING to improve JIT compilation for SM75 Turing devices. This update aims to prevent mismatches when compiling Turing device code on Ampere or newer architectures. While the release doesn't introduce new models or quantization methods, it continues to expand platform support, including updates for macOS, Linux, and Windows. The focus remains on refining compatibility and performance across diverse hardware configurations, making llama.cpp a more versatile tool for developers.

llama.cpp Releases·May 29, 2026