Open Source

Latest AI signals in this category

llama.cpp b10175 Release Expands Platform Support

The latest b10175 release of llama.cpp continues its trend of broadening platform compatibility, making it a versatile tool for developers across different systems. Notably, this update includes support for ROCm 7.2 on Ubuntu x64, which is significant for AMD GPU users seeking alternatives to NVIDIA's CUDA. The release also maintains a wide array of builds for Windows, macOS, and Linux, ensuring that developers can leverage llama.cpp's capabilities regardless of their hardware setup. While there are no groundbreaking new features, the consistent expansion of platform support solidifies llama.cpp's position as a flexible inference runtime option.

The b10088 release of llama.cpp marks another step in broadening its platform compatibility, making it a valuable tool for developers working across different systems. This update introduces support for Ubuntu with ROCm 7.2, which is particularly beneficial for those using AMD GPUs, offering enhanced performance. The release continues to support a wide array of platforms, including Windows, macOS, and Linux, ensuring developers can utilize llama.cpp's capabilities on their preferred systems. While there are no groundbreaking new features, the ongoing expansion of platform support strengthens llama.cpp's role as a flexible inference runtime for various computing environments.

llama.cpp ReleasesJul 23, 2026

Open Sourcemodels

llama.cpp b10059 Release Expands Platform Support

The b10059 release of llama.cpp enhances its platform compatibility, now supporting numerous operating systems and architectures. A key change is the defaulting of Hadamard multiplication to a CPU routine, which may lead to more consistent performance across setups. Although KleidiAI support for Apple Silicon is currently disabled, the release still accommodates platforms like macOS, Windows, and Linux, with configurations such as Vulkan and ROCm 7.2. While no new models are introduced, this update solidifies llama.cpp's role as a flexible inference runtime across diverse hardware environments.

The b9950 release of llama.cpp is a technical update that addresses specific platform issues and enhances code reliability with new unit tests for llama-batch. It resolves build problems on Win32 and introduces assertions for methods that are not yet implemented. While this update doesn't bring new models or groundbreaking features, it ensures compatibility across a wide array of systems, including macOS, Linux, Windows, and openEuler. This release is a step towards refining the software's robustness and usability across different hardware configurations, making it more stable and reliable for developers.

llama.cpp ReleasesJul 11, 2026

Open Sourcecoding

llama.cpp b9957 Release Enhances Tools and Builds

The b9957 release of llama.cpp brings notable improvements to its server tools and build processes, enhancing the development experience. With the introduction of a new tools_io abstraction and improvements to the edit tool, developers can expect more streamlined workflows. The update also addresses build issues and reorganizes utilities into class members, indicating a move towards a more structured codebase. While it doesn't introduce groundbreaking features, the inclusion of ROCm 7.2 and support for CUDA 12 and 13 DLLs highlights a focus on compatibility and performance. These enhancements make llama.cpp a more stable and reliable option for developers working on Apple Silicon, Windows, and Linux systems.

llama.cpp ReleasesJul 11, 2026

Open Sourceresearch

Hugging Face CEO Advocates for Open Source AI

Clem Delangue, CEO of Hugging Face, underscores the critical role of open source AI, comparing the platform to a GitHub for AI models and datasets. He observes that as companies expand, they often move from expensive proprietary APIs to more affordable open source options, which he believes is essential for democratizing AI technology. Delangue voices concerns about the risk of a few large companies dominating the AI landscape, advocating for openness and transparency, particularly in the field of robotics. This approach is reflected in Hugging Face's decision to focus on capital efficiency rather than traditional fundraising, even declining a significant investment offer from Nvidia to stay true to its open source principles.

TechCrunch AIJul 10, 2026

Open Sourcemodels

llama.cpp b9930 Release Expands Platform Support

The b9930 release of llama.cpp marks another step in broadening its platform reach, now covering macOS, Linux, Windows, and openEuler. This update includes Ubuntu builds with ROCm 7.2, which boosts performance for AMD GPU users, and Windows builds with CUDA 12 and 13, catering to NVIDIA users. While no new models are introduced, the focus is on enhancing compatibility across various hardware setups. By supporting both CPU and GPU environments, llama.cpp is positioning itself as a versatile inference runtime for a wide range of users.

llama.cpp ReleasesJul 9, 2026

Open Sourcemodels

Llama.cpp b9932 Release Enhances Performance

The b9932 release of llama.cpp is all about boosting performance, particularly by turning off the FA mask_opt on GCN for Vulkan, which should lead to better efficiency. It also brings back mask optimization for attention head sizes over 256, showing a clear focus on computational refinement. While no new models are introduced, the update broadens compatibility with macOS, Linux, Windows, and openEuler, offering specific builds for Vulkan, ROCm, and CUDA. This release is a significant step in making llama.cpp more adaptable and efficient across different hardware setups, ensuring it meets the needs of diverse computing environments.

The latest b9776 release of llama.cpp continues its trend of broadening platform compatibility, making it a versatile choice for developers across different systems. Notably, this update includes support for ROCm 7.2 on Ubuntu x64, which is significant for AMD GPU users seeking alternatives to NVIDIA's CUDA. The release also maintains a wide array of builds for macOS, Windows, and Linux, ensuring that developers can leverage llama.cpp's capabilities on their preferred platforms. While there are no groundbreaking new features, the consistent expansion of platform support solidifies llama.cpp's position as a flexible inference runtime.

The b9686 release of llama.cpp focuses on enhancing compatibility across a wide array of systems, though it doesn't introduce major new features. This update includes ROCm 7.2 support on Ubuntu x64, providing a significant boost for AMD GPU users who prefer alternatives to NVIDIA's CUDA. Developers can now utilize llama.cpp on various configurations, including macOS, Linux, Windows, and openEuler, ensuring they have the tools needed for AI inference tasks. While the release lacks groundbreaking changes, it strengthens llama.cpp's reputation as a flexible and accessible tool for AI developers working on different hardware setups.

llama.cpp ReleasesJun 18, 2026

Open Sourcemodels

llama.cpp b9692 Release Expands Platform Support

The latest b9692 release of llama.cpp continues its trend of broadening platform compatibility, now supporting a wide array of systems including macOS, Linux, Windows, and openEuler. Notably, this update includes support for ROCm 7.2 on Ubuntu x64, which is significant for AMD GPU users seeking alternatives to NVIDIA's CUDA. The release also maintains support for Vulkan and OpenVINO across different environments, ensuring flexibility for developers working with diverse hardware. While no new model architectures are introduced, this update solidifies llama.cpp's position as a versatile inference runtime across various environments.

llama.cpp ReleasesJun 18, 2026

Open Sourcecoding

GitHub Limits Open Pull Requests for Non-Writers

GitHub has introduced a new feature allowing repository maintainers to set a cap on the number of open pull requests from users without write access. This change aims to streamline the management of contributions by reducing the clutter of low-quality or drive-by pull requests. Maintainers can also designate trusted contributors who can exceed this limit without needing full collaborator access. This update is designed to help maintainers focus on meaningful contributions and reduce unnecessary review and CI overhead.

The latest b9571 release of llama.cpp continues its trend of broadening platform compatibility, notably adding support for ROCm 7.2 on Ubuntu x64. This update ensures that AMD GPU users can leverage llama.cpp more effectively, narrowing the gap with NVIDIA's CUDA. The release also maintains a focus on diverse operating systems, including macOS, Windows, and openEuler, though some features like KleidiAI on Apple Silicon remain disabled. This iteration doesn't introduce new models but solidifies llama.cpp's position as a versatile inference runtime across multiple environments.

The b9496 release of llama.cpp continues to broaden its platform compatibility, although some features are notably absent. MacOS Apple Silicon users will find KleidiAI support disabled, while Ubuntu gains strength with ROCm 7.2 and Vulkan support. Windows users benefit from the inclusion of CUDA 12 and 13 DLLs, which enhance GPU performance options. Despite certain features being disabled, this release highlights llama.cpp's ongoing commitment to being a versatile inference runtime across diverse systems. The focus remains on improving accessibility and performance across various hardware configurations.

llama.cpp ReleasesJun 4, 2026

Open Sourceresearch

Google Open Sources AI Hydrology Model for Flood Forecasting

Google has open-sourced its advanced AI-based hydrology model, aiming to enhance global flood forecasting capabilities. This move allows National Meteorological and Hydrological Services to integrate sophisticated AI tools into their workflows, potentially improving the accuracy and timeliness of flood warnings. By releasing the model on GitHub, Google empowers local experts to refine and adapt the technology using their own data, fostering a more resilient approach to flood management. This initiative democratizes access to cutting-edge forecasting tools, especially benefiting regions with limited resources.

The b9331 release of llama.cpp brings a strategic overhaul to its continuous integration workflows, focusing on efficiency by isolating tasks into separate workflows. This update includes the extraction of Android and HIP tasks, alongside the relocation of WebGPU and RPC tasks into distinct workflows. Additionally, the release halts SYCL f16 builds and optimizes pull request jobs by aligning backend paths. While there are no new model architectures introduced, this release aims to streamline development processes and enhance build management across diverse environments.

llama.cpp ReleasesMay 27, 2026

Open Sourcemodels

llama.cpp b9333 release expands platform support

The b9333 release of llama.cpp marks a significant expansion in its platform reach, enhancing its utility across various systems. With this update, macOS Apple Silicon users can now leverage KleidiAI, while Ubuntu users benefit from Vulkan and ROCm 7.2 enhancements. Windows compatibility is also improved with the inclusion of CUDA 12 and 13 DLLs, and openEuler architectures are now part of the supported lineup. Although there are no new model architectures in this release, llama.cpp is becoming a more versatile inference runtime, catering to a broader range of hardware configurations.

llama.cpp ReleasesMay 27, 2026

Open Sourcemodels

llama.cpp b9351 Release Expands Platform Support

The b9351 release of llama.cpp continues to broaden its platform compatibility, notably integrating ROCm 7.2 on Ubuntu x64, which enhances performance for AMD GPU users. This update also includes KleidiAI support for macOS Apple Silicon, making it easier for developers on M-series Macs to leverage ARM-tuned capabilities. While some features like SYCL FP32 on Ubuntu and Windows remain disabled, the release highlights llama.cpp's commitment to being a versatile inference runtime across diverse systems. This update doesn't introduce new models but strengthens the infrastructure for existing ones.

The b9273 release of llama.cpp marks a significant step in broadening its reach, now supporting a wider array of systems. Developers using macOS Apple Silicon can now benefit from KleidiAI, while Ubuntu users gain access to ROCm 7.2, enhancing GPU performance. Windows developers aren't left out, with new support for CUDA 12 and 13, making it easier to integrate llama.cpp into existing workflows. Although no new models are introduced, the focus on improving the runtime environment makes it a more adaptable tool for AI inference. This release underscores llama.cpp's commitment to being a versatile solution for developers seeking robust AI capabilities.

The b9150 release of llama.cpp continues its trend of broadening platform compatibility, now including support for macOS Apple Silicon with KleidiAI enabled and a variety of Linux configurations such as Ubuntu with ROCm 7.2 and Vulkan. This update also enhances Windows support with CUDA 12 and 13 DLLs, making it more versatile for developers working across different environments. While there are no groundbreaking new features, the release solidifies llama.cpp's position as a flexible inference runtime for diverse hardware setups. Developers can now leverage these updates to optimize performance across a wider range of systems.

llama.cpp ReleasesMay 15, 2026

Open Sourcemodels

llama.cpp b9159 Release Expands Platform Support

The latest b9159 release of llama.cpp significantly broadens its platform compatibility, making it more accessible to a diverse range of users. With new builds for macOS, Linux, Windows, and Android, the update includes support for Apple Silicon, Vulkan, ROCm 7.2, and CUDA 13. This expansion means developers can now leverage llama.cpp across more environments, enhancing its utility for AI inference tasks. While there are no new model architectures, the focus on platform diversity ensures that llama.cpp remains a versatile tool for developers working with different hardware configurations.

The b9144 release of llama.cpp enhances its adaptability by optimizing for specific hardware setups, particularly through the ggml-webgpu update. This ensures subgroup-matrix paths are utilized only when head dimensions meet certain divisibility conditions, improving efficiency. The release broadens support across macOS, Linux, Windows, and Android, with significant improvements for Apple Silicon, Vulkan, and CUDA environments. By focusing on these enhancements, llama.cpp strengthens its role as a flexible tool for developers working with a wide range of hardware configurations, even if no groundbreaking features are introduced.

Hugging Face BlogAug 14, 2025

Open Sourcevideo

Open source video model Wan 2.2 released

Replicate has announced Wan 2.2, their fastest and cheapest open source video model to date.

Replicate BlogJul 31, 2025

Open Sourceother

The Frontier is Open

Together AI has announced the opening of their new platform, allowing developers to access and utilize their AI tools more freely.

Together AI BlogJun 9, 2025

Open Sourceresearch

Common Pile v0.1 Dataset Released

EleutherAI has announced the release of Common Pile v0.1, an 8TB dataset consisting of public domain and openly licensed text.

EleutherAI BlogJun 5, 2025

Open Sourcevideo

Fine-tune open-source video models now available

Users can now train their own versions of Tencent's HunyuanVideo for style, motion, and character customization on the Replicate platform.

Replicate BlogJan 24, 2025

Open Sourcemodels

FLUX fine-tunes optimization announced

Replicate has improved the speed of running fine-tunes for FLUX, and these optimizations are available as open-source.

Replicate BlogNov 26, 2024

Open Sourceother

FLUX Optimizations Released as Open Source

FLUX has been optimized for speed on Replicate, and these improvements have been made available as open-source for further development.

Replicate BlogOct 10, 2024

Open Sourceimage

New Open Source Image Model and Tools Released

Replicate has announced an open source frontier image model that allows users to cut objects from videos, along with a new Python web framework developed by Jeremy Howard.

Replicate BlogAug 2, 2024

Open Sourceother

Open Source Pipeline for Auto-Interpretability Released

EleutherAI has announced the development of an open-source pipeline aimed at enhancing the interpretability of sparse autoencoder features.

EleutherAI BlogJul 30, 2024

Open Sourceimage

Run Stable Diffusion 3 Locally with ComfyUI

Users can now run Stable Diffusion 3 on their own machines using ComfyUI by executing a few terminal commands. This allows for local experimentation with the model on GPU-powered systems.

Replicate BlogJun 14, 2024

Open Sourceother

Replicate Intelligence #1 Overview

The Replicate Blog discusses a DIY implementation of Llama 3, introduces open-source smart glasses, and explores steering language models using dictionary learning techniques.

Replicate BlogMay 24, 2024

Open Sourcemusic

Voice Cloning with Open-Source Models

Replicate has introduced fine-tuning for realistic voice cloning (RVC), allowing users to train models on their own datasets from YouTube videos using a simple code interface.

Replicate BlogDec 6, 2023

Open Sourceother

Minetester: Open RL Environment on Minetest

Minetester is introduced as a fully open reinforcement learning environment built on the Minetest platform, along with an overview of its preliminary work.

EleutherAI BlogJul 8, 2023

Open Sourceother

EleutherAI Yearly Retrospective Released

EleutherAI has published a detailed retrospective covering their activities over the past year.

EleutherAI BlogMar 26, 2023

Open Sourcecoding

Hugging Face Introduces Skops for scikit-learn Models

Hugging Face has launched Skops, a new library designed to streamline the process of hosting scikit-learn models on the Hugging Face Hub. This tool allows developers to create detailed model cards, enhancing documentation and collaboration. By integrating Skops, users can easily serialize models, generate configuration files, and push them to the Hub, making them accessible for inference and further development. This release marks a significant step in making machine learning models more shareable and reproducible, particularly for those working with scikit-learn.

Hugging Face BlogAug 12, 2022

Open Sourceother

Open Source Large Language Model for AI Safety

EleutherAI discusses the benefits of releasing a large language model as a means to enhance AI safety. The blog outlines their reasoning behind this belief.

EleutherAI BlogJun 2, 2021