Models & Labs

ChatGPT Enhances Memory for Improved Context

OpenAIJune 4, 2026high confidence

Why it matters

→Enhances personalization by remembering user preferences across sessions.
→Improves AI's ability to maintain context, leading to more relevant interactions.
→Represents a step towards more human-like AI communication.

OpenAI has introduced a new memory system for ChatGPT, enhancing its ability to remember user preferences and maintain context across conversations. This update aims to make the AI more helpful by providing personalized and relevant responses based on past interactions. The improvement addresses a common limitation in AI chatbots, which often struggle to retain context over time. This advancement could significantly improve user experience, making ChatGPT a more effective and reliable tool for ongoing interactions.

Read original

More from OpenAI

Models & Labsmodels

GPT-5.6 Triples Scores on ARC-AGI-3 Benchmark

OpenAI's GPT-5.6 has made a notable leap in performance on the ARC-AGI-3 benchmark by activating two particular API settings. These settings, which focus on maintaining reasoning capabilities and enabling compaction, have resulted in a threefold increase in the model's scores. This achievement illustrates how targeted configuration changes can significantly enhance AI performance without the need for extensive architectural modifications. The improvement not only boosts the model's efficiency but also highlights the potential of optimizing existing systems to achieve superior results.

OpenAIJul 29, 2026

Models & Labsmodels

OpenAI Offers Free ChatGPT Access to Researchers

OpenAI is making a significant move by providing 100,000 academic researchers with free access to its most advanced ChatGPT models. This initiative aims to enhance scientific research and collaboration by leveraging AI's capabilities in data analysis and hypothesis generation. By removing financial barriers, OpenAI is fostering an environment where researchers can explore new ideas and accelerate discoveries. This could lead to breakthroughs across various scientific fields, as researchers now have a powerful tool at their disposal without the usual cost constraints.

OpenAIJul 29, 2026

Models & Labsmodels

GPT-5.6 Enhances AI Efficiency and Intelligence

OpenAI's release of GPT-5.6 marks a notable step in AI development by enhancing efficiency across various models and workflows. This version promises to deliver more intelligence per dollar, making AI applications more cost-effective and accessible. By optimizing inference and agentic workflows, GPT-5.6 aims to streamline processes and improve performance. While it doesn't introduce groundbreaking new features, it represents a significant refinement in how AI can be deployed more economically. This release is particularly relevant for developers looking to maximize the utility of AI without escalating costs.

OpenAIJul 29, 2026

More in Models & Labs

Models & Labsmodels

Llama.cpp adds GLM-5.2 speculative decoding support

Llama.cpp's latest update introduces speculative decoding support for GLM-5.2, enhancing its capabilities with NextN/MTP features. This addition allows for more efficient tensor loading and context management, particularly benefiting models using the GLM_DSA architecture. The update also includes options for exporting models with or without the MTP feature, providing flexibility for developers. This release marks a step forward in optimizing model performance and adaptability, especially for those leveraging the GLM-5.2 framework.

llama.cpp ReleasesJul 30, 2026

Models & Labsmodels

Llama.cpp b10178 Release Adds Trace Logging

The b10178 release of llama.cpp enhances its server capabilities by adding trace logging for slot similarity checking, offering developers detailed insights into prompt cache slot selection processes. This update includes specifics on skip reasons and similarity calculations, which can aid in performance optimization. While no new model architectures are introduced, the release continues to support a wide array of platforms, such as macOS with KleidiAI, Ubuntu with ROCm 7.2, and Windows with CUDA 12 and 13. This makes llama.cpp a more versatile tool for developers working on different systems, reinforcing its position as a comprehensive inference runtime.

llama.cpp ReleasesJul 30, 2026

Models & Labsmodels

llama.cpp b10180 Release Enhances SYCL Performance

The b10180 release of llama.cpp brings notable improvements to SYCL performance, focusing on unary elementwise operations. By introducing a contiguous fast path and employing 32-bit index math, the update aims to boost computational efficiency. The integration of fastdiv for elementwise index math further enhances processing speed. Although there are no new models in this release, llama.cpp continues to evolve as a flexible inference runtime, now more efficient on systems like macOS, Linux, and Windows. Developers working with SYCL can expect smoother and faster operations, reinforcing llama.cpp's adaptability across different computing environments.

llama.cpp ReleasesJul 30, 2026