Models & Labs

Kimi K2.7 Code Rivals Claude Fable 5 in Landing Page Test

Together AI BlogJune 17, 2026high confidence

Why it matters

→Open-source models like Kimi K2.7 Code can deliver competitive quality at a fraction of the cost of proprietary models.
→Providing models with visual context can significantly enhance the quality of AI-generated content.
→Cost-effective AI solutions enable more iterations and experimentation, crucial for design and development workflows.

Kimi K2.7 Code Rivals Claude Fable 5 in Landing Page Test — ©Together AI Blog

Together AI conducted an experiment comparing the open-source Kimi K2.7 Code with the proprietary Claude Fable 5 in generating landing pages. Kimi proved to be significantly more cost-effective, being 16 times cheaper on average, while delivering comparable quality. The use of a custom MCP server to provide visual inspiration improved Kimi's output, making it a viable alternative for developers. This study underscores the potential of open-source models to deliver high-quality results at a fraction of the cost.

Read original

More in Models & Labs

Models & Labsmodels

v0.22.1rc1: Docker Update for vLLM

The latest release candidate for vLLM, version 0.22.1rc1, introduces a change in the Docker setup by removing the use of extra-index-url for the flashinfer-jit-cache. This adjustment simplifies the Docker configuration, potentially reducing dependency management issues and improving build reliability. While this update might seem minor, it reflects ongoing efforts to streamline the development process and enhance the usability of vLLM for developers. This change is particularly relevant for those maintaining Docker environments and looking for more efficient ways to manage dependencies.

vLLM ReleasesJun 18, 2026

Models & Labsmodels

Llama.cpp b9688 Release Enhances Model Management

The latest b9688 release of llama.cpp introduces significant updates to its server capabilities, including a new model management API and real-time SSE updates. These enhancements aim to streamline the deployment and management of AI models, making it easier for developers to integrate and maintain models in various environments. The update also includes a download API and a delete endpoint, providing more control over model assets. While the release doesn't introduce new models, it strengthens the infrastructure, making llama.cpp a more robust choice for developers working with diverse hardware configurations.

llama.cpp ReleasesJun 18, 2026

Models & Labsmodels

Llama.cpp b9689 Release Adds Metal Backend Support

The latest release of llama.cpp, version b9689, enhances its Metal backend by adding support for f16 and bf16 tensor types in the concat operator. This update broadens the compatibility of the Metal backend, which previously supported only f32 and i32 types. By templating the kernel_concat on type T and adding type-specific pipeline getters, the release ensures more efficient processing across different data types. This development is particularly relevant for developers working on macOS and iOS platforms, as it expands the capabilities of AI models running on Apple Silicon and other supported devices.

llama.cpp ReleasesJun 18, 2026