Models & Labs

Hugging Face Introduces Falcon Perception Model

Hugging Face BlogApril 1, 2026medium confidence

Why it matters

→The introduction of Falcon Perception represents a significant step forward in the capabilities of perception systems, particularly in integrating language and visual data.

Hugging Face Introduces Falcon Perception Model — ©Hugging Face Blog

Hugging Face has launched Falcon Perception, a 0.6 billion-parameter early-fusion Transformer model that integrates image and text processing for open-vocabulary grounding and segmentation. It utilizes a hybrid attention mask and a structured token interface, achieving a Macro-F1 score of 68.0 on the SA-Co benchmark, outperforming previous models. Additionally, the release includes Falcon OCR, a 0.3 billion-parameter model that excels in OCR tasks, achieving high scores on relevant benchmarks. This development highlights advancements in perception systems and their applications in image processing.

Read original

Hugging Face Introduces Falcon Perception Model

Why it matters

More from Hugging Face Blog

OlmoEarth Platform Enables Large-Scale Geospatial Inference

More in Models & Labs

Llama.cpp adds GLM-5.2 speculative decoding support

Llama.cpp b10178 Release Adds Trace Logging

LFM2.5-Encoders Boost Long-Context Inference on CPU

NVIDIA Unveils Real-Time Surgical Simulator

llama.cpp b10180 Release Enhances SYCL Performance