Models & Labs

CyberSecQwen-4B: Specialized Cybersecurity Model Released

Hugging Face BlogMay 8, 2026high confidence

Why it matters

→CyberSecQwen-4B offers a deployable solution for cybersecurity tasks on consumer-grade hardware.
→It provides a cost-effective alternative to larger models without significant loss in accuracy.
→The model addresses privacy concerns by allowing sensitive data to remain internal.

CyberSecQwen-4B: Specialized Cybersecurity Model Released — ©Hugging Face Blog

Hugging Face has introduced CyberSecQwen-4B, a specialized AI model for defensive cybersecurity tasks. This model is designed to run locally on consumer-grade GPUs, making it accessible for environments where data privacy and cost are concerns. It retains 97.3% of the accuracy of larger models like Cisco's Foundation-Sec-Instruct-8B while using half the parameters. CyberSecQwen-4B is tailored for tasks such as CWE classification and CTI Q&A, providing a focused tool for cybersecurity professionals. This release highlights the importance of specialized, locally-runnable models in the cybersecurity domain.

Read original

More from Hugging Face Blog

Models & Labsmodels

Hugging Face Unveils EMO MoE Model

Hugging Face has introduced EMO, a new mixture-of-experts model that allows for emergent modularity without predefined human biases. Unlike traditional models that require the full model for optimal performance, EMO can achieve near full-model performance using only 12.5% of its experts for specific tasks. This innovation addresses the inefficiencies of large language models by enabling selective expert use, reducing computational costs while maintaining versatility. EMO's design encourages coherent expert grouping, making it a flexible and efficient tool for diverse applications.

Hugging Face BlogMay 8, 2026

More in Models & Labs

Models & Labsmodels

Llama.cpp b9075 Release Enhances CUDA Snake Activation

The b9075 release of llama.cpp brings a notable improvement for CUDA users by integrating the snake activation function into a single elementwise kernel. This enhancement is particularly advantageous for audio decoders like BigVGAN and Vocos, which previously depended on a more complex five-operation sequence. By streamlining these operations, the update promises better performance and efficiency across data types such as F32, F16, and BF16. This development reflects llama.cpp's ongoing focus on refining its CUDA capabilities, making it a more compelling option for developers dealing with complex activation functions.

llama.cpp ReleasesMay 9, 2026

Models & Labsmodels

Llama.cpp b9076 Release Expands Platform Support

The latest b9076 release of llama.cpp quietly expands its platform support, making it more versatile for developers across various systems. Notably, it now exposes child model information from the router's /v1/models endpoint, enhancing transparency and control for users. The update includes support for macOS Apple Silicon with KleidiAI enabled, as well as expanded compatibility with Ubuntu and Windows systems, including Vulkan and ROCm 7.2. This release doesn't introduce new models but strengthens llama.cpp's position as a flexible inference runtime across diverse hardware configurations.

llama.cpp ReleasesMay 9, 2026

Models & Labsmodels

llama.cpp b9077 release supports Vertex AI API

The b9077 release of llama.cpp now aligns with a Vertex AI compatible API, enhancing its integration with Google's AI platform. This update also brings a series of fixes and improvements across various operating systems, including macOS, Linux, and Windows. Developers can now leverage support for environments ranging from Apple Silicon to Vulkan and ROCm on Ubuntu. While there are no new model architectures, this release reinforces llama.cpp's role as a versatile tool for developers working across diverse platforms. The update ensures a more robust experience, particularly for those utilizing CUDA and SYCL technologies. Overall, llama.cpp continues to evolve as a reliable choice for AI development in a wide array of scenarios.

llama.cpp ReleasesMay 9, 2026