Qwen: Qwen3 4B

qwen/qwen3-4b

Qwen3-4B is a 4 billion parameter dense language model from the Qwen3 series, designed to support both general-purpose and reasoning-intensive tasks. It introduces a dual-mode architecture—thinking and non-thinking—allowing dynamic switching between high-precision logical reasoning and efficient dialogue generation. This makes it well-suited for multi-turn chat, instruction following, and complex agent workflows.

Modalities

Context

128K

Released

Apr 30, 2025

Knowledge Cutoff

Mar 2025