Model Class

Qwen3

Alibaba's open-weight large language model family launched April 2025 and evolved through multiple 2026 releases. Range covers six dense models (0.6B–32B parameters) and two Mixture-of-Experts models (30B-A3B and 235B-A22B at launch; Qwen 3.5 escalated to 397B-A17B with native vision + 201 languages in February 2026; Qwen3.6-27B added a hybrid Gated DeltaNet linear-attention + traditional-self-attention architecture in April 2026; Qwen3.7-Max preview ships 1M-token context). All releases under Apache 2.0 with downloadable weights.

4 connections

Definition

What it is

Alibaba's open-weight large language model family launched April 2025 and evolved through multiple 2026 releases. Range covers six dense models (0.6B–32B parameters) and two Mixture-of-Experts models (30B-A3B and 235B-A22B at launch; Qwen 3.5 escalated to 397B-A17B with native vision + 201 languages in February 2026; Qwen3.6-27B added a hybrid Gated DeltaNet linear-attention + traditional-self-attention architecture in April 2026; Qwen3.7-Max preview ships 1M-token context). All releases under Apache 2.0 with downloadable weights.

Why it exists

Through 2025, the open-weight frontier was dominated by aggressively-sparse MoE designs (DeepSeek V3, Kimi K2) — and Alibaba's Qwen team was specifically targeting the same agentic-coding workload class. The 2026 evolution diverged: Qwen3.6-27B is dense (not MoE) but uses hybrid linear+traditional attention to match much-larger MoE models on agentic coding. The framing: density + hybrid attention can compete with sparse MoE at the workload class where most enterprise spend lives (agentic coding, retrieval-augmented chat).

Primary use cases

Agentic coding (Qwen3-Coder-Next specifically optimizes for this workload), multilingual workloads (Qwen 3.5 supports 201 languages natively), long-horizon tasks via Qwen3.7-Max's 1M-token context + extended-thinking mode, on-prem deployment via the Apache-2.0 weights, and migration target for teams moving off proprietary frontier APIs.

Recent developments

Latest signals
  • Qwen3.6-27B (April 22, 2026) — dense 27B outperforms 397B MoE on coding. SWE-bench Verified 77.2% (vs Qwen3.5-397B-A17B's 76.2%), SWE-bench Pro 53.5% (vs 50.9%), Terminal-Bench 2.0 59.3% (vs 52.5%). Hybrid Gated DeltaNet linear attention + traditional self-attention. Per MarkTechPost — Qwen3.6-27B release.
  • Qwen 3.5 (February 2026) — 397B/17B-active MoE, native vision, 201 languages. Beats Alibaba's own larger trillion-parameter model at a fraction of the cost. Per VentureBeat — Qwen 3.5 beats trillion-parameter.
  • Qwen3.7-Max preview — 1M-token context + extended-thinking mode. Announced at the 2026 Alibaba Cloud Summit, designed for long-horizon coding/debugging/multi-step workflow automation. Per BuildMVPFast — Qwen 3.5 agentic benchmark.
  • Qwen3-Coder-Next — local agentic-coding deployment guide. Complete 2026 guide to running Qwen3-Coder-Next locally as an agentic coding agent. Per DEV — Qwen3-Coder-Next 2026 guide.
  • Apache 2.0 license — Hugging Face Hub + ModelScope. All Qwen3 family releases under Apache 2.0 with downloadable weights, distributed via Hugging Face Hub and Alibaba's ModelScope. Per GitHub (QwenLM/Qwen3.6).

Connections 4

Outbound 4