Qwen3.6 35B-A3B

qwen3.6-35b-a3b

Qwen3.6 35B A3B is a compact active-parameter MoE model: sources describe 35B total parameters with around 3B activated parameters, plus native long context that can be extended toward 1M tokens. Official materials compare it favorably with larger dense models on coding tasks despite the small active footprint. Its value proposition is efficient deployment with surprisingly strong coding and long-context ability.

Total Context

262.1Ktokens

Max Output

65.5Ktokens

Released

Apr 17, 2026

Modalities

Qwen3.6 35B-A3B Price

Input PriceOutput Price
$0.2571/M$1.5429/M

Qwen3.6 35B-A3B API

POST /v1/chat/completions

Qwen3.6 35B-A3B Benchmark

33

/100

Artificial Analysis Intelligence Index

Artificial Analysis broad capability aggregate

Index score

35.2

/100

Artificial Analysis Coding Index

Artificial Analysis software task aggregate

Index score

Knowledge & Reasoning

GPQA

Advanced science problem solving

84.1%

HLE

Broad expert-level exam set

20.2%

Coding & Engineering

SciCode

Scientific coding challenges

35.8%

Terminal-Bench Hard

Hard terminal task execution

34.8%

Instruction Following & Agent Tasks

IFBench

Prompt constraint adherence

64.4%

AA-LCR

Long-context reasoning

63.7%

τ²-Bench

Agent workflow tasks

95.3%

Metrics sourced from Artificial Analysis

Media and Discussions

Selected public videos and posts related to this model.

X (Twitter)

View post on X
View post on X
View post on X

Reddit

YouTube

Watch on YouTube
Watch on YouTube
Watch on YouTube

Qwen 3.6 35B A3B FAQ

Qwen 3.6 35B A3B: capabilities, use cases, limits, and TokenHub guidance.

What does Qwen 3.6 35B A3B focus on?+

Qwen 3.6 35B A3B is a Alibaba Qwen model for open-model multimodal reasoning and efficient deployment.

Which projects fit Qwen 3.6 35B A3B?+

Best for self-hosted deployment, image and video understanding and routine coding assistance, especially when deployment control is the priority.

What is special about Qwen 3.6 35B A3B?+

Key strength: an open MoE variant with a small active-parameter footprint and hybrid thinking that can switch between deliberate and direct responses.

When is another model better?+

Open deployment requires infrastructure, serving, and evaluation work. For stable production behavior matters, consider Qwen 3.6 Plus.

How do I avoid ID mistakes?+

Use TokenHub's exact ID; hosted behavior may differ from self-hosting.