Qwen3.5 35B-A3B

qwen3.5-35b-a3b

Qwen3.5 35B A3B is a native vision-language MoE model designed to approximate much larger model behavior with a smaller active footprint. Model cards describe hybrid linear attention, sparse experts, and comparable results to larger Qwen3.5 dense variants. The strongest positioning is efficient multimodal reasoning and coding without the cost of always activating a large dense model.

Total Context

262.1Ktokens

Max Output

65.5Ktokens

Released

Feb 23, 2026

Modalities

Qwen3.5 35B-A3B Price

Token TierInput PriceOutput Price
<=128K$0.0571/M$0.4571/M
>128K$0.2286/M$1.8286/M

Qwen3.5 35B-A3B API

POST /v1/chat/completions

Qwen3.5 35B-A3B Benchmark

23.4

/100

Artificial Analysis Intelligence Index

Artificial Analysis broad capability aggregate

Index score

16.8

/100

Artificial Analysis Coding Index

Artificial Analysis software task aggregate

Index score

Knowledge & Reasoning

GPQA

Advanced science problem solving

81.9%

HLE

Broad expert-level exam set

12.8%

Coding & Engineering

SciCode

Scientific coding challenges

29.3%

Terminal-Bench Hard

Hard terminal task execution

10.6%

Instruction Following & Agent Tasks

IFBench

Prompt constraint adherence

44.5%

AA-LCR

Long-context reasoning

55.3%

τ²-Bench

Agent workflow tasks

86.3%

Metrics sourced from Artificial Analysis

Media and Discussions

Selected public videos and posts related to this model.

X (Twitter)

View post on X
View post on X
View post on X

Reddit

YouTube

Watch on YouTube
Watch on YouTube
Watch on YouTube

Qwen 3.5 35B A3B FAQ

Qwen 3.5 35B A3B: capabilities, use cases, limits, and TokenHub guidance.

How should teams view Qwen 3.5 35B A3B?+

Qwen 3.5 35B A3B is a Alibaba Qwen model for open-model multimodal reasoning and efficient deployment.

What is Qwen 3.5 35B A3B best for?+

Best for self-hosted deployment, visual reasoning and routine coding assistance, especially when deployment control is the priority.

What is Qwen 3.5 35B A3B's main strength?+

Key strength: an open MoE variant with a small active-parameter footprint and hybrid thinking that can switch between deliberate and direct responses.

Is Qwen 3.5 35B A3B always the best choice?+

It belongs to an older generation and may lack newer capabilities. For the latest capabilities matter, consider Qwen 3.6 35B A3B.

What is the safest setup?+

Use TokenHub's exact ID; hosted behavior may differ from self-hosting.