ai/qwen3.6

Verified Publisher

By Docker

•Updated 7 days ago

Multimodal AI model with 35B MoE architecture for coding agents, reasoning, and vision tasks

Model

10K+

Overview Tags

ai/qwen3.6 repository overview

⁠Qwen3.6

Qwen3.6-35B-A3B is a cutting-edge multimodal AI model from Alibaba Cloud's Qwen team, delivering substantial upgrades in agentic coding, reasoning preservation, and real-world development workflows. Built on direct community feedback following the Qwen3.5 series, this model prioritizes stability and utility, offering developers an intuitive and productive coding experience with enhanced frontend workflows and repository-level reasoning capabilities.

This release features a Mixture of Experts (MoE) architecture with 35 billion total parameters and 3 billion activated parameters, supporting both text and vision inputs. With a native context length of 262,144 tokens (extensible to over 1 million), Qwen3.6 excels at handling complex, multi-turn interactions while maintaining efficiency. The model demonstrates exceptional performance on coding agent benchmarks like SWE-bench Verified, Terminal-Bench, and web development tasks, making it particularly valuable for software engineering, code generation, and multimodal applications.

Key innovations include thinking preservation to retain reasoning context across iterations, improved tool calling with better nested object parsing, and enhanced support for developer roles. Whether you're building coding assistants, multimodal applications, or agentic systems, Qwen3.6 delivers state-of-the-art performance with open-source accessibility.

⁠Characteristics

Attribute	Value
Provider	Alibaba Cloud / Qwen Team
Architecture	Qwen3.5 MoE (Mixture of Experts)
Parameters	35B total, 3B activated
Context Length	262,144 tokens (native), extensible to 1,010,000 tokens
Languages	Multilingual (English, Chinese, and others)
Input modalities	Text, Image
Output modalities	Text
License	Apache 2.0

⁠Using this model with Docker Model Runner

docker model run qwen3.6

For more information, check out the Docker Model Runner docs⁠.

⁠Benchmarks

Benchmark Results

⁠Coding Agent Benchmarks

Benchmark	Qwen3.5-27B	Gemma4-31B	Qwen3.5-35BA3B	Gemma4-26BA4B	Qwen3.6-35BA3B
SWE-bench Verified	75.0	52.0	70.0	17.4	73.4
SWE-bench Multilingual	69.3	51.7	60.3	17.3	67.2
SWE-bench Pro	51.2	35.7	44.6	13.8	49.5
Terminal-Bench 2.0	41.6	42.9	40.5	34.2	51.5
Claw-Eval (Avg)	64.3	48.5	65.4	58.8	68.7
Claw-Eval (Pass³)	46.2	25.0	51.0	28.0	50.0
SkillsBench (Avg5)	27.2	23.6	4.4	12.3	28.7
QwenClawBench	52.2	41.7	47.7	38.7	52.6
NL2Repo	27.3	15.5	20.5	11.6	29.4
QwenWebBench	1068	1197	978	1178	1397

⁠General Agent Benchmarks

Benchmark	Qwen3.5-27B	Gemma4-31B	Qwen3.5-35BA3B	Gemma4-26BA4B	Qwen3.6-35BA3B
TAU3-Bench	68.4	67.5	68.9	59.0	67.2
VITA-Bench	41.8	43.0	29.1	36.9	35.6
DeepPlanning	22.6	24.0	22.8	16.2	25.9
Tool Decathlon	31.5	21.2	28.7	12.0	26.9
MCPMark	36.3	18.1	27.0	14.2	37.0
MCP-Atlas	68.4	57.2	62.4	50.0	62.8
WideSearch	66.4	35.2	59.1	38.3	60.1

⁠Knowledge Benchmarks

Benchmark	Qwen3.5-27B	Gemma4-31B	Qwen3.5-35BA3B	Gemma4-26BA4B	Qwen3.6-35BA3B
MMLU-Pro	86.1	85.2	85.3	82.6	85.2
MMLU-Redux	93.2	93.7	93.3	92.7	93.3
SuperGPQA	65.6	65.7	63.4	61.4	64.7
C-Eval	90.5	82.6	90.2	82.5	90.0

⁠STEM & Reasoning Benchmarks

Benchmark	Qwen3.5-27B	Gemma4-31B	Qwen3.5-35BA3B	Gemma4-26BA4B	Qwen3.6-35BA3B
GPQA	85.5	84.3	84.2	82.3	86.0
HLE	24.3	19.5	22.4	8.7	21.4
LiveCodeBench v6	80.7	80.0	74.6	77.1	80.4
HMMT Feb 25	92.0	88.7	89.0	91.7	90.7
HMMT Nov 25	89.8	87.5	89.2	87.5	89.1
HMMT Feb 26	84.3	77.2	78.7	79.0	83.6
IMOAnswerBench	79.9	74.5	76.8	74.3	78.9
AIME26	92.6	89.2	91.0	88.3	92.7

⁠Vision Language Benchmarks

Benchmark	Qwen3.5-27B	Claude-Sonnet-4.5	Gemma4-31B	Gemma4-26BA4B	Qwen3.5-35BA3B	Qwen3.6-35BA3B
MMMU	82.3	79.6	80.4	78.4	81.4	81.7
MMMU-Pro	75.0	68.4	76.9	73.8	75.1	75.3
MathVista (mini)	87.8	79.8	87.5	86.3	87.1	87.6
MathVision	32.4	31.5	29.7	24.8	32.0	33.3
AI2D	98.0	96.2	98.0	97.2	97.8	98.1

⁠Links

⁠Considerations

Hardware Requirements: Due to the MoE architecture with 35B total parameters, running this model efficiently requires adequate GPU memory. GGUF quantized versions are available to reduce memory requirements while maintaining performance.
Context Window: While the native context length is 262K tokens, optimal performance may vary at extreme lengths. For most applications, the native 262K context provides excellent performance.
Multimodal Capabilities: The model supports both text and image inputs. Vision capabilities require the appropriate multimodal projection files (mmproj files available in GGUF format).
Tool Calling: Qwen3.6 includes improved tool calling support with better handling of nested objects. Follow the documented chat template format for optimal tool use.
Developer Role Support: This model includes enhanced support for developer roles, making it particularly effective for coding assistants and agentic workflows.
Thinking Preservation: When using the model for iterative development, enable thinking preservation to maintain reasoning context across multiple turns.

⁠Generated by

This model card was automatically generated using cagent-action⁠. Want to learn more about Docker Model Runner? Check out the project repository: https://github.com/docker/model-runner⁠.

Tag summary

Recent tags

Content type

Model

Digest

sha256:8ca73612f…

Size

17.3 GB

Last updated

7 days ago

docker model pull ai/qwen3.6:27B

This week's pulls

Pulls:

10,969

Last week

Learn more⁠