Models166in catalog
Largest context1MQwen3.5-Flash

Avg context

243K

Cheapest input

Free/M

Qwen3 4b Instruct 2507 Gguf

Largest output

262K

Qwen3 1p7b Fp8 Draft

Speed tiers

fast 57balanced 90deep 19

Capabilities

What Alibaba models support across the catalog.

Vision

38 / 166

23% of models

Tool use

105 / 166

63% of models

Function calling

105 / 166

63% of models

Extended thinking

68 / 166

41% of models

Streaming

166 / 166

100% of models

Prompt caching

19 / 166

11% of models

All Alibaba models

Sorted by context window, largest first. Tap a row for full specs.

ModelContext
Qwen3.5-Flash1M
Qwen3.5 Plus 2026-02-151M
Qwen3.6 Flash1M
Qwen3.6 Plus1M
Qwen3.6 Plus (free)1M
Qwen Coder1M
Qwen Plus 0728 (thinking)1M
Qwen Turbo 2024 11 011M
Qwen Turbo 2025 04 281M
Qwen Turbo Latest1M
Qwen3 Coder Flash998K
Qwen3 Coder Flash 2025 07 28998K
Qwen3 Coder Plus998K
Qwen3 Coder Plus 2025 07 22998K
Qwen Flash998K
Qwen Flash 2025 07 28998K
Qwen Plus 0728998K
Qwen Plus 2025 09 11998K
Qwen Plus Latest998K
Qwen3 5 Plus992K
Qwen3 1p7b Fp8 Draft262K
Qwen3 235b A22b Instruct 2507262K
Qwen3 235B A22B Instruct 2507262K
Qwen3 235b A22b Instruct 2507 Maas262K
Qwen3 235B A22B Thinking 2507262K

Showing 25 of 166 models

See rankings

How Alibaba models rank against the full catalog.