Models166in catalog
Largest context1MQwen3.5-Flash
Avg context
243K
Cheapest input
Free/M
Qwen3 4b Instruct 2507 Gguf
Largest output
262K
Qwen3 1p7b Fp8 Draft
Speed tiers
fast 57balanced 90deep 19
Capabilities
What Alibaba models support across the catalog.
Vision
38 / 166
23% of models
Tool use
105 / 166
63% of models
Function calling
105 / 166
63% of models
Extended thinking
68 / 166
41% of models
Streaming
166 / 166
100% of models
Prompt caching
19 / 166
11% of models
All Alibaba models
Sorted by context window, largest first. Tap a row for full specs.
Showing 25 of 166 models
See rankings
How Alibaba models rank against the full catalog.