Rankings

Best for Chatbots

Fast, cost-efficient models ideal for real-time conversational applications.

Models ranked

1105

Top context

33K

Qwen2 5 Coder 7b

Leading provider

Openai

171 models in this ranking

Leaderboard

Tap any row to see full specs and comparisons.

Rank	Model	Context	Max output	Input $/M	Chat score
1	Qwen2 5 Coder 7bAlibaba	33K	—	$0.010/M	fast
2	Llama3 2 11b VisionMeta	131K	131K	$0.015/M	fast
3	Llama3 2 3bMeta	131K	131K	$0.015/M	fast
4	Granite 4.0 MicroIbm	131K	131K	$0.017/M	balanced
5	Meta Llama 3 1 8bMeta	128K	2K	$0.020/M	fast
6	Llama 3 2 3bMeta	131K	131K	$0.020/M	fast
7	Llama Guard 3 8BMeta	131K	131K	$0.020/M	fast
8	Meta Llama 3 1 8b InstructMeta	131K	131K	$0.020/M	fast
9	Qwen2 Vl 7bAlibaba	131K	131K	$0.020/M	fast
10	Llama 3 1 8bMeta	131K	16K	$0.020/M	fast
11	Mistral Nemo Instruct 2407Mistral	131K	131K	$0.020/M	balanced
12	gpt-oss-20bOpenai	131K	131K	$0.020/M	balanced
13	Llama3 1 8bMeta	128K	128K	$0.025/M	fast
14	Hermes3 8bNous Research	131K	131K	$0.025/M	fast
15	Deepseek R1 Distill Llama 8bMeta	131K	—	$0.025/M	fast
16	Llama 3 2 1bMeta	128K	128K	$0.027/M	fast
17	Meta Llama 3 8bMeta	8K	8K	$0.030/M	fast
18	Granite 3 3 8bIbm	8K	—	$0.030/M	fast
19	Qwen3 4b Fp8Alibaba	128K	20K	$0.030/M	balanced
20	Deepseek OcrDeepseek	8K	8K	$0.030/M	balanced
21	Qwen3 8b Fp8Alibaba	128K	20K	$0.035/M	fast
22	Command R7B (12-2024)Cohere	128K	4K	$0.037/M	fast
23	Amazon Nova MicroAmazon	128K	10K	$0.035/M	balanced
24	Nova Micro 1.0Amazon	128K	10K	$0.035/M	balanced
25	Autoglm Phone 9b MultilingualZ Ai	66K	66K	$0.035/M	balanced

Showing 25 of 1105 models

Explore other model leaderboards.

Largest Context Window

Models ranked by maximum context window size.

Best for RAG

Models best suited for Retrieval-Augmented Generation workloads.

Best for AI Agents

Models with the capabilities needed to power autonomous agent workflows.

Best for Document Processing

Models with large enough context to process long documents in one pass.

Best Value per Context Token

Models offering the most context window per dollar of input cost.

Best Multimodal

Vision-capable models with the largest context windows.

Best Reasoning

Models with extended thinking or strong reasoning capabilities.