| 注册会员 | 1205 |
| 主题 | 846 |
| 模型 | 3026 |
| 技能包 | 13874 |
| 数据集 | 1047 |
| 论文 | 380 |
| 开源项目 | 602 |
| 模型名称 | 厂商 | 发布时间 | 智能评分 |
|---|---|---|---|
| GLM-5.2 (max) | 2026-06-16 |
50.67
|
|
| Kimi K2.7 Code | 2026-06-12 |
41.95
|
|
| Claude Fable 5 (Adaptive Reasoning, Max Effort, Opus 4.8 Fallback) | 2026-06-09 |
59.86
|
|
| North Mini Code | 2026-06-09 |
20.56
|
|
| Nemotron 3 Ultra 550B A55B (Reasoning) | 2026-06-04 |
37.76
|
|
| Gemma 4 12B (Non-reasoning) | 2026-06-03 |
13.19
|
|
| Gemma 4 12B (Reasoning) | 2026-06-03 |
22.00
|
|
| MiniMax-M3 | 2026-06-01 |
44.44
|
|
| Qwen3.7 Plus | 2026-06-01 |
38.98
|
|
| Step 3.7 Flash | 2026-05-29 |
29.73
|
|
| Claude Opus 4.8 (Adaptive Reasoning, Max Effort) | 2026-05-28 |
55.69
|
|
| LFM2.5-8B-A1B | 2026-05-28 |
8.32
|
|
| HyperNova 60B 2605 | 2026-05-26 |
22.14
|
|
| MiniCPM5-1B (Non-reasoning) | 2026-05-25 |
11.74
|
|
| MiniCPM5-1B (Reasoning) | 2026-05-25 |
11.96
|
|
| Command A+ | 2026-05-20 |
29.29
|
|
| Gemini 3.5 Flash (high) | 2026-05-19 |
50.20
|
|
| Gemini 3.5 Flash (medium) | 2026-05-19 |
45.38
|
|
| Gemini 3.5 Flash (minimal) | 2026-05-19 |
34.91
|
|
| Qwen3.7 Max | 2026-05-19 |
45.99
|
|
| JT-35B-Flash | 2026-05-14 |
28.36
|
|
| MiniCPM-V 4.6 1.3B | 2026-05-11 |
6.92
|
|
| Ring-2.6-1T | 2026-05-08 |
30.57
|
|
| GPT-5.5 Instant (May 2026) | 2026-05-05 |
33.52
|
|
| Grok 4.3 (Non-reasoning) | 2026-04-30 |
24.75
|
|
| Grok 4.3 (high) | 2026-04-30 |
37.58
|
|
| Grok 4.3 (low) | 2026-04-30 |
35.44
|
|
| Grok 4.3 (medium) | 2026-04-30 |
36.00
|
|
| Granite 4.1 30B | 2026-04-29 |
8.86
|
|
| Granite 4.1 3B | 2026-04-29 |
3.17
|
|
| Granite 4.1 8B | 2026-04-29 |
6.67
|
|
| Mistral Medium 3.5 | 2026-04-29 |
29.95
|
|
| Nemotron 3 Nano Omni 30B A3B Reasoning | 2026-04-29 |
14.93
|
|
| DeepSeek V4 Flash (Non-reasoning) | 2026-04-24 |
28.65
|
|
| DeepSeek V4 Flash (Reasoning, High Effort) | 2026-04-24 |
37.36
|
|
| DeepSeek V4 Flash (Reasoning, Max Effort) | 2026-04-24 |
40.28
|
|
| DeepSeek V4 Pro (Non-reasoning) | 2026-04-24 |
31.22
|
|
| DeepSeek V4 Pro (Reasoning, High Effort) | 2026-04-24 |
40.83
|
|
| DeepSeek V4 Pro (Reasoning, Max Effort) | 2026-04-24 |
44.27
|
|
| GPT-5.5 (Non-reasoning) | 2026-04-23 |
32.74
|
|
| GPT-5.5 (high) | 2026-04-23 |
53.13
|
|
| GPT-5.5 (low) | 2026-04-23 |
41.73
|
|
| GPT-5.5 (medium) | 2026-04-23 |
47.14
|
|
| GPT-5.5 (xhigh) | 2026-04-23 |
54.84
|
|
| GPT-5.5 Pro (xhigh) | 2026-04-23 |
—
|
|
| Hy3-preview (Non-reasoning) | 2026-04-23 |
26.10
|
|
| Hy3-preview (Reasoning) | 2026-04-23 |
33.58
|
|
| Ling-2.6-1T | 2026-04-23 |
26.05
|
|
| MiMo-V2.5 | 2026-04-22 |
40.14
|
|
| MiMo-V2.5-Pro | 2026-04-22 |
42.24
|
|
| MiMo-V2.5-Pro (Non-reasoning) | 2026-04-22 |
27.86
|
|
| Qwen3.6 27B (Non-reasoning) | 2026-04-22 |
29.28
|
|
| Qwen3.6 27B (Reasoning) | 2026-04-22 |
37.05
|
|
| Ling 2.6 Flash | 2026-04-21 |
19.25
|
|
| Kimi K2.6 | 2026-04-20 |
42.84
|
|
| Kimi K2.6 (Non-reasoning) | 2026-04-20 |
34.58
|
|
| Qwen3.6 Max Preview | 2026-04-20 |
40.00
|
|
| Claude Opus 4.7 (Adaptive Reasoning, Max Effort) | 2026-04-16 |
53.53
|
|
| Claude Opus 4.7 (Non-reasoning, High Effort) | 2026-04-16 |
42.68
|
|
| Qwen3.6 35B A3B (Non-reasoning) | 2026-04-16 |
24.15
|
|
| Qwen3.6 35B A3B (Reasoning) | 2026-04-16 |
31.65
|
|
| JT-MINI | 2026-04-15 |
18.53
|
|
| EXAONE 4.5 33B | 2026-04-09 |
22.96
|
|
| EXAONE 4.5 33B (Non-reasoning) | 2026-04-09 |
—
|
|
| Muse Spark | 2026-04-08 |
43.06
|
|
| GLM-5.1 (Non-reasoning) | 2026-04-07 |
35.37
|
|
| GLM-5.1 (Reasoning) | 2026-04-07 |
40.16
|
|
| Grok 4.20 0309 v2 (Non-reasoning) | 2026-04-07 |
21.84
|
|
| Grok 4.20 0309 v2 (Reasoning) | 2026-04-07 |
37.00
|
|
| Solar Pro 3 | 2026-04-06 |
14.14
|
|
| Gemma 4 E4B (Non-reasoning) | 2026-04-03 |
8.91
|
|
| Gemma 4 E4B (Reasoning) | 2026-04-03 |
12.50
|
|
| Gemma 4 26B A4B (Non-reasoning) | 2026-04-02 |
20.10
|
|
| Gemma 4 26B A4B (Reasoning) | 2026-04-02 |
25.69
|
|
| Gemma 4 31B (Non-reasoning) | 2026-04-02 |
24.85
|
|
| Gemma 4 31B (Reasoning) | 2026-04-02 |
29.35
|
|
| Gemma 4 E2B (Non-reasoning) | 2026-04-02 |
6.41
|
|
| Gemma 4 E2B (Reasoning) | 2026-04-02 |
9.26
|
|
| Qwen3.6 Plus | 2026-04-02 |
39.56
|
|
| Step 3.5 Flash 2603 | 2026-04-02 |
26.00
|
|
| GLM 5V Turbo (Reasoning) | 2026-04-01 |
34.49
|
|
| Trinity Large Thinking | 2026-04-01 |
24.47
|
|
| Qwen3.5 Omni Flash | 2026-03-30 |
18.99
|
|
| Qwen3.5 Omni Plus | 2026-03-30 |
30.64
|
|
| MiMo-V2-Omni-0327 | 2026-03-27 |
36.39
|
|
| MiMo-V2-Omni | 2026-03-19 |
34.99
|
|
| Nemotron Cascade 2 30B A3B | 2026-03-19 |
21.25
|
|
| MiMo-V2-Pro | 2026-03-18 |
40.29
|
|
| MiniMax-M2.7 | 2026-03-18 |
38.13
|
|
| GPT-5.4 mini (Non-Reasoning) | 2026-03-17 |
16.62
|
|
| GPT-5.4 mini (medium) | 2026-03-17 |
29.81
|
|
| GPT-5.4 mini (xhigh) | 2026-03-17 |
39.98
|
|
| GPT-5.4 nano (Non-Reasoning) | 2026-03-17 |
17.61
|
|
| GPT-5.4 nano (medium) | 2026-03-17 |
30.16
|
|
| GPT-5.4 nano (xhigh) | 2026-03-17 |
38.24
|
|
| Mistral Small 4 (Non-reasoning) | 2026-03-16 |
12.37
|
|
| Mistral Small 4 (Reasoning) | 2026-03-16 |
20.75
|
|
| NVIDIA Nemotron 3 Nano 4B | 2026-03-16 |
8.77
|
|
| GLM-5-Turbo | 2026-03-15 |
38.06
|
|
| NVIDIA Nemotron 3 Super 120B A12B (Reasoning) | 2026-03-11 |
25.41
|
|
| Grok 4.20 0309 (Non-reasoning) | 2026-03-10 |
22.48
|
|
| Grok 4.20 0309 (Reasoning) | 2026-03-10 |
36.50
|
|
| Sarvam 105B (high) | 2026-03-06 |
11.94
|
|
| Sarvam 30B (high) | 2026-03-06 |
6.64
|
|
| GPT-5.4 (Non-reasoning) | 2026-03-05 |
27.68
|
|
| GPT-5.4 (low) | 2026-03-05 |
39.14
|
|
| GPT-5.4 (xhigh) | 2026-03-05 |
51.40
|
|
| GPT-5.4 Pro (xhigh) | 2026-03-05 |
—
|
|
| Gemini 3.1 Flash-Lite | 2026-03-03 |
25.04
|
|
| Qwen3.5 0.8B (Non-reasoning) | 2026-03-02 |
4.42
|
|
| Qwen3.5 0.8B (Reasoning) | 2026-03-02 |
4.97
|
|
| Qwen3.5 2B (Non-reasoning) | 2026-03-02 |
8.76
|
|
| Qwen3.5 2B (Reasoning) | 2026-03-02 |
10.24
|
|
| Qwen3.5 4B (Non-reasoning) | 2026-03-02 |
16.00
|
|
| Qwen3.5 4B (Reasoning) | 2026-03-02 |
20.09
|
|
| Qwen3.5 9B (Non-reasoning) | 2026-03-02 |
20.32
|
|
| Qwen3.5 9B (Reasoning) | 2026-03-02 |
24.97
|
|
| LFM2 24B A2B | 2026-02-25 |
4.95
|
|
| Qwen3.5 122B A10B (Non-reasoning) | 2026-02-24 |
28.12
|
|
| Qwen3.5 122B A10B (Reasoning) | 2026-02-24 |
32.28
|
|
| Qwen3.5 27B (Non-reasoning) | 2026-02-24 |
29.31
|
|
| Qwen3.5 27B (Reasoning) | 2026-02-24 |
33.78
|
|
| Qwen3.5 35B A3B (Non-reasoning) | 2026-02-24 |
23.38
|
|
| Qwen3.5 35B A3B (Reasoning) | 2026-02-24 |
29.26
|
|
| Mercury 2 | 2026-02-20 |
25.33
|
|
| Gemini 3.1 Pro Preview | 2026-02-19 |
46.46
|
|
| Claude Sonnet 4.6 (Adaptive Reasoning, Max Effort) | 2026-02-17 |
47.21
|
|
| Claude Sonnet 4.6 (Non-reasoning, High Effort) | 2026-02-17 |
35.89
|
|
| Claude Sonnet 4.6 (Non-reasoning, Low Effort) | 2026-02-17 |
34.26
|
|
| Tiny Aya Global | 2026-02-17 |
1.00
|
|
| Qwen3.5 397B A17B (Non-reasoning) | 2026-02-16 |
31.98
|
|
| Qwen3.5 397B A17B (Reasoning) | 2026-02-16 |
33.68
|
|
| MiniMax-M2.5 | 2026-02-12 |
33.65
|
|
| GLM-5 (Non-reasoning) | 2026-02-11 |
32.41
|
|
| GLM-5 (Reasoning) | 2026-02-11 |
39.50
|
|
| Nanbeige4.1-3B | 2026-02-11 |
10.05
|
|
| Claude Opus 4.6 (Adaptive Reasoning, Max Effort) | 2026-02-05 |
43.71
|
|
| Claude Opus 4.6 (Non-reasoning, High Effort) | 2026-02-05 |
37.79
|
|
| GPT-5.3 Codex (xhigh) | 2026-02-05 |
44.27
|
|
| Gemini 3 Deep Think | 2026-02-05 |
—
|
|
| Qwen3 Coder Next | 2026-02-03 |
21.18
|
|
| Step 3.5 Flash | 2026-02-02 |
25.50
|
|
| LongCat Flash Lite | 2026-01-28 |
17.22
|
|
| Kimi K2.5 (Non-reasoning) | 2026-01-27 |
29.40
|
|
| Kimi K2.5 (Reasoning) | 2026-01-27 |
38.11
|
|
| Qwen3 Max Thinking | 2026-01-26 |
31.75
|
|
| LFM2.5-1.2B-Thinking | 2026-01-20 |
2.75
|
|
| Step3 VL 10B | 2026-01-20 |
9.47
|
|
| GLM-4.7-Flash (Non-reasoning) | 2026-01-19 |
15.52
|
|
| GLM-4.7-Flash (Reasoning) | 2026-01-19 |
22.89
|
|
| LFM2.5-1.2B-Instruct | 2026-01-05 |
2.71
|
|
| LFM2.5-VL-1.6B | 2026-01-05 |
1.01
|
|
| Falcon-H1R-7B | 2026-01-04 |
9.79
|
|
| K-EXAONE (Non-reasoning) | 2025-12-31 |
16.74
|
|
| K-EXAONE (Reasoning) | 2025-12-31 |
24.70
|
|
| MiniMax-M2.1 | 2025-12-23 |
31.36
|
|
| GLM-4.7 (Non-reasoning) | 2025-12-22 |
26.56
|
|
| GLM-4.7 (Reasoning) | 2025-12-22 |
33.81
|
|
| Gemini 3 Flash Preview (Non-reasoning) | 2025-12-17 |
27.37
|
|
| Gemini 3 Flash Preview (Reasoning) | 2025-12-17 |
37.76
|
|
| Solar Open 100B (Reasoning) | 2025-12-17 |
15.15
|
|
| MiMo-V2-Flash (Feb 2026) | 2025-12-16 |
33.22
|
|
| MiMo-V2-Flash (Non-reasoning) | 2025-12-16 |
23.07
|
|
| MiMo-V2-Flash (Reasoning) | 2025-12-16 |
31.20
|
|
| NVIDIA Nemotron 3 Nano 30B A3B (Non-reasoning) | 2025-12-15 |
7.39
|
|
| NVIDIA Nemotron 3 Nano 30B A3B (Reasoning) | 2025-12-15 |
17.53
|
|
| GPT-5.2 (Non-reasoning) | 2025-12-11 |
26.02
|
|
| GPT-5.2 (medium) | 2025-12-11 |
37.95
|
|
| GPT-5.2 (xhigh) | 2025-12-11 |
42.18
|
|
| GPT-5.2 Codex (xhigh) | 2025-12-11 |
40.14
|
|
| Mi:dm K 2.5 Pro | 2025-12-11 |
16.42
|
|
| Mi:dm K 2.5 Pro Preview | 2025-12-11 |
—
|
|
| Devstral 2 | 2025-12-09 |
15.49
|
|
| Devstral Small 2 | 2025-12-09 |
13.14
|
|
| GLM-4.6V (Non-reasoning) | 2025-12-08 |
10.98
|
|
| GLM-4.6V (Reasoning) | 2025-12-08 |
16.75
|
|
| Motif-2-12.7B-Reasoning | 2025-12-04 |
12.79
|
|
| Ministral 3 14B | 2025-12-02 |
9.96
|
|
| Ministral 3 3B | 2025-12-02 |
5.63
|
|
| Ministral 3 8B | 2025-12-02 |
8.92
|
|
| Mistral Large 3 | 2025-12-02 |
16.19
|
|
| DeepSeek V3.2 (Non-reasoning) | 2025-12-01 |
24.66
|
|
| DeepSeek V3.2 (Reasoning) | 2025-12-01 |
33.45
|
|
| DeepSeek V3.2 Speciale | 2025-12-01 |
22.24
|
|
| INTELLECT-3 | 2025-11-27 |
15.61
|
|
| Nova 2.0 Pro Preview (Non-reasoning) | 2025-11-27 |
16.42
|
|
| Nova 2.0 Pro Preview (low) | 2025-11-27 |
19.50
|
|
| Nova 2.0 Pro Preview (medium) | 2025-11-27 |
21.77
|
|
| Nova 2.0 Omni (Non-reasoning) | 2025-11-26 |
10.53
|
|
| Nova 2.0 Omni (low) | 2025-11-26 |
16.56
|
|
| Nova 2.0 Omni (medium) | 2025-11-26 |
20.95
|
|
| Claude Opus 4.5 (Non-reasoning) | 2025-11-24 |
34.71
|
|
| Claude Opus 4.5 (Reasoning) | 2025-11-24 |
40.77
|
|
| Grok 4.1 Fast (Non-reasoning) | 2025-11-19 |
16.88
|
|
| Grok 4.1 Fast (Reasoning) | 2025-11-19 |
30.62
|
|
| Gemini 3 Pro Preview (high) | 2025-11-18 |
39.55
|
|
| Gemini 3 Pro Preview (low) | 2025-11-18 |
33.07
|
|
| ERNIE 5.0 Thinking Preview | 2025-11-13 |
21.92
|
|
| GPT-5.1 (Non-reasoning) | 2025-11-13 |
20.40
|
|
| GPT-5.1 (high) | 2025-11-13 |
38.91
|
|
| GPT-5.1 Codex (high) | 2025-11-13 |
34.73
|
|
| GPT-5.1 Codex mini (high) | 2025-11-13 |
30.63
|
|
| Doubao Seed Code | 2025-11-11 |
25.98
|
|
| Kimi K2 Thinking | 2025-11-06 |
32.70
|
|
| Qwen3 Max Thinking (Preview) | 2025-11-03 |
25.03
|
|
| Kimi Linear 48B A3B Instruct | 2025-10-30 |
8.53
|
|
| Nova 2.0 Lite (Non-reasoning) | 2025-10-29 |
11.83
|
|
| Nova 2.0 Lite (high) | 2025-10-29 |
20.50
|
|
| Nova 2.0 Lite (low) | 2025-10-29 |
17.81
|
|
| Nova 2.0 Lite (medium) | 2025-10-29 |
19.00
|
|
| Granite 4.0 1B | 2025-10-28 |
2.07
|
|
| Granite 4.0 350M | 2025-10-28 |
1.00
|
|
| Granite 4.0 H 1B | 2025-10-28 |
2.66
|
|
| Granite 4.0 H 350M | 2025-10-28 |
1.00
|
|
| NVIDIA Nemotron Nano 12B v2 VL (Non-reasoning) | 2025-10-28 |
4.58
|
|
| NVIDIA Nemotron Nano 12B v2 VL (Reasoning) | 2025-10-28 |
8.96
|
|
| MiniMax-M2 | 2025-10-26 |
28.31
|
|
| Qwen3 VL 32B (Reasoning) | 2025-10-21 |
17.93
|
|
| Qwen3 VL 32B Instruct | 2025-10-21 |
11.06
|
|
| Claude 4.5 Haiku (Non-reasoning) | 2025-10-15 |
23.71
|
|
| Claude 4.5 Haiku (Reasoning) | 2025-10-15 |
29.58
|
|
| Qwen3 VL 4B (Reasoning) | 2025-10-14 |
7.90
|
|
| Qwen3 VL 4B Instruct | 2025-10-14 |
4.09
|
|
| Qwen3 VL 8B (Reasoning) | 2025-10-14 |
10.58
|
|
| Qwen3 VL 8B Instruct | 2025-10-14 |
8.42
|
|
| Ring-1T | 2025-10-13 |
16.16
|
|
| Jamba Reasoning 3B | 2025-10-08 |
4.13
|
|
| Ling-1T | 2025-10-08 |
12.75
|
|
| LFM2 8B A1B | 2025-10-07 |
1.78
|
|
| Qwen3 VL 30B A3B (Reasoning) | 2025-10-03 |
13.34
|
|
| Qwen3 VL 30B A3B Instruct | 2025-10-03 |
10.02
|
|
| GLM-4.6 (Non-reasoning) | 2025-09-30 |
22.98
|
|
| GLM-4.6 (Reasoning) | 2025-09-30 |
25.05
|
|
| Claude 4.5 Sonnet (Non-reasoning) | 2025-09-29 |
29.28
|
|
| Claude 4.5 Sonnet (Reasoning) | 2025-09-29 |
34.65
|
|
| DeepSeek V3.2 Exp (Non-reasoning) | 2025-09-29 |
21.33
|
|
| DeepSeek V3.2 Exp (Reasoning) | 2025-09-29 |
25.45
|
|
| Gemini 2.5 Flash Preview (Sep '25) (Non-reasoning) | 2025-09-25 |
18.83
|
|
| Gemini 2.5 Flash Preview (Sep '25) (Reasoning) | 2025-09-25 |
23.80
|
|
| Gemini 2.5 Flash-Lite Preview (Sep '25) (Non-reasoning) | 2025-09-25 |
13.10
|
|
| GPT-5 Codex (high) | 2025-09-23 |
36.12
|
|
| LFM2 2.6B | 2025-09-23 |
2.71
|
|
| Qwen3 Max | 2025-09-23 |
24.01
|
|
| Qwen3 VL 235B A22B (Reasoning) | 2025-09-23 |
20.60
|
|
| Qwen3 VL 235B A22B Instruct | 2025-09-23 |
14.31
|
|
| DeepSeek V3.1 Terminus (Non-reasoning) | 2025-09-22 |
21.41
|
|
| DeepSeek V3.1 Terminus (Reasoning) | 2025-09-22 |
26.35
|
|
| Granite 4.0 H Small | 2025-09-22 |
5.24
|
|
| Granite 4.0 Micro | 2025-09-22 |
2.37
|
|
| Qwen3 Omni 30B A3B (Reasoning) | 2025-09-22 |
9.63
|
|
| Qwen3 Omni 30B A3B Instruct | 2025-09-22 |
5.11
|
|
| Grok 4 Fast (Non-reasoning) | 2025-09-19 |
16.47
|
|
| Grok 4 Fast (Reasoning) | 2025-09-19 |
27.37
|
|
| Ring-flash-2.0 | 2025-09-19 |
8.16
|
|
| Magistral Medium 1.2 | 2025-09-18 |
20.11
|
|
| Ling-flash-2.0 | 2025-09-17 |
9.74
|
|
| Magistral Small 1.2 | 2025-09-17 |
11.95
|
|
| Qwen3 Next 80B A3B (Reasoning) | 2025-09-11 |
19.77
|
|
| Qwen3 Next 80B A3B Instruct | 2025-09-11 |
13.72
|
|
| Ling-mini-2.0 | 2025-09-09 |
3.75
|
|
| Gemini 2.5 Flash-Lite Preview (Sep '25) (Reasoning) | 2025-09-08 |
15.13
|
|
| Kimi K2 0905 | 2025-09-05 |
23.54
|
|
| Qwen3 Max (Preview) | 2025-09-05 |
19.18
|
|
| Apertus 70B Instruct | 2025-09-02 |
2.40
|
|
| Apertus 8B Instruct | 2025-09-02 |
1.00
|
|
| Grok Code Fast 1 | 2025-08-28 |
21.61
|
|
| DeepSeek V3.1 (Non-reasoning) | 2025-08-21 |
21.05
|
|
| DeepSeek V3.1 (Reasoning) | 2025-08-21 |
20.67
|
|
| Seed-OSS-36B-Instruct | 2025-08-20 |
18.34
|
|
| NVIDIA Nemotron Nano 9B V2 (Non-reasoning) | 2025-08-18 |
7.38
|
|
| NVIDIA Nemotron Nano 9B V2 (Reasoning) | 2025-08-18 |
8.84
|
|
| Gemma 3 270M | 2025-08-14 |
2.41
|
|
| Mistral Medium 3.1 | 2025-08-12 |
14.77
|
|
| GLM-4.5V (Non-reasoning) | 2025-08-11 |
7.00
|
|
| GLM-4.5V (Reasoning) | 2025-08-11 |
9.15
|
|
| GPT-5 (ChatGPT) | 2025-08-07 |
15.30
|
|
| GPT-5 (high) | 2025-08-07 |
36.11
|
|
| GPT-5 (low) | 2025-08-07 |
31.15
|
|
| GPT-5 (medium) | 2025-08-07 |
33.74
|
|
| GPT-5 (minimal) | 2025-08-07 |
17.18
|
|
| GPT-5 mini (high) | 2025-08-07 |
32.96
|
|
| GPT-5 mini (medium) | 2025-08-07 |
30.92
|
|
| GPT-5 mini (minimal) | 2025-08-07 |
14.25
|
|
| GPT-5 nano (high) | 2025-08-07 |
19.87
|
|
| GPT-5 nano (medium) | 2025-08-07 |
19.00
|
|
| GPT-5 nano (minimal) | 2025-08-07 |
8.00
|
|
| Qwen3 4B 2507 (Reasoning) | 2025-08-06 |
11.96
|
|
| Qwen3 4B 2507 Instruct | 2025-08-06 |
7.12
|
|
| Claude 4.1 Opus (Non-reasoning) | 2025-08-05 |
28.24
|
|
| Claude 4.1 Opus (Reasoning) | 2025-08-05 |
33.71
|
|
| gpt-oss-120b (high) | 2025-08-05 |
23.83
|
|
| gpt-oss-120b (low) | 2025-08-05 |
17.70
|
|
| gpt-oss-20B (high) | 2025-08-05 |
14.89
|
|
| gpt-oss-20B (low) | 2025-08-05 |
14.34
|
|
| Qwen3 Coder 30B A3B Instruct | 2025-07-31 |
13.61
|
|
| Qwen3 30B A3B 2507 (Reasoning) | 2025-07-30 |
15.83
|
|
| Qwen3 30B A3B 2507 Instruct | 2025-07-29 |
9.06
|
|
| GLM-4.5 (Reasoning) | 2025-07-28 |
19.49
|
|
| GLM-4.5-Air | 2025-07-28 |
16.52
|
|
| Llama Nemotron Super 49B v1.5 (Non-reasoning) | 2025-07-25 |
8.68
|
|
| Llama Nemotron Super 49B v1.5 (Reasoning) | 2025-07-25 |
12.42
|
|
| Qwen3 235B A22B 2507 (Reasoning) | 2025-07-25 |
22.34
|
|
| Qwen3 Coder 480B A35B Instruct | 2025-07-22 |
17.98
|
|
| Qwen3 235B A22B 2507 Instruct | 2025-07-21 |
18.16
|
|
| EXAONE 4.0 32B (Non-reasoning) | 2025-07-15 |
6.01
|
|
| EXAONE 4.0 32B (Reasoning) | 2025-07-15 |
10.59
|
|
| Exaone 4.0 1.2B (Non-reasoning) | 2025-07-15 |
2.77
|
|
| Exaone 4.0 1.2B (Reasoning) | 2025-07-15 |
2.90
|
|
| Kimi K2 | 2025-07-11 |
19.40
|
|
| Devstral Medium | 2025-07-10 |
12.40
|
|
| Devstral Small (Jul '25) | 2025-07-10 |
9.26
|
|
| Grok 4 | 2025-07-10 |
33.28
|
|
| LFM2 1.2B | 2025-07-10 |
1.15
|
|
| Solar Pro 2 (Non-reasoning) | 2025-07-09 |
7.77
|
|
| Solar Pro 2 (Reasoning) | 2025-07-09 |
8.99
|
|
| Jamba 1.7 Large | 2025-07-07 |
5.30
|
|
| Jamba 1.7 Mini | 2025-07-07 |
2.73
|
|
| ERNIE 4.5 300B A47B | 2025-06-30 |
9.02
|
|
| Gemma 3n E2B Instruct | 2025-06-26 |
1.00
|
|
| Gemma 3n E4B Instruct | 2025-06-26 |
1.19
|
|
| Mistral Small 3.2 | 2025-06-20 |
9.12
|
|
| Gemini 2.5 Flash-Lite (Non-reasoning) | 2025-06-17 |
6.92
|
|
| Gemini 2.5 Flash-Lite (Reasoning) | 2025-06-17 |
11.41
|
|
| MiniMax M1 40k | 2025-06-17 |
14.41
|
|
| MiniMax M1 80k | 2025-06-17 |
17.67
|
|
| Magistral Medium 1 | 2025-06-10 |
12.51
|
|
| Magistral Small 1 | 2025-06-10 |
10.70
|
|
| o3-pro | 2025-06-10 |
32.52
|
|
| Gemini 2.5 Pro | 2025-06-05 |
26.98
|
|
| DeepSeek R1 0528 Qwen3 8B | 2025-05-29 |
10.37
|
|
| DeepSeek R1 0528 (May '25) | 2025-05-28 |
20.08
|
|
| Sarvam M (Reasoning) | 2025-05-23 |
3.03
|
|
| Claude 4 Opus (Non-reasoning) | 2025-05-22 |
25.50
|
|
| Claude 4 Opus (Reasoning) | 2025-05-22 |
30.97
|
|
| Claude 4 Sonnet (Non-reasoning) | 2025-05-22 |
25.49
|
|
| Claude 4 Sonnet (Reasoning) | 2025-05-22 |
30.67
|
|
| Devstral Small (May '25) | 2025-05-21 |
11.82
|
|
| Gemini 2.5 Flash (Non-reasoning) | 2025-05-20 |
14.14
|
|
| Gemini 2.5 Flash (Reasoning) | 2025-05-20 |
20.06
|
|
| Gemma 3n E4B Instruct Preview (May '25) | 2025-05-20 |
4.55
|
|
| Llama 3.1 Nemotron Nano 4B v1.1 (Reasoning) | 2025-05-20 |
8.54
|
|
| Solar Pro 2 (Preview) (Non-reasoning) | 2025-05-20 |
9.97
|
|
| Solar Pro 2 (Preview) (Reasoning) | 2025-05-20 |
12.54
|
|
| Mistral Medium 3 | 2025-05-07 |
12.49
|
|
| Gemini 2.5 Pro Preview (May' 25) | 2025-05-06 |
22.34
|
|
| Nova Premier | 2025-04-30 |
12.73
|
|
| Qwen3 0.6B (Non-reasoning) | 2025-04-28 |
1.00
|
|
| Qwen3 0.6B (Reasoning) | 2025-04-28 |
1.28
|
|
| Qwen3 1.7B (Non-reasoning) | 2025-04-28 |
1.53
|
|
| Qwen3 1.7B (Reasoning) | 2025-04-28 |
2.63
|
|
| Qwen3 14B (Non-reasoning) | 2025-04-28 |
7.02
|
|
| Qwen3 14B (Reasoning) | 2025-04-28 |
10.15
|
|
| Qwen3 235B A22B (Non-reasoning) | 2025-04-28 |
10.86
|
|
| Qwen3 235B A22B (Reasoning) | 2025-04-28 |
13.43
|
|
| Qwen3 30B A3B (Non-reasoning) | 2025-04-28 |
6.80
|
|
| Qwen3 30B A3B (Reasoning) | 2025-04-28 |
9.31
|
|
| Qwen3 32B (Non-reasoning) | 2025-04-28 |
8.63
|
|
| Qwen3 32B (Reasoning) | 2025-04-28 |
10.46
|
|
| Qwen3 4B (Non-reasoning) | 2025-04-28 |
6.77
|
|
| Qwen3 4B (Reasoning) | 2025-04-28 |
8.35
|
|
| Qwen3 8B (Non-reasoning) | 2025-04-28 |
5.07
|
|
| Qwen3 8B (Reasoning) | 2025-04-28 |
7.40
|
|
| Gemini 2.5 Flash Preview (Non-reasoning) | 2025-04-17 |
11.66
|
|
| Gemini 2.5 Flash Preview (Reasoning) | 2025-04-17 |
17.55
|
|
| Granite 3.3 8B (Non-reasoning) | 2025-04-16 |
1.75
|
|
| o3 | 2025-04-16 |
30.40
|
|
| o4-mini (high) | 2025-04-16 |
25.55
|
|
| GPT-4.1 | 2025-04-14 |
19.36
|
|
| GPT-4.1 mini | 2025-04-14 |
16.27
|
|
| GPT-4.1 nano | 2025-04-14 |
7.27
|
|
| Llama 3.1 Nemotron Ultra 253B v1 (Reasoning) | 2025-04-07 |
9.08
|
|
| Llama 4 Maverick | 2025-04-05 |
14.27
|
|
| Llama 4 Scout | 2025-04-05 |
10.04
|
|
| GPT-4o (March 2025, chatgpt-4o-latest) | 2025-03-27 |
12.31
|
|
| DeepSeek V3 0324 | 2025-03-25 |
15.71
|
|
| Gemini 2.5 Pro Preview (Mar' 25) | 2025-03-25 |
23.03
|
|
| o1-pro | 2025-03-19 |
18.89
|
|
| Llama 3.3 Nemotron Super 49B v1 (Non-reasoning) | 2025-03-18 |
8.47
|
|
| Llama 3.3 Nemotron Super 49B v1 (Reasoning) | 2025-03-18 |
12.25
|
|
| Mistral Small 3.1 | 2025-03-17 |
8.59
|
|
| Command A | 2025-03-13 |
7.67
|
|
| Gemma 3 1B Instruct | 2025-03-13 |
1.00
|
|
| Gemma 3 12B Instruct | 2025-03-12 |
3.39
|
|
| Gemma 3 27B Instruct | 2025-03-12 |
4.78
|
|
| Gemma 3 4B Instruct | 2025-03-12 |
1.12
|
|
| Reka Flash 3 | 2025-03-10 |
4.06
|
|
| Jamba 1.6 Large | 2025-03-06 |
5.00
|
|
| Jamba 1.6 Mini | 2025-03-06 |
2.55
|
|
| QwQ 32B | 2025-03-05 |
13.37
|
|
| GPT-4.5 (Preview) | 2025-02-27 |
13.59
|
|
| Phi-4 Multimodal Instruct | 2025-02-26 |
4.53
|
|
| Gemini 2.0 Flash-Lite (Feb '25) | 2025-02-25 |
8.79
|
|
| Claude 3.7 Sonnet (Non-reasoning) | 2025-02-24 |
23.50
|
|
| Claude 3.7 Sonnet (Reasoning) | 2025-02-24 |
27.06
|
|
| Grok 3 | 2025-02-19 |
18.35
|
|
| Grok 3 Reasoning Beta | 2025-02-19 |
15.13
|
|
| Grok 3 mini Reasoning (high) | 2025-02-19 |
22.50
|
|
| R1 1776 | 2025-02-18 |
6.31
|
|
| Mistral Saba | 2025-02-17 |
6.44
|
|
| GPT-4o (ChatGPT) | 2025-02-15 |
8.25
|
|
| Gemini 2.0 Flash (Feb '25) | 2025-02-05 |
12.27
|
|
| Gemini 2.0 Flash-Lite (Preview) | 2025-02-05 |
8.59
|
|
| Gemini 2.0 Pro Experimental (Feb '25) | 2025-02-05 |
11.85
|
|
| o3-mini | 2025-01-31 |
18.98
|
|
| o3-mini (high) | 2025-01-31 |
18.38
|
|
| Mistral Small 3 | 2025-01-30 |
6.93
|
|
| Qwen2.5 Max | 2025-01-28 |
10.23
|
|
| Sonar Reasoning | 2025-01-28 |
11.69
|
|
| Sonar Reasoning Pro | 2025-01-28 |
17.85
|
|
| Gemini 2.0 Flash Thinking Experimental (Jan '25) | 2025-01-21 |
13.26
|
|
| Sonar | 2025-01-21 |
9.51
|
|
| Sonar Pro | 2025-01-21 |
9.27
|
|
| DeepSeek R1 (Jan '25) | 2025-01-20 |
12.57
|
|
| DeepSeek R1 Distill Llama 70B | 2025-01-20 |
9.93
|
|
| DeepSeek R1 Distill Llama 8B | 2025-01-20 |
6.41
|
|
| DeepSeek R1 Distill Qwen 1.5B | 2025-01-20 |
3.65
|
|
| DeepSeek R1 Distill Qwen 14B | 2025-01-20 |
9.83
|
|
| DeepSeek R1 Distill Qwen 32B | 2025-01-20 |
11.04
|
|
| DeepSeek V3 (Dec '24) | 2024-12-26 |
10.39
|
|
| Gemini 2.0 Flash Thinking Experimental (Dec '24) | 2024-12-19 |
6.63
|
|
| GPT-4o Realtime (Dec '24) | 2024-12-17 |
—
|
|
| GPT-4o mini Realtime (Dec '24) | 2024-12-17 |
—
|
|
| Grok 2 (Dec '24) | 2024-12-12 |
8.04
|
|
| Phi-4 | 2024-12-12 |
4.87
|
|
| Gemini 2.0 Flash (experimental) | 2024-12-11 |
10.68
|
|
| DeepSeek-V2.5 (Dec '24) | 2024-12-10 |
6.79
|
|
| Llama 3.3 Instruct 70B | 2024-12-06 |
8.59
|
|
| o1 | 2024-12-05 |
23.44
|
|
| Nova Lite | 2024-12-03 |
6.92
|
|
| Nova Micro | 2024-12-03 |
4.74
|
|
| Nova Pro | 2024-12-03 |
7.67
|
|
| QwQ 32B-Preview | 2024-11-27 |
9.22
|
|
| GPT-4o (Nov '24) | 2024-11-20 |
11.18
|
|
| Mistral Large 2 (Nov '24) | 2024-11-18 |
9.15
|
|
| Pixtral Large | 2024-11-18 |
8.15
|
|
| Qwen2.5 Turbo | 2024-11-18 |
6.30
|
|
| Qwen2.5 Coder Instruct 32B | 2024-11-11 |
7.11
|
|
| Claude 3.5 Haiku | 2024-10-22 |
12.41
|
|
| Claude 3.5 Sonnet (Oct '24) | 2024-10-22 |
9.91
|
|
| Llama 3.1 Nemotron Instruct 70B | 2024-10-15 |
7.64
|
|
| Reka Flash (Sep '24) | 2024-10-04 |
6.29
|
|
| Gemini 1.5 Flash-8B | 2024-10-03 |
5.53
|
|
| LFM 40B | 2024-09-30 |
3.37
|
|
| Llama 3.2 Instruct 11B (Vision) | 2024-09-25 |
3.33
|
|
| Llama 3.2 Instruct 1B | 2024-09-25 |
1.10
|
|
| Llama 3.2 Instruct 3B | 2024-09-25 |
4.22
|
|
| Llama 3.2 Instruct 90B (Vision) | 2024-09-25 |
6.23
|
|
| Gemini 1.5 Flash (Sep '24) | 2024-09-24 |
7.96
|
|
| Gemini 1.5 Pro (Sep '24) | 2024-09-24 |
9.97
|
|
| Qwen2.5 Coder Instruct 7B | 2024-09-19 |
4.48
|
|
| Qwen2.5 Instruct 32B | 2024-09-19 |
7.45
|
|
| Qwen2.5 Instruct 72B | 2024-09-19 |
9.57
|
|
| Mistral Small (Sep '24) | 2024-09-17 |
4.66
|
|
| o1-mini | 2024-09-12 |
13.98
|
|
| o1-preview | 2024-09-12 |
17.04
|
|
| DeepSeek-V2.5 | 2024-09-06 |
6.62
|
|
| Jamba 1.5 Large | 2024-08-22 |
5.13
|
|
| Jamba 1.5 Mini | 2024-08-22 |
2.70
|
|
| Grok Beta | 2024-08-13 |
7.49
|
|
| GPT-4o (Aug '24) | 2024-08-06 |
9.58
|
|
| Mistral Large 2 (Jul '24) | 2024-07-24 |
7.27
|
|
| Llama 3.1 Instruct 405B | 2024-07-23 |
8.50
|
|
| Llama 3.1 Instruct 70B | 2024-07-23 |
6.75
|
|
| Llama 3.1 Instruct 8B | 2024-07-23 |
6.10
|
|
| GPT-4o mini | 2024-07-18 |
6.91
|
|
| Claude 3.5 Sonnet (June '24) | 2024-06-21 |
8.30
|
|
| DeepSeek Coder V2 Lite Instruct | 2024-06-17 |
3.11
|
|
| DeepSeek-Coder-V2 | 2024-06-17 |
5.05
|
|
| Qwen2 Instruct 72B | 2024-06-07 |
6.01
|
|
| Gemini 1.5 Pro (May '24) | 2024-05-15 |
6.32
|
|
| Gemini 1.5 Flash (May '24) | 2024-05-14 |
4.92
|
|
| GPT-4o (May '24) | 2024-05-13 |
8.60
|
|
| DeepSeek-V2-Chat | 2024-05-06 |
3.64
|
|
| Qwen1.5 Chat 110B | 2024-04-25 |
4.08
|
|
| Arctic Instruct | 2024-04-24 |
3.42
|
|
| Phi-3 Mini Instruct 3.8B | 2024-04-23 |
4.59
|
|
| Llama 3 Instruct 70B | 2024-04-18 |
3.47
|
|
| Llama 3 Instruct 8B | 2024-04-18 |
1.19
|
|
| Mixtral 8x22B Instruct | 2024-04-17 |
4.35
|
|
| Command-R+ (Apr '24) | 2024-04-04 |
2.99
|
|
| DBRX Instruct | 2024-03-27 |
2.96
|
|
| Grok-1 | 2024-03-17 |
6.04
|
|
| Command-R (Mar '24) | 2024-03-12 |
2.14
|
|
| Claude 3 Haiku | 2024-03-04 |
3.86
|
|
| Claude 3 Opus | 2024-03-04 |
11.80
|
|
| Claude 3 Sonnet | 2024-03-04 |
4.74
|
|
| Mistral Large (Feb '24) | 2024-02-26 |
4.41
|
|
| Mistral Small (Feb '24) | 2024-02-26 |
3.62
|
|
| Phi-4 Mini Instruct | 2024-02-26 |
3.03
|
|
| Solar Mini | 2024-01-25 |
6.23
|
|
| OpenChat 3.5 (1210) | 2023-12-18 |
2.96
|
|
| Mistral Medium | 2023-12-11 |
3.59
|
|
| Mixtral 8x7B Instruct | 2023-12-11 |
2.43
|
|
| Gemini 1.0 Pro | 2023-12-06 |
3.13
|
|
| Gemini 1.0 Ultra | 2023-12-06 |
4.63
|
|
| Qwen Chat 72B | 2023-11-30 |
3.42
|
|
| DeepSeek LLM 67B Chat (V1) | 2023-11-29 |
3.01
|
|
| Claude 2.1 | 2023-11-21 |
3.88
|
|
| GPT-4 Turbo | 2023-11-06 |
7.89
|
|
| Mistral 7B Instruct | 2023-09-27 |
2.14
|
|
| Qwen Chat 14B | 2023-09-25 |
2.14
|
|
| Llama 2 Chat 13B | 2023-07-18 |
3.00
|
|
| Llama 2 Chat 70B | 2023-07-18 |
3.01
|
|
| Llama 2 Chat 7B | 2023-07-18 |
4.26
|
|
| Claude 2.0 | 2023-07-11 |
3.64
|
|
| GPT-3.5 Turbo (0613) | 2023-06-13 |
—
|
|
| PALM-2 | 2023-05-10 |
3.21
|
|
| Claude Instant | 2023-03-14 |
2.14
|
|
| GPT-4 | 2023-03-14 |
7.01
|
|
| Llama 65B | 2023-02-24 |
2.14
|
|
| GPT-3.5 Turbo | 2022-11-30 |
3.57
|