| 注册会员 | 1100 |
| 主题 | 846 |
| 模型 | 3026 |
| 技能包 | 13874 |
| 数据集 | 1047 |
| 论文 | 331 |
| 开源项目 | 532 |
| 模型名称 | 厂商 | 发布时间 | 智能评分 |
|---|---|---|---|
| Grok 4.3 | 2026-04-30 |
53.20
|
|
| Granite 4.1 30B | 2026-04-29 |
14.69
|
|
| Granite 4.1 3B | 2026-04-29 |
8.54
|
|
| Granite 4.1 8B | 2026-04-29 |
12.38
|
|
| Mistral Medium 3.5 | 2026-04-29 |
39.23
|
|
| Nemotron 3 Nano Omni 30B A3B Reasoning | 2026-04-29 |
21.43
|
|
| DeepSeek V4 Flash (Non-reasoning) | 2026-04-24 |
36.46
|
|
| DeepSeek V4 Flash (Reasoning, High Effort) | 2026-04-24 |
44.87
|
|
| DeepSeek V4 Flash (Reasoning, Max Effort) | 2026-04-24 |
46.52
|
|
| DeepSeek V4 Pro (Non-reasoning) | 2026-04-24 |
39.27
|
|
| DeepSeek V4 Pro (Reasoning, High Effort) | 2026-04-24 |
49.79
|
|
| DeepSeek V4 Pro (Reasoning, Max Effort) | 2026-04-24 |
51.51
|
|
| GPT-5.5 (Non-reasoning) | 2026-04-23 |
40.94
|
|
| GPT-5.5 (high) | 2026-04-23 |
58.87
|
|
| GPT-5.5 (low) | 2026-04-23 |
50.78
|
|
| GPT-5.5 (medium) | 2026-04-23 |
56.71
|
|
| GPT-5.5 (xhigh) | 2026-04-23 |
60.24
|
|
| GPT-5.5 Pro (xhigh) | 2026-04-23 |
—
|
|
| Hy3-preview (Non-reasoning) | 2026-04-23 |
33.66
|
|
| Hy3-preview (Reasoning) | 2026-04-23 |
41.85
|
|
| Ling-2.6-1T | 2026-04-23 |
33.61
|
|
| MiMo-V2.5 | 2026-04-22 |
49.03
|
|
| MiMo-V2.5-Pro | 2026-04-22 |
53.83
|
|
| MiMo-V2.5-Pro (Non-reasoning) | 2026-04-22 |
35.59
|
|
| Qwen3.6 27B (Non-reasoning) | 2026-04-22 |
37.14
|
|
| Qwen3.6 27B (Reasoning) | 2026-04-22 |
45.82
|
|
| Ling 2.6 Flash | 2026-04-21 |
26.16
|
|
| Kimi K2.6 | 2026-04-20 |
53.90
|
|
| Kimi K2.6 (Non-reasoning) | 2026-04-20 |
42.95
|
|
| Qwen3.6 Max Preview | 2026-04-20 |
51.81
|
|
| Claude Opus 4.7 (Adaptive Reasoning, Max Effort) | 2026-04-16 |
57.28
|
|
| Claude Opus 4.7 (Non-reasoning, High Effort) | 2026-04-16 |
51.82
|
|
| Qwen3.6 35B A3B (Non-reasoning) | 2026-04-16 |
31.53
|
|
| Qwen3.6 35B A3B (Reasoning) | 2026-04-16 |
43.49
|
|
| EXAONE 4.5 33B | 2026-04-09 |
30.23
|
|
| EXAONE 4.5 33B (Non-reasoning) | 2026-04-09 |
—
|
|
| Muse Spark | 2026-04-08 |
52.15
|
|
| GLM-5.1 (Non-reasoning) | 2026-04-07 |
43.82
|
|
| GLM-5.1 (Reasoning) | 2026-04-07 |
51.41
|
|
| Grok 4.20 0309 v2 (Non-reasoning) | 2026-04-07 |
28.99
|
|
| Grok 4.20 0309 v2 (Reasoning) | 2026-04-07 |
49.33
|
|
| Solar Pro 3 | 2026-04-06 |
25.87
|
|
| Gemma 4 E4B (Non-reasoning) | 2026-04-03 |
14.83
|
|
| Gemma 4 E4B (Reasoning) | 2026-04-03 |
18.76
|
|
| Gemma 4 26B A4B (Non-reasoning) | 2026-04-02 |
27.09
|
|
| Gemma 4 26B A4B (Reasoning) | 2026-04-02 |
31.21
|
|
| Gemma 4 31B (Non-reasoning) | 2026-04-02 |
32.29
|
|
| Gemma 4 31B (Reasoning) | 2026-04-02 |
39.18
|
|
| Gemma 4 E2B (Non-reasoning) | 2026-04-02 |
12.10
|
|
| Gemma 4 E2B (Reasoning) | 2026-04-02 |
15.21
|
|
| Qwen3.6 Plus | 2026-04-02 |
49.98
|
|
| Step 3.5 Flash 2603 | 2026-04-02 |
38.47
|
|
| GLM 5V Turbo (Reasoning) | 2026-04-01 |
42.85
|
|
| Trinity Large Thinking | 2026-04-01 |
31.87
|
|
| Qwen3.5 Omni Flash | 2026-03-30 |
25.87
|
|
| Qwen3.5 Omni Plus | 2026-03-30 |
38.63
|
|
| MiMo-V2-Omni-0327 | 2026-03-27 |
44.93
|
|
| MiMo-V2-Omni | 2026-03-19 |
43.40
|
|
| Nemotron Cascade 2 30B A3B | 2026-03-19 |
28.35
|
|
| MiMo-V2-Pro | 2026-03-18 |
49.20
|
|
| MiniMax-M2.7 | 2026-03-18 |
49.62
|
|
| GPT-5.4 mini (Non-Reasoning) | 2026-03-17 |
23.28
|
|
| GPT-5.4 mini (medium) | 2026-03-17 |
37.73
|
|
| GPT-5.4 mini (xhigh) | 2026-03-17 |
48.90
|
|
| GPT-5.4 nano (Non-Reasoning) | 2026-03-17 |
24.36
|
|
| GPT-5.4 nano (medium) | 2026-03-17 |
38.11
|
|
| GPT-5.4 nano (xhigh) | 2026-03-17 |
43.98
|
|
| Mistral Small 4 (Non-reasoning) | 2026-03-16 |
18.62
|
|
| Mistral Small 4 (Reasoning) | 2026-03-16 |
27.80
|
|
| NVIDIA Nemotron 3 Nano 4B | 2026-03-16 |
14.68
|
|
| GLM-5-Turbo | 2026-03-15 |
46.76
|
|
| NVIDIA Nemotron 3 Super 120B A12B (Reasoning) | 2026-03-11 |
35.97
|
|
| Grok 4.20 0309 (Non-reasoning) | 2026-03-10 |
29.69
|
|
| Grok 4.20 0309 (Reasoning) | 2026-03-10 |
48.48
|
|
| Sarvam 105B (high) | 2026-03-06 |
18.16
|
|
| Sarvam 30B (high) | 2026-03-06 |
12.34
|
|
| GPT-5.4 (Non-reasoning) | 2026-03-05 |
35.39
|
|
| GPT-5.4 (low) | 2026-03-05 |
47.94
|
|
| GPT-5.4 (xhigh) | 2026-03-05 |
56.80
|
|
| GPT-5.4 Pro (xhigh) | 2026-03-05 |
—
|
|
| Gemini 3.1 Flash-Lite Preview | 2026-03-03 |
33.52
|
|
| Qwen3.5 0.8B (Non-reasoning) | 2026-03-02 |
9.91
|
|
| Qwen3.5 0.8B (Reasoning) | 2026-03-02 |
10.52
|
|
| Qwen3.5 2B (Non-reasoning) | 2026-03-02 |
14.67
|
|
| Qwen3.5 2B (Reasoning) | 2026-03-02 |
16.29
|
|
| Qwen3.5 4B (Non-reasoning) | 2026-03-02 |
22.60
|
|
| Qwen3.5 4B (Reasoning) | 2026-03-02 |
27.08
|
|
| Qwen3.5 9B (Non-reasoning) | 2026-03-02 |
27.33
|
|
| Qwen3.5 9B (Reasoning) | 2026-03-02 |
32.43
|
|
| LFM2 24B A2B | 2026-02-25 |
10.49
|
|
| Qwen3.5 122B A10B (Non-reasoning) | 2026-02-24 |
35.87
|
|
| Qwen3.5 122B A10B (Reasoning) | 2026-02-24 |
41.60
|
|
| Qwen3.5 27B (Non-reasoning) | 2026-02-24 |
37.18
|
|
| Qwen3.5 27B (Reasoning) | 2026-02-24 |
42.07
|
|
| Qwen3.5 35B A3B (Non-reasoning) | 2026-02-24 |
30.69
|
|
| Qwen3.5 35B A3B (Reasoning) | 2026-02-24 |
37.12
|
|
| Mercury 2 | 2026-02-20 |
32.82
|
|
| Gemini 3.1 Pro Preview | 2026-02-19 |
57.18
|
|
| Claude Sonnet 4.6 (Adaptive Reasoning, Max Effort) | 2026-02-17 |
51.72
|
|
| Claude Sonnet 4.6 (Non-reasoning, High Effort) | 2026-02-17 |
44.38
|
|
| Claude Sonnet 4.6 (Non-reasoning, Low Effort) | 2026-02-17 |
42.60
|
|
| Tiny Aya Global | 2026-02-17 |
4.74
|
|
| Qwen3.5 397B A17B (Non-reasoning) | 2026-02-16 |
40.10
|
|
| Qwen3.5 397B A17B (Reasoning) | 2026-02-16 |
45.05
|
|
| MiniMax-M2.5 | 2026-02-12 |
41.93
|
|
| GLM-5 (Non-reasoning) | 2026-02-11 |
40.57
|
|
| GLM-5 (Reasoning) | 2026-02-11 |
49.77
|
|
| Nanbeige4.1-3B | 2026-02-11 |
16.08
|
|
| Claude Opus 4.6 (Adaptive Reasoning, Max Effort) | 2026-02-05 |
52.95
|
|
| Claude Opus 4.6 (Non-reasoning, High Effort) | 2026-02-05 |
46.46
|
|
| GPT-5.3 Codex (xhigh) | 2026-02-05 |
53.56
|
|
| Gemini 3 Deep Think | 2026-02-05 |
—
|
|
| Qwen3 Coder Next | 2026-02-03 |
28.28
|
|
| Step 3.5 Flash | 2026-02-02 |
37.80
|
|
| LongCat Flash Lite | 2026-01-28 |
23.93
|
|
| Kimi K2.5 (Non-reasoning) | 2026-01-27 |
37.27
|
|
| Kimi K2.5 (Reasoning) | 2026-01-27 |
46.81
|
|
| Qwen3 Max Thinking | 2026-01-26 |
39.85
|
|
| LFM2.5-1.2B-Thinking | 2026-01-20 |
8.08
|
|
| Step3 VL 10B | 2026-01-20 |
15.45
|
|
| GLM-4.7-Flash (Non-reasoning) | 2026-01-19 |
22.07
|
|
| GLM-4.7-Flash (Reasoning) | 2026-01-19 |
30.15
|
|
| LFM2.5-1.2B-Instruct | 2026-01-05 |
8.04
|
|
| LFM2.5-VL-1.6B | 2026-01-05 |
6.18
|
|
| Falcon-H1R-7B | 2026-01-04 |
15.80
|
|
| K-EXAONE (Non-reasoning) | 2025-12-31 |
23.41
|
|
| K-EXAONE (Reasoning) | 2025-12-31 |
32.12
|
|
| MiniMax-M2.1 | 2025-12-23 |
39.42
|
|
| GLM-4.7 (Non-reasoning) | 2025-12-22 |
34.16
|
|
| GLM-4.7 (Reasoning) | 2025-12-22 |
42.11
|
|
| Gemini 3 Flash Preview (Non-reasoning) | 2025-12-17 |
35.05
|
|
| Gemini 3 Flash Preview (Reasoning) | 2025-12-17 |
46.43
|
|
| Solar Open 100B (Reasoning) | 2025-12-17 |
21.67
|
|
| MiMo-V2-Flash (Feb 2026) | 2025-12-16 |
41.46
|
|
| MiMo-V2-Flash (Non-reasoning) | 2025-12-16 |
30.35
|
|
| MiMo-V2-Flash (Reasoning) | 2025-12-16 |
39.24
|
|
| NVIDIA Nemotron 3 Nano 30B A3B (Non-reasoning) | 2025-12-15 |
13.17
|
|
| NVIDIA Nemotron 3 Nano 30B A3B (Reasoning) | 2025-12-15 |
24.27
|
|
| GPT-5.2 (Non-reasoning) | 2025-12-11 |
33.57
|
|
| GPT-5.2 (medium) | 2025-12-11 |
46.64
|
|
| GPT-5.2 (xhigh) | 2025-12-11 |
51.28
|
|
| GPT-5.2 Codex (xhigh) | 2025-12-11 |
49.03
|
|
| Mi:dm K 2.5 Pro | 2025-12-11 |
23.06
|
|
| Mi:dm K 2.5 Pro Preview | 2025-12-11 |
—
|
|
| Devstral 2 | 2025-12-09 |
22.04
|
|
| Devstral Small 2 | 2025-12-09 |
19.47
|
|
| GLM-4.6V (Non-reasoning) | 2025-12-08 |
17.10
|
|
| GLM-4.6V (Reasoning) | 2025-12-08 |
23.42
|
|
| Motif-2-12.7B-Reasoning | 2025-12-04 |
19.08
|
|
| Ministral 3 14B | 2025-12-02 |
15.98
|
|
| Ministral 3 3B | 2025-12-02 |
11.24
|
|
| Ministral 3 8B | 2025-12-02 |
14.84
|
|
| Mistral Large 3 | 2025-12-02 |
22.80
|
|
| DeepSeek V3.2 (Non-reasoning) | 2025-12-01 |
32.09
|
|
| DeepSeek V3.2 (Reasoning) | 2025-12-01 |
41.71
|
|
| DeepSeek V3.2 Speciale | 2025-12-01 |
29.43
|
|
| INTELLECT-3 | 2025-11-27 |
22.17
|
|
| Nova 2.0 Pro Preview (Non-reasoning) | 2025-11-27 |
23.06
|
|
| Nova 2.0 Pro Preview (low) | 2025-11-27 |
31.90
|
|
| Nova 2.0 Pro Preview (medium) | 2025-11-27 |
35.71
|
|
| Nova 2.0 Omni (Non-reasoning) | 2025-11-26 |
16.61
|
|
| Nova 2.0 Omni (low) | 2025-11-26 |
23.22
|
|
| Nova 2.0 Omni (medium) | 2025-11-26 |
28.02
|
|
| Claude Opus 4.5 (Non-reasoning) | 2025-11-24 |
43.09
|
|
| Claude Opus 4.5 (Reasoning) | 2025-11-24 |
49.73
|
|
| Grok 4.1 Fast (Non-reasoning) | 2025-11-19 |
23.56
|
|
| Grok 4.1 Fast (Reasoning) | 2025-11-19 |
38.61
|
|
| Gemini 3 Pro Preview (high) | 2025-11-18 |
48.39
|
|
| Gemini 3 Pro Preview (low) | 2025-11-18 |
41.30
|
|
| ERNIE 5.0 Thinking Preview | 2025-11-13 |
29.09
|
|
| GPT-5.1 (Non-reasoning) | 2025-11-13 |
27.42
|
|
| GPT-5.1 (high) | 2025-11-13 |
47.70
|
|
| GPT-5.1 Codex (high) | 2025-11-13 |
43.11
|
|
| GPT-5.1 Codex mini (high) | 2025-11-13 |
38.63
|
|
| Doubao Seed Code | 2025-11-11 |
33.52
|
|
| Kimi K2 Thinking | 2025-11-06 |
40.89
|
|
| Qwen3 Max Thinking (Preview) | 2025-11-03 |
32.48
|
|
| Kimi Linear 48B A3B Instruct | 2025-10-30 |
14.41
|
|
| Nova 2.0 Lite (Non-reasoning) | 2025-10-29 |
18.03
|
|
| Nova 2.0 Lite (high) | 2025-10-29 |
34.54
|
|
| Nova 2.0 Lite (low) | 2025-10-29 |
24.59
|
|
| Nova 2.0 Lite (medium) | 2025-10-29 |
29.73
|
|
| Granite 4.0 1B | 2025-10-28 |
7.34
|
|
| Granite 4.0 350M | 2025-10-28 |
6.10
|
|
| Granite 4.0 H 1B | 2025-10-28 |
7.99
|
|
| Granite 4.0 H 350M | 2025-10-28 |
5.44
|
|
| NVIDIA Nemotron Nano 12B v2 VL (Non-reasoning) | 2025-10-28 |
10.09
|
|
| NVIDIA Nemotron Nano 12B v2 VL (Reasoning) | 2025-10-28 |
14.89
|
|
| MiniMax-M2 | 2025-10-26 |
36.09
|
|
| Qwen3 VL 32B (Reasoning) | 2025-10-21 |
24.72
|
|
| Qwen3 VL 32B Instruct | 2025-10-21 |
17.19
|
|
| Claude 4.5 Haiku (Non-reasoning) | 2025-10-15 |
31.05
|
|
| Claude 4.5 Haiku (Reasoning) | 2025-10-15 |
37.09
|
|
| Qwen3 VL 4B (Reasoning) | 2025-10-14 |
13.73
|
|
| Qwen3 VL 4B Instruct | 2025-10-14 |
9.55
|
|
| Qwen3 VL 8B (Reasoning) | 2025-10-14 |
16.66
|
|
| Qwen3 VL 8B Instruct | 2025-10-14 |
14.30
|
|
| Ring-1T | 2025-10-13 |
22.78
|
|
| Jamba Reasoning 3B | 2025-10-08 |
9.60
|
|
| Ling-1T | 2025-10-08 |
19.04
|
|
| LFM2 8B A1B | 2025-10-07 |
7.03
|
|
| Qwen3 VL 30B A3B (Reasoning) | 2025-10-03 |
19.68
|
|
| Qwen3 VL 30B A3B Instruct | 2025-10-03 |
16.05
|
|
| GLM-4.6 (Non-reasoning) | 2025-09-30 |
30.24
|
|
| GLM-4.6 (Reasoning) | 2025-09-30 |
32.51
|
|
| Claude 4.5 Sonnet (Non-reasoning) | 2025-09-29 |
37.14
|
|
| Claude 4.5 Sonnet (Reasoning) | 2025-09-29 |
43.03
|
|
| DeepSeek V3.2 Exp (Non-reasoning) | 2025-09-29 |
28.44
|
|
| DeepSeek V3.2 Exp (Reasoning) | 2025-09-29 |
32.94
|
|
| Gemini 2.5 Flash Preview (Sep '25) (Non-reasoning) | 2025-09-25 |
25.70
|
|
| Gemini 2.5 Flash Preview (Sep '25) (Reasoning) | 2025-09-25 |
31.14
|
|
| Gemini 2.5 Flash-Lite Preview (Sep '25) (Non-reasoning) | 2025-09-25 |
19.42
|
|
| GPT-5 Codex (high) | 2025-09-23 |
44.63
|
|
| LFM2 2.6B | 2025-09-23 |
8.04
|
|
| Qwen3 Max | 2025-09-23 |
31.38
|
|
| Qwen3 VL 235B A22B (Reasoning) | 2025-09-23 |
27.64
|
|
| Qwen3 VL 235B A22B Instruct | 2025-09-23 |
20.75
|
|
| DeepSeek V3.1 Terminus (Non-reasoning) | 2025-09-22 |
28.52
|
|
| DeepSeek V3.1 Terminus (Reasoning) | 2025-09-22 |
33.93
|
|
| Granite 4.0 H Small | 2025-09-22 |
10.81
|
|
| Granite 4.0 Micro | 2025-09-22 |
7.67
|
|
| Qwen3 Omni 30B A3B (Reasoning) | 2025-09-22 |
15.62
|
|
| Qwen3 Omni 30B A3B Instruct | 2025-09-22 |
10.68
|
|
| Grok 4 Fast (Non-reasoning) | 2025-09-19 |
23.12
|
|
| Grok 4 Fast (Reasoning) | 2025-09-19 |
35.06
|
|
| Ring-flash-2.0 | 2025-09-19 |
14.02
|
|
| Magistral Medium 1.2 | 2025-09-18 |
27.10
|
|
| Ling-flash-2.0 | 2025-09-17 |
15.74
|
|
| Magistral Small 1.2 | 2025-09-17 |
18.16
|
|
| Qwen3 Next 80B A3B (Reasoning) | 2025-09-11 |
26.72
|
|
| Qwen3 Next 80B A3B Instruct | 2025-09-11 |
20.11
|
|
| Ling-mini-2.0 | 2025-09-09 |
9.19
|
|
| Gemini 2.5 Flash-Lite Preview (Sep '25) (Reasoning) | 2025-09-08 |
21.65
|
|
| Kimi K2 0905 | 2025-09-05 |
30.85
|
|
| Qwen3 Max (Preview) | 2025-09-05 |
26.08
|
|
| Apertus 70B Instruct | 2025-09-02 |
7.70
|
|
| Apertus 8B Instruct | 2025-09-02 |
5.88
|
|
| Grok Code Fast 1 | 2025-08-28 |
28.74
|
|
| DeepSeek V3.1 (Non-reasoning) | 2025-08-21 |
28.13
|
|
| DeepSeek V3.1 (Reasoning) | 2025-08-21 |
27.71
|
|
| Seed-OSS-36B-Instruct | 2025-08-20 |
25.16
|
|
| NVIDIA Nemotron Nano 9B V2 (Non-reasoning) | 2025-08-18 |
13.16
|
|
| NVIDIA Nemotron Nano 9B V2 (Reasoning) | 2025-08-18 |
14.76
|
|
| Gemma 3 270M | 2025-08-14 |
7.71
|
|
| Mistral Medium 3.1 | 2025-08-12 |
21.25
|
|
| GLM-4.5V (Non-reasoning) | 2025-08-11 |
12.74
|
|
| GLM-4.5V (Reasoning) | 2025-08-11 |
15.09
|
|
| GPT-5 (ChatGPT) | 2025-08-07 |
21.83
|
|
| GPT-5 (high) | 2025-08-07 |
44.63
|
|
| GPT-5 (low) | 2025-08-07 |
39.20
|
|
| GPT-5 (medium) | 2025-08-07 |
42.03
|
|
| GPT-5 (minimal) | 2025-08-07 |
23.89
|
|
| GPT-5 mini (high) | 2025-08-07 |
41.17
|
|
| GPT-5 mini (medium) | 2025-08-07 |
38.94
|
|
| GPT-5 mini (minimal) | 2025-08-07 |
20.68
|
|
| GPT-5 nano (high) | 2025-08-07 |
26.83
|
|
| GPT-5 nano (medium) | 2025-08-07 |
25.88
|
|
| GPT-5 nano (minimal) | 2025-08-07 |
13.84
|
|
| Qwen3 4B 2507 (Reasoning) | 2025-08-06 |
18.18
|
|
| Qwen3 4B 2507 Instruct | 2025-08-06 |
12.88
|
|
| Claude 4.1 Opus (Non-reasoning) | 2025-08-05 |
36.00
|
|
| Claude 4.1 Opus (Reasoning) | 2025-08-05 |
42.00
|
|
| gpt-oss-120B (high) | 2025-08-05 |
33.27
|
|
| gpt-oss-120B (low) | 2025-08-05 |
24.47
|
|
| gpt-oss-20B (high) | 2025-08-05 |
24.47
|
|
| gpt-oss-20B (low) | 2025-08-05 |
20.79
|
|
| Qwen3 Coder 30B A3B Instruct | 2025-07-31 |
19.98
|
|
| Qwen3 30B A3B 2507 (Reasoning) | 2025-07-30 |
22.41
|
|
| Qwen3 30B A3B 2507 Instruct | 2025-07-29 |
15.00
|
|
| GLM-4.5 (Reasoning) | 2025-07-28 |
26.42
|
|
| GLM-4.5-Air | 2025-07-28 |
23.17
|
|
| Llama Nemotron Super 49B v1.5 (Non-reasoning) | 2025-07-25 |
14.59
|
|
| Llama Nemotron Super 49B v1.5 (Reasoning) | 2025-07-25 |
18.68
|
|
| Qwen3 235B A22B 2507 (Reasoning) | 2025-07-25 |
29.54
|
|
| Qwen3 Coder 480B A35B Instruct | 2025-07-22 |
24.77
|
|
| Qwen3 235B A22B 2507 Instruct | 2025-07-21 |
24.96
|
|
| EXAONE 4.0 32B (Non-reasoning) | 2025-07-15 |
11.66
|
|
| EXAONE 4.0 32B (Reasoning) | 2025-07-15 |
16.68
|
|
| Exaone 4.0 1.2B (Non-reasoning) | 2025-07-15 |
8.11
|
|
| Exaone 4.0 1.2B (Reasoning) | 2025-07-15 |
8.26
|
|
| Kimi K2 | 2025-07-11 |
26.32
|
|
| Devstral Medium | 2025-07-10 |
18.66
|
|
| Devstral Small (Jul '25) | 2025-07-10 |
15.21
|
|
| Grok 4 | 2025-07-10 |
41.52
|
|
| LFM2 1.2B | 2025-07-10 |
6.33
|
|
| Solar Pro 2 (Non-reasoning) | 2025-07-09 |
13.59
|
|
| Solar Pro 2 (Reasoning) | 2025-07-09 |
14.92
|
|
| Jamba 1.7 Large | 2025-07-07 |
10.88
|
|
| Jamba 1.7 Mini | 2025-07-07 |
8.07
|
|
| ERNIE 4.5 300B A47B | 2025-06-30 |
14.96
|
|
| Gemma 3n E2B Instruct | 2025-06-26 |
4.76
|
|
| Gemma 3n E4B Instruct | 2025-06-26 |
6.38
|
|
| Mistral Small 3.2 | 2025-06-20 |
15.07
|
|
| Gemini 2.5 Flash-Lite (Non-reasoning) | 2025-06-17 |
12.66
|
|
| Gemini 2.5 Flash-Lite (Reasoning) | 2025-06-17 |
17.57
|
|
| MiniMax M1 40k | 2025-06-17 |
20.86
|
|
| MiniMax M1 80k | 2025-06-17 |
24.43
|
|
| Magistral Medium 1 | 2025-06-10 |
18.77
|
|
| Magistral Small 1 | 2025-06-10 |
16.79
|
|
| o3-pro | 2025-06-10 |
40.69
|
|
| Gemini 2.5 Pro | 2025-06-05 |
34.63
|
|
| DeepSeek R1 0528 Qwen3 8B | 2025-05-29 |
16.43
|
|
| DeepSeek R1 0528 (May '25) | 2025-05-28 |
27.07
|
|
| Sarvam M (Reasoning) | 2025-05-23 |
8.39
|
|
| Claude 4 Opus (Non-reasoning) | 2025-05-22 |
33.00
|
|
| Claude 4 Opus (Reasoning) | 2025-05-22 |
39.00
|
|
| Claude 4 Sonnet (Non-reasoning) | 2025-05-22 |
33.00
|
|
| Claude 4 Sonnet (Reasoning) | 2025-05-22 |
38.66
|
|
| Devstral Small (May '25) | 2025-05-21 |
18.03
|
|
| Gemini 2.5 Flash (Non-reasoning) | 2025-05-20 |
20.56
|
|
| Gemini 2.5 Flash (Reasoning) | 2025-05-20 |
27.04
|
|
| Gemma 3n E4B Instruct Preview (May '25) | 2025-05-20 |
10.06
|
|
| Llama 3.1 Nemotron Nano 4B v1.1 (Reasoning) | 2025-05-20 |
14.43
|
|
| Solar Pro 2 (Preview) (Non-reasoning) | 2025-05-20 |
15.99
|
|
| Solar Pro 2 (Preview) (Reasoning) | 2025-05-20 |
18.81
|
|
| Mistral Medium 3 | 2025-05-07 |
18.76
|
|
| Gemini 2.5 Pro Preview (May' 25) | 2025-05-06 |
29.55
|
|
| Nova Premier | 2025-04-30 |
19.01
|
|
| Qwen3 0.6B (Non-reasoning) | 2025-04-28 |
5.68
|
|
| Qwen3 0.6B (Reasoning) | 2025-04-28 |
6.47
|
|
| Qwen3 1.7B (Non-reasoning) | 2025-04-28 |
6.76
|
|
| Qwen3 1.7B (Reasoning) | 2025-04-28 |
7.96
|
|
| Qwen3 14B (Non-reasoning) | 2025-04-28 |
12.76
|
|
| Qwen3 14B (Reasoning) | 2025-04-28 |
16.19
|
|
| Qwen3 235B A22B (Non-reasoning) | 2025-04-28 |
16.96
|
|
| Qwen3 235B A22B (Reasoning) | 2025-04-28 |
19.79
|
|
| Qwen3 30B A3B (Non-reasoning) | 2025-04-28 |
12.53
|
|
| Qwen3 30B A3B (Reasoning) | 2025-04-28 |
15.28
|
|
| Qwen3 32B (Non-reasoning) | 2025-04-28 |
14.53
|
|
| Qwen3 32B (Reasoning) | 2025-04-28 |
16.53
|
|
| Qwen3 4B (Non-reasoning) | 2025-04-28 |
12.49
|
|
| Qwen3 4B (Reasoning) | 2025-04-28 |
14.22
|
|
| Qwen3 8B (Non-reasoning) | 2025-04-28 |
10.63
|
|
| Qwen3 8B (Reasoning) | 2025-04-28 |
13.18
|
|
| Gemini 2.5 Flash Preview (Non-reasoning) | 2025-04-17 |
17.84
|
|
| Gemini 2.5 Flash Preview (Reasoning) | 2025-04-17 |
24.29
|
|
| Granite 3.3 8B (Non-reasoning) | 2025-04-16 |
7.00
|
|
| o3 | 2025-04-16 |
38.37
|
|
| o4-mini (high) | 2025-04-16 |
33.06
|
|
| GPT-4.1 | 2025-04-14 |
26.28
|
|
| GPT-4.1 mini | 2025-04-14 |
22.90
|
|
| GPT-4.1 nano | 2025-04-14 |
13.04
|
|
| Llama 3.1 Nemotron Ultra 253B v1 (Reasoning) | 2025-04-07 |
15.02
|
|
| Llama 4 Maverick | 2025-04-05 |
18.36
|
|
| Llama 4 Scout | 2025-04-05 |
13.52
|
|
| GPT-4o (March 2025, chatgpt-4o-latest) | 2025-03-27 |
18.56
|
|
| DeepSeek V3 0324 | 2025-03-25 |
22.28
|
|
| Gemini 2.5 Pro Preview (Mar' 25) | 2025-03-25 |
30.30
|
|
| o1-pro | 2025-03-19 |
25.76
|
|
| Llama 3.3 Nemotron Super 49B v1 (Non-reasoning) | 2025-03-18 |
14.35
|
|
| Llama 3.3 Nemotron Super 49B v1 (Reasoning) | 2025-03-18 |
18.49
|
|
| Mistral Small 3.1 | 2025-03-17 |
14.48
|
|
| Command A | 2025-03-13 |
13.48
|
|
| Gemma 3 1B Instruct | 2025-03-13 |
5.55
|
|
| Gemma 3 12B Instruct | 2025-03-12 |
8.79
|
|
| Gemma 3 27B Instruct | 2025-03-12 |
10.31
|
|
| Gemma 3 4B Instruct | 2025-03-12 |
6.30
|
|
| Reka Flash 3 | 2025-03-10 |
9.52
|
|
| Jamba 1.6 Large | 2025-03-06 |
10.56
|
|
| Jamba 1.6 Mini | 2025-03-06 |
7.87
|
|
| QwQ 32B | 2025-03-05 |
19.72
|
|
| GPT-4.5 (Preview) | 2025-02-27 |
19.96
|
|
| Phi-4 Multimodal Instruct | 2025-02-26 |
10.04
|
|
| Gemini 2.0 Flash-Lite (Feb '25) | 2025-02-25 |
14.70
|
|
| Claude 3.7 Sonnet (Non-reasoning) | 2025-02-24 |
30.81
|
|
| Claude 3.7 Sonnet (Reasoning) | 2025-02-24 |
34.71
|
|
| Grok 3 | 2025-02-19 |
25.17
|
|
| Grok 3 Reasoning Beta | 2025-02-19 |
21.65
|
|
| Grok 3 mini Reasoning (high) | 2025-02-19 |
32.08
|
|
| R1 1776 | 2025-02-18 |
11.99
|
|
| Mistral Saba | 2025-02-17 |
12.13
|
|
| GPT-4o (ChatGPT) | 2025-02-15 |
14.11
|
|
| Gemini 2.0 Flash (Feb '25) | 2025-02-05 |
18.51
|
|
| Gemini 2.0 Flash-Lite (Preview) | 2025-02-05 |
14.49
|
|
| Gemini 2.0 Pro Experimental (Feb '25) | 2025-02-05 |
18.05
|
|
| o3-mini | 2025-01-31 |
25.86
|
|
| o3-mini (high) | 2025-01-31 |
25.21
|
|
| Mistral Small 3 | 2025-01-30 |
12.67
|
|
| Qwen2.5 Max | 2025-01-28 |
16.28
|
|
| Sonar Reasoning | 2025-01-28 |
17.88
|
|
| Sonar Reasoning Pro | 2025-01-28 |
24.62
|
|
| Gemini 2.0 Flash Thinking Experimental (Jan '25) | 2025-01-21 |
19.60
|
|
| Sonar | 2025-01-21 |
15.49
|
|
| Sonar Pro | 2025-01-21 |
15.23
|
|
| DeepSeek R1 (Jan '25) | 2025-01-20 |
18.84
|
|
| DeepSeek R1 Distill Llama 70B | 2025-01-20 |
15.95
|
|
| DeepSeek R1 Distill Llama 8B | 2025-01-20 |
12.10
|
|
| DeepSeek R1 Distill Qwen 1.5B | 2025-01-20 |
9.08
|
|
| DeepSeek R1 Distill Qwen 14B | 2025-01-20 |
15.84
|
|
| DeepSeek R1 Distill Qwen 32B | 2025-01-20 |
17.17
|
|
| DeepSeek V3 (Dec '24) | 2024-12-26 |
16.46
|
|
| Gemini 2.0 Flash Thinking Experimental (Dec '24) | 2024-12-19 |
12.33
|
|
| GPT-4o Realtime (Dec '24) | 2024-12-17 |
—
|
|
| GPT-4o mini Realtime (Dec '24) | 2024-12-17 |
—
|
|
| Grok 2 (Dec '24) | 2024-12-12 |
13.89
|
|
| Phi-4 | 2024-12-12 |
10.41
|
|
| Gemini 2.0 Flash (experimental) | 2024-12-11 |
16.77
|
|
| DeepSeek-V2.5 (Dec '24) | 2024-12-10 |
12.51
|
|
| Llama 3.3 Instruct 70B | 2024-12-06 |
14.49
|
|
| o1 | 2024-12-05 |
30.75
|
|
| Nova Lite | 2024-12-03 |
12.65
|
|
| Nova Micro | 2024-12-03 |
10.27
|
|
| Nova Pro | 2024-12-03 |
13.48
|
|
| QwQ 32B-Preview | 2024-11-27 |
15.17
|
|
| GPT-4o (Nov '24) | 2024-11-20 |
17.32
|
|
| Mistral Large 2 (Nov '24) | 2024-11-18 |
15.09
|
|
| Pixtral Large | 2024-11-18 |
14.00
|
|
| Qwen2.5 Turbo | 2024-11-18 |
11.97
|
|
| Qwen2.5 Coder Instruct 32B | 2024-11-11 |
12.87
|
|
| Claude 3.5 Haiku | 2024-10-22 |
18.66
|
|
| Claude 3.5 Sonnet (Oct '24) | 2024-10-22 |
15.93
|
|
| Llama 3.1 Nemotron Instruct 70B | 2024-10-15 |
13.44
|
|
| Reka Flash (Sep '24) | 2024-10-04 |
11.97
|
|
| Gemini 1.5 Flash-8B | 2024-10-03 |
11.13
|
|
| LFM 40B | 2024-09-30 |
8.76
|
|
| Llama 3.2 Instruct 11B (Vision) | 2024-09-25 |
8.73
|
|
| Llama 3.2 Instruct 1B | 2024-09-25 |
6.28
|
|
| Llama 3.2 Instruct 3B | 2024-09-25 |
9.70
|
|
| Llama 3.2 Instruct 90B (Vision) | 2024-09-25 |
11.90
|
|
| Gemini 1.5 Flash (Sep '24) | 2024-09-24 |
13.79
|
|
| Gemini 1.5 Pro (Sep '24) | 2024-09-24 |
15.99
|
|
| Qwen2.5 Coder Instruct 7B | 2024-09-19 |
9.98
|
|
| Qwen2.5 Instruct 32B | 2024-09-19 |
13.24
|
|
| Qwen2.5 Instruct 72B | 2024-09-19 |
15.56
|
|
| Mistral Small (Sep '24) | 2024-09-17 |
10.18
|
|
| o1-mini | 2024-09-12 |
20.39
|
|
| o1-preview | 2024-09-12 |
23.74
|
|
| DeepSeek-V2.5 | 2024-09-06 |
12.33
|
|
| Jamba 1.5 Large | 2024-08-22 |
10.70
|
|
| Jamba 1.5 Mini | 2024-08-22 |
8.03
|
|
| Grok Beta | 2024-08-13 |
13.28
|
|
| GPT-4o (Aug '24) | 2024-08-06 |
18.64
|
|
| Mistral Large 2 (Jul '24) | 2024-07-24 |
13.03
|
|
| Llama 3.1 Instruct 405B | 2024-07-23 |
17.38
|
|
| Llama 3.1 Instruct 70B | 2024-07-23 |
12.47
|
|
| Llama 3.1 Instruct 8B | 2024-07-23 |
11.76
|
|
| GPT-4o mini | 2024-07-18 |
12.65
|
|
| Claude 3.5 Sonnet (June '24) | 2024-06-21 |
14.17
|
|
| DeepSeek Coder V2 Lite Instruct | 2024-06-17 |
8.48
|
|
| DeepSeek-Coder-V2 | 2024-06-17 |
10.61
|
|
| Qwen2 Instruct 72B | 2024-06-07 |
11.66
|
|
| Gemini 1.5 Pro (May '24) | 2024-05-15 |
12.00
|
|
| Gemini 1.5 Flash (May '24) | 2024-05-14 |
10.46
|
|
| GPT-4o (May '24) | 2024-05-13 |
14.50
|
|
| DeepSeek-V2-Chat | 2024-05-06 |
9.06
|
|
| Qwen1.5 Chat 110B | 2024-04-25 |
9.55
|
|
| Arctic Instruct | 2024-04-24 |
8.82
|
|
| Phi-3 Mini Instruct 3.8B | 2024-04-23 |
10.10
|
|
| Llama 3 Instruct 70B | 2024-04-18 |
8.88
|
|
| Llama 3 Instruct 8B | 2024-04-18 |
6.38
|
|
| Mixtral 8x22B Instruct | 2024-04-17 |
9.84
|
|
| Command-R+ (Apr '24) | 2024-04-04 |
8.35
|
|
| DBRX Instruct | 2024-03-27 |
8.32
|
|
| Grok-1 | 2024-03-17 |
11.69
|
|
| Command-R (Mar '24) | 2024-03-12 |
7.41
|
|
| Claude 3 Haiku | 2024-03-04 |
12.26
|
|
| Claude 3 Opus | 2024-03-04 |
18.00
|
|
| Claude 3 Sonnet | 2024-03-04 |
10.27
|
|
| Mistral Large (Feb '24) | 2024-02-26 |
9.91
|
|
| Mistral Small (Feb '24) | 2024-02-26 |
9.04
|
|
| Phi-4 Mini Instruct | 2024-02-26 |
8.39
|
|
| Solar Mini | 2024-01-25 |
11.90
|
|
| OpenChat 3.5 (1210) | 2023-12-18 |
8.32
|
|
| Mistral Medium | 2023-12-11 |
9.01
|
|
| Mixtral 8x7B Instruct | 2023-12-11 |
7.73
|
|
| Gemini 1.0 Pro | 2023-12-06 |
8.50
|
|
| Gemini 1.0 Ultra | 2023-12-06 |
10.15
|
|
| Qwen Chat 72B | 2023-11-30 |
8.82
|
|
| DeepSeek LLM 67B Chat (V1) | 2023-11-29 |
8.37
|
|
| Claude 2.1 | 2023-11-21 |
9.32
|
|
| GPT-4 Turbo | 2023-11-06 |
13.72
|
|
| Mistral 7B Instruct | 2023-09-27 |
7.41
|
|
| Qwen Chat 14B | 2023-09-25 |
7.41
|
|
| Llama 2 Chat 13B | 2023-07-18 |
8.36
|
|
| Llama 2 Chat 70B | 2023-07-18 |
8.37
|
|
| Llama 2 Chat 7B | 2023-07-18 |
9.74
|
|
| Claude 2.0 | 2023-07-11 |
9.06
|
|
| GPT-3.5 Turbo (0613) | 2023-06-13 |
—
|
|
| PALM-2 | 2023-05-10 |
8.59
|
|
| Claude Instant | 2023-03-14 |
7.41
|
|
| GPT-4 | 2023-03-14 |
12.75
|
|
| Llama 65B | 2023-02-24 |
7.41
|
|
| GPT-3.5 Turbo | 2022-11-30 |
8.99
|
|
| JT-MINI |
25.37
|