| 注册会员 | 1032 |
| 主题 | 361 |
| 模型 | 2962 |
| 技能包 | 6701 |
| 数据集 | 1026 |
| 论文 | 236 |
| 开源项目 | 319 |
| 模型名称 | 厂商 | 发布时间 | 智能评分 |
|---|---|---|---|
| Gemini 3.1 Pro Preview | 2026-02-19 |
57.18
|
|
| GPT-5.4 (xhigh) | 2026-03-05 |
56.95
|
|
| GPT-5.3 Codex (xhigh) | 2026-02-05 |
53.97
|
|
| Claude Opus 4.6 (max) | 2026-02-05 |
52.95
|
|
| Claude Sonnet 4.6 (max) | 2026-02-17 |
51.72
|
|
| GPT-5.2 (xhigh) | 2025-12-11 |
51.28
|
|
| GLM-5 | 2026-02-11 |
49.77
|
|
| Claude Opus 4.5 | 2025-11-24 |
49.73
|
|
| GPT-5.2 Codex (xhigh) | 2025-12-11 |
49.03
|
|
| Gemini 3 Pro Preview (high) | 2025-11-18 |
48.39
|
|
| GPT-5.1 (high) | 2025-11-13 |
47.7
|
|
| Kimi K2.5 | 2026-01-27 |
46.81
|
|
| GPT-5.2 (medium) | 2025-12-11 |
46.64
|
|
| Claude Opus 4.6 | 2026-02-05 |
46.46
|
|
| Gemini 3 Flash | 2025-12-17 |
46.43
|
|
| Qwen3.5 397B A17B | 2026-02-16 |
45.05
|
|
| GPT-5 (high) | 2025-08-07 |
44.63
|
|
| GPT-5 Codex (high) | 2025-09-23 |
44.63
|
|
| Claude Sonnet 4.6 | 2026-02-17 |
44.38
|
|
| GPT-5.1 Codex (high) | 2025-11-13 |
43.11
|
|
| Claude Opus 4.5 | 2025-11-24 |
43.09
|
|
| Claude 4.5 Sonnet | 2025-09-29 |
43.03
|
|
| Claude Sonnet 4.6 (Non-reasoning Low Effort) | 2026-02-17 |
42.6
|
|
| GLM-4.7 | 2025-12-22 |
42.11
|
|
| Qwen3.5 27B | 2026-02-24 |
42.07
|
|
| GPT-5 (medium) | 2025-08-07 |
42.03
|
|
| MiniMax-M2.5 | 2026-02-12 |
41.93
|
|
| DeepSeek V3.2 | 2025-12-01 |
41.71
|
|
| Qwen3.5 122B A10B | 2026-02-24 |
41.6
|
|
| Grok 4 | 2025-07-10 |
41.52
|
|
| MiMo-V2-Flash (Feb 2026) | 2025-12-16 |
41.46
|
|
| Gemini 3 Pro Preview (low) | 2025-11-18 |
41.3
|
|
| GPT-5 mini (high) | 2025-08-07 |
41.17
|
|
| Kimi K2 Thinking | 2025-11-06 |
40.89
|
|
| o3-pro | 2025-06-10 |
40.68
|
|
| GLM-5 | 2026-02-11 |
40.57
|
|
| Qwen3.5 397B A17B | 2026-02-16 |
40.1
|
|
| Qwen3 Max Thinking | 2026-01-26 |
39.85
|
|
| MiniMax-M2.1 | 2025-12-23 |
39.42
|
|
| MiMo-V2-Flash | 2025-12-16 |
39.24
|
|
| GPT-5 (low) | 2025-08-07 |
39.2
|
|
| GPT-5 mini (medium) | 2025-08-07 |
38.94
|
|
| Claude 4 Sonnet | 2025-05-22 |
38.66
|
|
| GPT-5.1 Codex mini (high) | 2025-11-13 |
38.63
|
|
| Grok 4.1 Fast | 2025-11-19 |
38.61
|
|
| o3 | 2025-04-16 |
38.37
|
|
| Kimi K2.5 | 2026-01-27 |
37.27
|
|
| Qwen3.5 27B | 2026-02-24 |
37.18
|
|
| Claude 4.5 Sonnet | 2025-09-29 |
37.14
|
|
| Qwen3.5 35B A3B | 2026-02-24 |
37.12
|
|
| Claude 4.5 Haiku | 2025-10-15 |
37.09
|
|
| MiniMax-M2 | 2025-10-26 |
36.09
|
|
| KAT-Coder-Pro V1 | 2025-11-11 |
36.03
|
|
| Qwen3.5 122B A10B | 2026-02-24 |
35.87
|
|
| Nova 2.0 Pro Preview (medium) | 2025-11-27 |
35.71
|
|
| Grok 4 Fast | 2025-09-19 |
35.06
|
|
| Gemini 3 Flash | 2025-12-17 |
35.05
|
|
| Claude 3.7 Sonnet | 2025-02-24 |
34.71
|
|
| Gemini 2.5 Pro | 2025-06-05 |
34.63
|
|
| GLM-4.7 | 2025-12-22 |
34.16
|
|
| DeepSeek V3.2 Speciale | 2025-12-01 |
34.07
|
|
| DeepSeek V3.1 Terminus | 2025-09-22 |
33.93
|
|
| GPT-5.2 | 2025-12-11 |
33.57
|
|
| Doubao Seed Code | 2025-11-11 |
33.52
|
|
| Gemini 3.1 Flash-Lite Preview | 2026-03-03 |
33.52
|
|
| gpt-oss-120B (high) | 2025-08-05 |
33.27
|
|
| o4-mini (high) | 2025-04-16 |
33.06
|
|
| Claude 4 Sonnet | 2025-05-22 |
33.0
|
|
| DeepSeek V3.2 Exp | 2025-09-29 |
32.94
|
|
| Mercury 2 | 2026-02-20 |
32.82
|
|
| GLM-4.6 | 2025-09-30 |
32.51
|
|
| Qwen3 Max Thinking (Preview) | 2025-11-03 |
32.48
|
|
| K-EXAONE | 2025-12-31 |
32.12
|
|
| DeepSeek V3.2 | 2025-12-01 |
32.09
|
|
| Grok 3 mini Reasoning (high) | 2025-02-19 |
32.08
|
|
| Nova 2.0 Pro Preview (low) | 2025-11-27 |
31.9
|
|
| Claude 4.1 Opus | 2025-08-05 |
31.88
|
|
| Qwen3 Max | 2025-09-23 |
31.38
|
|
| Gemini 2.5 Flash (Sep) | 2025-09-25 |
31.14
|
|
| Claude 4.5 Haiku | 2025-10-15 |
31.05
|
|
| Kimi K2 0905 | 2025-09-05 |
30.85
|
|
| Claude 3.7 Sonnet | 2025-02-24 |
30.81
|
|
| o1 | 2024-12-05 |
30.75
|
|
| Qwen3.5 35B A3B | 2026-02-24 |
30.69
|
|
| MiMo-V2-Flash | 2025-12-16 |
30.35
|
|
| Gemini 2.5 Pro (Mar) | 2025-03-25 |
30.29
|
|
| GLM-4.6 | 2025-09-30 |
30.24
|
|
| GLM-4.7-Flash | 2026-01-19 |
30.15
|
|
| Nova 2.0 Lite (medium) | 2025-10-29 |
29.73
|
|
| Gemini 2.5 Pro (May) | 2025-05-06 |
29.54
|
|
| Qwen3 235B A22B 2507 | 2025-07-25 |
29.54
|
|
| ERNIE 5.0 Thinking Preview | 2025-11-13 |
29.09
|
|
| Grok Code Fast 1 | 2025-08-28 |
28.74
|
|
| DeepSeek V3.1 Terminus | 2025-09-22 |
28.52
|
|
| DeepSeek V3.2 Exp | 2025-09-29 |
28.44
|
|
| Apriel-v1.5-15B-Thinker | 2025-09-30 |
28.33
|
|
| Qwen3 Coder Next | 2026-02-03 |
28.28
|
|
| DeepSeek V3.1 | 2025-08-21 |
28.13
|
|
| Nova 2.0 Omni (medium) | 2025-11-26 |
28.02
|
|
| DeepSeek V3.1 | 2025-08-21 |
27.71
|
|
| Qwen3 VL 235B A22B | 2025-09-23 |
27.64
|
|
| Apriel-v1.6-15B-Thinker | 2025-11-25 |
27.58
|
|
| GPT-5.1 | 2025-11-13 |
27.42
|
|
| Claude 4 Opus | 2025-05-22 |
27.36
|
|
| Magistral Medium 1.2 | 2025-09-18 |
27.1
|
|
| DeepSeek R1 0528 | 2025-05-28 |
27.07
|
|
| Gemini 2.5 Flash | 2025-05-20 |
27.04
|
|
| GPT-5 nano (high) | 2025-08-07 |
26.83
|
|
| Qwen3 Next 80B A3B | 2025-09-11 |
26.72
|
|
| GLM-4.5 | 2025-07-28 |
26.42
|
|
| Kimi K2 | 2025-07-11 |
26.32
|
|
| GPT-4.1 | 2025-04-14 |
26.28
|
|
| Qwen3 Max (Preview) | 2025-09-05 |
26.08
|
|
| GPT-5 nano (medium) | 2025-08-07 |
25.88
|
|
| o3-mini | 2025-01-31 |
25.86
|
|
| o1-pro | 2025-03-19 |
25.76
|
|
| Gemini 2.5 Flash (Sep) | 2025-09-25 |
25.7
|
|
| o3-mini (high) | 2025-01-31 |
25.21
|
|
| Grok 3 | 2025-02-19 |
25.17
|
|
| Seed-OSS-36B-Instruct | 2025-08-20 |
25.16
|
|
| Qwen3 235B 2507 | 2025-07-21 |
24.96
|
|
| Qwen3 Coder 480B | 2025-07-22 |
24.77
|
|
| Qwen3 VL 32B | 2025-10-21 |
24.72
|
|
| Sonar Reasoning Pro | 2025-01-28 |
24.62
|
|
| Nova 2.0 Lite (low) | 2025-10-29 |
24.59
|
|
| gpt-oss-120B (low) | 2025-08-05 |
24.47
|
|
| gpt-oss-20B (high) | 2025-08-05 |
24.47
|
|
| MiniMax M1 80k | 2025-06-17 |
24.43
|
|
| Gemini 2.5 Flash | 2025-04-17 |
24.29
|
|
| NVIDIA Nemotron 3 Nano | 2025-12-15 |
24.27
|
|
| K2 Think V2 | 2025-12-15 |
24.12
|
|
| GPT-5 (minimal) | 2025-08-07 |
23.89
|
|
| o1-preview | 2024-09-12 |
23.74
|
|
| HyperCLOVA X SEED Think (32B) | 2025-12-26 |
23.72
|
|
| Claude 4.1 Opus | 2025-08-05 |
23.56
|
|
| Grok 4.1 Fast | 2025-11-19 |
23.56
|
|
| GLM-4.6V | 2025-12-08 |
23.42
|
|
| K-EXAONE | 2025-12-31 |
23.41
|
|
| Nova 2.0 Omni (low) | 2025-11-26 |
23.22
|
|
| GLM-4.5-Air | 2025-07-28 |
23.17
|
|
| Grok 4 Fast | 2025-09-19 |
23.12
|
|
| Mi:dm K 2.5 Pro | 2025-12-11 |
23.06
|
|
| Nova 2.0 Pro Preview | 2025-11-27 |
23.06
|
|
| GPT-4.1 mini | 2025-04-14 |
22.9
|
|
| Mistral Large 3 | 2025-12-02 |
22.8
|
|
| Ring-1T | 2025-10-13 |
22.78
|
|
| Qwen3 30B A3B 2507 | 2025-07-30 |
22.41
|
|
| DeepSeek V3 0324 | 2025-03-25 |
22.28
|
|
| Claude 4 Opus | 2025-05-22 |
22.17
|
|
| INTELLECT-3 | 2025-11-27 |
22.17
|
|
| GLM-4.7-Flash | 2026-01-19 |
22.07
|
|
| Devstral 2 | 2025-12-09 |
22.04
|
|
| GPT-5 (ChatGPT) | 2025-08-07 |
21.83
|
|
| Solar Open 100B | 2025-12-17 |
21.67
|
|
| Gemini 2.5 Flash-Lite (Sep) | 2025-09-08 |
21.65
|
|
| Grok 3 Reasoning Beta | 2025-02-19 |
21.64
|
|
| Mistral Medium 3.1 | 2025-08-12 |
21.25
|
|
| NVIDIA Nemotron Nano 12B v2 VL | 2025-10-28 |
21.2
|
|
| MiniMax M1 40k | 2025-06-17 |
20.85
|
|
| gpt-oss-20B (low) | 2025-08-05 |
20.79
|
|
| Qwen3 VL 235B A22B | 2025-09-23 |
20.75
|
|
| GPT-5 mini (minimal) | 2025-08-07 |
20.68
|
|
| K2-V2 (high) | 2025-12-05 |
20.61
|
|
| Gemini 2.5 Flash | 2025-05-20 |
20.56
|
|
| o1-mini | 2024-09-12 |
20.38
|
|
| Qwen3 Next 80B A3B | 2025-09-11 |
20.11
|
|
| Tri-21B-think Preview | 2026-02-10 |
19.99
|
|
| Qwen3 Coder 30B A3B | 2025-07-31 |
19.98
|
|
| GPT-4.5 (Preview) | 2025-02-27 |
19.95
|
|
| Qwen3 235B | 2025-04-28 |
19.79
|
|
| QwQ-32B | 2025-03-05 |
19.72
|
|
| Qwen3 VL 30B A3B | 2025-10-03 |
19.68
|
|
| Gemini 2.0 Flash Thinking exp. (Jan) | 2025-01-21 |
19.6
|
|
| Devstral Small 2 | 2025-12-09 |
19.47
|
|
| Gemini 2.5 Flash-Lite (Sep) | 2025-09-25 |
19.42
|
|
| GLM-4.5V | 2025-08-11 |
19.26
|
|
| Motif-2-12.7B | 2025-12-04 |
19.08
|
|
| Ling-1T | 2025-10-08 |
19.04
|
|
| Nova Premier | 2025-04-30 |
19.01
|
|
| Olmo 3 32B Think | 2025-11-20 |
18.88
|
|
| DeepSeek R1 (Jan) | 2025-01-20 |
18.84
|
|
| Solar Pro 2 | 2025-05-20 |
18.8
|
|
| NVIDIA Nemotron Nano 9B V2 | 2025-08-18 |
18.78
|
|
| Magistral Medium 1 | 2025-06-10 |
18.77
|
|
| Mistral Medium 3 | 2025-05-07 |
18.76
|
|
| K2-V2 (medium) | 2025-12-05 |
18.68
|
|
| Llama Nemotron Super 49B v1.5 | 2025-07-25 |
18.68
|
|
| Claude 3.5 Haiku | 2024-10-22 |
18.66
|
|
| Devstral Medium | 2025-07-10 |
18.66
|
|
| GPT-4o (Aug) | 2024-08-06 |
18.64
|
|
| Tri-21B-Think | 2026-02-10 |
18.62
|
|
| Hermes 4 405B | 2025-08-27 |
18.56
|
|
| GPT-4o (Mar) | 2025-03-27 |
18.55
|
|
| Gemini 2.0 Flash | 2025-02-05 |
18.51
|
|
| Llama 3.3 Nemotron Super 49B | 2025-03-18 |
18.49
|
|
| Llama 4 Maverick | 2025-04-05 |
18.36
|
|
| Qwen3 4B 2507 | 2025-08-06 |
18.18
|
|
| Magistral Small 1.2 | 2025-09-17 |
18.16
|
|
| Gemini 2.0 Pro Experimental | 2025-02-05 |
18.05
|
|
| Devstral Small (May) | 2025-05-21 |
18.03
|
|
| Nova 2.0 Lite | 2025-10-29 |
18.03
|
|
| Sonar Reasoning | 2025-01-28 |
17.87
|
|
| Gemini 2.5 Flash | 2025-04-17 |
17.84
|
|
| Hermes 4 405B | 2025-08-27 |
17.63
|
|
| Gemini 2.5 Flash-Lite | 2025-06-17 |
17.57
|
|
| GPT-4o (Nov) | 2024-11-20 |
17.32
|
|
| Qwen3 VL 32B | 2025-10-21 |
17.19
|
|
| DeepSeek R1 Distill Qwen 32B | 2025-01-20 |
17.16
|
|
| GLM-4.6V | 2025-12-08 |
17.1
|
|
| Qwen3 235B | 2025-04-28 |
16.96
|
|
| Magistral Small 1 | 2025-06-10 |
16.79
|
|
| Olmo 3 7B Think | 2025-11-20 |
16.79
|
|
| Gemini 2.0 Flash (exp) | 2024-12-11 |
16.77
|
|
| EXAONE 4.0 32B | 2025-07-15 |
16.68
|
|
| Qwen3 VL 8B | 2025-10-14 |
16.66
|
|
| Nova 2.0 Omni | 2025-11-26 |
16.61
|
|
| Qwen3 32B | 2025-04-28 |
16.53
|
|
| DeepSeek V3 (Dec) | 2024-12-26 |
16.46
|
|
| K2-V2 (low) | 2025-12-05 |
16.46
|
|
| DeepSeek R1 0528 Qwen3 8B | 2025-05-29 |
16.43
|
|
| Qwen2.5 Max | 2025-01-28 |
16.28
|
|
| Qwen3 14B | 2025-04-28 |
16.19
|
|
| Qwen3 Omni 30B A3B | 2025-09-22 |
16.06
|
|
| Qwen3 VL 30B A3B | 2025-10-03 |
16.05
|
|
| Gemini 2.5 Flash-Lite | 2025-06-17 |
16.0
|
|
| Gemini 1.5 Pro (Sep) | 2024-09-24 |
15.99
|
|
| Hermes 4 70B | 2025-08-27 |
15.99
|
|
| Solar Pro 2 | 2025-05-20 |
15.99
|
|
| Ministral 3 14B | 2025-12-02 |
15.98
|
|
| DeepSeek R1 Distill Llama 70B | 2025-01-20 |
15.95
|
|
| Claude 3.5 Sonnet (Oct) | 2024-10-22 |
15.92
|
|
| DeepSeek R1 Distill Qwen 14B | 2025-01-20 |
15.84
|
|
| Falcon-H1R-7B | 2026-01-04 |
15.8
|
|
| Ling-flash-2.0 | 2025-09-17 |
15.74
|
|
| Qwen3 Omni 30B A3B | 2025-09-22 |
15.62
|
|
| GPT-5 nano (minimal) | 2025-08-07 |
15.59
|
|
| Qwen2.5 72B | 2024-09-19 |
15.55
|
|
| Sonar | 2025-01-21 |
15.49
|
|
| Step3 VL 10B | 2026-01-20 |
15.45
|
|
| Qwen3 30B | 2025-04-28 |
15.28
|
|
| Qwen3 8B | 2025-04-28 |
15.27
|
|
| Ministral 3 8B | 2025-12-02 |
15.25
|
|
| Sonar Pro | 2025-01-21 |
15.22
|
|
| Devstral Small | 2025-07-10 |
15.21
|
|
| Llama 3.1 405B | 2024-07-23 |
15.2
|
|
| QwQ 32B-Preview | 2024-11-27 |
15.17
|
|
| Mistral Large 2 (Nov) | 2024-11-18 |
15.09
|
|
| Mistral Small 3.2 | 2025-06-20 |
15.07
|
|
| Llama Nemotron Ultra | 2025-04-07 |
15.02
|
|
| Qwen3 30B A3B 2507 | 2025-07-29 |
15.0
|
|
| ERNIE 4.5 300B A47B | 2025-06-30 |
14.96
|
|
| Solar Pro 2 | 2025-07-09 |
14.92
|
|
| GPT-4.1 nano | 2025-04-14 |
14.89
|
|
| NVIDIA Nemotron Nano 9B V2 | 2025-08-18 |
14.76
|
|
| Gemini 2.0 Flash-Lite (Feb) | 2025-02-25 |
14.7
|
|
| Exaone 4.0 1.2B | 2025-07-15 |
14.63
|
|
| Llama Nemotron Super 49B v1.5 | 2025-07-25 |
14.59
|
|
| Qwen3 32B | 2025-04-28 |
14.53
|
|
| GPT-4o (May) | 2024-05-13 |
14.49
|
|
| Llama 3.3 70B | 2024-12-06 |
14.49
|
|
| Gemini 2.0 Flash-Lite (Preview) | 2025-02-05 |
14.48
|
|
| Mistral Small 3.1 | 2025-03-17 |
14.48
|
|
| Llama 3.1 Nemotron Nano 4B v1.1 | 2025-05-20 |
14.43
|
|
| Kimi Linear 48B A3B Instruct | 2025-10-30 |
14.41
|
|
| Llama 3.3 Nemotron Super 49B | 2025-03-18 |
14.34
|
|
| Qwen3 VL 8B | 2025-10-14 |
14.3
|
|
| Qwen3 4B | 2025-04-28 |
14.22
|
|
| Claude 3.5 Sonnet (June) | 2024-06-21 |
14.17
|
|
| NVIDIA Nemotron Nano 12B v2 VL | 2025-10-28 |
14.16
|
|
| Tulu3 405B | 2025-01-30 |
14.14
|
|
| GPT-4o (ChatGPT) | 2025-02-15 |
14.1
|
|
| Ring-flash-2.0 | 2025-09-19 |
14.02
|
|
| Pixtral Large | 2024-11-18 |
14.0
|
|
| Olmo 3.1 32B Think | 2025-12-12 |
13.94
|
|
| Grok 2 | 2024-12-12 |
13.88
|
|
| Gemini 1.5 Flash (Sep) | 2024-09-24 |
13.79
|
|
| Qwen3 VL 4B | 2025-10-14 |
13.73
|
|
| GPT-4 Turbo | 2023-11-06 |
13.71
|
|
| Solar Pro 2 | 2025-07-09 |
13.59
|
|
| Llama 4 Scout | 2025-04-05 |
13.52
|
|
| Command A | 2025-03-13 |
13.48
|
|
| Nova Pro | 2024-12-03 |
13.48
|
|
| Llama 3.1 Nemotron 70B | 2024-10-15 |
13.44
|
|
| Grok Beta | 2024-08-13 |
13.28
|
|
| Qwen3 8B | 2025-04-28 |
13.24
|
|
| Qwen2.5 Instruct 32B | 2024-09-19 |
13.23
|
|
| Granite 4.0 H Small | 2025-09-22 |
13.18
|
|
| NVIDIA Nemotron 3 Nano | 2025-12-15 |
13.17
|
|
| Qwen3 1.7B | 2025-04-28 |
13.06
|
|
| Mistral Large 2 (Jul) | 2024-07-24 |
13.03
|
|
| Olmo 3 7B | 2025-11-20 |
12.98
|
|
| Qwen3 4B 2507 | 2025-08-06 |
12.88
|
|
| Qwen2.5 Coder 32B | 2024-11-11 |
12.86
|
|
| Ministral 3 3B | 2025-12-02 |
12.85
|
|
| Qwen3 14B | 2025-04-28 |
12.76
|
|
| GLM-4.5V | 2025-08-11 |
12.74
|
|
| Mistral Small 3 | 2025-01-30 |
12.66
|
|
| Nova Lite | 2024-12-03 |
12.65
|
|
| GPT-4o mini | 2024-07-18 |
12.64
|
|
| Hermes 4 70B | 2025-08-27 |
12.63
|
|
| Qwen3 30B | 2025-04-28 |
12.53
|
|
| DeepSeek-V2.5 (Dec) | 2024-12-10 |
12.51
|
|
| Jamba 1.7 Large | 2025-07-07 |
12.51
|
|
| Qwen3 4B | 2025-04-28 |
12.49
|
|
| Llama 3.1 70B | 2024-07-23 |
12.47
|
|
| Claude 3 Opus | 2024-03-04 |
12.45
|
|
| Exaone 4.0 1.2B | 2025-07-15 |
12.42
|
|
| Gemma 3 12B | 2025-03-12 |
12.38
|
|
| Gemini 2.0 Flash Thinking exp. (Dec) | 2024-12-19 |
12.33
|
|
| DeepSeek-V2.5 | 2024-09-06 |
12.32
|
|
| Claude 3 Haiku | 2024-03-04 |
12.26
|
|
| Olmo 3.1 32B Instruct | 2026-01-13 |
12.16
|
|
| Mistral Saba | 2025-02-17 |
12.12
|
|
| DeepSeek R1 Distill Llama 8B | 2025-01-20 |
12.1
|
|
| Gemini 1.5 Pro (May) | 2024-05-15 |
11.99
|
|
| R1 1776 | 2025-02-18 |
11.98
|
|
| Qwen2.5 Turbo | 2024-11-18 |
11.97
|
|
| Reka Flash | 2024-10-04 |
11.96
|
|
| Llama 3.2 90B (Vision) | 2024-09-25 |
11.9
|
|
| Llama 3.1 8B | 2024-07-23 |
11.76
|
|
| EXAONE 4.0 32B | 2025-07-15 |
11.66
|
|
| Qwen2 72B | 2024-06-07 |
11.66
|
|
| Nova Micro | 2024-12-03 |
11.55
|
|
| Gemini 1.5 Flash-8B | 2024-10-03 |
11.13
|
|
| Phi-4 Mini | 2024-02-26 |
10.93
|
|
| DeepHermes 3 - Mistral 24B | 2025-03-13 |
10.88
|
|
| Jamba 1.5 Large | 2024-08-22 |
10.69
|
|
| Hermes 3 - Llama-3.1 70B | 2024-08-15 |
10.64
|
|
| DeepSeek-Coder-V2 | 2024-06-17 |
10.6
|
|
| Jamba 1.6 Large | 2025-03-06 |
10.55
|
|
| LFM2 24B A2B | 2026-02-25 |
10.49
|
|
| Gemini 1.5 Flash (May) | 2024-05-14 |
10.46
|
|
| Phi-4 | 2024-12-12 |
10.41
|
|
| Granite 4.0 H 1B | 2025-10-28 |
10.38
|
|
| Gemma 3 27B | 2025-03-12 |
10.31
|
|
| Claude 3 Sonnet | 2024-03-04 |
10.27
|
|
| Mistral Small (Sep) | 2024-09-17 |
10.17
|
|
| Gemini 1.0 Ultra | 2023-12-06 |
10.14
|
|
| Gemma 3n E4B (May) | 2025-05-20 |
10.05
|
|
| Phi-4 Multimodal | 2025-02-26 |
10.04
|
|
| Qwen2.5 Coder 7B | 2024-09-19 |
9.98
|
|
| Mistral Large (Feb) | 2024-02-26 |
9.91
|
|
| Mixtral 8x22B | 2024-04-17 |
9.84
|
|
| Llama 3.2 3B | 2024-09-25 |
9.7
|
|
| Jamba Reasoning 3B | 2025-10-08 |
9.6
|
|
| Qwen3 VL 4B | 2025-10-14 |
9.55
|
|
| Qwen1.5 Chat 110B | 2024-04-25 |
9.55
|
|
| Reka Flash 3 | 2025-03-10 |
9.52
|
|
| Claude 2.1 | 2023-11-21 |
9.32
|
|
| Ling-mini-2.0 | 2025-09-09 |
9.19
|
|
| DeepSeek R1 Distill Qwen 1.5B | 2025-01-20 |
9.07
|
|
| Claude 2.0 | 2023-07-11 |
9.06
|
|
| DeepSeek-V2 | 2024-05-06 |
9.06
|
|
| Mistral Small (Feb) | 2024-02-26 |
9.04
|
|
| Mistral Medium | 2023-12-11 |
9.01
|
|
| Granite 4.0 350M | 2025-10-28 |
8.84
|
|
| Qwen Chat 72B | 2023-11-30 |
8.82
|
|
| LFM 40B | 2024-09-30 |
8.76
|
|
| Llama 3.2 11B (Vision) | 2024-09-25 |
8.73
|
|
| Gemini 1.0 Pro | 2023-12-06 |
8.5
|
|
| DeepSeek Coder V2 Lite | 2024-06-17 |
8.48
|
|
| Command-R+ (Apr) | 2024-04-04 |
8.35
|
|
| DBRX | 2024-03-27 |
8.31
|
|
| LFM2.5-1.2B-Thinking | 2026-01-20 |
8.08
|
|
| Jamba 1.7 Mini | 2025-07-07 |
8.07
|
|
| LFM2 2.6B | 2025-09-23 |
8.04
|
|
| LFM2.5-1.2B-Instruct | 2026-01-05 |
8.04
|
|
| Jamba 1.5 Mini | 2024-08-22 |
8.03
|
|
| Jamba 1.6 Mini | 2025-03-06 |
7.87
|
|
| Mixtral 8x7B | 2023-12-11 |
7.73
|
|
| Gemma 3 270M | 2025-08-14 |
7.71
|
|
| Granite 4.0 Micro | 2025-09-22 |
7.67
|
|
| DeepHermes 3 - Llama-3.1 8B | 2025-02-13 |
7.58
|
|
| Claude Instant | 2023-03-14 |
7.41
|
|
| Command-R (Mar) | 2024-03-12 |
7.41
|
|
| Granite 4.0 1B | 2025-10-28 |
7.34
|
|
| LFM2 8B A1B | 2025-10-07 |
7.03
|
|
| Granite 3.3 8B | 2025-04-16 |
7.0
|
|
| Qwen3 1.7B | 2025-04-28 |
6.76
|
|
| Qwen3 0.6B | 2025-04-28 |
6.47
|
|
| Gemma 3n E4B | 2025-06-26 |
6.38
|
|
| LFM2 1.2B | 2025-07-10 |
6.33
|
|
| Gemma 3 4B | 2025-03-12 |
6.3
|
|
| Llama 3.2 1B | 2024-09-25 |
6.28
|
|
| LFM2.5-VL-1.6B | 2026-01-05 |
6.18
|
|
| Qwen3 0.6B | 2025-04-28 |
5.68
|
|
| Gemma 3 1B | 2025-03-13 |
5.55
|
|
| Granite 4.0 H 350M | 2025-10-28 |
5.44
|
|
| Gemma 3n E2B | 2025-06-26 |
4.76
|
|
| Cogito v2.1 | 2025-11-18 |
—
|
|
| GPT-4o Realtime (Dec) | 2024-12-17 |
—
|
|
| GPT-4o mini Realtime (Dec) | 2024-12-17 |
—
|
|
| Grok Voice Agent | 2025-12-17 |
—
|
|
| Mi:dm K 2.5 Pro Preview | 2025-12-11 |
—
|
|
| Molmo2-8B | 2025-12-11 |
—
|