OA0
OA0 是一个探索 AI 的社区
现在注册
已注册用户请  登录
社区运行状况
注册会员 1205
主题 846
模型 3026
技能包 13874
数据集 1047
论文 380
开源项目 602
模型名称 厂商 发布时间 智能评分
GLM-5.2 (max) Z AI 2026-06-16
50.67
Kimi K2.7 Code Kimi 2026-06-12
41.95
Claude Fable 5 (Adaptive Reasoning, Max Effort, Opus 4.8 Fallback) Anthropic 2026-06-09
59.86
North Mini Code Cohere 2026-06-09
20.56
Nemotron 3 Ultra 550B A55B (Reasoning) NVIDIA 2026-06-04
37.76
Gemma 4 12B (Non-reasoning) Google 2026-06-03
13.19
Gemma 4 12B (Reasoning) Google 2026-06-03
22.00
MiniMax-M3 MiniMax 2026-06-01
44.44
Qwen3.7 Plus Alibaba 2026-06-01
38.98
Step 3.7 Flash StepFun 2026-05-29
29.73
Claude Opus 4.8 (Adaptive Reasoning, Max Effort) Anthropic 2026-05-28
55.69
LFM2.5-8B-A1B Liquid AI 2026-05-28
8.32
HyperNova 60B 2605 Multiverse Computing 2026-05-26
22.14
MiniCPM5-1B (Non-reasoning) OpenBMB 2026-05-25
11.74
MiniCPM5-1B (Reasoning) OpenBMB 2026-05-25
11.96
Command A+ Cohere 2026-05-20
29.29
Gemini 3.5 Flash (high) Google 2026-05-19
50.20
Gemini 3.5 Flash (medium) Google 2026-05-19
45.38
Gemini 3.5 Flash (minimal) Google 2026-05-19
34.91
Qwen3.7 Max Alibaba 2026-05-19
45.99
JT-35B-Flash China Mobile 2026-05-14
28.36
MiniCPM-V 4.6 1.3B OpenBMB 2026-05-11
6.92
Ring-2.6-1T InclusionAI 2026-05-08
30.57
GPT-5.5 Instant (May 2026) OpenAI 2026-05-05
33.52
Grok 4.3 (Non-reasoning) xAI 2026-04-30
24.75
Grok 4.3 (high) xAI 2026-04-30
37.58
Grok 4.3 (low) xAI 2026-04-30
35.44
Grok 4.3 (medium) xAI 2026-04-30
36.00
Granite 4.1 30B IBM 2026-04-29
8.86
Granite 4.1 3B IBM 2026-04-29
3.17
Granite 4.1 8B IBM 2026-04-29
6.67
Mistral Medium 3.5 Mistral 2026-04-29
29.95
Nemotron 3 Nano Omni 30B A3B Reasoning NVIDIA 2026-04-29
14.93
DeepSeek V4 Flash (Non-reasoning) DeepSeek 2026-04-24
28.65
DeepSeek V4 Flash (Reasoning, High Effort) DeepSeek 2026-04-24
37.36
DeepSeek V4 Flash (Reasoning, Max Effort) DeepSeek 2026-04-24
40.28
DeepSeek V4 Pro (Non-reasoning) DeepSeek 2026-04-24
31.22
DeepSeek V4 Pro (Reasoning, High Effort) DeepSeek 2026-04-24
40.83
DeepSeek V4 Pro (Reasoning, Max Effort) DeepSeek 2026-04-24
44.27
GPT-5.5 (Non-reasoning) OpenAI 2026-04-23
32.74
GPT-5.5 (high) OpenAI 2026-04-23
53.13
GPT-5.5 (low) OpenAI 2026-04-23
41.73
GPT-5.5 (medium) OpenAI 2026-04-23
47.14
GPT-5.5 (xhigh) OpenAI 2026-04-23
54.84
GPT-5.5 Pro (xhigh) OpenAI 2026-04-23
Hy3-preview (Non-reasoning) Tencent 2026-04-23
26.10
Hy3-preview (Reasoning) Tencent 2026-04-23
33.58
Ling-2.6-1T InclusionAI 2026-04-23
26.05
MiMo-V2.5 Xiaomi 2026-04-22
40.14
MiMo-V2.5-Pro Xiaomi 2026-04-22
42.24
MiMo-V2.5-Pro (Non-reasoning) Xiaomi 2026-04-22
27.86
Qwen3.6 27B (Non-reasoning) Alibaba 2026-04-22
29.28
Qwen3.6 27B (Reasoning) Alibaba 2026-04-22
37.05
Ling 2.6 Flash InclusionAI 2026-04-21
19.25
Kimi K2.6 Kimi 2026-04-20
42.84
Kimi K2.6 (Non-reasoning) Kimi 2026-04-20
34.58
Qwen3.6 Max Preview Alibaba 2026-04-20
40.00
Claude Opus 4.7 (Adaptive Reasoning, Max Effort) Anthropic 2026-04-16
53.53
Claude Opus 4.7 (Non-reasoning, High Effort) Anthropic 2026-04-16
42.68
Qwen3.6 35B A3B (Non-reasoning) Alibaba 2026-04-16
24.15
Qwen3.6 35B A3B (Reasoning) Alibaba 2026-04-16
31.65
JT-MINI China Mobile 2026-04-15
18.53
EXAONE 4.5 33B LG AI Research 2026-04-09
22.96
EXAONE 4.5 33B (Non-reasoning) LG AI Research 2026-04-09
Muse Spark Meta 2026-04-08
43.06
GLM-5.1 (Non-reasoning) Z AI 2026-04-07
35.37
GLM-5.1 (Reasoning) Z AI 2026-04-07
40.16
Grok 4.20 0309 v2 (Non-reasoning) xAI 2026-04-07
21.84
Grok 4.20 0309 v2 (Reasoning) xAI 2026-04-07
37.00
Solar Pro 3 Upstage 2026-04-06
14.14
Gemma 4 E4B (Non-reasoning) Google 2026-04-03
8.91
Gemma 4 E4B (Reasoning) Google 2026-04-03
12.50
Gemma 4 26B A4B (Non-reasoning) Google 2026-04-02
20.10
Gemma 4 26B A4B (Reasoning) Google 2026-04-02
25.69
Gemma 4 31B (Non-reasoning) Google 2026-04-02
24.85
Gemma 4 31B (Reasoning) Google 2026-04-02
29.35
Gemma 4 E2B (Non-reasoning) Google 2026-04-02
6.41
Gemma 4 E2B (Reasoning) Google 2026-04-02
9.26
Qwen3.6 Plus Alibaba 2026-04-02
39.56
Step 3.5 Flash 2603 StepFun 2026-04-02
26.00
GLM 5V Turbo (Reasoning) Z AI 2026-04-01
34.49
Trinity Large Thinking Arcee AI 2026-04-01
24.47
Qwen3.5 Omni Flash Alibaba 2026-03-30
18.99
Qwen3.5 Omni Plus Alibaba 2026-03-30
30.64
MiMo-V2-Omni-0327 Xiaomi 2026-03-27
36.39
MiMo-V2-Omni Xiaomi 2026-03-19
34.99
Nemotron Cascade 2 30B A3B NVIDIA 2026-03-19
21.25
MiMo-V2-Pro Xiaomi 2026-03-18
40.29
MiniMax-M2.7 MiniMax 2026-03-18
38.13
GPT-5.4 mini (Non-Reasoning) OpenAI 2026-03-17
16.62
GPT-5.4 mini (medium) OpenAI 2026-03-17
29.81
GPT-5.4 mini (xhigh) OpenAI 2026-03-17
39.98
GPT-5.4 nano (Non-Reasoning) OpenAI 2026-03-17
17.61
GPT-5.4 nano (medium) OpenAI 2026-03-17
30.16
GPT-5.4 nano (xhigh) OpenAI 2026-03-17
38.24
Mistral Small 4 (Non-reasoning) Mistral 2026-03-16
12.37
Mistral Small 4 (Reasoning) Mistral 2026-03-16
20.75
NVIDIA Nemotron 3 Nano 4B NVIDIA 2026-03-16
8.77
GLM-5-Turbo Z AI 2026-03-15
38.06
NVIDIA Nemotron 3 Super 120B A12B (Reasoning) NVIDIA 2026-03-11
25.41
Grok 4.20 0309 (Non-reasoning) xAI 2026-03-10
22.48
Grok 4.20 0309 (Reasoning) xAI 2026-03-10
36.50
Sarvam 105B (high) Sarvam 2026-03-06
11.94
Sarvam 30B (high) Sarvam 2026-03-06
6.64
GPT-5.4 (Non-reasoning) OpenAI 2026-03-05
27.68
GPT-5.4 (low) OpenAI 2026-03-05
39.14
GPT-5.4 (xhigh) OpenAI 2026-03-05
51.40
GPT-5.4 Pro (xhigh) OpenAI 2026-03-05
Gemini 3.1 Flash-Lite Google 2026-03-03
25.04
Qwen3.5 0.8B (Non-reasoning) Alibaba 2026-03-02
4.42
Qwen3.5 0.8B (Reasoning) Alibaba 2026-03-02
4.97
Qwen3.5 2B (Non-reasoning) Alibaba 2026-03-02
8.76
Qwen3.5 2B (Reasoning) Alibaba 2026-03-02
10.24
Qwen3.5 4B (Non-reasoning) Alibaba 2026-03-02
16.00
Qwen3.5 4B (Reasoning) Alibaba 2026-03-02
20.09
Qwen3.5 9B (Non-reasoning) Alibaba 2026-03-02
20.32
Qwen3.5 9B (Reasoning) Alibaba 2026-03-02
24.97
LFM2 24B A2B Liquid AI 2026-02-25
4.95
Qwen3.5 122B A10B (Non-reasoning) Alibaba 2026-02-24
28.12
Qwen3.5 122B A10B (Reasoning) Alibaba 2026-02-24
32.28
Qwen3.5 27B (Non-reasoning) Alibaba 2026-02-24
29.31
Qwen3.5 27B (Reasoning) Alibaba 2026-02-24
33.78
Qwen3.5 35B A3B (Non-reasoning) Alibaba 2026-02-24
23.38
Qwen3.5 35B A3B (Reasoning) Alibaba 2026-02-24
29.26
Mercury 2 Inception 2026-02-20
25.33
Gemini 3.1 Pro Preview Google 2026-02-19
46.46
Claude Sonnet 4.6 (Adaptive Reasoning, Max Effort) Anthropic 2026-02-17
47.21
Claude Sonnet 4.6 (Non-reasoning, High Effort) Anthropic 2026-02-17
35.89
Claude Sonnet 4.6 (Non-reasoning, Low Effort) Anthropic 2026-02-17
34.26
Tiny Aya Global Cohere 2026-02-17
1.00
Qwen3.5 397B A17B (Non-reasoning) Alibaba 2026-02-16
31.98
Qwen3.5 397B A17B (Reasoning) Alibaba 2026-02-16
33.68
MiniMax-M2.5 MiniMax 2026-02-12
33.65
GLM-5 (Non-reasoning) Z AI 2026-02-11
32.41
GLM-5 (Reasoning) Z AI 2026-02-11
39.50
Nanbeige4.1-3B Nanbeige 2026-02-11
10.05
Claude Opus 4.6 (Adaptive Reasoning, Max Effort) Anthropic 2026-02-05
43.71
Claude Opus 4.6 (Non-reasoning, High Effort) Anthropic 2026-02-05
37.79
GPT-5.3 Codex (xhigh) OpenAI 2026-02-05
44.27
Gemini 3 Deep Think Google 2026-02-05
Qwen3 Coder Next Alibaba 2026-02-03
21.18
Step 3.5 Flash StepFun 2026-02-02
25.50
LongCat Flash Lite LongCat 2026-01-28
17.22
Kimi K2.5 (Non-reasoning) Kimi 2026-01-27
29.40
Kimi K2.5 (Reasoning) Kimi 2026-01-27
38.11
Qwen3 Max Thinking Alibaba 2026-01-26
31.75
LFM2.5-1.2B-Thinking Liquid AI 2026-01-20
2.75
Step3 VL 10B StepFun 2026-01-20
9.47
GLM-4.7-Flash (Non-reasoning) Z AI 2026-01-19
15.52
GLM-4.7-Flash (Reasoning) Z AI 2026-01-19
22.89
LFM2.5-1.2B-Instruct Liquid AI 2026-01-05
2.71
LFM2.5-VL-1.6B Liquid AI 2026-01-05
1.01
Falcon-H1R-7B TII UAE 2026-01-04
9.79
K-EXAONE (Non-reasoning) LG AI Research 2025-12-31
16.74
K-EXAONE (Reasoning) LG AI Research 2025-12-31
24.70
MiniMax-M2.1 MiniMax 2025-12-23
31.36
GLM-4.7 (Non-reasoning) Z AI 2025-12-22
26.56
GLM-4.7 (Reasoning) Z AI 2025-12-22
33.81
Gemini 3 Flash Preview (Non-reasoning) Google 2025-12-17
27.37
Gemini 3 Flash Preview (Reasoning) Google 2025-12-17
37.76
Solar Open 100B (Reasoning) Upstage 2025-12-17
15.15
MiMo-V2-Flash (Feb 2026) Xiaomi 2025-12-16
33.22
MiMo-V2-Flash (Non-reasoning) Xiaomi 2025-12-16
23.07
MiMo-V2-Flash (Reasoning) Xiaomi 2025-12-16
31.20
NVIDIA Nemotron 3 Nano 30B A3B (Non-reasoning) NVIDIA 2025-12-15
7.39
NVIDIA Nemotron 3 Nano 30B A3B (Reasoning) NVIDIA 2025-12-15
17.53
GPT-5.2 (Non-reasoning) OpenAI 2025-12-11
26.02
GPT-5.2 (medium) OpenAI 2025-12-11
37.95
GPT-5.2 (xhigh) OpenAI 2025-12-11
42.18
GPT-5.2 Codex (xhigh) OpenAI 2025-12-11
40.14
Mi:dm K 2.5 Pro Korea Telecom 2025-12-11
16.42
Mi:dm K 2.5 Pro Preview Korea Telecom 2025-12-11
Devstral 2 Mistral 2025-12-09
15.49
Devstral Small 2 Mistral 2025-12-09
13.14
GLM-4.6V (Non-reasoning) Z AI 2025-12-08
10.98
GLM-4.6V (Reasoning) Z AI 2025-12-08
16.75
Motif-2-12.7B-Reasoning Motif Technologies 2025-12-04
12.79
Ministral 3 14B Mistral 2025-12-02
9.96
Ministral 3 3B Mistral 2025-12-02
5.63
Ministral 3 8B Mistral 2025-12-02
8.92
Mistral Large 3 Mistral 2025-12-02
16.19
DeepSeek V3.2 (Non-reasoning) DeepSeek 2025-12-01
24.66
DeepSeek V3.2 (Reasoning) DeepSeek 2025-12-01
33.45
DeepSeek V3.2 Speciale DeepSeek 2025-12-01
22.24
INTELLECT-3 Prime Intellect 2025-11-27
15.61
Nova 2.0 Pro Preview (Non-reasoning) Amazon 2025-11-27
16.42
Nova 2.0 Pro Preview (low) Amazon 2025-11-27
19.50
Nova 2.0 Pro Preview (medium) Amazon 2025-11-27
21.77
Nova 2.0 Omni (Non-reasoning) Amazon 2025-11-26
10.53
Nova 2.0 Omni (low) Amazon 2025-11-26
16.56
Nova 2.0 Omni (medium) Amazon 2025-11-26
20.95
Claude Opus 4.5 (Non-reasoning) Anthropic 2025-11-24
34.71
Claude Opus 4.5 (Reasoning) Anthropic 2025-11-24
40.77
Grok 4.1 Fast (Non-reasoning) xAI 2025-11-19
16.88
Grok 4.1 Fast (Reasoning) xAI 2025-11-19
30.62
Gemini 3 Pro Preview (high) Google 2025-11-18
39.55
Gemini 3 Pro Preview (low) Google 2025-11-18
33.07
ERNIE 5.0 Thinking Preview Baidu 2025-11-13
21.92
GPT-5.1 (Non-reasoning) OpenAI 2025-11-13
20.40
GPT-5.1 (high) OpenAI 2025-11-13
38.91
GPT-5.1 Codex (high) OpenAI 2025-11-13
34.73
GPT-5.1 Codex mini (high) OpenAI 2025-11-13
30.63
Doubao Seed Code ByteDance Seed 2025-11-11
25.98
Kimi K2 Thinking Kimi 2025-11-06
32.70
Qwen3 Max Thinking (Preview) Alibaba 2025-11-03
25.03
Kimi Linear 48B A3B Instruct Kimi 2025-10-30
8.53
Nova 2.0 Lite (Non-reasoning) Amazon 2025-10-29
11.83
Nova 2.0 Lite (high) Amazon 2025-10-29
20.50
Nova 2.0 Lite (low) Amazon 2025-10-29
17.81
Nova 2.0 Lite (medium) Amazon 2025-10-29
19.00
Granite 4.0 1B IBM 2025-10-28
2.07
Granite 4.0 350M IBM 2025-10-28
1.00
Granite 4.0 H 1B IBM 2025-10-28
2.66
Granite 4.0 H 350M IBM 2025-10-28
1.00
NVIDIA Nemotron Nano 12B v2 VL (Non-reasoning) NVIDIA 2025-10-28
4.58
NVIDIA Nemotron Nano 12B v2 VL (Reasoning) NVIDIA 2025-10-28
8.96
MiniMax-M2 MiniMax 2025-10-26
28.31
Qwen3 VL 32B (Reasoning) Alibaba 2025-10-21
17.93
Qwen3 VL 32B Instruct Alibaba 2025-10-21
11.06
Claude 4.5 Haiku (Non-reasoning) Anthropic 2025-10-15
23.71
Claude 4.5 Haiku (Reasoning) Anthropic 2025-10-15
29.58
Qwen3 VL 4B (Reasoning) Alibaba 2025-10-14
7.90
Qwen3 VL 4B Instruct Alibaba 2025-10-14
4.09
Qwen3 VL 8B (Reasoning) Alibaba 2025-10-14
10.58
Qwen3 VL 8B Instruct Alibaba 2025-10-14
8.42
Ring-1T InclusionAI 2025-10-13
16.16
Jamba Reasoning 3B AI21 Labs 2025-10-08
4.13
Ling-1T InclusionAI 2025-10-08
12.75
LFM2 8B A1B Liquid AI 2025-10-07
1.78
Qwen3 VL 30B A3B (Reasoning) Alibaba 2025-10-03
13.34
Qwen3 VL 30B A3B Instruct Alibaba 2025-10-03
10.02
GLM-4.6 (Non-reasoning) Z AI 2025-09-30
22.98
GLM-4.6 (Reasoning) Z AI 2025-09-30
25.05
Claude 4.5 Sonnet (Non-reasoning) Anthropic 2025-09-29
29.28
Claude 4.5 Sonnet (Reasoning) Anthropic 2025-09-29
34.65
DeepSeek V3.2 Exp (Non-reasoning) DeepSeek 2025-09-29
21.33
DeepSeek V3.2 Exp (Reasoning) DeepSeek 2025-09-29
25.45
Gemini 2.5 Flash Preview (Sep '25) (Non-reasoning) Google 2025-09-25
18.83
Gemini 2.5 Flash Preview (Sep '25) (Reasoning) Google 2025-09-25
23.80
Gemini 2.5 Flash-Lite Preview (Sep '25) (Non-reasoning) Google 2025-09-25
13.10
GPT-5 Codex (high) OpenAI 2025-09-23
36.12
LFM2 2.6B Liquid AI 2025-09-23
2.71
Qwen3 Max Alibaba 2025-09-23
24.01
Qwen3 VL 235B A22B (Reasoning) Alibaba 2025-09-23
20.60
Qwen3 VL 235B A22B Instruct Alibaba 2025-09-23
14.31
DeepSeek V3.1 Terminus (Non-reasoning) DeepSeek 2025-09-22
21.41
DeepSeek V3.1 Terminus (Reasoning) DeepSeek 2025-09-22
26.35
Granite 4.0 H Small IBM 2025-09-22
5.24
Granite 4.0 Micro IBM 2025-09-22
2.37
Qwen3 Omni 30B A3B (Reasoning) Alibaba 2025-09-22
9.63
Qwen3 Omni 30B A3B Instruct Alibaba 2025-09-22
5.11
Grok 4 Fast (Non-reasoning) xAI 2025-09-19
16.47
Grok 4 Fast (Reasoning) xAI 2025-09-19
27.37
Ring-flash-2.0 InclusionAI 2025-09-19
8.16
Magistral Medium 1.2 Mistral 2025-09-18
20.11
Ling-flash-2.0 InclusionAI 2025-09-17
9.74
Magistral Small 1.2 Mistral 2025-09-17
11.95
Qwen3 Next 80B A3B (Reasoning) Alibaba 2025-09-11
19.77
Qwen3 Next 80B A3B Instruct Alibaba 2025-09-11
13.72
Ling-mini-2.0 InclusionAI 2025-09-09
3.75
Gemini 2.5 Flash-Lite Preview (Sep '25) (Reasoning) Google 2025-09-08
15.13
Kimi K2 0905 Kimi 2025-09-05
23.54
Qwen3 Max (Preview) Alibaba 2025-09-05
19.18
Apertus 70B Instruct Swiss AI Initiative 2025-09-02
2.40
Apertus 8B Instruct Swiss AI Initiative 2025-09-02
1.00
Grok Code Fast 1 xAI 2025-08-28
21.61
DeepSeek V3.1 (Non-reasoning) DeepSeek 2025-08-21
21.05
DeepSeek V3.1 (Reasoning) DeepSeek 2025-08-21
20.67
Seed-OSS-36B-Instruct ByteDance Seed 2025-08-20
18.34
NVIDIA Nemotron Nano 9B V2 (Non-reasoning) NVIDIA 2025-08-18
7.38
NVIDIA Nemotron Nano 9B V2 (Reasoning) NVIDIA 2025-08-18
8.84
Gemma 3 270M Google 2025-08-14
2.41
Mistral Medium 3.1 Mistral 2025-08-12
14.77
GLM-4.5V (Non-reasoning) Z AI 2025-08-11
7.00
GLM-4.5V (Reasoning) Z AI 2025-08-11
9.15
GPT-5 (ChatGPT) OpenAI 2025-08-07
15.30
GPT-5 (high) OpenAI 2025-08-07
36.11
GPT-5 (low) OpenAI 2025-08-07
31.15
GPT-5 (medium) OpenAI 2025-08-07
33.74
GPT-5 (minimal) OpenAI 2025-08-07
17.18
GPT-5 mini (high) OpenAI 2025-08-07
32.96
GPT-5 mini (medium) OpenAI 2025-08-07
30.92
GPT-5 mini (minimal) OpenAI 2025-08-07
14.25
GPT-5 nano (high) OpenAI 2025-08-07
19.87
GPT-5 nano (medium) OpenAI 2025-08-07
19.00
GPT-5 nano (minimal) OpenAI 2025-08-07
8.00
Qwen3 4B 2507 (Reasoning) Alibaba 2025-08-06
11.96
Qwen3 4B 2507 Instruct Alibaba 2025-08-06
7.12
Claude 4.1 Opus (Non-reasoning) Anthropic 2025-08-05
28.24
Claude 4.1 Opus (Reasoning) Anthropic 2025-08-05
33.71
gpt-oss-120b (high) OpenAI 2025-08-05
23.83
gpt-oss-120b (low) OpenAI 2025-08-05
17.70
gpt-oss-20B (high) OpenAI 2025-08-05
14.89
gpt-oss-20B (low) OpenAI 2025-08-05
14.34
Qwen3 Coder 30B A3B Instruct Alibaba 2025-07-31
13.61
Qwen3 30B A3B 2507 (Reasoning) Alibaba 2025-07-30
15.83
Qwen3 30B A3B 2507 Instruct Alibaba 2025-07-29
9.06
GLM-4.5 (Reasoning) Z AI 2025-07-28
19.49
GLM-4.5-Air Z AI 2025-07-28
16.52
Llama Nemotron Super 49B v1.5 (Non-reasoning) NVIDIA 2025-07-25
8.68
Llama Nemotron Super 49B v1.5 (Reasoning) NVIDIA 2025-07-25
12.42
Qwen3 235B A22B 2507 (Reasoning) Alibaba 2025-07-25
22.34
Qwen3 Coder 480B A35B Instruct Alibaba 2025-07-22
17.98
Qwen3 235B A22B 2507 Instruct Alibaba 2025-07-21
18.16
EXAONE 4.0 32B (Non-reasoning) LG AI Research 2025-07-15
6.01
EXAONE 4.0 32B (Reasoning) LG AI Research 2025-07-15
10.59
Exaone 4.0 1.2B (Non-reasoning) LG AI Research 2025-07-15
2.77
Exaone 4.0 1.2B (Reasoning) LG AI Research 2025-07-15
2.90
Kimi K2 Kimi 2025-07-11
19.40
Devstral Medium Mistral 2025-07-10
12.40
Devstral Small (Jul '25) Mistral 2025-07-10
9.26
Grok 4 xAI 2025-07-10
33.28
LFM2 1.2B Liquid AI 2025-07-10
1.15
Solar Pro 2 (Non-reasoning) Upstage 2025-07-09
7.77
Solar Pro 2 (Reasoning) Upstage 2025-07-09
8.99
Jamba 1.7 Large AI21 Labs 2025-07-07
5.30
Jamba 1.7 Mini AI21 Labs 2025-07-07
2.73
ERNIE 4.5 300B A47B Baidu 2025-06-30
9.02
Gemma 3n E2B Instruct Google 2025-06-26
1.00
Gemma 3n E4B Instruct Google 2025-06-26
1.19
Mistral Small 3.2 Mistral 2025-06-20
9.12
Gemini 2.5 Flash-Lite (Non-reasoning) Google 2025-06-17
6.92
Gemini 2.5 Flash-Lite (Reasoning) Google 2025-06-17
11.41
MiniMax M1 40k MiniMax 2025-06-17
14.41
MiniMax M1 80k MiniMax 2025-06-17
17.67
Magistral Medium 1 Mistral 2025-06-10
12.51
Magistral Small 1 Mistral 2025-06-10
10.70
o3-pro OpenAI 2025-06-10
32.52
Gemini 2.5 Pro Google 2025-06-05
26.98
DeepSeek R1 0528 Qwen3 8B DeepSeek 2025-05-29
10.37
DeepSeek R1 0528 (May '25) DeepSeek 2025-05-28
20.08
Sarvam M (Reasoning) Sarvam 2025-05-23
3.03
Claude 4 Opus (Non-reasoning) Anthropic 2025-05-22
25.50
Claude 4 Opus (Reasoning) Anthropic 2025-05-22
30.97
Claude 4 Sonnet (Non-reasoning) Anthropic 2025-05-22
25.49
Claude 4 Sonnet (Reasoning) Anthropic 2025-05-22
30.67
Devstral Small (May '25) Mistral 2025-05-21
11.82
Gemini 2.5 Flash (Non-reasoning) Google 2025-05-20
14.14
Gemini 2.5 Flash (Reasoning) Google 2025-05-20
20.06
Gemma 3n E4B Instruct Preview (May '25) Google 2025-05-20
4.55
Llama 3.1 Nemotron Nano 4B v1.1 (Reasoning) NVIDIA 2025-05-20
8.54
Solar Pro 2 (Preview) (Non-reasoning) Upstage 2025-05-20
9.97
Solar Pro 2 (Preview) (Reasoning) Upstage 2025-05-20
12.54
Mistral Medium 3 Mistral 2025-05-07
12.49
Gemini 2.5 Pro Preview (May' 25) Google 2025-05-06
22.34
Nova Premier Amazon 2025-04-30
12.73
Qwen3 0.6B (Non-reasoning) Alibaba 2025-04-28
1.00
Qwen3 0.6B (Reasoning) Alibaba 2025-04-28
1.28
Qwen3 1.7B (Non-reasoning) Alibaba 2025-04-28
1.53
Qwen3 1.7B (Reasoning) Alibaba 2025-04-28
2.63
Qwen3 14B (Non-reasoning) Alibaba 2025-04-28
7.02
Qwen3 14B (Reasoning) Alibaba 2025-04-28
10.15
Qwen3 235B A22B (Non-reasoning) Alibaba 2025-04-28
10.86
Qwen3 235B A22B (Reasoning) Alibaba 2025-04-28
13.43
Qwen3 30B A3B (Non-reasoning) Alibaba 2025-04-28
6.80
Qwen3 30B A3B (Reasoning) Alibaba 2025-04-28
9.31
Qwen3 32B (Non-reasoning) Alibaba 2025-04-28
8.63
Qwen3 32B (Reasoning) Alibaba 2025-04-28
10.46
Qwen3 4B (Non-reasoning) Alibaba 2025-04-28
6.77
Qwen3 4B (Reasoning) Alibaba 2025-04-28
8.35
Qwen3 8B (Non-reasoning) Alibaba 2025-04-28
5.07
Qwen3 8B (Reasoning) Alibaba 2025-04-28
7.40
Gemini 2.5 Flash Preview (Non-reasoning) Google 2025-04-17
11.66
Gemini 2.5 Flash Preview (Reasoning) Google 2025-04-17
17.55
Granite 3.3 8B (Non-reasoning) IBM 2025-04-16
1.75
o3 OpenAI 2025-04-16
30.40
o4-mini (high) OpenAI 2025-04-16
25.55
GPT-4.1 OpenAI 2025-04-14
19.36
GPT-4.1 mini OpenAI 2025-04-14
16.27
GPT-4.1 nano OpenAI 2025-04-14
7.27
Llama 3.1 Nemotron Ultra 253B v1 (Reasoning) NVIDIA 2025-04-07
9.08
Llama 4 Maverick Meta 2025-04-05
14.27
Llama 4 Scout Meta 2025-04-05
10.04
GPT-4o (March 2025, chatgpt-4o-latest) OpenAI 2025-03-27
12.31
DeepSeek V3 0324 DeepSeek 2025-03-25
15.71
Gemini 2.5 Pro Preview (Mar' 25) Google 2025-03-25
23.03
o1-pro OpenAI 2025-03-19
18.89
Llama 3.3 Nemotron Super 49B v1 (Non-reasoning) NVIDIA 2025-03-18
8.47
Llama 3.3 Nemotron Super 49B v1 (Reasoning) NVIDIA 2025-03-18
12.25
Mistral Small 3.1 Mistral 2025-03-17
8.59
Command A Cohere 2025-03-13
7.67
Gemma 3 1B Instruct Google 2025-03-13
1.00
Gemma 3 12B Instruct Google 2025-03-12
3.39
Gemma 3 27B Instruct Google 2025-03-12
4.78
Gemma 3 4B Instruct Google 2025-03-12
1.12
Reka Flash 3 Reka AI 2025-03-10
4.06
Jamba 1.6 Large AI21 Labs 2025-03-06
5.00
Jamba 1.6 Mini AI21 Labs 2025-03-06
2.55
QwQ 32B Alibaba 2025-03-05
13.37
GPT-4.5 (Preview) OpenAI 2025-02-27
13.59
Phi-4 Multimodal Instruct Microsoft 2025-02-26
4.53
Gemini 2.0 Flash-Lite (Feb '25) Google 2025-02-25
8.79
Claude 3.7 Sonnet (Non-reasoning) Anthropic 2025-02-24
23.50
Claude 3.7 Sonnet (Reasoning) Anthropic 2025-02-24
27.06
Grok 3 xAI 2025-02-19
18.35
Grok 3 Reasoning Beta xAI 2025-02-19
15.13
Grok 3 mini Reasoning (high) xAI 2025-02-19
22.50
R1 1776 Perplexity 2025-02-18
6.31
Mistral Saba Mistral 2025-02-17
6.44
GPT-4o (ChatGPT) OpenAI 2025-02-15
8.25
Gemini 2.0 Flash (Feb '25) Google 2025-02-05
12.27
Gemini 2.0 Flash-Lite (Preview) Google 2025-02-05
8.59
Gemini 2.0 Pro Experimental (Feb '25) Google 2025-02-05
11.85
o3-mini OpenAI 2025-01-31
18.98
o3-mini (high) OpenAI 2025-01-31
18.38
Mistral Small 3 Mistral 2025-01-30
6.93
Qwen2.5 Max Alibaba 2025-01-28
10.23
Sonar Reasoning Perplexity 2025-01-28
11.69
Sonar Reasoning Pro Perplexity 2025-01-28
17.85
Gemini 2.0 Flash Thinking Experimental (Jan '25) Google 2025-01-21
13.26
Sonar Perplexity 2025-01-21
9.51
Sonar Pro Perplexity 2025-01-21
9.27
DeepSeek R1 (Jan '25) DeepSeek 2025-01-20
12.57
DeepSeek R1 Distill Llama 70B DeepSeek 2025-01-20
9.93
DeepSeek R1 Distill Llama 8B DeepSeek 2025-01-20
6.41
DeepSeek R1 Distill Qwen 1.5B DeepSeek 2025-01-20
3.65
DeepSeek R1 Distill Qwen 14B DeepSeek 2025-01-20
9.83
DeepSeek R1 Distill Qwen 32B DeepSeek 2025-01-20
11.04
DeepSeek V3 (Dec '24) DeepSeek 2024-12-26
10.39
Gemini 2.0 Flash Thinking Experimental (Dec '24) Google 2024-12-19
6.63
GPT-4o Realtime (Dec '24) OpenAI 2024-12-17
GPT-4o mini Realtime (Dec '24) OpenAI 2024-12-17
Grok 2 (Dec '24) xAI 2024-12-12
8.04
Phi-4 Microsoft 2024-12-12
4.87
Gemini 2.0 Flash (experimental) Google 2024-12-11
10.68
DeepSeek-V2.5 (Dec '24) DeepSeek 2024-12-10
6.79
Llama 3.3 Instruct 70B Meta 2024-12-06
8.59
o1 OpenAI 2024-12-05
23.44
Nova Lite Amazon 2024-12-03
6.92
Nova Micro Amazon 2024-12-03
4.74
Nova Pro Amazon 2024-12-03
7.67
QwQ 32B-Preview Alibaba 2024-11-27
9.22
GPT-4o (Nov '24) OpenAI 2024-11-20
11.18
Mistral Large 2 (Nov '24) Mistral 2024-11-18
9.15
Pixtral Large Mistral 2024-11-18
8.15
Qwen2.5 Turbo Alibaba 2024-11-18
6.30
Qwen2.5 Coder Instruct 32B Alibaba 2024-11-11
7.11
Claude 3.5 Haiku Anthropic 2024-10-22
12.41
Claude 3.5 Sonnet (Oct '24) Anthropic 2024-10-22
9.91
Llama 3.1 Nemotron Instruct 70B NVIDIA 2024-10-15
7.64
Reka Flash (Sep '24) Reka AI 2024-10-04
6.29
Gemini 1.5 Flash-8B Google 2024-10-03
5.53
LFM 40B Liquid AI 2024-09-30
3.37
Llama 3.2 Instruct 11B (Vision) Meta 2024-09-25
3.33
Llama 3.2 Instruct 1B Meta 2024-09-25
1.10
Llama 3.2 Instruct 3B Meta 2024-09-25
4.22
Llama 3.2 Instruct 90B (Vision) Meta 2024-09-25
6.23
Gemini 1.5 Flash (Sep '24) Google 2024-09-24
7.96
Gemini 1.5 Pro (Sep '24) Google 2024-09-24
9.97
Qwen2.5 Coder Instruct 7B Alibaba 2024-09-19
4.48
Qwen2.5 Instruct 32B Alibaba 2024-09-19
7.45
Qwen2.5 Instruct 72B Alibaba 2024-09-19
9.57
Mistral Small (Sep '24) Mistral 2024-09-17
4.66
o1-mini OpenAI 2024-09-12
13.98
o1-preview OpenAI 2024-09-12
17.04
DeepSeek-V2.5 DeepSeek 2024-09-06
6.62
Jamba 1.5 Large AI21 Labs 2024-08-22
5.13
Jamba 1.5 Mini AI21 Labs 2024-08-22
2.70
Grok Beta xAI 2024-08-13
7.49
GPT-4o (Aug '24) OpenAI 2024-08-06
9.58
Mistral Large 2 (Jul '24) Mistral 2024-07-24
7.27
Llama 3.1 Instruct 405B Meta 2024-07-23
8.50
Llama 3.1 Instruct 70B Meta 2024-07-23
6.75
Llama 3.1 Instruct 8B Meta 2024-07-23
6.10
GPT-4o mini OpenAI 2024-07-18
6.91
Claude 3.5 Sonnet (June '24) Anthropic 2024-06-21
8.30
DeepSeek Coder V2 Lite Instruct DeepSeek 2024-06-17
3.11
DeepSeek-Coder-V2 DeepSeek 2024-06-17
5.05
Qwen2 Instruct 72B Alibaba 2024-06-07
6.01
Gemini 1.5 Pro (May '24) Google 2024-05-15
6.32
Gemini 1.5 Flash (May '24) Google 2024-05-14
4.92
GPT-4o (May '24) OpenAI 2024-05-13
8.60
DeepSeek-V2-Chat DeepSeek 2024-05-06
3.64
Qwen1.5 Chat 110B Alibaba 2024-04-25
4.08
Arctic Instruct Snowflake 2024-04-24
3.42
Phi-3 Mini Instruct 3.8B Microsoft 2024-04-23
4.59
Llama 3 Instruct 70B Meta 2024-04-18
3.47
Llama 3 Instruct 8B Meta 2024-04-18
1.19
Mixtral 8x22B Instruct Mistral 2024-04-17
4.35
Command-R+ (Apr '24) Cohere 2024-04-04
2.99
DBRX Instruct Databricks 2024-03-27
2.96
Grok-1 xAI 2024-03-17
6.04
Command-R (Mar '24) Cohere 2024-03-12
2.14
Claude 3 Haiku Anthropic 2024-03-04
3.86
Claude 3 Opus Anthropic 2024-03-04
11.80
Claude 3 Sonnet Anthropic 2024-03-04
4.74
Mistral Large (Feb '24) Mistral 2024-02-26
4.41
Mistral Small (Feb '24) Mistral 2024-02-26
3.62
Phi-4 Mini Instruct Microsoft 2024-02-26
3.03
Solar Mini Upstage 2024-01-25
6.23
OpenChat 3.5 (1210) OpenChat 2023-12-18
2.96
Mistral Medium Mistral 2023-12-11
3.59
Mixtral 8x7B Instruct Mistral 2023-12-11
2.43
Gemini 1.0 Pro Google 2023-12-06
3.13
Gemini 1.0 Ultra Google 2023-12-06
4.63
Qwen Chat 72B Alibaba 2023-11-30
3.42
DeepSeek LLM 67B Chat (V1) DeepSeek 2023-11-29
3.01
Claude 2.1 Anthropic 2023-11-21
3.88
GPT-4 Turbo OpenAI 2023-11-06
7.89
Mistral 7B Instruct Mistral 2023-09-27
2.14
Qwen Chat 14B Alibaba 2023-09-25
2.14
Llama 2 Chat 13B Meta 2023-07-18
3.00
Llama 2 Chat 70B Meta 2023-07-18
3.01
Llama 2 Chat 7B Meta 2023-07-18
4.26
Claude 2.0 Anthropic 2023-07-11
3.64
GPT-3.5 Turbo (0613) OpenAI 2023-06-13
PALM-2 Google 2023-05-10
3.21
Claude Instant Anthropic 2023-03-14
2.14
GPT-4 OpenAI 2023-03-14
7.01
Llama 65B Meta 2023-02-24
2.14
GPT-3.5 Turbo OpenAI 2022-11-30
3.57
AI模型天梯榜数据来源:Artificial Analysis - Comparison of AI Models
方法论:/r/docs/methodology
关于 ·  帮助 ·  PING ·  隐私 ·  条款   
OA0 - Omni AI 0 一个探索 AI 的社区
沪ICP备2024103595号-2
耗时 112 ms
Developed with Cursor