OA0
OA0 是一个探索 AI 的社区
现在注册
已注册用户请  登录
社区运行状况
注册会员 1100
主题 846
模型 3026
技能包 13874
数据集 1047
论文 331
开源项目 532
模型名称 厂商 发布时间 智能评分
Grok 4.3 xAI 2026-04-30
53.20
Granite 4.1 30B IBM 2026-04-29
14.69
Granite 4.1 3B IBM 2026-04-29
8.54
Granite 4.1 8B IBM 2026-04-29
12.38
Mistral Medium 3.5 Mistral 2026-04-29
39.23
Nemotron 3 Nano Omni 30B A3B Reasoning NVIDIA 2026-04-29
21.43
DeepSeek V4 Flash (Non-reasoning) DeepSeek 2026-04-24
36.46
DeepSeek V4 Flash (Reasoning, High Effort) DeepSeek 2026-04-24
44.87
DeepSeek V4 Flash (Reasoning, Max Effort) DeepSeek 2026-04-24
46.52
DeepSeek V4 Pro (Non-reasoning) DeepSeek 2026-04-24
39.27
DeepSeek V4 Pro (Reasoning, High Effort) DeepSeek 2026-04-24
49.79
DeepSeek V4 Pro (Reasoning, Max Effort) DeepSeek 2026-04-24
51.51
GPT-5.5 (Non-reasoning) OpenAI 2026-04-23
40.94
GPT-5.5 (high) OpenAI 2026-04-23
58.87
GPT-5.5 (low) OpenAI 2026-04-23
50.78
GPT-5.5 (medium) OpenAI 2026-04-23
56.71
GPT-5.5 (xhigh) OpenAI 2026-04-23
60.24
GPT-5.5 Pro (xhigh) OpenAI 2026-04-23
Hy3-preview (Non-reasoning) Tencent 2026-04-23
33.66
Hy3-preview (Reasoning) Tencent 2026-04-23
41.85
Ling-2.6-1T InclusionAI 2026-04-23
33.61
MiMo-V2.5 Xiaomi 2026-04-22
49.03
MiMo-V2.5-Pro Xiaomi 2026-04-22
53.83
MiMo-V2.5-Pro (Non-reasoning) Xiaomi 2026-04-22
35.59
Qwen3.6 27B (Non-reasoning) Alibaba 2026-04-22
37.14
Qwen3.6 27B (Reasoning) Alibaba 2026-04-22
45.82
Ling 2.6 Flash InclusionAI 2026-04-21
26.16
Kimi K2.6 Kimi 2026-04-20
53.90
Kimi K2.6 (Non-reasoning) Kimi 2026-04-20
42.95
Qwen3.6 Max Preview Alibaba 2026-04-20
51.81
Claude Opus 4.7 (Adaptive Reasoning, Max Effort) Anthropic 2026-04-16
57.28
Claude Opus 4.7 (Non-reasoning, High Effort) Anthropic 2026-04-16
51.82
Qwen3.6 35B A3B (Non-reasoning) Alibaba 2026-04-16
31.53
Qwen3.6 35B A3B (Reasoning) Alibaba 2026-04-16
43.49
EXAONE 4.5 33B LG AI Research 2026-04-09
30.23
EXAONE 4.5 33B (Non-reasoning) LG AI Research 2026-04-09
Muse Spark Meta 2026-04-08
52.15
GLM-5.1 (Non-reasoning) Z AI 2026-04-07
43.82
GLM-5.1 (Reasoning) Z AI 2026-04-07
51.41
Grok 4.20 0309 v2 (Non-reasoning) xAI 2026-04-07
28.99
Grok 4.20 0309 v2 (Reasoning) xAI 2026-04-07
49.33
Solar Pro 3 Upstage 2026-04-06
25.87
Gemma 4 E4B (Non-reasoning) Google 2026-04-03
14.83
Gemma 4 E4B (Reasoning) Google 2026-04-03
18.76
Gemma 4 26B A4B (Non-reasoning) Google 2026-04-02
27.09
Gemma 4 26B A4B (Reasoning) Google 2026-04-02
31.21
Gemma 4 31B (Non-reasoning) Google 2026-04-02
32.29
Gemma 4 31B (Reasoning) Google 2026-04-02
39.18
Gemma 4 E2B (Non-reasoning) Google 2026-04-02
12.10
Gemma 4 E2B (Reasoning) Google 2026-04-02
15.21
Qwen3.6 Plus Alibaba 2026-04-02
49.98
Step 3.5 Flash 2603 StepFun 2026-04-02
38.47
GLM 5V Turbo (Reasoning) Z AI 2026-04-01
42.85
Trinity Large Thinking Arcee AI 2026-04-01
31.87
Qwen3.5 Omni Flash Alibaba 2026-03-30
25.87
Qwen3.5 Omni Plus Alibaba 2026-03-30
38.63
MiMo-V2-Omni-0327 Xiaomi 2026-03-27
44.93
MiMo-V2-Omni Xiaomi 2026-03-19
43.40
Nemotron Cascade 2 30B A3B NVIDIA 2026-03-19
28.35
MiMo-V2-Pro Xiaomi 2026-03-18
49.20
MiniMax-M2.7 MiniMax 2026-03-18
49.62
GPT-5.4 mini (Non-Reasoning) OpenAI 2026-03-17
23.28
GPT-5.4 mini (medium) OpenAI 2026-03-17
37.73
GPT-5.4 mini (xhigh) OpenAI 2026-03-17
48.90
GPT-5.4 nano (Non-Reasoning) OpenAI 2026-03-17
24.36
GPT-5.4 nano (medium) OpenAI 2026-03-17
38.11
GPT-5.4 nano (xhigh) OpenAI 2026-03-17
43.98
Mistral Small 4 (Non-reasoning) Mistral 2026-03-16
18.62
Mistral Small 4 (Reasoning) Mistral 2026-03-16
27.80
NVIDIA Nemotron 3 Nano 4B NVIDIA 2026-03-16
14.68
GLM-5-Turbo Z AI 2026-03-15
46.76
NVIDIA Nemotron 3 Super 120B A12B (Reasoning) NVIDIA 2026-03-11
35.97
Grok 4.20 0309 (Non-reasoning) xAI 2026-03-10
29.69
Grok 4.20 0309 (Reasoning) xAI 2026-03-10
48.48
Sarvam 105B (high) Sarvam 2026-03-06
18.16
Sarvam 30B (high) Sarvam 2026-03-06
12.34
GPT-5.4 (Non-reasoning) OpenAI 2026-03-05
35.39
GPT-5.4 (low) OpenAI 2026-03-05
47.94
GPT-5.4 (xhigh) OpenAI 2026-03-05
56.80
GPT-5.4 Pro (xhigh) OpenAI 2026-03-05
Gemini 3.1 Flash-Lite Preview Google 2026-03-03
33.52
Qwen3.5 0.8B (Non-reasoning) Alibaba 2026-03-02
9.91
Qwen3.5 0.8B (Reasoning) Alibaba 2026-03-02
10.52
Qwen3.5 2B (Non-reasoning) Alibaba 2026-03-02
14.67
Qwen3.5 2B (Reasoning) Alibaba 2026-03-02
16.29
Qwen3.5 4B (Non-reasoning) Alibaba 2026-03-02
22.60
Qwen3.5 4B (Reasoning) Alibaba 2026-03-02
27.08
Qwen3.5 9B (Non-reasoning) Alibaba 2026-03-02
27.33
Qwen3.5 9B (Reasoning) Alibaba 2026-03-02
32.43
LFM2 24B A2B Liquid AI 2026-02-25
10.49
Qwen3.5 122B A10B (Non-reasoning) Alibaba 2026-02-24
35.87
Qwen3.5 122B A10B (Reasoning) Alibaba 2026-02-24
41.60
Qwen3.5 27B (Non-reasoning) Alibaba 2026-02-24
37.18
Qwen3.5 27B (Reasoning) Alibaba 2026-02-24
42.07
Qwen3.5 35B A3B (Non-reasoning) Alibaba 2026-02-24
30.69
Qwen3.5 35B A3B (Reasoning) Alibaba 2026-02-24
37.12
Mercury 2 Inception 2026-02-20
32.82
Gemini 3.1 Pro Preview Google 2026-02-19
57.18
Claude Sonnet 4.6 (Adaptive Reasoning, Max Effort) Anthropic 2026-02-17
51.72
Claude Sonnet 4.6 (Non-reasoning, High Effort) Anthropic 2026-02-17
44.38
Claude Sonnet 4.6 (Non-reasoning, Low Effort) Anthropic 2026-02-17
42.60
Tiny Aya Global Cohere 2026-02-17
4.74
Qwen3.5 397B A17B (Non-reasoning) Alibaba 2026-02-16
40.10
Qwen3.5 397B A17B (Reasoning) Alibaba 2026-02-16
45.05
MiniMax-M2.5 MiniMax 2026-02-12
41.93
GLM-5 (Non-reasoning) Z AI 2026-02-11
40.57
GLM-5 (Reasoning) Z AI 2026-02-11
49.77
Nanbeige4.1-3B Nanbeige 2026-02-11
16.08
Claude Opus 4.6 (Adaptive Reasoning, Max Effort) Anthropic 2026-02-05
52.95
Claude Opus 4.6 (Non-reasoning, High Effort) Anthropic 2026-02-05
46.46
GPT-5.3 Codex (xhigh) OpenAI 2026-02-05
53.56
Gemini 3 Deep Think Google 2026-02-05
Qwen3 Coder Next Alibaba 2026-02-03
28.28
Step 3.5 Flash StepFun 2026-02-02
37.80
LongCat Flash Lite LongCat 2026-01-28
23.93
Kimi K2.5 (Non-reasoning) Kimi 2026-01-27
37.27
Kimi K2.5 (Reasoning) Kimi 2026-01-27
46.81
Qwen3 Max Thinking Alibaba 2026-01-26
39.85
LFM2.5-1.2B-Thinking Liquid AI 2026-01-20
8.08
Step3 VL 10B StepFun 2026-01-20
15.45
GLM-4.7-Flash (Non-reasoning) Z AI 2026-01-19
22.07
GLM-4.7-Flash (Reasoning) Z AI 2026-01-19
30.15
LFM2.5-1.2B-Instruct Liquid AI 2026-01-05
8.04
LFM2.5-VL-1.6B Liquid AI 2026-01-05
6.18
Falcon-H1R-7B TII UAE 2026-01-04
15.80
K-EXAONE (Non-reasoning) LG AI Research 2025-12-31
23.41
K-EXAONE (Reasoning) LG AI Research 2025-12-31
32.12
MiniMax-M2.1 MiniMax 2025-12-23
39.42
GLM-4.7 (Non-reasoning) Z AI 2025-12-22
34.16
GLM-4.7 (Reasoning) Z AI 2025-12-22
42.11
Gemini 3 Flash Preview (Non-reasoning) Google 2025-12-17
35.05
Gemini 3 Flash Preview (Reasoning) Google 2025-12-17
46.43
Solar Open 100B (Reasoning) Upstage 2025-12-17
21.67
MiMo-V2-Flash (Feb 2026) Xiaomi 2025-12-16
41.46
MiMo-V2-Flash (Non-reasoning) Xiaomi 2025-12-16
30.35
MiMo-V2-Flash (Reasoning) Xiaomi 2025-12-16
39.24
NVIDIA Nemotron 3 Nano 30B A3B (Non-reasoning) NVIDIA 2025-12-15
13.17
NVIDIA Nemotron 3 Nano 30B A3B (Reasoning) NVIDIA 2025-12-15
24.27
GPT-5.2 (Non-reasoning) OpenAI 2025-12-11
33.57
GPT-5.2 (medium) OpenAI 2025-12-11
46.64
GPT-5.2 (xhigh) OpenAI 2025-12-11
51.28
GPT-5.2 Codex (xhigh) OpenAI 2025-12-11
49.03
Mi:dm K 2.5 Pro Korea Telecom 2025-12-11
23.06
Mi:dm K 2.5 Pro Preview Korea Telecom 2025-12-11
Devstral 2 Mistral 2025-12-09
22.04
Devstral Small 2 Mistral 2025-12-09
19.47
GLM-4.6V (Non-reasoning) Z AI 2025-12-08
17.10
GLM-4.6V (Reasoning) Z AI 2025-12-08
23.42
Motif-2-12.7B-Reasoning Motif Technologies 2025-12-04
19.08
Ministral 3 14B Mistral 2025-12-02
15.98
Ministral 3 3B Mistral 2025-12-02
11.24
Ministral 3 8B Mistral 2025-12-02
14.84
Mistral Large 3 Mistral 2025-12-02
22.80
DeepSeek V3.2 (Non-reasoning) DeepSeek 2025-12-01
32.09
DeepSeek V3.2 (Reasoning) DeepSeek 2025-12-01
41.71
DeepSeek V3.2 Speciale DeepSeek 2025-12-01
29.43
INTELLECT-3 Prime Intellect 2025-11-27
22.17
Nova 2.0 Pro Preview (Non-reasoning) Amazon 2025-11-27
23.06
Nova 2.0 Pro Preview (low) Amazon 2025-11-27
31.90
Nova 2.0 Pro Preview (medium) Amazon 2025-11-27
35.71
Nova 2.0 Omni (Non-reasoning) Amazon 2025-11-26
16.61
Nova 2.0 Omni (low) Amazon 2025-11-26
23.22
Nova 2.0 Omni (medium) Amazon 2025-11-26
28.02
Claude Opus 4.5 (Non-reasoning) Anthropic 2025-11-24
43.09
Claude Opus 4.5 (Reasoning) Anthropic 2025-11-24
49.73
Grok 4.1 Fast (Non-reasoning) xAI 2025-11-19
23.56
Grok 4.1 Fast (Reasoning) xAI 2025-11-19
38.61
Gemini 3 Pro Preview (high) Google 2025-11-18
48.39
Gemini 3 Pro Preview (low) Google 2025-11-18
41.30
ERNIE 5.0 Thinking Preview Baidu 2025-11-13
29.09
GPT-5.1 (Non-reasoning) OpenAI 2025-11-13
27.42
GPT-5.1 (high) OpenAI 2025-11-13
47.70
GPT-5.1 Codex (high) OpenAI 2025-11-13
43.11
GPT-5.1 Codex mini (high) OpenAI 2025-11-13
38.63
Doubao Seed Code ByteDance Seed 2025-11-11
33.52
Kimi K2 Thinking Kimi 2025-11-06
40.89
Qwen3 Max Thinking (Preview) Alibaba 2025-11-03
32.48
Kimi Linear 48B A3B Instruct Kimi 2025-10-30
14.41
Nova 2.0 Lite (Non-reasoning) Amazon 2025-10-29
18.03
Nova 2.0 Lite (high) Amazon 2025-10-29
34.54
Nova 2.0 Lite (low) Amazon 2025-10-29
24.59
Nova 2.0 Lite (medium) Amazon 2025-10-29
29.73
Granite 4.0 1B IBM 2025-10-28
7.34
Granite 4.0 350M IBM 2025-10-28
6.10
Granite 4.0 H 1B IBM 2025-10-28
7.99
Granite 4.0 H 350M IBM 2025-10-28
5.44
NVIDIA Nemotron Nano 12B v2 VL (Non-reasoning) NVIDIA 2025-10-28
10.09
NVIDIA Nemotron Nano 12B v2 VL (Reasoning) NVIDIA 2025-10-28
14.89
MiniMax-M2 MiniMax 2025-10-26
36.09
Qwen3 VL 32B (Reasoning) Alibaba 2025-10-21
24.72
Qwen3 VL 32B Instruct Alibaba 2025-10-21
17.19
Claude 4.5 Haiku (Non-reasoning) Anthropic 2025-10-15
31.05
Claude 4.5 Haiku (Reasoning) Anthropic 2025-10-15
37.09
Qwen3 VL 4B (Reasoning) Alibaba 2025-10-14
13.73
Qwen3 VL 4B Instruct Alibaba 2025-10-14
9.55
Qwen3 VL 8B (Reasoning) Alibaba 2025-10-14
16.66
Qwen3 VL 8B Instruct Alibaba 2025-10-14
14.30
Ring-1T InclusionAI 2025-10-13
22.78
Jamba Reasoning 3B AI21 Labs 2025-10-08
9.60
Ling-1T InclusionAI 2025-10-08
19.04
LFM2 8B A1B Liquid AI 2025-10-07
7.03
Qwen3 VL 30B A3B (Reasoning) Alibaba 2025-10-03
19.68
Qwen3 VL 30B A3B Instruct Alibaba 2025-10-03
16.05
GLM-4.6 (Non-reasoning) Z AI 2025-09-30
30.24
GLM-4.6 (Reasoning) Z AI 2025-09-30
32.51
Claude 4.5 Sonnet (Non-reasoning) Anthropic 2025-09-29
37.14
Claude 4.5 Sonnet (Reasoning) Anthropic 2025-09-29
43.03
DeepSeek V3.2 Exp (Non-reasoning) DeepSeek 2025-09-29
28.44
DeepSeek V3.2 Exp (Reasoning) DeepSeek 2025-09-29
32.94
Gemini 2.5 Flash Preview (Sep '25) (Non-reasoning) Google 2025-09-25
25.70
Gemini 2.5 Flash Preview (Sep '25) (Reasoning) Google 2025-09-25
31.14
Gemini 2.5 Flash-Lite Preview (Sep '25) (Non-reasoning) Google 2025-09-25
19.42
GPT-5 Codex (high) OpenAI 2025-09-23
44.63
LFM2 2.6B Liquid AI 2025-09-23
8.04
Qwen3 Max Alibaba 2025-09-23
31.38
Qwen3 VL 235B A22B (Reasoning) Alibaba 2025-09-23
27.64
Qwen3 VL 235B A22B Instruct Alibaba 2025-09-23
20.75
DeepSeek V3.1 Terminus (Non-reasoning) DeepSeek 2025-09-22
28.52
DeepSeek V3.1 Terminus (Reasoning) DeepSeek 2025-09-22
33.93
Granite 4.0 H Small IBM 2025-09-22
10.81
Granite 4.0 Micro IBM 2025-09-22
7.67
Qwen3 Omni 30B A3B (Reasoning) Alibaba 2025-09-22
15.62
Qwen3 Omni 30B A3B Instruct Alibaba 2025-09-22
10.68
Grok 4 Fast (Non-reasoning) xAI 2025-09-19
23.12
Grok 4 Fast (Reasoning) xAI 2025-09-19
35.06
Ring-flash-2.0 InclusionAI 2025-09-19
14.02
Magistral Medium 1.2 Mistral 2025-09-18
27.10
Ling-flash-2.0 InclusionAI 2025-09-17
15.74
Magistral Small 1.2 Mistral 2025-09-17
18.16
Qwen3 Next 80B A3B (Reasoning) Alibaba 2025-09-11
26.72
Qwen3 Next 80B A3B Instruct Alibaba 2025-09-11
20.11
Ling-mini-2.0 InclusionAI 2025-09-09
9.19
Gemini 2.5 Flash-Lite Preview (Sep '25) (Reasoning) Google 2025-09-08
21.65
Kimi K2 0905 Kimi 2025-09-05
30.85
Qwen3 Max (Preview) Alibaba 2025-09-05
26.08
Apertus 70B Instruct Swiss AI Initiative 2025-09-02
7.70
Apertus 8B Instruct Swiss AI Initiative 2025-09-02
5.88
Grok Code Fast 1 xAI 2025-08-28
28.74
DeepSeek V3.1 (Non-reasoning) DeepSeek 2025-08-21
28.13
DeepSeek V3.1 (Reasoning) DeepSeek 2025-08-21
27.71
Seed-OSS-36B-Instruct ByteDance Seed 2025-08-20
25.16
NVIDIA Nemotron Nano 9B V2 (Non-reasoning) NVIDIA 2025-08-18
13.16
NVIDIA Nemotron Nano 9B V2 (Reasoning) NVIDIA 2025-08-18
14.76
Gemma 3 270M Google 2025-08-14
7.71
Mistral Medium 3.1 Mistral 2025-08-12
21.25
GLM-4.5V (Non-reasoning) Z AI 2025-08-11
12.74
GLM-4.5V (Reasoning) Z AI 2025-08-11
15.09
GPT-5 (ChatGPT) OpenAI 2025-08-07
21.83
GPT-5 (high) OpenAI 2025-08-07
44.63
GPT-5 (low) OpenAI 2025-08-07
39.20
GPT-5 (medium) OpenAI 2025-08-07
42.03
GPT-5 (minimal) OpenAI 2025-08-07
23.89
GPT-5 mini (high) OpenAI 2025-08-07
41.17
GPT-5 mini (medium) OpenAI 2025-08-07
38.94
GPT-5 mini (minimal) OpenAI 2025-08-07
20.68
GPT-5 nano (high) OpenAI 2025-08-07
26.83
GPT-5 nano (medium) OpenAI 2025-08-07
25.88
GPT-5 nano (minimal) OpenAI 2025-08-07
13.84
Qwen3 4B 2507 (Reasoning) Alibaba 2025-08-06
18.18
Qwen3 4B 2507 Instruct Alibaba 2025-08-06
12.88
Claude 4.1 Opus (Non-reasoning) Anthropic 2025-08-05
36.00
Claude 4.1 Opus (Reasoning) Anthropic 2025-08-05
42.00
gpt-oss-120B (high) OpenAI 2025-08-05
33.27
gpt-oss-120B (low) OpenAI 2025-08-05
24.47
gpt-oss-20B (high) OpenAI 2025-08-05
24.47
gpt-oss-20B (low) OpenAI 2025-08-05
20.79
Qwen3 Coder 30B A3B Instruct Alibaba 2025-07-31
19.98
Qwen3 30B A3B 2507 (Reasoning) Alibaba 2025-07-30
22.41
Qwen3 30B A3B 2507 Instruct Alibaba 2025-07-29
15.00
GLM-4.5 (Reasoning) Z AI 2025-07-28
26.42
GLM-4.5-Air Z AI 2025-07-28
23.17
Llama Nemotron Super 49B v1.5 (Non-reasoning) NVIDIA 2025-07-25
14.59
Llama Nemotron Super 49B v1.5 (Reasoning) NVIDIA 2025-07-25
18.68
Qwen3 235B A22B 2507 (Reasoning) Alibaba 2025-07-25
29.54
Qwen3 Coder 480B A35B Instruct Alibaba 2025-07-22
24.77
Qwen3 235B A22B 2507 Instruct Alibaba 2025-07-21
24.96
EXAONE 4.0 32B (Non-reasoning) LG AI Research 2025-07-15
11.66
EXAONE 4.0 32B (Reasoning) LG AI Research 2025-07-15
16.68
Exaone 4.0 1.2B (Non-reasoning) LG AI Research 2025-07-15
8.11
Exaone 4.0 1.2B (Reasoning) LG AI Research 2025-07-15
8.26
Kimi K2 Kimi 2025-07-11
26.32
Devstral Medium Mistral 2025-07-10
18.66
Devstral Small (Jul '25) Mistral 2025-07-10
15.21
Grok 4 xAI 2025-07-10
41.52
LFM2 1.2B Liquid AI 2025-07-10
6.33
Solar Pro 2 (Non-reasoning) Upstage 2025-07-09
13.59
Solar Pro 2 (Reasoning) Upstage 2025-07-09
14.92
Jamba 1.7 Large AI21 Labs 2025-07-07
10.88
Jamba 1.7 Mini AI21 Labs 2025-07-07
8.07
ERNIE 4.5 300B A47B Baidu 2025-06-30
14.96
Gemma 3n E2B Instruct Google 2025-06-26
4.76
Gemma 3n E4B Instruct Google 2025-06-26
6.38
Mistral Small 3.2 Mistral 2025-06-20
15.07
Gemini 2.5 Flash-Lite (Non-reasoning) Google 2025-06-17
12.66
Gemini 2.5 Flash-Lite (Reasoning) Google 2025-06-17
17.57
MiniMax M1 40k MiniMax 2025-06-17
20.86
MiniMax M1 80k MiniMax 2025-06-17
24.43
Magistral Medium 1 Mistral 2025-06-10
18.77
Magistral Small 1 Mistral 2025-06-10
16.79
o3-pro OpenAI 2025-06-10
40.69
Gemini 2.5 Pro Google 2025-06-05
34.63
DeepSeek R1 0528 Qwen3 8B DeepSeek 2025-05-29
16.43
DeepSeek R1 0528 (May '25) DeepSeek 2025-05-28
27.07
Sarvam M (Reasoning) Sarvam 2025-05-23
8.39
Claude 4 Opus (Non-reasoning) Anthropic 2025-05-22
33.00
Claude 4 Opus (Reasoning) Anthropic 2025-05-22
39.00
Claude 4 Sonnet (Non-reasoning) Anthropic 2025-05-22
33.00
Claude 4 Sonnet (Reasoning) Anthropic 2025-05-22
38.66
Devstral Small (May '25) Mistral 2025-05-21
18.03
Gemini 2.5 Flash (Non-reasoning) Google 2025-05-20
20.56
Gemini 2.5 Flash (Reasoning) Google 2025-05-20
27.04
Gemma 3n E4B Instruct Preview (May '25) Google 2025-05-20
10.06
Llama 3.1 Nemotron Nano 4B v1.1 (Reasoning) NVIDIA 2025-05-20
14.43
Solar Pro 2 (Preview) (Non-reasoning) Upstage 2025-05-20
15.99
Solar Pro 2 (Preview) (Reasoning) Upstage 2025-05-20
18.81
Mistral Medium 3 Mistral 2025-05-07
18.76
Gemini 2.5 Pro Preview (May' 25) Google 2025-05-06
29.55
Nova Premier Amazon 2025-04-30
19.01
Qwen3 0.6B (Non-reasoning) Alibaba 2025-04-28
5.68
Qwen3 0.6B (Reasoning) Alibaba 2025-04-28
6.47
Qwen3 1.7B (Non-reasoning) Alibaba 2025-04-28
6.76
Qwen3 1.7B (Reasoning) Alibaba 2025-04-28
7.96
Qwen3 14B (Non-reasoning) Alibaba 2025-04-28
12.76
Qwen3 14B (Reasoning) Alibaba 2025-04-28
16.19
Qwen3 235B A22B (Non-reasoning) Alibaba 2025-04-28
16.96
Qwen3 235B A22B (Reasoning) Alibaba 2025-04-28
19.79
Qwen3 30B A3B (Non-reasoning) Alibaba 2025-04-28
12.53
Qwen3 30B A3B (Reasoning) Alibaba 2025-04-28
15.28
Qwen3 32B (Non-reasoning) Alibaba 2025-04-28
14.53
Qwen3 32B (Reasoning) Alibaba 2025-04-28
16.53
Qwen3 4B (Non-reasoning) Alibaba 2025-04-28
12.49
Qwen3 4B (Reasoning) Alibaba 2025-04-28
14.22
Qwen3 8B (Non-reasoning) Alibaba 2025-04-28
10.63
Qwen3 8B (Reasoning) Alibaba 2025-04-28
13.18
Gemini 2.5 Flash Preview (Non-reasoning) Google 2025-04-17
17.84
Gemini 2.5 Flash Preview (Reasoning) Google 2025-04-17
24.29
Granite 3.3 8B (Non-reasoning) IBM 2025-04-16
7.00
o3 OpenAI 2025-04-16
38.37
o4-mini (high) OpenAI 2025-04-16
33.06
GPT-4.1 OpenAI 2025-04-14
26.28
GPT-4.1 mini OpenAI 2025-04-14
22.90
GPT-4.1 nano OpenAI 2025-04-14
13.04
Llama 3.1 Nemotron Ultra 253B v1 (Reasoning) NVIDIA 2025-04-07
15.02
Llama 4 Maverick Meta 2025-04-05
18.36
Llama 4 Scout Meta 2025-04-05
13.52
GPT-4o (March 2025, chatgpt-4o-latest) OpenAI 2025-03-27
18.56
DeepSeek V3 0324 DeepSeek 2025-03-25
22.28
Gemini 2.5 Pro Preview (Mar' 25) Google 2025-03-25
30.30
o1-pro OpenAI 2025-03-19
25.76
Llama 3.3 Nemotron Super 49B v1 (Non-reasoning) NVIDIA 2025-03-18
14.35
Llama 3.3 Nemotron Super 49B v1 (Reasoning) NVIDIA 2025-03-18
18.49
Mistral Small 3.1 Mistral 2025-03-17
14.48
Command A Cohere 2025-03-13
13.48
Gemma 3 1B Instruct Google 2025-03-13
5.55
Gemma 3 12B Instruct Google 2025-03-12
8.79
Gemma 3 27B Instruct Google 2025-03-12
10.31
Gemma 3 4B Instruct Google 2025-03-12
6.30
Reka Flash 3 Reka AI 2025-03-10
9.52
Jamba 1.6 Large AI21 Labs 2025-03-06
10.56
Jamba 1.6 Mini AI21 Labs 2025-03-06
7.87
QwQ 32B Alibaba 2025-03-05
19.72
GPT-4.5 (Preview) OpenAI 2025-02-27
19.96
Phi-4 Multimodal Instruct Microsoft 2025-02-26
10.04
Gemini 2.0 Flash-Lite (Feb '25) Google 2025-02-25
14.70
Claude 3.7 Sonnet (Non-reasoning) Anthropic 2025-02-24
30.81
Claude 3.7 Sonnet (Reasoning) Anthropic 2025-02-24
34.71
Grok 3 xAI 2025-02-19
25.17
Grok 3 Reasoning Beta xAI 2025-02-19
21.65
Grok 3 mini Reasoning (high) xAI 2025-02-19
32.08
R1 1776 Perplexity 2025-02-18
11.99
Mistral Saba Mistral 2025-02-17
12.13
GPT-4o (ChatGPT) OpenAI 2025-02-15
14.11
Gemini 2.0 Flash (Feb '25) Google 2025-02-05
18.51
Gemini 2.0 Flash-Lite (Preview) Google 2025-02-05
14.49
Gemini 2.0 Pro Experimental (Feb '25) Google 2025-02-05
18.05
o3-mini OpenAI 2025-01-31
25.86
o3-mini (high) OpenAI 2025-01-31
25.21
Mistral Small 3 Mistral 2025-01-30
12.67
Qwen2.5 Max Alibaba 2025-01-28
16.28
Sonar Reasoning Perplexity 2025-01-28
17.88
Sonar Reasoning Pro Perplexity 2025-01-28
24.62
Gemini 2.0 Flash Thinking Experimental (Jan '25) Google 2025-01-21
19.60
Sonar Perplexity 2025-01-21
15.49
Sonar Pro Perplexity 2025-01-21
15.23
DeepSeek R1 (Jan '25) DeepSeek 2025-01-20
18.84
DeepSeek R1 Distill Llama 70B DeepSeek 2025-01-20
15.95
DeepSeek R1 Distill Llama 8B DeepSeek 2025-01-20
12.10
DeepSeek R1 Distill Qwen 1.5B DeepSeek 2025-01-20
9.08
DeepSeek R1 Distill Qwen 14B DeepSeek 2025-01-20
15.84
DeepSeek R1 Distill Qwen 32B DeepSeek 2025-01-20
17.17
DeepSeek V3 (Dec '24) DeepSeek 2024-12-26
16.46
Gemini 2.0 Flash Thinking Experimental (Dec '24) Google 2024-12-19
12.33
GPT-4o Realtime (Dec '24) OpenAI 2024-12-17
GPT-4o mini Realtime (Dec '24) OpenAI 2024-12-17
Grok 2 (Dec '24) xAI 2024-12-12
13.89
Phi-4 Microsoft 2024-12-12
10.41
Gemini 2.0 Flash (experimental) Google 2024-12-11
16.77
DeepSeek-V2.5 (Dec '24) DeepSeek 2024-12-10
12.51
Llama 3.3 Instruct 70B Meta 2024-12-06
14.49
o1 OpenAI 2024-12-05
30.75
Nova Lite Amazon 2024-12-03
12.65
Nova Micro Amazon 2024-12-03
10.27
Nova Pro Amazon 2024-12-03
13.48
QwQ 32B-Preview Alibaba 2024-11-27
15.17
GPT-4o (Nov '24) OpenAI 2024-11-20
17.32
Mistral Large 2 (Nov '24) Mistral 2024-11-18
15.09
Pixtral Large Mistral 2024-11-18
14.00
Qwen2.5 Turbo Alibaba 2024-11-18
11.97
Qwen2.5 Coder Instruct 32B Alibaba 2024-11-11
12.87
Claude 3.5 Haiku Anthropic 2024-10-22
18.66
Claude 3.5 Sonnet (Oct '24) Anthropic 2024-10-22
15.93
Llama 3.1 Nemotron Instruct 70B NVIDIA 2024-10-15
13.44
Reka Flash (Sep '24) Reka AI 2024-10-04
11.97
Gemini 1.5 Flash-8B Google 2024-10-03
11.13
LFM 40B Liquid AI 2024-09-30
8.76
Llama 3.2 Instruct 11B (Vision) Meta 2024-09-25
8.73
Llama 3.2 Instruct 1B Meta 2024-09-25
6.28
Llama 3.2 Instruct 3B Meta 2024-09-25
9.70
Llama 3.2 Instruct 90B (Vision) Meta 2024-09-25
11.90
Gemini 1.5 Flash (Sep '24) Google 2024-09-24
13.79
Gemini 1.5 Pro (Sep '24) Google 2024-09-24
15.99
Qwen2.5 Coder Instruct 7B Alibaba 2024-09-19
9.98
Qwen2.5 Instruct 32B Alibaba 2024-09-19
13.24
Qwen2.5 Instruct 72B Alibaba 2024-09-19
15.56
Mistral Small (Sep '24) Mistral 2024-09-17
10.18
o1-mini OpenAI 2024-09-12
20.39
o1-preview OpenAI 2024-09-12
23.74
DeepSeek-V2.5 DeepSeek 2024-09-06
12.33
Jamba 1.5 Large AI21 Labs 2024-08-22
10.70
Jamba 1.5 Mini AI21 Labs 2024-08-22
8.03
Grok Beta xAI 2024-08-13
13.28
GPT-4o (Aug '24) OpenAI 2024-08-06
18.64
Mistral Large 2 (Jul '24) Mistral 2024-07-24
13.03
Llama 3.1 Instruct 405B Meta 2024-07-23
17.38
Llama 3.1 Instruct 70B Meta 2024-07-23
12.47
Llama 3.1 Instruct 8B Meta 2024-07-23
11.76
GPT-4o mini OpenAI 2024-07-18
12.65
Claude 3.5 Sonnet (June '24) Anthropic 2024-06-21
14.17
DeepSeek Coder V2 Lite Instruct DeepSeek 2024-06-17
8.48
DeepSeek-Coder-V2 DeepSeek 2024-06-17
10.61
Qwen2 Instruct 72B Alibaba 2024-06-07
11.66
Gemini 1.5 Pro (May '24) Google 2024-05-15
12.00
Gemini 1.5 Flash (May '24) Google 2024-05-14
10.46
GPT-4o (May '24) OpenAI 2024-05-13
14.50
DeepSeek-V2-Chat DeepSeek 2024-05-06
9.06
Qwen1.5 Chat 110B Alibaba 2024-04-25
9.55
Arctic Instruct Snowflake 2024-04-24
8.82
Phi-3 Mini Instruct 3.8B Microsoft 2024-04-23
10.10
Llama 3 Instruct 70B Meta 2024-04-18
8.88
Llama 3 Instruct 8B Meta 2024-04-18
6.38
Mixtral 8x22B Instruct Mistral 2024-04-17
9.84
Command-R+ (Apr '24) Cohere 2024-04-04
8.35
DBRX Instruct Databricks 2024-03-27
8.32
Grok-1 xAI 2024-03-17
11.69
Command-R (Mar '24) Cohere 2024-03-12
7.41
Claude 3 Haiku Anthropic 2024-03-04
12.26
Claude 3 Opus Anthropic 2024-03-04
18.00
Claude 3 Sonnet Anthropic 2024-03-04
10.27
Mistral Large (Feb '24) Mistral 2024-02-26
9.91
Mistral Small (Feb '24) Mistral 2024-02-26
9.04
Phi-4 Mini Instruct Microsoft 2024-02-26
8.39
Solar Mini Upstage 2024-01-25
11.90
OpenChat 3.5 (1210) OpenChat 2023-12-18
8.32
Mistral Medium Mistral 2023-12-11
9.01
Mixtral 8x7B Instruct Mistral 2023-12-11
7.73
Gemini 1.0 Pro Google 2023-12-06
8.50
Gemini 1.0 Ultra Google 2023-12-06
10.15
Qwen Chat 72B Alibaba 2023-11-30
8.82
DeepSeek LLM 67B Chat (V1) DeepSeek 2023-11-29
8.37
Claude 2.1 Anthropic 2023-11-21
9.32
GPT-4 Turbo OpenAI 2023-11-06
13.72
Mistral 7B Instruct Mistral 2023-09-27
7.41
Qwen Chat 14B Alibaba 2023-09-25
7.41
Llama 2 Chat 13B Meta 2023-07-18
8.36
Llama 2 Chat 70B Meta 2023-07-18
8.37
Llama 2 Chat 7B Meta 2023-07-18
9.74
Claude 2.0 Anthropic 2023-07-11
9.06
GPT-3.5 Turbo (0613) OpenAI 2023-06-13
PALM-2 Google 2023-05-10
8.59
Claude Instant Anthropic 2023-03-14
7.41
GPT-4 OpenAI 2023-03-14
12.75
Llama 65B Meta 2023-02-24
7.41
GPT-3.5 Turbo OpenAI 2022-11-30
8.99
JT-MINI China Mobile
25.37
AI模型天梯榜数据来源:Artificial Analysis - Comparison of AI Models
方法论:/r/docs/methodology
关于 ·  帮助 ·  PING ·  隐私 ·  条款   
OA0 - Omni AI 0 一个探索 AI 的社区
沪ICP备2024103595号-2
耗时 69 ms
Developed with Cursor