For frontier AI news

Leaderboard Overview

See how leading models stack up across text, image, vision, and beyond. This page gives you a snapshot of each Arena, you can explore deeper insights in their dedicated tabs. Learn more about it here.

Arena Overview

Scroll to the right to see full stats of each model

First Place
Second Place
Third Place
gemini-3-pro
1
4
1
2
2
1
1
2
grok-4.1-thinking
2
5
4
6
8
5
10
11
Anthropicclaude-opus-4-5-20251101-thinking-32k
3
2
2
1
4
4
2
1
grok-4.1
4
25
7
11
16
14
14
12
Anthropicclaude-opus-4-5-20251101
5
1
3
5
1
3
3
3
gpt-5.1-high
6
7
8
10
3
8
8
9
gemini-2.5-pro
7
10
11
22
7
2
9
8
Anthropicclaude-sonnet-4-5-20250929-thinking-32k
8
3
5
3
5
7
5
4
Anthropicclaude-opus-4-1-20250805-thinking-16k
9
8
6
4
9
6
4
5
Anthropicclaude-sonnet-4-5-20250929
10
6
9
7
17
9
6
6
gpt-4.5-preview-2025-02-27
11
36
28
35
38
11
12
17
Anthropicclaude-opus-4-1-20250805
12
14
10
8
14
10
7
7
chatgpt-4o-latest-20250326
13
39
14
27
55
13
16
19
gpt-5-high
14
13
16
21
11
41
22
42
gpt-5.1
15
11
15
16
52
20
15
16
o3-2025-04-16
16
18
26
34
6
40
43
49
Qwen Iconqwen3-max-preview
17
9
12
13
10
27
13
13
MoonshotAIkimi-k2-thinking-turbo
18
19
18
14
20
22
19
28
grok-4-1-fast-reasoning
19
16
29
40
41
17
34
46
glm-4.6
20
23
23
28
18
23
17
25
gpt-5-chat
21
22
19
31
36
37
23
23
Qwen Iconqwen3-max-2025-09-23
22
37
21
17
13
28
24
26
Anthropicclaude-opus-4-20250514-thinking-16k
23
20
13
9
27
12
11
10
deepseek-v3.2-exp
24
41
20
24
35
21
25
18
mistral-large-3
25
59
22
12
47
58
32
36
Qwen Iconqwen3-235b-a22b-instruct-2507
26
17
17
20
19
45
20
22
deepseek-v3.2-exp-thinking
27
29
24
19
22
30
21
27
grok-4-fast-chat
28
38
40
39
25
39
42
33
deepseek-v3.2-thinking
29
70
43
36
28
38
37
43
MoonshotAIkimi-k2-0905-preview
30
40
32
25
34
42
56
57
deepseek-r1-0528
31
44
35
29
63
35
50
52
Baiduernie-5.0-preview-1022
32
21
45
57
33
15
45
40
MoonshotAIkimi-k2-0711-preview
33
46
39
32
67
55
66
61
deepseek-v3.1
34
34
38
47
31
32
39
32
deepseek-v3.1-thinking
35
33
30
37
26
19
18
14
deepseek-v3.1-terminus
36
-
47
56
61
18
48
45
Qwen Iconqwen3-vl-235b-a22b-instruct
37
24
27
26
39
64
27
35
deepseek-v3.1-terminus-thinking
38
-
25
33
32
51
26
20
deepseek-v3.2
39
45
34
46
15
31
30
30
Anthropicclaude-opus-4-20250514
40
35
33
30
57
16
31
15
gpt-4.1-2025-04-14
41
55
42
42
79
24
46
34
mistral-medium-2508
42
52
37
44
50
47
44
47
grok-3-preview-02-24
43
51
46
52
76
26
35
29
grok-4-0709
44
31
49
58
12
29
47
44
glm-4.5
45
26
36
38
30
52
33
38
gemini-2.5-flash
46
32
59
74
42
25
40
39
gemini-2.5-flash-preview-09-2025
47
15
50
73
24
43
41
41
grok-4-fast-reasoning
48
43
61
54
44
48
55
50
Anthropicclaude-haiku-4-5-20251001
49
27
31
15
68
49
29
24
o1-2024-12-17
50
56
55
64
45
46
38
48
Qwen Iconqwen3-next-80b-a3b-instruct
51
57
48
49
23
97
57
60
longcat-flash-chat
52
42
44
18
21
84
51
67
Anthropicclaude-sonnet-4-20250514-thinking-32k
53
30
41
23
48
33
28
21
Qwen Iconqwen3-235b-a22b-no-thinking
54
62
52
50
62
60
60
53
Qwen Iconqwen3-235b-a22b-thinking-2507
55
12
51
53
53
56
52
56
deepseek-r1
56
63
54
51
43
54
49
58
Qwen Iconqwen3-vl-235b-a22b-thinking
57
28
53
41
40
76
58
55
gpt-5-mini-high
58
49
63
65
37
87
64
78
deepseek-v3-0324
59
65
64
71
78
34
65
65
Tencenthunyuan-vision-1.5-thinking
60
-
56
59
-
63
54
59
o4-mini-2025-04-16
61
47
65
66
29
78
71
86
mai-1-preview
62
53
66
68
60
68
67
63
Anthropicclaude-sonnet-4-20250514
63
60
57
48
69
44
53
37
o1-preview
64
76
73
76
70
61
63
75
Anthropicclaude-3-7-sonnet-20250219-thinking-32k
65
48
58
45
74
36
36
31
Qwen Iconqwen3-coder-480b-a35b-instruct
66
75
60
43
77
66
59
54
Tencenthunyuan-t1-20250711
67
58
69
86
51
50
62
69
mistral-medium-2505
68
74
70
67
95
67
76
66
Qwen Iconqwen3-30b-a3b-instruct-2507
69
69
62
55
72
90
72
72
gpt-4.1-mini-2025-04-14
70
73
67
60
93
74
69
68
Tencenthunyuan-turbos-20250416
71
92
77
90
97
62
83
76
gemini-2.5-flash-lite-preview-09-2025-no-thinking
72
67
74
89
80
65
73
64
gemini-2.5-flash-lite-preview-06-17-thinking
73
81
79
103
82
57
68
71
Qwen Iconqwen3-235b-a22b
74
72
75
63
58
88
79
74
Qwen Iconqwen2.5-max
75
84
80
84
84
70
80
70
Anthropicclaude-3-5-sonnet-20241022
76
83
71
61
98
59
70
62
Anthropicclaude-3-7-sonnet-20250219
77
71
72
69
89
53
61
51
glm-4.5-air
78
66
76
70
64
89
75
73
Qwen Iconqwen3-next-80b-a3b-thinking
79
68
78
72
59
94
78
82
Minimaxminimax-m1
80
78
81
75
56
93
85
83
gemma-3-27b-it
81
100
93
126
105
72
89
87
o3-mini-high
82
64
68
62
46
101
74
81
grok-3-mini-high
83
54
83
95
65
86
77
77
gemini-2.0-flash-001
84
93
97
122
92
73
86
89
deepseek-v3
85
95
108
97
112
71
91
80
grok-3-mini-beta
86
77
88
100
81
79
81
84
mistral-small-2506
87
102
85
79
99
92
95
91
gemini-2.0-flash-lite-preview-02-05
88
103
109
145
103
75
100
101
gpt-oss-120b
89
86
96
93
71
137
103
131
Coherecommand-a-03-2025
90
98
92
94
114
82
92
85
glm-4.5v
91
50
87
83
88
102
93
110
gemini-1.5-pro-002
92
101
107
132
101
69
94
94
amazon-nova-experimental-chat-10-20
93
79
84
80
54
139
82
88
o3-mini
94
85
89
77
73
110
90
92
Tencenthunyuan-turbos-20250226
95
-
95
85
123
112
87
95
AntGroupling-flash-2.0
96
89
91
78
91
148
109
125
Minimaxminimax-m2
97
99
90
107
87
129
96
100
Stepfunstep-3
98
90
82
82
83
107
88
97
Nvidiallama-3.1-nemotron-ultra-253b-v1
99
-
86
92
75
83
84
104
amazon-nova-experimental-chat-10-09
100
-
102
96
-
120
110
112
gpt-4o-2024-05-13
101
119
116
116
119
80
106
124
Qwen Iconqwen3-32b
102
61
94
81
49
108
101
93
Qwen Iconqwen-plus-0125
103
82
106
106
104
100
98
90
glm-4-plus-0111
104
118
130
153
125
98
113
111
Anthropicclaude-3-5-sonnet-20240620
105
97
98
87
100
111
97
96
gemma-3-12b-it
106
147
121
158
107
85
108
99
Nvidianvidia-llama-3.3-nemotron-super-49b-v1.5
107
80
99
88
66
106
107
106
Tencenthunyuan-turbo-0110
108
-
110
112
145
116
119
105
gpt-5-nano-high
109
91
104
105
94
162
102
121
Metallama-3.1-405b-instruct-bf16
110
121
113
109
109
114
121
130
o1-mini
111
94
100
98
86
140
99
103
gpt-4o-2024-08-06
112
120
131
128
116
91
112
115
grok-2-2024-08-13
113
114
134
130
132
99
125
120
Qwen Iconqwq-32b
114
88
103
101
85
122
104
113
gemini-advanced-0514
115
137
133
149
122
77
117
136
Metallama-3.1-405b-instruct-fp8
116
109
118
120
108
109
120
140
Stepfunstep-2-16k-exp-202412
117
106
117
114
110
81
114
108
01.AIyi-lightning
118
104
114
115
117
119
127
133
Metallama-4-maverick-17b-128e-instruct
119
105
115
111
106
104
118
119
Qwen Iconqwen3-30b-a3b
120
96
111
102
90
133
122
109
Nvidiallama-3.3-nemotron-49b-super-v1
121
-
101
123
-
121
105
117
Tencenthunyuan-large-2025-02-10
122
108
126
121
130
113
115
79
gpt-4-turbo-2024-04-09
123
131
144
142
127
95
133
144
Anthropicclaude-3-5-haiku-20241022
124
115
112
104
141
115
124
107
Metallama-4-scout-17b-16e-instruct
125
122
125
124
111
125
136
126
deepseek-v2.5-1210
126
123
128
108
131
103
116
118
Anthropicclaude-3-opus-20240229
127
111
127
135
115
130
126
129
gemini-1.5-pro-001
128
112
129
141
129
96
128
98
gpt-4.1-nano-2025-04-14
129
113
124
110
148
105
134
128
AntGroupring-flash-2.0
130
87
105
91
96
160
111
122
Stepfunstep-1o-turbo-202506
131
128
122
133
120
126
123
102
Metallama-3.3-70b-instruct
132
132
137
144
128
132
144
148
gemma-3n-e4b-it
133
143
145
162
157
117
149
143
glm-4-plus
134
125
141
138
139
127
135
134
gpt-oss-20b
135
110
136
113
102
171
152
150
Qwen Iconqwen-max-0919
136
126
142
137
133
136
132
132
gpt-4o-mini-2024-07-18
137
141
149
140
147
123
142
135
Qwen Iconqwen2.5-plus-1127
138
107
132
131
118
146
139
149
gpt-4-1106-preview
139
140
146
150
121
135
140
156
mistral-large-2407
140
127
139
136
137
131
137
153
gpt-4-0125-preview
141
145
152
155
124
141
148
151
athene-v2-chat
142
116
120
117
113
166
131
138
olmo-3-32b-think
143
-
123
127
-
153
129
114
mercury
144
-
135
118
-
172
155
147
Tencenthunyuan-standard-2025-02-10
145
138
150
154
134
145
151
123
gemini-1.5-flash-002
146
144
156
160
135
118
146
137
grok-2-mini-2024-08-13
147
136
155
151
150
149
150
142
deepseek-v2.5
148
130
138
119
136
152
145
146
magistral-medium-2506
149
134
119
99
140
128
130
116
mistral-large-2411
150
151
147
143
142
144
141
152
athene-70b-0725
151
129
148
139
161
150
156
159
mistral-small-3.1-24b-instruct-2503
152
117
140
125
144
147
138
127
gemma-3-4b-it
153
159
163
187
164
142
161
145
Qwen Iconqwen2.5-72b-instruct
154
133
143
134
126
164
143
141
Nvidiallama-3.1-nemotron-70b-instruct
155
135
151
156
146
143
153
166
Tencenthunyuan-large-vision
156
124
154
129
138
134
147
139
Metallama-3.1-70b-instruct
157
150
159
152
156
159
158
158
amazon-nova-pro-v1.0
158
149
158
147
154
176
157
154
jamba-1.5-large
159
142
164
164
171
158
163
184
gemma-2-27b-it
160
163
167
172
173
124
162
157
reka-core-20240904
161
139
170
161
169
151
167
169
gpt-4-0314
162
158
153
157
143
165
154
171
Nvidiallama-3.1-nemotron-51b-instruct
163
164
168
167
153
157
168
175
llama-3.1-tulu-3-70b
164
-
174
168
159
167
159
173
gemini-1.5-flash-001
165
154
161
166
163
156
165
155
Anthropicclaude-3-sonnet-20240229
166
152
166
159
166
168
164
165
gemma-2-9b-it-simpo
167
177
176
191
195
138
174
161
Nvidianemotron-4-340b-instruct
168
162
169
171
165
174
169
162
Coherecommand-r-plus-08-2024
169
168
184
185
176
155
173
167
Metallama-3-70b-instruct
170
170
171
173
162
163
170
187
gpt-4-0613
171
167
160
165
151
154
160
170
mistral-small-24b-instruct-2501
172
161
165
163
160
178
172
168
glm-4-0520
173
165
172
169
168
170
171
178
reka-flash-20240904
174
148
180
180
174
169
177
181
Qwen Iconqwen2.5-coder-32b-instruct
175
146
157
146
152
188
166
163
Coherec4ai-aya-expanse-32b
176
156
178
179
175
177
176
160
gemma-2-9b-it
177
176
186
195
186
161
181
180
deepseek-coder-v2
178
155
162
148
155
189
175
164
Coherecommand-r-plus
179
172
188
196
189
175
184
183
Qwen Iconqwen2-72b-instruct
180
160
175
177
149
180
183
185
Anthropicclaude-3-haiku-20240307
181
166
181
175
177
183
180
179
amazon-nova-lite-v1.0
182
157
177
174
170
181
179
172
gemini-1.5-flash-8b-001
183
171
185
192
179
173
185
177
Azurephi-4
184
153
173
170
158
185
178
176
olmo-2-0325-32b-instruct
185
-
182
184
178
179
188
190
Coherecommand-r-08-2024
186
179
187
183
194
186
187
182
mistral-large-2402
187
181
183
178
172
184
186
191
amazon-nova-micro-v1.0
188
169
191
181
181
191
190
189
jamba-1.5-mini
189
191
194
197
213
187
199
203
ministral-8b-2410
190
173
189
190
187
182
193
186
Qwen Iconqwen1.5-110b-chat
191
180
190
186
182
198
189
195
gemini-pro-dev-api
192
202
199
214
212
190
198
200
Qwen Iconqwen1.5-72b-chat
193
175
193
193
193
201
195
192
reka-flash-21b-20240226-online
194
183
196
188
190
206
201
206
Tencenthunyuan-standard-256k
195
-
179
176
167
195
182
174
mixtral-8x22b-instruct-v0.1
196
188
192
189
180
200
192
204
Coherecommand-r
197
190
213
212
214
194
202
193
reka-flash-21b-20240226
198
189
197
198
198
208
208
205
gpt-3.5-turbo-0125
199
198
202
194
200
202
194
202
mistral-medium
200
182
195
199
183
197
196
201
Coherec4ai-aya-expanse-8b
201
178
200
203
199
199
200
188
Metallama-3-8b-instruct
202
193
209
205
210
193
206
207
llama-3.1-tulu-3-8b
203
-
206
204
191
196
197
196
gemini-pro
204
-
201
209
203
205
191
-
HuggingFacezephyr-orpo-141b-A35b-v0.1
205
205
212
211
202
211
207
217
01.AIyi-1.5-34b-chat
206
186
205
210
188
213
209
208
Metallama-3.1-8b-instruct
207
196
207
201
209
207
204
197
granite-3.1-8b-instruct
208
174
198
182
208
210
203
194
Qwen Iconqwen1.5-32b-chat
209
184
208
200
196
237
211
198
gpt-3.5-turbo-1106
210
192
203
202
197
218
205
215
Azurephi-3-medium-4k-instruct
211
194
214
218
185
220
214
212
gemma-2-2b-it
212
206
224
231
219
204
217
214
mixtral-8x7b-instruct-v0.1
213
197
215
216
211
214
213
218
dbrx-instruct-preview
214
200
211
207
204
212
210
213
Qwen Iconqwen1.5-14b-chat
215
199
217
215
216
228
219
211
InternLMinternlm2_5-20b-chat
216
185
204
208
192
239
212
210
Azurewizardlm-70b
217
-
231
234
221
192
220
219
deepseek-llm-67b-chat
218
-
228
222
224
238
221
220
01.AIyi-34b-chat
219
213
225
227
226
216
225
223
granite-3.0-8b-instruct
220
195
216
213
205
229
218
209
OpenChatopenchat-3.5-0106
221
211
222
219
222
222
223
224
OpenChatopenchat-3.5
222
209
229
229
237
203
227
216
granite-3.1-2b-instruct
223
187
210
206
201
219
215
199
Snowflakesnowflake-arctic-instruct
224
210
223
220
218
225
229
242
gemma-1.1-7b-it
225
207
219
221
223
221
226
226
tulu-2-dpo-70b
226
-
221
223
229
223
216
222
openhermes-2.5-mistral-7b
227
-
226
236
228
215
228
236
vicuna-33b
228
216
235
233
239
209
232
241
starling-lm-7b-beta
229
204
218
217
220
243
230
225
Azurephi-3-small-8k-instruct
230
208
220
228
207
234
224
228
Metallama-2-70b-chat
231
220
236
239
231
242
234
237
starling-lm-7b-alpha
232
219
234
226
233
231
233
230
Metallama-3.2-3b-instruct
233
201
233
240
217
224
231
229
nous-hermes-2-mixtral-8x7b-dpo
234
-
254
242
247
230
250
244
Qwen Iconqwq-32b-preview
235
212
232
241
184
244
222
221
Nvidiallama2-70b-steerlm-chat
236
-
246
256
240
241
239
258
granite-3.0-2b-instruct
237
203
227
225
215
249
237
231
solar-10.7b-instruct-v1.0
238
-
237
238
244
227
244
-
dolphin-2.2.1-mistral-7b
239
-
241
-
235
233
238
-
mistral-7b-instruct-v0.2
240
218
240
237
232
245
242
240
mpt-30b-chat
241
-
239
246
248
236
235
-
Azurewizardlm-13b
242
-
258
254
257
226
243
232
falcon-180b-chat
243
-
255
-
-
217
236
-
Qwen Iconqwen1.5-7b-chat
244
215
245
224
238
261
241
227
Azurephi-3-mini-4k-instruct-june-2024
245
214
230
230
206
250
240
247
Metallama-2-13b-chat
246
222
249
248
243
252
249
235
vicuna-13b
247
224
252
247
253
240
246
233
Qwen Iconqwen-14b-chat
248
-
248
232
236
247
245
243
Metacodellama-34b-instruct
249
-
251
250
245
259
252
251
palm-2
250
-
247
252
242
258
248
239
gemma-7b-it
251
223
242
244
241
248
253
246
HuggingFacezephyr-7b-beta
252
226
257
253
252
232
258
248
Azurephi-3-mini-128k-instruct
253
229
253
251
230
257
255
256
Azurephi-3-mini-4k-instruct
254
217
244
235
225
265
247
249
HuggingFacezephyr-7b-alpha
255
-
256
245
-
246
254
-
guanaco-33b
256
-
264
262
255
235
266
-
stripedhyena-nous-7b
257
-
262
261
250
253
259
257
Metacodellama-70b-instruct
258
-
238
-
-
-
256
-
HuggingFacesmollm2-1.7b-instruct
259
-
243
249
227
262
251
238
vicuna-7b
260
-
263
260
259
256
262
234
gemma-1.1-2b-it
261
227
250
243
246
254
257
245
Metallama-3.2-1b-instruct
262
230
260
255
234
260
260
250
mistral-7b-instruct
263
231
259
257
254
251
261
252
Metallama-2-7b-chat
264
221
265
264
249
264
263
255
gemma-2b-it
265
-
261
258
256
263
265
253
Qwen Iconqwen1.5-4b-chat
266
228
266
259
251
268
264
254
olmo-7b-instruct
267
225
267
263
258
272
269
-
koala-13b
268
-
270
267
265
269
270
-
alpaca-13b
269
-
275
273
262
255
271
-
gpt4all-13b-snoozy
270
-
268
-
261
266
267
-
mpt-7b-chat
271
-
271
266
263
267
272
-
chatglm3-6b
272
-
269
265
260
270
268
259
RWKVRWKV-4-Raven-14B
273
-
274
268
264
274
276
-
chatglm2-6b
274
-
272
271
267
273
274
-
oasst-pythia-12b
275
-
273
269
268
271
273
-
chatglm-6b
276
-
276
270
266
277
275
-
fastchat-t5-3b
277
-
279
275
270
275
277
-
dolly-v2-12b
278
-
277
274
269
276
278
-
Metallama-13b
279
-
280
276
271
278
280
-
Stabilitystablelm-tuned-alpha-7b
280
-
278
272
272
279
279
-