Run #568
success · fetched 2026-06-13 05:00:18 · 9.27 MB raw HTML · 538 models
Top quality: Claude Fable 5 (Adaptive Reasoning, Max Effort, Opus 4.8 Fallback) (69.2 pts)
| #? | Pareto? | Model? | Released? | Cost$? | $/Q? | Qual? | ΔTop? | Intel? | Code? | Agent? | Pen? | Score? |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | ✓ | Claude Fable 5 (Adaptive Reasoning, Max Effort, Opus 4.8 Fallback) Anthropic |
2026-06-09 | $9,940 | 143.74 | 69.2 | 0.0 | 64.9 | 62.0 | 80.6 | 40.0 | 69.2 |
| 2 | ✓ | Claude Opus 4.8 (Adaptive Reasoning, Max Effort) Anthropic |
2026-05-28 | $4,309 | 65.96 | 65.3 | -3.8 | 61.4 | 56.7 | 77.8 | 36.3 | 65.3 |
| 3 | ✓ | GPT-5.5 (xhigh) OpenAI |
2026-04-23 | $3,357 | 52.05 | 64.5 | -4.7 | 60.2 | 59.1 | 74.1 | 35.3 | 64.5 |
| 4 | ✓ | GPT-5.5 (high) OpenAI |
2026-04-23 | $2,159 | 34.21 | 63.1 | -6.0 | 58.9 | 58.5 | 72.0 | 33.3 | 63.1 |
| 5 | ✓ | GPT-5.5 (medium) OpenAI |
2026-04-23 | $1,199 | 19.73 | 60.8 | -8.4 | 56.7 | 56.2 | 69.4 | 30.8 | 60.8 |
| 6 | GPT-5.4 (xhigh) OpenAI |
2026-03-05 | $2,851 | 46.99 | 60.7 | -8.5 | 56.8 | 57.2 | 68.0 | 34.5 | 60.7 | |
| 7 | Claude Opus 4.7 (Adaptive Reasoning, Max Effort) Anthropic |
2026-04-16 | $4,653 | 77.09 | 60.4 | -8.8 | 57.3 | 52.5 | 71.3 | 36.7 | 60.4 | |
| 8 | Qwen3.7 Max Alibaba |
2026-05-19 | $1,202 | 20.82 | 57.8 | -11.4 | 56.6 | 50.1 | 66.6 | 30.8 | 57.8 | |
| 9 | ✓ | Gemini 3.1 Pro Preview |
2026-02-19 | $892 | 15.59 | 57.3 | -11.9 | 57.2 | 55.5 | 59.1 | 29.5 | 57.3 |
| 10 | Gemini 3.5 Flash (high) |
2026-05-19 | $1,552 | 27.28 | 56.9 | -12.3 | 55.3 | 45.0 | 70.3 | 31.9 | 56.9 | |
| 11 | Claude Opus 4.7 (Non-reasoning, High Effort) Anthropic |
2026-04-16 | $1,032 | 18.25 | 56.5 | -12.6 | 51.8 | 53.1 | 64.6 | 30.1 | 56.5 | |
| 12 | Gemini 3.5 Flash (medium) |
2026-05-19 | $1,417 | 25.14 | 56.4 | -12.8 | 54.8 | 43.9 | 70.4 | 31.5 | 56.4 | |
| 13 | Claude Opus 4.6 (Adaptive Reasoning, Max Effort) Anthropic |
2026-02-05 | $4,970 | 88.42 | 56.2 | -12.9 | 52.9 | 48.1 | 67.6 | 37.0 | 56.2 | |
| 14 | GPT-5.3 Codex (xhigh) OpenAI |
2026-02-05 | $1,572 | 28.20 | 55.7 | -13.4 | 53.6 | 53.1 | 60.5 | 32.0 | 55.7 | |
| 15 | Kimi K2.6 Kimi |
2026-04-20 | $948 | 17.03 | 55.7 | -13.5 | 53.9 | 47.1 | 66.0 | 29.8 | 55.7 | |
| 16 | ✓ | MiMo-V2.5-Pro Xiaomi |
2026-04-22 | $161 | 2.89 | 55.6 | -13.6 | 53.8 | 45.5 | 67.4 | 22.1 | 55.6 |
| 17 | MiniMax-M3 MiniMax |
2026-06-01 | $308 | 5.55 | 55.6 | -13.6 | 54.7 | 43.4 | 68.6 | 24.9 | 55.6 | |
| 18 | DeepSeek V4 Pro (Reasoning, Max Effort) DeepSeek |
2026-04-24 | $268 | 4.83 | 55.4 | -13.8 | 51.5 | 47.5 | 67.2 | 24.3 | 55.4 | |
| 19 | Claude Sonnet 4.6 (Adaptive Reasoning, Max Effort) Anthropic |
2026-02-17 | $3,959 | 71.70 | 55.2 | -13.9 | 51.7 | 50.9 | 63.0 | 36.0 | 55.2 | |
| 20 | Qwen3.7 Plus Alibaba |
2026-06-01 | $209 | 3.80 | 55.0 | -14.2 | 53.3 | 46.5 | 65.1 | 23.2 | 55.0 | |
| 21 | GPT-5.5 (low) OpenAI |
2026-04-23 | $501 | 9.24 | 54.2 | -15.0 | 50.8 | 52.1 | 59.7 | 27.0 | 54.2 | |
| 22 | Qwen3.6 Max Preview Alibaba |
2026-04-20 | $861 | 15.98 | 53.9 | -15.3 | 51.8 | 44.9 | 64.8 | 29.3 | 53.9 | |
| 23 | GPT-5.2 (xhigh) OpenAI |
2025-12-11 | $2,304 | 43.16 | 53.4 | -15.8 | 51.3 | 48.7 | 60.2 | 33.6 | 53.4 | |
| 24 | Grok 4.3 (high) xAI |
2026-04-30 | $395 | 7.40 | 53.4 | -15.8 | 53.2 | 41.0 | 65.9 | 26.0 | 53.4 | |
| 25 | DeepSeek V4 Pro (Reasoning, High Effort) DeepSeek |
2026-04-24 | $173 | 3.24 | 53.2 | -15.9 | 49.8 | 43.2 | 66.7 | 22.4 | 53.2 | |
| 26 | GPT-5.4 mini (xhigh) OpenAI |
2026-03-17 | $1,354 | 25.50 | 53.1 | -16.1 | 48.9 | 51.5 | 58.9 | 31.3 | 53.1 | |
| 27 | Claude Opus 4.6 (Non-reasoning, High Effort) Anthropic |
2026-02-05 | $1,451 | 27.51 | 52.7 | -16.4 | 46.5 | 47.6 | 64.2 | 31.6 | 52.7 | |
| 28 | Claude Opus 4.5 (Reasoning) Anthropic |
2025-11-24 | $2,734 | 52.18 | 52.4 | -16.8 | 49.7 | 47.8 | 59.6 | 34.4 | 52.4 | |
| 29 | GLM-5 (Reasoning) Z AI |
2026-02-11 | $547 | 10.45 | 52.4 | -16.8 | 49.8 | 44.2 | 63.1 | 27.4 | 52.4 | |
| 30 | ✓ | MiMo-V2.5 Xiaomi |
2026-04-22 | $49.3 | 0.94 | 52.2 | -16.9 | 49.0 | 42.1 | 65.5 | 16.9 | 52.2 |
| 31 | Qwen3.6 Plus Alibaba |
2026-04-02 | $483 | 9.37 | 51.5 | -17.6 | 50.0 | 42.9 | 61.7 | 26.8 | 51.5 | |
| 32 | MiMo-V2-Pro Xiaomi |
2026-03-18 | $351 | 6.86 | 51.1 | -18.0 | 49.2 | 41.4 | 62.8 | 25.5 | 51.1 | |
| 33 | MiniMax-M2.7 MiniMax |
2026-03-18 | $176 | 3.44 | 51.0 | -18.1 | 49.6 | 41.9 | 61.5 | 22.4 | 51.0 | |
| 34 | Claude Sonnet 4.6 (Non-reasoning, High Effort) Anthropic |
2026-02-17 | $1,397 | 27.49 | 50.8 | -18.3 | 44.4 | 46.4 | 61.6 | 31.5 | 50.8 | |
| 35 | GPT-5.4 (low) OpenAI |
2026-03-05 | $434 | 8.58 | 50.6 | -18.6 | 47.9 | 45.6 | 58.2 | 26.4 | 50.6 | |
| 36 | GPT-5.2 Codex (xhigh) OpenAI |
2025-12-11 | $3,244 | 65.54 | 49.5 | -19.6 | 49.0 | 43.0 | 56.5 | 35.1 | 49.5 | |
| 37 | DeepSeek V4 Flash (Reasoning, High Effort) DeepSeek |
2026-04-24 | $57.4 | 1.16 | 49.4 | -19.8 | 46.0 | 39.8 | 62.3 | 17.6 | 49.4 | |
| 38 | Gemini 3 Pro Preview (high) |
2025-11-18 | $820 | 16.75 | 49.0 | -20.2 | 48.4 | 46.5 | 52.0 | 29.1 | 49.0 | |
| 39 | DeepSeek V4 Flash (Reasoning, Max Effort) DeepSeek |
2026-04-24 | $113 | 2.31 | 48.8 | -20.3 | 46.5 | 38.7 | 61.3 | 20.5 | 48.8 | |
| 40 | GPT-5.2 (medium) OpenAI |
2025-12-11 | $700 | 14.40 | 48.6 | -20.6 | 46.6 | 44.2 | 54.9 | 28.4 | 48.6 | |
| 41 | GLM-5.1 (Non-reasoning) Z AI |
2026-04-07 | $618 | 12.72 | 48.5 | -20.6 | 43.8 | 35.8 | 66.0 | 27.9 | 48.5 | |
| 42 | Kimi K2.5 (Reasoning) Kimi |
2026-01-27 | $367 | 7.58 | 48.4 | -20.7 | 46.8 | 39.6 | 58.9 | 25.6 | 48.4 | |
| 43 | Claude Opus 4.5 (Non-reasoning) Anthropic |
2025-11-24 | $1,153 | 23.81 | 48.4 | -20.7 | 43.1 | 42.9 | 59.2 | 30.6 | 48.4 | |
| 44 | Qwen3.6 27B (Reasoning) Alibaba |
2026-04-22 | $659 | 13.61 | 48.4 | -20.8 | 45.8 | 36.5 | 62.9 | 28.2 | 48.4 | |
| 45 | GPT-5.1 (high) OpenAI |
2025-11-13 | $779 | 16.26 | 47.9 | -21.3 | 47.7 | 44.7 | 51.3 | 28.9 | 47.9 | |
| 46 | Grok 4.20 0309 v2 (Reasoning) xAI |
2026-04-07 | $514 | 10.74 | 47.9 | -21.3 | 49.3 | 40.5 | 53.9 | 27.1 | 47.9 | |
| 47 | Claude Sonnet 4.6 (Non-reasoning, Low Effort) Anthropic |
2026-02-17 | $554 | 11.63 | 47.7 | -21.5 | 42.6 | 43.0 | 57.5 | 27.4 | 47.7 | |
| 48 | Nemotron 3 Ultra 550B A55B (Reasoning) NVIDIA |
2026-06-04 | $409 | 8.63 | 47.4 | -21.7 | 47.7 | 37.6 | 57.1 | 26.1 | 47.4 | |
| 49 | Qwen3.5 397B A17B (Reasoning) Alibaba |
2026-02-16 | $418 | 8.81 | 47.4 | -21.8 | 45.0 | 41.3 | 55.8 | 26.2 | 47.4 | |
| 50 | Grok 4.20 0309 (Reasoning) xAI |
2026-03-10 | $484 | 10.27 | 47.2 | -22.0 | 48.5 | 42.2 | 50.9 | 26.9 | 47.2 | |
| 51 | Gemini 3.5 Flash (minimal) |
2026-05-19 | $750 | 15.91 | 47.1 | -22.0 | 43.3 | 47.1 | 51.0 | 28.7 | 47.1 | |
| 52 | Grok 4.3 (medium) xAI |
2026-04-30 | $161 | 3.43 | 47.1 | -22.0 | 48.8 | 35.1 | 57.5 | 22.1 | 47.1 | |
| 53 | DeepSeek V4 Pro (Non-reasoning) DeepSeek |
2026-04-24 | $154 | 3.28 | 47.0 | -22.2 | 39.3 | 38.4 | 63.3 | 21.9 | 47.0 | |
| 54 | MiMo-V2-Omni-0327 Xiaomi |
2026-03-27 | $218 | 4.66 | 46.8 | -22.3 | 44.9 | 36.9 | 58.6 | 23.4 | 46.8 | |
| 55 | KAT Coder Pro V2 KwaiKAT |
2026-03-27 | $73.5 | 1.57 | 46.7 | -22.5 | 43.8 | 45.6 | 50.7 | 18.7 | 46.7 | |
| 56 | Kimi K2.6 (Non-reasoning) Kimi |
2026-04-20 | $505 | 10.81 | 46.7 | -22.5 | 42.9 | 38.4 | 58.7 | 27.0 | 46.7 | |
| 57 | GLM-5 (Non-reasoning) Z AI |
2026-02-11 | $240 | 5.15 | 46.6 | -22.5 | 40.6 | 39.0 | 60.3 | 23.8 | 46.6 | |
| 58 | GPT-5.5 (Non-reasoning) OpenAI |
2026-04-23 | $361 | 7.74 | 46.6 | -22.6 | 40.9 | 48.6 | 50.2 | 25.6 | 46.6 | |
| 59 | Step 3.7 Flash StepFun |
2026-05-29 | $368 | 7.93 | 46.4 | -22.8 | 42.6 | 37.1 | 59.5 | 25.7 | 46.4 | |
| 60 | Gemini 3 Flash Preview (Reasoning) |
2025-12-17 | $278 | 6.02 | 46.2 | -22.9 | 46.4 | 42.6 | 49.7 | 24.4 | 46.2 | |
| 61 | Qwen3.6 35B A3B (Reasoning) Alibaba |
2026-04-16 | $280 | 6.13 | 45.7 | -23.5 | 43.5 | 35.2 | 58.3 | 24.5 | 45.7 | |
| 62 | GPT-5 Codex (high) OpenAI |
2025-09-23 | $995 | 21.91 | 45.4 | -23.7 | 44.6 | 38.9 | 52.7 | 30.0 | 45.4 | |
| 63 | GPT-5.4 nano (xhigh) OpenAI |
2026-03-17 | $363 | 8.04 | 45.2 | -24.0 | 44.0 | 43.9 | 47.6 | 25.6 | 45.2 | |
| 64 | GPT-5 (high) OpenAI |
2025-08-07 | $913 | 20.23 | 45.1 | -24.1 | 44.6 | 36.0 | 54.7 | 29.6 | 45.1 | |
| 65 | MiniMax-M2.5 MiniMax |
2026-02-12 | $125 | 2.77 | 45.0 | -24.2 | 41.9 | 37.4 | 55.6 | 21.0 | 45.0 | |
| 66 | Hy3-preview (Reasoning) Tencent |
2026-04-23 | $84.4 | 1.89 | 44.7 | -24.5 | 41.9 | 36.5 | 55.7 | 19.3 | 44.7 | |
| 67 | Claude 4.5 Sonnet (Reasoning) Anthropic |
2025-09-29 | $1,459 | 32.81 | 44.5 | -24.7 | 43.0 | 38.6 | 51.7 | 31.6 | 44.5 | |
| 68 | GLM-4.7 (Reasoning) Z AI |
2025-12-22 | $478 | 10.75 | 44.5 | -24.7 | 42.1 | 36.3 | 55.0 | 26.8 | 44.5 | |
| 69 | ✓ | DeepSeek V4 Flash (Non-reasoning) DeepSeek |
2026-04-24 | $40.0 | 0.90 | 44.3 | -24.8 | 36.5 | 35.2 | 61.3 | 16.0 | 44.3 |
| 70 | Qwen3.5 27B (Reasoning) Alibaba |
2026-02-24 | $299 | 6.82 | 43.8 | -25.3 | 42.1 | 34.9 | 54.6 | 24.8 | 43.8 | |
| 71 | DeepSeek V3.2 (Reasoning) DeepSeek |
2025-12-01 | $75.7 | 1.73 | 43.8 | -25.4 | 41.7 | 36.7 | 52.9 | 18.8 | 43.8 | |
| 72 | Qwen3.5 397B A17B (Non-reasoning) Alibaba |
2026-02-16 | $186 | 4.27 | 43.6 | -25.5 | 40.1 | 37.4 | 53.3 | 22.7 | 43.6 | |
| 73 | GPT-5.1 Codex (high) OpenAI |
2025-11-13 | $892 | 20.52 | 43.5 | -25.7 | 43.1 | 36.6 | 50.7 | 29.5 | 43.5 | |
| 74 | Qwen3.5 122B A10B (Reasoning) Alibaba |
2026-02-24 | $354 | 8.21 | 43.1 | -26.0 | 41.6 | 34.7 | 53.0 | 25.5 | 43.1 | |
| 75 | Mistral Medium 3.5 Mistral |
2026-04-29 | $1,001 | 23.50 | 42.6 | -26.6 | 39.2 | 35.4 | 53.2 | 30.0 | 42.6 | |
| 76 | GPT-5 (medium) OpenAI |
2025-08-07 | $552 | 13.05 | 42.3 | -26.9 | 42.0 | 38.9 | 45.8 | 27.4 | 42.3 | |
| 77 | Grok 4.3 (low) xAI |
2026-04-30 | $98.7 | 2.35 | 42.0 | -27.2 | 43.9 | 31.6 | 50.4 | 19.9 | 42.0 | |
| 78 | Gemini 3 Pro Preview (low) |
2025-11-18 | $355 | 8.47 | 41.9 | -27.3 | 41.3 | 39.4 | 45.0 | 25.5 | 41.9 | |
| 79 | GPT-5.5 Instant (May 2026) OpenAI |
2026-05-05 | $368 | 8.84 | 41.6 | -27.5 | 41.8 | 45.1 | 38.1 | 25.7 | 41.6 | |
| 80 | Qwen3.6 27B (Non-reasoning) Alibaba |
2026-04-22 | $234 | 5.64 | 41.5 | -27.6 | 37.1 | 26.6 | 60.9 | 23.7 | 41.5 | |
| 81 | MiMo-V2-Flash (Feb 2026) Xiaomi |
2025-12-16 | $66.5 | 1.61 | 41.2 | -27.9 | 41.5 | 33.5 | 48.8 | 18.2 | 41.2 | |
| 82 | Kimi K2 Thinking Kimi |
2025-11-06 | $308 | 7.48 | 41.2 | -27.9 | 40.9 | 34.8 | 47.9 | 24.9 | 41.2 | |
| 83 | Grok 4 xAI |
2025-07-10 | $2,881 | 69.98 | 41.2 | -28.0 | 41.5 | 40.5 | 41.5 | 34.6 | 41.2 | |
| 84 | Ring-2.6-1T InclusionAI |
2026-05-08 | $334 | 8.12 | 41.1 | -28.1 | 38.5 | 33.3 | 51.5 | 25.2 | 41.1 | |
| 85 | MiMo-V2-Flash (Reasoning) Xiaomi |
2025-12-16 | $47.5 | 1.16 | 41.0 | -28.1 | 39.2 | 31.8 | 52.1 | 16.8 | 41.0 | |
| 86 | MiMo-V2.5-Pro (Non-reasoning) Xiaomi |
2026-04-22 | $633 | 15.43 | 41.0 | -28.1 | 35.6 | 36.8 | 50.8 | 28.0 | 41.0 | |
| 87 | Qwen3.5 27B (Non-reasoning) Alibaba |
2026-02-24 | $128 | 3.15 | 40.7 | -28.4 | 37.2 | 33.4 | 51.5 | 21.1 | 40.7 | |
| 88 | GPT-5 mini (high) OpenAI |
2025-08-07 | $168 | 4.14 | 40.7 | -28.5 | 41.2 | 35.3 | 45.5 | 22.3 | 40.7 | |
| 89 | Step 3.5 Flash StepFun |
2026-02-02 | $74.8 | 1.85 | 40.5 | -28.7 | 37.8 | 31.6 | 52.0 | 18.7 | 40.5 | |
| 90 | Step 3.5 Flash 2603 StepFun |
2026-04-02 | $95.3 | 2.36 | 40.4 | -28.7 | 38.5 | 34.6 | 48.2 | 19.8 | 40.4 | |
| 91 | Claude 4.5 Sonnet (Non-reasoning) Anthropic |
2025-09-29 | $685 | 16.95 | 40.4 | -28.7 | 37.1 | 33.5 | 50.6 | 28.4 | 40.4 | |
| 92 | GLM-4.7 (Non-reasoning) Z AI |
2025-12-22 | $147 | 3.66 | 40.2 | -29.0 | 34.2 | 32.0 | 54.3 | 21.7 | 40.2 | |
| 93 | Qwen3 Max Thinking Alibaba |
2026-01-26 | $669 | 16.65 | 40.2 | -29.0 | 39.8 | 30.5 | 50.1 | 28.3 | 40.2 | |
| 94 | GPT-5 (low) OpenAI |
2025-08-07 | $228 | 5.71 | 39.9 | -29.3 | 39.2 | 30.7 | 49.7 | 23.6 | 39.9 | |
| 95 | MiniMax-M2.1 MiniMax |
2025-12-23 | $114 | 2.87 | 39.9 | -29.3 | 39.4 | 32.8 | 47.4 | 20.6 | 39.9 | |
| 96 | Qwen3.5 Omni Plus Alibaba |
2026-03-30 | $150 | 3.77 | 39.7 | -29.5 | 38.6 | 27.6 | 52.8 | 21.8 | 39.7 | |
| 97 | Qwen3.5 122B A10B (Non-reasoning) Alibaba |
2026-02-24 | $166 | 4.27 | 39.0 | -30.2 | 35.9 | 31.6 | 49.5 | 22.2 | 39.0 | |
| 98 | Kimi K2.5 (Non-reasoning) Kimi |
2026-01-27 | $141 | 3.65 | 38.6 | -30.5 | 37.3 | 25.8 | 52.8 | 21.5 | 38.6 | |
| 99 | Claude 4 Sonnet (Reasoning) Anthropic |
2025-05-22 | $1,246 | 32.29 | 38.6 | -30.6 | 38.7 | 34.1 | 43.0 | 31.0 | 38.6 | |
| 100 | GPT-5.4 (Non-reasoning) OpenAI |
2026-03-05 | $285 | 7.41 | 38.5 | -30.7 | 35.4 | 41.0 | 39.1 | 24.6 | 38.5 |